Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

QUESTION 1 The following table indicates the expression of 3 genes (rows) at two

ID: 282149 • Letter: Q

Question

QUESTION 1

The following table indicates the expression of 3 genes (rows) at two different time points (columns)

g1 2 0

g2 4 0

g3 8 0

Let's say a k-means clustering, on this data, identifies 2 clusters with centroids locations as (2,0) and (10,0). what is the square error distortion of this clustering result ?

-----------------------

Question 3

How many leaf nodes are there in the suffix tree of the text "AAAGG" ?

5

7

4
-------------------------

Question 4

The following table indicates the expression of 7 genes (rows) at two different time points (columns)

g1 2 2
g2 4 4
g3 6 6
g4 0 4
g5 4 0
g6 5 5
g7 9 9

In the first iteration of k-means three clusters are assigned as C1 = {g1,g2,g3}, C2 = {g4,g5}, and C3 ={g6,g7}. What are the centroids of clusters C1, C2, and C3 ?

(3,3), (2,2), and (7,7)

(4,4), (2,2), and (7,7)

(4,4), (2,2), and (8,8)

(4,4), (5,5), and (7,7)

1.50

1.67

1.00

2.50

Explanation / Answer

Answer :square error distortion =8

Leaf nodes will be 5

Question4.

Centroids clusters are

C1=(g1,g2,g3)=2+4+6/3,2+4+6/3)(0+4/2,0+4/2)(5+9/2,5+9/2)

=(4,4)(2,2)(7,7) option b

Hire Me For All Your Tutoring Needs
Integrity-first tutoring: clear explanations, guidance, and feedback.
Drop an Email at
drjack9650@gmail.com
Chat Now And Get Quote