QUESTION 1 The following table indicates the expression of 3 genes (rows) at two
ID: 282149 • Letter: Q
Question
QUESTION 1
The following table indicates the expression of 3 genes (rows) at two different time points (columns)
g1 2 0
g2 4 0
g3 8 0
Let's say a k-means clustering, on this data, identifies 2 clusters with centroids locations as (2,0) and (10,0). what is the square error distortion of this clustering result ?
-----------------------
Question 3
How many leaf nodes are there in the suffix tree of the text "AAAGG" ?
5
7
4
-------------------------
Question 4
The following table indicates the expression of 7 genes (rows) at two different time points (columns)
g1 2 2
g2 4 4
g3 6 6
g4 0 4
g5 4 0
g6 5 5
g7 9 9
In the first iteration of k-means three clusters are assigned as C1 = {g1,g2,g3}, C2 = {g4,g5}, and C3 ={g6,g7}. What are the centroids of clusters C1, C2, and C3 ?
(3,3), (2,2), and (7,7)
(4,4), (2,2), and (7,7)
(4,4), (2,2), and (8,8)
(4,4), (5,5), and (7,7)
1.50
1.67
1.00
2.50
Explanation / Answer
Answer :square error distortion =8
Leaf nodes will be 5
Question4.
Centroids clusters are
C1=(g1,g2,g3)=2+4+6/3,2+4+6/3)(0+4/2,0+4/2)(5+9/2,5+9/2)
=(4,4)(2,2)(7,7) option b
Related Questions
drjack9650@gmail.com
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.