Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

TEXT ANALYTICS 1. PLSA is a generalization of the two component mixture model to

ID: 3864139 • Letter: T

Question

TEXT ANALYTICS

1. PLSA is a generalization of the two component mixture model to discover more than one topic from text data. (T or F?)

2. Single-link merges the two clusters with the smallest minimum distance. This results

in “looser” clusters, since we only need to find two individually close elements in each

cluster in order to perform the merge. (T or F?)

3. Complete-link merges the two clusters with the smallest maximum distance between

elements. This results in very “tight” and “compact” clusters since the cluster diameter

is kept small (i.e., the average distance between all elements low). (T or F?)

Explanation / Answer

Answers are

1) True, PLSA is a generalistion of the simple two componet mixture model to morethan two components.

2) True, Yes in single link we can merge two clusters whose two close set members have the smallest distance.

3) True, Complete-link merges the two clusters with the smallest maximum distance between
elements