In class, we discussed the importance of the order of the data mining method. El
ID: 3684427 • Letter: I
Question
In class, we discussed the importance of the order of the data mining method. Elaborate on the differences between running K-means Say you were given an unlabelled data set with regards to their customers on their website. Propose an experimental design to predict the loyalty of an unknown customer. In class, we discussed two approaches for "pipe lining" data mining techniques with PCA and K-means. One was to apply PCA first on the raw data and then apply K-means. Another approach was to apply K-means on the raw data and then PCA. While both approaches are valid, they have very different results. Explain the differences between the two approaches.Explanation / Answer
k-means : A clustering algorithm
Lloyd algorithm : help finding cluster centre by optimization technique
ANN algorithm: uses a best-bin-first randomized KD tree algorithm to find the centre of the cluster
Related Questions
Hire Me For All Your Tutoring Needs
Integrity-first tutoring: clear explanations, guidance, and feedback.
Drop an Email at
drjack9650@gmail.com
drjack9650@gmail.com
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.