Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

Data Mining Which of these are not normally found to be useful in outlier / anom

ID: 3800976 • Letter: D

Question

Data Mining

Which of these are not normally found to be useful in outlier / anomaly detection

1.Fuzzy-based techniques

2.Model-based techniques

3.Proximity-based techniques

4.Density-based techniques

If you were to attempt to mine a set of digitized Hollywood-produced films for major thematic references, you would be working with what kind of data from a data mining perspective

1.Mixed format

2.Problematic

3.Complex

4.Visual

If we want to discover what set of games a given Facebook user is likely to choose to play

1.We need to talk to the user

2.We need to perform a link prediction analysis looking at current and recent games the user is or did play in relation to the current and recent games that the Facebook friends of the user are or have played

3.We are embarking on an impossible task

4.We need to perform a cluster analysis of the games played by the user and the user’s friends, looking for outliers

Which is not a known cause of anomalies

1.Data object in a group of data objects that does not belong to the same class as all the other objects

2.Data objects with one or more given attributes whose values differ significantly from the norm for that class of data objects

3.Data errors

4.Infrequent, random changes in the data caused by natural phenomena such as gamma rays, meteor strikes, etc.

Explanation / Answer

1a . Fuzzy based techniques are not normally found to be useful in outlier/anomaly detection of data mining. (Ans 1st Option)

1b.If wewere to attempt to mine a set of digitized Hollywood-produced films for major thematic references, we would be working with Visual Kind of data from a data mining perspective (Ans : 4th Option)

1c .If we want to discover what set of games a given Facebook user is likely to choose to play, We need to perform a cluster analysis of the games played by the user and the user’s friends, looking for outliers ( Ans : 4 th Option)

1d. Infrequent, random changes in the data caused by natural phenomena such as gamma rays, meteor strikes, etc are not known as cause of anomalies (Ans : 4th Option)