1)Classify the following attributes as binary, discrete, or continuous. Also cla
ID: 3876911 • Letter: 1
Question
1)Classify the following attributes as binary, discrete, or continuous. Also classify them as qualitative (nominal or ordinal) or quantitative (interval or ratio). Some cases may have more than one interpretation, so briefly indicate your reasoning if you think there may be some ambiguity. Example: Age in years. Answer: Discrete, quantitative, ratio
a)Time in terms of AM or PM.
b)Brightness as measured by a light meter.
c)Brightness as measured by people's judgments.
d)Angles as measured in degrees between 0 and 360.
e)Bronze, Silver, and Gold medals as awarded at the Olympics.
f)Height above sea level.
g)Number of patients in a hospital.
h)ISBN numbers for books. (Look up the format on the Web.)
i)Ability to pass light in terms of the following values: opaque, translucent' transparent.
j)Military rank.
k)Distance from the center of campus.
l)Density of a substance in grams per cubic centimeter.
m)Coat check number. (When you attend an event, you can often give your coat to someone who, in turn, gives you a number that you can use to claim your coat when you leave.)
2)Can you think of a situation in which identification numbers would be useful for prediction?
3)An educational psychologist wants to use association analysis to analyze test results. The test consists of 100 questions with four possible answers each.
a)How would you convert this data into a form suitable for association analysis?
b)In particular, what type of attributes would you have and how many of them are there?
4)Which of the following quantities is likely to show more temporal autocorrelation: daily rainfall or daily temperature? Why?
5)Many sciences rely on observation instead of (or in addition to) designed experiments. Compare the data quality issues involved in observational science with those of experimental science and data mining.
6)Discuss the difference between the precision of a measurement and the terms single and double precision, as they are used in computer science, typically to represent floating-point numbers that require 32 and 64 bits, respectively.
7)Give at least two advantages to working with data stored in text files instead of in a binary format.
8)Distinguish between noise and outliers. Be sure to consider the following questions.
a)Is noise ever interesting or desirable? Outliers?
b)Can noise objects be outliers?
c)Are noise objects always outliers?
d)Are outliers always noise objects?
e)Can noise make a typical value into an unusual one, or vice versa?
90For the following vectors, x and y, calculate the indicated similarity or distance measures.
(a) x : (0,0,1,1), y : (2,2,2,2) cosine, correlation, Euclidean
(b) x : (0,1,0,1), y : (0,1,0,1) cosine, correlation, Euclidean, Jaccard
(c) x : (1,1,0,1), y : (-1,0,-1,0) cosine, correlation, Euclidean
(d) x : (1,0,0,1,0,1), y : (0,1,1,0,0,1) cosine, correlation, Jaccard
(e) x : (2,1,0,2,0,3), y : (1,1,1,0,0,1) cosine, correlation
10)This exercise compares and contrasts some similarity and distance measures. For binary data, the L1 distance corresponds to the Hamming distance; that is, the number of bits that are different between two binary vectors. The Jaccard similarity is a measure of the similarity between two binary vectors. Compute the Hamming distance and the Jaccard similarity between the following two binary vectors.
x: 0111010101
y : 0110011010
Explanation / Answer
Answer (8):- a) No, Noise is undesriable and unwished signal but outliers may be interesting in detecting fraud cases.
b) Yes, Noise objects can be outliers in the case of time outliers
c) No, It is possible only in the case of data point that holds a true signal which can be outlier.
d) No, It is not possible.
e) yes, Noise can make a typical value into an unusual value by valid data point but vice-versa is not
possible.
Answer:-(7):- 1-Operating systems like Unix,Linux frequently uses plain text much more than binary format in order
to minimizing the access count of binary code.
2- Solaries operating system uses and retains plain text as like human interface for binary format for
reading and modifying it further.
Answer:(2):- 1-Yes, In Public Transportation System we can predict the revenue generated in a month by the use of
Identfication numbers.
For E.g.:- In the Case of a employee which travels 5 days in a week by public transportation, we can easily track his future contribution towards monthly revenue.
2- Identification numbers can also be used for predicting the number of users will use more direct route.
3- Indentification numbers can also be used for tracking the number of country/city person who ever used public transportation.
Answer:-(6):- Measurement of Precision describes how close is two or more measurements with each other and it is free from accuracy. For e.g.:- if you weight any article 10 times and got every time same weight it means that you may be precise but erroneous.
Single Precision represents floating type value and 32 bit (4 Byte) binary format with 24 bits precision.
Double Precision represents double type value 64 bit( 8 Byte) binary format with 53 bits precision.
Related Questions
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.