Question 1 please show work and explantation for answers found Which of the foll
ID: 3180037 • Letter: Q
Question
Question 1 please show work and explantation for answers found
Which of the following reasons is responsible for the increase in the use of data-mining techniques in business?
The dearth of information to analyze and interpret
The lack of methods to electronically track data
The ability to manually analyze all the data
The ability to electronically warehouse data
Queshtion 2 please show work and explantation for answers found
Observation refers to the:
goal of predicting a categorical outcome based on a set of variables
estimated continuous outcome variable
set of recorded values of variables associated with a single entity
mean of all variable values associated with one particular entity
Question 3 please show work and explantation for answers found
In which of the following data-mining process steps is the data manipulated to make it suitable for formal modeling?
Data preparation
Model construction
Data sampling
Model assessment
Question 4 please show work and explantation for answers found
The process of reducing the number of variables to consider in a data-mining approach without losing any crucial information is termed as _____
data augmentation
dimension reduction
aggregation
data sampling
Question 5 please show work and explantation for answers found
Which of the following is true of bottom-up hierarchical clustering?
It starts with each observation in its own cluster and then iteratively combine two most similar clusters
All observations are put in a mega-cluster to begin with
At the end of the process, observations in the same cluster have maximum distance
Each of the large clusters is broken down iteratively
Question 6 please show work and explantation for answers found
The simplest measure of similarity between observations consisting solely of categorical variables is given by _____
the Euclidean distance
the standardized Euclidean distance
matching coefficient
Jaccard's coefficient
Question 7 please show work and explantation for answers found
Single linkage measures dissimilarity between two clusters by:
considering the average dissimilarity over all pairs of observations between these clusters
considering only the two most distant observations in these clusters
considering only the two closest observations in these clusters
considering the distance between the cluster centroids
Question 8 please show work and explantation for answers found
_____ is the vector of the averages computed for each variable across all cluster observations
Euclidean distance
Matching coefficient
Centroid
Jaccard's coefficient
Question 9 please show work and explantation for answers found
The lift ratio of an association rule with a confidence value of 0.43 and in which the consequent occurs in 2 out of 10 cases is
1
0.72
2.15
1.75
Question 10 please show work and explantation for answers found
Exhibit 4-1. A large data set on Toledo workers was collected and the first three workers are characterized by
Worker
Age
Hourly Wage
Female
Union
High School
1
33
$20
0
0
1
2
30
$24
0
1
1
3
36
$16
1
0
0
where Female=1 if a worker is a female Female=0 otherwise
Union=1 if a worker belongs to a union Union=0 otherwise
High school=1 if a worker has a high school degree High school=0 otherwise
For the entire data set the average age is 32, the standard deviation of the age is 8, the average hourly wage is $18, and the standard deviation of the hourly wage is $5
Refer to Exhibit 4-1. What is the Euclidean distance for Workers 1 and 3 based on Age, Hourly Wage, and gender?
5.000
4.050
2.050
5.099
Question 11 please show work and explantation for answers found
Refer to Exhibit 4-1. What is the centroid of the cluster consisting of Workers 1 and 3 based on Age and Hourly Wage?
(33, $20)
(31.5, $22)
(32, $22)
(34.5, $18)
Question 12 please show work and explantation for answers found
Refer to Exhibit 4-1. What is the normalized (standardized) Euclidean distance for Workers 1 and 2 based on Age and Hourly Wage?
2.392
0.884
0.251
1.767
Question 13 please show work and explantation for answers found
Refer to Exhibit 4-1. What is the centroid of the cluster consisting of Workers 1 and 2 based on normalized (standardized) data on Age and Hourly Wage?
(-0.0625, 0.8)
(0.884, 1.2)
(-0.25, 0.8)
(0.125, 0.4)
Question 14 please show work and explantation for answers found
Refer to Exhibit 4-1. What is the matching coefficient for Workers 1 and 3 based on Female, Union, and High School?
1
1/3
2/3
0
Question 15 please show work and explantation for answers found
Refer to Exhibit 4-1. What is Jaccard’s coefficient for Workers 2 and 3 based on Female, Union, and High School?
1/3
2/3
1/2
0
A.The dearth of information to analyze and interpret
B.The lack of methods to electronically track data
C.The ability to manually analyze all the data
D.The ability to electronically warehouse data
Explanation / Answer
Question 1 )Which of the following reasons is responsible for the increase in the use of data-mining techniques in business?
Answer : The dearth of information to analyze and interpret
Observation refers to the:
Answer : set of recorded values of variables associated with a single entity
Related Questions
drjack9650@gmail.com
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.