Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

Question 1 please show work and explantation for answers found Which of the foll

ID: 3180037 • Letter: Q

Question

Question 1 please show work and explantation for answers found

Which of the following reasons is responsible for the increase in the use of data-mining techniques in business?

The dearth of information to analyze and interpret

The lack of methods to electronically track data

The ability to manually analyze all the data

The ability to electronically warehouse data

Queshtion 2 please show work and explantation for answers found

Observation refers to the:

goal of predicting a categorical outcome based on a set of variables

estimated continuous outcome variable

set of recorded values of variables associated with a single entity

mean of all variable values associated with one particular entity

Question 3 please show work and explantation for answers found

In which of the following data-mining process steps is the data manipulated to make it suitable for formal modeling?

Data preparation

Model construction

Data sampling

Model assessment

Question 4 please show work and explantation for answers found

The process of reducing the number of variables to consider in a data-mining approach without losing any crucial information is termed as _____

data augmentation

dimension reduction

aggregation

data sampling

Question 5 please show work and explantation for answers found

Which of the following is true of bottom-up hierarchical clustering?

It starts with each observation in its own cluster and then iteratively combine two most similar clusters

All observations are put in a mega-cluster to begin with

At the end of the process, observations in the same cluster have maximum distance

Each of the large clusters is broken down iteratively

Question 6 please show work and explantation for answers found

The simplest measure of similarity between observations consisting solely of categorical variables is given by _____

the Euclidean distance

the standardized Euclidean distance

matching coefficient

Jaccard's coefficient

Question 7 please show work and explantation for answers found

Single linkage measures dissimilarity between two clusters by:

considering the average dissimilarity over all pairs of observations between these clusters

considering only the two most distant observations in these clusters

considering only the two closest observations in these clusters

considering the distance between the cluster centroids

Question 8 please show work and explantation for answers found

_____ is the vector of the averages computed for each variable across all cluster observations

Euclidean distance

Matching coefficient

Centroid

Jaccard's coefficient

Question 9 please show work and explantation for answers found

The lift ratio of an association rule with a confidence value of 0.43 and in which the consequent occurs in 2 out of 10 cases is

1

0.72

2.15

1.75

Question 10 please show work and explantation for answers found

Exhibit 4-1. A large data set on Toledo workers was collected and the first three workers are characterized by

Worker

Age

Hourly Wage

Female

Union

High School

1

33

$20

0

0

1

2

30

$24

0

1

1

3

36

$16

1

0

0


where Female=1 if a worker is a female Female=0 otherwise

   Union=1 if a worker belongs to a union    Union=0 otherwise

   High school=1 if a worker has a high school degree High school=0 otherwise

For the entire data set the average age is 32, the standard deviation of the age is 8, the average hourly wage is $18, and the standard deviation of the hourly wage is $5

Refer to Exhibit 4-1. What is the Euclidean distance for Workers 1 and 3 based on Age, Hourly Wage, and gender?

5.000

4.050

2.050

5.099

Question 11 please show work and explantation for answers found

Refer to Exhibit 4-1. What is the centroid of the cluster consisting of Workers 1 and 3 based on Age and Hourly Wage?

(33, $20)

(31.5, $22)

(32, $22)

(34.5, $18)

Question 12 please show work and explantation for answers found

Refer to Exhibit 4-1. What is the normalized (standardized) Euclidean distance for Workers 1 and 2 based on Age and Hourly Wage?

2.392

0.884

0.251

1.767

Question 13 please show work and explantation for answers found

Refer to Exhibit 4-1. What is the centroid of the cluster consisting of Workers 1 and 2 based on normalized (standardized) data on Age and Hourly Wage?

(-0.0625, 0.8)

(0.884, 1.2)

(-0.25, 0.8)

(0.125, 0.4)

Question 14 please show work and explantation for answers found

Refer to Exhibit 4-1. What is the matching coefficient for Workers 1 and 3 based on Female, Union, and High School?

1

1/3

2/3

0

Question 15 please show work and explantation for answers found

Refer to Exhibit 4-1. What is Jaccard’s coefficient for Workers 2 and 3 based on Female, Union, and High School?

1/3

2/3

1/2

0

A.

The dearth of information to analyze and interpret

B.

The lack of methods to electronically track data

C.

The ability to manually analyze all the data

D.

The ability to electronically warehouse data

Explanation / Answer

Question 1 )Which of the following reasons is responsible for the increase in the use of data-mining techniques in business?

Answer : The dearth of information to analyze and interpret

Observation refers to the:

Answer : set of recorded values of variables associated with a single entity

Hire Me For All Your Tutoring Needs
Integrity-first tutoring: clear explanations, guidance, and feedback.
Drop an Email at
drjack9650@gmail.com
Chat Now And Get Quote