1. A. Given the data set below, apply the k-Nearest Neighbor algorithm to classi

ID: 3844432 • Letter: 1

Question

1. A. Given the data set below, apply the k-Nearest Neighbor algorithm to classify the test data for k=1 and k=3. Use the Euclidean distance metric.

Training Set

true label

0.453705

-0.0106

3.258589

0.169734

3.184656

-0.83691

-0.42561

1.385033

0.658765

-1.87715

-0.40507

-1.9574

-4.52775

4.123102

2.538689

-1.5386

-1.04649

-3.59664

2.967113

0.505111

Testing Set

true label

predicted label

-4.69237

-4.77898

-2.1147

-1.81277

4.277164

-4.83136

-1.33862

-0.93995

-4.02728

-4.96129

4.968125

3.757161

-2.19987

-3.48712

2.849136

-3.33965

-4.30273

2.530094

4.690116

-0.36379

B. Compute the confusion matrix, accuracy, precision, recall, and F1 measures given your answers to problem 1.

· C. Assume you have the data set given below, which provides hypothetical examples of instances when people did or did not get hired for a job. It consists of three categorical attributes and a label that indicates "hired" or "not hired". Using this data, induce a decision tree using information gain for splitting the nodes, showing the calculations at each step.

Training Set

Experience (EXP)

Sufficient Qualifications? (QUAL)

Opinions of References (REFOP)

true label

good

Yes

favorable

excellent

Yes

favorable

none

favorable

good

not favorable

good

Yes

not favorable

excellent

Yes

not favorable

excellent

Yes

favorable

good

Yes

favorable

none

Yes

favorable

none

Yes

not favorable

Training Set

true label

0.453705

-0.0106

3.258589

0.169734

3.184656

-0.83691

-0.42561

1.385033

0.658765

-1.87715

-0.40507

-1.9574

-4.52775

4.123102

2.538689

-1.5386

-1.04649

-3.59664

2.967113

0.505111

Explanation / Answer

Solution :-

General type of syntax is as follows:-

label = predict(Mdl,X)

[label,score,cost] = predict(Mdl,X)

Based on above syntax, we will now fill the below predicted label and it is also based on k = 1 and 3

Testing Set # x1 x2 true label predicted label 11 -4.69237 -4.77898 1 1 12 -2.1147 -1.81277 0 1 13 4.277164 -4.83136 1 0 14 -1.33862 -0.93995 0 0 15 -4.02728 -4.96129 1 0 16 4.968125 3.757161 1 0 17 -2.19987 -3.48712 0 1 18 2.849136 -3.33965 0 1 19 -4.30273 2.530094 1 1 20 4.690116 -0.36379 1 0

Navigate

1. A. Economics different than other social sciences. Explain why using John May

1. A. HCl does not appear in the equilibrium reaction between barium ion and chr

Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.

1. A. Given the data set below, apply the k-Nearest Neighbor algorithm to classi

Question

Explanation / Answer

Related Questions

Navigate