Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

1) a worksheet labeled ‘ Scatterplot ’containing a labeled (males and females in

ID: 3170278 • Letter: 1

Question

1)a worksheet labeled ‘Scatterplot’containing a labeled (males and females in different colors) scatterplot similar to the one you created in the Learn by Doing on p. 51 of the OLI module, but this one will use the attached dataset height.weight.S17.xls. Make sure your title and axis labels are appropriate. If necessary, change the range of the x-axis so the data points are not all “scrunched” up. Don’t forget title and axis labels.

Dataset Note: gender: 1 = male, 2 = female

height: in inches

weight: in pounds

2) a worksheet labeled ‘Measures’ containing your answer to the following question:

What determines which numerical measures of center and spread are appropriate for describing a distribution of a quantitative variable? Which measures will you use in each case?

Hints:

The question is asking about numerical measures of both center and spread. Make sure you know what the possible measures of center and the possible measures of spread are.

It is also asking about how you would decide which measures are appropriate for a distribution, and what your decision would be in different cases.

Make sure your answer addresses all parts of the question.

Answer:

Every distribution graph is setup with four parts that we usually see at the center of the graph which are variability, shape, outliers and finally clusters. In fact, there are two important features along the distribution of a quantitative variable can be described using numerical measures. For instance, the measures of center use the mean and median. Where the range and Inter-Quarrel Range and standard deviation help to enumerate the variability of a distribution of measures of a range. Whereas the shape of the distribution is what determines what measures of center and spread are appropriate to apply. By describing the center, we would use the mean if the distribution is symmetrical and if the graph is skewed, then we would use the median. Nevertheless, for the spread, and for a skewed spread would use the IQR. Lastly, for the symmetrical spread then would use the standard deviation.

Please let me know if I answer these questions correctly. Thank You!

Gender Height Weight 1 66.00 140.00 1 66.00 135.00 1 66.00 135.00 1 66.00 130.00 1 67.00 145.00 1 67.00 140.00 1 67.00 123.00 1 68.00 155.00 1 68.00 150.00 1 68.00 145.00 1 68.00 155.00 1 69.00 155.00 1 69.00 175.00 1 69.00 170.00 1 69.00 160.00 1 69.00 150.00 1 69.00 136.00 1 69.50 150.00 1 70.00 153.00 1 70.00 157.00 1 70.00 130.00 1 70.00 155.00 1 70.00 150.00 1 71.00 138.00 1 71.00 170.00 1 71.00 170.00 1 71.00 155.00 1 71.00 150.00 1 71.00 140.00 1 71.50 164.00 1 72.00 145.00 1 72.00 150.00 1 72.00 195.00 1 72.00 155.00 1 72.00 175.00 1 72.00 215.00 1 72.00 180.00 1 72.00 142.00 1 73.00 190.00 1 73.00 165.00 1 73.00 170.00 1 73.00 155.00 1 73.00 155.00 1 73.00 180.00 1 73.00 155.00 1 73.50 160.00 1 73.50 155.00 1 74.00 190.00 1 74.00 160.00 1 74.00 180.00 1 74.00 190.00 1 74.00 148.00 1 75.00 185.00 1 75.00 160.00 1 75.00 190.00 2 61.00 100.00 2 61.75 108.00 2 62.00 131.00 2 62.00 120.00 2 62.00 108.00 2 62.00 110.00 2 62.75 112.00 2 63.00 121.00 2 63.00 118.00 2 63.00 116.00 2 63.00 95.00 2 64.00 102.00 2 64.00 125.00 2 65.00 135.00 2 65.00 118.00 2 65.00 122.00 2 65.00 115.00 2 65.50 120.00 2 66.00 120.00 2 66.00 130.00 2 66.00 130.00 2 66.00 125.00 2 67.00 125.00 2 67.00 115.00 2 67.00 150.00 2 68.00 130.00 2 68.00 138.00 2 68.00 116.00 2 68.00 125.00 2 68.00 110.00 2 68.00 133.00 2 69.00 145.00 2 69.00 150.00 2 69.00 150.00 2 70.00 125.00 Height and Weight Based on Gender 230.00 210.00 190.00 170.00 150.00 130.00 110.00 90.00 70.00 70.00 80.00 60.00 65.00 75.00 Height Male Female

Explanation / Answer

In addition to it, mean is used as a measure of central tendency, when the variable is measured at interval-ratio (except when the variable is highly skewed), on eis interested to report the typical score, and at the same time is looking for further analysis.

Median is used when the variable is measured at the ordinal level, and when on eis interested to report the central score, as median always lies at the center of the distribution.

Mode is used when the variable is measured at the nominal level, and wants to report the most common score.