For this week’s assignment, you will complete the following tasks: Define each o
ID: 427337 • Letter: F
Question
For this week’s assignment, you will complete the following tasks:
Define each of these terms: Normal Distribution, Standard Normal Distribution, z-Score, Central Limit Theorem, Exponential Distribution, Sampling Distribution, Standard Error of the Mean
Explain the Empirical Rule and its use.
Complete the following: A new college is looking for housing for their potential students. Below is a table for how many persons a sample of potential buildings can house. Using this table, calculate the sample mean and sample standard deviation for maximum capacity of for these venues. Looking at these values, about 68% fall between ___ and ___, about 95% fall between ___ and ____, and about 99% fall between ___ and ___.
50
45
65
43
63
23
55
44
34
54
65
34
34
65
66
34
34
54
55
35
63
34
23
23
26
45
33
35
67
68
43
63
23
55
54
59
50
58
58
57
65
66
34
34
55
30
26
45
33
47
38
26
45
33
67
43
63
23
55
48
37
39
40
30
39
65
66
34
34
49
38
38
59
26
45
33
40
45
47
55
4. Create two different graphs using Excel to display this data. One must be the normal distribution curve
5. Write a scenario that this data may represent. It does not have to be a factual scenario.
50
45
65
43
63
23
55
44
34
54
65
34
34
65
66
34
34
54
55
35
63
34
23
23
26
45
33
35
67
68
43
63
23
55
54
59
50
58
58
57
65
66
34
34
55
30
26
45
33
47
38
26
45
33
67
43
63
23
55
48
37
39
40
30
39
65
66
34
34
49
38
38
59
26
45
33
40
45
47
55
Explanation / Answer
The central limit theorem states that if a large enough sample is taken (typically n > 30) then the sampling distribution of x¯x is approximately a normal distribution with a mean of ? and a standard deviation of ?n??n. Since in practice we usually do not know ? or ? we estimate these by x¯x and sn?sn respectively. In this case s is the estimate of ? and is the standard deviation of the sample. The expression sn?sn is known as the standard error of the mean, labeled SE(x¯x)
Simulation: Generate 500 samples of size heights of 4 men. Assume the distribution of male heights is normal with mean ? = 70" and standard deviation ? = 3.0". Then find the mean of each of 500 samples of size 4.
Here are the first 10 sample means:
70.4 72.0 72.3 69.9 70.5 70.0 70.5 68.1 69.2 71.8
Theory says that the mean of ( x¯x ) = ? = 70 which is also the Population Mean and SE(x¯)=?n?=34?=1.50SEx?n341.50
Simulation shows: Average (500 x¯x's) = 69.957 and SE(of 500 x¯x's) = 1.496
Change the sample size from n = 4 to n = 25 and get descriptive statistics:
Theory says that the mean of ( x¯x) = ? = 70 which is also the Population Mean and SE(x¯)=?n?=325?=0.60SEx?n3250.60
Simulation shows: Average (500 x¯x's) = 69.983 and SE(of 500 x¯x's) = 0.592
Sampling Distribution of Sample Mean x¯x from a Non-Normal Population
Simulation: Below is a Histogram of Number of Cds Owned by PSU Students. The distribution is strongly skewed to the right.
Assume the Population Mean Number of CDs owned is ? = 84 and ? = 96
Let's obtain 500 samples of size 4 from this population and look at the distribution of the 500 x-bars:
Theory says that the mean of ( x¯x) = ? = 84 which is also the Population Mean the SE(x¯)=48=964?SEx48964
Simulation shows Average(500 x¯x's) = 81.11 and SE(500 x¯x's for samples of size 4) = 45.1
Change the sample size from n = 4 to n = 25 and get descriptive statistics and curve:
Theory says that the mean of ( x¯x) = ? = 84 which is also the Population Mean and the SE(x¯)=9625?=19.2SEx962519.2 Simulation shows Average(500 x¯x's) = 83.281 and SE(500 x¯x's for samples of size 25) = 18.268. A histogram of the 500 x¯x's computed from samples of size 25 is beginning to look a lot like a normal curve.
i. The Law of Large Numbers says that as the sample size increases the sample mean will approach the population mean.
ii. The Central Limit Theorem says that as the sample size increases the sampling distribution of X¯X (read x-bar) approaches the normal distribution. We see this effect here for n = 25. Generally, we assume that a sample size of n = 30 is sufficient to get an approximate normal distribution for the distribution of the sample mean.
iii. The Central Limit Theorem is important because it enables us to calculate probabilities about sample means.
Example. Find the approximate probability that the average number of CDs owned when 100 students are asked is between 70 and 90.
Solution. Since the sample size is greater than 30, we assume the sampling distribution of x¯x is about normal with mean ? = 84 and SE(x¯)=?n?=96100?=9.6SEx?n961009.6. We are asked to find Prob( 70 < X¯X < 90). The z-scores for the two values are
for 90: z = (90 - 84)/ 9.6 = 0.625 and for 70: z = (70-84)/9.6 = -1.46. From tables of the normal distribution we get P( -1.46 < Z < 0.625) = .734 - .072 = .662.
Suppose the sample size was 1600 instead of 100. Then the distribution of x¯x would be about normal with mean 84 and standard deviation ?n?=961600?=9640=2.4?n96160096402.4. From the empirical rule we know that almost all x-bars for samples of size 1600 will be in the interval 84 ± (3)(2.4) or in the interval 84 ± 7.2 or between 76.8 and 91.2. The Law of Large Numbers says that as we increase the sample size the probability that the sample mean approaches the population mean is 1.00!
‹ 4.1 - Sampling Distributions for Sample Proportion, p-hatup4.3 - Review of Sampling Distributions ›
Printer-friendly version
Related Questions
drjack9650@gmail.com
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.