20
Introduction to Introduction to Probability Probability and Statistics and Statistics Chapter 7 Sampling Distributions

Introduction to Probability and Statistics Chapter 7 Sampling Distributions

Embed Size (px)

Citation preview

Page 1: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

Introduction to Probability Introduction to Probability and Statisticsand Statistics

Chapter 7

Sampling Distributions

Page 2: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

IntroductionIntroduction• Parameters are numerical descriptive measures

for populations.– For the normal distribution, the location and

shape are described by and – For a binomial population, the important

parameter is the binomial proportion p. • Often the values of parameters that specify the

exact form of a distribution are unknown. • You must rely on the sample to learn about

these parameters.

Page 3: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

Sampling DistributionsSampling Distributions

• Numerical descriptive measures calculated from a sample are called statisticsstatistics.

• Statistics vary from sample to sample and hence are random variables.

• The probability distributions for statistics are called sampling distributionssampling distributions.

Page 4: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

Possible samples3, 5, 23, 5, 13, 2, 15, 2, 1

Possible samples3, 5, 23, 5, 13, 2, 15, 2, 1

x

ExampleExample

Prob.

3.33 1/4

3 1/4

2 1/4

2.67 1/4

x

Population: 3, 5, 2, 1

Draw samples of size n = 3

without replacement

Population: 3, 5, 2, 1

Draw samples of size n = 3

without replacement

67.23/8

23/6

33/9

33.33/10

x

p(x)

1/4

2 3

75.24

1253

MeanPopulation

75.24

167.2

4

12

4

13

4

133.3)(

ofMean

xE

x

75.24

1253

MeanPopulation

75.24

167.2

4

12

4

13

4

133.3)(

ofMean

xE

x

Page 5: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

ExampleExampleToss a fair die n = 1 time. What is the distribution of the number x?

x p(x)

1 1/6

2 1/6

3 1/6

4 1/6

5 1/6

6 1/6Flat

Uniform

Page 6: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

ExampleExample

1 2 3 4 5 6

1 1 1.5 2 2.5 3 3.5

2 1.5 2 2.5 3 3.5 4

3 2 2.5 3 3.5 4 4.5

4 2.5 3 3.5 4 4.5 5

5 3 3.5 4 4.5 5 5.5

6 3.5 4 4.5 5 5.5 6

xToss a fair die n = 2 times. Sampling distribution of average on the two numbers?

Mound shaped

Page 7: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

ExampleExampleToss a fair coin n = 3 times. Sampling distribution of the average of the three numbers?

x

Approximately Normal

Page 8: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

ExampleExample

Page 9: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

Central Limit Theorem: If random samples of n observations are drawn from a non-normal population with mean and standard deviation, then, when n is large, the sampling distribution of the sample mean is approximately normally distributed, with mean and standard deviation . The approximation becomes more accurate as n becomes large.

Central Limit Theorem: If random samples of n observations are drawn from a non-normal population with mean and standard deviation, then, when n is large, the sampling distribution of the sample mean is approximately normally distributed, with mean and standard deviation . The approximation becomes more accurate as n becomes large.

x

n/

Sampling DistributionsSampling Distributions

Page 10: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

How Large is Large?How Large is Large? If the sampled population is normalnormal, then the sampling distribution of will also be normal, no matter what the sample size.

If the sampled population is symmetricsymmetric, the distribution becomes approximately normal for relatively small values of n.

If the sampled population is skewedskewed, the sample size must be at least 30at least 30 before the sampling distribution of becomes approximately normal.

x

x

Page 11: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

n/x

Sampling Distribution of Sampling Distribution of Sample MeanSample Mean

A random sample of size n is selected from a population with mean and standard deviation

he sampling distribution of the sample mean will have mean and standard deviation .

If the original population is normal, normal, the sampling distribution will be normal for any sample size.

If the original population is non-normal, non-normal, the sampling distribution will be normal when n is large (CLT).

The standard deviation of x-bar is sometimes called the STANDARD ERROR (SE).

Page 12: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

Finding Probabilities for Finding Probabilities for Sample MeanSample Mean

1587.8413.1)1(

)16/8

1012()12(

zP

zPxP

1587.8413.1)1(

)16/8

1012()12(

zP

zPxP

If the sampling distribution of is normal or approximately normal standardize or rescale the interval of interest in terms of

Find the appropriate area using Table 3.

If the sampling distribution of is normal or approximately normal standardize or rescale the interval of interest in terms of

Find the appropriate area using Table 3.

x

n

xz

/

Example: Example: A random sample of size n = 16 from a normal distribution with = 10 and = 8.

Page 13: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

ExampleExampleA soda filling machine is supposed to fill cans of soda with 12 fluid ounces. Suppose that the fills are actually normally distributed with a mean of 12.1 oz and a standard deviation of 0.2 oz. What is the probability that the average fill for a 6-pack of soda is less than 12 oz?

)12(xP

)6/2.

1.1212

/(

n

xP

1112.)22.1( zP

Page 14: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

Sampling Distribution of Sampling Distribution of Sample ProportionSample Proportion

The Central Limit TheoremCentral Limit Theorem can be used to conclude that the binomial random variable x is approximately normal when n is large, with mean np and variance npq.

The sample proportion, is simply a rescaling of the binomial random variable x, dividing it by n.

From the Central Limit Theorem, the sampling distribution of will also be approximately approximately normal, normal, with a rescaled mean and standard deviation.

n

xp ˆ

Page 15: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

The Sampling Distribution The Sampling Distribution of the Sample Proportionof the Sample Proportion

The standard deviation of p-hat is sometimes called the STANDARD ERROR (SE) of p-hat.

A random sample of size n is selected from a binomial population with parameter p

he sampling distribution of the sample proportion,

will have mean p and standard deviation

If n is large, and p is not too close to zero or one, the sampling distribution of will be approximately approximately normal. (normal. (npnp>5, >5, nqnq>5)>5)

n

xp ˆ

n

pq

Page 16: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

Finding Probabilities for Finding Probabilities for the Sample Proportionthe Sample Proportion

0207.9793.1)04.2(

)

100)6(.4.

4.5.()5.ˆ(

zP

zPpP

0207.9793.1)04.2(

)

100)6(.4.

4.5.()5.ˆ(

zP

zPpP

If the sampling distribution of is normal or approximately normal standardize or rescale the interval of interest in terms of

Find the appropriate area using Table 3.

If the sampling distribution of is normal or approximately normal standardize or rescale the interval of interest in terms of

Find the appropriate area using Table 3.

npq

ppz

ˆ

Example: Example: A random sample of size n = 100 from a binomial population with p = .4.

Page 17: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

ExampleExampleThe soda bottler in the previous example claims that only 5% of the soda cans are underfilled. A quality control technician randomly samples 200 cans of soda. What is the probability that more than 10% of the cans are underfilled?

)10.ˆ( pP

)24.3()

200)95(.05.

05.10.(

zPzP

0006.9994.1 This would be very unusual,

if indeed p = .05!

n = 200

S: underfilled can

p = P(S) = .05

q = .95

np = 10 nq = 190

n = 200

S: underfilled can

p = P(S) = .05

q = .95

np = 10 nq = 190OK to use the normal

approximation

Page 18: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

Key ConceptsKey ConceptsI.I. Statistics and Sampling DistributionsStatistics and Sampling Distributions

1. Sampling distributions describe the possible values of a statistic and how often they occur in repeated sampling.

2. Sampling distributions can be derived mathematically,approximated empirically, or found using statistical theorems.

3. The Central Limit Theorem states that sums and averages of measurements from a non-normal population with finite mean and standard deviation have approximately normal distributions for large samples of size n.

Page 19: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

Key ConceptsKey ConceptsII.II. Sampling Distribution of the Sample MeanSampling Distribution of the Sample Mean

1. When samples of size n are drawn from a normal populationwith mean and variance 2, the sample mean has a normal distribution with mean and variance 2n.

2. When samples of size n are drawn from a non-normal population with mean and variance 2, the Central Limit Theorem ensures that the sample mean will have an approximately normal distribution with mean and variance2n when n is large (n 30).

3. Probabilities involving the sample mean can be calculated

by standardizing the value of using

n

xz

/

n

xz

/

x

x

x

Page 20: Introduction to Probability and Statistics Chapter 7 Sampling Distributions

Key ConceptsKey ConceptsIII.III. Sampling Distribution of the Sample ProportionSampling Distribution of the Sample Proportion

1. When samples of size n are drawn from a binomial population with parameter p, the sample proportion

will have an approximately normal distribution with mean p and variance pq n if np 5 and nq 5.

2. Probabilities involving the sample proportion can be calculated by standardizing the value using

npq

ppz

ˆ

npq

ppz

ˆ