31
BPS - 5th Ed. BPS - 5th Ed. Chapter 11 Chapter 11 1 Chapter 11 Chapter 11 Sampling Distributions Sampling Distributions

BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Embed Size (px)

Citation preview

Page 1: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

BPS - 5th Ed.BPS - 5th Ed. Chapter 11Chapter 11 11

Chapter 11Chapter 11

Sampling Distributions Sampling Distributions

Page 2: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 22BPS - 5th Ed.BPS - 5th Ed.

Sampling TerminologySampling Terminology ParameterParameter

fixed, unknown number that describes the fixed, unknown number that describes the populationpopulation

StatisticStatistic known value calculated from a sampleknown value calculated from a sample a statistic is often used to estimate a parametera statistic is often used to estimate a parameter

VariabilityVariability different samples from the same population may different samples from the same population may

yield different values of the sample statisticyield different values of the sample statistic Sampling DistributionSampling Distribution

tells what values a statistic takes and how often it tells what values a statistic takes and how often it takes those values in repeated samplingtakes those values in repeated sampling

Page 3: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 33BPS - 5th Ed.BPS - 5th Ed.

Parameter vs. StatisticParameter vs. Statistic

A properly chosen sample of 1600 people across the United States was asked if they regularly watch a certain television program (Real Housewives of New Jersey), and 24% said yes. The parameter of interest here is the true proportion of all people in the U.S. who watch the program, while the statistic is the value 24% obtained from the sample of 1600 people.

Page 4: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 44BPS - 5th Ed.BPS - 5th Ed.

Parameter vs. StatisticParameter vs. Statistic

The mean of a population is denoted by µ – this is a parameter.

The mean of a sample is denoted by – this is a statistic. is used to estimate µ.

xx

The true proportion of a population with a certain trait is denoted by p – this is a parameter.

The proportion of a sample with a certain trait is denoted by (“p-hat”) – this is a statistic. is used to estimate p.

p̂ p̂

Page 5: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 55BPS - 5th Ed.BPS - 5th Ed.

The Law of Large NumbersThe Law of Large Numbers

Consider sampling at random from a Consider sampling at random from a population with true mean population with true mean µµ. As the . As the number of (independent) number of (independent) observations sampled increases, the observations sampled increases, the mean of the sample gets closer and mean of the sample gets closer and closer to the true mean of the closer to the true mean of the population.population.( gets closer to µ ) x

Page 6: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 66BPS - 5th Ed.BPS - 5th Ed.

The Law of Large NumbersThe Law of Large NumbersGamblingGambling

The “house” in a gambling operation is not The “house” in a gambling operation is not gambling at allgambling at all the games are defined so that the gambler has a the games are defined so that the gambler has a

negative expected gain per play (the true mean negative expected gain per play (the true mean gain after all possible plays is negative)gain after all possible plays is negative)

each play is independent of previous plays, so each play is independent of previous plays, so the the law of large numberslaw of large numbers guarantees that the guarantees that the average winnings of a large number of average winnings of a large number of customers will be close the (negative) true customers will be close the (negative) true averageaverage

Page 7: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 77BPS - 5th Ed.BPS - 5th Ed.

Sampling DistributionSampling DistributionThe The sampling distributionsampling distribution of a of a

statistic is the distribution of values statistic is the distribution of values taken by the statistic in all possible taken by the statistic in all possible samples of the same size (samples of the same size (nn) from ) from the same populationthe same populationto describe a distribution we need to to describe a distribution we need to

specify the shape, center, and spreadspecify the shape, center, and spreadwe will discuss the distribution of the we will discuss the distribution of the

sample mean (x-bar) in this chaptersample mean (x-bar) in this chapter

Page 8: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 88BPS - 5th Ed.BPS - 5th Ed.

Case StudyCase Study

Does This Cheese Smell Bad?

As cheese ages bacteria grows causing “off-odors”. Cheese makers want to know the odor threshold – the lowest concentration of a smell that the human nose can detect. Different people have different thresholds, and of interest is the mean threshold in the population of all adults.

Page 9: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 99BPS - 5th Ed.BPS - 5th Ed.

Case StudyCase Study

Suppose the mean threshold of all adults is =25 micrograms of bacteria per ounce of cheese, with a standard deviation of =7 micrograms per ounce and the threshold values follow a bell-shaped (normal) curve.

Does This Cheese Smell Bad?

Page 10: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 1010BPS - 5th Ed.BPS - 5th Ed.

Where should 95% of all Where should 95% of all individualindividual threshold values fall?threshold values fall?

mean plus or minus two standard mean plus or minus two standard deviationsdeviations

25 25 2(7) = 11 2(7) = 11

25 + 2(7) = 3925 + 2(7) = 39

95% should fall between 11 & 3995% should fall between 11 & 39

What about the What about the meanmean (average) of a sample (average) of a sample of of n n adults? What values would be expected?adults? What values would be expected?

Page 11: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 1111BPS - 5th Ed.BPS - 5th Ed.

Sampling DistributionSampling Distribution What about the What about the meanmean (average) of a sample of (average) of a sample of

n n adults? What values would be expected?adults? What values would be expected?

Answer this by thinking: Answer this by thinking: “What would “What would happen if we took many samples of happen if we took many samples of nn subjects from this population?”subjects from this population?” (let’s say (let’s say that n=10 subjects make up a sample)that n=10 subjects make up a sample) take a large number of samples of take a large number of samples of nn=10 =10

subjects from the populationsubjects from the population calculate the sample mean (x-bar) for each calculate the sample mean (x-bar) for each

samplesample make a histogram (or stemplot) of the values make a histogram (or stemplot) of the values

of x-barof x-bar examine the graphical display for shape, examine the graphical display for shape,

center, spreadcenter, spread

Page 12: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 1212BPS - 5th Ed.BPS - 5th Ed.

Case StudyCase Study

Mean threshold of all adults is =25 micrograms per ounce, with a standard deviation of =7 micrograms per ounce and the threshold values follow a bell-shaped (normal) curve.

Many (1000) repetitions of sampling n=10 adults from the population were simulated and the resulting histogram of the 1000x-bar values is on the next slide.

Does This Cheese Smell Bad?

Page 13: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 1313BPS - 5th Ed.BPS - 5th Ed.

Case StudyCase StudyDoes This Cheese Smell Bad?

Page 14: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 1414BPS - 5th Ed.BPS - 5th Ed.

Mean and Standard Deviation of Mean and Standard Deviation of Sample MeansSample Means

OR OR “Means of the Means”“Means of the Means”

If numerous samples of size n are taken from

a population with mean and standard

deviation , then the mean of the sampling

distribution of is (the population mean)

and the standard deviation is:

( is the population s.d.) n

X

Page 15: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 1515BPS - 5th Ed.BPS - 5th Ed.

Mean and Standard Deviation of Mean and Standard Deviation of Sample MeansSample Means

Since the mean of is , we say that is

an unbiased estimator of X X

Individual observations have standard deviation , but sample means from samples of size n have standard deviation

. Averages are less variable than individual observations.

X

n

Page 16: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 1616BPS - 5th Ed.BPS - 5th Ed.

Sampling Distribution ofSampling Distribution ofSample MeansSample Means

If individual observations have the N(µ, )

distribution, then the sample mean of n

independent observations has the N(µ, / )

distribution.

X

n

“If measurements in the population follow a Normal distribution, then so does the sample mean.”

Page 17: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

When do I use the adjusted standard When do I use the adjusted standard deviation? deviation?

When evaluating Samples: When evaluating Samples: When evaluating an When evaluating an individual:individual:

n

We can compare an individual to the entire population. For example, what is the probability that one student from MHS will get a 2000 on the SAT?

We can compare a sample to the performance of past samples of the population. For example: what is the probability that a sample of 20 students from MHS will get an average of 1800 on the SAT?

Page 18: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 1818BPS - 5th Ed.BPS - 5th Ed.

Case StudyCase Study

Mean threshold of all adults is =25 with a standard deviation of =7, and the threshold values follow a bell-shaped (normal) curve.

Does This Cheese Smell Bad?

(Population distribution)

Page 19: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 1919BPS - 5th Ed.BPS - 5th Ed.

Central Limit TheoremCentral Limit Theorem

“No matter what distribution the population values follow, the sample mean will follow a Normal distribution if the sample size is large.”

If a random sample of size n is selected from ANY population with mean and standard

deviation , then when n is large the sampling distribution of the sample mean is approximately Normal:

is approximately N(µ, / )

X

nX

Page 20: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 2020BPS - 5th Ed.BPS - 5th Ed.

Central Limit Theorem:Central Limit Theorem:Sample SizeSample Size

How large must How large must nn be for the CLT to be for the CLT to hold?hold?depends on how far the population depends on how far the population

distribution is from Normaldistribution is from Normalthe further from Normal, the larger the the further from Normal, the larger the

sample size neededsample size neededa sample size of a sample size of 25 or 3025 or 30 is typically large is typically large

enough for any population distribution enough for any population distribution encountered in practice encountered in practice

recall: if the population is Normal, any recall: if the population is Normal, any sample size will work (sample size will work (nn≥1)≥1)

Page 21: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 2121BPS - 5th Ed.BPS - 5th Ed.

Central Limit Theorem:Central Limit Theorem:Sample Size and Distribution of Sample Size and Distribution of x-x-

barbar

n=1

n=25n=10

n=2

Page 22: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 2222BPS - 5th Ed.BPS - 5th Ed.

Statistical Process ControlStatistical Process Control

Goal is to make a process stable over Goal is to make a process stable over time and keep it stable unless there time and keep it stable unless there are planned changesare planned changes

All processes have variationAll processes have variationStatistical description of stability over Statistical description of stability over

time: the time: the pattern of variationpattern of variation remains stable (does not say that remains stable (does not say that there is there is nono variation) variation)

Page 23: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 2323BPS - 5th Ed.BPS - 5th Ed.

Statistical Process ControlStatistical Process Control A variable described by the same A variable described by the same

distribution over time is said to be distribution over time is said to be in controlin control To see if a process has been disturbed and To see if a process has been disturbed and

to signal when the process is to signal when the process is out of controlout of control, , control chartscontrol charts are used to monitor the are used to monitor the processprocessdistinguish natural variation in the process from distinguish natural variation in the process from

additional variation that suggests a changeadditional variation that suggests a changemost common application: industrial processesmost common application: industrial processes

Page 24: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 2424BPS - 5th Ed.BPS - 5th Ed.

ChartsCharts There is a true mean There is a true mean that describes the that describes the

center or aim of the processcenter or aim of the process Monitor the process by plotting the meansMonitor the process by plotting the means

(x-bars) of small samples taken from the (x-bars) of small samples taken from the process at regular intervals over timeprocess at regular intervals over time

Process-monitoring conditionsProcess-monitoring conditions:: measure quantitative variable measure quantitative variable xx that is Normal that is Normal process has been operating in control for a long process has been operating in control for a long

periodperiod know process mean know process mean and standard deviation and standard deviation

that describe distribution of that describe distribution of xx when process is in when process is in controlcontrol

x

Page 25: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 2525BPS - 5th Ed.BPS - 5th Ed.

Control ChartsControl ChartsPlot the means (x-bars) of regular Plot the means (x-bars) of regular

samples of size n against timesamples of size n against timeDraw a horizontal Draw a horizontal center linecenter line at at

x

Draw horizontal Draw horizontal control limitscontrol limits at at ± 3 ± 3//n almost all (99.7%) of the values of x-bar should almost all (99.7%) of the values of x-bar should

be within the mean plus or minus 3 standard be within the mean plus or minus 3 standard deviationsdeviations

Any x-bar that does not fall between the Any x-bar that does not fall between the control limits is evidence that the control limits is evidence that the process is out of controlprocess is out of control

Page 26: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 2626BPS - 5th Ed.BPS - 5th Ed.

Case StudyCase Study

Need to control the tension in millivolts Need to control the tension in millivolts (mV) on the mesh of fine wires behind the (mV) on the mesh of fine wires behind the surface of the screen. surface of the screen.

Proper tension is Proper tension is 275275 mV ( mV (target mean target mean ))When in control, the standard deviation of When in control, the standard deviation of

the tension readings is the tension readings is =43=43 mV mV

Making Computer Monitors

Page 27: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 2727BPS - 5th Ed.BPS - 5th Ed.

Case StudyCase Study

Proper tension is Proper tension is 275275 mV ( mV (target mean target mean ). When in control, ). When in control, the standard deviation of the tension readings is the standard deviation of the tension readings is =43=43 mV. mV.

Making Computer Monitors

Take samples of Take samples of nn=4 screens and =4 screens and calculate the means of these samplescalculate the means of these samples

the control limits of the x-bar control chart the control limits of the x-bar control chart would bewould be

339.5 and 210.5

64.527532753μ4

43n

σ

Page 28: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 2828BPS - 5th Ed.BPS - 5th Ed.

Case StudyCase Study

Making Computer Monitors

(data)

Page 29: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 2929BPS - 5th Ed.BPS - 5th Ed.

Case StudyCase Study

Making Computer Monitors( chart)x

(in control)

Page 30: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 3030BPS - 5th Ed.BPS - 5th Ed.

Case StudyCase StudyMaking Computer Monitors

(examples of out of control processes)

Page 31: BPS - 5th Ed. Chapter 11 1 Sampling Distributions

Chapter 11Chapter 11 3131BPS - 5th Ed.BPS - 5th Ed.

Natural TolerancesNatural Tolerances

For x-bar charts, the For x-bar charts, the control limitscontrol limits for for the the meanmean of the process are of the process are ± 3 ± 3//n almost all (99.7%) of the values of almost all (99.7%) of the values of x-barx-bar should should

be within the mean plus or minus 3 standard be within the mean plus or minus 3 standard deviationsdeviations

When monitoring a process, the When monitoring a process, the natural natural tolerancestolerances for for individualindividual products are products are ± 3± 3 almost all (99.7%) of the almost all (99.7%) of the individual individual

measurementsmeasurements should be within the mean plus should be within the mean plus or minus 3 standard deviationsor minus 3 standard deviations