139
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 1 0

Sfs4e ppt 10

Embed Size (px)

Citation preview

Page 1: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.

Chapter

Hypothesis Tests Regarding a Parameter

10

Page 2: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.

Section

The Language of Hypothesis Testing

10.1

Page 3: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-3

1. Determine the null and alternative hypotheses

2. Explain Type I and Type II errors

3. State conclusions to hypothesis tests

Objectives

Page 4: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-4

Objective 1

• Determine the Null and Alternative Hypotheses

Page 5: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-5

A hypothesis is a statement regarding a characteristic of one or more populations.

In this chapter, we look at hypotheses regarding a single population parameter.

Page 6: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-6

Examples of Claims Regarding a Characteristic of a Single Population

• In 2008, 62% of American adults regularly volunteered their time for charity work. A researcher believes that this percentage is different today.

Source: ReadersDigest.com poll created on 2008/05/02

Page 7: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-7

• In 2008, 62% of American adults regularly volunteered their time for charity work. A researcher believes that this percentage is different today.

• According to a study published in March, 2006 the mean length of a phone call on a cellular telephone was 3.25 minutes. A researcher believes that the mean length of a call has increased since then.

Examples of Claims Regarding a Characteristic of a Single Population

Page 8: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-8

• In 2008, 62% of American adults regularly volunteered their time for charity work. A researcher believes that this percentage is different today.

• According to a study published in March, 2006 the mean length of a phone call on a cellular telephone was 3.25 minutes. A researcher believes that the mean length of a call has increased since then.

• Using an old manufacturing process, the standard deviation of the amount of wine put in a bottle was 0.23 ounces. With new equipment, the quality control manager believes the standard deviation has decreased.

Examples of Claims Regarding a Characteristic of a Single Population

Page 9: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-9

CAUTION!

We test these types of statements using sample data because it is usually impossible or impractical to gain access to the entire population. If population data are available, there is no need for inferential statistics.

Page 10: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-10

Hypothesis testing is a procedure, based on sample evidence and probability, used to test statements regarding a characteristic of one or more populations.

Page 11: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-11

1. Make a statement regarding the nature of the population.

2. Collect evidence (sample data) to test the statement.

3. Analyze the data to assess the plausibility of the statement.

Steps in Hypothesis Testing

Page 12: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-12

The null hypothesis, denoted H0, is a statement to be tested. The null hypothesis is a statement of no change, no effect or no difference and is assumed true until evidence indicates otherwise.

Page 13: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-13

The alternative hypothesis, denoted H1, is a statement that we are trying to find evidence to support.

Page 14: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-14

In this chapter, there are three ways to set up the null and alternative hypotheses:

1.Equal versus not equal hypothesis (two-tailed test)

H0: parameter = some value

H1: parameter ≠ some value

Page 15: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-15

In this chapter, there are three ways to set up the null and alternative hypotheses:

1. Equal versus not equal hypothesis (two-tailed test)

H0: parameter = some value

H1: parameter ≠ some value

2. Equal versus less than (left-tailed test)

H0: parameter = some value

H1: parameter < some value

Page 16: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-16

In this chapter, there are three ways to set up the null and alternative hypotheses:

1. Equal versus not equal hypothesis (two-tailed test)

H0: parameter = some value

H1: parameter ≠ some value

2. Equal versus less than (left-tailed test)

H0: parameter = some value

H1: parameter < some value

3. Equal versus greater than (right-tailed test)

H0: parameter = some value

H1: parameter > some value

Page 17: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-17

“In Other Words”

The null hypothesis is a statement of status quo or no difference and always contains a statement of equality. The null hypothesis is assumed to be true until we have evidence to the contrary. We seek evidence that supports the statement in the alternative hypothesis.

Page 18: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-18

For each of the following claims, determine the null and alternative hypotheses. State whether the test is two-tailed, left-tailed or right-tailed.

a)In 2008, 62% of American adults regularly volunteered their time for charity work. A researcher believes that this percentage is different today.

b)According to a study published in March, 2006 the mean length of a phone call on a cellular telephone was 3.25 minutes. A researcher believes that the mean length of a call has increased since then.

c)Using an old manufacturing process, the standard deviation of the amount of wine put in a bottle was 0.23 ounces. With new equipment, the quality control manager believes the standard deviation has decreased.

Parallel Example 2: Forming Hypotheses

Page 19: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-19

a) In 2008, 62% of American adults regularly volunteered their time for charity work. A researcher believes that this percentage is different today.

The hypothesis deals with a population proportion, p. If the percentage participating in charity work is no different than in 2008, it will be 0.62 so the null hypothesis is H0: p=0.62.

Since the researcher believes that the percentage is different today, the alternative hypothesis is a two-tailed hypothesis: H1: p≠0.62.

Solution

Page 20: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-20

b) According to a study published in March, 2006 the mean length of a phone call on a cellular telephone was 3.25 minutes. A researcher believes that the mean length of a call has increased since then.

The hypothesis deals with a population mean, μ. If the mean call length on a cellular phone is no different than in 2006, it will be 3.25 minutes so the null hypothesis isH0: μ = 3.25.

Since the researcher believes that the mean call length has increased, the alternative hypothesis is:H1: μ > 3.25, a right-tailed test.

Solution

Page 21: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-21

c) Using an old manufacturing process, the standard deviation of the amount of wine put in a bottle was 0.23 ounces. With new equipment, the quality control manager believes the standard deviation has decreased.

The hypothesis deals with a population standard deviation, σ. If the standard deviation with the new equipment has not changed, it will be 0.23 ounces so the null hypothesis is H0: σ = 0.23.

Since the quality control manager believes that the standard deviation has decreased, the alternative hypothesis is: H1: σ < 0.23, a left-tailed test.

Solution

Page 22: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-22

Objective 2

• Explain Type I and Type II Errors

Page 23: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-23

1. Reject the null hypothesis when the alternative hypothesis is true. This decision would be correct.

Four Outcomes from Hypothesis Testing

Page 24: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-24

1. Reject the null hypothesis when the alternative hypothesis is true. This decision would be correct.

2. Do not reject the null hypothesis when the null hypothesis is true. This decision would be correct.

Four Outcomes from Hypothesis Testing

Page 25: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-25

1. Reject the null hypothesis when the alternative hypothesis is true. This decision would be correct.

2. Do not reject the null hypothesis when the null hypothesis is true. This decision would be correct.

3. Reject the null hypothesis when the null hypothesis is true. This decision would be incorrect. This type of error is called a Type I error.

Four Outcomes from Hypothesis Testing

Page 26: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-26

1. Reject the null hypothesis when the alternative hypothesis is true. This decision would be correct.

2. Do not reject the null hypothesis when the null hypothesis is true. This decision would be correct.

3. Reject the null hypothesis when the null hypothesis is true. This decision would be incorrect. This type of error is called a Type I error.

4. Do not reject the null hypothesis when the alternative hypothesis is true. This decision would be incorrect. This type of error is called a Type II error.

Four Outcomes from Hypothesis Testing

Page 27: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-27

For each of the following claims, explain what it would mean to make a Type I error. What would it mean to make a Type II error?

a)In 2008, 62% of American adults regularly volunteered their time for charity work. A researcher believes that this percentage is different today.

b)According to a study published in March, 2006 the mean length of a phone call on a cellular telephone was 3.25 minutes. A researcher believes that the mean length of a call has increased since then.

Parallel Example 3: Type I and Type II Errors

Page 28: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-28

a) In 2008, 62% of American adults regularly volunteered their time for charity work. A researcher believes that this percentage is different today.

A Type I error is made if the researcher concludes thatp ≠ 0.62 when the true proportion of Americans 18 years or older who participated in some form of charity work is currently 62%.

A Type II error is made if the sample evidence leads the researcher to believe that the current percentage of Americans 18 years or older who participated in some form of charity work is still 62% when, in fact, this percentage differs from 62%.

Solution

Page 29: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-29

b) According to a study published in March, 2006 the mean length of a phone call on a cellular telephone was 3.25 minutes. A researcher believes that the mean length of a call has increased since then.

A Type I error occurs if the sample evidence leads the researcher to conclude that μ > 3.25 when, in fact, the actual mean call length on a cellular phone is still 3.25 minutes.

A Type II error occurs if the researcher fails to reject the hypothesis that the mean length of a phone call on a cellular phone is 3.25 minutes when, in fact, it is longer than 3.25 minutes.

Solution

Page 30: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-30

α = P(Type I Error)

= P(rejecting H0 when H0 is true)

β = P(Type II Error)

= P(not rejecting H0 when H1 is true)

Page 31: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-31

The probability of making a Type I error, α, is chosen by the researcher before the sample data is collected.

The level of significance, α, is the probability of making a Type I error.

Page 32: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-32

“In Other Words” As the probability of a Type I error increases, the probability of a Type II error decreases, and vice-versa.

Page 33: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-33

Objective 3

• State Conclusions to Hypothesis Tests

Page 34: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-34

CAUTION!

We never “accept” the null hypothesis, because, without having access to the entire population, we don’t know the exact value of the parameter stated in the null. Rather, we say that we do not reject the null hypothesis. This is just like the court system. We never declare a defendant “innocent”, but rather say the defendant is “not guilty”.

Page 35: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-35

According to a study published in March, 2006 the mean length of a phone call on a cellular telephone was 3.25 minutes. A researcher believes that the mean length of a call has increased since then.

a)Suppose the sample evidence indicates that the null hypothesis should be rejected. State the wording of the conclusion.

b)Suppose the sample evidence indicates that the null hypothesis should not be rejected. State the wording of the conclusion.

Parallel Example 4: Stating the Conclusion

Page 36: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-36

a) Suppose the sample evidence indicates that the null hypothesis should be rejected. State the wording of the conclusion.

The statement in the alternative hypothesis is that the mean call length is greater than 3.25 minutes. Since the null hypothesis (μ = 3.25) is rejected, there is sufficient evidence to conclude that the mean length of a phone call on a cell phone is greater than 3.25 minutes.

Solution

Page 37: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-37

b) Suppose the sample evidence indicates that the null hypothesis should not be rejected. State the wording of the conclusion.

Since the null hypothesis (μ = 3.25) is not rejected, there is insufficient evidence to conclude that the mean length of a phone call on a cell phone is greater than 3.25 minutes. In other words, the sample evidence is consistent with the mean call length equaling 3.25 minutes.

Solution

Page 38: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.

Section

Hypothesis Tests for a Population Proportion

10.2

Page 39: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-39

Objectives

1. Explain the logic of hypothesis testing

2. Test the hypotheses about a population proportion

3. Test hypotheses about a population proportion using the binomial probability distribution.

Page 40: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-40

Objective 1

• Explain the Logic of Hypothesis Testing

Page 41: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-41

A researcher obtains a random sample of 1000 people and finds that 534 are in favor of the banning cell phone use while driving, so = 534/1000. Does this suggest that more than 50% of people favor the policy? Or is it possible that the true proportion of registered voters who favor the policy is some proportion less than 0.5 and we just happened to survey a majority in favor of the policy? In other words, would it be unusual to obtain a sample proportion of 0.534 or higher from a population whose proportion is 0.5? What is convincing, or statistically significant, evidence?

Page 42: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-42

When observed results are unlikely under the assumption that the null hypothesis is true, we say the result is statistically significant. When results are found to be statistically significant, we reject the null hypothesis.

Page 43: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-43

To determine if a sample proportion of 0.534 is statistically significant, we build a probability model.

Since np(1 – p) = 100(0.5)(1 – 0.5) = 250 ≥ 10 and the sample size (n = 1000) is sufficiently smaller than the population size, we can use the normal model to describe the variability in . The mean of the distribution of is and the standard

deviation is

öp0.5

öp

0.5 1 0.5

10000.016.

Page 44: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-44

Sampling distribution of the sample proportion

Page 45: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-45

We may consider the sample evidence to be statistically significant (or sufficient) if the sample proportion is too many standard deviations, say 2, above the assumed population proportion of 0.5.

The Logic of the Classical Approach

Page 46: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-46

Recall that our simple random sample yielded a sample proportion of 0.534, so

standard deviations above the hypothesized proportion of 0.5.

Therefore, using our criterion, we would reject the null hypothesis.

z öp öp

öp

0.534 0.5

0.5 1 0.5 1000

2.15

Page 47: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-47

Why does it make sense to reject the null hypothesis if the sample proportion is more than 2 standard deviations away from the hypothesized proportion? The area under the standard normal curve to the right ofz = 2 is 0.0228.

Page 48: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-48

If the null hypothesis were true (population proportion is 0.5), then 1 – 0.0228 = 0.9772 = 97.72% of all sample proportions will be less than

.5 + 2(0.016) = 0.532and only 2.28% of the sample proportions will be more than 0.532.

Page 49: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-49

If the sample proportion is too many standard deviations from the proportion stated in the null hypothesis, we reject the null hypothesis.

Hypothesis Testing Usingthe Classical Approach

Page 50: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-50

A second criterion we may use for testing hypotheses is to determine how likely it is to obtain a sample proportion of 0.534 or higher from a population whose proportion is 0.5. If a sample proportion of 0.534 or higher is unlikely (or unusual), we have evidence against the statement in the null hypothesis. Otherwise, we do not have sufficient evidence against the statement in the null hypothesis.

The Logic of the P-Value Approach

Page 51: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-51

We can compute the probability of obtaining a sample proportion of 0.534 or higher from a population whose proportion is 0.5 using the normal model.

Page 52: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-52

Recall

So, we compute

The value 0.0158 is called the P-value, which means about 2 samples in 100 will give a sample proportion as high or higher than the one we obtained if the population proportion really is 0.5. Because these results are unusual, we take this as evidence against the statement in the null hypothesis.

z 0.534 0.5

0.5 1 0.5 1000

2.15

P öp 0.534 P(z 2.15) 0.0158

Page 53: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-53

If the probability of getting a sample proportion as extreme or more extreme than the one obtained is small under the assumption the statement in the null hypothesis is true, reject the null hypothesis.

Hypothesis TestingUsing the P-value Approach

Page 54: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-54

Objective 2

• Test hypotheses about a population proportion.

Page 55: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-55

Recall:

• The best point estimate of p, the proportion of the population with a certain characteristic, is given by

where x is the number of individuals in the sample with the specified characteristic and n is the sample size.

ˆ p x

n

Page 56: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-56

Recall:

• The sampling distribution of is approximately normal, with mean and standard deviation

provided that the following requirements are satisfied:

1. The sample is a simple random sample.2. np(1-p) ≥ 10.3. The sampled values are independent of each

other.

ˆ p

ˆ p p

ˆ p p(1 p)

n

Page 57: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-57

Testing Hypotheses Regarding a Population Proportion, p

To test hypotheses regarding the population proportion, we can use the steps that follow, provided that:•The sample is obtained by simple random sampling.• np0(1 – p0) ≥ 10.•The sampled values are independent of each other.

Page 58: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-58

Step 1: Determine the null and alternative hypotheses. The hypotheses can be structured in one of three ways:

Page 59: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-59

Step 2: Select a level of significance, α, based on the seriousness of making a Type I error.

Page 60: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-60

Step 3: Compute the test statistic

Note: We use p0 in computing the standard error rather than . This is because, when we test a hypothesis, the null hypothesis is always assumed true.

z0 ˆ p p0

p0(1 p0)

n

ˆ p

Classical Approach

Page 61: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-61

Classical Approach

Two-Tailed

(critical value)

Use Table V to determine the critical value.

Page 62: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-62

Classical Approach

Left-Tailed

(critical value)

Page 63: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-63

Classical Approach

Right-Tailed

(critical value)

Page 64: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-64

Step 4: Compare the critical value with the test statistic:

Classical Approach

Page 65: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-65

By Hand Step 3: Compute the test statistic.

P-Value Approach

z0 ˆ p p0

p0(1 p0)n

Page 66: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-66

P-Value Approach

Two-Tailed

Use Table V to estimate the P-value.

Page 67: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-67

P-Value Approach

Left-Tailed

Page 68: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-68

P-Value Approach

Right-Tailed

Page 69: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-69

Technology Step 3: Use a statistical spreadsheet or calculator with statistical capabilities to obtain the P-value.The directions for obtaining the P-value using the TI-83/84 Plus graphing calculator, MINITAB, Excel, and StatCrunch are in the Technology Step-by-Step in the text.

P-Value Approach

Page 70: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-70

Step 4: If the P-value < α, reject the null hypothesis.

P-Value Approach

Page 71: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-71

Step 5: State the conclusion.

Page 72: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-72

Parallel Example 1: Testing a Hypothesis about a Population Proportion: Large Sample Size

In 1997, 46% of Americans said they did not trust the media “when it comes to reporting the news fully, accurately and fairly”. In a 2007 poll of 1010 adults nationwide, 525 stated they did not trust the media. At the α = 0.05 level of significance, is there evidence to support the claim that the percentage of Americans that do not trust the media to report fully and accurately has increased since 1997?

Source: Gallup Poll

Page 73: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-73

Solution

We want to know if p > 0.46. First, we must verify the requirements to perform the hypothesis test:

1. This is a simple random sample.

2. np0(1 – p0) = 1010(0.46)(1 – 0.46) = 250.8 > 10

3. Since the sample size is less than 5% of the population size, the assumption of independence is met.

Page 74: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-74

Solution

Step 1: H0: p = 0.46 versus H1: p > 0.46

Step 2: The level of significance is α = 0.05.

Step 3: The sample proportion is .

The test statistic is then

ˆ p 525

10100.52

z0 0.52 0.46

0.46(1 0.46)

1010

3.83

Page 75: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-75

Solution: Classical Approach

Step 4: Since this is a right-tailed test, we determine the critical value at the α = 0.05 level of significance to be

z0.05 = 1.645.

Step 5: Since the test statistic, z0 = 3.83, is

greater than the critical value 1.645, we reject the null hypothesis.

Page 76: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-76

Solution: P-Value Approach

Step 4: Since this is a right-tailed test, the P- value is the area under the standard normal distribution to the right of the test statistic z0=3.83. That is, P-value =

P(Z > 3.83) ≈ 0.

Step 5: Since the P-value is less than the level of significance, we reject the null

hypothesis.

Page 77: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-77

Solution

Step 6: There is sufficient evidence at the α = 0.05 level of significance to

conclude that the percentage of Americans that do not trust the media to report fully and accurately has increased since 1997.

Page 78: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-78

Objective 3

• Test hypotheses about a population proportion using the binomial probability distribution.

Page 79: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-79

For the sampling distribution of to be approximately normal, we require np(1– p) be at least 10. What if this requirement is not met?

We stated that an event was unusual if the probability of observing the event was less than 0.05. This criterion is based on the P-value approach to testing hypotheses; the probability that we computed was the P-value. We use this same approach to test hypotheses regarding a population proportion for small samples.

ˆ p

Page 80: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-80

Parallel Example 4: Hypothesis Test for a Population Proportion: Small Sample Size

In 2006, 10.5% of all live births in the United States were to mothers under 20 years of age. A sociologist claims that births to mothers under 20 years of age is decreasing. She conducts a simple random sample of 34 births and finds that 3 of them were to mothers under 20 years of age. Test the sociologist’s claim at the α = 0.01 level of significance.

Page 81: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-81

Parallel Example 4: Hypothesis Test for a Population Proportion: Small Sample Size

Approach:

Step 1: Determine the null and alternative hypotheses

Step 2: Check whether np0(1–p0) is greater than or equal to 10, where p0 is the proportion stated in the null hypothesis. If it is, then the sampling distribution of is approximately normal and we can use the steps for a large sample size. Otherwise we use the following Steps 3 and 4.

öp

Page 82: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-82

Parallel Example 4: Hypothesis Test for a Population Proportion: Small Sample Size

Approach:

Step 3: Compute the P-value. For right-tailed tests, the P-value is the probability of obtaining x

or more successes. For left-tailed tests, the P- value is the probability of obtaining x or fewer successes. The P-value is always computed with the proportion given in the null hypothesis.

Step 4: If the P-value is less than the level of significance, α, we reject the null hypothesis.

Page 83: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-83

Solution

Step 1: H0: p = 0.105 versus H1: p < 0.105

Step 2: From the null hypothesis, we have p0 = 0.105. There were 34 mothers sampled, so np0(1– p0)=3.57 < 10.

Thus, the sampling distribution of is not approximately normal.

ˆ p

Page 84: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-84

Solution

Step 3: Let X represent the number of live births in the United States to mothers under 20

years of age. We have x = 3 successes in n = 34 trials so = 3/34= 0.088. We want to

determine whether this result is unusual if the population mean is truly 0.105. Thus,

P-value = P(X ≤ 3 assuming p=0.105 )

= P(X = 0) + P(X =1)

+ P(X =2) + P(X = 3)

= 0.51

ˆ p

Page 85: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-85

Solution

Step 4: The P-value = 0.51 is greater than the level of significance so we do not

reject H0. There is insufficient evidence to conclude that the percentage of live births in the United States to mothers under the age of 20 has decreased below the 2006 level of 10.5%.

Page 86: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.

Section

Hypothesis Tests for a Population Mean

10.3

Page 87: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-87

Objectives

1. Test hypotheses about a mean 2. Understand the difference between statistical

significance and practical significance.

Page 88: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-88

Objective 1

• Test Hypotheses about a Mean

Page 89: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-89

To test hypotheses regarding the population mean assuming the population standard deviation is unknown, we use the t-distribution rather than the Z-distribution. When we replace σ with s,

follows Student’s t-distribution with n –1 degrees of freedom.

x s

n

Page 90: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-90

1. The t-distribution is different for different degrees of freedom.

Properties of the t-Distribution

Page 91: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-91

1. The t-distribution is different for different degrees of freedom.

2. The t-distribution is centered at 0 and is symmetric about 0.

Properties of the t-Distribution

Page 92: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-92

1. The t-distribution is different for different degrees of freedom.

2. The t-distribution is centered at 0 and is symmetric about 0.

3. The area under the curve is 1. Because of the symmetry, the area under the curve to the right of 0 equals the area under the curve to the left of 0 equals 1/2.

Properties of the t-Distribution

Page 93: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-93

4. As t increases (or decreases) without bound, the graph approaches, but never equals, 0.

Properties of the t-Distribution

Page 94: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-94

4. As t increases (or decreases) without bound, the graph approaches, but never equals, 0.

5. The area in the tails of the t-distribution is a little greater than the area in the tails of the standard normal distribution because using s as an estimate of σ introduces more variability to the t-statistic.

Properties of the t-Distribution

Page 95: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-95

6. As the sample size n increases, the density curve of t gets closer to the standard normal density curve. This result occurs because as the sample size increases, the values of s get closer to the values of σ by the Law of Large Numbers.

Properties of the t-Distribution

Page 96: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-96

Testing Hypotheses Regarding aPopulation Mean

To test hypotheses regarding the population mean, we use the following steps, provided that:•The sample is obtained using simple random sampling.•The sample has no outliers, and the population from which the sample is drawn is normally distributed or the sample size is large (n ≥ 30).•The sampled values are independent of each other.

Page 97: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-97

Step 1: Determine the null and alternative hypotheses. Again, the hypotheses can be structured in one of three ways:

Page 98: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-98

Step 2: Select a level of significance, α, based on the seriousness of making a Type I error.

Page 99: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-99

Classical Approach

Step 3: Compute the test statistic

which follows the Student’s t-distribution with n – 1 degrees of freedom.

t0

x 0

s

n

Page 100: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-100

Classical Approach

Use Table VI to determine the critical value.

Page 101: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-101

Classical Approach

Two-Tailed

Page 102: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-102

Classical Approach

Left-Tailed

Page 103: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-103

Classical Approach

Right-Tailed

Page 104: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-104

Classical Approach

Step 4: Compare the critical value with the test statistic:

Page 105: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-105

P-Value Approach

By Hand Step 3: Compute the test statistic

which follows the Student’s t-distribution with n – 1 degrees of freedom.

t0

x 0

s

n

Page 106: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-106

P-Value Approach

Use Table VI to approximate the P-value.

Page 107: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-107

P-Value ApproachTwo-Tailed

Page 108: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-108

P-Value ApproachLeft-Tailed

Page 109: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-109

P-Value ApproachRight-Tailed

Page 110: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-110

P-Value Approach

Technology Step 3: Use a statistical spreadsheet or calculator with statistical capabilities to obtain the P-value. The directions for obtaining the P-value using the TI-83/84 Plus graphing calculator, MINITAB, Excel, and StatCrunch, are in the Technology Step-by-Step in the text.

Page 111: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-111

P-Value Approach

Step 4: If the P-value < α, reject the null hypothesis.

Page 112: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-112

Step 5: State the conclusion.

Page 113: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-113

The procedure is robust, which means that minor departures from normality will not adversely affect the results of the test. However, for small samples, if the data have outliers, the procedure should not be used.

Page 114: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-114

Parallel Example 1: Testing a Hypothesis about a Population Mean, Large Sample

Assume the resting metabolic rate (RMR) of healthy males in complete silence is 5710 kJ/day. Researchers measured the RMR of 45 healthy males who were listening to calm classical music and found their mean RMR to be 5708.07 with a standard deviation of 992.05.

At the α = 0.05 level of significance, is there evidence to conclude that the mean RMR of males listening to calm classical music is different than 5710 kJ/day?

Page 115: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-115

Solution

We assume that the RMR of healthy males is 5710 kJ/day. This is a two-tailed test since we are interested in determining whether the RMR differs from 5710 kJ/day.

Since the sample size is large, we follow the steps for testing hypotheses about a population mean for large samples.

Page 116: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-116

Solution

Step 1: H0: μ = 5710 versus H1: μ ≠ 5710

Step 2: The level of significance is α = 0.05.

Step 3: The sample mean is = 5708.07 and the sample standard deviation is s = 992.05. The test statistic is

t0 5708.07 5710

992.05 45 0.013

x

Page 117: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-117

Solution: Classical Approach

Step 4: Since this is a two-tailed test, we determine the critical values at the α = 0.05 level of significance with n –1 = 45 – 1 = 44 degrees of freedom to be approximately –t0.025 = –2.021

and t0.025 = 2.021.

Step 5: Since the test statistic, t0 = –0.013, is between

the critical values, we fail to reject the null hypothesis.

Page 118: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-118

Solution: P-Value Approach

Step 4: Since this is a two-tailed test, the P-value is the area under the t-distribution with n –1 = 45 – 1

= 44 degrees of freedom to the left of –t0.025= –0.013

and to the right of t0.025 = 0.013. That is, P-

value = P(t < -0.013) + P(t > 0.013) = 2 P(t > 0.013). 0.50 < P-value.

Step 5: Since the P-value is greater than the level of significance (0.05<0.5), we fail to reject the null hypothesis.

Page 119: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-119

Solution

Step 6: There is insufficient evidence at the α = 0.05 level of significance to conclude that the mean RMR of males listening to calm classical music differs from 5710 kJ/day.

Page 120: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-120

Parallel Example 3: Testing a Hypothesis about a Population Mean, Small Sample

According to the United States Mint, quarters weigh 5.67 grams. A researcher is interested in determining whether the “state” quarters have a weight that is different from 5.67 grams. He randomly selects 18 “state” quarters, weighs them and obtains the following data.

5.70 5.67 5.73 5.61 5.70 5.67

5.65 5.62 5.73 5.65 5.79 5.73

5.77 5.71 5.70 5.76 5.73 5.72

At the α = 0.05 level of significance, is there evidence to conclude that state quarters have a weight different than 5.67 grams?

Page 121: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-121

Solution

We assume that the weight of the state quarters is 5.67 grams. This is a two-tailed test since we are interested in determining whether the weight differs from 5.67 grams.

Since the sample size is small, we must verify that the data come from a population that is normally distributed with no outliers before proceeding to Steps 1-6.

Page 122: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-122

Assumption of normality appears reasonable.

Page 123: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-123

No outliers.

Page 124: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-124

Solution

Step 1: H0: μ = 5.67 versus H1: μ ≠ 5.67

Step 2: The level of significance is α = 0.05.

Step 3: From the data, the sample mean is calculated to be 5.7022 and the sample standard deviation is s = 0.0497. The test statistic is

t0 5.7022 5.67

.0497 182.75

Page 125: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-125

Solution: Classical Approach

Step 4: Since this is a two-tailed test, we determine the critical values at the α = 0.05 level of significance with n – 1 = 18 – 1 = 17 degrees of freedom to be –t0.025 = –2.11 and t0.025 = 2.11.

Step 5: Since the test statistic, t0 = 2.75, is greater than

the critical value 2.11, we reject the null hypothesis.

Page 126: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-126

Solution: P-Value Approach

Step 4: Since this is a two-tailed test, the P-value is the area under the t-distribution with n – 1 = 18 – 1

= 17 degrees of freedom to the left of –t0.025 = –2.75

and to the right of t0.025 = 2.75. That is, P-value

= P(t < -2.75) + P(t > 2.75) = 2 P(t > 2.75). 0.01 < P-value < 0.02.

Step 5: Since the P-value is less than the level of significance (0.02<0.05), we reject the null hypothesis.

Page 127: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-127

Solution

Step 6: There is sufficient evidence at the α = 0.05 level of significance to conclude that the mean weight of the state quarters differs from 5.67 grams.

Page 128: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-128

Objective 2

• Understand the Difference between Statistical Significance and Practical Significance

Page 129: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-129

When a large sample size is used in a hypothesis test, the results could be statistically significant even though the difference between the sample statistic and mean stated in the null hypothesis may have no practical significance.

Page 130: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-130

Practical significance refers to the idea that, while small differences between the statistic and parameter stated in the null hypothesis are statistically significant, the difference may not be large enough to cause concern or be considered important.

Page 131: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-131

Parallel Example 7: Statistical versus Practical Significance

In 2003, the average age of a mother at the time of her first childbirth was 25.2. To determine if the average age has increased, a random sample of 1200 mothers is taken and is found to have a sample mean age of 25.5 with a standard deviation of 4.8, determine whether the mean age has increased using a significance level of α = 0.05.

Page 132: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-132

Solution

Step 1: To determine whether the mean age has increased, this is a right-tailed test with

H0: μ = 25.2 versus H1: μ > 25.2.

Step 2: The level of significance is α = 0.05.

Step 3: Recall that the sample mean is 25.5. The test statistic is then

t0

25.5 25.2

4.8 12002.17

Page 133: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-133

Solution

Step 4: Since this is a right-tailed test, the critical value with α = 0.05 and 1200 – 1 = 1199 degrees of freedom is

P-value = P(t > 2.17).

From Table VI: .001 < P-value < .005

Step 5: Because the P-value is less than the level of significance, 0.05, we reject the null hypothesis.

Page 134: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-134

Solution

Step 6: There is sufficient evidence at the 0.05 significance level to conclude that the mean age of a mother at the time of her first childbirth is greater than 25.2.

Although we found the difference in age to be significant, there is really no practical significance in the age difference (25.2 versus 25.5).

Page 135: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-135

Large Sample Sizes

Large sample sizes can lead to statistically significant results while the difference between the statistic and parameter is not enough to be considered practically significant.

Page 136: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.

Section

Putting It Together: Which Method Do I Use?

10.4

Page 137: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-137

Objective

1. Determine the appropriate hypothesis test to perform

Page 138: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-138

Objective 1

• Determine the Appropriate Hypothesis Test to Perform

Page 139: Sfs4e ppt 10

Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc.10-139