44
Fundamentals of Hypothesis Testing

Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

  • View
    230

  • Download
    1

Embed Size (px)

Citation preview

Page 1: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Fundamentals of Hypothesis Testing

Page 2: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Identify the Population

Assume thepopulation

mean TV sets is 3.

(Null Hypothesis)

REJECT

Compute the Sample Mean to be 2.0

Take a Sample

Null Hypothesis

Hypothesis Testing Process

Do a statistical test and conclude

Page 3: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

1. State H0

State H1

2. Choose

3. Collect data

4. Compute test statistic (and/or the p value)

5. Make the decision

General Steps in Hypothesis Testing

Page 4: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Step 1

Define the null and alternative hypotheses

Page 5: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

A hypothesis is a claim about the population parameter.

Examples of a parameter are population mean or proportion

The mean number of TV sets per household is 3.0!

© 1984-1994 T/Maker Co.

What is a Hypothesis?

Page 6: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

States the assumption to be tested

e.g. The mean number of TV sets is 3

(H0: 3)

The null hypothesis is always about a

population parameter (H0: 3), notnot

about a sample statistic (H0: X 3)

The Null Hypothesis, H0

Page 7: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

The Null Hypothesis, H0

• Begins with the assumption that the null hypothesis is TRUETRUE

(Similar to the notion of innocent until proven guilty)

Always contains the “ = ” sign.

There may be enough evidence to reject the Null Hypothesis. Otherwise it is not rejected.

(continued)

Page 8: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

• Is the opposite of the null hypothesis

e.g. The mean number of TV sets is not 3 (H1: 3)

• Never contains an “=” sign.

The Alternative Hypothesis, H1

Page 9: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

One and Two Tail Hypotheses

H0: 3

H1: < 3

H0: 3

H1: > 3

H0: 3

H1: 3

Page 10: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Step 2

Set the level of significance

Page 11: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

H0: Innocent

The Truth The Truth

Verdict Innocent Guilty Decision H0 True H0 False

Innocent Correct ErrorDo NotReject

H0

1 - Type IIError ( )

Guilty Error Correct RejectH0

Type IError( )

Power(1 - )

Result Possibilities

Jury Trial Hypothesis Test

Page 12: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

• The (level of significance) is selected by the researcher at the start of the research. Typical values are 0.01, 0.05, and 0.10

• The defines unlikely values of sample statistic if null hypothesis is true. This is called rejection region.

Level of Significance,

Page 13: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Level of Significance, and the Rejection Region

H0: 3

H1: < 3

H0: 3

H1: > 3

H0: 3

H1: 3

/2

Critical Value(s)

Page 14: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Type I error

• Is when you reject the null

hypothesis when it is true.

• Probability of type I error is and is set by the researcher.

Errors in Making Decisions

Page 15: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Errors in Making Decisions

Type II Error

Is failing to Reject a False Null Hypothesis

Probability of Type II Error Is (beta)

The PowerPower of The Test Is (1-(1- ) )

Page 16: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Reduce probability of one error and the other one goes up holding everything else unchanged.

& Have an Inverse Relationship

Page 17: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

increases when difference between

hypothesized parameter & its true value decreases.

increases when you are willing to take a bigger chance of a type I error (decreases).

Factors Affecting Type II Error

Page 18: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

increases when population standard

deviation increases

increases when sample size n

decreases

Factors Affecting Type II Error

n

(continued)

Page 19: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

How to Choose between Type I and Type II Errors

• Choice depends on the costs of the errors.

• Choose smaller type I error when the cost

of rejecting the hypothesis is high. At a criminal trial, the presumption is

innocence. A type I error is convicting an innocent person.

Page 20: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Step 3

Gather the data

Page 21: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Step 4

Conduct a statistical test to measure the strength of the

evidence

Page 22: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Sample Mean = 3

Sampling Distribution of

we get a sample mean of this value ...

the population mean is

2

If H0 is true

Weighting the evidence

X

X

Page 23: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Step 5

Reach a conclusion

Page 24: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Sample Mean = 3

Sampling Distribution of

It is unlikely that we would get a sample mean of this value ... ... if in fact this were

the population mean.

... Therefore, we reject the

null hypothesis that = 3.

2

If H0 is true

Rejecting H0

X

X

Page 25: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

An Example of the 5 Steps

For the two tail test on the mean where the population standard

deviation is known.

Page 26: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

AssumptionsPopulation is normally distributed.

If not normal, use large samples.

Z test statistic:

Two-Tail Z Test for the Mean (Known)

/X

X

X XZ

n

Page 27: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Does an average box of cereal contain 368 grams of cereal? A random sample of 25 boxes showed X = 372.5. The company has specified to be 15 grams. Test at the 0.05 level

368 gm.

Example: Two Tail Test

H0: 368

H1: 368

Page 28: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

= 0.05

n = 25

Critical value: ±1.96

Test Statistic:

Decision:

Conclusion:

Do Not Reject at = .05

Cannot prove that the population mean is other

than 368Z0 1.96

.025

Reject

Example Solution: Two Tail

-1.96

.025

H0: 368

H1: 36850.1

2515

3685.372

n

XZ

Page 29: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

p Value Solution

(p Value = 0.1336) ( = 0.05) Do Not Reject.

01.50

Z

Reject

= 0.05

1.96

p Value = 2 x 0.0668

Look up 1.5 in the z table (.9332) and subtract from 1.00 to get .0668. Double if two tail test.

Reject

Page 30: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

The Z Test and Confidence Interval

You will find both give the same conclusion but the test is easier to

use and provides more information.

Page 31: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Connection to Confidence Intervals

For X = 372.5 oz, = 15 and n = 25,

The 95% Confidence Interval is (equation 8.1):

372.5 - (1.96) 15/ to 372.5 + (1.96) 15/ or

366.62 378.38

If this interval contains the hypothesized mean (368), we do not reject the null hypothesis.

_

25 25

Page 32: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

An Example of the 5 Steps

For the one tail test on the mean where the population standard

deviation is known.

Page 33: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Z0

Reject H0

Z0

Reject H0

H0: 0 H1: < 0

H0: 0 H1: > 0

Z Must Be Significantly Below to reject H0

Z Must Be Significantly Above to reject H0

One Tail Tests

Page 34: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Assumptions not changed:Population is normally distributed.

If not normal, use large samples.

Z test statistic not changed:

One-Tail Z Test for the Mean (Known)

/X

X

X XZ

n

Page 35: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Does an average box of cereal contain more than 368 grams of cereal? A random sample of 25 boxes showed X = 372.5. The company has specified to be 15 grams. Test at the 0.05 level

368 gm.

Example: One Tail Test

H0: 368 H1: > 368

Page 36: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Z .04 .06

1.6 .9495 .9505 .9515

1.7 .9591 .9599 .9608

1.8 .9671 .9678 .9686

.9738 .9750

Z0

Z = 1

1.645

.05

1.9 .9744

Standardized Cumulative Normal Distribution Table

(Portion)

What is Z given = 0.05?

= .05

Finding Critical Values: One Tail Tests

Critical Value = 1.645

.95

Page 37: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

= 0.5

n = 25

Critical value: 1.645

Decision:

Conclusion:

Do Not Reject at = .05

Cannot prove that the population mean is

more than 368Z0 1.645

.05

Reject

Example Solution: One Tail

H0: 368 H1: > 368 50.1

n

XZ

Page 38: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Z0 1.50

p Value= .0668

Z Value of Sample Statistic

From Z Table: Lookup 1.50 to Obtain .9332

Use the alternative hypothesis to find the direction of the rejection region.

1.000 - .9332 .0668

p Value is P(Z 1.50) = 0.0668

p Value Solution

Page 39: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

01.50

Z

Reject

(p Value = 0.0668) ( = 0.05) Do Not Reject.

p Value = 0.0668

= 0.05

Test Statistic 1.50 Is In the Do Not Reject Region

p Value Solution

1.645

Page 40: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

An Example of the 5 Steps

For the test on the mean where the population standard deviation

is not known.

Page 41: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Assumption:The population is normally distributed

or only slightly skewed & a large sample taken.

t test with n-1 degrees of freedom

t Test for the Mean (Unknown)

/

XtS n

Page 42: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

Example: One Tail t-Test

Does an average box of cereal contain more than 368 grams of cereal? A random sample of 36 boxes showed X = 372.5, ands 15. Test at the 0.01 level.

368 gm.

H0: 368 H1: 368

is not given

Page 43: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

= 0.01

n = 36, df = 35

Critical value: 2.4377

Test Statistic:

Decision:

Conclusion:

Do Not Reject at = .01

Cannot prove that the population mean is

more than 368t350 2.4377

.01

Reject

Example Solution: One Tail

H0: 368 H1: 368 80.1

3615

3685.372

nSX

t

1.80

Page 44: Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean

The t test and p-value

You will be unable to calculate the p-value for the t test as the book does not provide the necessary

table.