50
Horng-Chyi Horng Horng-Chyi Horng Statistics II Statistics II 1 Inference on the Mean of a Inference on the Mean of a Population Population - - Variance Known Variance Known H H 0 : : = = 0 H H 1 : : 0 , where , where 0 is a specified constant. is a specified constant. Sample mean is the unbiased point estimator for population Sample mean is the unbiased point estimator for population mean. mean. &4-4 (&8-2) 1 , 0 ~ then ), ( true is H if Therefore, . , ~ then , variance and mean with on distribut a from drawn samples are , , , If 0 0 0 0 2 2 2 1 N n X Z n N X X X X n

Inference on the Mean of a Population - Variance Known

  • Upload
    adanna

  • View
    51

  • Download
    0

Embed Size (px)

DESCRIPTION

Inference on the Mean of a Population - Variance Known. &4-4 (&8-2). H 0 : m = m 0 H 1 : m  m 0 , where m 0 is a specified constant. Sample mean is the unbiased point estimator for population mean. The Reasoning. - PowerPoint PPT Presentation

Citation preview

Page 1: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 11

Inference on the Mean of a PopulationInference on the Mean of a Population--Variance KnownVariance Known

HH00: : = = 00

HH11: : 00 , where , where 00 is a specified constant. is a specified constant.

Sample mean is the unbiased point estimator for population mean.Sample mean is the unbiased point estimator for population mean.

&4-4 (&8-2)

1,0~

then),( trueis H if Therefore,

.,~ then , varianceand mean with

ondistributi a fromdrawn samples are ,,, If

00

00

22

21

Nn

XZ

nNX

XXX n

Page 2: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 22

The ReasoningThe Reasoning

For HFor H00 to be true, the value of Z to be true, the value of Z00 can not be too large or too can not be too large or too

small.small.

Recall that 68.3% of ZRecall that 68.3% of Z00 should fall within (-1, +1) should fall within (-1, +1)

95.4% of Z95.4% of Z00 should fall within (-2, +2) should fall within (-2, +2)

99.7% of Z99.7% of Z00 should fall within (-3, +3) should fall within (-3, +3)

What values of ZWhat values of Z00 should we reject H should we reject H00? (based on ? (based on value) value)

What values of ZWhat values of Z00 should we conclude that there is not should we conclude that there is not

enough evidence to reject Henough evidence to reject H00??

Page 3: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 33

Page 4: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 44

Example 8-2Example 8-2

Aircrew escape systems are powered by a solid propellant. The burning rate of this propellant is an important product characteristic. Specifications require that the mean burning rate must be 50 cm/s. We know that the standard deviation of burning rate is 2 cm/s. The experimenter decides to specify a type I error probability or significance level of α = 0.05. He selects a random sample of n = 25 and obtains a sample average of the burning rate of x = 51.3 cm/s. What conclusions should be drawn?

Page 5: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 55

Page 6: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 66

Hypothesis Testing on Hypothesis Testing on - Variance Known- Variance Known

H1 Test Statistic Reject H0 if 0 Z0 > z or Z0 < - z

> 0 Z0 > z

< 0n

XZ

0

0

Z0 < -z

Page 7: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 77

P-Values in Hypothesis Tests(I)P-Values in Hypothesis Tests(I)

Where ZWhere Z00 is the test statistic, and is the test statistic, and (z) is the standard (z) is the standard

normal cumulative function.normal cumulative function. In example 8-2, ZIn example 8-2, Z00 = 3.25, P-Value = 2[1- = 3.25, P-Value = 2[1-(3.25)] = (3.25)] =

0.00120.0012

Page 8: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 88

P-Values of Hypothesis Testing on P-Values of Hypothesis Testing on - Variance Known- Variance Known

H1 Test Statistic P-Value

0 P = 2 * P(Z |Z0|)

> 0 P = P(Z Z0 )

< 0

n

XZ

0

0

P = P(Z Z0 )

Page 9: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 99

P-Values in Hypothesis Tests(II)P-Values in Hypothesis Tests(II)

-value is the maximum type I error allowed, while P--value is the maximum type I error allowed, while P-value is the real type I error calculated from the sample.value is the real type I error calculated from the sample.

-value is preset, while P-value is calculated from the -value is preset, while P-value is calculated from the sample.sample.

When P-value is less than When P-value is less than -value, we can safely make the -value, we can safely make the conclusion “Reject Hconclusion “Reject H00”. By doing so, the error we are ”. By doing so, the error we are

subjected to (P-value) is less than the maximum error subjected to (P-value) is less than the maximum error allowed (allowed (-value). -value).

Page 10: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 1010

Type II ErrorType II Error- Fail to reject H- Fail to reject H00 while H while H00 is false is false

Page 11: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 1111

How to calculate Type II Error? (I)How to calculate Type II Error? (I)(H(H00: : = = 00 Vs. H Vs. H11: : 00))

Under the circumstance of type II error, HUnder the circumstance of type II error, H00 is false. Supposed is false. Supposed

that the true value of the mean is that the true value of the mean is = = 00 + + , where , where > 0. The > 0. The

distribution of Zdistribution of Z00 is: is:

1 ,~ Therefore,

1 ,0

0

000

nNZ

nN

nn

X

n

XZ

Page 12: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 1212

How to calculate Type II Error? (II)How to calculate Type II Error? (II) - refer to section &4.3 (&8.1) - refer to section &4.3 (&8.1)

Type II error occurred when (fail to reject HType II error occurred when (fail to reject H00 while H while H00 is is

false) false)

1 ,/

~ where 02

02 n

NZZZZ

nZ

nZ

nZZ

nZP

nZ

nZ

nZ

P

ZZZP

//

//

1/

1/

1/

2/2/

2/2/

2/02/

2/02/

Therefore, Therefore,

Page 13: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 1313

The Sample Size (I)The Sample Size (I)

Given values of Given values of and and , find the required sample size n to , find the required sample size n to achieve a particular level of achieve a particular level of ....

02

222/

2/

2/

2/2/

re whe

/ Then,

Let

0 when /

// Since

ZZn

nZZ

Z

nZ

nZ

nZ

Page 14: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 1414

The Sample Size (II)The Sample Size (II)

Two-sided Hypothesis TestingTwo-sided Hypothesis Testing

One-sided Hypothesis TestingOne-sided Hypothesis Testing

Page 15: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 1515

Example 8-3Example 8-3

Page 16: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 1616

The Operating Characteristic CurvesThe Operating Characteristic Curves- Normal test (z-test)- Normal test (z-test)

Use to performing sample size or type II error Use to performing sample size or type II error calculations.calculations.

The parameter d is defined as:The parameter d is defined as:

so that it can be used for all problems regardless of the so that it can be used for all problems regardless of the values of values of 00 and and ..

Chart VI a,b,c,d are for Z-test.Chart VI a,b,c,d are for Z-test.

|||| 0

d

Page 17: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 1717

Example 8-5Example 8-5

Page 18: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 1818

Page 19: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 1919

Page 20: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 2020

Large Sample TestLarge Sample Test

If n If n 30, then the sample variance s 30, then the sample variance s22 will be close to will be close to 22 for most samples.for most samples.

Therefore, if population variance Therefore, if population variance 22 is unknown but n is unknown but n 30, we can substitute 30, we can substitute with s in the test procedure with with s in the test procedure with little harmful effect.little harmful effect.

Page 21: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 2121

Large Sample Hypothesis Testing on Large Sample Hypothesis Testing on - Variance Unknown but n - Variance Unknown but n 30 30

H1 Test Statistic Reject H0 if 0 Z0 > z or Z0 < - z

> 0 Z0 > z

< 0ns

XZ 0

0

Z0 < -z

Page 22: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 2222

Statistical Vs. Practical SignificanceStatistical Vs. Practical Significance

Practical Significance = 50.5-50 = 0.5Practical Significance = 50.5-50 = 0.5 Statistical Significance P-Value for each sample size n.Statistical Significance P-Value for each sample size n.

Page 23: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 2323

NotesNotes

be careful when interpreting the results from hypothesis tebe careful when interpreting the results from hypothesis testing when the sample size is large, sting when the sample size is large, because any small depbecause any small departure from the hypothesized value arture from the hypothesized value 00 will probably be det will probably be det

ected, even when the difference is of little or no practical siected, even when the difference is of little or no practical significance.gnificance.

In general, two types of conclusion can be drawn:In general, two types of conclusion can be drawn:

1. At 1. At = 0.**, we have enough evidence to reject H= 0.**, we have enough evidence to reject H00..

2. At 2. At = 0.**, we do not have enough evidence to reject = 0.**, we do not have enough evidence to reject HH00..

Page 24: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 2424

Confidence Interval on the Mean (I)Confidence Interval on the Mean (I)

Point Vs. Interval EstimationPoint Vs. Interval Estimation

The general form of interval estimate is The general form of interval estimate is

L L U U

in which we always attach a possible error in which we always attach a possible error such that such that

P(L P(L U) = 1- U) = 1-That is, we have 1-That is, we have 1- confidence that the true value of confidence that the true value of will fall within [L, U].will fall within [L, U].

Interval Estimate is also called Confidence Interval (C.I.).Interval Estimate is also called Confidence Interval (C.I.).

Page 25: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 2525

Confidence Interval on the Mean (II)Confidence Interval on the Mean (II)

L is called the lower-confidence limit andL is called the lower-confidence limit and

U is the upper-confidence limit.U is the upper-confidence limit.

Two-sided C.I. Vs. One-sided C.I.Two-sided C.I. Vs. One-sided C.I.

Page 26: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 2626

Construction of the C.I.Construction of the C.I.

From Central Limit Theory, From Central Limit Theory,

. ,~ ,25 and ,~ 22

nNXnXIf Use standardization and the properties of Z, Use standardization and the properties of Z,

1//

1

1 and

2/2/

2/2/

2/2/

nzXnzXP

zn

XzP

zZzPn

XZ

Page 27: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 2727

Formula for C.I. on the Mean with Variance KnownFormula for C.I. on the Mean with Variance Known

Used whenUsed when

1. Variance known1. Variance known

2. n 2. n 30, use s to estimate 30, use s to estimate ..

Page 28: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 2828

Example 8-6Example 8-6 Consider the rocket propellant problem in Example 8-2. Consider the rocket propellant problem in Example 8-2. Find a 95% C.I. on the mean burning rate?Find a 95% C.I. on the mean burning rate?

95% C.I => 95% C.I => = 0.05, = 0.05,

zz/2 /2 = z= z0.0250.025 = 1.96 = 1.96

Page 29: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 2929

Notes - C.I. Notes - C.I.

Relationship between Hypothesis Testing and C.I.sRelationship between Hypothesis Testing and C.I.s

Confidence level (1-Confidence level (1-) and precision of estimation (C.I. * ) and precision of estimation (C.I. * 1/2)1/2)

Sample size and C.I.sSample size and C.I.s

Page 30: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 3030

Choice of Sample Size to Achieve Precision of EstimationChoice of Sample Size to Achieve Precision of Estimation

Page 31: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 3131

Example 8-7Example 8-7

Page 32: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 3232

One-Sided C.I.s on the MeanOne-Sided C.I.s on the Mean

Page 33: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 3333

Inference on the Mean of a PopulationInference on the Mean of a Population--Variance UnknownVariance Unknown

HH00: : = = 00

HH11: : 00 , where , where 00 is a specified constant. is a specified constant.

Variance unknown, therefore, use s instead of Variance unknown, therefore, use s instead of in the test in the test statistic.statistic.

&4-5 (&8-3)

If n is large enough (If n is large enough ( 30), we can use the test procedure 30), we can use the test procedure in &4-4 (&8-2). However, n is usually small. In this case, in &4-4 (&8-2). However, n is usually small. In this case, T0 will not follow the standard normal distribution.T0 will not follow the standard normal distribution.

Page 34: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 3434

Inference on the Mean of a PopulationInference on the Mean of a Population--Variance UnknownVariance Unknown

Let XLet X11, X, X22, …, X, …, Xnn be a random sample for a normal distribu be a random sample for a normal distribu

tion with unknown mean tion with unknown mean and unknown variance and unknown variance 22. The . The quantityquantity

has a t distribution with n - 1 degrees of freedom.has a t distribution with n - 1 degrees of freedom.

Page 35: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 3535

freedom. of degrees ofnumber theis

1/

1

2/

2/1

:ondistributi t of pdf

2/12

k

kxkk

kxf k

Page 36: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 3636

The ReasoningThe Reasoning

For HFor H00 to be true, the value of T to be true, the value of T00 can not be too large or too can not be too large or too

small.small.

What values of TWhat values of T00 should we reject H should we reject H00? (based on ? (based on value) value)

What values of TWhat values of T00 should we conclude that there is not should we conclude that there is not

enough evidence to reject Henough evidence to reject H00??

Although when n Although when n 30, we can use Z 30, we can use Z00 in section &8-2 to in section &8-2 to

perform the testing instead. We prefer using Tperform the testing instead. We prefer using T00 to more to more

accurately reflect the real testing result if t-table is available.accurately reflect the real testing result if t-table is available.

Page 37: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 3737

Page 38: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 3838

Example 8-8Example 8-8

Page 39: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 3939

Page 40: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 4040

Testing for Normality (Example 8-8)Testing for Normality (Example 8-8)- t-test assumes that the data are a random sample from a - t-test assumes that the data are a random sample from a normal populationnormal population

(1) Box Plot(1) Box Plot (2) Normality Probability Plot(2) Normality Probability Plot

Page 41: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 4141

Hypothesis Testing on Hypothesis Testing on - Variance Unknown- Variance Unknown

H1 Test Statistic Reject H0 if

0 T0 > t or T0 < - t

> 0 T0 > t

< 0

ns

XT 0

0

T0 < -t

Page 42: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 4242

Finding P-ValuesFinding P-Values

Steps:Steps:

1. Find the degrees of freedom (k = n-1)in the t-table.1. Find the degrees of freedom (k = n-1)in the t-table.

2. Compare T2. Compare T00 to the values in that row and find the to the values in that row and find the

closest one.closest one.

3. Look the 3. Look the value associated with the one you pick. The value associated with the one you pick. The p-value of your test is equal to this p-value of your test is equal to this value. value.

In example 8-8, TIn example 8-8, T00 = 4.90, k = n-1 = 21, P-Value < 0.0005 = 4.90, k = n-1 = 21, P-Value < 0.0005

because the t value associated with (k = 21, because the t value associated with (k = 21, = 0.0005) is = 0.0005) is 3.819.3.819.

Page 43: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 4343

P-Values of Hypothesis Testing on P-Values of Hypothesis Testing on - Variance Unknown- Variance Unknown

H1 Test Statistic P-Value

0 P = 2 * P(tn-1 |T0|)

> 0 P = P(tn-1 T0 )

< 0

ns

XT 0

0

P = P(tn-1 T0 )

Page 44: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 4444

The Operating Characteristic CurvesThe Operating Characteristic Curves- t-test- t-test

Use to performing sample size or type II error calculations.Use to performing sample size or type II error calculations. The parameter d is defined as:The parameter d is defined as:

so that it can be used for all problems regardless of the valso that it can be used for all problems regardless of the values of ues of 00 and and ..

Chart VI e,f,g,h are used in t-test. (pp. A14-A15)Chart VI e,f,g,h are used in t-test. (pp. A14-A15)

|||| 0

d

Page 45: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 4545

Example 8-9Example 8-9

In example 8-8, if the mean load at failure differs from 10 In example 8-8, if the mean load at failure differs from 10 MPa by as much as 1 MPa, is the sample size n = 22 adeqMPa by as much as 1 MPa, is the sample size n = 22 adequate to ensure that Huate to ensure that H00 will be rejected with probability at l will be rejected with probability at l

east 0.8?east 0.8?

s = 3.55, therefore, d = 1.0/3.55 = 0.28.s = 3.55, therefore, d = 1.0/3.55 = 0.28.

Appendix Chart VI g, for d = 0.28, n = 22 => Appendix Chart VI g, for d = 0.28, n = 22 => = 0.68 = 0.68

The probability of rejecting HThe probability of rejecting H00: : = 10 if the true mean exceeds this by 1.0 MP = 10 if the true mean exceeds this by 1.0 MP

a (reject Ha (reject H00 while H while H00 is false) is approximately 1 - is false) is approximately 1 - = 0.32, which is too smal = 0.32, which is too smal

l. Therefore n = 22 is not enough. l. Therefore n = 22 is not enough.

At the same chart, d = 0.28, At the same chart, d = 0.28, = 0.2 (1- = 0.2 (1-=0.8) => n = 75=0.8) => n = 75

Page 46: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 4646

Page 47: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 4747

Construction of the C.I. on the MeanConstruction of the C.I. on the Mean - Variance Unknown - Variance Unknown

In general, the distribution of In general, the distribution of

is t with n-1 d.f. is t with n-1 d.f.

ns

XT

Use the properties of t with n-1 d.f., Use the properties of t with n-1 d.f.,

1//

1

1

1,2/1,2/

1,2/1,2/

1,2/1,2/

nstXnstXP

tns

XtP

tTtP

nn

nn

nn

Page 48: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 4848

Formula for C.I. on the Mean with Variance UnknownFormula for C.I. on the Mean with Variance Unknown

Page 49: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 4949

Example 8-10Example 8-10 Reconsider the tensile adhesive problem in Example 8-8. Reconsider the tensile adhesive problem in Example 8-8. Find a 95% C.I. on the mean?Find a 95% C.I. on the mean?

N = 22, sample mean = 13.71, s = 3.55, tN = 22, sample mean = 13.71, s = 3.55, t/2,n-1 /2,n-1 = t= t0.025,21 0.025,21 = 2.= 2.

080080

13.71 - 2.080 (3.55) / 13.71 - 2.080 (3.55) / 22 22 13.71 + 2.080 (3.55) / 13.71 + 2.080 (3.55) / 2222

13.71 - 1.57 13.71 - 1.57 13.71 + 1.57 13.71 + 1.57

12.14 12.14 15.28 15.28

The 95% C.I. On the mean is [12.14, 15.28] The 95% C.I. On the mean is [12.14, 15.28]

nstXnstX nn // 1,2/1,2/

Page 50: Inference on the Mean of a Population - Variance Known

Horng-Chyi HorngHorng-Chyi Horng Statistics IIStatistics II 5050

Final Note for the Inference on the MeanFinal Note for the Inference on the Mean

Variance Sample Size Population Type Testing Method

N > 30 All Z

Normal Z

Known

N < 30

Non-normal Chebyshev’s

Inequality

N > 30 All T or Z

Normal T

Unknown

N < 30

Non-normal Non-parametric