80
1 CASELETS 2. An investigation of the relative merits of two kinds of flash light batteries showed that a random sample 100 batteries of brand A lasted on the average 36.5 hrs with standard deviation of 1.8 hrs, while a random sample of 80 batteries of brand B lasted on the average 36.8 hrs with a standard deviation of 1.5 hrs. Use a Level of significance of 0.05 to test whether the observed difference between the average life time is significant. B A H : 0 Null Hypothesis ,i.e., there is no significant difference between the average life of two brands Alternative Hypothesis (two tailed) B A H : 1 Solution

Sampling and Sampling Methods 2

Embed Size (px)

Citation preview

Page 1: Sampling and Sampling Methods 2

1

CASELETS

2. An investigation of the relative merits of two kinds of flash light batteries showed that a random sample 100 batteries of brand A lasted on the average 36.5 hrs with standard deviation of 1.8 hrs, while a random sample of 80 batteries of brand B lasted on the average 36.8 hrs with a standard deviation of 1.5 hrs. Use a Level of significance of 0.05 to test whether the observed difference between the average life time is significant.

BAH :0Null Hypothesis ,i.e., there is no significant difference between the average life of two brandsAlternative Hypothesis

(two tailed)BAH :1

Solution

Page 2: Sampling and Sampling Methods 2

2

CASELETS(Solution con’t)

nn

BA

nn

xxZ

2

2B

1

2A

2

22

1

21

21

ss

2195.1

8025.2

10024.3

8.365.36Z

Since |Z| =1.2195 is less than the tabular value, i.e., 1.96 ( 5% level of significance). Hence the null hypothesis is accepted and we conclude that there is no significant difference between the average life times of the two brands A and B of the flashlight batteries.

05.0,5.1,8.36,80

8.1,5.36,100

2

1

sBn

sAn

B

A

Page 3: Sampling and Sampling Methods 2

3

CASELETS

3. The mean height of 50 male students who showed above average participation in college athletics was 68.2 inches with a standard deviation of 2.5 inches; while 50 male students who showed no interest in such participation had a mean height of 67.5 inches with a standard deviation of 2.8 inches. Use a Level of significance of 0.05 test the hypothesis that male students who participate in college athletics are taller than other male students.

BAH :0Null Hypothesis ,i.e., there is no significant difference between the mean heights of the male students who participate and who do not participate.

Alternative Hypothesis (one tailed)

BAH :1

Solution

Page 4: Sampling and Sampling Methods 2

4

CASELETS(Solution con’t)

nn

BA

nn

xxZ

2

2B

1

2A

2

22

1

21

21

ss

3188.1

5084.7

5025.6

5.672.68Z

Since |Z| =1.3188 is less than the tabular value, i.e., 1.645 ( 5% level of significance). Hence the null hypothesis is accepted and we conclude that there is no significant difference between the average height of two groups of students.

05.0,8.2,5.67,50

5.2,2.68,50

2

1

sBn

sAn

B

A

Page 5: Sampling and Sampling Methods 2

5

CASELETS

4. The mean yield of wheat from a district A was 200 lbs. With S.D = 10 lbs per acre from a sample of 100 plots. In other district B, the mean yield was 220 lbs with S.D = 12 lbs from a sample of 150 plots. Assuming that the standard deviation of the yield in the entire state was 11 lbs. Test whether there is any significant difference between the mean yield of crops in the two districts. Use a Level of significance of 0.01.

BAH :0Null Hypothesis ,i.e., the mean yield of crops in two districts do not differ significantly.Alternative Hypothesis

(two tailed)BAH :1

Solution

Page 6: Sampling and Sampling Methods 2

6

CASELETS(Solution con’t)

nn

BA

nn

xxZ

2

2

1

2

2

2

1

221

05.7

150121

100121

220210Z

Since |Z| =7.05 is greater than the tabular value, i.e., 2.58 ( 1% level of significance). Hence the null hypothesis is rejected and we conclude that the mean yields of crop in two districts differ significantly.

11

,12,220,150

01.0,10,210,100

2

1

ss

sBn

sAn

BA

B

A

Page 7: Sampling and Sampling Methods 2

7

CASELETS

5. Given the following information relating to two places, A and B, test whether there is any significant difference between their mean wages:

BAH :0Null Hypothesis ,i.e., the mean yield of crops in two districts do not differ significantly.Alternative Hypothesis

(two tailed)BAH :1

Solution

Mean Wages (Rs.)

Standard Deviation (Rs.)

Number of workers

A

47

28

1000

A

49

40

1500

Page 8: Sampling and Sampling Methods 2

8

CASELETS(Solution con’t)

47.1

15001600

1000784

4947Z

Since |Z| =1.47 is less than the tabular value, i.e., 1.96 ( 5% level of significance). Hence the null hypothesis is accepted and we conclude that there is no significant difference in the mean wages at places A and B.

05.0,40,49,1500

,28,47,1000

2

1

sBn

sAn

B

A

nn

BA

nn

xxZ

2

2B

1

2A

2

22

1

21

21

ss

Page 9: Sampling and Sampling Methods 2

9

Sampling Distributions ofSample Proportions

Sampling Distributions

Sampling Distribution of

Sample Mean

Sampling Distribution of

Sample Proportion

Page 10: Sampling and Sampling Methods 2

10

Population Proportions, P

P = the proportion of the population having some characteristic

• Sample proportion ( ) provides an estimate of P:

• 0 ≤ ≤ 1

• has a binomial distribution, but can be approximated by a normal distribution when nP(1 – P) ≥ 9

size sample

interest ofstic characteri the having sample the in itemsofnumber

n

XP ˆ

Page 11: Sampling and Sampling Methods 2

11

Sampling Distribution of P

• Normal approximation:

Properties:

and

(where P = population proportion)

Sampling Distribution

.3

.2

.1 0

0 . 2 .4 .6 8 1

p)P̂E( n

P)P(1

n

XVarσ2

P

ˆ

^

)PP( ˆ

Page 12: Sampling and Sampling Methods 2

12

Z-Value for Proportions

nP)P(1

PP̂

σ

PP̂Z

Standardize to a Z value with the formula:P̂

Page 13: Sampling and Sampling Methods 2

13

CONFIDENCE INTERVAL FOR A POPULATION PROPORTION

The confidence interval for a population proportion

P̂ZσP̂

where is the standard error of the proportion:P̂σ

n

P)-P(1ZP̂ /2

Or

Page 14: Sampling and Sampling Methods 2

14

CASELETS

1. A die was thrown 9000 times and of these 3220 yielded a 3 or 4. Can the die be regarded as unbiased?

3/1:0 PHNull Hypothesis ,i.e., the die is unbiased.

Alternative Hypothesis (two tailed)

3/1:1 PH

Solution: we are given that n = 9000. The observed proportion of successes (getting 3 or 4) in 9000 throws of a die is given by

= 3220/9000 = 0.3578

P = population proportion of success

= getting 3 or 4 in throw of an unbiased die = 2/6 = 1/3 =0.333

Page 15: Sampling and Sampling Methods 2

15

CASELETS(Solution con’t)

Since |Z| =4.94 is greater than 3, it is significant (at all levels of significance) . Hence the null hypothesis is rejected and we conclude that the die is certainly biased.

nP)P(1

PP̂

σ

PP̂Z

94.4

9000)3333.00.3333(1

3333.03578.0Z

Page 16: Sampling and Sampling Methods 2

16

CASELETS

2. In a random sample of 400 persons from a large population, 120 are females. Can it be said that males and females are in the ratio 5 : 3 in the population? Use a 1% level of significance?

375.0:0 PHNull Hypothesis ,i.e., the males and females in the population are in the ratio 5 : 3.

Alternative Hypothesis (two tailed)

375.0:1 PH

Solution: we are given that n = 9000 and X = No. of females in the sample = 120. The observed proportion of females in the sample is given by

= 120/400 = 0.3

P = Proportion of females in the population = 3/8 = 0.375

Page 17: Sampling and Sampling Methods 2

17

CASELETS(Solution con’t)

Since |Z| =3.125 is greater than the tabular value, i.e., 2.58 ( 1% level of significance). Hence the null hypothesis is rejected and we conclude that the males and females in the population are not in the ration 5 : 3.

nP)P(1

PP̂

σ

PP̂Z

125.3

400)375.00.375(1

375.03.0Z

Page 18: Sampling and Sampling Methods 2

18

CASELETS3. A manufacturer claimed that at least 95% of the equipments which he supplied to a factory conformed to specifications. An examination of a sample of 200 pieces of equipment revealed that 18 were faulty. Test this claim at a significance level of 5%.

95.0:0 PHNull Hypothesis ,i.e., Proportion of pieces conforming to specifications in the sample is at least 95%.

Solution: We are given that n = 200 and X = No. of pieces conforming to specifications in the sample = 200-18 = 182.

Proportion of pieces conforming to specifications in the sample

= 182/200 = 0.91

P = Proportion of pieces conforming to specifications in the population = 0.95

Page 19: Sampling and Sampling Methods 2

19

CASELETS(Solution con’t)

Since |Z| =2.6 is greater than the tabular value, i.e., 1.645 ( 5% level of significance). Hence the null hypothesis is rejected and we conclude that the manufacturer claim is rejected.

nP)P(1

PP̂

σ

PP̂Z

6.2

200)95.00.95(1

95.091.0Z

95.0:1 PHAlternative Hypothesis (Left- tailed)

Page 20: Sampling and Sampling Methods 2

20

CASELETS4. In a big city 325 men out of 600 men were found to be smokers. Does this information support the conclusion that the majority of men in the city are smokers. Test this claim at a significance level of 5%.

5.0:0 PHNull Hypothesis ,i.e., no. of smokers and non-smokers are equal in the city.

Solution: We are given that n = 600 and X = No. of smokers = 325.

Sample proportion of smokers is given by

= 325/600 = 0.5417

P = Population proportion of smokers is given by

= 0.5 (No. of smokers and non smokers are equal in population)

Page 21: Sampling and Sampling Methods 2

21

CASELETS(Solution con’t)

Since |Z| =2.04 is greater than the tabular value, i.e., 1.645 ( 5% level of significance for single tail). Hence the null hypothesis is rejected and we conclude that the the majority of men in the city are smokers.

nP)P(1

PP̂

σ

PP̂Z

04.2

600)5.00.5(1

5.05417.0Z

5.0:1 PHAlternative Hypothesis (right- tailed)

Page 22: Sampling and Sampling Methods 2

22

CASELETS5. In sample 400 parts manufactured by a factory, the number of defective parts was found to be 30. The company, however, claimed that at most 5% of their product is defective. Is the claim tenable?

5.0:0 PHNull Hypothesis ,i.e., proportion of defectives is at most 5%.

Solution: We are given that n = 400 and X = No. of defectives in the sample = 30.

Proportion of defectives in the sample is given by

= 30/400 = 0.075P̂

Page 23: Sampling and Sampling Methods 2

23

CASELETS(Solution con’t)

Since |Z| =2.27 is greater than the tabular value, i.e., 1.645 ( 5% level of significance for single tailed). Hence the null hypothesis is rejected and we conclude that the the company's claim of P = 0.05 is not tenable.

nP)P(1

PP̂

σ

PP̂Z

27.2

400)05.00.05(1

05.0075.0Z

5.0:1 PHAlternative Hypothesis (right- tailed)

Page 24: Sampling and Sampling Methods 2

24

6. A random sample of 700 units from a large consignment showed that 200 were damaged. Find (i) 95% and (ii) 99% confidence limits for the proportion of damaged units in the consignment.Solution: We are given n = 700.

= proportion of damaged units in the sample

= 200/700 = 0.286

CASELETS

(i) 95% Confidence Limits for ,i.e., proportion of damaged units in the consignment are given by:

n

P)-P(1ZP̂ /2

Page 25: Sampling and Sampling Methods 2

25

CASELETS(Solution con’t)

)319.0,253.0(

033.0286.0700

0.286)-0.286(196.1286.0

(i) 99% Confidence Limits for ,i.e., proportion of damaged units in the consignment are given by:

)330.0,242.0(

044.0286.0700

0.286)-0.286(158.2286.0

n

P)-P(1ZP̂ /2

Page 26: Sampling and Sampling Methods 2

26

Test for Significance of difference of Proportion

Let us consider two independent samples of size n1 and

n2 from the two populations A and B and let X1and X2

be the observed no. of successes (no. of units possessing the given attribute) in these samples respectively. Then = Observed proportion of successes in the

sample from population A = X1/ n1

= Observed proportion of successes in the sample from population B = X2/ n2

P1ˆ

P2ˆ

Page 27: Sampling and Sampling Methods 2

27

Test for Significance of difference of Proportion(con’t)

Null Hypothesis: H0 : ,i.e., the population proportion are same. In other words H0

is that the sample proportions and do not differ significantly.

Hence H0 : , the test statistic (Z-value) for difference of proportions becomes:

P1ˆ = P2

ˆ = P

P1ˆ P2

ˆ

P1ˆ = P2

ˆ

)11

)(1(

P̂-P̂Z

21

21

nnPP

Page 28: Sampling and Sampling Methods 2

28

Test for Significance of difference of Proportion(con’t)

Remark: In general P, the common population proportion is not known and we used its unbiased estimate provided by both the samples taken together which is given by

nn

P̂nP̂nP

21

221 1

Page 29: Sampling and Sampling Methods 2

29

Test for Significance of difference of Proportion(con’t)

Confidence Limits for difference of Proportions

n

)P-(1P

n

)P-(1PZ)P̂P̂(

2

22

1

11/221

Page 30: Sampling and Sampling Methods 2

30

CASELETS1. A Company has the head office at Kolkata and a branch at Mumbai. The personnel director wanted to know if the workers at the two places would like the introduction of a new plan of work and a survey was conducted for this purpose. Out of a sample of 500 workers at Kolkata, 62% favored the new plan. At Mumbai, out of a sample of 400 workers, 415 were against the new plan. Is there any significant difference between the two groups in their attitude towards the new plan at 5% level?Solution: Let and denote the sample proportion of workers favoring the new plan at Kolkata and Mumbai. We are given

P1ˆ P2

ˆ

P22 59.0ˆ 400,

62.0P̂ 500, 11

==

==

n

n

Page 31: Sampling and Sampling Methods 2

31

CASELETS(Solution con’t)

Null Hypothesis: We set the null hypothesis that the population proportion of workers favoring the new plan in Kolkata and Mumbai is same, i.e.,

H0:

Alternative Hypothesis H1: (Two- tailed)

P1ˆ = P2

ˆ

P1ˆ P2̂

)11

)(1(

P̂-P̂Z

21

21

nnPP

Page 32: Sampling and Sampling Methods 2

32

CASELETS(Solution con’t)

To get the Z-value we first calculate common population proportion using both sample proportion, i.e.,

607.0400500

59.0*40062.0*005

nn

P̂nP̂nP

21

221 1

9155.0

)4001

5001

)(607.01(607.0

0.59-0.62Z

Since |Z| =0.9155 is less than the tabular value, i.e., 1.96 ( 5% level of significance for two tailed). Hence the null hypothesis is accepted and we conclude that the there is no significant difference in the two groups in their attitude.

Page 33: Sampling and Sampling Methods 2

33

CASELETS2. A machine puts out 16 imperfect articles in a sample of 500. After the machine is overhauled, it puts out 3 imperfect articles in a batch of 100. Has the machine improved? (5% level)Solution: We are given n1 = 500 and n2 = 100.

= proportion of defective in the first sample = 16/500 = 0.032

= proportion of defective in the second sample = 3/100 = 0.03

P1ˆ

P2ˆ

Null Hypothesis: H0: , i.e., there is no significant difference in the machine before overhauling and after overhauling.

Alternative Hypothesis H1: (right- tailed)

P1ˆ P2

ˆ

P1ˆ = P2

ˆ

Page 34: Sampling and Sampling Methods 2

34

CASELETS(Solution con’t)

To get the Z-value we first calculate common population proportion using both sample proportion, i.e.,

032.0100500

03.0*100032.0*005

nn

P̂nP̂nP

21

221 1

04.1

)100

1500

1)(032.01(032.0

0.03-0.032Z

)11

)(1(

P̂-P̂Z

21

21

nnPP

Page 35: Sampling and Sampling Methods 2

35

CASELETS(Solution con’t)

Since |Z| =1.04 is less than the tabular value, i.e., 1.645 ( 5% level of significance for right tailed). Hence the null hypothesis is accepted and we conclude that the machine has not improved after overhauling.

Page 36: Sampling and Sampling Methods 2

36

CASELETS3. Before an increase in excise duty on tea, 400 people out of a sample of 500 persons were found to be tea drinkers. After an increase in duty, 400 people were tea drinkers in sample of 600 people. Using standard error of proportion, state whether there is a significant decrease in the consumption of tea. Test at 5% and 1% level of significance.Solution: We are given n1 = 500 and n2 = 600.

= Sample proportion of tea drinkers before increase in excise duty

=400/500 = 0.8

= Sample proportion of tea drinkers after increase in excise duty

=400/600 = 0.67

P1ˆ

P2ˆ

Page 37: Sampling and Sampling Methods 2

37

CASELETS(Solution con’t)

Null Hypothesis: H0: , i.e., there is no significant difference in the consumption of tea before and after the increase in excise duty.

P1ˆ = P2

ˆ

Alternative Hypothesis H1: (right- tailed)

P1ˆ P2

ˆ

To get the Z-value we first calculate common population proportion using both sample proportion, i.e.,

)11

)(1(

P̂-P̂Z

21

21

nnPP

Page 38: Sampling and Sampling Methods 2

38

CASELETS(Solution con’t)

11/8600500

67.0*6008.0*005

nn

P̂nP̂nP

21

221 1

81.4

)6001

5001

)(11/81(11/8

0.67-0.80Z

)11

)(1(

P̂-P̂Z

21

21

nnPP

Page 39: Sampling and Sampling Methods 2

39

CASELETS(Solution con’t)

Since |Z| =4.81 is greater than the tabular value, i.e., 1.645 ( 5% level of significance for right tailed) and 2.33 ( 1% level of significance for right tailed) . Hence the null hypothesis is rejected and we conclude that there is a significant decrease in the consumption of tea after increase in the excise duty.

Page 40: Sampling and Sampling Methods 2

40

CASELETS4. The subject under investigation is the measure of dependence of Tamil on words of Sanskrit origin. One newspaper article reporting the proceedings of the constituent assembly contained 2,025 words of which 729 words were declared by literary critic to be Sanskrit origin. A second article by the same author describing atomic research contained 1,600 words of which 640 words were declared by the same critic to be Sanskrit origin. Examine whether there is any significant difference in the dependence of this writer on words of Sanskrit origin in writing the two articles (5%).Solution: We are given n1 = 2025 and n2 = 1600.

= Sample proportion of Sanskrit origin words in first article

=729/2025 = 0.36

= Sample proportion of Sanskrit origin words in second article

=640/1600 = 0.40

P1ˆ

P2ˆ

Page 41: Sampling and Sampling Methods 2

41

CASELETS(Solution con’t)

Null Hypothesis: H0: , i.e., there is no significant difference in the proportion of Sanskrit words in the writers vocabulary in the two articles. In other words, Tamil does not depend on the words of Sanskrit origin.

P1ˆ = P2

ˆ

Alternative Hypothesis H1: (two- tailed)

P1ˆ P2̂

To get the Z-value we first calculate common population proportion using both sample proportion, i.e.,

)11

)(1(

P̂-P̂Z

21

21

nnPP

Page 42: Sampling and Sampling Methods 2

42

CASELETS(Solution con’t)

38.016002025

4.0*160036.0*2025

nn

P̂nP̂nP

21

221 1

469.2

)1600

12025

1)(38.01(38.0

0.40-0.36Z

)11

)(1(

P̂-P̂Z

21

21

nnPP

Page 43: Sampling and Sampling Methods 2

43

CASELETS(Solution con’t)

Since |Z| =2.469 is greater than the tabular value, i.e., 1.96 ( 5% level of significance for right tailed). Hence the null hypothesis is rejected and we conclude that the proportion of Sanskrit words in the writer’s two articles differ significantly.

Page 44: Sampling and Sampling Methods 2

44

Small Samples

• If the sample size is small, then the distributions of standardized statistics are far from normality and consequently normal test (Z-test) cannot be applied. Hence to deal with small samples, we use student’s t-test also known as Exact Sample Test.

Page 45: Sampling and Sampling Methods 2

45

Student’s t- test

If x1, x2 , …, xn is a random sample of size n from a normal population with mean and variance then student’s t statistic is defined as:

2

n

1iiX

n

1X

n

1ii

22 )X(X1-n

1Swhere and

n

Sμ)X(

t

1-n

sμ)X(

When standard deviation of sample is given

OrWhen standard deviation of sample calculated

Page 46: Sampling and Sampling Methods 2

46

ntx

2

)2/(1S

n

Student’s t- test( con’t)

Confidence Limits for

Page 47: Sampling and Sampling Methods 2

47

CASELETS

1. Ten cartoons are taken at random from an automatic filling machine. The mean net weight of the ten cartoons is 11.8 kg. and standard deviation is 0.15 kg. Does this sample mean differ significantly from the intended weight of 12 kg? You are given that v = 9, t0.05 = 2.26

Solution: We are given that n = 10, = 11.8kg, = 0.15kg

sx

Null Hypothesis: H0: , i.e., the sample mean =11.8 kg does not differ significantly from population mean

Alternative Hypothesis H1: (two- tailed)

kg12 xkg12

kg12

Page 48: Sampling and Sampling Methods 2

48

1-n

sμ)X(

n

Sμ)X(

t

The test statistic is given by

4

1-10

0.15)21(11.8

t

Since |t| =4 is greater than the tabular value of t , i.e., 2.26 (for 9 d.f at 5% level of significance). Hence the null hypothesis is rejected and we conclude that the sample mean differ significantly from the mean .

kg12

(Solution con’t)

CASELETS

Page 49: Sampling and Sampling Methods 2

49

CASELETS

2. A machine is designed to produce insulating washers for electrical devices of average thickness of 0.025 cm. A random sample of 10 washers was found to have an average thickness of 0.024 cm with a standard deviation of 0.002 cm. Test the significance of the deviation. (Value of t for 9 degree of freedom at 5% level is 2.26)Solution: We are given that n = 10, = 0.024 cm, = 0.002 cm

sx

Null Hypothesis: H0: , i.e., there is no significant deviation between sample mean = 0.024 and population mean

Alternative Hypothesis H1: (two- tailed)

cm025.0x

cm025.0

cm025.0

Page 50: Sampling and Sampling Methods 2

50

1-n

sμ)X(

n

Sμ)X(

t

The test statistic is given by

5.1

1-10

0.002)025.0(0.024

t

Since |t|=1.5 is less than the tabular value of t , i.e., 1.96 (for 9 d.f at 5% level of significance). Hence the null hypothesis is accepted and we conclude that there is no significant difference the sample and population mean .

CASELETS(Solution con’t)

Page 51: Sampling and Sampling Methods 2

51

CASELETS

3. The mean weekly sales of the chocolate bar in candy stores was 146.3 bars per store. After an advertising campaign the mean weekly sales in 22 stores for a typical week increased to 153.7 and showed a standard deviation of 17.2. Was the advertising campaign successful Test the significance of the deviation. (Value of t for 21 degree of freedom at 5% level for single tailed test is 1.721)Solution: We are given that n = 22, = 153.7, = 17.2

sx

Null Hypothesis: H0: , i.e., there is no significant deviation between sample mean = 153.7 and population mean

Alternative Hypothesis H1: (right- tailed)

3.146x

3.146

3.146

Page 52: Sampling and Sampling Methods 2

52

1-n

sμ)X(

n

Sμ)X(

t

The test statistic is given by

9716.1

1-22

17.2)3.146(153.7

t

Since |t|=1.97 is greater than the tabular value of t , i.e., 1.721 (for 21 d.f at 5% level of significance). Hence the null hypothesis is rejected and we conclude that the advertising campaign was successful in promoting sales. .

(Solution con’t)

CASELETS

Page 53: Sampling and Sampling Methods 2

53

CASELETS

4. A soap manufacturing company was distributing a particular brand of soap through a large number of retail shops. Before a heavy advertisement campaign, the mean sales per week per shop was 140 dozens. After the campaign, a sample of 26 shops was taken and the mean sales was found to be 147 dozens with a standard deviation 16. Can you consider the advertisement effective. (Value of t for 25 degree of freedom at 5% level for single tailed testis 1.721)Solution: We are given that n = 26, = 147 dozen, = 16 dozen

sx

Null Hypothesis: H0: dozen , i.e., there is no significant deviation between sample mean = 147 and population mean Alternative Hypothesis H1: (right- tailed)

140x

140140

Page 54: Sampling and Sampling Methods 2

54

1-n

sμ)X(

n

Sμ)X(

t

The test statistic is given by

19.2

1-26

16)140(147

t

Since |t|=2.19 is greater than the tabular value of t , i.e., 1.798 (for 21 d.f at 5% level of significance for single tailed). Hence the null hypothesis is rejected and we conclude that the advertising campaign was effective in promoting sales. .

CASELETS(Solution con’t)

Page 55: Sampling and Sampling Methods 2

55

5. Certain pesticides is packed into bags by a machine. A random sample of 10 bags is drawn and their contents are found to weigh (in kg.) as follows:

50, 49, 52, 44, 45, 48, 46, 45, 49, 45,

Test if the average packing can be taken to be 50 kg. (Value of t for 9 degree of freedom at 5% level for two tailed testis 2.262)

CASELETS

Null Hypothesis: H0: , i.e., the average packing is 50 kg.

kg50

Alternative Hypothesis H1: (Two- tailed)

50

Solution:

Page 56: Sampling and Sampling Methods 2

56

CASELETS

3.4710

473

10

45494546484544524950X

n

1X

n

1ii

12.7

})3.4745()3.4749()3.4745()3.4746()3.4748(

)3.4745()3.4744()3.4752()3.4749()3.4750({1-10

1

)X(X1-n

1S

22222

22222

n

1ii

22

(Solution con’t)

Page 57: Sampling and Sampling Methods 2

57

The test statistic is given by

n

Sμ)X(

t

2.3712.0

2.7-

10

12.7

)50(47.3t

Since |t|= 3.12 is greater than the tabular value of t , i.e., 2.262 (for 9 d.f at 5% level of significance for two tailed). Hence the null hypothesis is rejected and we conclude that the average packing cannot be taken to be 50 kg.

(Solution con’t)

CASELETS

Page 58: Sampling and Sampling Methods 2

58

6. A sample of size 9from a normal population gave and . Find a 99% interval for population mean.

CASELETS

99% confidence limits for the population mean are:

Since standard deviation is given, therefore we replace n by n-1 in the denominator in the root

Solution: We are given:

8.15X3.102 S

9,3.10,8.15 2 nSX

ntx

2

)2/(1S

n

1-ntx

2

)2/(1S

n

Page 59: Sampling and Sampling Methods 2

59

CASELETS(Solution con’t)

)6136.19,9864.11(

8136.38.151-9

3.1036.38.51

1-9

3.10t8.51 )2/01.0(19

Page 60: Sampling and Sampling Methods 2

Sampling Distribution for the Difference Between Two Means (small samples)

Independent Sampling

Page 61: Sampling and Sampling Methods 2

61

Small Samples

• To use the t-distribution, both sampled populations must be approximately normally distributed with equal population variances, and the random samples must be selected independently of each other.

• Since we assume that the two population variances are equal, we can construct a pooled sample estimator of for use in making inferences. 2

Page 62: Sampling and Sampling Methods 2

62

Let x1, x2, …, xn1 and y1, y2, …, yn2 be two independent random samples from the given normal populations. We set the Null Hypothesis H0 : ,i.e., the samples have been drawn from the normal populations with the same means. In other words, the sample means do not differ significantly.

21

211

)(

1

2

21

nns

xxt

p

Where is the pooled sample estimator for the two populations and it can be calculated as follows.

S P2

t-value the difference Between Two Population Means

Page 63: Sampling and Sampling Methods 2

63

2

)1()1(

)1()1(

)1()1(

21

222

211

21

222

2112

nn

snsn

nn

snsnsp

n

1ii

11

1X

n

1X

n

1ii

22

2Y

n

1X

Note: Here Degree of freedom for t test is (n1

+ n2 – 2)

t-value the difference Between Two Population Means

n

1ii

2

1

21

1)X(X

1-n

1s

n

1ii

2

2

22

2)X(X

1-n

1s

Page 64: Sampling and Sampling Methods 2

64

Student’s t- test( con’t)

Confidence Limits for Difference of Means )( 21

2

11)2/()(

1

2221 21 nn

stxx pnn

Page 65: Sampling and Sampling Methods 2

65

CASELETS

1. The nicotine content in milligrams of two samples of tobacco were found to be as follows:

Sample A: 24 27 26 21 25

Sample B: 27 30 28 31 22 36

Can it be said that two samples come from normal populations having the same mean? (5%)

Null Hypothesis: H0: , i.e., the two samples have been drawn from the normal populations with the same mean.Alternative Hypothesis H1: (two- tailed)

21

21

Page 66: Sampling and Sampling Methods 2

66

CASELETS( con’t)

6.245

123

)2521262724(5

1X

n

1X

n

1ii

11

1

296

174

)362231283027(6

1X

n

1X

n

1ii

22

2

Page 67: Sampling and Sampling Methods 2

67

CASELETS( con’t)

3.54

2.21

})6.2425()6.2421()6.2426()6.2427()6.2424({1-5

1

)X(X1-n

1

22222

n

1ii

2

1

21

1

s

6.215

108

})2936()2922()2931()2928()2930()2927({1-6

1

)X(X1-n

1

222222

n

1ii

2

2

22

2

s

Page 68: Sampling and Sampling Methods 2

68

36.149

2.129

265

6.21*53.5*4

2

)1()1(

)1()1(

)1()1(

21

222

211

21

222

2112

nn

snsn

nn

snsnsp

CASELETS( con’t)

92.1

61

51

36.14

)296.24(

211

)(

1

2

21

nns

xxt

p

Since |t|= 1.92 is less than the tabular value of t , i.e., 2.262 (for 9 d.f at 5% level of significance for two tailed). Hence the null hypothesis is accepted and we conclude that the samples come from normal populations with the same mean.

Page 69: Sampling and Sampling Methods 2

69

CASELETS

2. A group of 5 patients treated with medicine ‘A’ weigh 42, 39, 48, 60 and 41 kgs.: Second group of 7 patients from the same hospital treated with medicine ‘B’ weigh 38, 42, 56, 64, 68, 69 and 62 kgs. Do you agree with the claim that medicine ‘B’ increases the weight significantly? (The value of t at 5% level of significance for 10 degrees of freedom is 1.81 for one tail)

Null Hypothesis: H0: , i.e., there is no significant difference between the medicines A and B as regards their effect on increase in weight.

Alternative Hypothesis H1: (one- tailed)

21

21

Page 70: Sampling and Sampling Methods 2

70

CASELETS( con’t)

465

230

)4160483942(5

1X

n

1X

n

1ii

11

1

577

399

)62696864564238(7

1X

n

1X

n

1ii

22

2

Page 71: Sampling and Sampling Methods 2

71

CASELETS( con’t)

5.724

290

})4641()4660()4648()4639()4642({1-5

1

)X(X1-n

1

22222

n

1ii

2

1

21

1

s

34.1546

926

})5762()5769()5768()5764()5756()5742()5738({1-7

1

)X(X1-n

1

2222222

n

1ii

2

2

22

2

s

Page 72: Sampling and Sampling Methods 2

72

6.12110

1216

275

34.154*75.72*4

2

)1()1(

)1()1(

)1()1(

21

222

211

21

222

2112

nn

snsn

nn

snsnsp

CASELETS( con’t)

7.1

71

51

6.121

)5746(

211

)(

1

2

21

nns

xxt

p

Since |t|= 1.7 is less than the tabular value of t , i.e., 1.81 (for 10 d.f at 5% level of significance for one tailed). Hence the null hypothesis is accepted and we conclude that the medicines A and B do not differ significantly as regards their effect on increases weight

Page 73: Sampling and Sampling Methods 2

73

CASELETS

3. A random sample of 20 daily workers of State A was found to have average daily earnings of Rs. 44 with sample variance 900. Another sample of 20 daily workers from State B was found to earn on an average Rs. 30 per day with sample variance 400. Test whether the workers in State A are earning more than those in State B. (5% level of significance)

Null Hypothesis: H0: , i.e., there is no significant difference in the average daily earnings of the workers in States A and B.

Alternative Hypothesis H1: (one- tailed right)

BA

BA

Solution: We are given that

;400,30,20;900,44,20 22 sxnsxn BBBAAA

Page 74: Sampling and Sampling Methods 2

74

65038

24700

22020

400*19900*19

2

)1()1(

)1()1(

)1()1( 22222

BA

BBAA

BA

BBAAp nn

snsn

nn

snsns

CASELETS( con’t)

738.1

201

201

650

)3044(

211

)(

1

2

21

nns

xxt

p

Since |t|= 1.738 is greater than the tabular value of t , i.e., 1.645 (for 38 d.f at 5% level of significance for one tailed). Hence the null hypothesis is rejected and we conclude that the workers in the State A are earning more than those in State B.

Page 75: Sampling and Sampling Methods 2

75

CASELETS

4. The means of two random samples of size 9 and 7 are 196.42 and 198.82 respectively. The sum of the squares of the deviations from the mean are 26.94 and 18.73 respectively. Can the samples be considered to have been drawn from the same normal population? (5% level of significance)

Null Hypothesis: H0: , i.e., both the samples have been drawn from the same normal population.Alternative Hypothesis H1: (two-

tailed)

21

21

Solution: We are given that

73.18)(,82.198,7

;94.26)(,42.196,9

22

22

12

11

xxxn

xxxn

Page 76: Sampling and Sampling Methods 2

76

CASELETS( con’t)

3675.38

94.26

}94.26{1-9

1

)X(X1-n

1 n

1ii

2

1

21

1

s

1217.36

73.18

}73.18{1-7

1

)X(X1-n

1 n

1ii

2

2

22

2

s

Page 77: Sampling and Sampling Methods 2

77

26.314

67.45

279

1217.3*63675.3*8

2

)1()1(

)1()1(

)1()1(

21

222

211

21

222

2112

nn

snsn

nn

snsnsp

CASELETS( con’t)

64.2

71

91

26.3

)82.19842.196(

211

)(

1

2

21

nns

xxt

p

Since |t|= 2.64 is greater than the tabular value of t , i.e., 2.15 (for 14 d.f at 5% level of significance for two tailed). Hence the null hypothesis is rejected and we conclude that the samples cannot be considered to have come from the same normal population.

Page 78: Sampling and Sampling Methods 2

78

CASELETS

5. The average number of articles produced by two machines per day are 200 and 250 with standard deviations 20 and 25 respectively on the basis of records of 25 days’ production. Can you regard both the machines equally efficient at 15 level of significance?

Null Hypothesis: H0: , i.e., both the machines are equally efficient

Alternative Hypothesis H1: (two- tailed)

21

21

Solution: We are given that

;25,250,25;20,200,25 222111 sxnsxn

Page 79: Sampling and Sampling Methods 2

79

5.51248

24600

22525

625*24400*24

2

)1()1(

)1()1(

)1()1(

21

222

211

21

222

2112

nn

snsn

nn

snsnsp

CASELETS( con’t)

809.7

251

251

5.512

)250200(

211

)(

1

2

21

nns

xxt

p

Since |t|= 7.809 is greater than the tabular value of t , i.e., 2.58 (for 48 d.f at 1% level of significance for two tailed). Hence the null hypothesis is rejected and we conclude that both the machines are not equally efficient at 1% level of significance.

Page 80: Sampling and Sampling Methods 2

80

If the mean of these three values is 8.0, then X3 must be 9 (i.e., X3 is not free to vary)

Degrees of Freedom (df)

Here, n = 3, so degrees of freedom = n – 1 = 3 – 1 = 2

(2 values can be any numbers, but the third is not free to vary for a given mean)

Idea: Number of observations that are free to vary after sample mean has been calculated

Example: Suppose the mean of 3 numbers is 8.0

Let X1 = 7

Let X2 = 8

What is X3?