6
Page 1 of 6 Ramin Shamshiri STA 5328, HW #3 Due 07/20/09 STA 5328, Homework #3 Due July 20, 2009 Ramin Shamshiri UFID # 90213353 Note: Problem numbers are according to the 6 th text edition. 8.2. Suppose that 1 = 2 = , 1 = 1 2 , and 2 = 2 2 . Consider the estimator 3 = 1 + 1 βˆ’ 2 . a. Show that 3 is an unbiased estimator for . b. If 1 and 2 are independent, how should the constant a be chosen in order to minimize the variance of 3 ? Solution a: is an unbiased estimator if ( )= . Here we have ( 3 )= [ 1 + 1 βˆ’ 2 ] = ( 1 )+ 1 βˆ’ ( 2 ) = + 1 βˆ’ = + βˆ’ = β–  Solution b: Recall that: + = 2 + 2 () Therefore, 3 = 1 + 1 βˆ’ 2 = 2 1 + 1 βˆ’ 2 ( 2 ) 3 = 2 1 2 + 1 βˆ’ 2 2 2 In order to find the minimum value of 3 for a critical point a, we should set the first derivative of 3 with respect to a equal to zero to find a. ( 3 ) =0 2 1 2 βˆ’ 21 βˆ’ 2 2 =0 1 2 βˆ’ 2 2 + 2 2 =0 1 2 + 2 2 = 2 2 = + In order to check that the value of a will minimize 3 , we can either substitute a in 3 or we can use the second derivative test. 2 ( 3 ) 2 =2 1 2 +2 2 2 β‰₯ 0 Using the second derivative test, it is confirmed that = + β– 

STA5328_Ramin_Shamshiri_HW3

Embed Size (px)

DESCRIPTION

STA5328_Ramin_Shamshiri_HW3

Citation preview

Page 1: STA5328_Ramin_Shamshiri_HW3

Page 1 of 6

Ramin Shamshiri STA 5328, HW #3 Due 07/20/09

STA 5328, Homework #3

Due July 20, 2009

Ramin Shamshiri

UFID # 90213353 Note: Problem numbers are according to the 6

th text edition.

8.2. Suppose that 𝐸 πœƒ 1 = 𝐸 πœƒ 2 = πœƒ,𝑉 πœƒ 1 = 𝜎12, and 𝑉 πœƒ 2 = 𝜎2

2 .

Consider the estimator πœƒ 3 = π‘Žπœƒ 1 + 1 βˆ’ π‘Ž πœƒ 2.

a. Show that πœƒ 3is an unbiased estimator for πœƒ.

b. If πœƒ 1 and πœƒ 2 are independent, how should the constant a be chosen in order to minimize the variance of

πœƒ 3?

Solution a:

πœƒ is an unbiased estimator if 𝐸(πœƒ ) = πœƒ. Here we have

𝐸(πœƒ 3) = 𝐸[π‘Žπœƒ 1 + 1βˆ’ π‘Ž πœƒ 2]

= π‘ŽπΈ(πœƒ 1) + 1βˆ’ π‘Ž 𝐸(πœƒ 2)

= π‘Žπœƒ + 1 βˆ’ π‘Ž πœƒ

= π’‚πœ½+ πœ½βˆ’ π’‚πœ½ = 𝜽

β– 

Solution b:

Recall that:

𝑉 π‘Žπ‘‹ + π‘π‘Œ = π‘Ž2𝑉 𝑋 + 𝑏2𝑉(π‘Œ)

Therefore,

𝑉 πœƒ 3 = 𝑉 π‘Žπœƒ 1 + 1βˆ’ π‘Ž πœƒ 2

= π‘Ž2𝑉 πœƒ 1 + 1 βˆ’ π‘Ž 2𝑉(πœƒ 2)

𝑉 πœƒ 3 = π‘Ž2𝜎12 + 1βˆ’ π‘Ž 2𝜎2

2

In order to find the minimum value of 𝑉 πœƒ 3 for a critical point a, we should set the first derivative of 𝑉 πœƒ 3 with respect to a equal to zero to find a.

𝑑𝑉(πœƒ 3)

π‘‘π‘Ž= 0

2π‘ŽπœŽ12 βˆ’ 2 1βˆ’ π‘Ž 𝜎2

2 = 0

π‘ŽπœŽ12 βˆ’ 𝜎2

2 + π‘ŽπœŽ22 = 0

π‘Ž 𝜎12 + 𝜎2

2 = 𝜎22

𝒂 =𝝈𝟐𝟐

𝝈𝟏𝟐+𝝈𝟐

𝟐

In order to check that the value of a will minimize 𝑉 πœƒ 3 , we can either substitute a in 𝑉 πœƒ 3 or we can use the

second derivative test.

𝑑2𝑉(πœƒ 3)

π‘‘π‘Ž2= 2π‘ŽπœŽ1

2 + 2π‘ŽπœŽ22 β‰₯ 0

Using the second derivative test, it is confirmed that 𝐚 =𝝈𝟐𝟐

𝝈𝟏𝟐+𝝈𝟐

𝟐

β– 

Page 2: STA5328_Ramin_Shamshiri_HW3

Page 2 of 6

Ramin Shamshiri STA 5328, HW #3 Due 07/20/09

8.13. If Y has a binomial distribution with parameters n and p, then 𝑝 1 = π‘Œ/𝑛 is an unbiased estimator of p.

Another estimator of p is 𝑝 2 = (π‘Œ + 1)/(𝑛 + 2).

a. Derive the bias of 𝑝 2.

b. Derive 𝑀𝑆𝐸(𝑝 1) π‘Žπ‘›π‘‘ 𝑀𝑆𝐸(𝑝 2).

c. For what values of p is 𝑀𝑆𝐸(𝑝 1) < 𝑀𝑆𝐸(𝑝 2)?

Solution a:

The bias of a point estimator πœƒ is given by 𝐡 πœƒ = 𝐸 πœƒ βˆ’ πœƒ

Therefore, 𝐡 𝑝 2 = 𝐸 𝑝 2 βˆ’ 𝑝

= 𝐸 π‘Œ + 1

𝑛 + 2 βˆ’ 𝑝 =

1

𝑛 + 2𝐸 π‘Œ + 1 βˆ’ 𝑝

=1

𝑛 + 2𝐸 π‘Œ +

1

𝑛 + 2𝐸 1 βˆ’ 𝑝

=𝑛𝑝

𝑛 + 2+

1

𝑛 + 2βˆ’ 𝑝 =

𝑛𝑝 + 1 βˆ’ 𝑝(𝑛 + 2)

𝑛 + 2=𝟏 βˆ’ πŸπ’‘

𝒏+ 𝟐

β– 

Solution b:

𝑀𝑆𝐸(𝑝 1) = 𝑉 𝑝 1 + 𝐡 𝑝 1 2

𝑉 𝑝 1 =𝑝(1 βˆ’ 𝑝)

𝑛 , 𝐡 𝑝 1 = 0

Therefore: 𝑴𝑺𝑬(𝒑 𝟏) = 𝑽 𝒑 𝟏 =𝒑(πŸβˆ’π’‘)

𝒏

𝑀𝑆𝐸(𝑝 2) = 𝑉 𝑝 2 + 𝐡 𝑝 2 2

𝑉 𝑝 2 = 𝑉 π‘Œ+1

𝑛+2 , Recall that: 𝑉 π‘Žπ‘‹ + π‘π‘Œ = π‘Ž2𝑉 𝑋 + 𝑏2𝑉(π‘Œ)

𝑉 𝑝 2 =1

𝑛 + 2 2𝑉 π‘Œ +

1

𝑛 + 2 2𝑉 1 =

1

𝑛 + 2 2𝑉 π‘Œ + 0 =

𝑛𝑝(1βˆ’ 𝑝)

𝑛 + 2 2

𝑉 𝑝 2 =𝑛𝑝 1βˆ’ 𝑝

𝑛 + 2 2 π‘Žπ‘›π‘‘ 𝐡 𝑝 2

2 = 1 βˆ’ 2𝑝

𝑛 + 2

2

𝑀𝑆𝐸(𝑝 2) =𝑛𝑝 1βˆ’π‘

𝑛+ 2 2 +

1 βˆ’ 2𝑝

𝑛 + 2

2

=𝒏𝒑 𝟏 βˆ’ 𝒑 + 𝟏 βˆ’ πŸπ’‘ 𝟐

𝒏+ 𝟐 𝟐

β– 

Solution c:

𝑝(1 βˆ’ 𝑝)

𝑛<𝑛𝑝 1 βˆ’ 𝑝 + 1 βˆ’ 2𝑝 2

𝑛 + 2 2

𝑛 + 2 2𝑝 1 βˆ’ 𝑝 βˆ’ 𝑛2𝑝 1βˆ’ 𝑝 βˆ’ 𝑛 1 βˆ’ 𝑝2 < 0

𝑛 + 2 2 βˆ’ 𝑛 𝑝 1 βˆ’ 𝑝 βˆ’ 𝑛 1 βˆ’ 2𝑝 2 < 0

4𝑛𝑝 βˆ’ 4𝑛𝑝2 + 4𝑝 βˆ’ 4𝑝2 βˆ’ 𝑛 + 4𝑛𝑝 βˆ’ 4𝑛𝑝2 < 0

𝑝2 8𝑛 + 4 βˆ’ 𝑝 4 + 8𝑛 + 𝑛 > 0

βˆ†= 𝑏2 βˆ’ 4π‘Žπ‘ = 4 + 8𝑛 2 βˆ’ 4 8𝑛 + 4 𝑛

𝑝1,𝑝2 = 4 + 8𝑛 Β± 4 + 8𝑛 2 βˆ’ 4 8𝑛 + 4 𝑛

2 8𝑛 + 4 =

1

2Β±

[ 4 + 8𝑛 2 βˆ’ 4 8𝑛 + 4 𝑛]

4 8𝑛 + 4 2 =

𝟏

𝟐±

𝒏 + 𝟏

πŸ–π’ + πŸ’

Therefore 𝒑 β‰ˆπŸ

𝟐

β– 

Page 3: STA5328_Ramin_Shamshiri_HW3

Page 3 of 6

Ramin Shamshiri STA 5328, HW #3 Due 07/20/09

8.24. In a study of the relationship between birth order and college success, an investigator found that 126 in a

sample of 180 college graduates were first born or only children; in a sample of 100 nongraduates of comparable age and socioeconomic background, the number of firstborn or only children was 54. Estimate the

difference in the proportions of first-born or only children for the two populations from which these samples

were drawn. Give a bound for the error of estimation.

Solution:

𝑛1 = 180, π‘Œ1 = 126

𝑛2 = 100, π‘Œ2 = 54

𝑝 1 βˆ’ 𝑝 2 =?

𝑝 1 =π‘Œ1

𝑛1=

126

180= 0.7

𝑝 2 =π‘Œ2

𝑛2=

54

100= 0.54

Estimating the difference in the proportions:

𝑝 1 βˆ’ 𝑝 2 = 0.7βˆ’ 54 = 𝟎.πŸπŸ”

The bound error estimation using probability of 95%:

2πœŽπ‘ 1βˆ’π‘ 2 = 2 𝑝1(1βˆ’ 𝑝1)

𝑛1+𝑝2(1βˆ’ 𝑝2)

𝑛2= 2

0.7(0.3)

180+

0.54(0.44)

10= 2 0.6042 = 𝟎.πŸπŸπŸŽπŸ–πŸ’

β– 

Page 4: STA5328_Ramin_Shamshiri_HW3

Page 4 of 6

Ramin Shamshiri STA 5328, HW #3 Due 07/20/09

8.76. Do SAT scores for high school students fifer depending on the student’s intended field of study? Fifteen

students who intended to major in engineering were compared with 15 students who intended to major language and literature. Given in the accompanying table are the means and standard deviations of the scores on the

verbal and mathematics portion of the SAT for the two groups of students.

Verbal Math

Engineering 𝑦 = 446 s=42 𝑦 = 548 s=57

Language/Literature 𝑦 = 534 s=45 𝑦 = 517 s=52

a. Construct a 95% confidence interval for the difference in average verbal scores of students majoring in engineering and of those majoring in language/literature.

b. Construct a 95% confidence interval for the difference in average math scores of students majoring in

engineering and of those majoring in language/literature. c. Interpret the results obtained in (a) and (b).

d. What assumptions are necessary for the methods used previously to be valid?

Solution a:

𝑛1 = 15 and 𝑛2 = 15, 𝑠1 = 42 and 𝑠2 = 45, π‘Œ 1 = 446 and π‘Œ 2 = 534

𝑠𝑝 = 𝑛1 βˆ’ 1 𝑠1

2 + 𝑛2 βˆ’ 1 𝑠22

𝑛1 + 𝑛2 βˆ’ 2 =

14 42 2 + 14 45 2

28 = 43.525

95% confidence interval, (using t-table to find 𝑑(0.025 ,𝑑𝑓=28))

π‘Œ 1 βˆ’ π‘Œ 2 Β± 𝑑(𝛼/2,𝑑𝑓=28)𝑠𝑝 1

𝑛1+

1

𝑛2 = 446 βˆ’ 534 Β± 2.048 43.525

1

15+

1

15= βˆ’πŸ–πŸ–Β± πŸ‘πŸ.πŸ“πŸ’

= [βˆ’πŸπŸπŸŽ.πŸ“πŸ’,βˆ’πŸ“πŸ“.πŸ’πŸ”] Solution b:

𝑛1 = 15 and 𝑛2 = 15, 𝑠1 = 57 and 𝑠2 = 52, π‘Œ 1 = 548 and π‘Œ 2 = 517

𝑠𝑝 = 𝑛1 βˆ’ 1 𝑠1

2 + 𝑛2 βˆ’ 1 𝑠22

𝑛1 + 𝑛2 βˆ’ 2 =

14 57 2 + 14 52 2

28 = 54.557

95% confidence interval, (using t-table to find 𝑑(0.025 ,𝑑𝑓=28))

π‘Œ 1 βˆ’ π‘Œ 2 Β± 𝑑(𝛼/2,𝑑𝑓=28)𝑠𝑝 1

𝑛1+

1

𝑛2= 548 βˆ’ 517 Β± 2.048 54.557

1

15+

1

15= πŸ‘πŸΒ± πŸ’πŸŽ.πŸ•πŸ—πŸ•

= [βˆ’πŸ—.πŸ•πŸ—πŸ•,πŸ•πŸ.πŸ•πŸ—πŸ•] Solution c: It means that we are 95% confidence that the difference in average verbal scores of students

majoring in engineering and of those majoring in language/literature is in the interval = βˆ’120.54,βˆ’55.46 . Since this interval contains only negative values, we can claim that it appears to be difference in the two mean verbal scores achieved by engineering and language students.

Since the seconds interval, = [βˆ’9.797,71.797], contains both positive and negative values, we cannot claim

that there is difference in average math scores of engineering students and language students. In the other

words, we do not have 95% confidence to claim which group has a larger mean.

Solution d:

Samples are independent

Equal variance assumption (𝜎12 = 𝜎2

2)

Page 5: STA5328_Ramin_Shamshiri_HW3

Page 5 of 6

Ramin Shamshiri STA 5328, HW #3 Due 07/20/09

8.79. A factory operates with two machines of type A and one machine of type B. The weekly repair costs X for

type A machine are normally distributed with mean πœ‡1 and variance 𝜎2. The weekly repair costs Y for machines

of type B are also normally distributed but with mean πœ‡2 and variance 3𝜎2. The expected repair cost per week

for the factory is thus 2πœ‡1 + πœ‡2. If you are given a random sample X1,X2,…,Xn on costs of type A machines and

an independent random sample Y1, Y2, …,Ym on costs for type B machines, show how you would construct a

95% confidence interval for 2πœ‡1 + πœ‡2:

a. If 𝜎2 is known.

b. If 𝜎2 is not known.

Solution a:

Let πœƒ = 2πœ‡1 + πœ‡2 and πœƒ = 2𝑋 + π‘Œ The 95% Confidence interval for πœƒ = 2πœ‡1 + πœ‡2 is:

πœƒ βˆ’ 𝑍𝛼2

. (πœŽπœƒ ) < πœƒ < πœƒ + 𝑍𝛼2

. (πœŽπœƒ )

Where πœŽπœƒ = 𝜎(2𝑋 +π‘Œ ) = 4πœŽπ‘‹ + πœŽπ‘Œ = 4𝜎2

𝑛+

3𝜎2

π‘š and 𝑍𝛼

2= 𝑍0.25 = 1.96. Therefore:

2𝑋 + π‘Œ βˆ’ 1.96. 4𝜎2

𝑛+

3𝜎2

π‘š < πœƒ < (2𝑋 + π‘Œ ) + 1.96. (

4𝜎2

𝑛+

3𝜎2

π‘š)

πŸπ‘Ώ + 𝒀 βˆ’ 𝟏.πŸ—πŸ”πˆ πŸ’

𝒏+πŸ‘

π’Ž< 𝟐𝝁𝟏 + 𝝁𝟐 < (2𝑿 + 𝒀 ) + 𝟏.πŸ—πŸ”πˆ

πŸ’

𝒏+πŸ‘

π’Ž

β– 

Solution b:

When 𝜎2 is not known, we should use pooled variance and t-distribution.

Pooled variance, 𝑆𝑝 is calculated as follow:

𝑆𝑝 = 𝑛 βˆ’ 1 𝑠1

2 + π‘š βˆ’ 1 𝑠22

𝑛 + π‘šβˆ’ 2

(2𝑋 + π‘Œ ) Β± 𝑑𝛼/2. 𝑆𝑝 4

𝑛+

3

π‘š

In order to make an inference about the population variance 𝜎2 based on a random sample Y1, Y2, …,Ym from a

normal population, a good estimator of 𝜎2 is the sample variance.

𝑆12 =

1

𝑛 βˆ’ 1 𝑋𝑖 βˆ’ 𝑋

2

𝑛

𝑖=1

𝑆22 =

1π‘šβˆ’ 1

π‘Œπ‘– βˆ’ π‘Œ 2𝑛

𝑖=1

3

𝑆𝑝2 =

𝑋𝑖 βˆ’ 𝑋 2𝑛

𝑖=1 + π‘Œπ‘– βˆ’ π‘Œ

2𝑛𝑖=1

3𝑛 + π‘šβˆ’ 2

πŸπ‘Ώ + 𝒀 Β± 𝟏.πŸ—πŸ” π‘Ώπ’Š βˆ’π‘Ώ 𝟐

π’π’Š=𝟏 +

π’€π’Š βˆ’ 𝒀 πŸπ’Žπ’Š=𝟏

πŸ‘π’ +π’Žβˆ’πŸ

πŸ’

𝒏+πŸ‘

π’Ž

β– 

Page 6: STA5328_Ramin_Shamshiri_HW3

Page 6 of 6

Ramin Shamshiri STA 5328, HW #3 Due 07/20/09

6. Suppose on the average I will receive one phone call per evening. Show that under reasonable assumptions

that the probability y that I will receive exact one phone call tonight is e-1

Solution