Download ppt - 5.5.3 Means and Variances for Linear Combinations

5.5.3 Means and Variances for Linear Combinations

What is called a linear combination of random variables?

aX+b, aX+bY+c, etc. 2X+3 5X+9Y+10

If a and b are constants, then E(aX+b)=aE(X)+b.__________________________

• Corollary 1. E(b)=b. • Corollary 2. E(aX)=aE(X).

Let 𝑈= 𝑎0 + 𝑎1𝑋+ 𝑎2𝑌+ 𝑎3𝑍+ ⋯

Means of Linear Combinations

𝐸ሺ𝑈ሻ= 𝑎0 + 𝑎1𝐸𝑋+ 𝑎2𝐸𝑌+ 𝑎3𝐸𝑍+ ⋯

Example X, Y, Z = values of three dice You win 𝑈= 5+ 2𝑋+ 3𝑌+ 4𝑍 dollars 𝐸ሺ𝑋ሻ= 𝐸ሺ𝑌ሻ= 𝐸ሺ𝑍ሻ= 3.5 Your total expected winnings are 𝐸ሺ𝑈ሻ= 5+ 2ሺ3.5ሻ+ 3ሺ3.5ሻ+ 4ሺ3.5ሻ= 36.50

Expected value (constant) = constant 𝐸ሺ5ሻ= 5

𝐸ሺ∑ሻ= ∑ሺ𝑒𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑣𝑎𝑙𝑢𝑒𝑠ሻ 𝐸ሺ5+ 3𝑋+ 4𝑌+ 5𝑍ሻ= 𝐸ሺ5ሻ+ 𝐸ሺ3𝑋ሻ+ 𝐸ሺ4𝑌ሻ+ 𝐸ሺ5𝑍ሻ 𝐸ሺ𝑐𝑜𝑛𝑠𝑡𝑎𝑛𝑡∙𝑋ሻ= 𝑐𝑜𝑛𝑠𝑡𝑎𝑛𝑡 ∙𝐸𝑋 𝐸ሺ3𝑋ሻ= 3𝐸ሺ𝑋ሻ 𝐸ሺ4𝑌ሻ= 4𝐸ሺ𝑌ሻ Eሺ5𝑍ሻ= 5𝐸ሺ𝑍ሻ 𝐸ሺ𝑎0 + 𝑎1𝑋+ 𝑎2𝑌+ 𝑎3𝑍ሻ = 𝐸ሺ𝑎0ሻ+ 𝐸ሺ𝑎1𝑋ሻ+ 𝐸ሺ𝑎2𝑌ሻ+ 𝐸ሺ𝑎3𝑍ሻ = 𝑎0 + 𝑎1𝐸𝑋+ 𝑎2𝐸𝑌+ 𝑎3𝐸𝑍

Variances of Linear Combinations

𝑉𝑎𝑟ሺ𝑎0 + 𝑎1𝑋+ 𝑎2𝑌+ 𝑎3𝑍ሻ = 𝑎12𝑉𝑎𝑟ሺ𝑋ሻ+ 𝑎22𝑉𝑎𝑟ሺ𝑌ሻ+ 𝑎32𝑉𝑎𝑟ሺ𝑍ሻ when X, Y and Z are independent.

𝑉𝑎𝑟ሺ𝑎0ሻ= 0 𝑉𝑎𝑟ሺ𝑐𝑜𝑛𝑠𝑡𝑎𝑛𝑡ሻ= 0

𝑉𝑎𝑟ሺ𝑎0 + 𝑋ሻ= 𝑉𝑎𝑟ሺ𝑋ሻ Adding a constant doesn’t change variance

𝑉𝑎𝑟ሺ𝑎1𝑋ሻ= 𝑎12𝑉𝑎𝑟𝑋 Variances are in units2.

𝑆𝑡.𝐷𝑒𝑣ሺ𝑎1𝑋ሻ= ȁ&𝑎1ȁ& 𝑆𝑡.𝐷𝑒𝑣ሺ𝑋ሻ

If X, Y, Z are independent or uncorrelated. 𝑉𝑎𝑟ሺ𝑋+ 𝑌+ 𝑍ሻ= 𝑉𝑎𝑟ሺ𝑋ሻ+ 𝑉𝑎𝑟ሺ𝑌ሻ+ 𝑉𝑎𝑟ሺ𝑍ሻ Putting these facts together 𝑉𝑎𝑟ሺ𝑎0 + 𝑎1𝑋+ 𝑎2𝑌+ 𝑎3𝑍ሻ = ሺif independentሻ 𝑉𝑎𝑟ሺ𝑎0ሻ+ 𝑉𝑎𝑟ሺ𝑎1𝑋ሻ+ 𝑉𝑎𝑟ሺ𝑎2𝑌ሻ+ 𝑉𝑎𝑟ሺ𝑎3𝑍ሻ = 𝑎12𝑉𝑎𝑟ሺ𝑋ሻ+ 𝑎22𝑉𝑎𝑟ሺ𝑌ሻ+ 𝑎32𝑉𝑎𝑟ሺ𝑍ሻ For independent X, Y

𝜎𝑋+𝑌 = ඥ𝑉𝑎𝑟ሺ𝑋ሻ+ 𝑉𝑎𝑟ሺ𝑌ሻ

𝜎𝑋+𝑌2 = 𝜎𝑋2 + 𝜎𝑌2

𝜎𝑋+𝑌 = ට𝜎𝑋2 + 𝜎𝑌2

Like Pythagorean Theorem 𝑐2 = ȁ2 + b2, 𝑐= ξ𝑎2 + 𝑏2

Most useful facts:

If X1, X2, X3, … , Xn are independent measurements each with 𝐸(𝑋1) = 𝜇 and 𝑉𝑎𝑟ሺ𝑥𝑖ሻ= 𝜎2 then

𝐸ሺ𝑋തሻ= 𝜇 𝑉𝑎𝑟ሺ𝑋തሻ= 𝜎2𝑛 𝑆𝑡.𝐷𝑒𝑣ሺ𝑋തሻ= 𝜎ξ𝑛

To see this:

𝐸൬1𝑛 ∑ 𝑋𝑖൰= 1𝑛 𝐸ሺ∑ 𝑋𝑖ሻ= 1𝑛 ∑ 𝐸ሺ𝑋𝑖ሻ= 1𝑛 𝜇𝑛

𝑖=1 = 1𝑛ሺ𝑛𝜇ሻ= 𝜇

The sample mean 𝑋ത is an unbiased estimator of the population mean 𝜇.

𝑉𝑎𝑟൬1𝑛 ∑ 𝑋𝑖൰= ൬1𝑛൰2 𝑉𝑎𝑟ሺ∑ 𝑋𝑖ሻ= 1𝑛2 𝑉𝑎𝑟ሺ𝑋𝑖ሻ= 1𝑛2 𝜎2𝑛

𝑖=1 = 1𝑛2ሺ𝑛𝜎2ሻ= 𝜎2𝑛

5.5.5 The Central Limit Effect

• If X1, X2,…, Xn are independent random variables with mean m and variance s2, then for large n, the variable is approximately normally distributed.

If a population has the N(m,s) distribution, then the sample mean x of n independent observations has the distribution.

For ANY population with mean m and standard deviation s the sample mean of a random sample of size n is approximately

when n is LARGE.( , )Nn

sm

( , )Nn

sm

x

Central Limit Theorem

If the population is normal or sample size is large, mean follows a normal distribution

and

follows a standard normal distribution.

x( , )N

ns

m

n

xxzx

x

sm

sm

• The closer x’s distribution is to a normal distribution, the smaller n can be and have the sample mean nearly normal.

• If x is normal, then the sample mean is normal for any n, even n=1.

• Usually n=30 is big enough so that the sample mean is approximately normal unless the distribution of x is very asymmetrical.

• If x is not normal, there are often better procedures than using a normal approximation, but we won’t study those options.

• Even if the underlying distribution is not normal, probabilities involving means can be approximated with the normal table.

• HOWEVER, if transformation, e.g. , makes the distribution more normal or if another distribution, e.g. Weibull, fits better, then relying on the Central Limit Theorem, a normal approximation is less precise. We will do a better job if we use the more precise method.

If X1, X2, … , Xn are normal, then 𝑋ത is exactly normal.

Example: Bags of potatoes weigh

𝜇= 5.0 pounds

𝜎= 0.1 pounds

If we buy 4 bags, what is Pሺ𝑋ത< 4.9ሻ?

𝐸ሺ𝑋തሻ= 𝜇= 5 ඥ𝑉𝑎𝑟ሺ𝑋തሻ= 𝜎ξ𝑛 = 0.1ξ4 = 0.05

Pሺ𝑋ത< 4.9ሻ= Pቆ𝑋ത− 𝐸𝑋തඥVȁrሺXഥሻ< 4.9− 5.00.05 ቇ= Pሺ𝑧< −2ሻ= 0.0228

Example• X=ball bearing diameter• X is normal distributed with m=1.0cm and s=0.02cm• =mean diameter of n=25• Find out what is the probability that will be off by

less than 0.01 from the true population mean.

%76.98)5.25.2(

)004.0

0.101.1004.0

0.199.0()01.199.0(

004.02502.0

0.1

zP

zPxP

nx

x

ss

m

xx

Exercise

The mean of a random sample of size n=100 is going to be used to estimate the mean daily milk production of a very large herd of dairy cows. Given that the standard deviation of the population to be sampled is s=3.6 quarts, what can we assert about he probabilities that the error of this estimate will be more then 0.72 quart?