26
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Embed Size (px)

Citation preview

Page 1: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Stats Lunch: Day 3

The Basis of Hypothesis Testing w/ Parametric

Statistics

Page 2: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Back to the Normal Curve…

Page 3: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

The Central Limit Theorem in Nature

Page 4: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Samples vs. Populations

-The Central Limit Theorem says that if we have lots of people (scores), their distribution will approximate a normal curve

-However, it is often impractical or too expensive to run studies with lots of subjects…

-Population: Entire group of people (animals, etc.) that you are interested in (Ex: How ALL Americans will vote in election)

-Sample: The group of people that you actually study (Ex: the 100 people that you call on the phone to ask them how they’ll vote)

Page 5: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Samples vs. Populations

-We assume that our sample is REPRESENTATIVE of the population we’re interested in…

Random Selection: start with a list of entire population and randomly select people to study. This assures that sample represents population

Haphazard Selection: Taking whoever you can get...

So, our samples are often not truly RANDOM and therefore can have a (very) different mean and standard deviation than the true population:

-Our sample is less likely to resemble a true normal curve

-It is more difficult to estimate what percentage of scores fall above or below a given score...

Page 6: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

The Central Limit Theorem in Practice

Page 7: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Population Parameters vs. Statistics

-Although we want to, we almost never know the TRUE mean and standard deviation of a population

-Thus, we don’t know the “Population Parameters”: Mean, Variance and SD of a population

= population mean

2 = population variance, = population SD

-Samples ESTIMATE a population

-We call the mean (M), Variance (SD2), and Standard Deviation (SD) of a sample “Statistics”

Page 8: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Introduction to Hypothesis Testing

“All I know is I don’t know nothing”

If you don’t know “nothing”, then you must know something…

Logic of Hypothesis Testing (w/ N=1) We manipulate an independent variable (Ex: Let baby

listen to Mozart)

We obtain a sample score (Ex: IQ = 140 ; Z=2.67) Using what we know about the normal curve for that

variable’s distribution:

-We determine how likely it is that we got that sample score just by chance (in other words, how likely is it we got a score this high if the music had no effect)

Page 9: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

More than 99% of scores fall below Z =2.67

Z= 2.67

Introduction to Hypothesis Testing

Page 10: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

-It’s VERY unlikely (but possible) that we got a score (Z=2.67) this BIG just by chance

-With a score this high, there is less than a 2% chance that this score would occur if the drug had no effect

-We can reject the idea that the drug has no effect. Therefore, we accept the idea that it does have an effect.

“All I know is I don’t know nothing”…

We can only show how unlikely it is to get a score if our manipulation really had no effect (by seeing where our score falls on the normal distribution for that variable).

Introduction to Hypothesis Testing

Page 11: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

-It’s VERY unlikely (but possible) that we got a score (Z=2.67) this BIG just by chance

-With a score this high, there is less than a 2% chance that this score would occur if the drug had no effect

-We can reject the idea that the drug has no effect. Therefore, we accept the idea that it does have an effect.

“All I know is I don’t know nothing”…

We can only show how unlikely it is to get a score if our manipulation really had no effect (by seeing where our score falls on the comparison distribution for that variable)

Page 12: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Steps of Hypothesis Testing

1. Restate your question as a research hypothesis and a null hypothesis about the populations

2. Determine the characteristics of the comparison distribution

3. Determine the cutoff sample score on the comparison distribution at which the null hypothesis could be rejected

4. Determine your sample’s score on the comparison distribution

5. Decide whether to reject the null hypothesis

Page 13: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Experimental Group: Baby MozartRest of Population: All the other babies

Step 1. Create a Null Hypothesis and Research (Alternate) Hypothesis (about the population)

-Null Hypothesis (H0): Your manipulation has no effect (µexperimental group = µrest of population) -Research Hypothesis (H1): Your manipulation has some effect (µexperimental group > µrest of population)

The null is the exact opposite of your research hypothesis.

Statistically, we can only test how likely it is we got a score so big (or small) if the null is true.Thus, we try to disprove, or reject, the null hypothesis.

Page 14: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Ex: Null: Listening to Mozart DOES NOT increase babies’ IQResearch: Listening to Mozart does increase babies’ IQ

Step 2. Determine the characteristics of your Comparison Distribution (Which, in this case is the normal curve)

Comparison Distribution: What the distribution would look like if the null hypothesis were true. We compare our sample scores to this distribution.

-What score would we expect to get if the null were true?

-If null is true, then our experimental group and the rest of the population would look the same. The distributions would be equal.

Ex: in this case, if the null is true both groups would have a mean IQ of 100

Page 15: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Step 3. Determine when to reject null hypothesis

Cutoff Sample Score (Critical Value): How big will your sample score have to be before you reject the null?

In other words, the probability (2.5%, 5%, etc.) that you’re willing to take that the null hypothesis is really TRUE, even though you say it isn’t

This is most commonly called “alpha level” or “p”.

Ex: Cutoff = .05 -the point (Z score) on the normal curve where 95% of scores fall below it

95%

Page 16: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Step 5. Deciding to Reject Null or Not

If your sample score is larger than your cutoff sample score (critical value), you can reject the null

You have reached statistical significance

However, you can never ACCEPT the null.

If your sample score is SMALLER than your cutoff sample score (critical value), all you can say is that you “failed to reject the null”

-So, you never really “prove” anything…

-All you can say is how unlikely it is that what you found would occur randomly (your manipulation had no effect)

Page 17: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Directional (One-Tailed) vs. Nondirectional (Two-Tailed) Hypotheses

Directional Tests: Expect that your sample score will be bigger (or smaller) than the mean of your comparison distribution

Reject Null if your sample score falls in the top (or bottom…depending on your hypothesis) 5% of scores in the comparison distribution.

But: can only reject null if your data are in the direction you predicted…

Sample Falls above cutoff:

REJECT NULL

Sample Falls below cutoff:

FAIL TO REJECT NULL

Page 18: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Nondirectional Tests: Expect that your sample score will be different than the mean of your comparison distribution (either bigger or smaller)

Reject Null if your sample score falls in the top 2.5% OR bottom 2.5% of comparison distribution.

Harder to reject null.

Reject RejectFail to reject

null

Page 19: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Because, in the real world, we don’t do hypothesis testing w/ a N=1...

We aren’t interested in comparing one score to a distribution of individual scores

Instead, we are interested in means of samples, not individual scores (our N > 1)

Need a new comparison distribution (can’t compare the mean of a sample to a distribution of individual scores)

Distribution of Means: Comparison distribution composed of the MEANS of a large number of samples

-get a bunch of samples from the same population, and plot their means

This is also called a “Sampling Distribution of Means”

Page 20: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

(Sampling) Distribution of Means

-We can’t compare the mean of a sample (the group of subjects we’re studying) with a comparison distribution composed of individual scores.

-Instead we want to compare it to a distribution made up of a bunch of means from samples...

120, 110, 85, 100, 80, 100, 95, 105, 101, 99, 95, 107

120, 80, 100

M = 100

110, 80, 100

M = 96.67

107, 100, 101

M = 102.67 . . .

Page 21: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Characteristics of a Distribution of Means:

1. Mean will be close to the population mean (M = )

2. Shape of the Distribution will be approx. normal (N > 30)3. The spread of the distribution (i.e., the variance) is SMALLER than the population distribution

Calculating the Variance of a Distribution of Means:

-It is unlikely that more than one extreme score will be selected in each sample…-Thus, it is unlikely that a sample mean will be extreme-So, the variance for the Distribution of Means will be smaller than the population’s

-This is particularly true with a large N (sample size)

-The larger the N, the smaller the variance of a dist of means...

Page 22: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Mathematically it looks like this:

Population Variance / N (or… σ2/N)

Called the “Variance of Distributions of Means”

σ2M = σ2/N

IQ Example (Pop Variance=225, N=50)

σ2M = 225/50 = 4.5

Standard Deviation of Dist of Means (Standard Error)

σM = √ σ2M = √4.5 = 2.12

Page 23: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Estimating the Population Mean...

-We can never know the TRUE population mean

-However, we can estimate it from what we know about the sample mean and the Standard Error

-Point Estimate: when we estimate a particular population parameter

-The best point estimate of is M

-But how accurate is it going to be?

-Better off to estimate using a range of possible scores (some above and some below M)

-Interval Estimate

Page 24: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Confidence Intervals

We are 95% (or 99%…or 50%, etc.) confident that the population mean falls between Score A and Score B

Page 25: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

Figuring Confidence Intervals:

1. Find Standard Error (2.12)

2. Figure the Raw Scores above and Below M

-For 95%, figure raw scores +/- 1.96 Standard Error from M

-For 99%, figure raw score +/- 2.57 Standard Error from M

Standard Error for Baby Mozart = 2.12, M = 130

Interval = M +/- (Standard Error * Z cutoff)

SE * Z cutoff = (2.12 * 1.96) = 4.16

Interval = 130 +/- 4.16

130 - 4.16 = 125.84

130 + 4.16 = 134.16

95% Interval Example

Page 26: Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics

So, we are 95% confident that the population mean falls between 125.84 and 134.16

99% Interval Example

Standard Error for Baby Mozart = 2.12, M = 130

SE * Z cutoff = (2.12 * 2.57) = 5.45

Interval = 130 +/- 5.45

130 - 5.45 = 124.55

130 + 5.45 = 135.45

So, we are 99% confident that the population mean falls between 124.55 and 135.45