15
Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical significance ≠ substantive significance. The p value does not measure strength of relationship! So, how do we tell how strong a relationship is? Is the curiosity killing you?

Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

Embed Size (px)

Citation preview

Page 1: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

Overview

• Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other.

• But statistical significance ≠ substantive significance.

• The p value does not measure strength of relationship!

• So, how do we tell how strong a relationship is?

• Is the curiosity killing you?

Page 2: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

Measures of Association(for nominal and ordinal variables)

• The Proportional Reduction in Error (PRE) Approach– How much better can we

predict the dependent variable by knowing the independent variable?

• An example– How do you feel about school

vouchers? (NES 2000)– Knowing only a d.v., how do

you predict an outcome?– Mode = 238– How many errors? # mis-

predictions. – E1 = 84 + 65 + 194 = 343

1. Favor strongly 2. Favor not strongly

4. Oppose not strongly

5. Oppose strongly

School vouchers

0

50

100

150

200

250

Fre

qu

ency

School vouchers

Page 3: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

A simple PRE example• Now suppose you have knowledge about how a nominal independent

variable relates to the dependent variable. Say, religion…

School vouchers Protestant Jewish Catholic Total

Favor strongly 55 (29.9%) 85 (39.5%) 98 (53.8%) 238 (41.0%)

Favor not strongly 22 (12.0%) 32 (14.9%) 30 (16.5) 84 (14.5)

Oppose not strongly 29 (15.8%) 22 (10.2%) 14 (7.7%) 65 (11.2%)

Oppose strongly 78 (42.2%) 76 (35.3%) 40 (22.0%) 194 (33.4%)

Total 184 (99.9%) 215 (99.9%) 182 (100.0%) 581 (100.1%)

•Now, choose the mode of each independent variable category. 78 + 85 + 98 = 261 correct predictions E2 = 320 BTW: χ2 = 31.7

Page 4: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

A simple PRE example (cont.)

• Proportional reduction in error = (E1 – E2)/E1

= (343 – 320)/343 = .067 • We call this a 6.7% reduction in error…

• This calculation is aka “Lambda.”

Note – Lambda can be used when one or both variables are nominal. It bombs when one dv category has a preponderance of the observations (Cramér’s V is useful then).

Page 5: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

PRE with two ordinal variables

• When both variables are ordinal, you have many options for measuring the strength of a relationship

• Gamma, Kendall’s tau-b, Kendall’s tau-c, etc.

• Choices, choices, choices…

Page 6: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

Biblical Literalism and Education

• Is the Bible the word of God or of men? (NES 2000)

• Chi-sq = 105.4 at 4 df p = .000 reject the null hypothesis

Is the Bible the word of God or man? * Education: 3 categories Crosstabulation

96 230 274 600

56.1% 46.2% 26.2% 35.0%

58 227 583 868

33.9% 45.6% 55.7% 50.6%

17 41 189 247

9.9% 8.2% 18.1% 14.4%

171 498 1046 1715

100.0% 100.0% 100.0% 100.0%

Count

% within Education:3 categories

Count

% within Education:3 categories

Count

% within Education:3 categories

Count

% within Education:3 categories

God's word, literal

God's word, not literal

Man's word

Is the Bible theword of God orman?

Total

1. Lessthan HS 2. HS

3. Morethan HS

Education: 3 categories

Total

Page 7: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

Gamma, Tau-b, Tau-c…

Symmetric Measures

.222 .022 10.099 .000

.188 .019 10.099 .000

.383 .036 10.099 .000

1715

Kendall's tau-b

Kendall's tau-c

Gamma

Ordinal byOrdinal

N of Valid Cases

ValueAsymp.

Std. Errora

Approx. Tb

Approx. Sig.

Not assuming the null hypothesis.a.

Using the asymptotic standard error assuming the null hypothesis.b.

So our independent variable, education, reduces our error in predicting Biblical literalism by either

22.2% (tau-b),

18.8% (tau-c) or

38.3 whopping % (gamma)

And, SPSS reports sign. level, but let me come back to that later.

Page 8: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

Either of these might be considered a perfect relationship, depending on one’s reasoning about what relationships between variables look like.

• Why are there multiple measures of association?• Statisticians over the years have thought of

varying ways of characterizing what a perfect relationship is:

tau-b = 1, gamma = 1 tau-b <1, gamma = 1

55

10 25

3 7 30

55

35

40

Page 9: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

I’m so confused!!

Page 10: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

Rule of Thumb

• Gamma tends to overestimate strength but gives an idea of upper boundary.

• If table is square use tau-b; if rectangular, use tau-c.

• Pollock (and we agree):

τ <.1 is weak; .1<τ<.2 is moderate; .2<τ<.3 moderately strong; .3< τ<1 strong.

Page 11: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

A last example

• Theory: People’s partisanship leads them to develop distinct ideas about public policies.

• A case in point: Dem’s, Ind’s, and Rep’s develop different ideas about whether immigration should be increased, kept the same, or decreased

Page 12: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

Last example (cont.)

• Specifically, Dem’s have tended to favor minorities and those with less power. Therefore, I anticipate that Dem’s will be most in favor of increasing immigration, Rep’s will be most in favor of decreasing it.

• Let’s test this out using NES data.

Page 13: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

Increase/decrease immigration * Party ID: 3 categories Crosstabulation

% within Party ID: 3 categories

4.2% 4.3% 2.5% 3.8%

6.0% 6.0% 5.1% 5.8%

47.3% 44.6% 44.4% 45.5%

13.8% 15.1% 16.4% 15.0%

28.7% 30.0% 31.5% 29.9%

100.0% 100.0% 100.0% 100.0%

1. Increased a lot

2. Increased a little

3. Left the same

4. Decreased a little

5. Decreased a lot

Increase/decreaseimmigration

Total

1. Democrat2.

independent 3. Republican

Party ID: 3 categories

Total

Symmetric Measures

.037 .021 1.749 .080

.037 .021 1.749 .080

.055 .032 1.749 .080

1708

Kendall's tau-b

Kendall's tau-c

Gamma

Ordinal byOrdinal

N of Valid Cases

ValueAsymp.

Std. Errora

Approx. Tb

Approx. Sig.

Not assuming the null hypothesis.a.

Using the asymptotic standard error assuming the null hypothesis.b.

Page 14: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical
Page 15: Overview Chi-square showed us how to determine whether two (nominal or ordinal) variables are statistically significantly related to each other. But statistical

Last example (cont.)

• Conclusion There is a relationship between partisan- ship and feelings about immigration— i.e., what we saw in the table is not a result of chance. The relationship is weak (tau-c = .04). Dem’s are only a little more likely to favor immigration, Reps only a little more likely to oppose it.