16
Regression Discontinuity Designs Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL I NFERENCE Fall 2019 Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 1 / 16

Regression Discontinuity Designs - Harvard University

  • Upload
    others

  • View
    10

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Regression Discontinuity Designs - Harvard University

Regression Discontinuity Designs

Kosuke Imai

Harvard University

STAT186/GOV2002 CAUSAL INFERENCE

Fall 2019

Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 1 / 16

Page 2: Regression Discontinuity Designs - Harvard University

Observational Studies

In many cases, we cannot randomize the treatment assignmentethical constraintslogistical constraints

But, some important questions demand empirical evidence eventhough we cannot conduct randomized experiments!

Designing observational studies find a setting where crediblecausal inference is possibleKey = Knowledge of treatment assignment mechanism

Regression discontinuiety design (RD Design):1 Sharp RD Design: treatment assignment is based on a

deterministic rule2 Fuzzy RD Design: encouragement to receive treatment is based on

a deterministic rule

Originates from a study of the effect of scholarships on students’career plans (Thistlethwaite and Campbell. 1960. J. of Educ. Psychol)

Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 2 / 16

Page 3: Regression Discontinuity Designs - Harvard University

Regression Discontinuity Design

Idea: Find an arbitrary cutpoint c which determines the treatmentassignment such that Ti = 1{Xi ≥ c}Close elections as RD design (Lee et al. 2004. Q. J. Econ):

be a continuous and smooth function of vote shares everywhere,except at the threshold that determines party membership. Thereis a large discontinuous jump in ADA scores at the 50 percentthreshold. Compare districts where the Democrat candidatebarely lost in period t (for example, vote share is 49.5 percent),with districts where the Democrat candidate barely won (forexample, vote share is 50.5 percent). If the regression disconti-nuity design is valid, the two groups of districts should appear exante similar in every respect—on average. The difference will bethat in one group, the Democrats will be the incumbent for thenext election (t � 1), and in the other it will be the Republicans.Districts where the Democrats are the incumbent party for elec-tion t � 1 elect representatives who have much higher ADAscores, compared with districts where the Republican candidate

FIGURE ITotal Effect of Initial Win on Future ADA Scores:

This figure plots ADA scores after the election at time t � 1 against theDemocrat vote share, time t. Each circle is the average ADA score within 0.01intervals of the Democrat vote share. Solid lines are fitted values from fourth-order polynomial regressions on either side of the discontinuity. Dotted lines arepointwise 95 percent confidence intervals. The discontinuity gap estimates

� 0�P*t �1D � P*t �1

R � 1�P*t �1D � P*t �1

R .

“Affect” “Elect”

828 QUARTERLY JOURNAL OF ECONOMICS

Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 3 / 16

Page 4: Regression Discontinuity Designs - Harvard University

Identification

Estimand:E(Yi(1)− Yi(0) | Xi = c)

Assumption: E(Yi(t) | Xi = x) is continuous in x for t = 0,1deterministic rather than stochastic treatment assignmentviolation of the overlap assumption: 0 < Pr(Ti | Xi = x) < 1 for all xRD design is all about extrapolation

Regression modeling:

E(Yi(1) | Xi = c) = limx↓c

E(Yi(1) | Xi = x) = limx↓c

E(Yi | Xi = x)

E(Yi(0) | Xi = c) = limx↑c

E(Yi(0) | Xi = x) = limx↑c

E(Yi | Xi = x)

Advantage: internal validityDisadvantage: external validityMake sure nothing else is going on at Xi = c

Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 4 / 16

Page 5: Regression Discontinuity Designs - Harvard University

Analysis Methods under the RD Design

Simple linear regression within a windowHow should we choose a window in a principled manner?How should we relax the functional form assumption?higher-order polynomial regression not robust to outliers

Local linear regression (same h for both sides): better behavior atthe boundary than other nonparametric regressions

(α̂+, β̂+) = argminα,β

n∑i=1

1{Xi > c}{Yi − α− (Xi − c)β}2 · K(

Xi − ch

)

(α̂−, β̂−) = argminα,β

n∑i=1

1{Xi < c}{Yi − α− (Xi − c)β}2 · K(

Xi − ch

)Weighted regression with a kernel function of one’s choice:

uniform kernel: K (u) = 12 1{|u| < 1}

triangular kernel: K (u) = (1− |u|)1{|u| < 1}

Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 5 / 16

Page 6: Regression Discontinuity Designs - Harvard University

Optimal Bandwidth (Imbens and Kalyanaraman. 2012. Rev. Econ. Stud.)

Choose the bandwidth by minimizing the MSE:

MSE = E[{(α̂+ − α̂−)− (α+ − α−)}2 | X]= E{(α̂+ − α+)

2 | X}+ E{(α̂− − α−)2 | X}−2 · E(α̂+ − α+ | X) · E(α̂− − α− | X)

= (Bias+ − Bias−)2 + Variance+ + Variance−

Bias and variance of local linear regression estimator at theboundary:

Bias = E(m̂(0) | X)−m(0), Variance = V(m̂(0) | X)

where m(x) = E(Yi | Xi = x), m̂(x) = α̂(x), and

(α̂(x), β̂(x)) = argminα,β

n∑i=1

(Yi − α− β(Xi − x))2 · K(

Xi − xh

)Refinements, e.g., bias correction (Calonico et al. 2014. Econometrica)

Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 6 / 16

Page 7: Regression Discontinuity Designs - Harvard University

The “As-if Random” Assumption

RD design does NOT require the local randomization or “as-ifrandom” assumption within a window:

{Yi(1),Yi(0)}⊥⊥Ti | c0 ≤ Xi ≤ c1

The “as-if random” assumption implies zero slope of regressionlines difference-in-means within the windowThe assumption may be violated regardless of the window size

PL19CH20-Imai ARI 16 April 2016 8:58

a Democratic experience advantage b Share of total spending by Democratic candidate

–0.05 0.00 0.05

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

Democratic margin

Prop

ortio

n ca

ndid

ates

with

adv

anta

ge

–0.05 0.00 0.05

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

Democratic margin

Prop

ortio

n of

tota

l spe

ndin

g

Figure 1The problem of the local randomization assumption. Under the local randomization assumption, also called the as-if-randomassumption, the observations below and above the discontinuity threshold, a [−0.02, 0.02] window indicated by dotted lines in this case,are assumed to be identical on average. As a result, the estimated discontinuity is based on two flat lines with no slope (red dashed lines).In contrast, under the continuity assumption, the association with the forcing variable is not assumed to be absent (blue solid lines). Thetwo plots are based on the dataset on US House elections by Caughey & Sekhon (2011) using two pretreatment covariates: theexperience advantage of the Democratic candidate (a) and the proportion of total donations given to the Democrat (b). They show thatthe local randomization assumption can falsely discover a discontinuity (a) or overestimate one (b).

WHEN IS THE CONTINUITY ASSUMPTION VIOLATED?The discussion so far implies that imbalance in pretreatment covariates just below and above thethreshold does not necessarily imply the violation of the identification assumption for the RDdesign. Under the continuity assumption, such imbalance can exist so long as there is no discon-tinuous jump at the threshold. Lack of discontinuity in pretreatment covariates at the thresholdthen represents empirical evidence for the continuity of the expected potential outcomes so longas all pretreatment covariates relevant for the outcome of interest are measured and analyzed.

If covariate imbalance does not necessarily invalidate the RD design, what are the scenariosunder which the continuity assumption is violated? To answer this question, we must considerthe kind of sorting behavior that pushes would-be barely-losers up above the threshold or movespotential barely-winners down below the threshold.8 Such sorting will lead to a discontinuousjump at the threshold in the conditional expectation function of the potential outcomes. This inturn is likely to manifest as a discontinuous jump in pretreatment covariates, which are associatedwith the outcome.

Following the informative discussion of this issue by Eggers et al. (2015a), we consider two typesof sorting behavior. One is due to pre-election behavior or characteristics of candidates, whereasthe other is owing to postelection advantages in vote tallying, including the ability to engineerelectoral fraud. We argue that although the postelection sorting behavior clearly constitutes aviolation of the continuity assumption, the pre-election behavior may not. The occurrence ofelectoral fraud, for example, implies that a candidate who would have barely lost the election ends

8See Eggers et al. (2015b) for a discussion of sorting behavior when using population thresholds as the discontinuity cutoffs.

www.annualreviews.org • Regression Discontinuity Design 383

Ann

u. R

ev. P

olit.

Sci

. 201

6.19

:375

-396

. Dow

nloa

ded

from

ww

w.a

nnua

lrevi

ews.o

rg A

cces

s pro

vide

d by

Prin

ceto

n U

nive

rsity

Lib

rary

on

05/1

7/16

. For

per

sona

l use

onl

y.

Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 7 / 16

Page 8: Regression Discontinuity Designs - Harvard University

Close Elections Controversy (de la Cuesta and Imai. 2016. Annu. Rev.

Political Sci)

Sorting?1 Pre-election

behavior orcharacteristicsof candidates,e.g., resourceadvantages steep slope

2 Post-electionadvantages invote tallying,e.g., electionfraud sorting

PL19C

H20-Im

aiA

RI

16A

pril20168:58

–1.5 –1.0 –0.5 0.0 0.5 1.0 1.5

a Difference-in-means in window

Estimated discontinuity(standard deviation unitsfor nonbinary measures)

Estimated discontinuity(standard deviation unitsfor nonbinary measures)

Estimated discontinuity(standard deviation unitsfor nonbinary measures)

Rep. # prev. terms

Inc.'s D1 nominate

Partisan swing

Rep. experience adv.

Rep. inc. in race

Dem. pres. margin

% Govt. worker

% Voter turnout

% Black

Rep.-held open seat

Dem.-held open seat

Dem. governor

% Foreign born

Open seat

Dem. sec. of state

Dem. inc. in race

Dem. win t − 1

Dem. experience adv.

Dem. margin t − 1

Dem. share t − 1

Dem. # prev. terms

% Urban

Dem. spending %

Dem. donation %

CQ rating

–1.5 –1.0 –0.5 0.0 0.5 1.0 1.5

b Linear regression in window

Inc.'s D1 nominate

Partisan swing

Rep. # prev. terms

Rep. experience adv.

Rep. inc. in race

Dem. governor

Dem. pres. margin

Rep.-held open seat

% Voter turnout

Open seat

Dem.-held open seat

% Black

% Foreign born

% Urban

Dem. sec. of state

Dem. # prev. terms

% Govt. worker

Dem. margin t − 1

Dem. share t − 1

Dem. spending %

Dem. donation %

Dem. inc. in race

Dem. experience adv.

Dem. win t − 1

CQ rating

−1.5 −1.0 −0.5 0.0 0.5 1.0 1.5

c Local linear regression

Rep. # prev. terms

Rep. experience adv.

% Govt. worker

Partisan swing

Inc.'s D1 nominate

Rep. inc. in race

Dem. pres. margin

% Voter turnout

Rep.-held open seat

Dem. sec. of state

Dem. inc. in race

% Foreign born

Dem.-held open seat

Dem. margin t − 1

% Black

Open seat

Dem. governor

Dem. win t − 1

Dem. experience adv.

Dem. share t − 1

% Urban

CQ rating

Dem. # prev. terms

Dem. donation %

Dem. spending %

Figure 2Comparison of estimated discontinuities in pretreatment covariates across three methods. Solid and dashed lines in each panel represent 95% confidence intervals, notcorrected for multiplicity. (a) Filled blue circles represent estimates based on the difference-in-means estimator within the 2-percentage-point window on either side ofthe threshold; open red circles represent estimates within the one-half-percentage-point window. Panel (b) shows the estimates based on the linear regression in the samesets of windows. Panel (c) presents the estimates based on the local linear regression proposed by Calonico et al. (2014). Abbreviations: adv., advantage; CQ, CongressionalQuarterly; Dem., Democratic; govt., government; inc., incumbent; pres., president; prev., previous; Rep., Republican; sec., secretary; t, time period.

388dela

Cuesta·

Imai

Annu. Rev. Polit. Sci. 2016.19:375-396. Downloaded from www.annualreviews.org Access provided by Princeton University Library on 05/17/16. For personal use only.

PL19C

H20-Im

aiA

RI

16A

pril20168:58

–1.5 –1.0 –0.5 0.0 0.5 1.0 1.5

a Difference-in-means in window

Estimated discontinuity(standard deviation unitsfor nonbinary measures)

Estimated discontinuity(standard deviation unitsfor nonbinary measures)

Estimated discontinuity(standard deviation unitsfor nonbinary measures)

Rep. # prev. terms

Inc.'s D1 nominate

Partisan swing

Rep. experience adv.

Rep. inc. in race

Dem. pres. margin

% Govt. worker

% Voter turnout

% Black

Rep.-held open seat

Dem.-held open seat

Dem. governor

% Foreign born

Open seat

Dem. sec. of state

Dem. inc. in race

Dem. win t − 1

Dem. experience adv.

Dem. margin t − 1

Dem. share t − 1

Dem. # prev. terms

% Urban

Dem. spending %

Dem. donation %

CQ rating

–1.5 –1.0 –0.5 0.0 0.5 1.0 1.5

b Linear regression in window

Inc.'s D1 nominate

Partisan swing

Rep. # prev. terms

Rep. experience adv.

Rep. inc. in race

Dem. governor

Dem. pres. margin

Rep.-held open seat

% Voter turnout

Open seat

Dem.-held open seat

% Black

% Foreign born

% Urban

Dem. sec. of state

Dem. # prev. terms

% Govt. worker

Dem. margin t − 1

Dem. share t − 1

Dem. spending %

Dem. donation %

Dem. inc. in race

Dem. experience adv.

Dem. win t − 1

CQ rating

−1.5 −1.0 −0.5 0.0 0.5 1.0 1.5

c Local linear regression

Rep. # prev. terms

Rep. experience adv.

% Govt. worker

Partisan swing

Inc.'s D1 nominate

Rep. inc. in race

Dem. pres. margin

% Voter turnout

Rep.-held open seat

Dem. sec. of state

Dem. inc. in race

% Foreign born

Dem.-held open seat

Dem. margin t − 1

% Black

Open seat

Dem. governor

Dem. win t − 1

Dem. experience adv.

Dem. share t − 1

% Urban

CQ rating

Dem. # prev. terms

Dem. donation %

Dem. spending %

Figure 2Comparison of estimated discontinuities in pretreatment covariates across three methods. Solid and dashed lines in each panel represent 95% confidence intervals, notcorrected for multiplicity. (a) Filled blue circles represent estimates based on the difference-in-means estimator within the 2-percentage-point window on either side ofthe threshold; open red circles represent estimates within the one-half-percentage-point window. Panel (b) shows the estimates based on the linear regression in the samesets of windows. Panel (c) presents the estimates based on the local linear regression proposed by Calonico et al. (2014). Abbreviations: adv., advantage; CQ, CongressionalQuarterly; Dem., Democratic; govt., government; inc., incumbent; pres., president; prev., previous; Rep., Republican; sec., secretary; t, time period.

388dela

Cuesta·

Imai

Annu. Rev. Polit. Sci. 2016.19:375-396. Downloaded from www.annualreviews.org Access provided by Princeton University Library on 05/17/16. For personal use only.

Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 8 / 16

Page 9: Regression Discontinuity Designs - Harvard University

Density Test of Sorting (McCrary. 2008. J. Econom.)

r ¼ X 1;X 2; . . . ;X J . The binsize and bandwidth were again chosen subjectively after using the automaticprocedure. Much more so than the vote share density, the roll call density exhibits very specific features nearthe cutoff point that are hard for any automatic procedure to identify.27

The figure strongly suggests that the underlying density function is discontinuous at 50%. Outcomes withina handful of votes of the cutoff are much more likely to be won than lost; the first-step histogram indicatesthat the passage of a roll call vote by 1–2 votes is 2.6 times more likely than the failure of a roll call vote by 1–2

ARTICLE IN PRESS

0

30

60

90

120

150

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1Democratic Margin

Fre

quen

cy C

ount

0.00

0.20

0.40

0.60

0.80

1.00

1.20

1.40

1.60

Den

sity

Est

imat

e

Fig. 4. Democratic vote share relative to cutoff: popular elections to the House of Representatives, 1900–1990.

Table 2Log discontinuity estimates

Popular elections Roll call votes

#0.060 0.521(0.108) (0.079)

N 16,917 35,052

Note: Standard errors in parentheses. See text for details.

0

50

100

150

200

250

300

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1Percent Voting in Favor of Proposed Bill

Fre

quen

cy C

ount

0.00

0.50

1.00

1.50

2.00

2.50

Den

sity

Est

imat

e

Fig. 5. Percent voting yeay: roll call votes, U.S. House of Representatives, 1857–2004.

27I use a binsize of b ¼ 0:003 and a bandwidth of h ¼ 0:03. The automatic procedure would select b ¼ 0:0025 and h ¼ 0:114.

J. McCrary / Journal of Econometrics 142 (2008) 698–714710

1 Create histogram with a selected bin size2 Fit local linear regression to bin midpoints to smooth the histogram3 Estimate the difference in the logged histogram height at the

thresholdKosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 9 / 16

Page 10: Regression Discontinuity Designs - Harvard University

Placebo Test

V.C. Sensitivity to Alternative Measures of Voting Records

Our results so far are based on a particular voting index, theADA score. In this section we investigate whether our resultsgeneralize to other voting scores. We find that the findings do notchange when we use alternative interest groups scores, or othersummary measures of representatives’ voting records.

Table III is analogous to Table I, but instead of using ADAscores, it is based on two alternative measures of roll-call voting.The top panel is based on McCarty, Poole, and Rosenthal’s DW-NOMINATE scores. The bottom panel is based on the percent ofindividual roll-call votes cast that are in agreement with theDemocrat party leader. All the qualitative results obtained usingADA scores (Table I) hold up using these measures. When we usethe DW-NOMINATE scores, is �0.36, remarkably close to thecorresponding estimate of 1[Pt�1

D � Pt�1R ] in column (4), which

is �0.34. The estimates are negative here because, unlike ADAscores, higher Nominate scores correspond to a more conservativevoting record. When we use the measure “percent voting with theDemocrat leader,” is 0.13, almost indistinguishable from the

FIGURE VSpecification Test: Similarity of Historical Voting Patterns between Bare

Democrat and Republican DistrictsThe panel plots one time lagged ADA scores against the Democrat vote share.

Time t and t � 1 refer to congressional sessions. Each point is the average laggedADA score within intervals of 0.01 in Democrat vote share. The continuous line isfrom a fourth-order polynomial in vote share fitted separately for points above andbelow the 50 percent threshold. The dotted line is the 95 percent confidenceinterval.

838 QUARTERLY JOURNAL OF ECONOMICS

What is a good placebo?1 expected not to have any effect2 closely related to outcome of interest

Lagged outcome future cannot affect pastInterpretation: failure to reject the null 6= the null is correct

Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 10 / 16

Page 11: Regression Discontinuity Designs - Harvard University

Fuzzy RD Design (Hahn et al. 2001. Econometrica)

Sharp regression discontinuity design: Ti = 1{Xi ≥ c}What happens if we have noncompliance?Forcing variable as an instrument: Zi = 1{Xi ≥ c}Potential outcomes: Ti(z) and Yi(z, t)Assumptions

1 Monotonicity: Ti(1) ≥ Ti(0)2 Exclusion restriction: Yi(0, t) = Yi(1, t)3 E(Ti(z) | Xi = x) and E(Yi(z,Ti(z)) | Xi = x) are continuous in x

Estimand:

E(Yi(1,Ti(1))− Yi(0,Ti(0)) | Complier ,Xi = c)

Estimator:

limx↓c E(Yi | Xi = x)− limx↑c E(Yi | Xi = x)limx↓c E(Ti | Xi = x)− limx↑c E(Ti | Xi = x)

Disadvantage: external validityKosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 11 / 16

Page 12: Regression Discontinuity Designs - Harvard University

Class Size Effect (Angrist and Lavy. 1999. Q. J. Econ)

Effect of class-size on student test scoresMaimonides’ Rule: Maximum class size = 40

f (z) =z⌊ z−1

40

⌋+ 1

0 50 100 150 200

1015

2025

3035

40

Enrollment Count

Cla

ss S

ize

Maimonides RuleActual class size

Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 12 / 16

Page 13: Regression Discontinuity Designs - Harvard University

Empirical Analysis

Yi : class average verbal test scoreTi : classsizeWindow size: wConstruction of forcing variable:

Xi =

40− Zi if 40− w/2 ≤ Zi ≤ 40 + w/280− Zi if 80− w/2 ≤ Zi ≤ 80 + w/2

......

Linear models (cluster standard errors by schools):

Ti = δ1 + α1 × I{Xi ≥ 0}+ β1Xi + γ1Xi × I{Xi ≥ 0}+ ε1i

Yi = δ2 + α2 × I{Xi ≥ 0}+ β2Xi + γ2Xi × I{Xi ≥ 0}+ ε2i

where α̂1 = −7.90 (s.e. = 1.90) and α̂2 = −0.056 (s.e. = 2.08)Two-stage least squares estimate: est. = 0.007 (s.e. = 0.261)

Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 13 / 16

Page 14: Regression Discontinuity Designs - Harvard University

Interrupted Time Series Design

Time as the forcing variablePossibility of multiple events at the same timeMust model time trend: seasonality, etc.

Effect of the “stand your ground” bill in Florida on homicide

(Humphreys, Gasparrini, and Wiebe. 2017. JAMA)

Use of other states difference-in-differences designs

Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 14 / 16

Page 15: Regression Discontinuity Designs - Harvard University

Geographical RD Design (Keele and Titiunik. 2015. Political Anal.)

RD in two dimensions don’t use distance as forcing variableExample: media markets in Princeton, New Jersey

6.3 Compound Treatment Reduction

Before calculating each unit’s distance to each of the boundary points, we discuss how to minimize

the compound treatment assumption in this application. Figure 4 contains a map of the state of

New Jersey along with the location of the boundary between the New York and Philadelphia media

markets. We could calculate distances between voters and points along the entire media market

boundary and compare voters who are near each other along this border. However, it is first

important to reduce the incidence of compound treatments as much as possible. To do that, we

examined the boundaries of four different administrative units: U.S. congressional districts, state

senate districts, state house districts, and school districts. We found that for many parts of the

media market boundary, the boundaries of at least one of these units overlapped perfectly with the

media market boundary. In other words, in various boundary segments, not only did the media

market change at the boundary, but so did the school and/or the legislative districts. This is not

entirely surprising since, as we discussed above, the media market boundary in this area (and in

most of the United States) follows county boundaries.The overlap between media and county boundaries means that we cannot escape the problem of

compound treatments entirely, but we can minimize it by restricting our analysis to those segments

Fig. 4 Boundary between Philadelphia and New York City media markets. The dashed line represents

the boundary between the Philadelphia, PA, media market (located southwest of the boundary) and theNew York City, NY, media market (located northeast of the boundary), which divides the state ofNew Jersey. Area of detail is where legislative districts and school district are constant on both sides ofmedia market boundary. Figure 5 contains a detailed map of this area.

Geographic Boundaries as Regression Discontinuities 143

Dow

nloa

ded

from

htt

ps://

ww

w.c

ambr

idge

.org

/cor

e. U

niv

of M

ichi

gan

Law

Lib

rary

, on

19 A

pr 2

018

at 1

6:48

:50,

sub

ject

to th

e Ca

mbr

idge

Cor

e te

rms

of u

se, a

vaila

ble

at h

ttps

://w

ww

.cam

brid

ge.o

rg/c

ore/

term

s. h

ttps

://do

i.org

/10.

1093

/pan

/mpu

014

along the media market boundary where voters are in identical legislative districts and schooldistricts. Despite the length of the media market boundary, we found only one short segmentalong the border where both legislative district or school district boundaries did not also followthe county–media market boundary. The area of detail in Fig. 4 marks this boundary segment.Figure 5 contains a detailed map of this area. The area marked with gray hash lines is WestWindor-Plainsboro school district, which is split in two by the county–media market boundary.This school district also lies within a single U.S. House, state house, and state senate district.We restrict our analysis to residents in this school district since we can more plausibly assumethat areas on either side of this segment of the media market border are comparable—though wetest this assumption empirically below. Thus, while we cannot avoid compound treatments in theapplication, by finding an area where school and legislative districts are constant, we hope toincrease the plausibility of the compound treatment irrelevance assumption.

It is also worth noting that by holding units such as legislative and school districts constantin order to reduce compound treatments, we are making our analysis conditional on distance.That is, by removing compound treatments, we are already restricting our analysis to areaswithin a specific distance of the border. We suspect that, in many applications of the GRDdesign, addressing the compound treatment problem will prompt researchers to indirectly conditionon distance to the boundary and will be a useful first step to identify areas that are similar along thediscontinuity of interest.

New York-Philadelphia Media Market Boundary

New York Media Market

Philadelphia Media Market

Treated Area of Analysis

Control Area of Analysis

Montgomery Township School District

Princeton School District

Hopewell Valley School District

Frankling Township School District

South Brunswick School District

Cranbury Township School District

East Windsor School District

Robbinsville Township School DistrictMilestone Township School Distric

West Windsor-Plainsboro School District

Upper Freehold School District

Lawrence Township School District

Fig. 5 Detail of the boundary between Philadelphia and New York City media markets. Area marked with

gray hash lines indicates the West Windsor-Plainsboro school district, which straddles the media marketboundary. Empirical analysis is confined to the West Windsor-Plainsboro school district only, where legis-lative districts are also the same on both sides of the border. Treated area is southwest of media market

boundary, inside Philadelphia media market, where volume of political ads is high; control area is northeastof media market boundary, inside New York City media market, where volume of ads is zero.

Luke J. Keele and Rocı́o Titiunik144

Dow

nloa

ded

from

htt

ps://

ww

w.c

ambr

idge

.org

/cor

e. U

niv

of M

ichi

gan

Law

Lib

rary

, on

19 A

pr 2

018

at 1

6:48

:50,

sub

ject

to th

e Ca

mbr

idge

Cor

e te

rms

of u

se, a

vaila

ble

at h

ttps

://w

ww

.cam

brid

ge.o

rg/c

ore/

term

s. h

ttps

://do

i.org

/10.

1093

/pan

/mpu

014

Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 15 / 16

Page 16: Regression Discontinuity Designs - Harvard University

Summary

Observational studies treatment assignment is not random“Design” observational studies for credible causal inference

Sharp regression discontinuity (RD) designs:deterministic (rather than stochastic) treatment assignment rulecontinuity assumption no sortingdoes not require “as-if random” assumptionlimited external validity extrapolation required for generalizationincumbency effects controversy (Eggers et al. 2015. Am. J. Political Sci)Importance of placebo tests

Fuzzy RD designs: noncomplianceOther RD designs: interrupted time series, geographical boundary

Suggested readings:ANGRIST AND PISCHKE, Chapter 6IMBENS AND LEMIEUX. (2008). J. of Econom

Kosuke Imai (Harvard) Regression Discontinuity Designs Stat186/Gov2002 Fall 2019 16 / 16