29
Model-Based and Design-Based Methods in “Empirical Micro” David Card, UC Berkeley

Model-Based and Design-Based

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Model-Based and Design-Based

Model-Based and Design-BasedMethods in “Empirical Micro”

David Card, UC Berkeley

Page 2: Model-Based and Design-Based

“Empirical micro” has emerged in the past 3 decades out of labor, public economics, and related fields

- undergraduate level: econometrics curriculum

- graduate level: “applied econometrics” courses, JMP’s

- individual fields:- big changes in development and health economics

- public perception: econ is not just about interest rates

Page 3: Model-Based and Design-Based

Today’s talk:

- a little background

- some lessons from the design-based perspective

- sources of the tension between design-based and model-based approaches

- thoughts on going forward

Page 4: Model-Based and Design-Based

I. Background Issuesa. Models and Causality- a primary goal in economic research is to draw causal inferences. What role do models play in this process?

- one view (e.g., Heckman): causality is model-based: causality only exists within the framework of a theory that says 'x causes y’

- an opposing view (e.g., Holland): causality is design-based: a claim of causality requires that you can design a manipulation in which ‘x causes y’

Page 5: Model-Based and Design-Based

b. Working Models in Economics- economic studies always have a “working model”

-the working model specifies (at least implicitly)-whose behavior are we modeling? -what are the constraints?-what are the endogenous choice variables?-what variables are known/observed?-what time horizon, level of aggregation...?

- successful working model constrains the focus of the study (and holds back the audience)

Page 6: Model-Based and Design-Based

BUT: wide debate about what is legitimate working model

- trend in fields outside “empirical micro” is toward more complex working models with dynamics, imperfect information, and GE

- come back to this later...

Page 7: Model-Based and Design-Based

Three main classes of working models in emp. micro:

1. explicit/approximating Archetype: Cowles Commission; consumer demand models (Deaton and Muellbauer, 1980)

-theory implies yi=fi(Xi), with restrictions on fi-researcher estimates approximating model:

yi = Xiβ + εi (White, 1980)

-error “tacked on”: meas. error, specification error, ...

-focus on testing ‘general predictions’ of theory

Page 8: Model-Based and Design-Based

2. explicit/exactArchetype: McFadden’s (1974) multinomial logit

- model gives complete d.g.p. (“all causes” model)e.g. utility function + unobserved heterogeneity in tastes

complete demand system

-focus on estimating model parameters and distribution of unobservables

-requires specific functional forms, but once estimated can be used for prediction, welfare analysis, etc.

Page 9: Model-Based and Design-Based

3. implicit working modelArchetype: program evaluation (training, etc.)

-no precisely specified behavioral model. Focus instead on a “structural” outcome equation with a signed coefficient

-can measure causal effect of existing policy, but cannot extrapolate very far

-widely extended to studies of non-market behavior

Page 10: Model-Based and Design-Based

c. Emergence of Design-based research

The “crisis” in econometric practice in the late 1970s/early 1980s

- Lucas (1976) critique - Hendry (1980) “Alchemy or Science” - Leamer (1983) “specification searches”- failure of replication efforts (JMCB study)-Lalonde (1985) critique of program eval. methods

Page 11: Model-Based and Design-Based

1980s/1990s Responses

in labor economics:- randomized instruments (draft lottery)- arguably exogenous instruments - quasi-experiments - Mariel boatlift, NJ min. wage - RD (Angrist-Lavy, Lee's vote share idea)

outside labor:- adoption of design-based approaches in public

finance, development, health economics- explosion in lab-based and field experiments

Page 12: Model-Based and Design-Based

III. Insights from Design-Based Approacha) counterfactuals and causality-widespread adoption of Rubin causal model-identification equated with research design (LATE)

b) substantive specification testing-randomization/balance tests-pre-treatment comparisons, falsification exercises,

multi-way differencing for observational designs-hierarchy: randomized design > RD > IV with

balanced design > event study > DD > OLS

Page 13: Model-Based and Design-Based

c) replicability and pre-specified designs

- growing emphasis on “replicability” as a condition for credibility (program/data achives) -- replication failures even in lab experiments (Psychology)- do we reward or punish replicators in Economics?- movement for pre-specified research plans, particularly for field experiments (multi-testing)

Page 14: Model-Based and Design-Based

d) how much “prior knowledge” do we have?

- Deaton and Cartwright (2018): randomized designs are less appealing when we have strong prior knowledge (Fisher vs. Student)

- design based studies often turn up small or even wrong-signed results (income experiments,....)

- especially in labor economics: standard models often fall short. Other fields may be better...

Page 15: Model-Based and Design-Based

IV. Tensions between model-based and design-based approaches

Background factors

1. DSGE model becomes the working model in macro. (Lucas, Prescott, ...). Models that ignore dynamics, imperfect information, and GE are seen as incomplete and potentially misleading

Page 16: Model-Based and Design-Based

2. growth of empirical research in IO, heavily influenced by McFadden’s logit framework

- limited data and sources of exogenous variation (e.g., only market shares directly observable)

- relatively complete model needed for anti-trust policy analysis, analysis of dynamic games (entry)

- structural model needed to infer much from observed bids in auctions

Page 17: Model-Based and Design-Based

3. Developments in econometric methodology

- mixed logit and extensions (BLP)

- Rust, Hotz-Miller, … adaptation of nested logit to intertemporal context

- advances in simulation/integration techniques

- partial identification

Page 18: Model-Based and Design-Based

Key Areas of Disagreement

a. Completeness/complexity of the working model

- design-based papers use simplified working models (often single equation model) with no formal derivation of the DGP. Emphasis is on “sharp” predictions and falsification.

- model-based papers work from the assumed model (though estimation may only use moment conditions). Emphasis on sophisticated agents, equilibrium behavior.

Page 19: Model-Based and Design-Based

•Sims (2010) comment on Angrist&Pischke:

... by promoting single-equation, linear, single-instrument modeling focused on single parameters ... they are helping create an environment in which applied economists emerge from PhD programs not knowing how to undertake deeper and more useful analyses of the data.

Page 20: Model-Based and Design-Based

b. What are the “parameters of interest”?

- design-based approach estimates E[y|X] = Xβ,with focus on “clean identification” of β

- Is β endogenous? (e.g., welfare takeup; retirement; monetary policy rules)

- model-based approach posits g(y,x,e; θ)=0with focus on estimating θ, sometimes using the

implied linear projection L(y|X) = Xβ(θ)

Page 21: Model-Based and Design-Based

c. Identification

- design-based practitioners emphasize "credible" identification, with testing/evaluation of assumptions

-model-based studies are often identified by combination of stochastic assumptions, functional form and exclusion restrictions, and “outside information” (calibration)

- differences reflect confidence in the model

Page 22: Model-Based and Design-Based

d. Inherent confidence in the working model

- some design-based studies attempt to test basic predictions. E.g., min wage studies, lab experiments

- in program evaluations: null hypothesis of a zeroeffect "proof of concept"

- model-based studies usually stipulate a relatively complete model. Goal of research is to estimate parameters and conduct welfare analysis. (Some attempts at out-of sample prediction)

Page 23: Model-Based and Design-Based

e. Choice of Topic

- design based approach has crossed over into many areas on the fringes of economics (including non-neoclassical topics/questions) - e.g. peer effects, happiness

- model-based approach tends to emphasize neoclassical topics, in part because of need for model development

- nevertheless: some model-based behavioral economics (DLM charitable giving study)

Page 24: Model-Based and Design-Based

V. Going Forward

a. Generalizability /Extrapolation- classic criticism of experiments: generalizability- simplest design-based studies recover some average treatment effect for the subjects in the study, relative to a counterfactual

Some new directions:1. MTE’s (Heckman and Vytlacil) or other generalized IV approaches - “mine” the available data in a given design

Page 25: Model-Based and Design-Based

2. richer model of the counterfactuals/treatmentsE.g., models with two treatments

- Kline-Walters - Head Start evaluation- Mountjoy - 2 year and 4 year colleges

3. combining evidence from multiple studies- works when there is a single treatment effect of interest that is estimated in different settings

e.g., treatment effects for ALMP’s- can potentially recover design biases (e.g. randomized v. non-randomized designs) and publication biases

Page 26: Model-Based and Design-Based

Medium Term Effect Sizes/Confidence Intervals

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

Effect Size on P[Employment] 1-2 years out

Page 27: Model-Based and Design-Based

b. Ideas from Machine Learning in Stats/CS

- ML models provide ‘best possible’ predictions- can be useful for classifying/stratifying

- cross validation and related ideas helping address ‘overfitting’

- LASSO useful in model selection and many-instruments problems

- new work on multiple testing (Efron): Kline-Walters randomized audit model

Page 28: Model-Based and Design-Based

c. Really Big Data Sets- design-based approaches are highly

complementary to really big (population) data- can search for “rare” designs- or power up IV/RD/RKD designs

- but: many “big” data sets are sparse (e.g. tax data have limited behavioral measures) need for creative modeling

- budget and access limitations for big data are changing empirical micro into a lab-based model

Page 29: Model-Based and Design-Based

Final Thoughts- Unlike physics, there is no ‘standard model’ of microeconomic behavior- Many basic premises in conventional working

models – such as temporally separable preferences, risk neutrality, etc., – are probably wrong for many/most agents-

- There is a need for many kinds of applied work:“descriptive work” that can point to new facts“program evaluation”“simple hypothesis testing” (1st order predictions)“fit the best available model”