32
1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New York NY USA 1.1 Descriptive Statistics and Linear Regression

1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

Embed Size (px)

Citation preview

Page 1: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

1/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Microeconometric Modeling

William GreeneStern School of BusinessNew York UniversityNew York NY USA

1.1 Descriptive Statistics and Linear Regression

Page 2: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

2/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Data Description

• Basic Statistics• Tables• Histogram• Box Plot• Histogram• Kernel Density Estimator

Linear Regression Model

• Linear Model• Specification & Estimation

• Nonlinearities• Interactions

• Inference - Testing• Wald• F• LM

• Prediction and Model Fit• Endogeneity

• 2SLS• Control Function• Hausman Test

Page 3: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

3/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Cornwell and Rupert Panel DataCornwell and Rupert Returns to Schooling Data, 595 Individuals, 7 YearsVariables in the file are

EXP = work experienceWKS = weeks workedOCC = occupation, 1 if blue collar, IND = 1 if manufacturing industrySOUTH = 1 if resides in southSMSA = 1 if resides in a city (SMSA)MS = 1 if marriedFEM = 1 if femaleUNION = 1 if wage set by union contractED = years of educationLWAGE = log of wage = dependent variable in regressions

These data were analyzed in Cornwell, C. and Rupert, P., "Efficient Estimation with Panel Data: An Empirical Comparison of Instrumental Variable Estimators," Journal of Applied Econometrics, 3, 1988, pp. 149-155.

Page 4: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

4/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Page 5: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

5/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Objective: Impact of Education on (log) Wage

Specification: What is the right model to use to analyze this association?

Estimation Inference Analysis

Page 6: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

6/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Simple Linear Regression

LWAGE = 5.8388 + 0.0652*ED

Page 7: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

7/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Multiple Regression

Page 8: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

8/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Nonlinear Specification: Quadratic Effect of Experience

Page 9: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

9/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Partial EffectsCoefficients do not tell the story

Education: .05654Experience .04045 - 2*.00068*ExpFEM -.38922

Page 10: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

10/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Effect of Experience = .04045 - 2*.00068*ExpPositive from 1 to 30, negative after.

Page 11: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

11/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Model Implication: Effect of Experience and Male vs. Female

Page 12: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

12/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Interaction EffectGender Difference in Partial

Effects

Page 13: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

13/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Partial Effect of a Year of EducationE[logWage]/ED=ED + ED*FEM *FEMNote, the effect is positive. Effect is larger for women.

Page 14: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

14/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Gender Effect Varies by Years of Education

Page 15: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

15/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Endogeneity

y = X+ε, Definition: E[ε|x]≠0 Why not? The most common reasons:

Omitted variables Unobserved heterogeneity (equivalent to omitted variables) Measurement error on the RHS (equivalent to omitted

variables) Endogenous sampling and attrition

Page 16: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

16/69: Topic 1.1 - Descriptive Statistics and Linear Regression

The Effect of Education on LWAGE

1 2 3 4 ... ε

What is ε? ,... + everything else

= f( , , , ,

Ability, Motivation

Ability, Motiva .ti .on . )

EDUC

ED

LWAGE EXP

GENDER SMSA SOUTHUC

2EXP

Page 17: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

17/69: Topic 1.1 - Descriptive Statistics and Linear Regression

What Influences LWAGE?

1 2

3 4

Ability, Motivation

Ability, Motivat

( , ,...)

...

ε( )

Increased is associated with increases in

ion

Ability

Ability, Motivatio, ,n(

EDUC

E

LWAGE X

EXP

DUC X

2EXP

2

...) and ε( )

What looks like an effect due to increase in may

be an increase in . The estimate of picks up

the effect of and the hidden effect of .

Ability, Motivation

Ability

Ability

EDUC

EDUC

Page 18: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

18/69: Topic 1.1 - Descriptive Statistics and Linear Regression

An Exogenous Influence

1 2

3 4

( , , ,...)

...

ε( )

Increased is asso

Abili

ciate

ty, Motivation

Ability, Motivation

Ability, Motivation

d with increases in

( , , ,..

EDU ZC

EDUC

LWAGE X

EXP

Z

ZX

2EXP

2

.) and not ε( )

An effect due to the effect of an increase on will

only be an increase in . The estimate of picks up

the effect of only.

Ability, Motiv

ation

EDUC

EDUC

ED

Z

Z

UC

is an Instrumental Variable

Page 19: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

19/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Instrumental Variables Structure

LWAGE (ED,EXP,EXPSQ,WKS,OCC, SOUTH,SMSA,UNION)

ED (MS, FEM)

Reduced Form: LWAGE[ ED (MS, FEM), EXP,EXPSQ,WKS,OCC, SOUTH,SMSA,UNION ]

Page 20: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

20/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Two Stage Least Squares Strategy

Reduced Form: LWAGE[ ED (MS, FEM,X), EXP,EXPSQ,WKS,OCC, SOUTH,SMSA,UNION ]

Strategy (1) Purge ED of the influence of everything but

MS, FEM (and the other variables). Predict ED using all exogenous information in the sample (X and Z).

(2) Regress LWAGE on this prediction of ED and everything else.

Standard errors must be adjusted for the predicted ED

Page 21: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

21/69: Topic 1.1 - Descriptive Statistics and Linear Regression

OLS

Page 22: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

22/69: Topic 1.1 - Descriptive Statistics and Linear Regression

The extreme result for the coefficient on ED is probably due to the fact that the instruments, MS and FEM are dummy variables. There is not enough variation in these variables.

Page 23: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

23/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Source of Endogeneity

LWAGE = f(ED, EXP,EXPSQ,WKS,OCC, SOUTH,SMSA,UNION) +

ED = f(MS,FEM, EXP,EXPSQ,WKS,OCC, SOUTH,SMSA,UNION) + u

Page 24: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

24/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Remove the Endogeneity by Usinga Control Function

LWAGE = f(ED, EXP,EXPSQ,WKS,OCC, SOUTH,SMSA,UNION) + u +

LWAGE = f(ED, EXP,EXPSQ,WKS,OCC, SOUTH,SMSA,UNION) + u +

Strategy Estimate u Add u to the equation. ED is uncorrelated with

when u is in the equation.

Page 25: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

25/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Auxiliary Regression for ED to Obtain Residuals

Page 26: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

26/69: Topic 1.1 - Descriptive Statistics and Linear Regression

OLS with Residual (Control Function) Added

2SLS

Page 27: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

27/69: Topic 1.1 - Descriptive Statistics and Linear Regression

A Warning About Control Functions

Sum of squares is not computed correctly because U is in the regression.A general result. Control function estimators usually require a fix to the estimated covariance matrix for the estimator.

Page 28: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

28/69: Topic 1.1 - Descriptive Statistics and Linear Regression

An Endogeneity Test? (Hausman)

Exogenous Endogenous

OLS Consistent, Efficient Inconsistent 2SLS Consistent, Inefficient Consistent

Base a test on d = b2SLS - bOLS

Use a Wald statistic, d’[Var(d)]-1d

What to use for the variance matrix? Hausman: V2SLS - VOLS

Page 29: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

29/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Hausman Test

Chi squared with 1 degree of freedom

Page 30: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

30/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Hausman Test: One at a Time?

Page 31: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

31/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Endogeneity Test: Wu

Considerable complication in Hausman test (Greene (2012), pp. 234-237)

Simplification: Wu test. Regress y on X and estimated for the

endogenous part of X. Then use an ordinary Wald test.

Variable addition test

Page 32: 1/69: Topic 1.1 - Descriptive Statistics and Linear Regression Microeconometric Modeling William Greene Stern School of Business New York University New

32/69: Topic 1.1 - Descriptive Statistics and Linear Regression

Wu Test