14
Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

Embed Size (px)

Citation preview

Page 1: Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

Conclusion to Bivariate Linear Regression

Economics 224 – Notes for November 19, 2008

Page 2: Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

Reporting regression results

• Equation format OR table format. For each of these:– Make sure you define x and y, with the units for each also

provided. In your report, make this accessible.– Report the sample size and units of observation.– Report the standard errors or t-statistics associated with

each of the regression coefficients.– Report the coefficient of determination, along with its

statistical significance. ANOVA table could be provided for a fuller report.

– Each step involves reorganizing the results from Excel or other statistical programs to one of these conventions.

– Don’t report too many or too few decimals!

Page 3: Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

Equation format

• Income and alcohol example. x is mean family income per capita in dollars, 1986 and y is alcohol consumption in litres of alcohol per capita for those aged 15 or over, 1985-86. n = 10 observations from the ten provinces of Canada.

• The regression equation, with standard errors reported in brackets, is as follows. R2 for this equation is 0.625 with a P-value of 0.0065.

• Alternatively, the t statistic could be reported in the brackets – make sure you indicate whether it is the standard errors or t statistics that are reported in the brackets.

(0.076) (2.332)

276.0835.0ˆ xy

Page 4: Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

4

Table format

Dependent variable is wages and salaries

VariableEstimated

CoefficientStandard Error

Probability Value

Constant -13,493 23,211 0.568

Yrs schooling 4,181 1,606 0.017

R2 = 0.253, P = 0.017

(+ Other equation test-statistics)

Could be t-statistics

Page 5: Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

5

Presenting Multiple Results

Dependent variable is wages and salaries

Variable Equation I Equation II Equation III

Constant-13,493

(23.211)…. ….

Yrs schooling4,181

(1,606) †…. ….

R2 0.253

Significance 0.017 … …

(+ Other equation test-statistics)Note: Standard errors in brackets. * – significant at the 1% level,

† – significant at the 5% level, ‡ – significant at the 10% level

Page 6: Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

6

Residual analysis• The t-test and F-test theoretically only work if the assumptions

about the error term are met.E(ε) = 0; Variance (ε) = σ2 is constant for each x;Values of ε are independent of each other. ε is normally distributed.

• If these assumptions are not met:– Must correct how our model is constructed.– Or, must come up with a new estimator other than OLS and

work on correcting the problems –> Econ 324 and up.• Can’t see true ε’s –> must look at our estimates:

– Estimated residuals are ei = yi – ŷi.

– Best way: plot them versus xi or ŷi using Excel or another program.

Page 7: Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

x e17 4914.40712 -21180.112 30819.8811 -2299915 -11223.415 -13223.419 4052.21615 -2223.420 9871.12116 -25404.518 3233.31211 15500.9814 27457.6912 -3680.1214.5 -41132.913.5 19548.2415 28276.613 1138.78910 7682.07512.5 -17770.715 -8223.412.3 14565.56

Residuals (e) for years of schooling (x) and wages and salaries regression

xy 181,4493,13ˆ

ye

yye ˆ

Page 8: Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

Last slide

• Example of regression of wages and salaries on years of schooling. Appears to satisfy assumptions, although it may have heteroskedasticity. That is, variance of residuals may not be equal for all values of x.

Page 9: Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

Regression of alcohol consumption on income, provinces of Canada, 1986.

Residuals. ei = predicted – actual alcohol consumption

x (income)

Appears to have a reasonable scatter of residuals, with no obvious violation of assumptions.

Page 10: Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

Consumption function. Example of serial or auto correlation.

Page 11: Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

11

Solutions

• Tests and results are suspect.• Violations of assumptions may affect some

estimates more than others.• Solutions– May mean we have missing explanatory variables

or wrong equation format.– May mean that ε does not meet assumptions.– Use different estimators.

• Examine in detail in courses in Econometrics.

Page 12: Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

Transformations

• Relationship may not be linear. There are many different possibilities here. Two examples are provided in other documents:– Population growth – exponential growth.– Earnings and age – parabolic relationship.

Page 13: Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

13

Some Cautions

• If we conclude β1 ≠ 0 –> doesn’t imply x causes y.

• Could still be random relationships.• Need some theoretical argument too.

• If we conclude β1 is statistically different than 0 –> doesn’t mean a linear relationship exists for sure.– Must watch out for non-linear relationships.

Page 14: Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008

Next day

• Begin multiple regression (ASW, Ch. 13).• Assignment 6 has now been posted.

Remember that this assignment is optional.