21
© Andy Field 2005 Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

Embed Size (px)

Citation preview

Page 1: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005Discovering Statistics Using SPSS

CHAPTER 5

REGRESSION

Page 2: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005Discovering Statistics Using SPSS

Figure 4.1

Page 3: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005

Please find out the mean, variance, standard deviation of the two variables.

Then, calculate the covariance, and the r and R square.

Discovering Statistics Using SPSS

Page 4: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005

We also talked about partial correlation.

• Do you remember how to use SPSS to calculate this and how to interpret this?

Discovering Statistics Using SPSS

Page 5: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005

Moving beyond Correlation

• Correlation is useful to tell us the relationship about two variables, but it tells us nothing about the predictive model to our data and use that model to predict values of the Dependent variable from one or more independent variables.

Discovering Statistics Using SPSS

Page 6: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005

The method of least squares

• We need to find a “model” that has the least “variances” and best fit the data.

• It means we need to find a straight line to “describe” our data.

Discovering Statistics Using SPSS

Page 7: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005

A straight line…1. The slope (or gradient) of the line; and

2. The point at which the line crosses the vertical axis of the graph (known as the intercept of the line).

Discovering Statistics Using SPSS

Page 8: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005Discovering Statistics Using SPSS

Figure 7.1

Page 9: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005Discovering Statistics Using SPSS

Figure 7.2 – A Regression Line: a line that minimizes the sum of squared differences.

Page 10: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005Discovering Statistics Using SPSS

Figure 5.3 - Goodness-of-fit: how “fit” is the line?

SSm = SSr - SSt

Page 11: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005

SSm• If the value of the SSm is larger, then the

regression model is very different from using the mean to predict the dependent variable.

• If the value of the SSm is small, then using the regression model is little better than using the mean as the model.

Discovering Statistics Using SPSS

Page 12: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005

R square• It represents the amount of variance in the

outcome explianed by the SSm relative to how much variation was to explain by the SSt (mean).

• Thus, R square = SSm/SSt

Discovering Statistics Using SPSS

Page 13: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005

F ratio• Is a measure of how much the model has

improved the prediction of the outcome compared to the level of inaccuracy of the model.

• A good model should have a large F-ratio.

Discovering Statistics Using SPSS

Page 14: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005

Class exercise – weekly records.

• R square = .335, which tells us that advertising expenditure can account for 33.5% of the variation in record sales.

• ANOVA test = F ratio = 99.587

• Beta = the change in the outcome associated with a unit change in the predictor = if our independent variable is increased by 1 unit, the our model predicts that 0.096 extra records will be sold.

• T-test = tests the null hypothesis that the value of beta is 0: therefore, if it is significant we accept the hypothesis that the beta value is significantly different from zero and that the predictor variable contributes significantly to our ability to estimate values of the outcome.

Discovering Statistics Using SPSS

Page 15: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005Discovering Statistics Using SPSS

Figure 5.4

Page 16: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005

MULTIPLE REGRESSION

Discovering Statistics Using SPSS

Page 17: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005

• A logical extension of the simple regression model to situations in which there are several independent variables.

• We talked about regression LINE in a simple regression model, now we are talking about a regression PLANE

Discovering Statistics Using SPSS

Page 18: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005Discovering Statistics Using SPSS

Figure 5.6

Page 19: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005

Methods of regression• Hierarchial (Blockwise Entry): based on

early research findings

• Forced Entry: all enter at once but based on previous research

• Stepwise methods: exploratory

Discovering Statistics Using SPSS

Page 20: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005Discovering Statistics Using SPSS

Figure 7.7 Outliers – Check Cook’s distance

Page 21: © Andy Field 2005Discovering Statistics Using SPSS CHAPTER 5 REGRESSION

© Andy Field 2005Discovering Statistics Using SPSS

Figure 7.9