29
Chapter Fourteen 14-1

Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Embed Size (px)

Citation preview

Page 1: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Chapter Fourteen

14-1

Page 2: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Statistical Analysis Procedures

Statistical procedures that simultaneously analyze multiple measurements on each individual or object under study.

14-2Key Terms & Definitions

Page 3: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Statistical Analysis

14-3Key Terms & Definitions

Page 4: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Correlation Analysis

Bivariate Techniques:

• Statistical methods of analyzing the relationship between variables.

Independent Variable:

• Variable believed to affect the value of the dependent variable.

14-4Key Terms & Definitions

Page 5: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Dependent Variable:

• Variable expected to be explained or caused by the independent variable.

Regression Analysis:

• The analysis of the strength of the linear relationship between variables when one is considered the independent variable and the other is the dependent variable.

Correlation Analysis

14-5Key Terms & Definitions

Page 6: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Correlation Analysis

14-6Key Terms & Definitions

Page 7: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

The Strength of Association:

The coefficient of determination (R2): the percentage of the total variation in the dependent variable explained by the independent variable.

Pearson Correlation:

Analysis of the degree to which changes in one variable are associated with changes in another for use with metric data.

Measures of AssociationThe Concepts

14-7Key Terms & Definitions

Page 8: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Multiple Regression Key Concepts

Coefficient of Determination:

• Measured changes in the dependent and independent variables.

Regression Coefficients:

• Effect of the independent variable on the dependent variable.

Dummy Variables:

• Nominally scaled variables included in regression analysis.

A procedure for predicting the level or magnitude of a (metric) dependent variable based on the levels of multiple independent variables.

14-8Key Terms & Definitions

Page 9: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Regression Analysis

14-9Key Terms & Definitions

Page 10: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Regression Analysis

14-10Key Terms & Definitions

Page 11: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Uses of Regression Analysis

14-11Key Terms & Definitions

Page 12: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Coefficient of Determination:

Measure of the percentage of the variation in the dependent variable explained by variations in the independent variables.

Regression Coefficients:

Estimates of the effect of individual independent variables on the dependent variable.

Dummy Variables:

In regression analysis, a way of representing two-group or dichotomous, nominally scaled independent variables by coding one group as 0 and the other as 1.

14-12Key Terms & Definitions

Regression Analysis Measurement Applications

Page 13: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Potential Use and Interpretation Problems

Collinearity:

Correlation of independent variables with each other, which can bias estimates of regression coefficients.

Causation:

Inference that a change in one variable is responsible for (caused) an observed change in another variable.

Scaling of Coefficients:

A method of directly comparing the magnitudes of the regression coefficients of independent variables by scaling them in the same units or by standardizing the data.

14-13Key Terms & Definitions

Page 14: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Standardization Process

14-14Key Terms & Definitions

Page 15: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Cluster AnalysisThe general term for statistical procedures that classify objects, or people, into some number of mutually exclusive and exhaustive groups on the basis of two or more classification variables.

14-14Key Terms & Definitions

Consumers who frequently eat out but seldom eat at fast-food restaurants.

People who frequently eat out and also frequently eat at fast-food restaurants.

People who do not frequently eat out or frequently eat at fast-food restaurants.

Page 16: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Cluster AnalysisClustering people according to how frequently and where they eat out is a way of identifying a particular consumer base. An upscale restaurant can see where its customers fall.

14-16Key Terms & Definitions

Page 17: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Factor Analysis

A procedure for simplifying data by reducing a large set of variables to a smaller set of factors of composite variables by identifying dimensions of the data.

Factor: A linear combination of variables that are correlated with each other.

14-17Key Terms & Definitions

Page 18: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Factor Scores

In factor analysis, a factor score is calculated on each factor for each subject in the data set. For example, in a factor analysis with two factors, the following equations might be used to determine factor scores:

14-18Key Terms & Definitions

Page 19: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Factor Loading

Correlation between factor scores and the original variables.

Other Key Issues:

• Naming Factors• Number of factors to Retain

14-19Key Terms & Definitions

Page 20: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Factor Loading

Naming Factors

This is a somewhat subjective step. Usually a certain consistency exists among the variables that load highly on a given factor. For example, it is not surprising to see ratings “smooth ride” and “quiet ride” on the same factor.

Number of Factors to Retain

How many factors o you retain? A general rule of thumb is to stop

factoring when additional factors no longer make sense.

14-20Key Terms & Definitions

Page 21: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Conjoint AnalysisConjoint analysis could be used by a manufacturer of golf balls to determine the three most important features of a

golf ball and to see which ball meets the most needs of both consumer and manufacturer.

14-21Key Terms & Definitions

Average driving distance• 10 yards more than the golfer’s average• Same as the golfer's average• 10 yards less than the golfer's average

Average ball life• 54 holes• 36 holes• 18 holes

Price Per Ball• $2.00• $2.50• $3.00

Page 22: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Conjoint AnalysisConjoint analysis could be used by a manufacturer of golf balls to determine the three most important features of a

golf ball and to see which ball meets the most needs of both consumer and manufacturer.

14-22Key Terms & Definitions

For potential purchasers, the ideal golf ball has these characteristics• Average driving distance – 10 yards above average• Average ball life – 54 holes• Price $2.00

For the manufacturer, which is based on cost, the ideal golf ball has these characteristics:• Average driving distance – 10 yards below average• Average ball life – 18 holes• Price - $3.00

Page 23: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Conjoint Analysis

Multivariate procedure used to quantify the value consumers associate with different levels of

product/service attributes or features.

14-23Key Terms & Definitions

Page 24: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Conjoint Analysis – Estimating Utilities

Utilities: The relative value of attribute levels determined through conjoint analysis.

14-24Key Terms & Definitions

Page 25: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Conjoint Analysis – Estimating Utilities

Utilities: The relative value of attribute levels determined through conjoint analysis.

14-25Key Terms & Definitions

Page 26: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

After some pre-processing, the actual data

mining process includes four steps:

1. Clustering

2. Classification

3. Modeling

4. Application

Data Mining Process

Key Terms & Definitions14-26

Page 27: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Step One ClusteringDiscovering groups and structures in the data based on selected sets of variables.

Step Two ClassificationApplying the structure identified in the cluster analysis to another subset of data.

Data Mining Process

Key Terms & Definitions14-27

Page 28: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

Step Three ModelingUse regression to model the relationship or predict subgroup membership. Seeking high predictive accuracy.

Step Four ApplicationPutting your actionable data to work.

Data Mining Process

Key Terms & Definitions14-28

Page 29: Chapter Fourteen 14-1. Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or

14-29

Key Terms & Definitions

• Regression Analysis• Coefficient of Determination• Regression Coefficients• Dummy Variables• Collinearity• Causation• Scaling of Coefficients

Links and button are active when in “Slide Show Mode”Key Terms & Definitions

• Discriminant Coefficient• Cluster Analysis• Factor Analysis• Factor Scores• Factor Loading• Conjoint Analysis• Estimating Utilities