44
1 Statistical Tools for Multivariate Six Sigma Dr. Neil W. Polhemus CTO & Director of Development StatPoint, Inc. Revised talk: www.statgraphics.com\documents.htm

STATGRAPHICS Centurion XV

Embed Size (px)

Citation preview

Page 1: STATGRAPHICS Centurion XV

1

Statistical Tools for Multivariate Six Sigma

Dr. Neil W. PolhemusCTO & Director of DevelopmentStatPoint, Inc.

Revised talk: www.statgraphics.com\documents.htm

Page 2: STATGRAPHICS Centurion XV

2

The Challenge

The quality of an item or service usually depends on more than one characteristic.

When the characteristics are not independent, considering each characteristic separately can give a misleading estimate of overall performance.

Page 3: STATGRAPHICS Centurion XV

3

The Solution

Proper analysis of data from such processes requires the use of multivariate statistical techniques.

Page 4: STATGRAPHICS Centurion XV

4

Important Tools Statistical Process Control

Multivariate capability analysis Multivariate control charts

Statistical Model Building* Data Mining - dimensionality reduction DOE - multivariate optimization

* Regression and classification.

Page 5: STATGRAPHICS Centurion XV

5

Example #1

Textile fiber

Characteristic #1: tensile strength (115.0 ± 1.0)

Characteristic #2: diameter (1.05 ± 0.01)

Page 6: STATGRAPHICS Centurion XV

6

Individuals Charts

Page 7: STATGRAPHICS Centurion XV

7

Capability Analysis (each separately)

Page 8: STATGRAPHICS Centurion XV

8

Scatterplot

Page 9: STATGRAPHICS Centurion XV

9

Multivariate Normal Distribution

Page 10: STATGRAPHICS Centurion XV

10

Control Ellipse

Page 11: STATGRAPHICS Centurion XV

11

Multivariate CapabilityDetermines joint probability of being within

the specification limits on all characteristics.

Page 12: STATGRAPHICS Centurion XV

12

Mult. Capability Indices

Defined to give the

same DPM as in the

univariate case.

Page 13: STATGRAPHICS Centurion XV

13

More than 2 Variables

Page 14: STATGRAPHICS Centurion XV

14

Hotelling’s T-Squared

Measures the distance of each point from the centroid of the data (or the assumed distribution).

Page 15: STATGRAPHICS Centurion XV

15

T-Squared Chart

Page 16: STATGRAPHICS Centurion XV

16

T-Squared Decomposition

Page 17: STATGRAPHICS Centurion XV

17

Statistical Model Building Defining relationships (regression and ANOVA) Classifying items Detecting unusual events Optimizing processes

When the response variables are correlated, it is important to consider the responses together.

When the number of variables is large, the dimensionality of the problem often makes it difficult to determine the underlying relationships.

Page 18: STATGRAPHICS Centurion XV

18

Example #2

Page 19: STATGRAPHICS Centurion XV

19

Matrix Plot

Page 20: STATGRAPHICS Centurion XV

20

Multiple Regression

Page 21: STATGRAPHICS Centurion XV

21

Reduced Models

MPG City = 29.9911 - 0.0103886*Weight + 0.233751*Wheelbase (R2=73.0%)

MPG City = 64.1402 - 0.054462*Horsepower - 1.56144*Passengers - 0.374767*Width (R2=64.3%)

Page 22: STATGRAPHICS Centurion XV

22

Dimensionality Reduction

Construction of linear combinations of the variables can often provide important insights.

Principal components analysis (PCA) and principal components regression (PCR): constructs linear combinations of the predictor variables X that contain the greatest variance and then uses those to predict the responses.

Partial least squares (PLS): finds components that minimize the variance in both the X’s and the Y’s simultaneously.

Page 23: STATGRAPHICS Centurion XV

23

Principal Components Analysis

Page 24: STATGRAPHICS Centurion XV

24

Scree Plot

Page 25: STATGRAPHICS Centurion XV

25

Component Weights

C1 = 0.377*Engine Size + 0.292*Horsepower + 0.239*Passengers + 0.370*Length + 0.375*Wheelbase + 0.389*Width + 0.360*U Turn Space + 0.396*Weight

C2 = -0.205*Engine Size – 0.593*Horsepower + 0.731*Passengers + 0.043*Length + 0.260*Wheelbase – 0.042*Width – 0.026*U Turn Space – 0.030*Weight

Page 26: STATGRAPHICS Centurion XV

26

Interpretation

Page 27: STATGRAPHICS Centurion XV

27

PC Regression

Page 28: STATGRAPHICS Centurion XV

28

Contour Plot

Page 29: STATGRAPHICS Centurion XV

29

PLS Model Selection

Page 30: STATGRAPHICS Centurion XV

30

PLS Coefficients

Selecting to extract 3 components:

Page 31: STATGRAPHICS Centurion XV

31

Interpretation

Page 32: STATGRAPHICS Centurion XV

32

Neural Networks

Page 33: STATGRAPHICS Centurion XV

33

Bayesian Classifier

Page 34: STATGRAPHICS Centurion XV

34

Classification

Page 35: STATGRAPHICS Centurion XV

35

Design of Experiments

When more than one characteristic is important, finding the optimal operating conditions usually requires a tradeoff of one characteristic for another.

One approach to finding a single solution is to use desirability functions.

Page 36: STATGRAPHICS Centurion XV

36

Example #3

Myers and Montgomery (2002) describe an experiment on a chemical process (20-run central composite design):

Response variable Goal

Conversion percentage maximize

Thermal activity Maintain between 55 and 60

Input factor Low High

time 8 minutes 17 minutes

temperature 160˚ C 210˚ C

catalyst 1.5% 3.5%

Page 37: STATGRAPHICS Centurion XV

37

Optimize Conversion

Page 38: STATGRAPHICS Centurion XV

38

Optimize Activity

Page 39: STATGRAPHICS Centurion XV

39

Desirability Functions

Maximization

Page 40: STATGRAPHICS Centurion XV

40

Desirability Functions

Hit a target

Page 41: STATGRAPHICS Centurion XV

41

Combined Desirability

di = desirability of i-th response given the settings of the m experimental factors X.

D ranges from 0 (least desirable) to 1 (most desirable).

Page 42: STATGRAPHICS Centurion XV

42

Desirability ContoursMax D=0.959 at time=11.14, temperature=210.0, and catalyst = 2.20.

Page 43: STATGRAPHICS Centurion XV

43

Desirability Surface

Page 44: STATGRAPHICS Centurion XV

44

References Johnson, R.A. and Wichern, D.W. (2002). Applied Multivariate

Statistical Analysis. Upper Saddle River: Prentice Hall.Mason, R.L. and Young, J.C. (2002).

Mason and Young (2002). Multivariate Statistical Process Control with Industrial Applications. Philadelphia: SIAM.

Montgomery, D. C. (2005). Introduction to Statistical Quality Control, 5th edition. New York: John Wiley and Sons.

Myers, R. H. and Montgomery, D. C. (2002). Response Surface Methodology: Process and Product Optimization Using Designed Experiments, 2nd edition. New York: John Wiley and Sons.

Revised talk: www.statgraphics.com\documents.htm