Analysis of Variance Webinar

Embed Size (px)

Citation preview

  • 8/13/2019 Analysis of Variance Webinar

    1/44

    1

    Analysis of Variance

    Using Statgraphics Centurion

    Dr. Neil W. Polhemus

    Copyright 2011 by StatPoint Technologies, Inc.

  • 8/13/2019 Analysis of Variance Webinar

    2/44

    2

    Analysis of Variance

    Analysis of variance (ANOVA) models partition the variability of

    a response variable into components attributable to one or

    more explanatory factors.

    Factors may be:

    Categorical or quantitative

    Crossed or nested

    Fixed or random

    Fully or partially randomized

  • 8/13/2019 Analysis of Variance Webinar

    3/44

    3

    Procedures

    STATGRAPHICS has procedures for:

    Oneway ANOVA single categorical factor.

    Multifactor ANOVA multiple crossed categorical factors.

    Variance Components Analysis multiple nested categorical

    factors.

    General Linear Models any combination of factors.

  • 8/13/2019 Analysis of Variance Webinar

    4/44

    4

    Example #1 Oneway ANOVA

    Data: 1/10-th systematic sample of companies in the Fortune

    500 list for 1986 (Source: DASL Data and Story Library)

    Response: Y = profit per employee

    Factor: X = sector of economy

  • 8/13/2019 Analysis of Variance Webinar

    5/44

    5

    Data file: profits.sgd

  • 8/13/2019 Analysis of Variance Webinar

    6/44

    6

    Data Input

  • 8/13/2019 Analysis of Variance Webinar

    7/44

    7

    ANOVA Table

  • 8/13/2019 Analysis of Variance Webinar

    8/44

    8

    Means Plot

  • 8/13/2019 Analysis of Variance Webinar

    9/44

    9

    Multiple Range Tests

  • 8/13/2019 Analysis of Variance Webinar

    10/44

    10

    Analysis of Means (ANOM)

  • 8/13/2019 Analysis of Variance Webinar

    11/44

    11

    Residual Plot

  • 8/13/2019 Analysis of Variance Webinar

    12/44

    12

    Excluding Row #49

  • 8/13/2019 Analysis of Variance Webinar

    13/44

    13

    Example #2 Multifactor

    ANOVA Data: Exercise tolerance in a stress test (Applied Linear

    Statistical Models by Neter et al.)

    Response: Y = minutes until fatigue occurs on a stationary

    bicycle

    Factors: X1 = gender, X2 = percent body fat, X3 = smoking

    history

    Experimental design: 2 by 2 by 3 factorial design with 3

    subjects at each of the 12 combinations of the factors

  • 8/13/2019 Analysis of Variance Webinar

    14/44

    14

    Data file: stresstest.sgd

  • 8/13/2019 Analysis of Variance Webinar

    15/44

    15

    Data Input

  • 8/13/2019 Analysis of Variance Webinar

    16/44

    16

    Analysis of Variance Table

  • 8/13/2019 Analysis of Variance Webinar

    17/44

    17

    Graphical ANOVA

  • 8/13/2019 Analysis of Variance Webinar

    18/44

    18

    Interaction Plot

  • 8/13/2019 Analysis of Variance Webinar

    19/44

    Example #3: Analysis of

    Covariance Tests whether certain factors have an effect on the response

    after removing the effect of one or more quantitative factors.

    Response: Y = profit per employee

    Factor: X = sector of economy

    Covariate: LOG(sales)

    19

  • 8/13/2019 Analysis of Variance Webinar

    20/44

    20

    X-Y Plot

  • 8/13/2019 Analysis of Variance Webinar

    21/44

    21

    Data Input

  • 8/13/2019 Analysis of Variance Webinar

    22/44

    22

    Means Plot

    Commun

    ication

    Energy

    Fina

    nce

    HiTe

    ch

    Manufacturin

    g

    Medical

    Other

    Retail

    Tran

    sportatio

    n

    Means and 95.0 Percent Tukey HSD Intervals

    Sector

    -10

    0

    10

    20

    30

    40

    Profits/Employ

    ees

  • 8/13/2019 Analysis of Variance Webinar

    23/44

    23

    Example #4 Variance

    Components Study Data: Pigment paste example (Statistics for Experimenters by

    Box, Hunter and Hunter)

    Response: Y = moisture contents

    Factors: X1 = batch, X2 = sample within batch, X3 = test within

    sample

    Experimental design: 15 by 2 by 2 hierarchical design

  • 8/13/2019 Analysis of Variance Webinar

    24/44

    24

    Data file: pigment.sgd

  • 8/13/2019 Analysis of Variance Webinar

    25/44

    25

    Data Input

  • 8/13/2019 Analysis of Variance Webinar

    26/44

    26

    Variance Components Plot

  • 8/13/2019 Analysis of Variance Webinar

    27/44

    27

    Analysis of Variance

  • 8/13/2019 Analysis of Variance Webinar

    28/44

    28

    Example #5 Split-Plot Design

    Data: Corrosion resistance example (Statistics for

    Experimenters by Box, Hunter and Hunter)

    Response: Y = corrosion resistance of steel bars

    Factors: X1 = furnace temperature, X2 = coating

    Experimental design: Since furnace temperature is hard to

    change, it will be randomized over a larger experimental unitthan coating.

  • 8/13/2019 Analysis of Variance Webinar

    29/44

    29

    Experimental Design

    Each of the 6 runs of the furnace is a whole plot.Temperature is a whole plot factor and is randomized across the runs.

    Each position in the furnace is a subplot.

    Coating is a subplot factor that is randomized across the positions.

  • 8/13/2019 Analysis of Variance Webinar

    30/44

    30

    Data file: steelbars.sgd

  • 8/13/2019 Analysis of Variance Webinar

    31/44

    31

    Data Input - GLM

  • 8/13/2019 Analysis of Variance Webinar

    32/44

    32

    Model Specification

  • 8/13/2019 Analysis of Variance Webinar

    33/44

    33

    Analysis of Variance

  • 8/13/2019 Analysis of Variance Webinar

    34/44

    34

    F-tests and Error Components

  • 8/13/2019 Analysis of Variance Webinar

    35/44

    35

    Interaction Plot

  • 8/13/2019 Analysis of Variance Webinar

    36/44

    36

    Example #6 Repeated

    Measures Design

    Each of 3 drugs was given to 8 different patients. Their heart rate wasmeasured at 4 distinct times.

    There are 2 experimental units: subject to which a particular drug is

    assigned, and time period in which measurements are taken.

    Source: Analysis of Messy Data by Milliken and Johnson.

    Subject Drug T1 T2 T3 T4

    1 AX23 72 86 81 77

    2 AX23 78 83 88 81

    9 BWW9 85 86 80 8410 BWW9 82 86 80 84

    17 CONTROL 69 73 72 74

    18 CONTROL 66 62 67 73

  • 8/13/2019 Analysis of Variance Webinar

    37/44

    37

    Data file: heartrate.sgd

  • 8/13/2019 Analysis of Variance Webinar

    38/44

    38

    Data input

  • 8/13/2019 Analysis of Variance Webinar

    39/44

    39

    Model Specification

  • 8/13/2019 Analysis of Variance Webinar

    40/44

    40

    Analysis of Variance

  • 8/13/2019 Analysis of Variance Webinar

    41/44

    41

    Comparison to Control

  • 8/13/2019 Analysis of Variance Webinar

    42/44

    42

    User-Defined Contrast

  • 8/13/2019 Analysis of Variance Webinar

    43/44

    43

    Interaction Plot

  • 8/13/2019 Analysis of Variance Webinar

    44/44

    References

    Box, G. E. P., Hunter, W. G. and Hunter, J. S. (2005). Statistics

    for Experimenters: An Introduction to Design, Data Analysis,

    and Model Building, 2nd edition. New York: John Wiley and

    Sons.

    DASL Data and Story Library. (lib.stat.cmu.edu/DASL) Milliken, G. A. and Johnson, D. E. (1992). Analysis of Messy

    Data - Volume 1: Designed Experiments, reprint edition. New

    York: Van Nostrand Reinhold.

    Neter, J., Kutner, M.H., Wasserman, W., and Nachtscheim, C.J.

    (1996). Applied Linear Statistical Models, 4th edition.

    Homewood, Illinois: Irwin.

    44