27
Phenotypic multivariate analysis

Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

  • View
    217

  • Download
    2

Embed Size (px)

Citation preview

Page 1: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Phenotypic multivariate analysis

Page 2: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Last 2 days…….

1

P

A AC C E E

1/.5

ac ecae

P

Page 3: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

This lecture

P

AC E

ace

P

F

f1

P P

f2 f3

Page 4: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

This lecture

P

F

f5

P

F

f1

P

F

f2

P

F

f3

P

F

f4

C

c1 c3 c4c5

c2

Page 5: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Data analysis in non-experimental designs using latent constructs

Principal Components Analysis Triangular Decomposition (Cholesky) Exploratory Factor Analysis Confirmatory Factor Analysis Structural Equation Models

Page 6: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Principal Components Analysis

SPSS, SAS Is used to reduce a large set of correlated

observed variables (xi) to (a smaller number of) uncorrelated (orthogonal) components (yi)

yi is a linear function of xi Transformation of the data, not a model

Page 7: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

PCA path diagram

D

P

S = observed covariances = P D P’

x1 x2 x3 x4 x5

y1 y2 y3 y4 y5

Page 8: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

PCA equations

Covariance matrix qSq = qPq qDq qPq’ = P# P # ’ P = orthogonal matrix of eigenvectors D = diagonal matrix with eigenvalues P’P = I and P# = P D Criteria for number of factors Kaiser criterion, scree plot, %var Important: models not identified!

x1 x2 x3 x4 x5

y1 y2 y3 y4 y5

Page 9: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Correlations: satisfaction, n=100

Var 1

work

Var 2

work

Var 3

work

Var 4

home

Var 5

home

Var 6

home

Var 1 1

Var 2 .65 1

Var 3 .65 .73 1

Var 4 .14 .14 .16 1

Var 5 .15 .18 .24 .66 1

Var 6 .14 .24 .25 .59 .73 1

Page 10: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

++++ ++

00

0

0

0

0++

++++

work home

Var 1 Var 2 Var 3 Var 4 Var 5 Var 6

Page 11: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Triangular decomposition (Cholesky)

x1 x2 x3 x4 x5

y1 y2 y3 y4 y5

1 operationalization of all PCA outcomes

Model is just identified and saturated (df=0)

1 1 1 1 1

Page 12: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Triangular decomposition

S = Q * Q’ ( = P# * P# ‘)

5Q5 = f11 0 0 0 0f21 f22 0 0 0f31 f32 f33 0 0f41 f42 f43 f44 0f51 f52 f53 f54 f55

Q is a lower matrix This is not a model! This is a transformation of

the observed matrix S. Fully determinate!

Page 13: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Matrix algebra, Cholesky

3Q3 = f11 0 0

f21 f22 0f31 f32 f33

Calculate Q * Q’

Var x1: f11*f11Var x2: f21*f21+f22*f22Cov x1,x3: f31*f11Cov x2,x3: f31*f21+f32*f22

Page 14: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Exploratory Factor Analysis

Account for covariances among observed variables in terms of a smaller number of latent, common factors

Includes error components for each variable x = L * f + u x = observed variables f = latent factors u = unique factors L = matrix of factor loadings

Page 15: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

EFA path diagram

C

L

U

Page 16: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

EFA equations

S = L * C * L’ + U * U’ S = observed covariance matrix L = factor loadings C = correlations between factors U = diagonal matrix of errors

Correlations between latent factors are allowed

Page 17: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Exploratory factor analysis

No prior assumption on number of factors All variables load on all latent factors Factors are either all correlated or all

uncorrelated Unique factors are uncorrelated Underidentification

Page 18: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Confirmatory factor analysis

A model is constructed in advance The model has a specific number of factors Variables do not have to load on all factors Measurement errors may correlate Latent factors may be correlated

Page 19: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

CFA

An initial model (i.e., a matrix of factor loadings) may be specified, because:

its elements have been obtained from a previous analysis in another sample

its elements are described by a theoretical process

Page 20: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

CFA equations

x = L * f + u x = observed variables, f = latent factors u = unique factors, L = factor loadings S = L * C * L’ + U * U’ S = observed covariance matrix L = factor loadings C = correlations between factors U = diagonal matrix of errors

Page 21: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Structural equations models

The factor model x = L * f + u is sometimes refered to as the measurement model

The relations between latent factors can also be modelled

This is done in the covariance structure model, or the structural equations model

Higher order factor models

Page 22: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Structural Model

X1

e1

X2

e2

X3

e3

X4

e4

X5

e5

X6 X7

e7

f1

f2

f3

Page 23: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Practice!

Problem behavior in children (CBCL at age 3) 7 syndromes (aggression, oppositional,

withdrawn/depressed, anxious, overactive, sleep and somatic problems

Syndromes are correlated

Datafile: cbcl1all.cov

Page 24: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Observed correlations (2683 subj.)

Opp w/d agg anx act sleep Withdrawn .41 Aggression .63 .35 Anxious .45 .47 .27 Overactive .53 .34 .52 .29 Sleep .32 .24 .28 .26 .23 Somatic .21 .22 .18 .17 .15 .23

Page 25: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Cholesky: How many underlying factors?– S = Q * Q’, Q is 7x7 lower– Fact7.mx

What is the fit of a 1 factor model?– S = F * F’ + U, F = 7x1 full, U = 7x7 diagonal– Fact1.mx

Page 26: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

What is the fit of a 2 factor model?– Same model,but F = 7x2 full with loading of aggression fixed– Fact2.mx

Achenbach suggests 2 factors: externalizing and internalizing: what is the evidence for this model?

– Same model, F = 7x2 full, internalizing factor and externalizing factor

– Fact2a.mx

Page 27: Phenotypic multivariate analysis. Last 2 days……. 1 P A A C C E E 1/.5 a cecae P

Can the 2 factor model be improved by adding a 3rd general problem factor or by having a correlation between the 2 factors?

– Same model, F = 7x3 full with general factor, internalizing factor and externalizing factor, Fact3.mx

– S = F * C * F’ + U, F = 7x2 full matrix, C = stand 2x2 matrix (correlation), U = 7x7 diagonal matrix, Fact2b.mx