4. Introduction to Local Estimation - sample(ECOLOGY)Feb 04, 2016 · 4.1 Comparison. Traditional...

4. Introduction to Local Estimation

Overview

1. Traditional vs. piecewise SEM

2. Tests of directed separation

3. Introduction to piecewiseSEM

4.1 Traditional vs. Piecewise SEM

4.1 Comparison. Traditional vs. piecewise SEM

Variance-covariance Piecewise

Single (global) variance-covariance matrix estimated

Multiple (local) variance-covariance matrices estimated (one for each endogenous variable)

Simultaneous solution (computationally intensive)

Multiple solutions (modularized)

Fit to normal distribution Incorporates various distributions (Poisson, Gamma, etc.)

Assumes independence Can model non-independence (blocked, temporal, spatial, etc.)

Latent & composite variables No latent or composite variables (yet*)

True correlated errors Partial correlations (so far*)

Non-recursive (feedbacks) Only for recursive

Multi-group models Can estimate random components, but no formal χ2 test

4.1 Comparison. Traditional vs. piecewise SEM

Traditional SEM Piecewise SEM

4.1 Comparison. Graphs to equations

y1 = γ11x1 + ζ1

y2 = γ12x1 + b12y1 + ζ2

11 b12

4.1 Comparison. Break this model up

11y1x1

2 Simple Linear Regressions

4.1 Comparison. Break this model up

y2 y2 y3

1 Simple, 1 Multiple Regression

ACTIVITY

• Take your SEM from Lesson 3

• Break it up into the component models

4.2 Tests of Direction Separation

4.2 Directed Separation. Model fit

Does the model fit the data?

Does the model represent the data well??

Are we missing important information???

4.2 Directed Separation. Model fit

Did we get the topology right or are there unrecognized significant relationships?

4.2 Directed Separation. D-sep

• Concept from Graph Theory

• Two nodes are d-separated if they are conditionally independent e.g., the effect of x on y3 is zero given the influences of y1

and y2

4.2 Directed Separation. D-sep

Test of direction separation:

1. Identify all independence claims

2. Evaluate each independence claim

3. Summarize information across all claims

4.2 Directed Separation. Independence claims

1. Identify all independence claims

Basis set = the smallest possible set of independence claims from a graph

1. x ⏊ y3 | (y1, y2)

2. y1 ⏊ y2 | (x)

4.2 Directed Separation. Independence claims

The d-separation criterion for any pair of variables involves:

1. Controlling for common ancestors that could generate correlations between the pair

2. Controlling for causal connections through multi-link directed pathways via parents

3. Not controlling for common descendent variables.

4.2 Directed Separation. Deriving the basis set

What is the basis set?

1. mass ⏊ dia | (canopy)

2. mass ⏊ # | (canopy)

3. mass ⏊ %| (canopy)

4. dia ⏊ # | (canopy)

5. dia ⏊ % | (canopy)

6. % ⏊ # | (canopy)

Dia ~ mass + canopy

4.2 Directed Separation. Deriving the basis set

What is the basis set?

1. canopy ⏊ % | (#)

2. canopy⏊ mass | (dia)

3. dia ⏊ # | (canopy)

4. dia ⏊ % | (canopy, #)

5. mass ⏊ #| (dia, canopy)

6. mass ⏊ % | (dia, #)

4.2 Directed Separation. A note

• Basis set generally excludes relationships among exogenous variables

1. x1 ⏊ y2 | (y1)2. x2 ⏊ y2 | (y1)3. x1 ⏊ x2

• Unclear as to the direction of the relationship

• Unclear whether variables could even be causally linked• E.g., Ocean basin and Latitude

• Distributional assumptions, etc. not defined

1. x1 ⏊ y2 | (y1)2. x2 ⏊ y2 | (y1)3. x1 ⏊ x2

• Basis set generally excludes non-linear components (polynomials)

1. x ⏊ y2 | (y1)2. x2 ⏊ y2 | (y1)

• Basis set generally excludes non-linear components (interactions)

1. x1 ⏊ y2 | (y1)2. x2 ⏊ y2 | (y1)3. x1 * x2 ⏊ y2 | (y1)

4.2 Directed Separation. Fisher’s C

• Summarize independence claims across basis set:

C = -2*∑ln(pi)

pi = the p-values of all tests of conditional independence

• p can come from many statistics

• C has a c2-square distribution with 2k degrees of freedom

• k = # of elements of the basis set.

4.2 Directed Separation. Fisher’s C

What is p < 0.05?

• You are likely missing some associations

• You reject this model

• The way forward: adding links or different model structure? (look at d-sep tests)

• To re-iterate, p ≥ 0.05 is GOOD

4.2 Directed Separation. Model selection

• Fisher’s C can be used to construct model AIC:

AIC = C + 2K

• K = # of likelihood parameters estimated (not to be confused with k)

• Can be extended to small sample size:

AICc = C + 2K(n / (n – K – 1))

4.2 Directed Separation. Sample size

• Since there is no global vcov matrix, the t-rule does not apply

• Shipley suggests need only enough d.f. to fit each component model

• Or, d-rule (Grace et al 2015):• d = # of samples / # of pathways• d ≥ 5

• More is always better!

ACTIVITY

• Take your SEM from Lesson 3

• Derive the basis set

4.3 Introduction to piecewiseSEM

4_Local_Estimation.R

4.3 piecewiseSEM.

piecewiseSEM: Piecewise structural equation modeling in R for ecology, evolution, and systematics

install.packages(“devtools”)

library(devtools)

install_github(“jslefche/piecewiseSEM@2.0”)

library(piecewiseSEM)

Mediation in Analysis of Post-Fire Recovery of Plant Communities in California Shrublands

Five year study of wildfires in Southern California in 1993. 90 plots (20 x 50m)

4.3 piecewiseSEM. Keeley example

Distance Richness

Hetero

Abiotic1. Create list of

structured equations

2. Conduct d-septests (evaluate fit)

3. Extract coefficients

4.3 piecewiseSEM. Store list of equations

keeley <- read.csv(“keeley.csv”)

keeley.sem <- psem(

lm(abiotic ~ distance, data = keeley),

lm(hetero ~ distance, data = keeley),

lm(rich ~ abiotic + hetero, data = keeley),

data = keeley)

distance rich

hetero

abiotic

4.3 piecewiseSEM. D-sep tests

# Get the basis set

basisSet(keeley.sem)

# Conduct d-sep tests

dTable <- dSep(keeley.sem)

Independ.Claim Estimate Std.Error DF Crit.Value P.Value

1 rich ~ distance + ... 0.640431807 0.156457462 NA 4.093329 9.564005e-05 ***

2 hetero ~ abiotic + ... 0.002229248 0.001676649 NA 1.329585 1.871306e-01

distance rich

hetero

abiotic

4.3 piecewiseSEM. Fisher’s C

# Get Fisher’s C

fisherC(dTable)

Fisher.C df P.Value

1 21.862 4 0

distance rich

hetero

abiotic

4.3 piecewiseSEM. Re-assess fit

distance rich

hetero

abiotic

# Add significant path

keeley.sem2 <-

update(keeley.sem, rich ~ abiotic + hetero + distance)

dSep(keeley.sem2)

1 hetero ~ abiotic + ... 0.002229248 0.001676649 NA 1.329585 0.1871306

fisherC(keeley.sem2)

Fisher.C df P.Value

1 3.352 2 0.187

4.3 piecewiseSEM. Get coefficients

distance rich

hetero

abiotic

# Get coefficients

coefs(keeley.sem2)

Response Predictor Estimate Std.Error DF Crit.Value P.Value Std.Estimate

1 abiotic distance 0.3998 0.0823 NA 4.8562 0.0000 0.4597 ***

2 hetero distance 0.0045 0.0013 NA 3.4593 0.0008 0.3460 ***

3 rich abiotic 0.5233 0.1756 NA 2.9793 0.0038 0.2660 **

4 rich hetero 33.4010 11.1187 NA 3.0040 0.0035 0.2539 **

5 rich distance 0.6404 0.1565 NA 4.0933 0.0001 0.3743 ***

4.3 piecewiseSEM. Get R2

distance rich

hetero

abiotic

# Get R-squared

rsquared(keeley.sem2)

Response family link R.squared

1 abiotic gaussian identity 0.2113455

2 hetero gaussian identity 0.1197074

3 rich gaussian identity 0.4700472

4.3 piecewiseSEM. Summary

summary(keeley.sem2)

Structural Equation Model of keeley.sem2

abiotic ~ distance

hetero ~ distance

rich ~ abiotic + hetero + distance

AIC BIC

25.352 52.85

Tests of directed separation:

hetero ~ abiotic + ... 0.0022 0.0017 87 1.3296 0.1871

Coefficients:

abiotic distance 0.3998 0.0823 NA 4.8562 0.0000 0.4597 ***

hetero distance 0.0045 0.0013 NA 3.4593 0.0008 0.3460 ***

rich abiotic 0.5233 0.1756 NA 2.9793 0.0038 0.2660 **

rich hetero 33.4010 11.1187 NA 3.0040 0.0035 0.2539 **

rich distance 0.6404 0.1565 NA 4.0933 0.0001 0.3743 ***

Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘’ 1

Goodness-of-fit:

Global model: Fisher's C = 3.352 with P-value = 0.187 and on 2 degrees of freedom

Individual R-squared:

Response Rsquared

abiotic 0.21

hetero 0.12

rich 0.47

distance rich

hetero

abiotic

R2 = 0.21

R2 = 0.12

R2 = 0.47

0.35 0.26

4.3 piecewiseSEM. Correlated errors

distance rich

hetero

abiotic

keeley.sem3 = psem(

lm(abiotic ~ distance, data = keeley),

lm(rich ~ distance + hetero, data = keeley),

rich %~~% abiotic

4.3 piecewiseSEM. Correlated errors

distance rich

hetero

abiotic

R2 = 0.21

R2 = 0.12

R2 = 0.42

0.35 0.29

4.3 piecewiseSEM. AIC comparisons

distance rich

hetero

abiotic

distance rich

hetero

abiotic

distance rich

hetero

abiotic

distance rich

hetero

abiotic0 0

abiotic ~ 1

4.3 piecewiseSEM. Fit new model

keeley.sem4 <- psem(

lm(rich ~ distance + hetero, data = keeley),

abiotic ~ 1

distance rich

hetero

abiotic

distance rich

hetero

abiotic

distance rich

hetero

abiotic

AIC = 25.3

AIC = 28.5

4.3 piecewiseSEM. Partial regression coefficient

Isolate the independent effect of distance on richness:

1. Regress abiotic and hetero against richness (removing distance)

2. Regress distance against abiotic and hetero (remove rich)

3. Regression residuals of 1 against 2 (having removed effects of abiotic and hetero from both)

4.3 piecewiseSEM. Partial regressions

# Get partial residuals

partialResid(rich ~ distance, modelList2)

4.3 piecewiseSEM. Partial regressions

• Useful for displaying trends, particularly with complex models where bivariate correlations are messy

• Can be used for any multiple regression (single model or list)

• Not applicable to simple regression (Y ~ X)

4.4 A Warning

4.4 Directed Separation. A warning

• Intermediate non-normal endogenous variables pose a challenge

• If normal, significance values are reciprocal

𝑃2,1 == 𝑃1,2

𝑃2,1 ≠ 𝑃1,2

• If non-normal, significance values are not reciprocal because of transformation via link function

log(y1)

log(y2)

# Create fake data

set.seed(1)

data <- data.frame(

x = runif(100),

y1 = runif(100),

y2 = rpois(100, 1),

y3 = runif(100)

# Store in SEM list

modelList <- psem(

lm(y1 ~ x, data),

glm(y2 ~ x, "poisson", data),

lm(y3 ~ y1 + y2, data),

data )

summary(modelList)

Error:

Non-linearities detected in the basis set where P-values are not

symmetrical.

This can bias the outcome of the tests of directed separation.

Offending independence claims:

y2 <- y1 *OR* y2 -> y1

Option 1: Specify directionality using argument 'direction = c()'.

Option 2: Remove path from the basis set by specifying as a correlated

error using '%~~%'.

Option 3: Use argument 'conserve = TRUE' to compute both tests, and

return the most conservative P-value.

dSep(modelList, conserve = T)

1 y3 ~ x + ... -0.2006020 0.1088444 NA -1.8430170 0.06841223

2 y2 ~ y1 + ... -0.2826168 0.3994236 NA -0.7075617 0.47921747

summary(mody1.y2)$coefficients[2, 4]

0.5152512

0.4792175

dSep(modelList, direction = c("y2 <- y1"))

1 y3 ~ x + ... -0.20060205 0.10884438 NA -1.8430170 0.06841223

2 y1 ~ y2 + ... -0.01818537 0.02784565 NA -0.6530776 0.51525122

0.5152512

0.4792175

modelList2 <- update(modelList, y2 %~~% y1)

summary(modelList2)

Tests of directed separation:

y3 ~ x + ... -0.2006 0.1088 NA -1.843 0.0684

Coefficients:

y1 x 0.0173 0.1026 NA 0.1686 0.8664 0.0170

y2 x 0.5232 0.4170 NA 1.2547 0.2096 0.7548

y3 y1 0.0042 0.1079 NA 0.0388 0.9691 0.0039

y3 y2 0.0297 0.0295 NA 1.0079 0.3160 0.1020

~~y2 ~~y1 -0.0768 NA 97 -0.7589 0.7751 -0.0768

4. Introduction to Local Estimation - sample(ECOLOGY)Feb 04, 2016 · 4.1 Comparison. Traditional...

Documents

e lecture 4c covariance and correlationcs.brown.edu/courses/cs100/lectures/lecture4c.pdf · 2019-10-15 · Dispersion: variance, standard deviation, covariance, correlation ... A

X Y. Variance Covariance Correlation Scatter plot

Analysis of covariance - NYUjsimonof/classes/2301/pdf/ancova.pdf · Analysis of covariance Analysis of variance (ANOVA) models are restrictive in that they allow only categori-

VICReg: Variance-Invariance-Covariance Regularization for

Analysis of Variance and Covariance: How to Choose and ......Analysis of Variance and Covariance: How to Choose and Construct Models for the Life Sciences Analysis of variance (ANOVA)

Analysis of Variance and Covariance

Part 1 Analysis of Covariance: ANCOVA. Analysis of Covariance ANCOVA Like an analysis of variance in which one or more variables (called covariates) have

anova — Analysis of variance and covariance - Stata · PDF fileIf you are interested in MANOVA or MANCOVA, see ... anova— Analysis of variance and covariance 5. regress, baselevels

Christopher Dougherty EC220 - Introduction to econometrics (review chapter) Slideshow: covariance, covariance and variance rules, and correlation Original

PBG 650 Advanced Plant Breeding Module 6: Quantitative Genetics – Environmental variance – Heritability – Covariance among relatives

Essays on Financial Econometrics: Variance and Covariance

Estimation of Error Variance-Covariance Parameters Using ...2019/10/18 · Research Article Estimation of Error Variance-Covariance Parameters Using Multivariate Geographically Weighted

1. VARIANCE- & COVARIANCE-BASED SEM 2. TESTING FOR …faculty.weatherhead.case.edu/jxs16/docs/SEM_AMA... · sem overview 1. variance- & covariance-based sem 2. testing for common

Variance and covariance M contains the mean Sums of squares General additive models

Variance and covariance Sums of squares General linear models

Genetic variance–covariance matrices: a critique of the evolutionary

Estimation of Variance and Covariance Components C. R ...hz68/619/Henderson1953.pdf · Estimation of Variance and Covariance Components C. R. Henderson Biometrics, Vol. 9, No. 2

An introduction to the GOCE error variance covariance

Modelling heterogeneous variance-covariance …eprints.ncrm.ac.uk/3549/1/hetvarcov.pdfModelling heterogeneous variance-covariance components in two-level multilevel models with application

Accuracy of GNSS Observations from Three Real-Time ... · PRS, thereby producing a delta ECEF vector and its variance and covariance values (i.e., a 3 by 3 variance-covariance matrix)