Upload
meryl-fisher
View
217
Download
4
Embed Size (px)
Citation preview
Bayesian factor and structural equation Bayesian factor and structural equation models in spatial applications. models in spatial applications.
Specification, identification and model Specification, identification and model assessment, with case study assessment, with case study
illustrationsillustrations
Peter Congdon, Queen Mary Peter Congdon, Queen Mary University of LondonUniversity of London
Dept of Geography & Centre for Dept of Geography & Centre for StatisticsStatistics
OutlineOutline Background: Bayesian approaches to LV Background: Bayesian approaches to LV
models, advantages & disadvantagesmodels, advantages & disadvantages Computational options including Computational options including
WINBUGSWINBUGS Wider application contexts of Bayesian LV Wider application contexts of Bayesian LV
& SEM models& SEM models Spatial Priors; Common Spatial FactorsSpatial Priors; Common Spatial Factors
Outline (continued)Outline (continued)
Different sorts of spatial factor model Different sorts of spatial factor model (depending on form of manifest variables) (depending on form of manifest variables) and possible identification issuesand possible identification issues
Assessing models, model fit & model Assessing models, model fit & model choice. Possible variable/model choice choice. Possible variable/model choice approachesapproaches
Case studiesCase studies
Case StudiesCase Studies Social capital & mental health, multilevel Social capital & mental health, multilevel
model using Health Survey for England model using Health Survey for England (HSE)(HSE)
Multilevel model for joint prevalence of Multilevel model for joint prevalence of obesity & diabetes, BRFSS respondents obesity & diabetes, BRFSS respondents nested within US counties & states (CDC nested within US counties & states (CDC Behavioral Risk Factor Surveillance Behavioral Risk Factor Surveillance System)System)
Suicide & self-harm, ecological study for Suicide & self-harm, ecological study for small areas (wards) in Eastern Englandsmall areas (wards) in Eastern England
BackgroundBackground SEM and factor models originate in (& still SEM and factor models originate in (& still
most widely used) in psychological, most widely used) in psychological, educational & behavioural applications. educational & behavioural applications.
Recent Bayesian applications to Recent Bayesian applications to psychological & education testing data psychological & education testing data include SEM (e.g. Lee & Song, 2003), LCA, include SEM (e.g. Lee & Song, 2003), LCA, item analysis, and factor analysis per se (e.g. item analysis, and factor analysis per se (e.g. Aitkin & Aitkin, 2005; Press & Shigemasu, Aitkin & Aitkin, 2005; Press & Shigemasu, 1998). 1998).
Also some work on automated Bayesian Also some work on automated Bayesian model choice in normal linear factor modelmodel choice in normal linear factor model
Advantages of Bayesian ApproachAdvantages of Bayesian Approach
Straightforward to depart from standard Straightforward to depart from standard assumptions often built into classical assumptions often built into classical estimation methods (e.g. factor scores estimation methods (e.g. factor scores multivariate normal & independent over multivariate normal & independent over subjects)subjects)
Advantage in generalizations such as Advantage in generalizations such as nonlinear factor effects, multiplicative factor nonlinear factor effects, multiplicative factor schemesschemes
Advantages of Bayesian Approach Advantages of Bayesian Approach (continued)(continued)
Random effect models (of which Random effect models (of which factor/SEM models are subclass) can be factor/SEM models are subclass) can be fitted without relying on numerical methods fitted without relying on numerical methods to integrate out random effectsto integrate out random effects
Potential for Bayesian model choice Potential for Bayesian model choice procedures (e.g. stochastic search procedures (e.g. stochastic search variable selection) in factor/SEM modelsvariable selection) in factor/SEM models
Disadvantages of Bayesian ApproachDisadvantages of Bayesian Approach
Identification issues (re “naming” of Identification issues (re “naming” of factors): can have label switching for latent factors): can have label switching for latent constructs during MCMC updating if there constructs during MCMC updating if there aren’t constraints to ensure consistent aren’t constraints to ensure consistent labelling.labelling.
Slow convergence of model parameters or Slow convergence of model parameters or global model fit measures (e.g. DIC and global model fit measures (e.g. DIC and effective parameter estimate) in large effective parameter estimate) in large latent variable applications (e.g. 1000 or latent variable applications (e.g. 1000 or 10000 subjects)10000 subjects)
Disadvantages of Bayesian ApproachDisadvantages of Bayesian Approach
Formal Bayes model assessment Formal Bayes model assessment (marginal likelihoods/Bayes factors) (marginal likelihoods/Bayes factors) difficult for large realistic applicationsdifficult for large realistic applications
Sensitivity to priors on hyperparameters Sensitivity to priors on hyperparameters (e.g. priors for factor covariance matrix)(e.g. priors for factor covariance matrix)
Bayesian approach may need sensible Bayesian approach may need sensible priors when applied to factor models priors when applied to factor models (“diffuseness“ not necessarily suitable)(“diffuseness“ not necessarily suitable)
Bayesian ComputingBayesian Computing Many Bayesian applications to SEM and Many Bayesian applications to SEM and
factor analysis facilitated by WINBUGS factor analysis facilitated by WINBUGS package. package.
See Congdon (Applied Bayesian See Congdon (Applied Bayesian Modelling, 2003); Lee (Structural Equation Modelling, 2003); Lee (Structural Equation Modeling: a Bayesian Approach, 2007)Modeling: a Bayesian Approach, 2007)
Alternative is R…more programming Alternative is R…more programming involvedinvolved
BayesX can’t model common factorsBayesX can’t model common factors
WINBUGSWINBUGS
Despite acronym, WINBUGS employs Despite acronym, WINBUGS employs Metropolis-Hastings updating where Metropolis-Hastings updating where necessary as well as Gibbs samplingnecessary as well as Gibbs sampling
Program code is essentially a description Program code is essentially a description of the priors & likelihood, but can monitor of the priors & likelihood, but can monitor model-related quantities of interestmodel-related quantities of interest
Computing Illustration: a Normal Computing Illustration: a Normal SEMSEM
Wheaton Study: 3 latent variables, each Wheaton Study: 3 latent variables, each measured by two indicators. Alienation67 measured by two indicators. Alienation67 measured by anomia67 (1967 anomia scale) measured by anomia67 (1967 anomia scale) and powles67 (1967 powerlessness scale). and powles67 (1967 powerlessness scale).
Alienation71 is measured in same way, but Alienation71 is measured in same way, but using 1971 scales. using 1971 scales.
Third latent variable, SES (socio-economic Third latent variable, SES (socio-economic status) measured by years of schooling and status) measured by years of schooling and Duncan's Socioeconomic Index, both in 1967. Duncan's Socioeconomic Index, both in 1967.
Structural model relates alienation in 1971 Structural model relates alienation in 1971 (F(F22) to alienation in 1967 (F) to alienation in 1967 (F11) and SES (G)) and SES (G)
FF2i2i = βF = βF1i1i + + 22GGii+u+u2i2i
FF1i1i = = GGii + u + u1i1i
Measurement model for alienationMeasurement model for alienation
yyjiji==j j ++jjFF1i1i j=1,2j=1,2
yyjiji==j j ++jjFF2i2i j=3,4j=3,4
Measurement model for SES Measurement model for SES
xxjiji==j j ++jjGGii j=1,2j=1,2
WINBUGS for Wheaton studyWINBUGS for Wheaton studymodel { for (i in 1:n) { # structural model
F2[i] ~ dnorm(mu.F2[i],1);
mu.F2[i] <- beta* F1[i]+gam[2]*G[i]
F1[i] ~ dnorm(mu.F1[i],1);
mu.F1[i] <- gam[1]*G[i]}
# priors (normal uses inverse variance)
for (j in 1:2) {gam[j] ~ dnorm(0,0.001)}
beta ~ dnorm(0,0.001)
# measurement equations for alienation for (i in 1:n) { for (j in 1:4) { y[i,j] ~ dnorm(mu[i,j],tau[j])} mu[i,1] <- alph[1]+lam[1]* F1[i]; mu[i,2] <- alph[2]+lam[2]* F1[i] mu[i,3] <- alph[3]+lam[3]* F2[i]; mu[i,4] <- alph[4]+lam[4]* F2[i]}
# PRIORSfor (j in 1:4){ alph[j] ~ dnorm(0,0.001);
# gamma prior on precisions tau[j] ~ dgamma(1,0.001)# alternative prior starts with s.d. of residuals# sd.y[j] ~ dunif(0,100); tau[j] <- 1/(sd.y[j]*sd.y[j])
# identifiability constraint on loadings to ensure # positive alienation measure lam[j] ~ dnorm(1,1) I(0,)}
# measurement of SES (G[i])
for (i in 1:n) { G[i] ~ dnorm(0,1)
for (j in 1:2) { x[i,j] ~ dnorm(mu.x[i,j],tau.x[j])}
mu.x[i,1] <- del[1]+kappa[1]* G[i];
mu.x[i,2] <- del[2]+kappa[2]* G[i]}
for (j in 1:2) {del[j] ~ dnorm(0,0.001);
# gamma prior on precisions
tau.x[j] ~ dgamma(1,0.001)
# identifying constraint ensures +ve SES scale
kappa[j] ~ dnorm(1,1) I(0,)}}
Monitoring model related quantitiesMonitoring model related quantities
Suppose one were interested in posterior Suppose one were interested in posterior probs that Fprobs that F2i2i > F > F1i1i (alienation increasing (alienation increasing
for ifor ith th subject)subject) Add codeAdd code
for (i in 1:n) {delF[i] <- step(F2[i]-F1[i])}for (i in 1:n) {delF[i] <- step(F2[i]-F1[i])} Then posterior means of delF provide Then posterior means of delF provide
required probabilitiesrequired probabilities
Widening Applications of Latent Widening Applications of Latent Variable Methods Variable Methods
In particular: application contexts of Bayes In particular: application contexts of Bayes SEM/factor models now include ecological SEM/factor models now include ecological (area level) studies of health variations. (area level) studies of health variations.
Usually no longer valid to assume units Usually no longer valid to assume units (i.e. areas) are independent. (i.e. areas) are independent.
Instead spatial correlation in latent Instead spatial correlation in latent variable(s) (common spatial factors) over variable(s) (common spatial factors) over the areas should be consideredthe areas should be considered
Multi-Level Latent Variable ModelsMulti-Level Latent Variable Models
Latent variable methods also more widely Latent variable methods also more widely applied in multilevel health studies applied in multilevel health studies
Such models consider joint impact of Such models consider joint impact of individual levelindividual level and and area levelarea level risk factors risk factors on health statuson health status
With several outcomes (data both With several outcomes (data both multivariate & multilevel) can model area multivariate & multilevel) can model area effects using common factor(s)effects using common factor(s)
SOME SPATIAL PRIORS: SOME SPATIAL PRIORS: THE BASIS FOR COMMON THE BASIS FOR COMMON
SPATIAL FACTORSSPATIAL FACTORS
Priors incorporating spatial structure: Priors incorporating spatial structure: basis for common spatial factorsbasis for common spatial factors
May be specified over continuous space May be specified over continuous space (geostatistical models often used for (geostatistical models often used for “kriging”)“kriging”)
OR for discrete sets of areas with irregular OR for discrete sets of areas with irregular boundaries (“lattices” or “polygons”)boundaries (“lattices” or “polygons”)
Major classes: Major classes: Simultaneous Autoregressive (SAR) or Simultaneous Autoregressive (SAR) or
Conditional Autoregressive (CAR) priorsConditional Autoregressive (CAR) priors
Spatial PriorsSpatial Priors
My focus: CAR priors for “lattices” (e.g. My focus: CAR priors for “lattices” (e.g. administrative areas)administrative areas)
These are priors for “structured” effects These are priors for “structured” effects (where labels of area units are important) (where labels of area units are important) as opposed to unstructured effects as opposed to unstructured effects (unaffected or exchangeable over different (unaffected or exchangeable over different labelling scheme for areas)labelling scheme for areas)
Substantive BasisSubstantive Basis Generally taken to represent Generally taken to represent
unmeasured area level risk factors for unmeasured area level risk factors for health that vary relatively smoothly health that vary relatively smoothly over space (regardless of arbitrary over space (regardless of arbitrary administrative boundaries that may administrative boundaries that may define units of analysis)define units of analysis)
Substantive grounding: increased Substantive grounding: increased recognition of genuine spatial effects recognition of genuine spatial effects on health (“contextual” effects)on health (“contextual” effects)
DIFFERENT TYPES OF DIFFERENT TYPES OF COMMON SPATIAL FACTORCOMMON SPATIAL FACTOR
(A) Manifest health variables(A) Manifest health variables
Manifest variables are health outcomes Manifest variables are health outcomes yyij ij (areas i, variable j)(areas i, variable j)
Common residual factor sCommon residual factor sii, expresses , expresses
spatial clustering recurring over several spatial clustering recurring over several outcomes joutcomes j
Interpretable as index of common Interpretable as index of common health risks over outcomeshealth risks over outcomes
Example: Wang & Wall 2003Example: Wang & Wall 2003
(B) Census Indicator Confirmatory (B) Census Indicator Confirmatory Model.Model.
Common Spatial Socioeconomic Factor or Common Spatial Socioeconomic Factor or Factors (deprivation, rurality, etc) based Factors (deprivation, rurality, etc) based on relevant indicators Zon relevant indicators Zikik (k=1,..,K) such as (k=1,..,K) such as
unemployment, low income etc. unemployment, low income etc. Often census indicators form bulk of Often census indicators form bulk of
manifest variablesmanifest variables Example: Hogan & Tchernis JASA 2004Example: Hogan & Tchernis JASA 2004
(C) Two Classes of Manifest (C) Two Classes of Manifest VariableVariable
Common factor(s) used to explain Common factor(s) used to explain variations in observed Y variables (health variations in observed Y variables (health outcomes). outcomes).
But factors mainly measured by But factors mainly measured by socioeconomic indicators Z (e.g. census socioeconomic indicators Z (e.g. census data)data)
Example: my Eastern region suicide studyExample: my Eastern region suicide study Partly confirmatory, partly exploratoryPartly confirmatory, partly exploratory
MANIFEST VARIABLES: MANIFEST VARIABLES: AREA HEALTH VARIABLESAREA HEALTH VARIABLES
(A) Shared Spatial Residual Effects(A) Shared Spatial Residual Effects Unobserved area effects common to Unobserved area effects common to
several health outcomes modelled by several health outcomes modelled by shared spatial effectshared spatial effect
Typical scenario: area counts yTypical scenario: area counts y ijij for for
areas i and outcomes j. Poisson or areas i and outcomes j. Poisson or binomial likelihoodbinomial likelihood
Types of EventTypes of Event
May be deaths, hospitalizations, incidence May be deaths, hospitalizations, incidence counts for different cancer types, counts for different cancer types, prevalence counts, etc prevalence counts, etc
Expected events (offset) EExpected events (offset) Eijij based on based on
standard age rates applied to area standard age rates applied to area populations: ypopulations: yijij ~ Poisson(E ~ Poisson(Eijijijij) )
Can also have populations at risk: yCan also have populations at risk: y ijij ~ ~
Poisson(NPoisson(Niiijij) or y) or yijij ~ Bin(N ~ Bin(Nii,,ijij) )
Multivariate Spatial EffectsMultivariate Spatial Effects One option for such data: no reductionOne option for such data: no reduction Multivariate residual effectsMultivariate residual effects
log(log(ijij)=)=jj+s+sijij
(or log((or log(ijij)=)=jj++jjxxii+s+sijij))
For sFor sijij could use multivariate version of could use multivariate version of
conditional autoregressive prior conditional autoregressive prior
Multivariate Spatial EffectsMultivariate Spatial Effects
Multivariate normal CAR Prior is example Multivariate normal CAR Prior is example of Markov Random Field (Rue & Held, of Markov Random Field (Rue & Held, 2005). 2005).
Easily applied in WINBUGS using mv.car Easily applied in WINBUGS using mv.car prior.prior.
May fit well but proliferation of parameters May fit well but proliferation of parameters (more parameters than data points)(more parameters than data points)
Alternative : common spatial factorAlternative : common spatial factor
log(log(ijij)=)=jj++jjssii
Parsimonious and provides interpretable Parsimonious and provides interpretable summary measure of health risksummary measure of health risk
ssii is univariate CAR (or some other prior is univariate CAR (or some other prior
with spatial dependence) with spatial dependence) Correlation between outcomes within areas Correlation between outcomes within areas
modelled via loadings modelled via loadings jj. .
Identification: Location & ScaleIdentification: Location & Scale
Need Need iissii=0 for location identification. =0 for location identification.
Centre effects at each MCMC iteration.Centre effects at each MCMC iteration. Scale identifiability:
EITHER set var(s)=1 and all j are free loadings (fixed scale),
OR leave var(s) unknown and constrain a loading, e.g. 1=1.0 (anchoring constraint)
Labelling Problems in Repeated Labelling Problems in Repeated SamplingSampling
Even in simple model, labelling may be an issue.
Consider fixed variance identification option, var(s)=1, loadings all unknown.
Suppose diffuse priors are taken on loadings in
log(log(ijij)=)=jj++jjssii
without directional constraint.
Labelling Problems (continued)Labelling Problems (continued)
Then can have:
a) j all positive combined with ssii acting as acting as
positive measure of health risk (higher spositive measure of health risk (higher s ii in in
areas with higher cancer rates) areas with higher cancer rates)
OR OR
b) b) j all negative combined with s all negative combined with sii acting acting
as negative measure of health risk (sas negative measure of health risk (s ii
higher in areas with lower cancer rates)higher in areas with lower cancer rates)
Identifying constraints for Identifying constraints for consistent labellingconsistent labelling
For unambiguous labelling advisable to constrain one or more j to be positive (e.g. truncated normal or gamma prior)
Note that anchoring constraint with var(s) unknown, and preset loading (e.g. 1=1.0), may be intrinsically better identified – steers remaining unknown coefficients to consistent labelling
Loadings and LabellingsLoadings and Labellings May not be sufficient just to rely on May not be sufficient just to rely on
constraining one loading (e.g. assume +ve) constraining one loading (e.g. assume +ve) to ensure consistent labellingto ensure consistent labelling
Sometimes said that constraining direction Sometimes said that constraining direction on one loading ensures consistent on one loading ensures consistent identification…identification…
What if indicator chosen for constrained What if indicator chosen for constrained loading (e.g. loading (e.g. iiii> 0) is poor measure for > 0) is poor measure for
constructconstruct
Loadings and LabellingsLoadings and Labellings
If twenty indicators are measuring a If twenty indicators are measuring a construct, the 19 unconstrained loadings construct, the 19 unconstrained loadings may “fit” a different label (e.g. deprivation) may “fit” a different label (e.g. deprivation) to that implied by the remaining to that implied by the remaining constrained loading (e.g. affluence)constrained loading (e.g. affluence)
Personal View: Much depends on suitable Personal View: Much depends on suitable selection of manifest indicators and which selection of manifest indicators and which (and how many, maybe >1 ) are chosen to (and how many, maybe >1 ) are chosen to have constrained loadingshave constrained loadings
WINBUGS Code for manifest WINBUGS Code for manifest variable scenario Avariable scenario A
Extensions of Spatial Common FactorsExtensions of Spatial Common Factors Product schemes. Consider health Product schemes. Consider health
outcomes arranged by area i and age x. outcomes arranged by area i and age x. Populations at risk NPopulations at risk N ixix
yyixix ~ Poisson(N ~ Poisson(Nixixixix))
log(log(ixix)=)=xx++xxssii
x x show which age groups are most show which age groups are most sensitive to spatial variations in risk sensitive to spatial variations in risk represented by srepresented by sii
Variation on Lee-Carter (JASA 1992) Variation on Lee-Carter (JASA 1992) mortality forecasting modelmortality forecasting model
Random Effect LoadingsRandom Effect Loadings x x potentially random, rather than potentially random, rather than
fixed effects. fixed effects. Identified using sum to 1 or averaging Identified using sum to 1 or averaging
to 1 constraint, e.g. to 1 constraint, e.g. x x multinomial, or multinomial, or
xx~Gamma(h,h)~Gamma(h,h)
Nonlinear effects of common factorNonlinear effects of common factor One possibility: just take powers of sOne possibility: just take powers of s ii, ,
e.g.e.g. log(log(ijij)=)=jj++jjssii++jjss22
ii
Or: spline for nonlinear effects in Or: spline for nonlinear effects in common factor score scommon factor score sii. .
e.g. under fixed variance var(s)=1 e.g. under fixed variance var(s)=1 option, locate knots option, locate knots kk at selected at selected quantiles on cumulative standard quantiles on cumulative standard normal.normal.
Linear SplineLinear Spline
Then linear splineThen linear spline
log(log(ijij)=)=jj++jjssii++kkbbjkjk(s(sii- - kk))++
bbjkjk might be random effects, but might be random effects, but
raises identification issues…?raises identification issues…?
INDICATOR BASED INDICATOR BASED SPATIAL CONSTRUCTSSPATIAL CONSTRUCTS
(B) Indicator Based Spatial Constructs(B) Indicator Based Spatial Constructs
Many studies use latent constructs to Many studies use latent constructs to analyze population health variations. analyze population health variations.
Such constructs (e.g. deprivation) not Such constructs (e.g. deprivation) not directly observeddirectly observed
Instead derived from a collection of relevant Instead derived from a collection of relevant indicator variables that are observed, using indicator variables that are observed, using multivariate techniques or other “composite multivariate techniques or other “composite variable” methodsvariable” methods
Many health outcomes show “deprivation Many health outcomes show “deprivation gradient”gradient”
Latent Constructs in Population Latent Constructs in Population HealthHealth
Example: Townsend deprivation score Example: Townsend deprivation score based on summing standardized census based on summing standardized census area values for 4 input variables (sum of “z area values for 4 input variables (sum of “z scores”)scores”)
% unemployed, % with no car, % % unemployed, % with no car, % households overcrowded, % households not households overcrowded, % households not owner occupiers owner occupiers
Other area constructsOther area constructs
Other examples of latent constructs Other examples of latent constructs relevant to area health variations: relevant to area health variations: rurality/urbanicity, social fragmentationrurality/urbanicity, social fragmentation
Social fragmentation scores used to Social fragmentation scores used to analyze variations in area suicide rates analyze variations in area suicide rates and psychiatric hospitalization ratesand psychiatric hospitalization rates
Confirmatory Indicator Based ModelConfirmatory Indicator Based Model Confirmatory model: indicators k=1,..,K are Confirmatory model: indicators k=1,..,K are
established proxies for latent constructestablished proxies for latent construct e.g. area unemployment rates, welfare e.g. area unemployment rates, welfare
recipients, social housing rates as recipients, social housing rates as indicators of area deprivationindicators of area deprivation
Census rates rCensus rates rikik=z=zikik/D/Dikik where z where zikik are counts are counts
(e.g. unemployed), D(e.g. unemployed), Dikik are relevant are relevant
denominators (e.g. econ active denominators (e.g. econ active populations). populations).
One option for confirmatory modelOne option for confirmatory model
Use Gaussian approximation to binomial Use Gaussian approximation to binomial (Hogan & Tchernis JASA 2004) with (Hogan & Tchernis JASA 2004) with variance stabilizing transformation: Rvariance stabilizing transformation: Rikik==rrikik, ,
var(Rvar(Rikik)=)=kk/D/Dikik. .
→ → normal measurement equationsnormal measurement equations
RRikik ~N( ~N( kkkkFFii, , kk/D/Dik)ik)
where Fwhere Fii scores follow spatial CAR prior scores follow spatial CAR prior
Or use relevant Exponential Family links in Or use relevant Exponential Family links in deriving common spatial factorderiving common spatial factor
P(zP(zikik||ikik) = exp([z) = exp([zikikikik-b(-b(ikik)]/)]/+c(z+c(zikik, , ))))
e.g. ze.g. zikik binomial with populations N binomial with populations Nii, z, zikik ~ ~
Bin(NBin(Nii,,ikik))
Logit link, plus overdispersion effects wLogit link, plus overdispersion effects w ikik
logit(logit(ikik)= )= kkkkFFii+w+wikik
wwik ik : normal and uncorrelated over : normal and uncorrelated over
indicators k.indicators k.
For other indicators transform to For other indicators transform to normalitynormality
For intrinsic proportions (e.g. proportion of For intrinsic proportions (e.g. proportion of area that is green space as indicator of area that is green space as indicator of rurality) take logit transform to rurality) take logit transform to approximate normalityapproximate normality
for population density take log transformfor population density take log transform etcetc
TWO CLASSES OF MANIFEST TWO CLASSES OF MANIFEST VARIABLEVARIABLE
(C) Spatial Factors in Model with 2 (C) Spatial Factors in Model with 2 classes of manifest variableclasses of manifest variable
Health Outcomes YHealth Outcomes Yijij (j=1,…,J); e.g. (j=1,…,J); e.g.
mortality or incidence countsmortality or incidence counts Social Indicators ZSocial Indicators Zikik (k=1,..k); e.g. census (k=1,..k); e.g. census
rates of unemploymentrates of unemployment Typical Scenario: multiple common spatial Typical Scenario: multiple common spatial
factors (Ffactors (F1i1i,..,F,..,FQiQi) primarily measured by Z ) primarily measured by Z
variables (indicators established as variables (indicators established as relevant). relevant).
2 class model2 class model
But Factors F also act to potentially But Factors F also act to potentially explain area variations in health outcomes explain area variations in health outcomes Y. Y.
Z to F links confirmatory, Y to F links Z to F links confirmatory, Y to F links exploratoryexploratory
ExampleExample Four Poisson health outcomes Y1-Y4, Eight Four Poisson health outcomes Y1-Y4, Eight
indicators: Z1-Z4 measure F1; Z5-Z8 measure indicators: Z1-Z4 measure F1; Z5-Z8 measure F2 ; both F1 and F2 F2 ; both F1 and F2 maymay explain Y explain Y
YYijij ~ Po(E ~ Po(Eijijijij))
log(log(ijij)=)=jj++j1j1FF1i1i++j2j2FF2i2i
ZZikik ~ EF( ~ EF(ikik))
g(g(i1i1)= )= 11FF1i1i+w+wi1i1
………… g(g(i5i5)= )= 55FF2i2i+w+wi5i5
………………
MODEL CHOICEMODEL CHOICE
Formal Choice or NotFormal Choice or Not Formal Bayes model criteria (e.g. marginal Formal Bayes model criteria (e.g. marginal
likelihood/Bayes factor) difficult to derive; likelihood/Bayes factor) difficult to derive; also change with priorsalso change with priors
Popular alternative (AIC analogue): Popular alternative (AIC analogue): Deviance Information Criterion (DIC). Deviance Information Criterion (DIC).
Average deviance Dev.bar + effective Average deviance Dev.bar + effective parameter count dparameter count dee
DIC=Dev.bar+ dDIC=Dev.bar+ dee
Model Fit in Realistic ApplicationsModel Fit in Realistic Applications
Multilevel applications to health survey Multilevel applications to health survey data may involve thousands of subjects data may involve thousands of subjects (e.g. HSE study). (e.g. HSE study).
Ecological applications may involve Ecological applications may involve hundreds of small areas (Eastern region hundreds of small areas (Eastern region suicide study)suicide study)
Model Fit in Realistic ApplicationsModel Fit in Realistic Applications
Convergence of DIC and dConvergence of DIC and dee typically slow typically slow
in models with many random effects (such in models with many random effects (such as factor scores)as factor scores)
Slow convergence also applies to other Slow convergence also applies to other measures of fit, e.g. Monte Carlo measures of fit, e.g. Monte Carlo estimates of conditional predictive estimates of conditional predictive ordinatesordinates
Model selection alternatives…Model selection alternatives…
Model Choice using Variable Selection Model Choice using Variable Selection
Model selection potentially for both Model selection potentially for both loadings and factor variance/covariance loadings and factor variance/covariance structure.structure.
Don’t necessarily apply selection for all Don’t necessarily apply selection for all elements in any particular application (e.g. elements in any particular application (e.g. depending whether exploratory or depending whether exploratory or confirmatory)confirmatory)
Apply to selected aspects of spatial SEM Apply to selected aspects of spatial SEM models, e.g. loadings only or correlations models, e.g. loadings only or correlations between factors onlybetween factors only
Selection in 2 manifest variable Selection in 2 manifest variable SEMSEM
Spatial factor models with 2 types of Spatial factor models with 2 types of manifest variable (health outcomes Ymanifest variable (health outcomes Yjj + +
socioeconomic indices Zsocioeconomic indices Zkk))
Apply selection to loadings Apply selection to loadings jqjq linking Y linking Yjj to to
FFq q (exploratory part of model)(exploratory part of model)
But don’t apply selection to Z on F But don’t apply selection to Z on F loadings (confirmatory sub-model based loadings (confirmatory sub-model based on extensive prior knowledge)on extensive prior knowledge)
Mixture Priors for Selecting LoadingsMixture Priors for Selecting Loadings
Random Effects SelectionRandom Effects Selection
Selection procedures for random effects Selection procedures for random effects and/or their variance/covariance structureand/or their variance/covariance structure
e.g. Cai and Dunson (2008), Tüchler & e.g. Cai and Dunson (2008), Tüchler & Frühwirth-Schnatter (2008)Frühwirth-Schnatter (2008)
These extend to factor and SEM models These extend to factor and SEM models as factors are shared random effectsas factors are shared random effects
RE Selection: Multivariate Spatial RE Selection: Multivariate Spatial PriorPrior
Q>1 for shared common spatial Q>1 for shared common spatial factorsfactors
Within area covariance matrix in Within area covariance matrix in MCAR prior denoted MCAR prior denoted FF
Cholesky Decomposition of Covariance Matrix Cholesky Decomposition of Covariance Matrix FF
Selection on variances and/or covariancesSelection on variances and/or covariances
Suppose investigator sure about number Suppose investigator sure about number of factors (confirmatory model based on of factors (confirmatory model based on substantial evidence) substantial evidence)
BUT not sure whether correlations BUT not sure whether correlations between factors are needed between factors are needed
Selection can be applied to relevant Selection can be applied to relevant parameters in decomposition of parameters in decomposition of FF →→ mixture prior selection on mixture prior selection on qrqr parameters parameters to decide whether correlations needed to decide whether correlations needed
CASE STUDIESCASE STUDIES
Social capital and mental health, Social capital and mental health, multilevel model using Health Survey multilevel model using Health Survey for England (HSE)for England (HSE)
Multilevel model, joint prevalence of Multilevel model, joint prevalence of obesity & diabetes, BRFSS subjects obesity & diabetes, BRFSS subjects nested within US counties & statesnested within US counties & states
Suicide & self-harm, ecological (area) Suicide & self-harm, ecological (area) study for wards in Eastern Englandstudy for wards in Eastern England
Case Study 1, Mental Health & Social Case Study 1, Mental Health & Social Capital, Health Survey for EnglandCapital, Health Survey for England
Y is observed mental health status (binary). Y is observed mental health status (binary). Y=1 if subject’s GHQ12 score is 4 or more, Y = 0 otherwise.
Pr(Y=1) related to known socioeconomic risk Pr(Y=1) related to known socioeconomic risk factors X at individual subject levelfactors X at individual subject level
Pr(Y=1) also related to known indicators of Pr(Y=1) also related to known indicators of geographic geographic contextcontext, G (e.g. micro-area , G (e.g. micro-area deprivation quintile, region of residence, urban-deprivation quintile, region of residence, urban-rural residence). Micro-areas (32K in England) rural residence). Micro-areas (32K in England) called Super Output Areas called Super Output Areas
Latent RisksLatent Risks
Finally Pr(Y=1) also related to latent Finally Pr(Y=1) also related to latent subject level risks, {Fsubject level risks, {F1i1i,F,F2i2i,...,F,...,FQiQi}}
Examples: social capital, perceived stress. Examples: social capital, perceived stress. Structural model: Y~f(Y|X,G,F,Structural model: Y~f(Y|X,G,F,))
Health Outcome Sub-ModelHealth Outcome Sub-Model Regression involves 9065 adult subjects.
Yi~ Bin(1,i) .
Use log-link (→relative risk interpretation). Q=1 for single latent risk factor (social
capital) log(i)=βXi+γGi+Fi =β₀+β1,gend[i]
+β2,age[i]+β3,eth[i]+β4,oph[i]+β5,own[i]
+β6,noqual[i]+1,reg[i]+2,dep[i]+3,urb[i]+Fi
Multiple Indicators for Social Multiple Indicators for Social CapitalCapital
Social capital measured by a battery of K Social capital measured by a battery of K survey `items' (e.g. questions about survey `items' (e.g. questions about neighbourhood perceptions, organisational neighbourhood perceptions, organisational memberships etc), {Z₁,...,Zmemberships etc), {Z₁,...,ZKK}}
Z~g(Z|F,Z~g(Z|F,) ) e.g. with binary questions, link probability e.g. with binary questions, link probability
of positive response of positive response kk=Pr(Z=Pr(Zkk=1) to latent =1) to latent construct via construct via
logit(logit(kk)=)=kk++kkFF
Indicators of Social CapitalIndicators of Social Capital Social Support Score (Z1)Social Support Score (Z1) 5 binary items (Z2-Z6) relate to
neighbourhood perceptions (e.g. can people be trusted?; do people try to be helpful?; this area is a place I enjoy living in; etc)
Final item (Z7) relates to membership of organisations or groups.
Multiple Causes of Social CapitalMultiple Causes of Social Capital Social capital varies by demographic Social capital varies by demographic
groups and geographic context (urban groups and geographic context (urban status, region, small area deprivation status, region, small area deprivation category, etc). category, etc).
So have multiple causes of F as well as So have multiple causes of F as well as multiple indicatorsmultiple indicators
F ~ h(F|X*,G*, F ~ h(F|X*,G*, φ)φ) X* and G* are individual and contextual X* and G* are individual and contextual
variables relevant to “causing” social variables relevant to “causing” social capital variationscapital variations
Multiple Cause Sub-ModelMultiple Cause Sub-Model
FFii~N(μ~N(μii,1) μ,1) μii=φ=φ1,gend[i]1,gend[i]+φ+φ2,eth[i]2,eth[i]+φ+φ3,noqual[i]3,noqual[i]
+φ+φ4,urb[i]4,urb[i]+φ+φ5,reg[i]5,reg[i]
+φ+φ6,dep[i]6,dep[i]..
φ: fixed effects parameters with reference φ: fixed effects parameters with reference category (zero coeff) for identificationcategory (zero coeff) for identification
Only small number of regions in HSE Only small number of regions in HSE If had finer spatial detail could take area φ If had finer spatial detail could take area φ
effects spatially random (but weak effects spatially random (but weak identification…?)identification…?)
Effect of F on YEffect of F on Y
Social capital has significant effect in reducing Social capital has significant effect in reducing the chances of psychiatric caseness. the chances of psychiatric caseness.
The effect of social capital apparent in relative The effect of social capital apparent in relative risk 0.35 of psychiatric morbidity for high capital risk 0.35 of psychiatric morbidity for high capital individuals (with score F=+1) as compared to individuals (with score F=+1) as compared to low capital individuals (with F=-1). low capital individuals (with F=-1).
Obtained as exp(-0.525)/exp(0.525)Obtained as exp(-0.525)/exp(0.525) = -0.525 is coefficient for social capital effect.= -0.525 is coefficient for social capital effect.
Geographic Context: Micro-area Deprivation Gradient Geographic Context: Micro-area Deprivation Gradient from Multiple Cause Modelfrom Multiple Cause Model
Case Study 2: Diabetes & Obesity Case Study 2: Diabetes & Obesity in USin US
Data from 2007 Behavioral Risk Factor Data from 2007 Behavioral Risk Factor Surveillance System (BRFSS)Surveillance System (BRFSS)
Multinomial outcome (J=6 categories) Multinomial outcome (J=6 categories) defined by diabetic status and weight defined by diabetic status and weight category (obese, overweight, normal). category (obese, overweight, normal).
Multinomial CategoriesMultinomial Categories
Reference category are subjects with Reference category are subjects with neither condition. All other categories are neither condition. All other categories are “ill” relative to reference category“ill” relative to reference category
Multilevel multicategory regressionMultilevel multicategory regression
Regression includes:Regression includes:o subject level risk factors (age, ethnicity, subject level risk factors (age, ethnicity,
gender, education), gender, education), o known geographic effects (e.g. county known geographic effects (e.g. county
poverty), poverty), o county and state random effects to model county and state random effects to model
unknown geographic influences (e.g. unknown geographic influences (e.g. unknown environmental exposures).unknown environmental exposures).
Regression & LikelihoodRegression & Likelihood
Model FormModel Form
Model includes Model includes knownknown subject risk factors subject risk factors and contextual variables (e.g. county and contextual variables (e.g. county poverty)poverty)
UnknownUnknown contextual risks: assume county contextual risks: assume county and state latent effects, shared over and state latent effects, shared over categories j=1,..,J-1. categories j=1,..,J-1.
Illustrates nested latent spatial effectsIllustrates nested latent spatial effects
County & State EffectsCounty & State Effects
Take county effects vTake county effects vcc (c=1,..,3142) to be (c=1,..,3142) to be
spatially correlated CARspatially correlated CAR But uBut uss (state effects, s=1,..,51) taken to be (state effects, s=1,..,51) taken to be
unstructured. unstructured. Avoids confounding of two spatially Avoids confounding of two spatially
structured effectsstructured effects
Regression Terms for j=1,..J-1Regression Terms for j=1,..J-1
Case Study 3, Suicide & Self Harm: Case Study 3, Suicide & Self Harm: Eastern Region Wards in EnglandEastern Region Wards in England
Two classes of manifest variablesTwo classes of manifest variables YY11-Y-Y44: suicide totals in small areas: suicide totals in small areas
ZZ11-Z-Z1414: Fourteen small area social : Fourteen small area social
indicators indicators Q=3 latent constructs (FQ=3 latent constructs (F11 fragmentation, F fragmentation, F22
deprivation, Fdeprivation, F33 urbanicity). Converse of F urbanicity). Converse of F33
is “rurality”. Common spatial factors. is “rurality”. Common spatial factors.
Local Authority Map: Eastern EnglandLocal Authority Map: Eastern England
Geographic FrameworkGeographic Framework
N=1118 small areas (called wards, N=1118 small areas (called wards, subdivisions of local authorities). subdivisions of local authorities).
Small area focus beneficial: people with Small area focus beneficial: people with similar socio-demographic characteristics similar socio-demographic characteristics tend to cluster in relatively small areas, so tend to cluster in relatively small areas, so greater homogeneity in risk factors related greater homogeneity in risk factors related to social statusto social status
On other hand, health events may be On other hand, health events may be rare…rare…
Confirmatory Sub-ModelConfirmatory Sub-Model
Confirmatory Z-on-F modelConfirmatory Z-on-F model Each indicator ZEach indicator Zkk loads only on one loads only on one
construct Fconstruct Fqq..
Most indicators binomial. A few taken as Most indicators binomial. A few taken as normal after transformation. Mostly 2001 normal after transformation. Mostly 2001 Census, a few non-census (service Census, a few non-census (service access score, proportion greenspace)access score, proportion greenspace)
Exponential Family Model for Exponential Family Model for modelling Z-on-F effectsmodelling Z-on-F effects
For indicator kFor indicator k1,..,14, G1,..,14, Gkk 1,2,3 denotes 1,2,3 denotes
which construct it loads on. which construct it loads on. Regression with link g allows for Regression with link g allows for
overdispersion via “unique” w effectsoverdispersion via “unique” w effects
g(g(ikik)= )= kkk,Gk,GkkF[GF[Gkk,i]+w,i]+wikik
Expected Direction of Expected Direction of Confirmatory Model LoadingsConfirmatory Model Loadings
Health Outcome Sub-Model (Y-on-Health Outcome Sub-Model (Y-on-F effects)F effects)
Model for Y-on-F effectsModel for Y-on-F effects
YYijij ~ Po(E ~ Po(Eijijijij) j=1,..,4) j=1,..,4
log(log(ijij)=)=jj++j1j1FF1i1i++j2j2FF2i2i++j3j3FF3i3i+u+uijij
Coefficient selection on Coefficient selection on jq jq using relatively using relatively
informative priors under “retain” option informative priors under “retain” option when Jwhen Jjqjq=1.=1. Using diffuse priors means null Using diffuse priors means null
model tends to be selectedmodel tends to be selected
Redundant CoefficientsRedundant Coefficients
Some coefficients (e.g. urbanicity on male Some coefficients (e.g. urbanicity on male and female suicide, deprivation on female and female suicide, deprivation on female suicide) not retained under model suicide) not retained under model selectionselection
Four coefficients in the Y-on-F model were Four coefficients in the Y-on-F model were set to zero in at least some MCMC set to zero in at least some MCMC iterations iterations → averaging over 2→ averaging over 244 Y-on-F Y-on-F modelsmodels
Future Directions in Spatial Factor Future Directions in Spatial Factor ModellingModelling
Extend model selection to interactions between Extend model selection to interactions between factors, nonlinear effects etcfactors, nonlinear effects etc
In England, model area socioeconomic structure In England, model area socioeconomic structure (and maybe some health outcomes) at (and maybe some health outcomes) at “neighbourhood” level (32000 “Super Output “neighbourhood” level (32000 “Super Output Areas” with mean population 1500).Areas” with mean population 1500).
In US, similar scope for modelling SES structure In US, similar scope for modelling SES structure in relation to health events for Zip Code in relation to health events for Zip Code Tabulation Areas or ZCTAs (around 31K across Tabulation Areas or ZCTAs (around 31K across US, on average about 10K population)US, on average about 10K population)
More generallyMore generally
Bayesian software options for latent Bayesian software options for latent variable and SEM applications more variable and SEM applications more widely availablewidely available
Potentialities of WINBUGS in this context Potentialities of WINBUGS in this context not always appreciatednot always appreciated
Scope for dedicated Bayesian factor Scope for dedicated Bayesian factor analysis packageanalysis package