44
BIO656--Multilevel Models 1 Term 4, 2006 PART 8 PART 8 Two Stage & Joint Models Two Stage & Joint Models

Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

  • View
    222

  • Download
    1

Embed Size (px)

Citation preview

Page 1: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 1Term 4, 2006

PART 8PART 8

Two Stage & Joint ModelsTwo Stage & Joint Models

Page 2: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 2Term 4, 2006

SEERMED DATA SEERMED DATA

End of Life Colorectal Cancer Costs

Motivation:

Expenditure$0 $500,000

Page 3: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 3Term 4, 2006

DataData

Patient – Physician

Death

Terminal-Phase Costs

12 mos

Cancer Diagnosis

Professional Health-Care Services

HMO Hospice FFS

Private Ins.Medicare

Medicare Payments

Claims

RejectedAllowed Co-PayDeductibles

Factors: Need-based Enabling Predisposing

Page 4: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 4Term 4, 2006

DataData

Patient – Physician

Death

Terminal-Phase Costs

12 mos

3 mos

Cancer Diagnosis

Medicare Payments

Page 5: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 5Term 4, 2006

SEERMED DATA SEERMED DATA

End of Life Colorectal Cancer Costs

Motivation:

Expenditure$0 $500,000

Page 6: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 6Term 4, 2006

A “Normal” DistributionA “Normal” Distribution

Den

sity

Y

Page 7: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 7Term 4, 2006

A Complex DistributionA Complex Distribution

Den

sity

Y

Page 8: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 8Term 4, 2006

Complex Distributions Complex Distributions Mixtures of Simple Distributions Mixtures of Simple Distributions

Den

sity

Y

Mixtures-of-Experts Models (MEM)

Finite Mixture Models (FMM)

McLachlan, Peel. (2001), FMMJacobs, Jordan. (1991), MEM, Neural Comp

Page 9: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 9Term 4, 2006

A simple, two-part mixtureA simple, two-part mixture

$0 $+ 1. P(Y>0)

E(Y+)

2. E(Y|Y>0)

Page 10: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 10Term 4, 2006

A Two-Part Model:A Two-Part Model:(Intensity & Size)(Intensity & Size)

IS – logit/lognormal

1. logit{ Pr(Yi>0) } = x

2. i.) log10(Yi+) = x + i

ii.) i ~ N(0,2)

0. “Tobit” model: Tobin (1958)1. Selection (hurdle) models: (Amemiya 1984; Heckman 1976) 2. Zero-inflated models (Lambert 1992; Green 1994)3. Two-part models (Manning 1981; Mullahy 1998)

Page 11: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 11Term 4, 2006

Another Two-Part Model:Another Two-Part Model:(Intensity & Size)(Intensity & Size)

IS – Probit/log-Gamma

1. -1{ Pr(Yi>0) } = x

2. i.) log10{E( Yi+)} = x

ii.) Yi+ ~ (,)

Page 12: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 12Term 4, 2006

A Two-Part Model:A Two-Part Model:The Intensity-Size GLMThe Intensity-Size GLM

h1 binary data link function

h2 continuous data link function

f exponential family w/ dispersion

IS – GLM

Page 13: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 13Term 4, 2006

0 +

Multiple Levels 1Multiple Levels 1

Page 14: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 14Term 4, 2006

MonthlyMonthly SEERMED Data SEERMED Data

Month 10

Month 11

Month 12

10

12

10

11

12

+

+

11 +

Page 15: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 15Term 4, 2006

HMREM1HMREM1

TimeX

f12

0 +

g2

g1

f10

0 +

g2

g1

f11

0 +

g2

g1

Month 12

Month 10

Month 11

a

a

b

b

Multiple Levels 2Multiple Levels 2

X

X

X

X

X

Page 16: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 16Term 4, 2006

1. Intensity: logit( i

) = x2. Size:

a) i = x

b) Yi+ ~ f ( i

, )

A 2-Part ModelA 2-Part Model

Page 17: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 17Term 4, 2006

1. Intensity: logit( i

c ) = x + zai

2. Size:

a) ic = x + zbi

b) Yi+c ~ f ( i

c, )

ui = ~ N , = ai 0 aa

bi 0 ba bb

A Longitudinal 2-Part ModelA Longitudinal 2-Part Model

1. Olsen, Schafer, (2001)

2. Tooze, Grunwald, Jones, (2002)

3. Yau, Lee, Ng, (2002)

3. Random Effects:

Page 18: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 18Term 4, 2006

Data Analysis: 3 General StepsData Analysis: 3 General Steps

1. Exploration

2. Model Fitting and Estimation

3. Diagnostics

and the greatest of these is…

Page 19: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 19Term 4, 2006

3 33

2

2

2

1 1

1

0

10 11 12

01

23

45

Month

log

10C

ost

1

Uncooked Spaghettis PlotUncooked Spaghetti Plot

Page 20: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 20Term 4, 2006

MonthlyMonthly SEERMED Data SEERMED Data

Month 10

Month 11

Month 12

10

12

10

11

12

+

+

11 +

Page 21: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 21Term 4, 2006

Figure 5: Seermed log10 month 1 & 2Figure 5: Seermed log10 month 1 & 2

Expenditure 10

Expenditure 11

Density

Month 10 & Month 11 log10(Costs)

0

5

0

5

Bivariate Point Mass

Univariate Continuous Distbs.

Bivariate Continuous

Distb.

Page 22: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 22Term 4, 20060 1 2 3 4 5

01

23

45

D2 0.36

Rho 0.56

OR 12.9

D1 0.77

13% 7%

10% 70%

SEERMED Costs: Months 10 & 11

log10y10 1

log

10y

11

1

PRISM plot: Month 10 & 11 SEERMED Costs

Paired

Response

Intensity

Size

Mixture

plot

aa

bb

ba

Page 23: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 23Term 4, 2006

x

Den

sity

logy10

0

0

0.36 0.56

12.9 0.77

0

0

0.04 0.34

8.65 0.54

13%7%

10% 70%

01

23

45

6

x

Den

sity

logy11

0

0

0.25 0.51

15.12 0.83

8%5%

15% 72%

0 1 2 3 4 5 6

9%4%

10% 77%

x

Den

sity

logy12

0 1 2 3 4 5 6

PRISM Matrix: Months 10-12

Page 24: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 24Term 4, 2006

SEERMED MREMSEERMED MREM

1. Intensity:

h1( ic ) = 0 + 1Obs + 2Male + 3Obs*Male + ai

2. Size:

a) h2( ic ) = 0 + 1Obs + 2Male + bi

b) Yi+c ~ f ( i

c, )

3. Random Effects:

Size: Lognormal, Gamma

Intensity: Probit, Logistic

ui = ~ N , = ai 0 a

bi 0 ba b

2

2

Page 25: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 25Term 4, 2006

EstimationEstimation

Li()

Likelihood:

Whoa.

• PQL, MCEM, MCMC, …

• Adaptive Quadrature – Newton-Raphson

Zeger, Karim (1991); Davidian, Giltinan, (1993); Pinheiro, Bates (1995);

Mcculloch (1997); Booth et al. (2001); Rabe-Hesketh, et al. (2004)

Non-Linear Mixed Model (NLMM)But:

Page 26: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 26Term 4, 2006

Estimation: SASEstimation: SAS

proc nlmixed data=SEERMED;

parms / data=parms_start;

*- 1) logistic: logit{Pr( Y>0 | a )} = Xalpha + a = “eta0” -*;

eta0 = alpha0_c + alpha1_c*obs + alpha2_c*male + alpha3_c*obsmale + a;

pi_c = exp(eta0) / (1+exp(eta0));

*- 2) log-normal: E( log(Y) | Y>0, b ) = XB + b = “eta1” -*;

eta1 = beta0_c + beta1_c*obs + beta2_c*male + b;

*- log-likelihood -*;

pi=CONSTANT('PI');

if y=0 then ll1 = 0;

else ll1=-.5*log(2*pi*sigma**2)-.5*((log10y-eta1)/sigma)**2;

ll = (1-Gpos)*log(1-pi_c) + Gpos*log(pi_c) + Gpos*(ll1);

model y ~ GENERAL(ll);

RANDOM a b ~ NORMAL([0,0],[tau_aa, tau_ba, tau_bb]) SUBJECT=id;

run;

Page 27: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 27Term 4, 2006

Estimation: SAS (better)Estimation: SAS (better) proc nlmixed data=sanfran qpoints=10;

parms / data=parms_start;

*-logit-*;

eta0 = alpha0_c + alpha1_c*obs + alpha2_c*male + alpha3_c*obsmale + a;

expeta = exp(eta0);

pi_c = expeta / (1+expeta);

tau_aa = exp(logtau_a)**2;

*-lognormal-*;

eta1 = beta0_c + beta1_c*obs + beta2_c*male + b;

phi = 10**(log10phi); *std dev of log10(Y+1)|b;

tau_bb = (10**(log10tau_b))**2;

*- RE Var -*;

rho_ba = (exp(2*zrho_ba) - 1) / (exp(2*zrho_ba) + 1);

tau_ba = rho_ba*(tau_aa*tau_bb)**.5;

*- log-likelihood -*;

pi=CONSTANT('PI');

if y=0 then ll1 = 0; else ll1=-.5*log(2*pi*phi**2)-.5*((log10y-eta1)/phi)**2;

ll = (1-Gpos)*log(1-pi_c) + Gpos*log(pi_c) + Gpos*(ll1);

model y ~ GENERAL(ll);

RANDOM a b ~ NORMAL([0,0],[tau_aa, tau_ba, tau_bb]) SUBJECT=id;

ods output ParameterEstimates = parms_new;

run;

Page 28: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 28Term 4, 2006

SEERMED MREM Results 1SEERMED MREM Results 1

Page 29: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 29Term 4, 2006

Profile ll (alpha3)Profile ll (alpha3)

Intensity model Obs*Male interaction term (3)

Sca

led

Pro

file

Like

lihoo

dMREM Profile Likelihood Plots for 3

Logit-Lognormal

Probit*-Lognormal

Probit*-Gamma

c

c

Logit-Gamma LR 6

Page 30: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 30Term 4, 2006

SEERMED MREM Results 2SEERMED MREM Results 2

Page 31: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 31Term 4, 20060 1 2 3 4 5

01

23

45

D2 0.36

Rho 0.56

OR 12.9

D1 0.77

13% 7%

10% 70%

SEERMED Costs: Months 10 & 11

log10y10 1

log

10y

11

1

PRISM plot: Month 10 & 11 SEERMED Costs

Paired

Response

Intensity

Size

Mixture

plot

aa

bb

ba

Page 32: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 32Term 4, 2006

SEERMED SEERMED MREMMREM Results 2 Results 2

But do these models fit?…

Page 33: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 33Term 4, 2006

02000

4000

6000

8000

10000

12000

Y10+

Y11+

Y12+

G10

G11

G12

L10

L11

L12

Y10+

Y11+

Y12+

G10

G11

G12

L10

L11

L12

10

11

12

P10

P11

P12

10

11

12

P10

P11

P12

10 11 12 10 11 12

0.0

0.2

0.4

0.6

0.8

1.0Female Male

Inte

nsity:

Pr(

Y>

0)

Siz

e:

E(Y

|Y>

0)

Month

Data vs. MREM Models

Obs: , YExp: P, L,G

Page 34: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 34Term 4, 2006

Diagnostic PRISM Matrix: lognormal IS-GLMM Residuals

Observed

Expected

Ob

serv

ed

Expected

QQ Plot1

12%6%

13% 70%

9%4%

15% 71%

13%7%

10% 70%

01

23

45

6

Ob

serv

ed

Expected

QQ Plot2

8%5%

10% 77%

8%5%

15% 72%

0 1 2 3 4 5 6

9%4%

10% 77%

Ob

serv

ed

Expected

QQ Plot3

0 1 2 3 4 5 6

Page 35: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 35Term 4, 2006

Diagnostic PRISM Matrix: lognormal IS-GLMM Residuals

Observed

Expected

x

De

nsi

ty

Res10

12%6%

13% 70%

9%4%

15% 71%

13%7%

10% 70%

01

23

45

6

x

De

nsi

ty

Res11

8%5%

10% 77%

8%5%

15% 72%

0 1 2 3 4 5 6

9%4%

10% 77%

x

De

nsi

ty

Res12

0 1 2 3 4 5 6

Page 36: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 36Term 4, 2006

MEMMEM

MREMMREM

Review & Related WorkReview & Related Work

0 1 2

+

HMREMHMREM HMMMM

1. Simple Combinations of Simple Models

2. Complex (Multi-Level) Data:Many Models & Many Pictures

Ideas

Page 37: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 37Term 4, 2006

Data vs. HMREM Models

02000

4000

6000

8000

10000

12000

Y10+

Y11+

Y12+

G10

G11

G12

L10

L11

L12

H10

H11

H12

Y10+

Y11+

Y12+

G10

G11

G12

L10

L11

L12

H10

H11

H12

10

11

12

l10

l11

l12

10

11

12

l10

l11

l12

10 11 12 10 11 12

0.0

0.2

0.4

0.6

0.8

1.0Female Male

Inte

nsity:

Pr(

Y>

0)

Siz

e:

E(Y

|Y>

0)

Month

Data vs. HMMMM Models

Page 38: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 38Term 4, 2006

Review & Related WorkReview & Related Work

• These ideas are not just for Zero-Inflated Data

• Latent Variables are useful for “connecting” things

Page 39: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 39Term 4, 2006

Opportunistic Infection & IDUOpportunistic Infection & IDU

Day in Study6 months prior to 1st interview

Opportunistic Infection

Interview: Reported Drug Use

Interview: Reported No Drug Use

Always Users

Intermittent Users

Never Users

Each Line Represents 1 subject’s time in the study

Page 40: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 40Term 4, 2006

But what about Possible But what about Possible Informative Missingness?Informative Missingness?

OIDrug Use

Death / Dropout

Page 41: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 41Term 4, 2006

Jointly Analyze Survival & OIsJointly Analyze Survival & OIs

1) logistic model:

logit{ Pr(OIij | ai) } = 0 + 1SUij +

2SUij*HCuseij + 3AUij + 4Periodj + ai

2) Survival Model:

log{ (t) } = 0 + 1SUij + 2AUij + ai

3) Latent Effects:

ai ~ N(0,)

 

Guo & Carlin (2004)

Page 42: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 42Term 4, 2006

Warning!Warning!

• But “Buyer Beware”

-- Model Assumptions

-- Identifiability

-- Model Fit

-- Marginalize & Check whenever possible

• MLMs require even more due-diligence than usual

Page 43: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 43Term 4, 2006

ReferencesReferences• Mixture Models:

– McLachlan, G. J. and Peel, D. (2001), Finite mixture models, John Wiley & Sons.

– Jacobs, R. A. and Jordan, M. I. (1991), “Adaptive mixtures of local experts. Neural Computation,” Neural Computation, 3, 79–87.

• Two-Part Models:– Tobin, J. (1958), “Estimation of Relationships for Limited Dependent Variables,”

Econometrica, 25, 24–36. – Amemiya, T. (1984), “Tobit models: A survey,” Journal of Econometrics, 24, 3–61.– Heckman, J. (1976), “The common structure of statistical models of truncation, sample

selection, and limited dependent variables, and a sample estimator for such models,” The Annals of Economic Development and Social Measurement, 5, 475–592.

– Lambert, D. (1992), “Zero-inflated Poisson regression, with an application to defects in manufacturing,” Technometrics, 34, 1–14.

– Green, W. (1994), “Accounting for excess zeros and sample selection in Poisson and negative binomial regression models,” Working Paper EC-94-10, Department of Economics, New York University

– Manning, W., Newhouse, J., Orr, L., Duan, N., Keeler, E., Leibowitz, A., Marquis, M., and Phelps, C. (1981), “A two-part model of the demand for medical care: Preliminary results from the health insurance experiment,” in Health, Economics, and Health Economics, eds. van der Gaag, J. and Perlman, M., pp. 103–104.

– Mullahy, J. (1998), “Much ado about two: reconsidering retransformation and the two part model in health economics,” Journal of Health Economics, 17, 247–281.

Page 44: Term 4, 2006BIO656--Multilevel Models 1 PART 8 Two Stage & Joint Models

BIO656--Multilevel Models 44Term 4, 2006

• Longitudinal 2-part models– Olsen, M. K. and Schafer, J. L. (2001), “A two-part random-effects model for semicontinuous

longitudinal data,” Journal of the American Statistical Association, 96, 730–745.– Tooze, J. A., Gunward, G. K., and Jones, R. H. (2002), “Analysis of repeated measures

data with clumping at zero,” Statistical Methods in Medical Research, 11, 341–355.– Yau, K. K. W., Lee, A. H., and Ng, A. S. K. (2002), “A zero-augmented gamma mixed model

for longitudinal data with many zeros,” The Australian and New Zealand Journal of Statistics 44, 177–183.

• Estimation:– Zeger, S. L. and Karim, M. R. (1991), “Generalized linear models with random effects: A

Gibbs sampling approach,” Journal of the American Statistical Association, 86, 79–86.– Davidian, M. and Giltinan, D. M. (1993), “Some general estimation methods for nonlinear

mixed-effects models,” Journal of Biopharmaceutical Statistics, 3, 23–55.– Pinheiro, J. C. and Bates, D. M. (1995), “Approximations to the log-likelihood function in the

nonlinear mixed-effects model,” Journal of Computational and Graphical Statistics,4, 12–35.– McCulloch, C. E. (1997), “Maximum likelihood algorithms for generalized linear mixed

models,” Journal of the American Statistical Association, 92, 162–170.– Booth, J. G., Hobert, J. P., and Jank, W. (2001), “A survey of Monte Carlo algorithms for

maximizing the likelihood of a two-stage hierarchical model,” Statistical Modelling: An International Journal, 1, 333–349.

– Rabe-Hesketh, S., Skrondal, A., and Pickles, A. (2004), “Maximum likelihood estimation of limited and discrete variable models with nested random effects,” Journal of Econometrics, in press.

• Other:– Guo, X. and Carlin, B.P. (2004), ``Separate and Joint Modeling of Longitudinal and

Event Time Data Using Standard Computer Packages," The American Statistician, 58 16--24.

ReferencesReferences