67
Modelling Longitudinal Data • Survival Analysis. • Event History. • Recurrent Events. • A Final Point – and link to Multilevel Models (perhaps).

Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Embed Size (px)

Citation preview

Page 1: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Modelling Longitudinal Data

• Survival Analysis.

• Event History.

• Recurrent Events.

• A Final Point – and link to Multilevel Models (perhaps).

Page 2: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Yi 1 = ’Xi1 + i1

Vector of explanatory variables and estimates

Independent identifiably distributed error

Outcome 1 for individual i

Page 3: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Yi 2 = ’Xi2 + i2

Vector of explanatory variables and estimates

Independent identifiably distributed error

Outcome 2 for individual i

THE SAME AGAIN AT TIME 2

Page 4: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Yi 1 = ’Xi1 + i1 Yi 2 = ’Xi2 + i2

Considered together conventional regression analysis in NOT appropriate

Page 5: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Yi 2 - Yi 1 = ’(Xi2-Xi1) + (i2 - i1)

Change in Score

Here the ’ is simply a regression on the difference or change in scores.

Page 6: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

As social scientists we are often substantively interested in whether a specific event has occurred.

Page 7: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Survival Data – Time to an event

In the medical area…

• Duration from treatment to death.

• Time to return of pain after taking a pain killer.

Page 8: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Survival Data – Time to an event

Social Sciences…

• Duration of unemployment.

• Duration of time on a training scheme.

• Duration of housing tenure.

• Duration of marriage.

• Time to conception.

Page 9: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Consider a binary outcome or two-state event

0 = Event has not occurred

1 = Event has occurred

Page 10: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Start of Study End of Study

0 1

0

0

1

1

t1 t2 t3

Page 11: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

These durations are a continuous Y so why can’t we use standard

regression techniques?

Page 12: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Start of Study End of Study

0 1

0

0

1

1

1

0

CENSORED OBSERVATIONS

0

Page 13: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Start of Study End of Study

1

B

CENSORED OBSERVATIONS

A

Page 14: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

These durations are a continuous Y so why can’t we use standard

regression techniques?

What should be the value of Y for person A and person B at the end of our study (when we fit the model)?

Page 15: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Cox Regression

is a method for modelling time-to-event data in the presence of censored cases.

•Explanatory variables in your model (continuous and categorical).

•Estimated coefficients for each of the covariates.

•Handles the censored cases correctly.

Page 16: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Start of Study End of Study

0 1

0

0

1

1

1

0

CENSORED OBSERVATIONS

0

UNEMPLOYMENT AND RETURNING TO WORK STUDY

0 = Unemployed; 1 = Returned to work

Page 17: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Y variable =

duration with censored observations

X1

X3

X2

A Statistical Model

Page 18: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Y variable =

duration with censored observations

Previous Occupation

Educational Qualifications

A Statistical Model

Length of Work experience

A continuous covariate

Page 19: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

More complex event history analysis

Page 20: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Start of Study

End of Study

0

t1 t2 t3

0 = Unemployed; 1 = Returned to work

1 1

UNEMPLOYMENT AND RETURNING TO WORK STUDY

0

Page 21: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Start of Study

End of Study

0

t1

0 = Unemployed; 1 = Returned to work

UNEMPLOYMENT AND RETURNING TO WORK STUDY

Spell or Episode

Page 22: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Start of Study

End of Study

0

t1

0 = Unemployed; 1 = Returned to work

1

UNEMPLOYMENT AND RETURNING TO WORK STUDY

Transition = movement from one state to another

Page 23: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Recurrent Events Analysis

Page 24: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

The structure of many large-scale studies results in survey data being collected at a number of discrete occasions. In this situation, rather than being continuous, time lends itself to be conceptualized as a sequence of discrete events. Furthermore, social scientists are often substantively interested in whether a specific event has occurred. Taken together, these two issues appeal to the adoption of a discrete-time or event history approach.

Page 25: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Recurrent events are merely outcomes that can take place on a number of occasions. A simple example is unemployment measured month by month. In any given month an individual can either be employed or unemployed. If we had data for a calendar year we would have twelve discrete outcome measures (i.e. one for each month).

Page 26: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Social scientists now routinely employ statistical models for the analysis of discrete data, most notably logistic and log-linear models, in a wide variety of substantive areas. I believe that the adoption of a recurrent events approach is appealing because it is a logical extension of these models.

Page 27: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Willet and Singer (1995) conclude that discrete-time methods are generally considered to be simpler and more comprehensible, however, mastery of discrete-time methods facilitates a transition to continuous-time approaches should that be required.

Willet, J. and Singer, J. (1995) Investigating Onset, Cessation, Relapse, and Recovery: Using Discrete-Time Survival Analysis to Examine the Occurrence and Timing of Critical Events. In J. Gottman (ed) The Analysis of Change (Hove: Lawrence Erlbaum Associates).

Page 28: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

STATISTICAL ANALYSIS FOR BINARY RECURRENT

EVENTS (SABRE)

• Fits appropriate models for recurrent events.

• It is like GLIM.

• It can be downloaded free.

Page 29: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

www.cas.lancs.ac.uk/software

Page 30: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Consider a binary outcome or two-state event

0 = Event has not occurred

1 = Event has occurred

In the cross-sectional situation we are used to modelling this with logistic regression.

Page 31: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

0 = Unemployed; 1 = Returned to work

UNEMPLOYMENT AND

RETURNING TO WORK STUDY –

A study for six months

Page 32: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Months1 2 3 4 5 6

obs 0 0 0 0 0 0

Constantly unemployed

Page 33: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Months1 2 3 4 5 6

obs 1 1 1 1 1 1

Constantly employed

Page 34: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Months1 2 3 4 5 6

obs 1 0 0 0 0 0

Employed in month 1 then unemployed

Page 35: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Months1 2 3 4 5 6

obs 0 0 0 0 0 1

Unemployed but gets a job in month six

Page 36: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Months1 2 3 4 5 6

obs 0 1 0 1 1 0obs 0 0 1 0 1 1obs 0 1 1 0 0 1obs 1 0 0 0 1 0

Mixed employment patterns

Page 37: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Here we have a binary outcome – so could we simply use logistic regression to model it?

Months1 2 3 4 5 6

obs 0 0 0 0 0 0

Page 38: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Yes and No!

Page 39: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

SABRE fits two models that are appropriate to this analysis.

Model 1 = Pooled Cross-Sectional Logit Model

Page 40: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

)'exp(1

)]'[exp()(

it

it

x

yxL

it

itB

POOLED CROSS-SECTIONAL

LOGIT MODEL

x it is a vector of explanatory variables and is a vector of

parameter estimates .

Page 41: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

)'exp(1

)]'[exp()(

it

it

x

yxL

it

itB

POOLED CROSS-SECTIONAL

LOGIT MODEL

In conventional logistic regression models, where each observation is assumed to be independent, a logistic link function is used, the contribution to the likelihood by the ith case and the t th event is given by the equation above.

Page 42: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

This approach can be regarded as a naïve solution to our data analysis problem.

Page 43: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

We need to consider a number of issues….

Page 44: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

MonthsY1 Y2

obs 0 0

Pickle’s tip - In repeated measured analysis

we would require something like a ‘paired’ t test

rather than an ‘independent’ t test because we

can assume that Y1 and Y2 are related.

Page 45: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

SABRE fits two models that are appropriate to this analysis.

Model 2 = Random Effects Model

(or logistic mixture model)

Page 46: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Repeated measures data violate an important assumption of conventional regression models.

The responses of an individual at different points in time will not be independent of each other.

This problem has been overcome by the inclusion of an additional, individual-specific error term.

Page 47: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

df(

)'exp(1

)]'[exp()(

1

it

iti

x

yxL

itT

t

itB

Page 48: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

The random effects model extends the pooled cross-sectional model to include a case-specific random error term to account for residual heterogeneity.

For a sequence of outcomes for the ith case, the basic random effects model has the integrated (or marginal likelihood) given by the equation.

Page 49: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Davies and Pickles (1985) have demonstrated that the failure to explicitly model the effects of residual heterogeneity may cause severe bias in parameter estimates. Using longitudinal data the effects of omitted explanatory variables can be overtly accounted for within the statistical model. This greatly improves the accuracy of the estimated effects of the explanatory variables

Page 50: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)
Page 51: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Movers and Stayers

When considering data on recurrent events there will be individuals for whom there will be zero

(or very low) probabilities of change in outcome from one event to the next. These individuals

are termed as ‘stayers’.

Page 52: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Months1 2 3 4 5 6

obs 0 0 0 0 0 0

This person is a stayer!

Page 53: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Months1 2 3 4 5 6

obs 1 1 1 1 1 1

So is this person.

Page 54: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

An awareness of the issue of ‘stayers’ is important for technical reasons. A limitation of a parametric modelling approach is that the tail behaviour of the normal distribution is inconsistent with ‘stayers’ and they will tend to be underestimated (see Spilerman 1972).

Spilerman, S. (1972) ‘Extensions of the Mover-Stayer Model’, American Journal of Sociology, 78, pp.599-626.

Page 55: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Recurrent events may be analysed using other software but SABRE is specifically designed to handle stayers and this feature increases SABRE’s flexibility in representing residual heterogeneity (Barry, Francis, Davies, and Stott 1998).

Barry, J., Francis, B., Davies, R.B. and Stott,D. (1998) SABRE Users Guidehttp://www.cas.lancs.ac.uk/software/sabre3.1/sabreuse.html

Page 56: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Past BehaviourCurrent

Behaviour

STATE DEPENDENCE

Page 57: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

UnemployedEmployed

Employed

Young People Aged 19

MAYAPRIL

Different Probabilities of Employment

Page 58: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

This is called a MARKOV model

A Markov model helps to control for a previous outcome (or behaviour).

Page 59: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

1tyγ'β -itx

ACCOUNTS FOR PREVIOUS

OUTCOME (yt-1)

Page 60: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

UnemployedExplanatory

Variables Employed

EmployedExplanatory

Variables

The Model Provides TWO sets of estimates

MAY

APRIL

Page 61: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

This is a ‘two-state’ MARKOV model

But we can make it more complicated.

Page 62: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

MonthsY1 Y2

obs 0 0

First Order Markov Model

Page 63: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

MonthsY1 Y2 Y3

obs 0 0 0

Second Order Markov Model

Page 64: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

FINAL POINT – A THOUGHT!

Page 65: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Months1 2 3 4 5 6

obs 0 1 0 1 1 0obs 0 0 1 0 1 1obs 0 1 1 0 0 1obs 1 0 0 0 1 0

Mixed employment patterns

Page 66: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

a b c d e f

1 2 3 4 1 2 1 2 3 1 2 3 1 2 1 2 3 1 2

g

Observations Months

Individuals

Hierarchical or Multilevel Data Structure

Page 67: Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps)

Is the recurrent events model simply a multilevel model fitted

at the single level?

A controversial point!

More later…..