28
Flexible parametric alternatives to the Cox model Paul Lambert 1,2 , Patrick Royston 3 1 Department of Health Sciences, University of Leicester, UK 2 Medical Epidemiology & Biostatistics, Karolinska Institutet, Stockholm, Sweden 3 MRC Clinical Trials Unit, London [email protected] 11 September 2009 Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 1 / 27

Flexible parametric alternatives to the Cox model

  • Upload
    vananh

  • View
    227

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Flexible parametric alternatives to the Cox model

Flexible parametric alternatives to the Cox model

Paul Lambert1,2, Patrick Royston3

1Department of Health Sciences, University of Leicester, UK2Medical Epidemiology & Biostatistics, Karolinska Institutet, Stockholm, Sweden

3MRC Clinical Trials Unit, London

[email protected]

11 September 2009

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 1 / 27

Page 2: Flexible parametric alternatives to the Cox model

Motivation

�The success of Cox regression has perhaps had the unintendedside-e¤ect that practitioners too seldom invest e¤orts in studying thebaseline hazard. . .

. . . a parametric version [of the Cox model], ... if found to beadequate, would lead to more precise estimation of survivalprobabilities and ... concurrently contribute to a better understandingof the phenomenon under study. � (Hjort 1992)

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 2 / 27

Page 3: Flexible parametric alternatives to the Cox model

Motivation

�The success of Cox regression has perhaps had the unintendedside-e¤ect that practitioners too seldom invest e¤orts in studying thebaseline hazard. . .

. . . a parametric version [of the Cox model], ... if found to beadequate, would lead to more precise estimation of survivalprobabilities and ... concurrently contribute to a better understandingof the phenomenon under study. � (Hjort 1992)

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 2 / 27

Page 4: Flexible parametric alternatives to the Cox model

Sir David Roxbee Cox (b. 15 July 1924)Taken at IBC, 2008

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 3 / 27

Page 5: Flexible parametric alternatives to the Cox model

Problems with the Cox model

Discards information on the survival distribution

Di¢ cult to visualize the hazard function

Problem of non-proportional hazards (non-PH)� modelling to dealwith non-PH can be complex

May want a completely speci�ed probability model

e.g. for prediction, simulation or model validation

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 4 / 27

Page 6: Flexible parametric alternatives to the Cox model

Quote from Sir David Cox (Reid, 1994)

Reid �What do you think of the cottage industry that�s grown uparound [the Cox model]?�

Cox �In the light of further results one knows since, I think Iwould normally want to tackle the problem parametrically.. . . I�m not keen on non-parametric formulations normally.�

Reid �So if you had a set of censored survival data today, youmight rather �t a parametric model, even though there was afeeling among the medical statisticians that that wasn�t quiteright.�

Cox �That�s right, but since then various people have shown thatthe answers are very insensitive to the parametricformulation of the underlying distribution. And if you wantto do things like predict the outcome for a particular patient,it�s much more convenient to do that parametrically.�

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 5 / 27

Page 7: Flexible parametric alternatives to the Cox model

Outline (part 1� Patrick)

Motivation for �exible parametric proportional hazards models

Very brief introduction to Royston-Parmar models

Generalization of �exible parametric PH models

Smoothing baseline distribution functions with splines

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 6 / 27

Page 8: Flexible parametric alternatives to the Cox model

Outline (part 2� Paul)

The stpm2 command

Why we need �exible models

Applications of stpm2

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 7 / 27

Page 9: Flexible parametric alternatives to the Cox model

Flexible parametric proportional hazards models I

Start with about the simplest survival distribution: exponential

S (t) = exp (�λt)

Transform to log cumulative hazard scale:

lnH (t) = ln [� lnS (t)]= ln (λt) = lnλ+ ln t

We see that lnH (t) is a linear function of ln t with slope 1

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 8 / 27

Page 10: Flexible parametric alternatives to the Cox model

Flexible parametric proportional hazards models II

Generalize lnH (t) to a linear function of ln t with slope γ:

lnH (t) = lnλ+ γ ln t

This is a Weibull distribution. Now add covariates, x:

lnH (tjx) = lnλ+ γ ln t + xβ

Further generalize the ln t part to allow greater �exibility:

lnH (tjx) = lnλ+ s (ln t) + xβ

Here, s (ln t) is a suitable smooth function of ln t

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 9 / 27

Page 11: Flexible parametric alternatives to the Cox model

Flexible parametric proportional hazards models III

We actually use a restricted cubic spline function for s (ln t)

more coming on splines shortly

Let�s write the model more generally as�log cumul. hazard = log baseline cumul. hazard + covariate e¤ect�:

lnH (tjx) = lnH0 (t) + xβ

The baseline log cumulative hazard is a smooth function of ln t:

lnH0 (t) = lnH (t; x = 0)= lnλ+ s (ln t)

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 10 / 27

Page 12: Flexible parametric alternatives to the Cox model

Why model on the log cumulative hazard scale? I

We are used to the log hazard scale� why change to the logcumulative hazard scale?

Under proportional hazards, covariate e¤ects are proportional on thelog cumulative hazard scale AND on the log hazard scale:

lnH (tjx) = lnH0 (t) + xβ

) H (tjx) = H0 (t) exp (xβ)

) h (tjx) = h0 (t) exp (xβ)

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 11 / 27

Page 13: Flexible parametric alternatives to the Cox model

Why model on the log cumulative hazard scale? II

The log cumulative hazard as a function of log time is often simple

in the Weibull model, it is a straight line

It�s easier to capture the shape of simple functions than complicatedones

e.g. the log hazard function is usually more complicated than the logcumulative hazardyou can too easily get �wiggly��tted hazard functions

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 12 / 27

Page 14: Flexible parametric alternatives to the Cox model

Royston-Parmar modelsA further generalization!

We can generalize the model for the log cumulative hazard functionby going back a few steps

No details given here� see Royston & Parmar (2002)

The most important other class of Royston-Parmar models isproportional cumulative odds models:

O (tjx) = O0 (t) exp (xβ)

whereO (t) = [1� S (t)] /S (t)

is the (cumulative) odds of an event occurring before time tProportional odds models for survival data were �rst suggested byBennett (1983)They sometimes give a better �t than proportional hazards modelsBut we won�t be discussing them much today

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 13 / 27

Page 15: Flexible parametric alternatives to the Cox model

Examples of survival, cumulative hazard and cumulativeodds functions in real data

The Rotterdam breast cancer dataset:

N = 2982 patients with primary breast cancer

1,477 events for relapse-free survival

Time is years since surgery to remove the primary tumour

Data restricted to �rst 10 years of follow-up

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 14 / 27

Page 16: Flexible parametric alternatives to the Cox model

Survival function (Kaplan-Meier)

sts gen s = s

0.2

5.5

.75

1S(

t)

0 2 4 6 8 10Years from surgery

Survival function

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 15 / 27

Page 17: Flexible parametric alternatives to the Cox model

Cumulative hazard function (Nelson-Aalen)

sts gen H = na

0.2

.4.6

.8H

(t)

0 2 4 6 8 10Years from surgery

Cumulative hazard function

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 16 / 27

Page 18: Flexible parametric alternatives to the Cox model

Log cumulative hazard function

gen lnH = ln(H)

­8­6

­4­2

0ln

 H(t)

.1 .2 .5 1 2 5 10Years from surgery (log scale)

Log cumulative hazard function

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 17 / 27

Page 19: Flexible parametric alternatives to the Cox model

Log cumulative hazard and odds functions

gen lnO = ln(1 - s) / s

­8­6

­4­2

0ln

 H(t)

 / lo

g O

(t)

.1 .2 .5 1 2 5 10Years from surgery (log scale)

ln H(t)ln O(t)

Log cumulative hazard and odds functions

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 18 / 27

Page 20: Flexible parametric alternatives to the Cox model

Spline functions: A brief description

Splines are �exible mathematical functions de�ned by piecewisepolynomials joined at points on the x axis known as knots

Regression splines are particularly useful, because

they can be incorporated into any regression model which has a linearpredictorthey are relatively simple but still �exible enough for most practical data

Cubic regression splines can be constrained in di¤erent ways toimprove their smoothness

e.g. the �tted function is forced to have continuous 0th, �rst andsecond derivatives at the knots

The following graph shows how smoothness improves as the curveand its derivatives are more and more constrained

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 19 / 27

Page 21: Flexible parametric alternatives to the Cox model

25

50

100

150

200E

xces

s M

orta

lity

Rat

e (1

000

py’s

)

0 1 2 3 4 5Years from Diagnosis

No Constraints

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 20 / 27

Page 22: Flexible parametric alternatives to the Cox model

25

50

100

150

200E

xces

s M

orta

lity

Rat

e (1

000

py’s

)

0 1 2 3 4 5Years from Diagnosis

Forced to Join at Knots

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 21 / 27

Page 23: Flexible parametric alternatives to the Cox model

25

50

100

150

200E

xces

s M

orta

lity

Rat

e (1

000

py’s

)

0 1 2 3 4 5Years from Diagnosis

Continuous 1st Derivatives

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 22 / 27

Page 24: Flexible parametric alternatives to the Cox model

25

50

100

150

200E

xces

s M

orta

lity

Rat

e (1

000

py’s

)

0 1 2 3 4 5Years from Diagnosis

Continuous 2nd Derivatives

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 23 / 27

Page 25: Flexible parametric alternatives to the Cox model

Restricted cubic splines I

A further useful constraint is to force the �tted function to be linearin the tails

i.e. beyond the lowest and highest knots� known as boundary knots

This stabilizes the function in the two regions which usually have thesparsest data

These splines are known as �restricted�(or �natural�) cubicsplines� RCS

We use RCS in Royston-Parmar models to smooth the baselinedistribution function

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 24 / 27

Page 26: Flexible parametric alternatives to the Cox model

Restricted cubic splines II

An RCS with K interior knots and 2 boundary knots has K + 1 d.f.and K + 1 basis functions:

s (x) = γ0 + γ1z1 + γ2z2 + γK+1zK+1

where z1 equals x and the other z�s are simple functions of the knotsand x

If K = 0 the spline simpli�es to the linear function γ0 + γ1x

no boundary knots, of course

Will shortly return to the question of how many knots to choose inthe Stata demonstration

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 25 / 27

Page 27: Flexible parametric alternatives to the Cox model

But how do I do any of this in Stata?

Let�s visit Stata now and see how to �t some simple �exible proportionalhazards models to the Rotterdam breast cancer data

We use Paul�s powerful stpm2 program to do the model-�tting, and itspredict function to get the outputs we need.

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 26 / 27

Page 28: Flexible parametric alternatives to the Cox model

Notes on stpm2

stpm2 can �t a wide variety of types of model� Paul will demonstratesome of them shortly

Particularly strong are its prediction facilities:

Survival, cumulative hazard, hazard, cumulative odds, ...All with pointwise con�dence intervals if desired (ci option)Can handle important derived quantities such as hazard ratios, hazarddi¤erences and survival di¤erences

Now hand over to Paul to continue.

Patrick Royston (MRC CTU) Flexible parametric survival models 11 September 2009 27 / 27