59
Interim analysis and treatment selection in time-to-event randomized trials in rare diseases on a long-term horizon Mohamed Amine BAYAR , , Gw ´ ena¨ el LE TEUFF , , Marie-C ´ ecile LE DELEY ,, , Stefan MICHIELS , November , - Statistical methods and designs in clinical oncology - Symposium of the ONCOSTAT team Service de Biostatistique et d’Epid ´ emiologie, Gustave Roussy, Villejuif, France CESP - UPS - UVSQ, INSERM, Universit´ e Paris-Saclay, Villejuif, France Unit ´ e de M ´ ethodologie et Biostatistique, Centre Oscar Lambret, Lille, France

[width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Interim analysis and treatment selection

in time-to-event randomized trials

in rare diseases on a long-term horizon

Mohamed Amine BAYAR1,2, Gwenael LE TEUFF1,2,Marie-Cecile LE DELEY1,2,3, Stefan MICHIELS1,2

November 9, 2017 - Statistical methods and designs in clinicaloncology - Symposium of the ONCOSTAT team1Service de Biostatistique et d’Epidemiologie, Gustave Roussy, Villejuif, France2CESP - UPS - UVSQ, INSERM, Universite Paris-Saclay, Villejuif, France3Unite de Methodologie et Biostatistique, Centre Oscar Lambret, Lille, France

1

Page 2: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Context

• Most cancers in children are rare• ≈ 20% of cancers in adults are rare• Precision medicine⇒ Many common cancers in

adults become a set of rare cancers

⇒ Scarce resources for clinical research

Large randomized clinical trials (RCT) with standardone-sided 2.5% α-level and 80% power for a reasonablee�ect size o�en no longer feasible (Parmar et al., 2016)

2

Page 3: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Previous work (Bayar et al., 2016)

• Consider a trial as part of a series of two-arm RCTsrather than in isolation

• Assess bene�ts and risks on a long period• Search for the best compromise between evidencecriteria and sample size to achieve the greatesttherapeutic gain

Conclusion:

Performing a series of small trials with relaxedα-levels leads, on average, to larger survival bene�tsover a long research horizon compared with largertrials with a typical 2.5% one-sided α-level

3

Page 4: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Previous work (Bayar et al., 2016)

• Consider a trial as part of a series of two-arm RCTsrather than in isolation

• Assess bene�ts and risks on a long period• Search for the best compromise between evidencecriteria and sample size to achieve the greatesttherapeutic gain

Conclusion:

Performing a series of small trials with relaxedα-levels leads, on average, to larger survival bene�tsover a long research horizon compared with largertrials with a typical 2.5% one-sided α-level

3

Page 5: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Current objective

1. Design each trial within the series as a two-arm RCTwith an interim analysis (IA)

2. Design each trial within the series as a two-stagethree-arm RCT with treatment selection at interim

3. Compare the performance of the two previous seriesdesigns between them, and with other moretraditional designs

4

Page 6: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Simulation framework

Basic simulation Model

• Succession of K two-arm RCTs over 15 years• Experimental arm E vs. control arm C• Time-to-event endpoint - One-sided log-rank Test• Treatment selected a�er each trial becomes thecontrol of the next trial

• Number of patients for each trial within the seriescomputed with the current baseline selected fromthe previous trial (Kim and Tsiatis, 1990)

5

Page 7: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Simulation framework

Assumptions

• Uniform accrual• Exponential distribution of survival times (λCk,λEk), for

each trial k, k ∈ [1,K] and K depends on the courseof the series

• No patient lost to follow-up (FU)• Fixed FU time

6

Page 8: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Simulation parameters

Characteristics of the underlying disease• Accrual rate: 50, 100, or 200 patients/year• Hazard rate of the control arm of the �rst trial of the

series λC1Survival λC1 Follow-up

median survival of 6 months 2 log(2) 6 monthsmedian survival of 1 year log(2) 1 yearmedian survival of 2 years log(2)

2 2 years2-year survival rate of 75% log(4)−log(3)

2 2 years

Trials within the same series are designed to achievethe same power (80% or 90%) for the same expectedHR of 0.5, 0.6 or 0.75 7

Page 9: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Simulation parameters

Characteristics of the underlying disease• Accrual rate: 50, 100, or 200 patients/year• Hazard rate of the control arm of the �rst trial of the

series λC1Survival λC1 Follow-up

median survival of 6 months 2 log(2) 6 monthsmedian survival of 1 year log(2) 1 yearmedian survival of 2 years log(2)

2 2 years2-year survival rate of 75% log(4)−log(3)

2 2 years

Trials within the same series are designed to achievethe same power (80% or 90%) for the same expectedHR of 0.5, 0.6 or 0.75 7

Page 10: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Simulation parameters

Characteristics of the underlying disease• Accrual rate: 50, 100, or 200 patients/year• Hazard rate of the control arm of the �rst trial of the

series λC1Survival λC1 Follow-up

median survival of 6 months 2 log(2) 6 monthsmedian survival of 1 year log(2) 1 yearmedian survival of 2 years log(2)

2 2 years2-year survival rate of 75% log(4)−log(3)

2 2 years

Trials within the same series are designed to achievethe same power (80% or 90%) for the same expectedHR of 0.5, 0.6 or 0.75 7

Page 11: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Simulation parameters

Characteristics of the underlying disease• Accrual rate: 50, 100, or 200 patients/year• Hazard rate of the control arm of the �rst trial of the

series λC1Survival λC1 Follow-up

median survival of 6 months 2 log(2) 6 monthsmedian survival of 1 year log(2) 1 yearmedian survival of 2 years log(2)

2 2 years2-year survival rate of 75% log(4)−log(3)

2 2 years

Trials within the same series are designed to achievethe same power (80% or 90%) for the same expectedHR of 0.5, 0.6 or 0.75 7

Page 12: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Simulation parameters

Hypotheses of how treatments improve over time:future treatment e�ects

Relative characterizationHazard Ratio

E[HR] P[HR 6 0.5]

0.925 0.02

0.950 0.02

0.950 0.01

1.000 0.01

• Historical distributionderived from themeta-analysis of 698RCTs on > 200 000patients (Djulbegovicet al., 2012)

• Other distributions ±optimistic orpessimistic

8

Page 13: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Simulation parameters

Hypotheses of how treatments improve over time:future treatment e�ects

Relative characterizationHazard Ratio

E[HR] P[HR 6 0.5]

0.925 0.02

0.950 0.02

0.950 0.01

1.000 0.01

• Historical distributionderived from themeta-analysis of 698RCTs on > 200 000patients (Djulbegovicet al., 2012)

• Other distributions ±optimistic orpessimistic

8

Page 14: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Simulation parameters

Hypotheses of how treatments improve over time:future treatment e�ects

Relative characterizationHazard Ratio

E[HR] P[HR 6 0.5]

0.925 0.02

0.950 0.02

0.950 0.01

1.000 0.01

Absolute characterizationHazard Rate λE(t)Hazard rate of the control arm of the �rsttrial of the series λC1 = log(2)

8

Page 15: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Simulation parameters

288 possible combinations of simulation parameters

• 3 accrual rates• 4 baseline survivals• 4 hypotheses of how treatments improve over time• 2 powers to be achieved for 3 expected HRs

9

Page 16: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Illustration of one possible series

HRseries

λC1

Hazard rate ofthe control armof the �rst trial

of the series

λE1

yxyxyxyxyxyx

yxyxyxyxyxyx

λC2

λE1λC1

λE2

λC3

λC2λC2

×

λE3 λC4

λE3λC3

×

λE4

λC4λC4

× =λC4λC1

10

Page 17: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Illustration of one possible series

HRseries

λC1

λE1

Distributionof future treatment

e�ects

yxyxyxyxyxyx

yxyxyxyxyxyx

λC2

λE1λC1

λE2

λC3

λC2λC2

×

λE3 λC4

λE3λC3

×

λE4

λC4λC4

× =λC4λC1

10

Page 18: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Illustration of one possible series

HRseries

λC1

λE1

yxyxyxyxyxyx

yxyxyxyxyxyx

λC2

λE1λC1

λE2

λC3

λC2λC2

×

λE3 λC4

λE3λC3

×

λE4

λC4λC4

× =λC4λC1

10

Page 19: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Illustration of one possible series

HRseries

λC1

λE1

yxyxyxyxyxyx

yxyxyxyxyxyx

λC2

λE1λC1

λE2

λC3

λC2λC2

×

λE3 λC4

λE3λC3

×

λE4

λC4λC4

× =λC4λC1

10

Page 20: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Illustration of one possible series

HRseries

λC1

λE1

Distributionof future treatment

e�ects

yxyxyxyxyxyx

yxyxyxyxyxyx

λC2

λE1λC1

λE2

λC3

λC2λC2

×

λE3 λC4

λE3λC3

×

λE4

λC4λC4

× =λC4λC1

10

Page 21: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Illustration of one possible series

HRseries

λC1

λE1

yxyxyxyxyxyx

yxyxyxyxyxyx

λC2

λE1λC1

λE2

λC3

λC2λC2

×

λE3 λC4

λE3λC3

×

λE4

λC4λC4

× =λC4λC1

10

Page 22: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Illustration of one possible series

HRseries

λC1

λE1

Distributionof future treatment

e�ects

yxyxyxyxyxyx

yxyxyxyxyxyx

λC2

λE1λC1

λE2

λC3

λC2λC2

×

λE3 λC4

λE3λC3

×

λE4

λC4λC4

× =λC4λC1

10

Page 23: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Illustration of one possible series

HRseries

λC1

λE1

yxyxyxyxyxyx

yxyxyxyxyxyx

λC2

λE1λC1

λE2

λC3

λC2λC2

×

λE3 λC4

λE3λC3

×

λE4

λC4λC4

× =λC4λC1

10

Page 24: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Illustration of one possible series

HRseries

λC1

λE1

Distributionof future treatment

e�ects

yxyxyxyxyxyx

yxyxyxyxyxyx

λC2

λE1λC1

λE2

λC3

λC2λC2

×

λE3 λC4

λE3λC3

×

λE4

λC4λC4

× =λC4λC1

10

Page 25: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Illustration of one possible series

HRseries

λC1

λE1

yxyxyxyxyxyx

yxyxyxyxyxyx

λC2

�λE1λC1

λE2

λC3

�λC2

�λC2

×

λE3 λC4

�λE3

�λC3

×

λE4

λC4

�λC4

× =λC4λC1

10

Page 26: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Performance metrics

At the end of the 15-year research period

Total survival bene�t1

HRseries− 1 =

λControlFirst trialλSelectedLast trial

− 1

ExampleFor a series of RCTs,

At baseline, median survival = 12 months⇒ λControlFirst trial = 0.69A�er 15 year, λSelectedLast trial = 0.46⇒ median survival = 18 months

HRseries = 0.67⇔ Total survival bene�t = 50%

11

Page 27: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Performance metrics

10 000 repetitions of the 15-year research periodExpected total survival bene�t (Gain)

E[λControlFirst trialλSelectedLast trial

− 1]

Probability of a detrimental e�ect (Risk)Probability that the event rate associated with thetreatment selected at the end of the 15 years is worsethan the baseline event rate

P[λSelectedLast trial > λControlFirst trial

]12

Page 28: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Series of two-arm RCTs with an interim analysis (IA)

IA performed when 1/2 of the required events areexpected to be attained

• Wieand stopping rule for futility (Wieand et al., 1994)HR > 1⇒ stop the trial for futility

• OBF β−spending stopping rule for futility (O’Brienand Fleming, 1979)

• OBF α−spending stopping rule for e�cacy• Combining the latter two

13

Page 29: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Series of two-stage three-arm RCTs with treatmentselection at interim (Posch et al., 2005)

At the �rst stage2 experimental treatments J1 = {1, 2} are compared tothe control and the best is selected for the second stage

At the second stageSelected treatment compared to the control, combiningdata from both stages at the multiple level α

• Closed testing procedure for multiple testing• Simes test for intersection hypotheses• Weighted inverse normal combination function for

stagewise p-values combination14

Page 30: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

STAGE 1 DATAn1 patients per group

Test statistics Zj,1P-values pj,1j ∈ {1, 2}

All pj,1 >threshold

Stop the trial

Select ’best’(j∗)

STAGE 2 DATA2 treatment comparison

(j∗ vs control)n2 patients per group

Test statistics Zj∗P-values pj∗

Testing strategyAssume that j∗ = 1

Stage 2 nominal αH1 can be rejected if:C(p1,1, p1,2) < α

C(p12,1, p12,2) < α

Decision making and testing strategy for the two-stageadaptive treatment selection design, adapted fromDmitrienko et al. (2009) 15

Page 31: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

STAGE 1 DATAn1 patients per group

Test statistics Zj,1P-values pj,1j ∈ {1, 2}

All pj,1 >threshold

Stop the trial

Select ’best’(j∗)

STAGE 2 DATA2 treatment comparison

(j∗ vs control)n2 patients per group

Test statistics Zj∗P-values pj∗

Testing strategyAssume that j∗ = 1

Stage 2 nominal αH1 can be rejected if:C(p1,1, p1,2) < α

C(p12,1, p12,2) < α

Decision making and testing strategy for the two-stageadaptive treatment selection design, adapted fromDmitrienko et al. (2009) 15

Page 32: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

STAGE 1 DATAn1 patients per group

Test statistics Zj,1P-values pj,1j ∈ {1, 2}

All pj,1 >threshold

Stop the trial

Select ’best’(j∗)

STAGE 2 DATA2 treatment comparison

(j∗ vs control)n2 patients per group

Test statistics Zj∗P-values pj∗

Testing strategyAssume that j∗ = 1

Stage 2 nominal αH1 can be rejected if:C(p1,1, p1,2) < α

C(p12,1, p12,2) < α

Decision making and testing strategy for the two-stageadaptive treatment selection design, adapted fromDmitrienko et al. (2009) 15

Page 33: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

STAGE 1 DATAn1 patients per group

Test statistics Zj,1P-values pj,1j ∈ {1, 2}

All pj,1 >threshold

Stop the trial

Select ’best’(j∗)

STAGE 2 DATA2 treatment comparison

(j∗ vs control)n2 patients per group

Test statistics Zj∗P-values pj∗

Testing strategyAssume that j∗ = 1

Stage 2 nominal αH1 can be rejected if:C(p1,1, p1,2) < α

C(p12,1, p12,2) < α

Decision making and testing strategy for the two-stageadaptive treatment selection design, adapted fromDmitrienko et al. (2009) 15

Page 34: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Design parameters

Series of 2-arm trials withinterim analysis

Interim analysis• No interim analysis• Wieand stopping rule for futility• OBF β−spending stopping rule for futility• OBF α−spending stopping rule for e�cacy• Combining the latter two

One-sided α-level:0.025, 0.05, 0.1, 0.2

Series of 3-arm trials withselection at interim

First stage threshold:0.05, 0.1, 0.15, 0.2

Second stage α-level:0.025, 0.05, 0.1, 0.2

16

Page 35: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Results

Accrual rate = 50 patients/yearBaseline median survival = 1 year

Historical distribution of treatment e�ects90% power for an expectedHR of 0.6

17

Page 36: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Results - Inclusion of an interim analysis

Accrual rate = 50 patients/yearBaseline median survival = 1 year

Historical distribution of treatment e�ects90% power for an expectedHR of 0.6

18

Page 37: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Results - Inclusion of an interim analysis

Accrual rate = 50 patients/yearDi�erent baseline hazards rates

Historical distribution of treatment e�ects90% power for an expectedHR of 0.6

Median survival of 6 months Median survival of 1 year Median survival of 2 years 2-year survival rate of 75%

19

Page 38: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Results - Inclusion of an interim analysis

Accrual rate = 50 patients/yearBaseline median survival = 1 year

Di�erent distributions of treatment e�ects90% power for an expectedHR of 0.6

More optimistic Historical More pessimistic Very pessimistic

20

Page 39: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Results - Designs comparison

Accrual rate = 50 patients/yearBaseline median survival = 1 year

Historical distribution of treatment e�ects90% power for an expectedHR of 0.6

Series of 2-arm RCTs with interim analysis Series of 3-arm RCTs with selection at interim

21

Page 40: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Results - Optimal designs

Argmax E[λControlFirst trialλSelectedLast trial

− 1]

subject to P[λSelectedLast trial > λControlFirst trial

]< 1.0%

Accrual rate = 50 patients/yearBaseline median survival = 1 year

Historical distribution of treatment e�ects90% power for an expectedHR of 0.6

Optimal designTraditional

designSeries of 2-arm RCTs

with no interim analysisSeries of 2-arm RCTswith interim analysis

Series of 3-arm RCTswith selection at interim

α-level 0.025 0.1 0.1 0.1Interim analysis None None Wieand threshold = 0.2

Number of trial 3.0 4.0 4.7 4.3Gain 27.0% 36.1% 39.6% 42.7%Risk 0.33% 1.03% 0.91% 0.87%

22

Page 41: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Results - Optimal designs

Argmax E[λControlFirst trialλSelectedLast trial

− 1]

subject to P[λSelectedLast trial > λControlFirst trial

]< 1.0%

Accrual rate = 50 patients/yearBaseline median survival = 1 year

Historical distribution of treatment e�ects90% power for an expectedHR of 0.6

Optimal designTraditional

designSeries of 2-arm RCTs

with no interim analysisSeries of 2-arm RCTswith interim analysis

Series of 3-arm RCTswith selection at interim

α-level 0.025 0.1 0.1 0.1Interim analysis None None Wieand threshold = 0.2

Number of trial 3.0 4.0 4.7 4.3Gain 27.0% 36.1% 39.6% 42.7%Risk 0.33% 1.03% 0.91% 0.87%

22

Page 42: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Results - Optimal designs

Argmax E[λControlFirst trialλSelectedLast trial

− 1]

subject to P[λSelectedLast trial > λControlFirst trial

]< 1.0%

Accrual rate = 50 patients/yearBaseline median survival = 1 year

Historical distribution of treatment e�ects90% power for an expectedHR of 0.6

Optimal designTraditional

designSeries of 2-arm RCTs

with no interim analysisSeries of 2-arm RCTswith interim analysis

Series of 3-arm RCTswith selection at interim

α-level 0.025 0.1 0.1 0.1Interim analysis None None Wieand threshold = 0.2

Number of trial 3.0 4.0 4.7 4.3Gain 27.0% 36.1% 39.6% 42.7%Risk 0.33% 1.03% 0.91% 0.87%

22

Page 43: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Results - Optimal designs

Argmax E[λControlFirst trialλSelectedLast trial

− 1]

subject to P[λSelectedLast trial > λControlFirst trial

]< 1.0%

Accrual rate = 50 patients/yearBaseline median survival = 1 year

Historical distribution of treatment e�ects90% power for an expectedHR of 0.6

Optimal designTraditional

designSeries of 2-arm RCTs

with no interim analysisSeries of 2-arm RCTswith interim analysis

Series of 3-arm RCTswith selection at interim

α-level 0.025 0.1 0.1 0.1Interim analysis None None Wieand threshold = 0.2

Number of trial 3.0 4.0 4.7 4.3Gain 27.0% 36.1% 39.6% 42.7%Risk 0.33% 1.03% 0.91% 0.87%

22

Page 44: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Results - Optimal designs

Argmax E[λControlFirst trialλSelectedLast trial

− 1]

subject to P[λSelectedLast trial > λControlFirst trial

]< 1.0%

Accrual rate = 50 patients/yearBaseline median survival = 1 year

Historical distribution of treatment e�ects90% power for an expectedHR of 0.6

Optimal designTraditional

designSeries of 2-arm RCTs

with no interim analysisSeries of 2-arm RCTswith interim analysis

Series of 3-arm RCTswith selection at interim

α-level 0.025 0.1 0.1 0.1Interim analysis None None Wieand threshold = 0.2

Number of trial 3.0 4.0 4.7 4.3Gain 27.0% 36.1% 39.6% 42.7%Risk 0.33% 1.03% 0.91% 0.87%

22

Page 45: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Results - Optimal designs

For the 288 possible combinations of simulation parametersSeries of 2-arm RCTs with interim analysis

One-sided α-level

Interim Analysis 0.025 0.05 0.1 0.2

No interim analysis 0.0% 0.3% 1.4% 1.0% 2.8%Wieand stopping rule for futility 5.6% 4.5% 4.5% 25.8% 40.4%OBF β−spending stopping rule for futility 5.2% 5.9% 2.1% 2.4% 15.7%OBF α−spending stopping rule for e�cacy 0.3% 2.1% 0.7% 1.0% 4.2%Combining the latter two 5.2% 11.5% 4.5% 15.7% 36.9%

16.4% 24.4% 13.2% 46.0% Total

Series of 3-arm RCTs with selection at interimSecond stage α-level

First stage threshold 0.025 0.05 0.1 0.2

0.05 0.4% 0.0% 0.7% 0.7% 1.8%0.1 3.7% 0.7% 1.1% 0.4% 5.9%0.15 10.7% 8.1% 5.1% 2.2% 26.1%0.2 4.4% 6.6% 12.9% 42.3% 66.2%

19.1% 15.4% 19.9% 45.6% Total23

Page 46: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Results - Optimal designs

Comparison of the performance of optimal designs for the 288possible combinations of simulation parameters

For the same number of trials, a series of 3-arm RCTs test twicemore experimental treatments than a series of 2-arm RCTs

24

Page 47: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Conclusion

Even when including interim analysis or two-stagedesign with treatment selection at interim, we stillrecommend to relax α-level

Our recommendation is only valid when considering aseries of trials run over a relatively long researchhorizon and when the supply of new treatments is large

25

Page 48: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

References

Bayar, M. A., Le Teu�, G., Michiels, S., Sargent, D. J., and Le Deley, M.-C. (2016). Newinsights into the evaluation of randomized controlled trials for rare diseases overa long-term research horizon: a simulation study. Statistics in medicine,35(19):3245–3258.

Djulbegovic, B., Kumar, A., Glasziou, P. P., Perera, R., Reljic, T., Dent, L., Ra�ery, J.,Johansen, M., Di Tanna, G. L., Miladinovic, B., et al. (2012). New treatmentscompared to established treatments in randomized trials. The Cochrane Library.

Dmitrienko, A., Tamhane, A. C., and Bretz, F. (2009). Multiple testing problems inpharmaceutical statistics. CRC Press.

Kim, K. and Tsiatis, A. A. (1990). Study duration for clinical trials with survivalresponse and early stopping rule. Biometrics, pages 81–92.

O’Brien, P. C. and Fleming, T. R. (1979). A multiple testing procedure for clinical trials.Biometrics, pages 549–556.

Parmar, M. K., Sydes, M. R., and Morris, T. P. (2016). How do you design randomisedtrials for smaller populations? a framework. BMC medicine, 14(1):183.

Posch, M., Koenig, F., Branson, M., Brannath, W., Dunger-Baldauf, C., and Bauer, P.(2005). Testing and estimation in �exible group sequential designs with adaptivetreatment selection. Statistics in medicine, 24(24):3697–3714.

Wieand, S., Schroeder, G., and O’Fallon, J. R. (1994). Stopping when the experimentalregimen does not appear to help. Statistics in medicine, 13(13-14):1453–1458.

26

Page 49: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Q & A

Please, send any additional questions or comments to:[email protected]

Page 50: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Simulation parameter

Hypotheses of how treatments improve over time :future treatment e�ects

Relative characterizationHazard Ratio

E[HR] P[HR 6 0.5]

0.925 0.020.950 0.020.950 0.011.000 0.01

Absolute characterizationHazard Rate

λE(t) ∼ logN (µ(t), σ2)

µ(t) = a× t+ b

E[λE(t)] = eµ(t)+ 12σ

2

SD[λE(t)] = eµ(t)+ 12σ

2√eσ2 − 1

Page 51: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Simulation parameter

Hypotheses of how treatments improve over time :future treatment e�ects

Relative characterizationHazard Ratio

E[HR] P[HR 6 0.5]

0.925 0.020.950 0.020.950 0.011.000 0.01

Absolute characterizationHazard Rate

Time (Years)

E[λE (t)]

0 5 10

E[HR]E[HR]

Page 52: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Stage 1 Stage 2

H1

p1,1

H2

p2,1

H1 ∩ H2

p12,1 = minj∈{1,2}

2j p(j),1

Simes test

Select1 treatment

e.g., H1

H1

p1,2

H1 ∩ H2

p12,2 = p1,2

Figure 1: P-value de�nitions of the closed testing procedureusing the Simes test for intersection hypotheses, adaptedfrom Dmitrienko et al. (2009)

Page 53: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Stage 1 Stage 2

H1

p1,1

H2

p2,1

H1 ∩ H2

p12,1 = minj∈{1,2}

2j p(j),1

Simes test

Select1 treatment

e.g., H1

H1

p1,2

H1 ∩ H2

p12,2 = p1,2

Figure 1: P-value de�nitions of the closed testing procedureusing the Simes test for intersection hypotheses, adaptedfrom Dmitrienko et al. (2009)

Page 54: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Stage 1 Stage 2

H1

p1,1

H2

p2,1

H1 ∩ H2

p12,1 = minj∈{1,2}

2j p(j),1

Simes test

Select1 treatment

e.g., H1

H1

p1,2

H1 ∩ H2

p12,2 = p1,2

Figure 1: P-value de�nitions of the closed testing procedureusing the Simes test for intersection hypotheses, adaptedfrom Dmitrienko et al. (2009)

Page 55: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Stage 1 Stage 2

H1

p1,1

H2

p2,1

H1 ∩ H2

p12,1 = minj∈{1,2}

2j p(j),1

Simes test

Select1 treatment

e.g., H1

H1

p1,2

H1 ∩ H2

p12,2 = p1,2

Figure 1: P-value de�nitions of the closed testing procedureusing the Simes test for intersection hypotheses, adaptedfrom Dmitrienko et al. (2009)

Page 56: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather

Results - Sensitivity analysis

Accrual rate = 50, 100, 200 patients/yearBaseline median survival = 1 year

Historical distribution of treatment e�ects90% power for an expectedHR of 0.6

Accrual rate = 50 patients/year Accrual rate = 100 patients/year Accrual rate = 200 patients/year

Page 57: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather
Page 58: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather
Page 59: [width=0.2]logogustaveroussyrvb [width=0.2]logocesp2 ... › data › pages › ...Previous work (Bayar et al., 2016) • Consider a trial as part of a series of two-arm RCTs rather