Network Meta-analysis, Indirect Comparisons€¦ · Tony Ades with thanks to Debbi Caldwell, Sofia Dias, Guobing Lu, Nicky Welton Academic Unit of Primary Health Care, Department

GMDS Jahrestagung

Essen, September 7-10, 2009

Network Meta-analysis, Indirect Comparisons:

are they so controversial ?

Tony Ades

with thanks to

Debbi Caldwell, Sofia Dias, Guobing Lu, Nicky Welton

Academic Unit of Primary Health Care,

Department of Community Based Medicine

Network meta-analysis

1. What is it ?

2. Is it complicated ?

3. Does it make difficult assumptions ?

4. Is it biased ?

Pair-wise meta-analysis

One or more trials comparing treatments B and C

Inferences about relative treatment effect dBC

B C

Indirect Comparisons

One or more trials comparing treatments A and B

One or more trials comparing treatments A and C

Inferences about relative treatment effect dBC ????

.

A

C

B

Indirect Comparisons

One or more trials comparing treatments A and B

One or more trials comparing treatments A and C

Simultaneous Inference about relative treatment effects

dAB, dAC, dBC

A

C

B

Indirect Comparisons: K treatments

Simultaneous inference on all K(K-1)/2 pair-wise contrasts

dAB, dAC, dAD dAE, dBC, dBD dBE dCD dCE , dDE

A

C

B

D

EA

B D

C

E

Mixed Treatment Comparisons = Multiple

treatment comparisons = Network Meta-analysis

A

C

B

D

E

Any connected

network:

simultaneous

inference on all

K(K-1)/2 contrasts.

Combines “direct” and

“indirect” evidence on

each contrast.

Loops: possibility of

“inconsistency”.

Example: Smoking Cessation

(Hasselblad, 1998)

• 24 trials, 4 treatments, and 50 data points (two 3-arm trials, the remaining 2-arm trials)

• Treatments:

A: No Contact

B: Self-Help

C: Individual Counselling

D: Group Counselling

Smoking Cessation Data Structure

1 trial

1 trial

2 trials

15 trials

1 trial

1 trial

1 trial

2 trials

A C D

B C D

A B

A C

A D

B C

B D

C D

MTC model for 4 treatments

Binomial data, treatments A,B,C,D

Data informing all 6 contrasts, AB, AC, AD, BC, BD, CD

Take A as the reference treatment

Make dAB, dAC, dAD the basic parameters, to be estimated.

Remaining contrasts are functional parameters:

dBC= dAC – dAB

dBD= dAD – dAB ”Consistency equations”

dCD= dAD - dAC

MTC model for 4 treatments

Binomial data, treatments A,B,C,D

Data informing all 6 contrasts, AB, AC, AD, BC, BD, CD

Take A as the reference treatment

Make dAB, dAC, dAD the basic parameters, to be estimated.

Remaining contrasts are functional parameters:

dBC= dAC – dAB

dBD= dAD – dAB ”Consistency equations”

dCD= dAD - dAC

A B C D

Is it complicated ?

Model for pair-wise meta-analysis. Treatments k, Trials j, comparing treatments A and B:

, ,2

,

, , ,2

Link Function: Logit( ) ( )

Random Effect: ~ ( , )

Likelihood: ~ Binomial( , )

Priors: , , ~ ( , )

j k j j AB

j AB AB

j k j k j k

j AB

p I k A

N d

r p n

d Dist

treatment effect

Is it complicated ?

Model for MTC. Trials j, treatments k in {A,B,C,..…..S }: comparing treatments b and k:

, , ,2

, ,

, , ,2

Link Function: Logit( ) ( )

Random Effect: ~ ( , )

Likelihood: ~ Binomial( , )

Priors: , , ....... , ~ ( , )

j k j j b k

j b k Ak Ab

j k j k j k

j AB AC AS

p I k b

N d d

r p n

d d d Dist

K-1 treatment effects: “basic” parameters

Same code for MTC and pair-wise meta-

analysis

A single program code can estimate

• Pair-wise meta-analysis

• Indirect comparisons

• Mixed Treatment Comparisons

= Network Meta-analysis

• Multi-arm trials

• Any combination of the above

Does MTC require difficult assumptions ?

Pair-wise meta-analysis assumes that the trial specific treatment effects are “exchangeable”

MTC assumes that the trial specific treatment effects

are “exchangeable” .

…. meaning that, if ALL the trials had included ALL the treatments, then each trial would have estimated the same, exchangeable effects.

MTC is based on a “missing at random” assumption

Consistency not an additional assumption: but follows from exchangeability

,j AB

, , ,, ,j AB j AC j AD

Is it biased ?

“Between-trial [ ie Indirect] comparisons are

unreliable. Patient populations may differ in

their responsiveness to treatment. Therefore

an apparently more effective treatment may

have been tested in a more responsive

population”Cranney, Guyatt et al. Endocr Rev 2002,

23; 570-8. Summary of meta-analyses of

therapies for postmenopausal osteoporosis

Is it biased ?

“Placebo controlled trials lacking an active control give little useful information about comparative effectiveness. Such information cannot reliably be obtained from cross-study comparisons, as the conditions of the studies may have been quite different”

International Council of Harmonisation E10 2.7.1.4

IS it biased ?

“Indirect comparisons are observational studies

across trials, and may suffer the biases of

observational studies, for example confounding”

Cochrane Handbook for systematic reviews of interventions 4.2.4.

Cochrane Library Issue 2

… But Victor, Egger, and Moher have each claimed

that pair-wise M-A is “observational”.

Are indirect estimates vulnerable to bias ?

If is an unbiased estimate of dAC, and

is an unbiased estimate of dAB, then

must be an unbiased estimate of dBC

“Indirect” can only be biased if “direct” is biased !

ˆ ˆ ˆIndirect Direct Direct

BC AC ABd d d

ˆDirect

ACd

ˆDirect

ABd

ˆ Indirect

BCd

“…. But suppose the AC and the AB trials were

on different patient populations? “

If we want a pair-wise meta-analysis to provide an

estimate of dBC in a patient group X, then we

should include only trials on group X.

If we want an indirect comparison to provide an

estimate of dBC in a patient group X, then we

should include only trials on group X.

“…. But suppose there are UNRECOGNISED

treatment modifiers in the AC trials, and not in the AB

trials ? “

Good question!

Heterogeneity is very common in pair-wise meta-analysis, and indicates the presence of often unrecognised relative effect modifiers (UREMs)

At the same time, the concern with indirect comparisons is that AB and AC trials might differ with respect to the presence UREMs.

Perhaps the heterogeneity issue in MA and the ‟bias‟ issue in IC are the same problem ?

We live in a world with UREMs !

… that is why we discover “heterogeneity” in

meta-analysis so often

Two kinds of UREM

(a) „Fixed‟ bias due to a trial-level treatment-by-covariate interaction: Some trials estimate , others h

… external validity problem

(b) „Random bias‟ associated with markers for low quality. Trials without such markers estimate .

With, they estimate b, b ~ N(B,2B)

… internal validity problem

„thought experiments‟

Suppose there is an Unrecognised Relative Effect Modifier

(treatment-by-trial interaction) with effect h and it is

present in a proportion p of infinitely large trials:

Or suppose there was a random UREM b ~ N(B,2B)

What estimate would we expect from a meta-analysis, or an

indirect comparison. How might the estimate deviate from

the target parameter ?

With „fixed‟ UREMs, what IS the

target parameter ?

UREM effect size: h

Treatment effects : no UREM , with UREM

UREM present in a proportion of trials: p

Target parameter *: !!!!!

* The pooled average.

Between trials Variance

(1 ) ( ) p p h ph

2(1 )p p h

h

Thought experiment (1):

Pair-wise Meta-analysis, N=1A meta-analysis with N=1 trials: UREM=h, with prob. p = 0.5

Target parameter, the pooled effect = +h /2

Long-run consideration of estimates from N=1

• 50% of the time it estimates

• 50% of the time it estimates + h

• Expected absolute bias = h / 2

• Probability that the trial estimates pooled effect target = 0

In a world with UREMs, a meta-analysis with N=1

(ie an RCT) is ALWAYS BIASED

Pair-wise Meta-analysis, N=2

A meta-analysis with N=2 trial: UREM=h, with prob. p = 0.5

Target parameter = +h /2

Four possible outcomes:

Prob Trial 1 Trial 2 Expected Bias:

MA - Target

Expected

Abs Bias

1 .25 ( h/2) h/2

2 .25 h ( h/2) ( h/2) 0

3 .25 h ( h/2) ( h/2) 0

4 .25 h h h ( h/2) h/2

Expected Absolute Bias h/4

Pair-wise Meta-analysis, N=3

A meta-analysis with N=3 trial: UREM=h, with prob. p = 0.5

Target parameter = +h /2

Eight possible outcomes:

Prob 3 trials Expected Bias:

MA - Target

Expected

Abs Bias

1 .125 , , , ( h/2) h/2

2 .375 , , h ( h/3) ( h/2) h/6

3 .375 , h, h ( 2h/3) ( h/2) h/6

4 .125 h, h, h h ( h/2) h/2

Expected Absolute Bias h/4

Pair-wise Meta-analysis Absolute Bias, Pooled Effect Target

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0 5 10 15 20 25 30 35 40

Number of Trials

Ex

pe

cte

d A

bs

olu

te B

ias

, in

Un

its

of

Bia

s

p =0.5

Pair-wise Meta-analysis Absolute Bias, Pooled Effect Target

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0 5 10 15 20 25 30 35 40

Number of Trials

Ex

pe

cte

d A

bs

olu

te B

ias

, U

nit

s o

f B

ias

p=0.5

p=0.4, 0.6

p=0.3, 0.7

p=0.2, 0.8

p=0.1, 0.9

Indirect comparisons – How might UREM work ?

Suppose :

1. A is „placebo‟, B and C active treatments in same

class.

2. A UREM acts on B and C equally, but not A.

Therefore UREM acts on dAB and dAC, but not dBC;

In BC trials it acts the same on each arm

Indirect Comps – 3rd comparator “similar”

We have trials of A vs C, B vs C, and wish to make inferences

about A vs B: dAB = dAC – dBC

UREM h present with probability 0.5, Target = dAB+h/2

In the absence of UREM,

AB trials estimate dAB, IC estimates dAC-dBC = dAB

In the presence of UREM,

AB trials estimate dAB +h, IC estimates dAC+h – dBC = dAB+h

Now consider IC syntheses with N AC and N BC trials ….

One AC trial & One BC trial, UREM=h, p=0.5

Four possible outcomes: Pooled effect Target: dAB+h/2

Prob AC trial BC trial Indirect

Comparison

Equal to Abs Bias

1 0.25 No UREM No UREM dAC-dBC dAB h / 2

2 0.25 No UREM UREM dAC-dBC dAB h / 2

3 0.25 UREM No UREM dAC+ h -dBC dAB + h h / 2

4 0.25 UREM UREM dAC+ h -dBC dAB + h h / 2

Expected absolute bias h / 2

Pairwise meta-analysis and Indirect Comparisons. p =0.5

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0 10 20 30 40

Number of Trials per comparison

Exp

ecte

d A

bso

lute

Bia

s,

un

its o

f h

Pair-wise MA

AND IC, 3rd comparator „similar‟ dAB = dAC – dBC

IC, 3rd comparator „different‟

dBC = dAC – dAB

Random effect modifiers.

Random bias has been associated with indicators of lower study quality (Schultz 1995)

Welton, Sterne: JRSS(A), 2009. Trials with lower quality indicators estimate: dAB + b, b ~ Normal(B,2)

What is known about this?

• Higher between-trials variation “lower quality” RCTs

• B depends on the meta-analysis.

• On average ….equivalent to an OR= 1.6

• .

• Over-estimates in favour of „newer‟ treatment

B 0.50.25

Random bias in MA, N=2, mean bias B, p=0.5

Prob Trial 1 Trial MA result Mean, var

of random

bias in MA

Exp Abs

Bias*

1 0.25 No UREM No UREM d, d 0, 0 0

2 0.25 No UREM UREM d, d+b B/2, 2/4 0.257 *

3 0.25 UREM No UREM d +b, d B/2, 2/4 0.257 *

4 0.25 UREM UREM d+b, d+b B, 2/2 0.502 *

Expected absolute bias 0.254

* By numerical integration

How might quality-related random bias work in

MA and IC ?

1. Bias operates in favour of “newest”, in „vulnerable‟ trials.

2. Assume treatments in chronological order A, B, C

3. Indirect comparison: infer BC effect from AC and AB trials.

“lower quality” AB trials are biased to favour B,

“lower quality” AC trials favour C.

dBC= dAC+bj – (dAB+bk), bj, bk ~ N(B, 2)

MEAN Biases CANCEL, but Random elements do not

… Song (J Clin Epi 2008) argues that IC may be LESS biased than direct comparisons

Pair-wise meta-analysis and Indirect Comparisons, Quality-

Related Random Bias with p=0.5, =0.25

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0 5 10 15 20 25 30 35 40

Number of Trials

Exp

ecte

d A

bso

lute

Bia

s

Pair-wise MA

Indirect comparisons dBC=dAC-dAB

bj~N(B,2)

B=0.5

B=0.25B=0

B=0.5

B=0.25

B=0

But HEY! …. That‟s when biases SUBTRACT:

….. suppose they ADD !!

1. Bias operates in favour of “newest”, in „vulnerable‟

trials.

2. But now: infer AC effect from AB and BC trials.

“lower quality” AB trials are biased to favour B,

“lower quality” BC trials favour C.

dAC = dAB+bj + (dBC +bk), bj, bk ~ N(B, 2)

MEAN Biases Now ADD.

Quality-related Random Bias: Pairwise meta-analyis and Indirect

Comparisons based on ADDITION. p=0.5, =0.25

0

0.1

0.2

0.3

0.4

0.5

0.6

0 5 10 15 20 25 30 35 40

Number of Trials

Exp

ecte

d A

bso

lute

Bia

s

Indirect comparisons dAC=dAB+ dBC

B=0.5

B=0.25

B=0

Pair-wise MAB=0.5

B=0.25

B=0

bj~N(B,2)

Findings

1. „Error of generalisation‟ (bias ?) in pair-wise MA under UREM qualitatively same as „bias‟ in IC.

Findings


2. Quantitatively, also comparable. With FIXED UREMs, bias in IC either identical to Pair-wise MA, or a little worse.

Findings


2. Quantitatively, also comparable. With FIXED UREMs, bias in IC either identical to Pair-wise MA, or a little worse.

3. Random “quality-related” UREMs: IC can be better than MA when subtracting effects, but far worse when adding !

Findings

4. Think of single trials as MA‟s with N=1.

When we use RCTs to make predictions about the future,

in a world with UREMs, where it is hard to duplicate RCT

conditions, RCTs are effectively „observational‟ …

…. (but still far better than non-randomised studies!)

Findings






5. If heterogeneity in MA, never say “we need one (or more) big trial(s)”. They won‟t help !

Findings






5. If heterogeneity in MA, never say “we need one (or more) big trial(s)”. They won‟t help !

6. Similarly, if you are worried ICs are biased because of UREMs, don‟t say “we need a head-to-head trial”. It will be biased / un-interpretable / un-generalisable, too.

In conclusion

Pair-wise meta-analysis is a special case of

network meta-analysis, with K=2 treatments.

Every time we ask a new question about

network meta-analysis, we learn something

new about pair-wise meta-analysis

Documents

Network Meta-analysis, Indirect Comparisons€¦ · Tony Ades with thanks to Debbi Caldwell, Sofia Dias, Guobing Lu, Nicky Welton Academic Unit of Primary Health Care, Department