28
Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Embed Size (px)

Citation preview

Page 1: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Statistical Analysisof IC50s

Nick Andrews, Statistics Unit CFI, HPA

Page 2: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Outline

• Curve fitting to estimate IC50

• Determining cut-offs for outliers (high IC50)

• Monitoring trends in IC50

Page 3: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Curve fitting

The data in each run consists of measuring the RFU of a sample against the antiviral at different concentrations (ten 4-fold dilutions) as well as with no-antiviral (virus-control)

Estimate IC50: the concentration at which the rfu is 50% of that from the virus control.

Page 4: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Sources of variation in IC50

• Curve fitting

• Within a run (2 – replicates)

• Between runs (Reference samples)

Page 5: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Point to Point (Excel)

Oseltamivir- Plate 1

0

5000

10000

15000

20000

25000

30000

35000

40000

45000

40

00

10

00

25

0

62

.50

15

.62

50

3.9

06

3

0.9

76

6

0.2

44

1

0.0

61

0

0.0

15

3

VC

Oseltamivir Conc (nM)

RF

U

292R 292K 119V A/Lisbon/22/2007

Page 6: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Smooth Curves (S-shaped Graph-pad)

Page 7: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Comparison-curve fitting

Sample P-P S-shape Ratio (Hi/Low)292R 0.69 0.69 1.00292K >4000 8551 -119V 40.3 40.7 1.01A/Lisbon22/22070.59 0.56 1.05

A/Latvia/685/2007 0.68 0.69 1.01A/Denmark/1/2007 0.43 0.41 1.05A/Denmark/2/2007 0.54 0.51 1.06A/Denmark/3/2007 0.54 0.48 1.12

Page 8: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Comments (Curve fitting)

• For most samples curve will make no more than about 5% difference to results.

• Must be clear exactly how the curve fitting and calculation of IC50 works – e.g. is IC50 based on 50% of OD of virus control or 50% of fitted upper asymptote.

• Need to ensure problems in curve fitting are flagged.

Page 9: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Between replicate variation (Point to point)

Sample Rep1 Rep2 Ratio(High/Low)

292R 0.75 0.63 1.19292K 3769 >4000 -119V 42.0 38.6 1.09A/Lisbon22/22070.57 0.61 1.07A/Latvia/685/2007 0.70 0.66 1.06A/Denmark/1/2007 0.40 0.46 1.15A/Denmark/2/2007 0.58 0.49 1.18A/Denmark/3/2007 0.54 0.53 1.02

Page 10: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Comments on replicates

• Between replicates in a run variation is greater than curve fitting with most results within about 15%-20%.

• Using replicates and taking the average reduces this effect.

• Large differences between replicates should be flagged (e.g. >30%).

Page 11: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Between run variation

Page 12: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Comparison runs (Reference Virus)

Sample Run1 Run2 Ratio(High/Low)

274H Osel 0.42 0.36 1.17274H Zan 0.17 0.22 1.29274Y Osel 300 303 1.01274Y Zan 0.31 0.37 1.19119V Osel 43.7 34.0 1.29199V Zan 1.41 1.32 1.07DB Osel 920 1316 1.43DB Zan 239 351 1.47

Page 13: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Comments

• Between runs variation is higher still at about 40%+. Hence retest those with high values and use controls in runs to identify problems in a run.

Page 14: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Determining Cut-offs

• The aim is to pick up outliers with high IC50 results as they may indicate resistant strains.

• Need a method to determine the ‘normal range’ in the absence of outliers.

• Various statistical methods may be used – the important thing is to ensure the outliers do not unduly affect the cut-off.

Page 15: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Methods used

• Need sufficient data points for initial estimate (about 30-50). These can be refined later with more data.

• Advisable to transform data to approximate normality (log-transform).

• Obtain a robust estimation of the Standard deviation (1.48*Median absolute deviation or using 0.75*Inter-quartile range).

• Use median + say 1.65, 3 SD.

Page 16: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Example: Initial 30 results

H3N2 Zanamivir 2006/7

0.10

1.00

10.00

0 30 60 90 120 150 180

Virus

IC50

Median=0.47 Robust SD (log-scale) = 0.17

Cut1 =1.07 Cut2=1.81

Page 17: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Example: Final results 2006/07

H3N2 Zanamivir 2006/7

0.10

1.00

10.00

0 30 60 90 120 150 180Virus

IC50

Median=0.48 Robust SD (log-scale) = 0.12

Cut1 =0.93 Cut2=1.35

Page 18: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Do we need to log-transform?

• Most results from dilution assays produce ‘geometric results’ so likely to be sensible

• Sometimes doesn’t seem to matter but sometimes data are skewed. (e.g. lower quartile much closer to median than upper quartile) – important to transform as robust methods assume data are normal once outliers are removed.

Page 19: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Non-log scale

H1N1 Zanamivir 2006/7

0.00

0.50

1.00

1.50

2.00

2.50

3.00

0 20 40 60 80 100

Virus

IC5

0

1.6

Page 20: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Log-scale (y-axis)H1N1 Zanamivir 2006/7

0.10

1.00

10.00

0 20 40 60 80 100

Virus

IC5

0

2.7

Page 21: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Alternative presentation – Box and Whisker Plot

Uses median + 1.5 * IQR (+2SD)Could be adjusted to match the scatter plots

3.0

H1N1 isolates 2004-2005 UK

1.4

2.4

2.1

1.8

2.3

1.95

0.0

0.5

1.0

1.5

2.0

2.5

3.0

Oseltamivir Zanamivir

IC

50 (nM

)

Page 22: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Monitoring ICD50 (outliers) over time• First look at the data (scatter plots/box-whisker)

• Calculate statistics for different time periods– Median– %Outliers– Geometric means with 95% CI with outliers removed– Kruskal Wallis test for medians– Calculate Geometeric Means with 95% CI and t-test/

anova (outliers removed) – regression for time trend (outliers removed).

Page 23: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Monitoring ICD50 (outliers) over time

• Perform statistical tests– Kruskal Wallis test for medians

– Chi-squared or Fishers exact test for % outliers

– Anova or t-tests for comparing means (oultiers removed). Alternatively look for non-overlapping 95% CI which is a conservative method (approx p<0.007)

Page 24: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Graphical Presentation – Box and Whisker

H1N1 isolates 2004-2007 EU+ UK: Oseltamivir

0.0

0.5

1.0

1.5

2.0

2.5

3.0

3.5

4.0

4.5

5.0

H1N1 EU+UK 04-05 H1N1 EU+UK 05-06 H1N1 EU+UK 06-07

IC50 (

nM

)

Page 25: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Scatter Plot

H1N1 Zanamivir 2005-2007

0.10

1.00

10.00

IC50

2005/06 2006/07

3SD

1.65 SD

Median

Page 26: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Scatter plotH3N2 Zanamivir 2005-2007

0.10

1.00

10.00

100.00

IC50

2005/06 2006/07

3SD

1.65 SD

Median

Page 27: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Statistics H3N2 Zanamivir NIMR

Year Samples >3SD (%) median Geomean (95% CI)2005/06 88 1 (1.1%) 0.67 0.69 (0.62-0.76)2006/07 173 4 (2.3%) 0.59 0.58 (0.56-0.61)

• P-value comparing %>3SD = 0.67• P-value comparing medians = 0.0002 – significant although

actual difference small.• 95% CI for Geometric means do not overlap• Note of caution: Changes may occur within a year so this

comparison maybe too simplistic. For example there appears to be some evidence of a change within 2006/07

Page 28: Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA

Summary

• Good use of statistical methods can help interpret the IC50 results and ensure assay results are reliable.

• Many appropriate methods already in place but more could be incorporated