25
EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT ENHANCEMENTS R. Burke Squires

EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT ENHANCEMENTS

R. Burke Squires

Page 2: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Pandemic H1N1 2009 Origin?

•  April / May 2009 –  Cases of an Influenza-like Illness (ILI) occurred in California,

Texas and Mexico •  New strain of influenza was found to be the cause.

–  Pandemic H1N1 2009 influenza virus - 1st pandemic strain of the 21st Century

•  Like many, we also wondered what the origin of the virus and its segments were.

•  Our analysis –  Different conclusions –  More accurate description of virus lineage

Page 3: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Original Analysis

¨  Reference strain ¤ A/California/04/2009

¨  BLAST ¤ Each segment against all flu sequences in IRD ¤ Return top 1000 hits (June 2009)

¨  Graph ¤ Nucleotide differences vs. isolation year differences

Page 4: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Nucleotide Diff. vs Isolation Year Diff. – Seg 5 (NP)

0

50

100

150

200

250

300

0 20 40 60 80 100

Nuc

leot

ide

Dif

fere

nce

(Nor

mal

ized

)

Isolation Year Difference (from 2009)

A  

C  

B  

Page 5: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

The Pattern Repeats

Segment  7  (M)  

Segment  8  (NS)  Segment  4  (HA)  

Segment  6  (NA)  

Page 6: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Characterizing Groups: Seg 5 (NP)

Group “A” Group “B”

Page 7: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Evolutionary Trajectory

Evolutionary Trajectory (E.T.)

Similar , Distantly Related

Page 8: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Evolutionary Trajectory Slopes vs. Mutation Rate

Segment E. T. Slope S.U.R. Slope Mutation Rate

PB2 6.8 24.9 4.3

PB1 7.6 26.9

PA 5.9 23.2

HA 5.5 28.8 5.7

NP 2.9 18.2 3.6

NA 3.8 23.1 3.2

M 1.3 5.6 1.5

NS 2.0 12.5 1.6

Page 9: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Recent Enhancements

¨  Dynamic BLAST of reference sequence ¨  Alignment of BLAST hits ¨  Distance matrix scoring of aligned sequences ¨  Estimation of genetic distance ¨  Selection of ancestral sequences

Page 10: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Dynamic BLASTing

¨  1000 BLAST Results – overkill ¨  BLAST reference sequence with increasing 50 hit

results ¤ Use default BLAST options ¤ BLAST against all influenza sequences isolated before

2009 ¨  Extract the oldest year of isolation from hits ¨  Stop when the oldest year of isolation does not

change between iterations ¨  Using CDS sequences; not full length sequences

Page 11: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Alignment of BLAST Hits

¨  Align all of the BLAST hits using MAFFT ¨  Order alignment by year of isolation

Page 12: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Distance Matrix

¨  Calculate distance matrix aligned sequences ¤ Utilizing Criscuolo &

Michel codon model ¤ DNAdistree

1 2 3 4 5 6 7 8 9

1 X X X X X X X X Y

2 X X X X X X X X X

3 X X X X X X X X X

4 X X X X X X X X X

5 X X X X X X X X X

6 X X X X X X X X X

7 X X X X X X X X X

8 X X X X X X X X X

9 Y X X X X X X X X

Criscuolo, A., & Michel, C. J. (2009). Phylogenetic Inference with Weighted Codon Evolutionary Distances. Journal of Molecular Evolution, 68(4), 377–392. doi:10.1007/s00239-009-9212-y

Page 13: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Genetic Distance Estimation

¨  Find lowest slope of genetic distance / isolation year difference when compared to reference sequence (2009)

¨  Use that genetic distance as a cut off, eliminating all sequences with greater genetic distance

¨  Use remaining sequences to estimate slope with exponential curve fit

Page 14: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Estimation of Genetic Distance / Year

Isolation Year Differences

Gen

etic

Dist

ance

Page 15: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Estimation of Genetic Distance / Year

Isolation Year Differences

Gen

etic

Dist

ance

Page 16: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Estimation of Genetic Distance / Year

Isolation Year Differences

Gen

etic

Dist

ance

Page 17: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Estimation of Genetic Distance / Year

Isolation Year Differences

Gen

etic

Dist

ance

Page 18: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Estimation of Genetic Distance / Year

Isolation Year Differences

Gen

etic

Dist

ance

Page 19: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Genetic Distance Per Year - Actual

y = 0.0125x0.5184 R² = 0.87804

0

0.02

0.04

0.06

0.08

0.1

0.12

0.14

0 20 40 60 80 100

Gen

etic

Dis

tanc

e (C

odon

mod

el)

Isolation Year Differences

Page 20: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Eliminate Non-ancestral Sequences

¨  Use genetic distance per year curve fit to eliminate sequences in distance matrix

1 2 3 4 5 6 7 8 9

1 X X X X X Y

2 X X X

3 X X X X X X X

4 X X X X X

5 X X

6 X X X

7 X X X X

8

9 Y X X X X

Page 21: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Determine Ancestral Sequences

¨  Sum number of sequences in a row of matrix

¨  Examine distribution of sums to determine ancestral sequences

¨  Fill-in gaps with hypothetical ancestor for each nucleotide change

1 2 3 4 5 6 7 8 9

1 X X X X X Y

2 X X X

3 X X X X X X X

4 X X X X X

5 X X

6 X X X

7 X X X X

8

9 Y X X X X

Page 22: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Final Analysis

¨  Unfortunately, we will have to wait for the publication

¨  Prospectively linked 2009 H1N1 Pandemic sequence to 1918 “Spanish Flu” pandemic

Page 23: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Summary

¨  Developed the Evolutionary Trajectory Analysis ¨  Data-driven approach to ancestry ¨  Ancestry = similarity component + time component

Page 24: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

Acknowledgements

¨  JCVI / Influenza Research Database (IRD) ¤ Richard H Scheuermann ¤ Brett Picket

¨  UT Southwestern / IRD ¤ Jyothi Noronha ¤ Victoria Hunt

¨  Erasmus MC / SMU ¤ Elizabeth McClellan My thanks to JCVI / IRD for travel support

Page 25: EVOLUTIONARY TRAJECTORY ANALYSIS: RECENT …

R. Burke Squires NIAID, Bioinformatics & Computational Biosciences Branch (BCBB) [email protected]

Q & A