58
“Sequencing cancer genomes” Phil Stephens Wellcome Trust Sanger Institute

“Sequencing cancer genomes” - Illumina

  • Upload
    others

  • View
    8

  • Download
    0

Embed Size (px)

Citation preview

Page 1: “Sequencing cancer genomes” - Illumina

“Sequencing cancer genomes”

Phil Stephens

Wellcome Trust Sanger Institute

Page 2: “Sequencing cancer genomes” - Illumina

in situ cancer

Invasive cancer

Metastatic cancer

Pre-cancerous

lesion

Genomic instability

Genetic Instability

~10 ‘Driver mutations’

1000s ‘Passenger Mutations’

ExposureUV light

Tobacco smoke

Breast, Lung, Colorectal cancers etc.....

Page 3: “Sequencing cancer genomes” - Illumina

Base Substitutions Insertions deletions

Copy number changes Rearrangements

Cancer genome harbour multiple classes of mutation

Page 4: “Sequencing cancer genomes” - Illumina

Five years of data per day vs capillary sequencing

Detect all classes of mutation in a single experiment

Screen the entire genome for mutations

Screen targeted regions: Coding exons

Illumina Genome Analyser

Sequencing by synthesis

Page 5: “Sequencing cancer genomes” - Illumina

• 33,345 somatic substitutions

• 88% sensitivity

• 97% specificity

• 31 Somatic rearrangements

• 41 Copy number changes

• 36 Small insertions & deletions

Comprehensive catalogue of somatic mutations

from a malignant melanoma genome

Page 6: “Sequencing cancer genomes” - Illumina

Substitution mutation spectrum

Total

Mutations

Early

Mutations

Late

Mutations

C>T common due to UV light exposure

C>T 82%

C>T 53%

P <0.0001C>A 2%

C>A 19%

Page 7: “Sequencing cancer genomes” - Illumina

Somatic mutations in genic footprints

Expected = 40%

Observed = 29%

Distribution of somatic substitutions

Page 8: “Sequencing cancer genomes” - Illumina

Stalled RNA

polymerase

Bulky adduct on

transcribed strand

mRNA

Transcription-coupled repair

Page 9: “Sequencing cancer genomes” - Illumina

Strand bias:- Somatic C>T substitutions

T

U4000

3000

2000

1000

0

Num

ber

of

muta

tion

s

P <0.0001

Page 10: “Sequencing cancer genomes” - Illumina

Strand bias:- ALL somatic substitutions

T

U4000

3000

2000

1000

0

Num

ber

of

muta

tion

s

P <0.0001

Page 11: “Sequencing cancer genomes” - Illumina

Transcription coupled repair only accounts

for only a third of the deficit of mutations

over protein coding gene footprints

Page 12: “Sequencing cancer genomes” - Illumina

Novel „expression linked‟ repair mechanismM

uta

tion p

revale

nce (

per

Mb)

30

20

10

0

Expression levels (log2)

Effect of strand of mutation: p<0.0001

Effect of expression level: p<0.0001

C>T non-transcribed

strandC>T transcribed

strand

4 5 6 7 8 9

Page 13: “Sequencing cancer genomes” - Illumina

Novel „expression linked‟ repair mechanismM

uta

tion p

revale

nce (

per

Mb)

30

20

10

0

Expression levels (log2)

Effect of strand of mutation: p<0.0001

Effect of expression level: p<0.0001

C>T non-transcribed

strandC>T transcribed

strand

4 5 6 7 8 9

Page 14: “Sequencing cancer genomes” - Illumina

Feedback for our first cancer genome

Page 15: “Sequencing cancer genomes” - Illumina

Objections to sequencing cancer genomes:

1) “Cancer is too heterogeneous for sequencing to uncover

therapeutically relevant mutations”.

2) “All of the major cancer genes/pathways have already

been discovered”

3) “Sequencing large numbers of cancer genomes is not the

best use of the technology, and will have very limited

clinical use”

4) “Sequencing is unlikely to provide insights into

metastasis, the dominant cause of mortality in cancer”

Page 16: “Sequencing cancer genomes” - Illumina

Objections to sequencing cancer genomes:

1) “Cancer is too heterogeneous for sequencing to uncover

therapeutically relevant mutations”.

2) “All of the major cancer genes/pathways have already

been discovered”

3) “Sequencing large numbers of cancer genomes is not the

best use of the technology, and will have very limited

clinical use”

4) “Sequencing is unlikely to provide insights into

metastasis, the dominant cause of mortality in cancer”

Page 17: “Sequencing cancer genomes” - Illumina

BRAF is an oncogene 50% of Melanoma

T1799A V600E

BRAF is not important Melanoma

BRAF is not a good drug target

Melanoma is too complex to treat effectively

Page 18: “Sequencing cancer genomes” - Illumina

Selective inhibitor of BRAF

(Plexxicon 4032)

Before 15 days after

Courtesy of Dr Grant McArthur

Page 19: “Sequencing cancer genomes” - Illumina

Similar results are being seen with:

PARP inhibitors BRCA1/BRCA2 null breast cancer

EGFR inhibitors EGFR mutated lung cancer

ABL inhibitors BCR/ABL positive CML

Page 20: “Sequencing cancer genomes” - Illumina

Objections to sequencing cancer genomes:

1) “Cancer is too heterogeneous for sequencing to uncover

therapeutically relevant mutations”.

2) “All of the major cancer genes/pathways have already

been discovered”

3) “Sequencing large numbers of cancer genomes is not the

best use of the technology, and will have very limited

clinical use”

4) “Sequencing is unlikely to provide insights into

metastasis, the dominant cause of mortality in cancer”

Page 21: “Sequencing cancer genomes” - Illumina

Investigate all protein coding exons/miRNA’s simultaneously

● Protein coding genes 21,416

● Micro RNAs 1664

Can process large numbers of samples

High sensitivity in primary tumours (normal contamination)

Cancer Exome Sequencing

Human exome

Page 22: “Sequencing cancer genomes” - Illumina

18 samples from a common epithelial tumour

9/18 had a truncating mutation in a novel TSG

136/388 follow up tumours had truncating mutations

in same gene

Potentially „druggable‟ using a cheap, approved,

existing cancer drug

Cancer Exome Sequencing

Page 23: “Sequencing cancer genomes” - Illumina

3 x ER Negative Breast cancers

25 x ERPositive Breast cancers

Cancer Exome Sequencing

Page 24: “Sequencing cancer genomes” - Illumina

Numbers of somatic substitutions per sample

ER negative

Mean 75, (range 29-100)

Page 25: “Sequencing cancer genomes” - Illumina

Numbers of somatic substitutions per sample

ER positive

Mean 19.7, (range 1-45)

Page 26: “Sequencing cancer genomes” - Illumina

Somatic subs in known breast cancer genes

PD3995a AKT1 E17K

PD3995a NF1 G-1T

PD3994a PIK3CA N345K

PD3989a PIK3CA E545K

PD3856a PIK3CA H1047R

PD3857a PIK3CA H1047R

PD3888a PIK3CA H1047R

PD3983a PIK3CA H1047R

PD3985a PIK3CA H1047R

PD3992a PIK3CA H1047R

PD3996a PTEN Y27D

PD3991a TP53 G245S

PD4002a TP53 H179Y

PD3987a TP53 Y220C

PD3986a TP53 G+1A

PD3985a TP53 R306X

Page 27: “Sequencing cancer genomes” - Illumina

PD3849a CDH1 V193X

PD3984a MAP2K4 V151X

PD3992a MAP2K4 I81X

PD3989a PTEN L370X

PD3995a GATA3 N352X

PD3988a GATA3 N352X

PD4004a GATA3 Read through

Somatic indels in known breast cancer genes

Page 28: “Sequencing cancer genomes” - Illumina

638 somatic substitutions

38 small INDELS

Page 29: “Sequencing cancer genomes” - Illumina

Mutations in a „novel pathway‟

in >60% of ER+ breast cancer

Page 30: “Sequencing cancer genomes” - Illumina

Several potential new susceptibility alleles are

being further evaluated

CHEK2 BRCA1

W411X 4 bp deletion

Exome sequencing & susceptibility alleles

Page 31: “Sequencing cancer genomes” - Illumina

Objections to sequencing cancer genomes:

1) “Cancer is too heterogeneous for sequencing to uncover

therapeutically relevant mutations”.

2) “All of the major cancer genes/pathways have already

been discovered”

3) “Sequencing large numbers of cancer genomes is not the

best use of the technology, and will have very limited

clinical use”

4) “Sequencing is unlikely to provide insights into

metastasis, the dominant cause of mortality in cancer”

Page 32: “Sequencing cancer genomes” - Illumina

Rearrangement characterisation

Page 33: “Sequencing cancer genomes” - Illumina

73 Chromosomes, 37 structural abnormalities

Breast cancer karyotype

Page 34: “Sequencing cancer genomes” - Illumina

238 somatic structural variants Patterns of variation

Breast cancer HCC38 (Triple neg)

150

100

50

0

Page 35: “Sequencing cancer genomes” - Illumina

What are these structural variants doing?

Page 36: “Sequencing cancer genomes” - Illumina

T N

Chromosome 3 position (Mb)

48 48.2 48.4 48.6 48.8 49 49.2 49.4

Co

py n

um

be

r

6

5

4

3

2

1

0

T N

FISH confirmation of

tandem duplicationSLC26A6/PRKAR2A in frame fusion gene

5’ UTR 3’ UTR

SLC26A6 PRKAR2A

(Exons 1-17) (Exons 4-11)

600

200

400

bps

164 kb tandem duplication Genomic PCR

Page 37: “Sequencing cancer genomes” - Illumina

RT-PCR

MGLADASGPRDTQALLSATQAMDLRRRDYHMERPLLNQEHLEELGRWGSAPRTHQWRTWLQCSRARAYALLLQHLPVLVWLPRYPVRDWLLGDLLSGLSVAIMQLPQGLAYALLAGLPPVFGLYSSFYPVFIYFLFGTSRHISVGTFAVMSVMVGSVTESLAPQALNDSMINETARDAARVQVASTLSVLVGLFQVGLGLIHFGFVVTYLSEPLVRGYTTAAAVQVFVSQLKYVFGLHLSSHSGPLSLIYTVLEVCWKLPQSKVGTVVTAAVAGVVLVVVKLLNDKLQQQLPMPIPGELLTLIGATGISYGMGLKHRFEVDVVGNIPAGLVPPVAPNTQLFSKLVGSAFTIAVVGFAIAISLGKIFALRHGYRVDSNQELVALGLSNLIGGIFQCFPVSCSMSRSLVQESTGGNSQVAGAISSLFILLIIVKLGELFHDLPKAVLAAIIIVNLKGMLRQLSDMRSLWKANRADLLIWLVTFTATILLNLDLGLVVAVIFSLLLVVVRTQMPHYSVLGQVPDTDIYRDVAEYSEAKEVRGVKVFRSSATVYFANAEFYSDALKQRCGVDVDFLISQKKKLLKKQEQLKLKQLQKEEKLRKQAASPKGASVSINVNTSLEDMRSNNVEDCKMVIHPKTDEQRCRLQEACKDILLFKNLDQEQLSQVLDAMFERIVKADEHVIDQGDDGDNFYVIERGTYDILVTKDNQTRSVGQYDNRGSFGELALMYNTPRAATIVATSEGSLWGLDRVTFRRIIVKNNAKKRKMFESFIESVPLLKSLEVSERMKIVDVIGEKIYKDGERIITQGEKADSFYIIESGEVSILIRSRTKSNKDGGNQEVEIARCHKGQYFGELALVTNKPRAASAYAVGDVKCLVMDVQAFERLLGPCMDIMKRNISHYEEQLVKMFGSSVDLGNLGQStop

Predicted 914 amino acid fusion protein

T N

600

200

400

bps

Page 38: “Sequencing cancer genomes” - Illumina

Five expressed in frame fusion genes

2 generated by tandem duplications

3 generated by large inversions

Page 39: “Sequencing cancer genomes” - Illumina
Page 40: “Sequencing cancer genomes” - Illumina

Potential applications in healthcare

Page 41: “Sequencing cancer genomes” - Illumina

Selective inhibitor of BRAF V600E

(Plexxicon 4032)

Before 15 days after

Courtesy of Dr Grant McArthur

Page 42: “Sequencing cancer genomes” - Illumina

Personalised Haematology

Time (months)

Risk stratification

Therapy durationPrediction of

relapse

Tu

mo

ur

ce

lls

Accurate ascertainment of tumour burden in

response to chemotherapy

Page 43: “Sequencing cancer genomes” - Illumina

~two hundred tumours and identified at least

one somatically acquired rearrangement in

sample (range 1-245)

Solid tumour rearrangement screens

Page 44: “Sequencing cancer genomes” - Illumina

Identify tumour-specific rearrangements

from cancer DNA

Design & test quantitative assay for

rearrangements

Extract DNA from plasma

(10-20mL blood sample)

Quantify tumour DNA

2 weeks

~£2,500

1 week

£250

½ day

£50

½ day

£50

Work flow

Page 45: “Sequencing cancer genomes” - Illumina

Detecting 1 copy of tumour genome

Page 46: “Sequencing cancer genomes” - Illumina

Serial measurements

Page 47: “Sequencing cancer genomes” - Illumina

Potential healthcare applications

• Monitoring tumour response to therapy in real-time

– Reduce toxicity, prevent drug wastage

• Identifying disease relapse before clinically evident

– Pre-emptive therapy

• Choosing intensity of adjuvant therapy based on risk

stratification

• Surrogate marker of cell kill in early phase clinical trials

Page 48: “Sequencing cancer genomes” - Illumina

Objections to sequencing cancer genomes:

1) “Cancer is too heterogeneous for sequencing to uncover

therapeutically relevant mutations”.

2) “All of the major cancer genes/pathways have already

been discovered”

3) “Sequencing large numbers of cancer genomes is not the

best use of the technology, and will have very limited

clinical use”

4) “Sequencing is unlikely to provide insights into

metastasis, the dominant cause of mortality in cancer”

Page 49: “Sequencing cancer genomes” - Illumina

Dynamics of metastasis in pancreatic cancer

Patient Patient Patient

1 5 9

PCR genotype multiple

metastases

PCR genotype multiple

metastases

PCR genotype multiple

metastases

Page 50: “Sequencing cancer genomes” - Illumina

Ind

ex

No

rmal

1 2 3 4 5 Wate

r

Shared in all metastases

Partially shared

Unique to index metastasis

Dynamics of metastasis in pancreatic cancer

Page 51: “Sequencing cancer genomes” - Illumina

Inter-individual differences

0

5

10

15

20

25

30

35

40

45

50

Private

Partially shared

Omnipresent

Page 52: “Sequencing cancer genomes” - Illumina

Phylogenetic relationships between

metastases within same patient

Rearrangements 1-56

Index

4

5

1

2

3

Primary

Germline

Page 53: “Sequencing cancer genomes” - Illumina

Phylogenetic tree

Page 54: “Sequencing cancer genomes” - Illumina

Evidence for convergent evolution

Page 55: “Sequencing cancer genomes” - Illumina

Organ-specific signature

Page 56: “Sequencing cancer genomes” - Illumina

Dynamics of genomic instability and metastasis

Primary tumour

Capacity to initiate

metastasis

Secondary metastasis

Tertiary metastasis

Molecular time

Page 57: “Sequencing cancer genomes” - Illumina

Conclusions

1) “Although cancer is heterogeneous, sequencing is an

effective tool to uncover therapeutically relevant

mutations”.

2) “There are major cancer genes and pathways yet to be

discovered”

3) “Sequencing large numbers of cancer genomes has

massive clinical utility and will transform the way that

cancer is treated and patients are managed”

4) Sequencing can generate ‘dramatic insights’ into

modes and mechanisms of metastasis

Page 58: “Sequencing cancer genomes” - Illumina

Dave McBride Mike Quail

Meng-lay lin Dan Turner

Patrick Tarpey Harold Swerdlow

Stuart Mclaren

Keiran Raine David Bentley

Dave Jones Keira Cheetham

Ignacio Varela Mark Ross

Laura Mudie

Lucy Stebbings

Catherine Leroy

Jon Teague