40
Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow Benson, MD, PhD; Xing Li, PhD; Aoy Tomita-Mitchell, PhD; Timothy J. Nelson, MD, PhD; James S. Tweddell, MD; Michael E. Mitchell, MD September 2013 Research Update September 2013 Research Update

Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Embed Size (px)

Citation preview

Page 1: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Transcriptome Profiling of Human Cardiac

Tissues in Hypoplastic Left Heart SyndromeKarl D. Stamm, MS

Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow Benson, MD, PhD; Xing Li, PhD; Aoy Tomita-Mitchell, PhD; Timothy J. Nelson, MD, PhD; James S. Tweddell, MD; Michael E. Mitchell, MDSeptember 2013 Research UpdateSeptember 2013 Research Update

Page 2: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Overview• Medical Research• Trouble with humans• Rare diseases are common in a large enough population

• Next-Generation Sequencing Tech• Illumina HiSeq methodology• Differential expression

• Further Mining• Principle components analyses• Gene profiles and the self-organizing-map

Page 3: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Trouble with Humans• Small sample sizes• Low statistical power• High interpersonal variability• Ethnic backgrounds imply metabolic differences

• Phenocopy• Multiple distinct diseases showing identical presentation• Confounds clustering or association studies• Ruins Case/Control study power

• PHI – Private/Protected Health Information• Data security is paramount• Cross-disciplinary collaborations are limited• DNA is theoretically but not practically identifiable

Page 4: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Congenital Heart Defect Incidence• Down Syndrome 1:700 live births• 50-60% have some structural heart defect

• 22qD Syndrome 1:4000 live births• 75-90% have some structural heart defect

• ‘Healthy’ 99:100 live births• 0.8% have some structural heart defect

Proportion Explained:

Page 5: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

C.H.D. in particularHypoplastic Left Heart Syndrome

All sequencing costs for this study provided by

1 in 40 CHD cases are HLHS2.5 : 10000 of all births•Complex developmental disorder•100% fatal before the invention of the Norwood Procedure 1981•No multigenerational pedigrees•Spontaneous mutation: immune to detection by genetic linkage

Page 6: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow
Page 7: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Generate Reads – Illumina Tech

10 to 500 million short reads are generated in pairs, 2x50 to 2x100 bp each.http://seqanswers.com/forums/showthread.php?t=21

Page 8: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Align Reads to Reference

• Which one? • NCBI #37.3 has 3.1 billion bases across 190 contiguous scaffolds• UCSC hg19 has 3.2 billion bases across 163 contiguous scaffolds• Haploid reference contains disease alleles and chimeric sequence

like an A+B+O blood type.

Image of patches modifying the CHR17 reference from 2011 according to Ensemblhttp://www.ensembl.info/blog/2011/05/20/accessing-non-reference-sequences-in-human/

Page 9: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Millions of Variants• The 1000 Genomes project found 38 million SNPs, 1.4 million

short insertions or deletions, and more than 14 thousand larger deletions

• The NHLBI Exome Sequencing Project targeted 22MBases across 2,440 individuals and found 563,700 variants, 82% of which were novel. They averaged 200 novel, coding mutations per person.

• We find about 150-300 thousand SNVs in an exome, 10% of which are nonsynonymous

• SAMTOOLS is the software of choice for variant calling relative to your reference genome.

Page 10: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

• CCG/Proline -> CTG/Leucine • HOPX is a gene known to regulate heart development!

Page 11: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

• Very common mutation

Page 12: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

RNA-Seq vs. Whole Genome

1. Extract and purify mRNA by polyadenylation

2. Convert spliced mRNA to DNA fragments

3. Run standard genome sequencing on the product

4. Result: Expression level dependent sequence coverage

Image found athttp://www.pacificu.edu/optometry/ce/courses/20591/armdpg3.cfm

Page 13: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

RNA-Seq Reconstructs Transcripts

From the CuffLinks paper, Trapnell et al.http://www.nature.com/nbt/journal/v28/n5/abs/nbt.1621.htmlNature Biotechnology Volume: 28, Pages: 511–515 Year published: (2010)

Page 14: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

IGV – aligned reads viewer

Page 15: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

CoverageBED

BEDTOOLS : a flexible suite of utilities for comparing genomic features. http://code.google.com/p/bedtools/

Simple arbitrary feature read depth counting.-Count by gene, exon, whatever

Page 16: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Example of bad alignment

Page 17: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Variance and mean linked by local regression - for robust parameter estimation.

Page 18: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

• Negative Binomial

• Models count as ‘binomial successes until a set number of failures’ which better fits the RNA-Seq fragment generation (limited reagent)

• Allows/captures the ‘overdispersion’ seen in RNA-Seq experiments.

Page 19: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Scale the totals for compatible means

Page 20: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Mean-Variance Connection

Page 21: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Detection in Low Values

Page 22: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow
Page 23: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow
Page 24: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow
Page 25: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow
Page 26: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Per-gene mean by difference ratio

Page 27: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow
Page 28: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

DESeq• Starting from 18,000 Rsids minus 1200 NA• 1000 entries p<0.05

Page 29: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Theme

• Big lists• Noisy data• Complex correlation• Heterogeneous background

Page 30: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Precious Tissue Samples

• Collecting tissue during surgery is an extra burden placed on overloaded surgical teams.

• Samples must be processed carefully to avoid degradation of sensitive molecules.

• Many steps and costs prior to gene sequencing.

• Collaborators have provided 35 patients’ atrial septal tissues.

• Still no ethical source of healthy control.

Page 31: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow
Page 32: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

• Hope to see separation between red/notred or solid/notsolid points• Lack of discrimination in major variation dimensions • Implying uncontrolled heterogeneity dominates

Therefore, more difference person to person than between subtypes

Page 33: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Top25 Consistent Genes

• Anyone know what it means when Adducin2 and HomeoboxA4 are overexpressed? Is it significant that a dehydrogenase is under-expressed?

Page 34: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Group Profiles at Selected Dimensions

Page 35: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Self-Organizing Map

• Kohonen 1990 • Halfway between neural networks and k-means (horrible oversimplification)

• Enforced grid layout and local neighborhood similarity• Data points (here 25-dimensional vectors) lay out in natural organization

Page 36: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Stochastic - Iteration

Page 37: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Pairwise Similarity• Co-clustering frequency determines sample similarity• Sub-clusters are identified organically

Page 38: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Results• Lists of genes differential across conditions• Many conditions, uncertain homogeneity• List cutoff subjective• No healthy control group

• We can mine these lists for pathways or biological processes• Resulting in more lists of more complex results

Page 39: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Transcriptome Project Future Work• A few more samples are coming… Can we build a classifier?• Predict non-measured variables? Signatures of immune

response point towards treatment targets.• Predict compensatory effects? Samples are taken just days

after birth, but 8 months after the heart started beating.

• How else we could look at this rich, unique dataset?

Page 40: Transcriptome Profiling of Human Cardiac Tissues in Hypoplastic Left Heart Syndrome Karl D. Stamm, MS Donna K. Mahnke, MS; Mary A. Goetsch, MS; D. Woodrow

Thanks for listening