148
TIGR TIGR

Talk on Phylogenomics for MBL Molecular Evolution Course 2004

Embed Size (px)

DESCRIPTION

Talk by Jonathan Eisen for MBL Molecular Evolution course in 2004

Citation preview

Page 1: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Page 2: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Phylogenomics and theAxis of Evol

orHow I Came to Love a Tree

Jonathan A. Eisen

The Institute for Genomic Research

Page 3: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Topics of Discussion

• Introduction to evolution• Introduction to phylogenomics• Phylogenomic examples

– Species evolution

– Uncultured organisms

– Functional predictions

– Gene duplication

– Genome rearrangements

Page 4: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Eisen Genome ProjectsExtremophiles, DNA repair models

Deinococcus radiodurans

Haloferax volcanii

Tetrahymena thermophila

Novel phylogenetic groups Tree of Life

Endosymbionts Wolbachia, Baumania, Chemosynthetic symbionts, Prochloron

Evolution of C1 metabolism Carboxydothermus, Methylococcus, Chlorobium, Acidothiobacillus

Page 5: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Analysis of Complete Genomes

• Identification/prediction of genes

• Characterization of gene features

• Characterization of genome features

• Prediction of gene function

• Prediction of pathways

• Integration with known biological data

• Comparative genomics

Page 6: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Comparative Genomics

• Comparison of genomes between species• Identify differences

– SNPs, Indels– Rearrangements– Presence/absence of genes, pathways, features

• Correlating with phenotypic differences• Can be used to improve on every step in

genome analysis

Page 7: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

DNA Islands in O157:H7 vs. K12:Gene Loss or Transfer?

Eisen (2001)

Page 8: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

GroupB Streptococcus CGH

Tettelin et al., 2002

Page 9: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Segmented Duplications in Arabidospsis

AGI, 2000

Page 10: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGRTIGRTIGRTIGRTIGR

“Nothing in biology makes senseexcept in the light of evolution.”

T. H. Dobzhansky (1973)

Page 11: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Evolutionary Perspective and Comparative Biology

• Comparative biology is the analysis of differences and similarities between species.

• An evolutionary perspective is useful in such studies because this allows one to focus not just on the levels and degrees of similarity or difference but on how and why similarities and differences came to be.

Page 12: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Phylogenomics

• Genome sequences contain a record of the evolution of a species and all its genes

• Evolutionary analysis is the key to interpreting genome sequences and making the most use out of them

• There is a feedback loop between evolutionary and genome analysis such that they should be done together.

Page 13: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Page 14: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Why Completeness is Important

• Improves characterization of genome features• Better comparative genomics• Presence/absence is less subjective• Missing sequence might be important (e.g.,

centromere)• Allows researchers to focus on biology not

sequencing• Facilitates large scale correlation studies• Controls for contamination

Page 15: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Phylogenomics I:

Major Evolutionary Transitions

Page 16: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

• Analysis of S. pombe genome by Wood et al. 2002• Compared the predicted proteomes of all

completed genomes of eukaryotes to those of prokaryotes

• Asked: “Are there genes found in all eukaryotes with no obvious homologs in any prokaryote?”

Eukaryotes vs. Prokaryotes

Page 17: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

BacteriaArchaea

Eukaryotes

Giardia

Trichomonas

Naegleria

Trypanosoma

Euglena

Plasmodium

Tetrahymena

Phytophthora

Arabidopsis

Chlamydomonas

Dictyostelium

Humans

Fly

Worm

Encephalatozoon

S. cerevisiae

S. pombe

Evolutionary Scenario

Page 18: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Eukaryotic Specific Genes

• >200 genes found including:– Cytoskeleton components: tubulin, ankyrin,

myosin– Protein degradation: ubiquitin, proteases– Chromatin and DNA packaging

• Of the 200 many had no known function: could encode novel eukaryotic wide processes

Page 19: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Multi- vs. Single-Cellular Eukaryotes

• Further analysis of S. pombe genome• Compared multi-cellular vs. single-cellular

eukaryotes (animals and plants vs. yeast)• “Are there genes in all multi-cellular and not in

any single-cellular?”• Found only 3• Concluded that the genetic basis of multi-

cellularity was likely to be gene regulation and not invention of new genes

Page 20: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Multiple Origins of Multicellularity

Page 21: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Phylogenomics II:

Uncultured Organisms and Phylogenetic Anchors

Page 22: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Selecting Genome Projects

• Economic importance• Relevance to human disease• Biochemical or physiological novelty• Ecological importance• Phylogenetic position

Page 23: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Selecting Genome Projects

• If all else is (roughly) equal, select the most experimentally tractable organisms– Deinococcus radiodurans– Chlorobium tepidum– Tetrahymena thermophila

• Genome sequences are powerful tools for launching experimental studies for those organisms

• However, not all important organisms work nicely in the lab

Page 24: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

rRNA and Uncultured Microbes

Eisen et al. 1992

Page 25: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Beyond rRNA I:Symbiont Genome Sequencing

shotgunshotgun

sequencesequence

Warner Brothers, Inc.Warner Brothers, Inc.

Page 26: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR Wu et al., 2004

Wolbachia pipientis wMel

Page 27: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Wolbachia Mobile/Repetitive DNA

RepeatClass

Size(Median)

Copies Protein motifs/families IS Family Possible Terminal Inverted Repeat Sequence

1 1512 3 Transposase IS4 5’ ATACGCGTCAAGTTAAG 3’2 360 12 - New 5’ GGCTTTGTTGCAT CGCTA 3’3 858 9 Transposase IS492/IS110 5’ GGCTTTGTTGCAT 3’4 1404.5 4 Conserved hypothetical,

phage terminaseNew 5’ ATACCGCGAWTSAWTCGCGGTAT 3’

5 1212 15 Transposase IS3 5’ TGACCTTACCCAGAAAAAGTGGAGAGAAAG 3’6 948 13 Transposase IS5 5’ AGAGGTTGTCCGGAAACAAGTAAA 3’7 2405.5 8 RT/maturase -8 468 45 - -9 817 3 conserved hypothetical,

transposaseISBt12

10 238 2 ExoD -11 225 2 RT/maturase -12 1263 4 Transposase ???13 572.5 2 Transposase ??? None detected14 433 2 Ankyrin -15 201 2 - -16 1400 6 RT/maturase -17 721 2 transposase IS63018 1191.5 2 EF-Tu -19 230 2 hypothetical -

Wu et al., 2004

Page 28: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR Wu et al., 2004

Page 29: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Selection Apparently Inefficient in wMel

• Likely not due to higher mutation rate– Full suite of DNA repair genes

• Likely not due to low amounts of homologous recombination– RecA present

– Population studies suggest homologous recombination occurs

• Wolbachia has multiple types of bottlenecks– Maternal transmission like obligate mutualists

– Infectious sweeps of cytoplasmic incompatibility like pathogens

Wu et al., 2004

Page 30: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

• Sap feeding insects

Glassy-winged Sharpshooter

• Carriers of Xylella fastidiosa that causes Pierce’s disease of grapevines

• There are >20000 sharpshooter species, within which intracellular symbiotic bacteria are wildspread

Baumannia cicadellinicola:1° symbionts of the Glassy-winged Sharpshooter

Page 31: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR400,000

100,000

200,000

300,000

500,000

600,000

1

Baumania Genome Completed

Page 32: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

“Whole genome” tree of insect endosymbionts

Page 33: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Bu_APS Bu_Bp Bu_Sg Wi_gl Ca_bl gebc recG&ruvABC(recombinate repa ir) - - - - - recG&ruvABC methy -direct DNA repair(correct T -G) mutSL mutSLH mutSL - - mutSLH 8-Oxo_ dGTP prevention(mutT, MutY||MutM ) mutTY mutY mutTY - mutTY mutTM uvrD(or homolog rep) rep rep rep uvrD - uvrD recA - - - recA - recA phrB(UV pyrimidine dimer) phrB phrB - phrB - - recBCD recBCD recBCD recBCD recBCD recBCD recBCD mutL(mismatch repair) mutL mutL mutL - - mutL recJ(rec based exision,methy -direct repair) - - - recJ - recJ transcription -repair coupling factor(mfd) mfd mfd - mfd - - uracil -DNA -glycosylase(remove U from DNA) ung ung - ung ung ung site -specific DNA inversion stimulation(fis) fis - fis - - - dna recombination protein rmuc (inversion) - - - - rmuc rmuc

DNA repair genes

Page 34: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

GlutamateArgA ArgB ArgC ArgD ArgE

OmithineAarAB ArgF ArgG ArgH

Arg

PyruvateIvHI IlvC IlvD IlvE

Val

Pyruvate+

Alpha-Ketobutyrate

ThreonineIle

IlvA

PEP +Erythrose 4-phosphate

AroH AroB AroD AroE AroK AroA AroCChorismate

PheA HisC Phe

TrpEG TrpD TrpC TrpABTrp

AspartateThrA Asd ThrA

HomoserineThrB ThrC

MetB MetC MetE

Thr

MetDapA DapB DapD DapC DapE DapF LysA Lys

PRPP+ATPHisG HisI HisA HisHF HisB HisC HisB HisD

His

Essential amino acid biosynthetic pathways

Page 35: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

9359 clones that are not included in the final assembly

Run_TA

7152 assembles (400 have been assembled)

<1kb 6996

1kb-2kb 125

2kb-3kb 18

3kb-4kb 6

4kb-5kb 3

5kb-6kb 2

6kb-7kb 1

7kb-8kb 1

Sequences from Another Symbiont

150 Bacteroides/Chlorobi (njtree/blast)

Page 36: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

AspartateThrA Asd

HomoserineThrB ThrC

MetB MetC MetE

Thr

MetDapA DapB DapD DapC DapE DapF LysA Lys

Page 37: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Page 38: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Beyond rRNA II: Metagenomics

• Isolate, by filtration, all microbes in a sample

• Extract total DNA in very large pieces

• Clone those pieces as BACs into E.coli to get enough.

• Identify which BAC contains phylogenetic marker of interest

• Sequence the BACs like a bacterial genome.

Sample

Filterconcentrate

ExtractDNA

CloneInto BACs

SequenceGeneList

Page 39: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Using a rRNA anchor allowed the

identification of a new form of phototrophy:

Proteorhodopsin

Beja et al. 2000

Page 40: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR Beja et al., 2001

Page 41: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Phylogenetic Anchors

Beja et al., 2000

Page 42: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Puf Operons vs. Cultured Species

Beja et al., 2002

Page 43: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Beyond rRNA III:Shotgun Environmental Sequencing

shotgunshotgun

sequencesequence

Warner Brothers, Inc.Warner Brothers, Inc.

Page 44: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Sargasso Sea

• High microbial diversity• Most of the abundant rRNA phylotypes have

never been cultured• Physiological processes of microbes largely

unknown• Well studied in terms of oceanographic parameters

Page 45: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

rRNA as a Phylogenetic Anchor

Venter et al., 2004

Page 46: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Shotgun Sequencing Allows Use of Alternative Anchors (e.g., RecA)

Venter et al., 2004

Page 47: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Shotgun Sequencing Detects More Diversity than PCR-methods

Venter et al., 2004

Page 48: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Sargasso Phylotypes

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

AlphaproteobacteriaBetaproteobacteriaGammaproteobacteriaEpsilonproteobacteria

Deltaproteobacteria

CyanobacteriaFirmicutes

Actinobacteria

Chlorobi

CFB

ChloroflexiSpirochaetesFusobacteria

Deinococcus-Thermus

EuryarchaeotaCrenarchaeota

Major Phylogenetic Group

Weighted % of Clones

EFG

EFTu

HSP70

RecA

RpoB

rRNA

Venter et al., 2004

Page 49: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Phylogenomics and Species Evolution II: Biased Sample of Genomes

• Of 40 bacterial phyla most genome sequences come from only 3 groups

Hugenholtz 2002

Page 50: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

# of Bacterial Phyla Sequenced

0

5

10

15

20

25

1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005

Year

Total # of Bacterial Phyla with a Genome Sequenced

Page 51: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

# of Bacterial Phyla Sequenced

0

5

10

15

20

25

1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005

Year

Total # of Bacterial Phyla with a Genome Sequenced

Page 52: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

0.1AcidobacteriaBacteroidesFibrobacteres GemmimonasVerrucomicrobia PlanctomycetesChloroflexiProteobacteriaChlorobi FirmicutesFusobacteria Actinobacteria Cyanobacteria Chlamydia Spriochaetes Deinococcus-Thermus AquificaeThermotogaeTM6OS-KTermite GroupOP8Marine GroupAWS3OP9NKB19OP3OP10TM7OP1OP11NitrospiraSynergistesDeferribacteresThermudesulfobacteriaChrysiogenetesThermomicrobiaDictyoglomusCoprothmermobacterThis projectPublishedIn progressUncultured lineageTree based on Hugenholtz (2002) with some modifications.

Solution:

Selecting Genome Projects

to IncreasePhylogenetic

Diversity

Page 53: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Re-evaluating the Bacterial Tree

Page 54: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Functional Diversity of Proteorhodopsins?

Venter et al., 2004

Page 55: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Phylogenomics III:

Prediction of Function(s)

Page 56: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Predicting Function• Identification of motifs

– Short regions of sequence similarity that are indicative of general activity

– e.g., ATP binding

• Homology/similarity based methods– Gene sequence is searched against a databases of other

sequences– If significant similar genes are found, their functional

information is used

• Problem– Genes frequently have similarity to hundreds of motifs

and multiple genes, not all with the same function

TIGRTIGRTIGRTIGR

Page 57: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGRTIGRTIGRTIGRTIGR

Page 58: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Blast Search of H. pylori “MutS” Score E Sequences producing significant alignments: (bits) Value sp|P73625|MUTS_SYNY3 DNA MISMATCH REPAIR PROTEIN 117 3e-25 sp|P74926|MUTS_THEMA DNA MISMATCH REPAIR PROTEIN 69 1e-10 sp|P44834|MUTS_HAEIN DNA MISMATCH REPAIR PROTEIN 64 3e-09 sp|P10339|MUTS_SALTY DNA MISMATCH REPAIR PROTEIN 62 2e-08 sp|O66652|MUTS_AQUAE DNA MISMATCH REPAIR PROTEIN 57 4e-07 sp|P23909|MUTS_ECOLI DNA MISMATCH REPAIR PROTEIN 57 4e-07

• Blast search pulls up Syn. sp MutS#2 with much higher p value than other MutS homologs

Eisen, 1997

Page 59: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Page 60: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Phylogenetic Tree of MutS Family

Aquae Trepa

FlyXenlaRatMouse

HumanYeast

NeucrArath

BorbuStrpy

BacsuSynsp

EcoliNeigo

ThemaTheaqDeira

Chltr

SpombeYeast

YeastSpombe

MouseHumanArath

YeastHumanMouseArath

StrpyBacsu

CelegHuman

YeastMetthBorbu

AquaeSynsp

Deira Helpy

mSaco

YeastCeleg

Human

Page 61: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

MutS Subfamilies

Aquae Trepa

FlyXenlaRatMouse

HumanYeast

NeucrArath

Borbu

StrpyBacsu

SynspEcoli

Neigo

ThemaTheaqDeira

Chltr

SpombeYeast

YeastSpombe

MouseHumanArath

YeastHumanMouseArath

StrpyBacsu

CelegHumanYeast

MetthBorbu

AquaeSynsp

Deira Helpy

mSaco

YeastCeleg

Human

MSH4

MSH5 MutS2

MutS1

MSH1

MSH3

MSH6

MSH2

Page 62: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

MutS Subfamilies

• MutS1 Bacterial MMR

• MSH1 Euk - mitochondrial MMR

• MSH2 Euk - all MMR in nucleus

• MSH3 Euk - loop MMR in nucleus

• MSH6 Euk - base:base MMR in nucleus

• MutS2 Bacterial - function unknown

• MSH4 Euk - meiotic crossing-over

• MSH5 Euk - meiotic crossing-over

Page 63: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Overlaying Functions onto Tree

Aquae Trepa

Rat

FlyXenla

MouseHumanYeast

NeucrArath

Borbu

SynspNeigo

ThemaStrpy

Bacsu

Ecoli

TheaqDeiraChltr

SpombeYeast

YeastSpombe

MouseHuman

Arath

YeastHumanMouseArath

StrpyBacsu

HumanCeleg

YeastMetthBorbu

AquaeSynsp

Deira Helpy

mSaco

YeastCeleg

Human

MSH4

MSH5MutS2

MutS1

MSH1

MSH3

MSH6

MSH2

Page 64: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Functional Prediction Using Tree

Aquae Trepa

FlyXenlaRatMouse

HumanYeast

NeucrArath

Borbu

StrpyBacsu

SynspEcoli

Neigo

ThemaTheaqDeira

Chltr

SpombeYeast

YeastSpombe

MouseHumanArath

YeastHumanMouseArath

MSH1MitochondrialRepair

MSH3 - Nuclear RepairOf Loops

MSH6 - Nuclear RepairOf Mismatches

MutS1 - Bacterial Mismatch and Loop Repair

StrpyBacsu

CelegHumanYeast

MetthBorbu

AquaeSynsp

Deira Helpy

mSaco

YeastCeleg

Human

MSH4 - Meiotic CrossingOver

MSH5 - Meiotic Crossing Over MutS2 - Unknown Functions

MSH2 - Eukaryotic NuclearMismatch and Loop Repair

Page 65: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Table 3. Presence of MutS Homologs in Complete Genomes Sequences

Species # of MutSHomologs

WhichSubfamilies?

MutLHomologs

BacteriaEscherichia coli K12 1 MutS1 1Haemophilus influenzae Rd KW20 1 MutS1 1Neisseria gonorrhoeae 1 MutS1 1Helicobacter pylori 26695 1 MutS2 -Mycoplasma genitalium G-37 - - -Mycoplasma pneumoniae M129 - - -Bacillus subtilis 169 2 MutS1,MutS2 1Streptococcus pyogenes 2 MutS1,MutS2 1Mycobacterium tuberculosis - - -Synechocystis sp. PCC6803 2 MutS1,MutS2 1Treponema pallidum Nichols 1 MutS1 1Borrelia burgdorferi B31 2 MutS1,MutS2 1Aquifex aeolicus 2 MutS1,MutS2 1Deinococcus radiodurans R1 2 MutS1,MutS2 1

ArchaeaArchaeoglobus fulgidus VC-16, DSM4304 - - -Methanococcus janasscii DSM 2661 - - -Methanobacterium thermoautotrophicum ΔH 1 Mut 2S -

EukaryotesSaccharomyces cerevisiae 6 M 1-6SH 3+Homo sapiens 5 M 2-6SH 3+

Page 66: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Page 67: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

rRNA and Uncultured Microbes

Eisen et al., 1992

Page 68: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

4 F17L22 170 Arabidopsis thali4455279 Arabidopsis thaliana1049068 Lycopersicon esculentuHomo sapiens5514652 Drosophila melanogasteDrosophila melanogaster2123725 Caenorhabditis elegans6606113 Capronia mansoniiRpoII.Yeast.YOR151C107346 Schizosaccharomyces pom151348 Euplotes octocarinatus265427 Euplotes octocarinatus3845258 Plasmodium falciparumRpoIII.DromeRpoIII.Drome.7303535EGAD 114464 Caenorhabditis eleRpoIII.Yeast.172383EGAD 145012 SchizosaccharomyceRpoIII.Neucr.7800864ARATH5 K18C1 1Aeropyrum pernixEGAD 8025 Sulfolobus acidocald5458046 Pyrococcus abyssiPH1546 Pyrococcus horikoshiiThermococcus celerEGAD 14667 Methanococcus vanniMJ1040 Methanococcus jannaschiAF1886 Archaeoglobus fulgidusHalobacterium halobiumThermoplasma acidophilumRPB2 Methanobacterium thermoauatmystery.BAB02021ARATH3 MRC8.7ARATH3 MYM9.126723961 Schizosaccharomyces poRpoI.Yeast.YPR010CRpoI.Neucr.3668171RPA2 Rattus norvegicusMus musculusRpoI.Drome.7296211Caenorhabditis elegans92131 Euplotes octocarinatusARATH1 T1P2.15ARATH1 F1N18.21492072Molluscum contagiosum v439046 Variola major virus1143635 Variola virus2772787 Vaccinia virus323395 Cowpox virus6578643 Rabbit fibroma virus6523969 Myxoma virus6682809 Yaba monkey tumor viru7271687 Fowlpox virus4049822 Melanoplus sanguinipes2887 Kluyveromyces lactisEGAD 151364 Sacch kluyveri1369760 Borrelia burgdorferiBB0389 Borrelia burgdorferiTP0241 Treponema pallidum6652714 Rickettsia massiliae6652723 Rickettsia sp. Bar296652720 Rickettsia conoriiRP140 Rickettsia prowazekii6960339 Salmonella typhimuriumEGAD 1084 Salmonella choleraesEC3987 Escherichia coliEGAD 23892 Buchnera aphidicolaHI0515 Haemophilus influenzaeEGAD 6020 Pseudomonas putidaRPOB Coxiella burnetii3549149 Legionella pneumophilaRPOB Neisseria meningitidisHP1198 Helicobacter pylori6967949 Campylobacter jejuniAA1339 Aquifex aeolicusBS0107 Bacillus subtilis4512396 Bacillus halodurans6002201 Listeria monocytogenesEGAD 32012 Staphylococcus aureEGAD 32011 Spiroplasma citriMG341 Mycoplasma genitaliumMP326 Mycoplasma pneumoniae6899151 Ureaplasma urealyticumRv0667 Mycobacterium tuberculoMycobacterium leprae7144498 Mycobacterium smegmatiEGAD 39063 Mycobacterium smegmGP 7331268 Amycolatopsis medit7248348 Streptomyces coelicolo7573273 Thermus aquaticusDR0912 Deinococcus radioduransTM0458 Thermotoga maritimaEGAD 74970 80693 Heterosigma cEGAD Odontella sinensisEGAD 60306 Spinacia oleraceaEGAD Nicotiana tabacum6723742 Oenothera elata5457427 Sinapis alba5881686 Arabidopsis thaliana4958867 Triticum aestivumEGAD 76270 Zea maysRPOB Oryza sativaEGAD Pinus thunbergiiEGAD Marchantia polymorpha7259525 Mesostigma viride5880717 Nephroselmis olivaceaRPOB Guillardia thetasll1787 Synechocystis PCC6803EGAD 75526 Porphyra purpurea6466433 Cyanidium caldariumEGAD 76712 Cyanophora paradoxaRPOB Chlorella vulgarisEGAD 76424 Euglena gracilis5231258 Toxoplasma gondii6492294 Neospora caninumEGAD 83446 Plasmodium falcipar

100

78

100

85

93

83

100

79

100

100

100100 100

100

94100

100

7499100

99100

100

99

9480

100

100

100

100

59

100

100

99

56100

100

100

10058 95100

9763

95100

100

10081

100

100

100

59

6099

100

10094

100100

69100

7710097

100

71

100

9958

83

100100

100

99100

98100

100

61

99

75100

73100

100

59

100

100

72

72

98

529859

100

100

a

Novel RNA Polymerase in A. thaliana

ArchaealIV

II

III

I

Viral

Bacterial - RpoB

Plastid- RpoBs

Page 69: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Arabidopsis thalianaGP9651815gDrosophila melanogasterGP72929

Homo sapiensSPP49917DNL4 HUMANGallus gallusGP15778121dbjBAB6Xenopus laevisGP18029886gbAAL5

Candida albicansSPP52496DNLI CSaccharomyces cerevisiaeGP1151Schizosaccharomyces pombeGP700

Camelpox virusGP18483081gbAAL7Variola major virusGP439074gbA

Cowpox virusGP20153167gbAAM136Vaccinia virusGP2772802gbAAB96VIRUS vaccinia 9791118refNP 06Vaccinia virus strain Tian TanMonkeypox virusGP17529940gbAAL

Homo sapiensSPP49916DNL3 HUMANMus musculusGP1794221gbAAC5300

Xenopus laevisGP18029884gbAAL5lumpy skin disease virusGP1514

Swinepox virusGP18448623gbAAL6Myxoma virusGP6523988gbAAF1502Rabbit fibroma virusGP392838gb

Fowlpox virusGP453602embCAA828Drosophila melanogasterGP72996

Arabidopsis thalianaSPQ42572DNOryza sativaGP16905197gbAAL310

Crithidia fasciculataGP312384eCaenorhabditis elegansSPQ27474

Drosophila melanogasterGP72916Homo sapiensSPP18858DNL1 HUMANMus musculusSPP37913DNL1 MOUSERattus norvecusSPQ9JHY8DNL1 RAXenopus laevisSPP51892DNL1 XEN

Plasmodium falciparumGP1815859Schizosaccharomyces pombeSPP12Saccharomyces cerevisiaeSPP048

Aeropyrum pernixSPQ9YD18DNLI AAcidianus ambivalensSPQ02093DNSulfolobus solfataricusSPQ980TSulfolobus shibataeSPQ9P9K9DNLSulfolobus tokodaiiSPQ976G4DNLAquifex aeolicusGP2983805gbAACAquifex aeolicusSPO67398DNLI APyrobaculum aerophilumGP409906

uncultured crenarchaeote 74A4GThermoplasma acidophilumSPQ9HJThermoplasma volcaniumOMNINTL0

Methanosarcina acetivorans strArchaeoglobus fuldusSPO29632DN

A METAC 19916535gbAAM05952.1 DPyrococcus abyssiSPQ9V185DNLI Pyrococcus horikoshiiSPO59288DPyrococcus furiosusSPP56709DNLThermococcus kodakaraensisGP10Thermococcus fumicolansSPQ9HH0Methanopyrus kandleri AV19GP19

Methanococcus jannaschiiSPQ576Halobacterium sp.SPQ9HR35DNLI

Streptomyces coelicolorSPQ9FCBLymantria dispar nucleopolyhed

Ligase IV

Viral ligases

Ligase I

Archaeal Ligase

DNA Ligase Tree

Page 70: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Problems with Similarity Based Functional Prediction

• Prone to database error propagation.• Cannot identify orthologous groups reliably.• Perform poorly in cases of evolutionary rate

variation and non-hierarchical trees (similarity will not reflect evolutionary relationships in these cases)

• May be misled by modular proteins or large insertion/deletion events.

• Are not set up to deal with expanding data sets.

TIGRTIGRTIGRTIGR

Page 71: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Evolutionary Rate Variation

231456

Page 72: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Rate Variation and DuplicationSpecies 3Species 1Species 21A2A3A1B2B3BDuplication

Page 73: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

AlkA Domain (O6-Me-G glycosylase)Ogt Domain (O6-Me-G alkyltransferase)Ada Domain (transcriptions regulator)Ada E. coliAda H. inflOgt E. coliOgt H. inflOgt Gram+Ogt D. radioAlkA Gram+AlkA E. coliMGMT Euks

Alkylation Repair Genes

Page 74: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Clustering vs. Neighbor-joining

MutS2.SynsMutS2.BacsMutS2.HelpMutS2.DeirMutsl.MettMSH4.CelegMSH4.YeastMSH4.humanmMutS.SacoMSH3.yeastC23C11.SpoMSH1.YeastMSH3.HumanREP1.MouseGTBP.MouseGTBP.HumanMSH6.YeastMSH5.HumanMSH5.CelegMSH5.YeastMSH2.HumanMSH2.MouseMSH2.YeastMutS.EcoliMutS.SynspMutS.DeiraMutS.Bacsu

MutS.EcoliMutS.SynspMutS.BacsuMutS.DeiraMSH2.HumanMSH2.MouseMSH2.YeastMSH3.HumanREP1.MouseGTBP.MouseGTBP.HumanMSH6.YeastC23C11.SpoMSH1.YeastMSH3.yeastMSH4.CelegMSH4.humanMSH5.CelegMSH5.YeastmMutS.SacoMSH5.HumanMSH4.YeastMutS2.SynsMutS2.BacsMutS2.DeirMutS2.HelpMutsl.Mett

UPGMANeighbor-Joining

Page 75: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Types of Molecular Homology

• Homologs: genes that are descended from a common ancestor (e.g., all globins)

• Orthologs: homologs that have diverged after speciation events (e.g., human and chimp -globins)

• Paralogs: homologs that have diverged after gene duplication events (e.g., and globin).

• Xenologs: homologs that have diverged after lateral transfer events

• Positional homology: common ancestry of specific amino acid or nucleotide positions in different genes

Page 76: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Deinococcus radiodurans

Page 77: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

DNA Repair Genes in D. radiodurans Complete Genome

Process Genes in D. radiodurans

Nucleotide Excision Repair UvrABCD, UvrA2Base Excision Repair AlkA, Ung, Ung2, GT, MutM, MutY-Nths,

MPGAP Endonuclease XthMismatch Excision Repair MutS, MutLRecombination Initiation Recombinase Migration and resolution

RecFJNRQ, SbcCD, RecDRecARuvABC, RecG

Replication PolA, PolC, PolX, phage PolLigation DnlJdNTP pools, cleanup MutTs, RRaseOther LexA, RadA, HepA, UVDE, MutS2

Page 78: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Problem:

List of DNA repair gene homologs in D. radiodurans genome is not significantly different from other

bacterial genomes of the similar size

Page 79: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

0.1

Acidobacteria

Bacteroides

Fibrobacteres

Gemmimonas

Verrucomicrobia

Planctomycetes

Chloroflexi

Proteobacteria

Chlorobi

FirmicutesFusobacteria Actinobacteria

Cyanobacteria

Chlamydia

Spriochaetes

Deinococcus-Thermus

Aquificae

Thermotogae

TM6OS-K

Termite GroupOP8

Marine GroupAWS3

OP9

NKB19

OP3

OP10

TM7

OP1OP11

Nitrospira

SynergistesDeferribacteres

Thermudesulfobacteria

Chrysiogenetes

Thermomicrobia

Dictyoglomus

Coprothmermobacter

Tree based on Hugenholtz (2002) with some modifications.

~40 Phyla of Bacteria

Page 80: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

0.1

Acidobacteria

Bacteroides

Fibrobacteres

Gemmimonas

Verrucomicrobia

Planctomycetes

Chloroflexi

Proteobacteria

Chlorobi

FirmicutesFusobacteria Actinobacteria

Cyanobacteria

Chlamydia

Spriochaetes

Deinococcus-Thermus

Aquificae

Thermotogae

TM6OS-K

Termite GroupOP8

Marine GroupAWS3

OP9

NKB19

OP3

OP10

TM7

OP1OP11

Nitrospira

SynergistesDeferribacteres

Thermudesulfobacteria

Chrysiogenetes

Thermomicrobia

Dictyoglomus

Coprothmermobacter

Tree based on Hugenholtz (2002) with some modifications.

Most DNA metabolism studies in two Phyla

Page 81: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

0.1

Acidobacteria

Bacteroides

Fibrobacteres

Gemmimonas

Verrucomicrobia

Planctomycetes

Chloroflexi

Proteobacteria

Chlorobi

FirmicutesFusobacteria Actinobacteria

Cyanobacteria

Chlamydia

Spriochaetes

Deinococcus-Thermus

Aquificae

Thermotogae

TM6OS-K

Termite GroupOP8

Marine GroupAWS3

OP9

NKB19

OP3

OP10

TM7

OP1OP11

Nitrospira

SynergistesDeferribacteres

Thermudesulfobacteria

Chrysiogenetes

Thermomicrobia

Dictyoglomus

Coprothmermobacter

Tree based on Hugenholtz (2002) with some modifications.

Deinococcus is very distant from well studied groups

Page 82: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

-Ogt-RecFRQN-RuvC-Dut-SMS

-PhrI-AlkA-Nfo-Vsr-SbcCD-LexA-UmuC

-PhrI-PhrII-AlkA-Fpg-Nfo-MutLS-RecFORQ-SbcCD-LexA-UmuC-TagI

-PhrI-Ogt-AlkA-Xth-MutLS-RecFJORQN-Mfd-SbcCD-RecG-Dut-PriA-LexA-SMS-MutT

-PhrI-PhrII?-AlkA-Fpg-Nfo-RecO-LexA-UmuC

-PhrI-Ung?-MutLS-RecQ?-Dut-UmuC

-PhrII-Ogg

-Ogt-AlkA-TagI-Nfo-Rec-SbcCD-LexA

-Ogt-AlkA-Nfo-RecQ-SbcD?-Lon-LexA

-AlkA-Xth-Rad25?

-AlkA-Rad25

-Nfo

-Ogt-Ung-Nfo-Dut-Lon

-Ung

-PhrII

-PhrI

Ecoli

Haein

Neig

o

Help

y

Bacsu

Str

py

Mycg

e

Mycp

n

Borb

u

Tre

pa

Syn

sp

Metj

n

Arc

fu

Mett

h

Hu

man

Yeast

BACTERIA ARCHAEA EUKARYOTES

from mitochondria

+Ada+MutH+SbcB

dPhr

+TagI?+Fpg

+UvrABCD+Mfd

+RecFJNOR+RuvABC

+RecG+LigI

+LexA+SSB

+PriA+Dut?

+Rus+UmuD

+Nei?+RecE

tRecT?

+Vsr+RecBCD?

+RFAs+TFIIH

+Rad4,10,14,16,23,26+CSA

+Rad52,53,54+DNA-PK, Ku

dSNF2dMutSdMutLdRecA

+Rad1+Rad2

+Rad25?+Ogg+LigII

+Ung?+SSB,

+Dut?

+PhrI, PhrII+Ogt

+Ung, AlkA, MutY-Nth+AlkA

+Xth, Nfo?+MutLS?

+SbcCD+RecA

+UmuC+MutT

+LondMutSI/MutSII

dRecA/SMSdPhrI/PhrII

+Sprt3MG

+Rad7+CCE1

+P53dRecQ

dRad23+MAG?

-PhrII-RuvC

tRad25

+TagI?

+RecT

tUvrABCD

tTagI ?

Gain and Loss of Repair Genes

TIGRTIGRTIGRTIGR Eisen and Hanawalt, 1999

Page 83: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Non-Homology Prediction: Phylogenetic Profiles

• Step 1: Search all genes in organisms of interest against all other genomes

• Ask: Yes or No, is each gene found in each other species

• Cluster genes by distribution patterns (profiles)

Page 84: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Phylogenetic Profile - E.coliFlagellar Genes

fhiAfliMfliPfliGflgGfliFflgIflhAflhBgcpE

Wu and Eisen, unpublished

Page 85: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Chlorobium tepidum Strain TLS

C. tepidum mat in highly sulfidic “Travelodge Stream”, Rotorua, New Zealand

(from Castenholz and Pierson, 1995)

Phase contrast photomicrographof the 48-hours culture and electron

micrograph of thin cell section (from Wahlund et al, 1991)

Page 86: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Phylogenetic Profile -C. tepidum Chlorophyll

Synthesis

Wu and Eisen, unpublished

5002_cobalamin biosynthesis protein CbiG/precorrin-4 C11-methyltransferase3939_precorrin-3B C17-methyltransferase/precorrin-8X methylmutase cbiJH882_cobyric acid synthase cbiP3160_dsrN protein dsrN862_cobyrinic acid a,c-diamide synthase cbiA-14010_cobN protein, putative2641_magnesium-protoporphyrin methyltransferase bchH-31498_magnesium-protoporphyrin methyltransferase bchH-14003_cobN protein, putative2636_magnesium-protoporphyrin methyltransferase bchH-24008_magnesium-chelatase, subunit I chlI-24007_magnesium-chelatase, subunit D/I family1504_magnesium-chelatase, subunit I chlI-1

Page 87: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

C. tepidum encodes a novel RbcLChlorobium tepidum ORF02314Pseudomonas hydrogenothermophi

Archaeoglobus fulgidus gi2648975Bacillus subtilis gi2633730Methanococcus jannaschiigi2129276Pyrococcus abysiigi|5458634Pyrococcus horikoshiigi|3257353Pyr ococcus karaensisgi3769302

Achaeoglobus fulgidusgi2648911

0.005 changesIVRhodobacter capsulatus2522275Rhodobacter sphaeroides1 15192Alcaligenes eutrophusRalstonia eutropha3377785Alcaligenes eutrophus 950106Xanthobacter flavus 48544Galdieria partita 4519903Galdieria partitaGaldieria sulphuraria 8925949Cyanidium caldariumCyanidioschyzon merolaeGephyrocapsa oceanicaEmiliania huxleyiPleurochrysis carteraePleurochrysis carterae 456606Calyptrosphaera sphaeroideaUmbilicosphaera sibogaeChrysochromulina hirtaPavlova salinaPorphyridium aerugineumGelidiella acerosaSpermothamnion repensEupogodon planusCeramium diaphanumAntithamnion sp.Sarcodia ceylanicaPterocladia lucidaGelidium sesquipedale 3581614Gelidium americanumGelidium elegans 7107687Gelidium pusillum 392980Capreolia implexaCallophyllis violaceaOnikusa japonicumYatabella hirsutaAcanthopeltis japonicaPterocladiella capillaceaPtilophora pinnatifidaPortieria japonicaGracilariopsis sp.Eucheuma isiforme 4323441Agardhiella ramosissimaMeristotheca papulosaEucheuma arnoldiiKappaphycus alvareziiBetaphycus philippinensisSolieria chordalisMastocarpus stellatusRhodogorgon carriebowensisHeringia mirabilisAhnfeltiopsis gigartinoidesCaulacanthus ustulatusAreschougia congestaCallophycus africanusErythroclonium angustatumHypnea charoidesRhabdonia clavigeraMastocarpus papillatus 450801Sarcothalia livida 5020070Gigartina skottsbergii 5020042Iridaea cordataIridaea cordata 5020044Mazzaella californicaChondrus nipponicus 5020030Chondrus pinnulatus 5020034Chondrus yendoi 5020036Chondrus canaliculatusTrematocarpus dichotomusTrematocarpus dichotomus 96154Chondracanthus chamissoi 50200Chondracanthus acicularisChondracanthus tenellus 502002Gigartina radula 5020040Amphiroa fragilissimaPorphyra purpurea cpst 1276692Cryptomonas cpst 3602978Aureoumbra lagunensisAureococcus anophagefferensOlisthodiscus luteus cpst 1192Tetrasporopsis fuscescens 6006Tetrasporopsis fuscescensPhaeothamnion confervicolaPhaeoschizochlamys mucosaTurbinaria ornataIsthmoplea sphaerophoraPogotrichum filiformeCoelocladia arcticaRosenvingea intricataHydroclathrus clathratusChnoospora implexaColpomenia sinuosaScytosiphon lomentariaPilayella littoralisDelamarea attenuataDictyosiphon foeniculaceusHecatonema sp.Elachista fucicolaPunctaria plantagineaPetalonia binghamiaeStictyosiphon soriferusStriaria attenuataLitosiphon pusillusLaminariocolax tomentosoidesMyriotrichia clavaeformisSorocarpus micromorusStreblonema tenuissimumMyrionema strangulansGiraudia sphacelarioidesHummia onustaMischococcus sphaerocephalusBotrydium stoloniferumXanthonema debileTribonema aequaleVaucheria bursataPleurochloridella botrydiopsisPseudopedinella elasticaPhaeodactylum tricornutumRhizosolenia setigeraDetonula confervaceaThalassiosira nordenskioeldiiOdontella sinensisUnidentified diatom endosymbioCylindrotheca sp.MnBradyrhizobium japonicum 41605

IIChlorellaChloromonas sp. ANT1Mesostigma viride 7259514AgmenellumProchlorothrix hollandic 45543Anabaena sp.Anabaena sp.Anabaena sp.142086Astasia longaEuglena gracilisSpirotaenia condensata 1145421Gonatozygon monotaeniumGenicularia spirotaeniaLycopodium clavatumLycopodiella alopecuroidesPhylloglossum drummondiiHuperzia campianaTrichomanes alatumSelaginella bombycinaSelaginella sp. 577150Botrypus strictusIsoetes melanopoda 577149Isoetes lacustrisAngiopteris lygodiifoliaElaphoglossum yoshinagaeBlechnum occidentaleAdiantum capillusvenerisMarsilea ancylopodaCyathea sp. Wolf 1994Hymenophyllopsis dejectaMatonia foxworthyiLophosoria quadripinnataOsmunda cinnamomeaDennstaedtia sp. Wolf 623Polypodium vulgareHypolepis sp. Wolf 628Polypodiodes formosanaGoniophlebium persicifoliumPhanerosorus majorSalvinia cucullata 515459Coleochaete nitellarumKlebsormidium sp. 552613Anthoceros formosae cp 1389564Lathyrus tuberosusCoronilla emerusPersicaria hydropiperAchlys triphyllaHibiscus punaluuensisMontinia caryophyllaceaHernandia albifloraAlangium chinenseCleome hasslerianaCitrus x paradisiFortunella japonicaStyrax japonicaWisteria sp.Ladenbergia pavoniiPolypremum procumbensBrodiaea coronariaDichelostemma multiflorumMedicago sativaPhiladelphus lewisiiAextoxicon punctatumPittosporum japonicumManglietia fordianaBerberis thunbergiiOphiocolea floribundaViburnum rhytidophyllumAucuba japonicaBulbinella caudafelisKniphofia uvariaAstelia banksiiCentrolepis strigosaEmpodium veratrifoliumCurculigo capitulataZingiber gramineumCalathea loeseneriStrelitzia nicolaiNephrophyllidium cristagalliAcicarpha tribuloidesKalanchoe daigremontianaOpercularia vaginataCalycophyllum candidissimumPhyllis noblaHippuris vulgarisDigitalis purpureaLathraea clandestinaAlectra orobanchoidesNemum spadiceumBulbostylis atrosanguineaFimbristylis complanatacyperus rigidifoliusKyllinga appendiculataPycreus flavescensLipocarpha microcephalaAscolepis capensisHarveya capensisCallitriche albomarginataMyriophyllum exalbescensPenthorum sedoidesTetracarpaea tasmanicaTetracarpaea tasmanica 7240484Geniostoma rupestreArceuthobium verticilliflorumGinalloa arnottianaConnarus conchocarpusPhysena sp.Apeiba tibourbouLasiopetalum sp. Chase 2195Didymosalpinx noraeTrigonia niveaMachaerium lunatumHolocalyx balansaeColona floribundaDalbergia hupeanaHugonia platysepalaMaackia amurensisCapparis hastataEuchresta horsfieldiiParkia roxburghiiGlycyrrhiza echinataCalpurnia aureaBolusanthus speciosusBrongniartia pacificaAteleia herbertsmithiiAdesmia exilisSarcocephalus latifoliusTrichocarya splendensCochlospermum intermediumReevesia thyrsoideaGonystylus macrophyllusMytilaria laosensisAdenium obesumNerium oleanderUvularia floridanaArthropodium cirratumElegia cuspidataCleistes divaricataChysis bractescensGyrocarpus sp. Chase 317Croomia heterosepalaHerreria montevidenseWitsenia mauraBarbacenia elegansAriopsis peltataComospermum yedoenseXanthosoma mafaffaNolina recurvataEichhornia azureaPontederia cordataPeltandra virginicaNeuwiedia veratrifoliaIsotria verticillataSymplocarpus foetidusEleutherine latifoliaClivia miniataThomasia solanaceaEriaxis rigidaQuisqualis indicaRauvolfia manniiTerminalia catappaRhexia virginicaAmpelocera hottleyiDissotis rotundifoliaStylogyne sp.Primula sieboldiiZanthoxylum monophyllumAbrus precatoriusCordyla africanaCeriops australisBauhinia candicansNauclea orientalisCasuarina cunninghaminaHillia trifloraSisyndite sparteaNeoluederitzia sericeocarpaThevetia peruvianaUlex europaeusCassia didymobotryaSalweenia wardiiDelonix regiaClitoria ternateaParkinsonia aculeataRhopalocarpus sp.Simarouba glaucaKelleronia revoiliiTribulus macropterusKallstroemia maximaTribulopis pentandraCeratonia siliquaPintoa chilensisSericodes greggiiRobinia pseudoacaciaGoethalsia meianthaDimocarpus longanRhizophora apiculataLeptonychia pallidaGuaiacum guatemalensePicralima nitidaBalanites maughamiiOpilia amentaceaDiegodendron humbertiiCadia purpureaLuetzelburgia sp. Pennington sHydrangea macrophyllaAugea capensisZygophyllum billardierei 66896Zygophyllum hirticaule 6689634Fagonia creticaEustoma russellianumZygophyllum albumMacrolobium acaciifoliumZygophyllum coccineum 6689622Pterospermum celebicum 4995777Pterospermum celebicumGymnocladus dioicaAcacia caveniaPorlieria chilensisGleditsia triacanthosGrewia occidentalisHeliocarpus americanusPavonia multifloraWrightia arboreaStephanostema stenocarpumHildegardia barteriSchoutenia glomerataCola nitidaPentaplaris doroteaeBerrya javanicaChorisia speciosaCecropia palmataMascarenhasia arborescensCalopogon tuberosusGloriosa superbaLarrea cuneifoliaThespesia populneaArgania spinosaDirachma socotranaSchlumbergera truncataSecuridaca diversifoliaAbroma angustaDionaea muscipulaStylidium calcaratumOreostylidium subulatumKeraudrenia hermanniifoliaSetchellanthus caeruleusHallea rubrostipulataSenna alataLeitneria floridanaLiparia splendensPachysandra procumbensTrema micranthaHelicteres baruensisSophora bhutanicaXanthocercis zambesiacaStrumpfia maritimaCyclopia genistoidesPodalyria bifloraHedysarum vicioidesCaesalpinia pulcherrimaCephalanthus natalensisBlepharocarya depauperataAndrosace spinuliferaChorizema cordatumViscainoa geniculataThottea tomentosaSimaba bidwilliiHovea ellipticaAcosmium dasycarpumTamarindus indicaTipularia discolorAntirhea lucidaSimmondsia chinensisLabordia tinifoliaEuclea natalensisTriplaris americanaTetraena mongolicaBulnesia arboreaTriplochiton zambesiacusLespedeza cuneataArachis hypogaeaBursera inaguensisCommiphora habessinicaAllamanda catharticaMimosa speggazziniiTimonius sp. Ridsdale 2204Eremocitrus glaucaSeverinia buxifoliaAtalantia ceylanicaMicrocos latistipulataPleiospermium alatumTachigali paniculataHumiria balsaminiferaBunchosia armeniacaChaetachme aristataMetternichia princepsGironniera subaequalisPeltophorum sp. 2343002Prostanthera niveaBerchemia discolorVitex agnuscastusBaphia massaiensisErythrophleum ivorensePericopsis moonianaMorkillia mexicanaBrucea mollisAmaroria soulameoidesStachys macranthaMarrubium incanumRuttya fruticosaCaryopteris bicolorThymus alsinoidesOriganum laevigatumSalvia aethiopisPerovskia abrotanoidesRosmarinus officinalisCuscuta reflexaMonarda didymaMentha longifoliaGlechoma hederaceaVigna radiataRhachicallis americanaBixa orellanaSaccifolium bandeiraeJusticia odoraDipteryx odorataCyclonema myricoidesSterculia apetalaLeucophyllum frutescensBruguiera gymnorhizaDiospyros whyteana 1834444Westringia rosmariniformisAgastache foeniculumTectona grandisPeltophorum peltatumLuxemburgia ciliosaCybistax donnellsmithiiCrypteronia paniculataJacaranda sparreiScutellaria alpinaGenipa americanaMouriri helleriMapouria umbrosaRutidea orientalisPouchetia gilletiiSimira viridifloraKraussia floribundaLeptactina platyphyllaAidia micranthaFeretia aeruginescensGlossostipula concinnaOxyanthus pyriformisTabernaemontana divaricataCydista aequinoctialisMassularia acuminataMitriostigma axillareCalochone redingiiPavetta abyssinicaCatesbaea spinosaBertiera brevifloraHymenocoleus hirsutusHirtella bicornisGardenia thunbergiaRamosmania rodriguesiiKailarsenia ochreataTarenna neurophyllaCasasia clusiifoliaCremaspora trifloraOchna serrulataPorterandia crosbyiMorinda citrifoliaNeurocalyx zeylanicusGlossocalyx longicuspisNemuaron vieillardiiBracteanthus glycycarpusSiparuna brasiliensisAdenia digitataSauvagesia calophyllumSanango sp. Bremer 3352Rapanea ferrugineaHalleria lucidaGomphostigma scoparioidesSeymeria pectinataPandorea jasminoidesTricalysia cryptocalyxCoffea arabicaParacoffea melanocarpaDichapetalum crassifoliumPalicourea lasiorrachisUsteria guineensisMacarisia emarginataCoptosapelta flavescensBonyunia minorAlstonia scholarisBlepharistemma membranifoliaVantanea guianensisMuntingia calaburaDichilus lebeckioidesOphiorrhiza mungosFagopyrum callianthumRumex acetosellaOxalis dilleniiSchwenckia laterifloraLinum perenneAngophora hispidaCalycanthus occidentalisBaeckea ramosissimaVahlia capensisRubia tinctorumCrucianella angustifoliaValantia muralisCruciata glabraAsperula laevigataPhopsis stylosaSherardia arvensisGalium elongatumStauntonia hexaphyllaSatyrium nepalenseSaruma henryiColeonema pulchellumImpatiens capensisCrassula marnieranaCanarium ovatumHaplomitrium hookeriDysosma versipellisBuddleja asiaticaRibes aureumSinopodophyllum hexandrumCornus alternifoliaAnthospermum herbaceumGratiola pilosaHumulus lupulusPodophyllum peltatumAdenandra unifloraDiphylleia cymosaDipteronia sinensisUlmus parvifoliaHemiptelea davidiiPinckneya pubensBongardia chrysogonumMelolobium microphyllumTorenia fournieriPteroceltis tatarinowiiTinnea zambesiacaIndigofera australisTapirira mexicanaAesculus paviaAcer saccharumCyrtocarpa proceraSpondias cythereaAbobra tenuifoliaChamaecrista fasciculataGlycine maxAstragalus sparsusPeriploca graecaHoya bellaAsclepias curassavicaApocynum cannabinumStapelia leendertziaeTylophora sylvaticaDicraspidia donnellsmithiiVitis aestivalisTephrosia grandifloraKummerowia stipulaceaZornia cantoniensisCampylotropis griffithiiSesbania sesbanPlocosperma buxifoliumPlocama pendulaPolygonum cuspidatumHedwigia ciliataFunaria hygrometricaUleastrum palmicolaOrthotrichum lyelliUlota obtusiusculaBryodixonia perichaetialisRhabdoweisia crenulataTayloria lingulataSplachnum sphaericumFelipponea esquiroliiClimacium dendroidesDozya japonicaPilotrichopsis dentataBartramia pomiformisLoeskeobryum cavifoliumAntitrichia formosanaDuthiella speciosissimaCtenidium molluscumPlatyhypnidium riparioides 623Platyhypnidium riparioidesRhynchostegium pallidifoliumTaiwanobryum speciosumIsopterygium vinealePterobryopsis orientalis var.Entodontopsis leucostegaStereophyllum radiculosumDesmotheca apiculataCardotiella quinquefariaDolichomitriopsis diversiformiEntodon myurusHerpetineuron toccoaeAnomodon abbreviatusHaplohymenium longinerveCyptodontopsis obtusifoliaOkamuraea hakoniensisRhytidiadelphus loreusPlagiothecium denticulatumHylocomium splendensRhytidiadelphus japonicus 9967Pleurozium schreberiPterobryon arbusculaPleuroziopsis ruthenicaTrachyloma indicumSciaromium tricostatumForsstroemia neckeroidesTomentypnum nitensDrepanocladus aduncusCalliergonella cuspidataHypnum lindbergii 6230584Hypnum lindbergiiBrachythecium rivulareHylocomiastrum pyrenaicumNeodolichomitra yunnanensisTaxiphyllum aomorienseMyuroclada maximowicziiWijkia hornschuchiiBrotherella henoniiHeterophyllium nematosumPylaisiadelpha tenuirostrisThuidium recognitumTrachypodopsis auriculataEumyurium sinicumLeucodon nipponicusCryphaea sinensisAbietinella abietinaPylaisiella intricataBoulaya mitteniiSchlotheimia tectaZygodon intermediusMiyabea fruticellaMacromitrium incurvifoliumMacrocoma tenuis subsp. sullivGroutiella apiculataVenturiella sinensisAulacopilum hodgkinsoniaePhyscomitrella patensPtychomitrium gardneriAmphidium lapponicumMnium cuspidatumSphagnum fallaxDrummondia prorepensEncalypta proceraPorella pinnataTolypella proliferaChara connivensLamprothamnium papulosumLychnothamnus barbatusJubula pennsylvanicaLeucobryum scabrumMetzgeria furcataCalypogeia muellerianaMakinoa crispataLophocolea heterophyllaHerbertus pensilisLepidozia reptansPetalophyllum ralfsiiFossombronia foveolataMonoclea gottscheiDumortiera hirsutaConocephalum conicumLunularia cruciataAsterella tenellaRicciocarpos natansMarchantia polymorphaSphaerocarpos texanusGeothallus tuberosaMougeotia sp.Mougeotia sp.Cylindrocystis sp. UTEX1925Bryopsis maximaChlorella ellipsoideaChlorella ellipsoidea cpst 285Synechococcus sp.SP P00880Synechococcus PCC6301 38927Synechococcus sp. 217137Synechocystis PCC6803Hydrogenovibrio marinus1 59956Acidithiobacillus ferrooxidansHalothiobacillus neapolitanusNitrobacter winogradskyi1 5230Nitrobacter winogradskyi2 5230Nitrobacter vulgaris 349304Thiobacillus intermedius K12 4Alvinoconcha hessleri sym 1546Chromatium vinosum1 516602Thiobacillus ferrooxidans2 297Thiobacillus denitrificans1Hydrogenophaga pseudoflavaHydrogenovibrio marinus2 59956Synechococcus sp. 1850939Chromatium vinosum2 804821Pisum sativum 3893097Pisum sativum cpstCeltis sinensis var. japonicaAphananthe asperaAilanthus altissimaRulingia sp. Chase 2196Koeberlinia spinosaRorippa amphibiaArabis stelleriCardamine flexuosaArmoracia rusticanaNasturtium officinaleReseda albaLepidium virginicumCapsella bursapastorisArabidopsis himalaicaNeobeckia aquaticaDistylium racemosumAnagyris foetidaAjuga chamaepitysIlex brevicuspisUncaria rhynchophyllaOnonis spinosaDaphniphyllum sp.Photinia fraseriMitchella repensIsopogon latifoliusTulipa kolpakowskianaCyanella hyacinthoidesWatsonia angustaFreesia albaPopulus deltoidesLepuropetalon spathulatumHamamelis mollisShaniodendron subaequaleTovaria pendulaLimnanthes douglasiiPiptanthus nepalensisDrosophyllum lusitanicumMillegrana radiolaPinguicula caeruleaErodium texanumOenothera elata subsp. hookeriSedum rubrotinctumJasminum suavissimumZelkova serrataCobaea scandensHeteropyxis natalensisParnassia fimbriataCamptotheca acuminataLeucadendron laureolumRhoiptelea chilianthaLuculia grandifoliaEschscholzia californicaEucryphia lucidaFouquieria columnarisCoris monspeliensisAltingia sp. RBGEViviania marifoliaPelargonium capitatumStylophorum diphyllumBeesia calthifoliaDicranostigma franchetianaAverrhoa carambolaAnopterus macleayanus 1922257Chrysolepis sempervirensDendromecon rigidumGrubbia tomentosaMacleaya cordataGlaucium flavumRhodoleia championiiSarcocaulon vanderietiaeGalax urceolataTicodendron incognitumHunnemannia fumariifoliaLambertia inermisDisanthus cercidifoliusAndrocymbium ciliolatumGunnera hamiltoniiCyclamen hederifoliumExbucklandia populneaCephalotus follicularisHeliamphora nutansRetzia capensisZaluzianskya katharinaeEmorya suaveolensHebenstretia dentataScrophularia sp. DePamphilis STozzia alpinaLloydia serotinaCycnium racemosumStriga asiaticaChelone obliquaCollinsia grandifloraHemiphragma heterophyllumCatharanthus roseusPlatystemon californicusTropaeolum majusMonsonia emarginataGeranium grandiflorumCliftonia monophyllaAmphianthus pusillusAlonsoa unilabiataNemesia strumosaAtherosperma moschatumSamolus valerandiDaucus carotaApium graveolensMorina coulteriana 2065207Levenhookia leptanthaForstera bidwilliiCortusa matthioliCortusa matthioli 1834405Hypecoum imberbeLogania vaginalisMitreola petiolataMitrasacme pilosaCastilleja linariifoliaAngelonia pubescensMalesherbia linearifoliaOrobanche corymbosaByblis linifloraKorthalsella lindsayiDendrophthora clavataPhoradendron serotinumRoridula gorgoniasVelleia paradoxaKopsia fruticosaKoelreuteria paniculataDonatia fascicularisCuttsia viburneaProtea repensPlacospermum coriaceumSarracenia flavaKingdonia unifloraDodecatheon meadiaSchoepfia schreberiDrypetes roxburghiiCasimiroa edulisLysimachia azoricaLunasia amaraDictamnus sp. M.W.Chase1820KCalodendrum capenseMelicope ternataPilocarpus pennatifoliusNotothixos subaureusBuxus sempervirensSarcomelicope simplicifoliaPhelline comosaClematepistephium smilacifoliuGovenia sp. Chase O146Erythrorchis altissimaStyppeiochloa gynoglossaCathaya argyrophyllaCarludovica palmataLasia spinosaPeliosanthes arisanensisApostasia stylidioidesZamioculcas zamiifoliaLiriope platyphyllaLudovia integrifoliaAnchomanes difformisIris ensataOphiopogon jaburanChlorophytum comosumLomandra longifoliaAnthurium scherzerianumTripladenia cunninghamiiPetermannia cirrosaSpathiphyllum clevelandiiTriphora trianthophoraScindapsus aureusBehnia reticulataTropidia sp. Chase O211Oubanguia alataMargaritaria tetracoccaEschweilera odoraLaurus nobilisCorythophora rimosaLecythis pranceiCareya arboreaPlanchonia validaCouratari tauariBertholletia excelsaAsteranthos brasiliensisCariniana legalisCouroupita guianensisBarringtonia asiaticaAllantoma lineataGustavia poeppigianaEcheandia sp.Vanilla africanaLitsea japonicaPogonia japonicaSelenipedium chicaMexipedium xerophyticumKibara rigidifoliaWilkiea huegelianaOuratea duparquetianaRipogonum elseyanumZygogynum sp. Qiu 90024Millettia japonicaBaloskion gracileDilomilis montanaPseudowintera colorataTasmannia insipidaCoelia tripteraTaxus chinensisNeocallitropsis araucarioidesMicrobiota decussataCupressus sempervirensZamia floridanaDiselma archeriWiddringtonia cedarbergensisPhyllocladus trichomanoidesMetasequoia glyptostroboidesLibocedrus plumosaCephalotaxus wilsonianaAmentotaxus argotaeniaThuja occidentalisJuniperus confertaChamaecyparis obtusaThujopsis dolabrataTetraclinis articulataCallitris rhomboideaPlatycladus orientalis flagellAraucaria angustifoliaAgathis moorei 1850530Agathis borneensisWollemia nobilisCedrus deodaraPseudolarix amabilisKeteleeria davidianaAbies fabriPinus balfourianaCalocedrus decurrensNyssa sylvaticaTaxodium distichumTsuga heterophyllaLarix deciduaPicea abiesDuckeodendron cestroidesMetroxylon vitienseViscum albumAlzatea verticillataSabia sp. Qiu 91025Magnolia grandifloraSpiraea x vanhoutteiPrunus emarginata533056Gladiolus guienziiAspidistra elatiorTricalistra ochreataTupistra albifloraOrthrosanthus polistachusVeratrum albumHymenanthera alpinaStewartia pseudocamelliaAkania bidwilliiDegeneria vitiensisBretschneidera sinensisRavenala madagascariensisRhynchocalyx lawsonioidesOncotheca balansaeGynotroches axillarisCapirona decorticansAndira inermisPellacalyx saccardiansSarcococca confusaBalanops vieillardiPotalia resiniferaGoetzia elegansPosoqueria latifoliaPouteria eerwahAngylocalyx brauniiManilkara zapotaBurckella oxycarpaMolongum laxumPicramnia polyanthaAcokanthera oblongifoliaMollinedia ovataMaesa myrsinoidesAegle marmelosEriostemon brevifoliusParaserianthes lophanthaGenista elataHardenbergia violaceaAlbizia julibrissinCercis canadensisExostema caribaeumCladrastis sikokianaGymnostoma webbianumRetama sphaerocarpaPithecellobium mexicanumNepenthes alataSpathelia excelsaIrvingbaileya sp. Plunkett 151Derris laxifloraApuleia leiocarpaCanavalia ensiformisGlycosmis pentaphyllaDiplolaena dampieriCalypso bulbosaEucommia ulmoidesThermopsis fabaceaChorilaena quercifoliaPhebalium woombyeOsbeckia stellataNymania capensisSwietenia macrophyllaLicania tomentosaBaptisia tinctoriaArgyrolobium harveyanumCaragana arborescensCipadessa bacciferaTambourissa tauTrichilia emeticaSchmardaea microphyllaAbrophyllum ornansClusia gundlachiiMauloutchia chapelieriPaphiopedilum bellatulumParinari sumatranaClausena excavataAmorpha fruticosaTheophrasta americanaMelastoma malabathricumTapura amazonicaHintonia latifloraPhragmipedium longifoliumNitraria retusaStemona japonicaAristea glaucaThysanotus spinigerLanaria lanataStreptopus lanceolatusChordifex amblycoleusSaropsis fastigiataGuringalia dimorphaAcion hookeriLepyrodia glaucaAlexgeorgea ganopodaLepidobolus chaetocephalusHarperia laterifloraDielsia stenostachyaMelanostachya ustulataCalorophus elongatusLoxocarya gigasTyrbastes glaucescensMeeboldina canaLeptocarpus tenaxKulinia eludensEmpodisma minusTaraxis grossaWinifredia solaChaetanthus aristatusTremulina tremulaEurychorda complanataSporadanthus tasmanicusDapsilanthus ramosusColeocarya gracilisDesmocladus castaneusBomarea hirtellaStawellia dimorphanthaSimethis mattiazziiPhormium tenaxStypandra glaucaNivenia corymbosaAmana edulisSparrmannia ricinocarpaProsartes lanuginosaXeronema callistemonRhodohypoxis milloidesSpirodela intermediaDisa tripetaloidesPersea americanaTorreya nuciferaHortonia floribundaIpheion dialystemonLeucocrinum montanumMedeola virginianaBorya septentrionalisWelwitschia mirabilisEphedra sinicaAloe bakeriHemerocallis fulvaJohnsonia pubescensAnemarrhena asphodeloidesBulbine succulentaPoellnitzia rubrifloraTrachyandra sp.Haworthia subfasciataLomatophyllum purpureumAstroloba foliosaEremurus himalaicusGasteria liliputianaAsphodeline luteaLimonium axillareSaharanthus ifniensisPsylliostachys suworowiiPlumbago auriculataLimoniastrum guyonianumArmeria splendensCeratolimon feeiDoryanthes excelsaIsophysis tasmanicaGeophila repensChassalia parvifloraVochysia hondurensisLachnocaulon ancepsOryza sativaHordeum jubatumPeridictyon sanctumLeymus chinensisEremium erianthumElytrophorus globularisAmphipogon strictusElymus glaucescensPseudoroegneria spicataZizania texanaAristida congesta subsp. barbiZea maysCyperochloa hirsutaBowiea volubilisAgapanthus africanusDasylirion serratifoliumCampylandra sp. Tamura and YamHuntleya heteroclitaCryptarrhena sp. Chase O307Dichaea riopalenquensisCalibanus hookeriTristagma bivalveMontrichardia arborescensPolianthes geminifloraKoellensteinia gramineaHosta rectifoliaTulbaghia violaceaXanthorrhoea hostilisMonochoria cyaneaZygopetalum intermediumPsychotria borucanaSchoenus nigricansFicinia gracilisBolboschoenus maritimusRhynchospora nervosaKyllingiella polyphyllaScirpus polystachyusCyperus dichroostachyusOxycaryum cubensePleurostachys sp. Kallunki etMesomelaena pseudostygiaGahnia javanicaSchoenoplectus articulatusAbildgaardia ovataFuirena ciliarisHellmuthia membranaceaTrichophorum caespitosumScirpoides burkeiIsolepis cernuaScleria distansCarex confertaOreobolus kuekenthaliiEleocharis atropurpureaCaustis dioicaBecquerelia cymosaColeochloa abyssinicaCladium jamaicenseLepironia articulataEriophorum vaginatumActinoscirpus grossusAmorphophallus rivieriAnomochloa marantoideaMapania cuspidataStenomesson pearceiLycoris sanguineaGalanthus plicatusLeucojum autumnaleDuckeella adolphiiHeteranthera dubiaAmaryllis belladonnaHemiphylacus latifoliaDressleria eburneaOrnithogalum longebracteatumEriocaulon microcephalusAllium altaicumTricoryne sp.Neofinetia falcataStellilabium pogonostalixOncidium excavatumGilliesia gramineaWashingtonia robustaParadisea liliastrumSolaria atropurpureaWhiteheadia bifoliaChlorogalum pomeridianumAnthericum liliagoLeucocoryne coquimbensisEriopsis bilobaPabellonia incrassataNothoscordum bivalvePatersonia glabrataSparattanthelium wonotoboenseGeorgeantha hexandraEcdeiocolea monostachyaCamassia leichtliniiArisaema triphyllumSmilacina hondoensisMaianthemum dilatatumAsphodelus aestivusSemele androgynaRuscus aculeatusDanae racemosaOxychloe andinaAlbuca shawiiPolygonatum cirrhifoliumCardiocrinum cordatumZantedeschia aethiopicaTofieldia pusillaAplectrum hyemaleXerophyllum tenaxOrontium aquaticumArundina graminifoliaLiparis lilifoliaMasdevallia infractaPeganum harmalaEarina autumnalisGinkgo bilobaStanhopea ecornutaBloomeria croceaPhyllanthus liebmannianusMaxillaria cucullataNeomoorea irrorataAngraecum sesquipedaleMuscari neglectumTapeinochilos ananassaePhilodendron oxycardiumHypolytrum bullatumOrchidantha fimbriataHypoxis glabellaPauridia longitubaMilla bifloraTriteleia bridgesiiMuilla maritimaNypa fruticansDaphnandra repandulaLaureliopsis philippianaKadsura japonicaAnomatheca laxaEriochilus cucullatusLapageria roseaLiriodendron chinenseEremothamnus marlothianusGagea luteaIxia latifoliaMonimia ovalifoliaLavatera acerifoliaLaurelia novaezelandiaeMalacocarpus crithmifoliusCephalanthera damasoniumParis tetraphyllaCyanea coriaceaCollospermum hastatumPhilesia buxifoliaPasithea coeruleaBulbophyllum lobbiiEria ferrugineaDaiswa fargesiiPalmorchis trilobulataDieffenbachia sp. Qiu 96007Anisoptera marginataCrispiloba dispermaPhyllonoma laticuspisSerissa foetidaPeperomia sp. Qiu 91047Combretocarpus rotundatusXerorchis amazonicaMalaxis spicataArgostemma hookeriTernstroemia stahliiPodochilus cultratusCoelogyne cristataCalanthe vestitaEpipactis helleborineArpophyllum giganteumPhreatia sp. Chase O203Illigera luzonensisDacrycarpus imbricatusThunia albaSobralia macranthaAnisophyllea corneriPleurothallis endotrachysCyrtopodium andersoniiGlomera sp. Chase O555Cymbidium ensifoliumDioscorea bulbiferaDiaphananthe rutilaCoryanthes verrucolineataNageia falcatusEncyclia sp. KMC1998Trichotosia feroxMonocostus uniflorusRohdea japonicaMaranta leuconeuraCryptocentrum peruvianumSpiranthes cernuaXylobium sp. KMC1998Phaius minorDisporum nantouenseAlstroemeria sp.Meiracyllium trinasutumCattleya dowianaCatasetum expansumDiceratostele gabonensisCorymborkis sp. Chase O542Monophyllorchis sp. Chase O435Sowerbaea junceaAnarthria polyphyllaHopkinsia adscendensLyginia barbataMayaca fluviatilisMyrsiphyllum asparagoidesIriartea deltoideaScilla bifloraGrammatophyllum speciosumAphyllanthes monspeliensisSansevieria cylindricaDracaena aubreyanaPetersianthus africanusAporusa frutescensGaertnera sp.Psilanthus manniiPodococcus barteriCorybas diemenicusHabenaria repensDiuris sulphureaCaladenia cf. caerulea Chase OGlossodia majorChloraea sp. Chase O551Platanthera ciliarisOrthoceras strictumPonthieva racemosaPlatythelys querceticolaMicrotis parvifloraChiloglottis trapeziformisLyperanthus nigricansThelymitra sp. Chase O489Cryptostylis subulataCalochilus robertsoniCranichis fertilisLasianthus pedunculatusMycetia malayanaClintonia borealisPterostylis nutansGoodyera pubescensCyrilla racemifloraDillenia indicaBrownea arizaAldina latifoliaMyrospermum sousanumOchroma pyramidaleAdansonia rubrostipaPhylloxylon perrieriMoringa oleiferaCastanospermum australeRoussea simplexPlumeria inodoraGeosiris sp.Epistephium cf. lucidum ChaseCouepia robustaEuphronia guianensisAcioa guianensisCalophyllum sp.Caryocar glabrumKinugasa japonicaGarrya ellipticaAptosimum sp. DePamphilis HWBGAlseis lugonisBacopa carolinianaAscarina sp. QiuM149Alnus incanaAlvaradoa amorphoidesBrachyloma daphnoidesMelichrus procumbensRhodothamnus chamaecistusRhododendron hippophaeoidesMonotoca scopariaLeucothoe racemosaSprengelia incarnataAndersonia sprengelioidesOligarrhena micranthaCraibiodendron yunnanenseAndromeda polifoliaLyonia ligustrinaPrionotes cerinthoidesVaccinium meridionaleSatyria warszewicziiDiplycosia acuminataCyathodes glaucaArcheria comberiBudawangia gnidioidesEpacris lanuginosaCosmelia rubraRupicola sprengelioidesWoollsia pungensSphenotoma dracophylloidesLeucopogon gibbosusTrochocarpa gunniiAstroloma humifusumLysinema ciliatumCorrea pulchellaTricyrtis affinisTetramerista sp.Danais xanthorrhoeaCarpinus carolinianaAphanopetalum resinosum 940979Corylopsis paucifloraBoehmeria niveaCrossostylis bifloraByrsonima crassifoliaPeltanthera floribundaSterigmapetalum guianense subsSymplocos costataMyrsine africanaBessera elegansAeranthes ramosaPhilydrum lanuginosumLycomormium squalidumAcineta chrysanthaLycaste cruentaKegeliella kupperiVerbena officinalisHeteropolygonatum pendulumEriospermum bayeriThymelaea hirsutaMyrica ceriferaSkimmia anquetiliaDovyalis rhamnoidesDesfontainia spinosaDrimys winteriCryptocarya obovataRothmannia longifloraCorokia cotoneasterIxerba brexioidesLonicera orientalisCurtisia dentataMorus albaMaclura pomiferaGoodia lotifoliaKandelia candelScyphiphora hydrophyllaceaDombeya sp. Chase 273Loeseneriella africanaErithalis fruticosaDriessenia glanduligeraAceratium ferrugineumSteganthera hirsutaMahonia bealeiHedera helixTeucridium parvifoliumShepherdia argenteaSaurauia nepaulensisClematoclethra lasiocladaActinidia chinensisTibouchina urvilleanaDiplectria divaricataSwartzia sp.Melissa officinalisOcimum basilicumHoffmannia refulgens X HoffmanAquilaria beccarianaMyrcianthes fragransChilocarpus suaveolensAmaracarpus sp. Parvis 680Declieuxia fruticosaDisporopsis arisanensisTrachelospermum jasminoidesCorynocarpus cribbianusLotus corniculatusHydnophytum formicarumMyrmecodia platyreaGravesia viscosaCubanola domingensisBuchanania latifoliaDeppea grandifloraMyroxylon balsamumCarallia brachiataArrabidaea pubescensOroxylum indicumHarpagophyton granidieriClavija eggersianaMeyna tetraphyllaKigelia africanaAntonia ovataChazaliella abruptaMacfadyena unguiscatiSukunia longipesCassipourea ceylanicaDactylopetalum ellipticifoliumRudgea lorentensisAlibertia edulisRandia aculeataCampsis radicansEuclinia longifloraPauridiantha paucinervisAmphitecna apiculataFaramea multifloraCrescentia portoricensisVangueria edulisEnterospermum coriaceumCanthium coromandelicumCollinsonia canadensisOrmosia emarginataTrichocladus crinitusLacunaria jenmaniiPachira aquaticaDryadodaphne sp. Gray 4853Touroulia guianensisElaeocarpus grandisLoropetalum chinenseBetula nigraNymphoides peltataDoryphora aromaticaAegiceras corniculatumPhellodendron amurensePassiflora quadrangularisPseudomonotes tropenbosiiPentace polyanthaTetraclea coulteriBrownlowia elataDiospyros kakiNelsonia campestrisMussaenda erythrophyllaIpomoea purpureaMimulus aurantiacusNeuburgia corynocarpumBaissea leonensisGeum chiloenseHolarrhena pubescensFuntumia elasticaSecamone afzeliiBeaumontia grandifloraMandevilla sanderiDictyophleba lucidaStrophanthus eminiiParsonsia heterophyllaParquetina nigrescensPrestonia quinquangularisPachypodium lamereiCeropegia woodiiBegonia boisianaHelmiopsiella madagascariensisEriolaena spectabilisCissampelos pareiraLantana camaraTuberaria guttataToisusu urbanianaChosenia arbutifoliaAdoxa moschatellinaCoprosma pumilaCladopus austroosumiensisOserya coulterianaMarathrum rubrumHydrobryum floribundumVanroyenella plumosaPodostemum ceratophyllumNertera granadensisSalix albaPremna microphyllaGuettarda uruguensisMarcgravia rectifloraReineckea carneaAsparagus officinalisBurchellia bubalinaTrochetiopsis erythroxylonRuizia cordataSeetzenia lanataQuiina pteridophyllaBurretiodendron esquiroliiHamelia cupreaChristiana africanaMatisia cordataParamelhania decaryanaAllocasuarina muellerianaDussia tessmanniiBrighamia insignisDelissea rhytidospermaRehderodendron macrocarpumHalesia carolinaSinojackia rehderianaSchinus molleToxicodendron radicansQuercus virginianaSargentodoxa cuneataElingamita johnsoniOsyris lanceolataOstryopsis davidianaSantalum albumLiquidambar styracifluaHoloptelea integrifoliaSideroxylon cinereumSchima superbaHeterocentron subtriplinerviumCorylus cornutaEubrachion ambiguumJeffersonia diphyllaSarcolaena oblongifoliaCarya glabraArgemone mexicanaGrevillea robustaFortunearia sinensisOstrya virginianaSanguinaria canadensisTrientalis europaeaGlaucidium palmatumHypseocharis sp.Listera smalliiBletilla striataConanthera campanulataHerpolirion novaezelandiaeAnemopsis californicaKunhardtia radiataOdontostomum hartwegiiHermannia erodioidesCypripedium irapeanumDais cotinifoliaGnetum parvifoliumLeonia glycycarpaPiptocalyx mooreiHennecartia omphalandraPetenaea cordataSpermadictyon suaveolensVirgilia divaricataRuta graveolensXymalos monosporaNicotiana debneyiChoisya mollisPhylica polifoliaWarszewiczia cordataArgophyllum sp.Trillium camtschatcensePalmeria scandensBurchardia umbellataErythronium japonicumLardizabala biternataJacquinia umbellataEmmenopterys henryiScabiosa sp. Albach 39Roupala macrophyllaPterostemon rotundifoliusIdesia polycarpaComptonia peregrinaCoopernookia strophiolataDiaspasis filifoliaKerria japonicaFagus crenataGoodenia ovataMyrothamnus flabellifoliusCaltha palustrisCondaminea corymbosaDrosera capensisPterocephalus lasiospermus 206Gaiadendron punctatumNesiota ellipticaSelliera radicansSaxifraga integrifoliaPlatytheca verticellataPeumus boldusItea virginicaPogonopus speciosusAlberta magnaRustia splendensRanzania japonicaSoldanella montanaLobelia cardinalisHeuchera micranthaPieris floribundaGaultheria miquelianaChimarrhis hookeriMelampyrum lineareOlea europaeaHedyosmum bonplandianumEuthystachys abbreviataStilbe vestitaLindenbergia philippinensisProboscidea louisianicaPhlox longifoliaStaphylea trifoliaAtuna racemosaColebrookea oppositifoliaPodranea ricasolianaTecoma stansRadermachera frondosaNectaroscordum siculumPrunella grandifloraOlinia ventosaCaulophyllum robustumMisodendron brachystachyumTrachelium caeruleumCalceolaria sp. dePamphilis 90Jovellana sp. DePamphilis 90 3Nothofagus alessandriiIva frutescensBaccharis halimifoliaHelianthus annuusBorrichia frutescensPentanopsis fragransConostomium quadrangulareKohautia caespitosaBouvardia glaberrimaSynaptantha tillaeaceaOldenlandia corymbosaPentodon pentandrusParapentas silvaticaCarphalea glaucescensPentanisia longitubaPlacopoda virgataHedyotis fruticosaSpermacoce tenuiorRichardia pilosaAmaranthus tricolorTheligonum cynocrambeHydrophyllum virginianumEriodictyon californicumBorago officinalis336406Eccremocarpus scaberLavandula angustifoliaSatureja hortensisHorminum pyrenaicumHyssopus officinalisDracocephalum grandiflorumLamium albumPedicularis foliosaHemimeris sabulosaNuxia sp. Hedberg 4731Peltoboykinia tellimoidesNotholirion thomsonianumMilligania densifloraCorydalis nobilisDarlingtonia californicaPlatanus occidentalisAntidaphne viscoideaDrosera regiaTiarella polyphyllaChrysosplenium albumDiervilla sessilifoliaLeptarrhena pyrolifoliaScoliopus bigeloviiTrochodendron aralioidesAsteropyrum cavalerieiLilium japonicumNomocharis pardanthinaAlania endlicheriFritillaria agrestisDecaisnea fargesiiNepeta tuberosaTeucrium chamaedrysTeline canariensisLaburnum anagyroidesArgyrocytisus battandieriChamaecytisus austriacusChamaespartium sagittaleAdenocarpus complicatusCalicotome villosaCytisus arboreusCytisophyllum sessilifoliumPetteria ramentaceaSpartocytisus supranubiusPolygala amaraSinapis albaRogiera suffrutescensHelianthemum grandiflorumOtholobium sericeumCistus revoliiErythrina cristagalliCajanus cajanBituminaria bituminosaNapoleonaea vogeliiGuarea glabraSpartium junceumLupinus albescensErinacea anthyllisLotononis galpinii

I

Eisen et al., 2002

Page 88: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

PG Profile of C. tepidum RbcL suggests link with Sulfur Metabolism

from Eisen et al., 2002and see Hanson and Tabita 2001

CT1893 sulfhydrogenase, delta subunit hydDCT1681 ABC transporter, permease proteinCT2206 polysaccharide efflux transporter, putativeCT1271 glycosyl transferaseCT1965 conserved hypothetical proteinCT2256 geranylgeranyl hydrogenase bchPCT0011 deoxyhypusine synthase, putativeCT1772 ribulose bisphosphate carboxylase, large subunit rbcLCT1894 sulhydrogenase, alpha subunit hydACT0472 conserved hypothetical proteinCT0274 carbon-nitrogen hydrolase family proteinCT1891 sulfhydrogenase, beta subunit hydB-1CT1892 sulfhydrogenase, gamma subunit hydG-1CT1250 sulfhydrogenase, gamma subunit hydG-2CT1249 sulfhydrogenase, beta subunit hydB-2

Page 89: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Carboxydothermus hydrogenoformans

• Isolated in Yellowstone• Thermophile (grows at 80°C)• Anaerobic• Grows on CO (Carbon Monoxide)• Produces hydrogen gas• Low GC gram postive species• Many Archaeal-like genes

Page 90: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

QuickTime™ and aTIFF (LZW) decompressor

are needed to see this picture.

Page 91: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

PG Profiling Works Better Using Orthology

Page 92: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Phylogenomics IV:

Duplications

Page 93: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Why Duplications Are Useful to Identify

• Allows division into orthologs and paralogs

• Improves functional predictions

• Helps identify mechanisms of duplication

• Can be used to study mutation processes in different parts of a genome

• Lineage specific duplications may be indicative of species’ specific adaptations

Page 94: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

C. pneumoniae - All Paralogs

Page 95: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

C. pneumoniae Lineage-Specific Paralogs

Page 96: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

E. coli Paralogs - All

Page 97: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

E. coli Paralogs - Top

Page 98: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

E. coli Paralogs - Recent

Page 99: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Expansion of MCP Family in V. choleraeE.coli gi1787690B.subtilis gi2633766Synechocystis sp. gi1001299Synechocystis sp. gi1001300Synechocystis sp. gi1652276Synechocystis sp. gi1652103H.pylori gi2313716H.pylori99 gi4155097C.jejuni Cj1190cC.jejuni Cj1110cA.fulgidus gi2649560A.fulgidus gi2649548B.subtilis gi2634254B.subtilis gi2632630B.subtilis gi2635607B.subtilis gi2635608B.subtilis gi2635609B.subtilis gi2635610B.subtilis gi2635882E.coli gi1788195E.coli gi2367378E.coli gi1788194E.coli gi1789453C.jejuni Cj0144C.jejuni Cj0262cH.pylori gi2313186H.pylori99 gi4154603C.jejuni Cj1564C.jejuni Cj1506cH.pylori gi2313163H.pylori99 gi4154575H.pylori gi2313179H.pylori99 gi4154599C.jejuni Cj0019cC.jejuni Cj0951cC.jejuni Cj0246cB.subtilis gi2633374T.maritima TM0014T.pallidum gi3322777T.pallidum gi3322939T.pallidum gi3322938B.burgdorferi gi2688522T.pallidum gi3322296B.burgdorferi gi2688521T.maritima TM0429T.maritima TM0918T.maritima TM0023T.maritima TM1428T.maritima TM1143T.maritima TM1146P.abyssi PAB1308P.horikoshii gi3256846P.abyssi PAB1336P.horikoshii gi3256896P.abyssi PAB2066P.horikoshii gi3258290P.abyssi PAB1026P.horikoshii gi3256884D.radiodurans DRA00354D.radiodurans DRA0353D.radiodurans DRA0352P.abyssi PAB1189P.horikoshii gi3258414B.burgdorferi gi2688621M.tuberculosis gi1666149V.cholerae VC0512V.cholerae VCA1034V.cholerae VCA0974V.cholerae VCA0068V.cholerae VC0825V.cholerae VC0282V.cholerae VCA0906V.cholerae VCA0979V.cholerae VCA1056V.cholerae VC1643V.cholerae VC2161V.cholerae VCA0923V.cholerae VC0514V.cholerae VC1868V.cholerae VCA0773V.cholerae VC1313V.cholerae VC1859V.cholerae VC1413V.cholerae VCA0268V.cholerae VCA0658V.cholerae VC1405V.cholerae VC1298V.cholerae VC1248V.cholerae VCA0864V.cholerae VCA0176V.cholerae VCA0220V.cholerae VC1289V.cholerae VCA1069V.cholerae VC2439V.cholerae VC1967V.cholerae VCA0031V.cholerae VC1898V.cholerae VCA0663V.cholerae VCA0988V.cholerae VC0216V.cholerae VC0449V.cholerae VCA0008V.cholerae VC1406V.cholerae VC1535V.cholerae VC0840V.cholerae VC0098V.cholerae VCA1092V.cholerae VC1403V.cholerae VCA1088V.cholerae VC1394V.cholerae VC0622NJ*******************************************************************************

Heidelberg et al. (2000)

Page 100: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

B. anthracis lineage specific duplications

ORF04205 molybdopterin biosynthesis protein MoeA (moeA)ORF05907 molybdopterin biosynthesis protein MoeA (moeA)ORF02636 molybdopterin biosynthesis protein MoeA (moeA)ORF04204 molybdopterin biosynthesis protein MoeB, putativeORF05908 molybdopterin biosynthesis protein MoeB, putativeORF02634 molybdopterin biosynthesis protein MoeB, putativeORF05904 molybdopterin converting factor, subunit 1 (moaD)ORF02639 molybdopterin converting factor, subunit 1 (moaD)ORF04206 molybdopterin converting factor, subunit 2 (moaE)ORF05905 molybdopterin converting factor, subunit 2 (moaE)ORF02638 molybdopterin converting factor, subunit 2 (moaE)

Page 101: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

S. aureus Lineage Specific Duplications

ORF02715 4-diphosphocytidyl-2C-methyl-D-erythritol synthase, putativeORF02712 alcohol dehydrogenase, zinc-containingORF00701 alpha-hemolysin precursor (2X)ORF00717 antibacterial proteinORF02597 capsular polysaccharide biosynthesis proteins CapABC (2X) ORF00804 cell wall hydrolase (3X)ORF00657 cell wall surface anchor family protein (2X)ORF00358 clumping factor (2X)ORF01758 deoxyribose-phosphate aldolase (deoC)ORF02579 purine nucleoside phosphorylase (deoD)ORF01031 drug transporter, putativeORF00805 endopeptidase resistance gene (eprH)ORF00706 exotoxin 1,3,4,5, unknown (2X)ORF02184 fibronectin(2X)ORF00097 glycosyl transferase, group 1 family protein (3X)ORF02086 IgG-binding protein (2X)ORF02431 integrase/recombinase, core domain family (3X)

Analysis done with S. Gill

Page 102: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

S. aureus Lineage Specific DuplicationsORF00137 conserved hypothetical proteinORF00138 conserved hypothetical proteinORF00139 conserved hypothetical proteinORF00140 conserved hypothetical proteinORF00141 conserved hypothetical proteinORF00142 conserved hypothetical proteinORF00143 conserved hypothetical proteinORF00144 conserved hypothetical proteinORF00145 conserved hypothetical proteinORF00146 conserved hypothetical proteinORF00148 conserved hypothetical proteinORF00667 conserved hypothetical proteinORF01251 conserved hypothetical proteinORF02160 conserved hypothetical proteinORF02166 conserved hypothetical proteinORF02170 conserved hypothetical proteinORF02171 conserved hypothetical proteinORF02507 conserved hypothetical proteinORF02745 conserved hypothetical proteinORF02760 conserved hypothetical proteinORF02762 conserved hypothetical proteinORF02763 conserved hypothetical proteinORF02766 conserved hypothetical proteinORF02768 conserved hypothetical proteinORF02769 conserved hypothetical proteinORF02770 conserved hypothetical proteinORF02771 conserved hypothetical proteinORF02772 conserved hypothetical proteinORF02773 conserved hypothetical proteinORF02774 conserved hypothetical proteinORF02896 conserved hypothetical proteinORF02974 conserved hypothetical proteinORF02711 conserved hypothetical protein UPF0007ORF02614 conserved hypothetical protein, authentic frameshift

ORF00286 hypothetical proteinORF00338 hypothetical proteinORF00361 hypothetical proteinORF00412 hypothetical proteinORF00415 hypothetical proteinORF00614 hypothetical proteinORF00697 hypothetical proteinORF00703 hypothetical proteinORF00705 hypothetical proteinORF00875 hypothetical proteinORF00876 hypothetical proteinORF00877 hypothetical proteinORF00879 hypothetical proteinORF00888 hypothetical proteinORF00889 hypothetical proteinORF01024 hypothetical proteinORF01041 hypothetical proteinORF01089 hypothetical proteinORF01091 hypothetical proteinORF01092 hypothetical proteinORF01093 hypothetical proteinORF01095 hypothetical proteinORF01446 hypothetical proteinORF01462 hypothetical proteinORF01918 hypothetical proteinORF02099 hypothetical proteinORF02102 hypothetical proteinORF02158 hypothetical proteinORF02159 hypothetical proteinORF02172 hypothetical proteinORF02430 hypothetical proteinORF02434 hypothetical proteinORF02530 hypothetical proteinORF02531 hypothetical proteinORF02532 hypothetical proteinORF02533 hypothetical proteinORF02534 hypothetical protein

Analysis done with S. Gill

Page 103: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Lineage Specific Duplications in Wolbachia wMelAnnotationankyrin repeat domain proteinankyrin repeat domain proteinankyrin repeat domain proteinankyrin repeat domain proteinankyrin repeat domain proteinankyrin repeat domain proteinankyrin repeat domain proteinconserved domain proteinconserved domain proteinconserved domain proteinconserved domain proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinconserved hypothetical proteinFRAMESHIFTconserved hypothetical proteinPOINT MUTATIONconserved hypothetical protein,degenerateconserved hypothetical protein,FRAMESHIFTconserved hypothetical protein,FRAMESHIFTconserved hypothetical protein,FRAMESHIFTconserved hypothetical protein,FRAMESHIFTconserved hypothetical protein,interruption-Cconserved hypothetical protein,POINT MUTATIONconserved hypothetical protein,POINT MUTATIONconserved hypothetical protein,truncatedconserved hypothetical protein,truncationDNA mismatch repair proteinMutL (mutL)DNA repair protein RadC,putativeDNA repair protein RadC,putative, truncationDNA repair protein RadC,truncationDnaJ domain proteinDnaJ domain proteinexopolysaccharide synthesisprotein ExoD-related proteinexopolysaccharide synthesisprotein ExoD-related proteinHNH endonuclease familyproteinHNH endonuclease familyproteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical protein

hypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinhypothetical proteinmajor facilitator familytransportermajor facilitator familytransportermajor facilitator familytransportermembrane protein, putativemembrane protein, putativemembrane protein, putativeMutL family proteinNa+/H+ antiporter family proteinNa+/H+ antiporter, putativepermease, putativeportal protein, FRAMESHIFTportal protein, FRAMESHIFTprophage LambdaW1, DNAmethylaseprophage LambdaW1, terminaselarge subunit, putativeprophage LambdaW2, ankyrinrepeat domain proteinprophage LambdaW2, ankyrinrepeat domain protein

prophage LambdaW2, baseplateassembly protein J, putativeprophage LambdaW2, baseplateassembly protein V, putativeFRAMESHIFTprophage LambdaW2, baseplateassembly protein V, putativeFRAMESHIFTprophage LambdaW2, baseplateassembly protein W, putativeprophage LambdaW2, minor tailprotein Z, putative,FRAMESHIFTprophage LambdaW2, site-specific recombinase, resolvasefamilyprophage LambdaW4, ankyrinrepeat domain proteinprophage LambdaW4, DNAmethylaseprophage LambdaW4, portalprotein, FRAMESHIFTprophage LambdaW4, portalprotein, FRAMESHIFTprophage LambdaW4, terminaselarge subunit, putativeprophage LambdaW5, ankyrinrepeat domain proteinprophage LambdaW5, ankyrinrepeat domain proteinprophage LambdaW5, ankyrinrepeat domain proteinprophage LambdaW5, baseplateassembly protein J, putative,FRAMESHIFTprophage LambdaW5, baseplateassembly protein V, putativeprophage LambdaW5, baseplateassembly protein W, putativeprophage LambdaW5, minor tailprotein Z, putative, degenerate,FRAMESHIFTprophage LambdaW5, site-specific recombinase, resolvasefamilyregulatory protein RepA, putativeregulatory protein RepA, putativereverse transcriptase, putativereverse transcriptase, putativereverse transcriptase, putativesodium/alanine symporter familyproteinsodium/alanine symporter familyproteinTenA/THI-4 family proteintranscriptional regulatortranscriptional regulatortranscriptional regulatortranscriptional regulatortranscriptional regulatortranscriptional regulatortranscriptional regulator, putativetranslation elongation factor Tu(tuf)translation elongation factor Tu(tuf)transposase, degeneratetransposase, IS4 familytransposase, IS4 familytransposase, IS4 familytransposase, IS5 family,interruption-Ntransposase, IS5 family,truncationtransposase, putative, degeneratetransposase, putative, degeneratetransposase, putative, degeneratetype IV secretion system proteinVirB4, putativeUDP-N-acetylglucosaminepyrophosphorylase-relatedprotein

Page 104: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

MutL Duplication in Wolbachia wMel

ORF01096 DNA mismatch repair protein MutL (mutL)ORF00446 MutL family protein

Page 105: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

MutL Duplication in Wolbachia wMel

Page 106: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Archglbs fulgidus 2650206Archglbs profundus 3892197Desulfovibrio vulgaris 902747Dslfotomaculum thermocistern1Chlorobium tepdum CT0852Chlorobium tepdum CT2249Allochromatium vinosum 2576393Pyrobaculum islandicum 2459568Archglbs fulgidus 2650205Archglbs profundus 389219Dslfotomaculum thermocistern2Desulfovibrio vulgaris 902748Chlorobium tepdum CT0853Chlorobium tepdum CT2248Allochromatium vinosum 2576394Pyrobaculum islandicum 2459569Desulfovibrio vulgaris 1591554Methanobacterium thermoautotrophiucm coF420 2621331100901009510099100100989910010086100100DsrBDsrCDuplication of DsrBC

Recent Duplication of DsrBC in C. tepidum

Page 107: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR0.1

Schizosaccharomyces pombeGP139

Neurospora crassaPIRS55262S552

Clostridium perfringensGP18145

Bacillus subtilisSPP45864YWJD

Bacillus cereusGP6759487embCAB

B BACAN 01914 UV endonuclease

Bacillus haloduransOMNINTL01BH

B BACAN 01459 UV endonuclease

Deinococcus radioduransGP61167

Nostoc sp. PCC 7120GP17130610d

Older Duplication of UVDE

Page 108: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

X-files

Eisen et al. 2000. Genome Biology 1(6): 11.1-11.9

Also see Tillier and Collins. 2000. Nature Genetics 26(2):195-7 and Suyama and Bork. 2001. Trends Genetics 17: 10-13.

Page 109: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Phylogenomics V:

Genome Dynamics

Page 110: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

V. cholerae vs. E. coli All

Page 111: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

V. cholerae vs. E. coli Best

Page 112: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

V. cholerae vs. E. coli: if Top

Page 113: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

V. cholerae vs. E. coli: Top, Rotated

Page 114: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Duplication and Gene Loss Model

A

B

CD

E

F

A

B

CD

E

F

A

B

CD

E

F

A

B

C

D

EF

A’

B’

C’

D’

E’F’

A

B

C

D

EF

A’

B’

C’

D’

E’F’

A

C

D

F

A’

B’

E’

E. coliE. coli

B

C

D

F

A’

B’

D’

E’

V. cholerae

A

B

C

D

EF

A’

B’

C’

D’

E’F’

Page 115: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

B1A1B2A2B3A3B3B22423222120191817161514131211109672582627282930123453132 B131326789101112131415161718192021222324252627282930123453132 B32423222120191817161514131211109672582627282933231304521 A131326789101112131415161718192021222324252627282930123453132 A231326789101112131918171615142021222324252627282930123453132 A32678910111213191817161514202122232425262754331302928132B2Inversion Around Terminus (*)

Inversion Around Terminus (*)

Inversion AroundOrigin (*)

Inversion AroundOrigin (*)

******** Common Ancestor of

A and B

31326789101112131415161718192021222324252627282930123453132A2A1A2A3B2B1

Symmetric Inversion Model

Page 116: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

V. cholerae vs. E. coli: Top

Page 117: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR C. trachomatis MoPn

C. p

neu

mon

iae

AR

39Origin

Terminus

C. trachomatis vs C. pneumoniae

Page 118: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

M. leprae vs. M. tuberculosis

Page 119: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

B. subtilis vs. S. auerus

0

500

1000

1500

2000

2500

3000

2632200 2632700 2633200 2633700 2634200 2634700 2635200 2635700 2636200 2636700

analysis w/ S. Gill

Page 120: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

P. putida vs. P.aeruginosa Orthologs

9945700

9946700

9947700

9948700

9949700

9950700

9951700

0 2000 4000 6000 8000

Series1

analysis w/ K. Nelson

Page 121: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

StrpB vs. StrpA All

13621300

13621500

13621700

13621900

13622100

13622300

13622500

13622700

13622900

13623100

0 500 1000 1500 2000 2500

Series1

Page 122: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

StrpB vs. StrpA: Orthologs

13621300

13621500

13621700

13621900

13622100

13622300

13622500

13622700

13622900

13623100

0 500 1000 1500 2000 2500

Series1

Page 123: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Why are Inversions Symmetrical Around Origin

• Genetic studies in Salmonella and E. coli suggest that there may be strong selection against other inversions

• See:– Mahan, Segall, Schmid and Roth– Liu and Sanderson– Rebollo, Francois, and, Louarn

Page 124: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

TIGRTIGR

Other peopleOther people

Mom and DadMom and Dad

H. OchmanH. Ochman

W. MartinW. Martin

F. RobbF. Robb

J. BattistaJ. Battista

E. OriasE. Orias

D. BryantD. BryantS. O’NeillS. O’Neill

M. EisenM. Eisen

N. MoranN. Moran

R. MyersR. Myers

C. M. CavanaughC. M. Cavanaugh

P. HanawaltP. Hanawalt

NSFNSF

J. HeidelbergJ. Heidelberg

T.ReadT.Read

N. WardN. Ward

M-I BenitoM-I Benito

J. C. VenterJ. C. Venter C. FraserC. Fraser

S. SalzbergS. Salzberg

O. WhiteO. White

I. PaulsenI. Paulsen

$$$$$$

ONRONR

DOEDOE

NIHNIHH. TettelinH. Tettelin

Eisen GroupEisen Group

Martin WuMartin WuDongying WuDongying WuJames SakwaJames SakwaJonathan BadgerJonathan Badger

Page 125: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Vertical Inheritance

From C. Darwin, origin of species, via W. F. Doolittle

Page 126: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Lateral Transfer

Page 127: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Why Gene Transfers Are Useful to Identify

• Laterally transferred genes frequently involved in environmental adaptations and/or pathogenicity

• Identification of vectors of gene transfer (e.g., transposons, integrons, phage)

• Identify species associations in the environment (e.g., Thermotoga and Archaea, Nelson et al.)

• Identify organelle derived genomes in eukaryotic genomes

• Important for understanding of evolution

Page 128: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Examples of Horizontal Transfers

• Antibiotic resistance genes on plasmids

• Toxin resistance genes on plasmids

• Insertion sequences

• Agrobacterium Ti plasmid

• Virus and phage gene acquisition and transfer

• Organelle to nucleus transfers

Page 129: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Steps in Lateral Gene Transfer (LGT)

A B C D

1 Gene acquires host features

2

Transfer

6 Amelioration

3-5 Integration, selection, spread

Page 130: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Inference of Gene Transfer Involves Identifying Unusual Genes

• Unusual distribution patterns

• High sequence similarity to supposedly distantly related species

• Unusual nucleotide composition

• Unusual patterns of evolutionary relatedness (gene vs. species)

Page 131: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Alternative Explanations for Unusual Genes

• Gene loss• Evolutionary rate variation• Strong selection

– Purifying selection– Directional selection– Convergence

• Mistaken orthology assignment• Blast problems• Contamination

Page 132: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

“Hundreds of human genes appear likely to have resulted from horizontal transfer from bacteria at some point in

the vertebrate lineage.”

Page 133: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

IHGSC 2001

• Claim:– Lateral transfer from bacteria to vertebrates

• Evidence– Genes match bacteria but not non-vertebrate

eukaryotes– Or, genes have stronger match to bacteria than to

non-vertebrates– A set of ~120 of these genes are found in many

bacterial species

Page 134: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Unusual Distribution Pattern

+ +

Page 135: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Implied Model for BVTs

1

2

3-6

Inverts Vertebrates Protists Bacteria

Page 136: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Gene Loss?

Inverts Vertebrates Protists Bacteria

Page 137: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Rate Variation?

Inverts

Vertebrates

Protists

Bacteria

Page 138: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Unusual Distribution -Incomplete Data

+ +

Gene present in ancestor

+/-+/-

Page 139: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Number of pBVTs Dependson # of Genomes Analyzed

1 2 3 4 5 Other

0

200

400

600

800

1000

1200

1400

1600

1800

Number of protein sets

Fruit flyC. elegansArabidopsisYeastParasites

Salzberg et al. 2001

Page 140: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Trees Don’t Support TransferParamecium bursaria Chlorella virus 1Homo sapiens HAS1Mus musculus HAS1Xenopus laevisXenopus laevis Danio rerio Homo sapiens Mus musculus Danio rerio Xenopus laevis Gallus gallus Bos taurus Homo sapiens Mus musculus Rattus norvegicus Bradyrhizobium sp SNU001Rhizobium leguminosarumRhizobium spRhizobium lotiRhizobium tropiciRhizobium sp. NodCMesorhizobium sp 7653RSinorhizobium melilotiRhizobium melilotiRhizobium leguminosarumRhizobium galegaeAzorhizobium caulinodansStigmatella aurantiacaStreptomyces coelicolorStreptococcus uberisStreptococcus equisimilisStreptococcus pyogenes HASAStreptococcus pneumoniae0.2BacteriaVertebratesVirusIIIIII

Salzberg et al. 2001

Page 141: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Trees Don’t Support Transfer II

Page 142: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Alternative explanations

• Gene loss from non-vertebrate eukaryotes• Rapid divergence in non-vertebrate

eukaryotes• Some non-vertebrate genomes are

incomplete• Bad annotation/gene finding• Contamination• Blast ≠ evolution

Page 143: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Organellar to Nuclear Transfers in A. thaliana

Page 144: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

A. thaliana Nuclear Proteins:Best Matches to Complete Genomes

0

1000

2000

3000

4000

Bes

t M

atch

es

CH

LT

E

PO

RG

IB

AC

SUM

CY

TU

BB

UR

TR

EP

AC

HL

PN

EC

OL

IN

EIM

ER

ICP

RC

AU

CR

HE

LP

YSY

NSP

AQ

UA

ED

EIR

AT

HE

MA

AE

RP

EA

RC

FU

ME

TJA

ME

TT

HP

YR

AB

CE

LE

GY

EA

STD

RO

ME

B A E

Page 145: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Best Matches vs. Prokaryotes

Page 146: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

Best Matches vs. Prokaryotes

Page 147: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

A. thaliana T1E2.8 is aChloroplast Derived HSP60ARATH -T1E2.8**********ECOLHAEINVIBCHVIBCHRICPR YEASTCHLPNCHLTRAQUAECAMJEHELPYBBURTREPATHEMA BACSUDEIRAMCYTU MCYTU SYNSPSYNSPODONT CPSTMYCGEMYCPNCHLPNCHLTRCHLPN CHLTR ARCFUARCFUMETJAPYRHOMETTH METTHYEAST YEASTYEASTYEAST CELEGYEASTYEASTYEASTCELEG YEAST YEAST CELEGYEASTCELEG CELEGEukaryaArchaeaBacteriaCyano/Cpst

Page 148: Talk on Phylogenomics for MBL Molecular Evolution Course 2004

TIGRTIGRTIGRTIGR

rRNA vs. Whole Genome Trees

Mycobacterium tuberculosisBacillus subtilisSynechocystis sp.Caenorhabditis elegansDrosophila melanogasterSaccharomyces cerevisiaeMethanobacterium thermoautotrophicumArchaeoglobus fulgidusPyrococcus horikoshiiMethanococcus jannaschiiAeropyrum pernixAquifex aeolicusThermotoga maritimaDeinococcus radioduransTreponema pallidumBorrelia burgdorferiHelicobacter pyloriCampylobacter jejuniNeisseria meningitidisEscherichia coliVibrio choleraeHaemophilus influenzaeRickettsia prowazekiiMycoplasma pneumoniaeMycoplasma genitaliumChlamydia trachomatisChlamydia pneumoniae0.05 changesArchaeaBacteriaEukarya