83
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

  • View
    216

  • Download
    0

Embed Size (px)

Citation preview

Page 1: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

BIOE 109Summer 2009

Lecture 4- Part IIPhylogenetic Inference

Page 2: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

What is phylogeny?

Page 3: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

What is phylogeny?A

B

(a) None(b) Both are phylogenetic trees(c) Only A is phylogenetic tree(d) Only B is phylogenetic tree

Page 4: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Phylogeny: evolutionary history of a group of speciesor a gene.

What is phylogeny?

Page 5: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Phylogeny: evolutionary history of a group of speciesor a gene

Phylogenetic tree: graphical summary of theevolutionary history

What is phylogeny?

Page 6: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference
Page 7: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Phylogeny describes:-1. Pattern and/or timing of events that occurred as species

diversified.

2. Sequence in which lineages appeared

3. Which organisms are more closely or distantly related.

Page 8: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Phylogenetic Inference 

Two points to keep in mind:

Page 9: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Phylogenetic Inference 

Two points to keep in mind:

1. Phylogenetic trees are hypotheses-how reliable?

 

Page 10: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Phylogenetic Inference 

Two points to keep in mind:

1. Phylogenetic trees are hypotheses 2. Gene trees are not the same as species trees

Page 11: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Phylogenetic Inference 

Two points to keep in mind:

1. Phylogenetic trees are hypotheses 2. Gene trees are not the same as species trees

• a species tree depicts the evolutionary history of a group of species.

Page 12: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Phylogenetic Inference 

Two points to keep in mind:

1. Phylogenetic trees are hypotheses 2. Gene trees are not the same as species trees

• a species tree depicts the evolutionary history of a group of species.

• a gene tree depicts the evolutionary history of a specific locus.

Page 13: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Conflict between gene trees and species trees

Page 14: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Conflict between gene trees and species trees

Page 15: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Phylogenetic Inference  

• phylogenetic trees are built from “characters”.   

Page 16: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Phylogenetic Inference  

• phylogenetic trees are built from “characters”.  • characters can be morphological, behavioral, physiological, or molecular.  

Page 17: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Phylogenetic Inference  

• phylogenetic trees are built from “characters”.  • characters can be morphological, behavioral, physiological, or molecular.  • there are two important assumptions about the characters used to build trees:   

Page 18: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Phylogenetic Inference  

• phylogenetic trees are built from “characters”.  • characters can be morphological, behavioral, physiological, or molecular.  • there are two important assumptions about the characters used to build trees:  1. they are independent. 

Page 19: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Phylogenetic Inference  

• phylogenetic trees are built from “characters”.  • characters can be morphological, behavioral, physiological, or molecular.  • there are two important assumptions about characters used to build trees:  1. they are independent. 2. they are homologous. 

Page 20: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

What is a homologous character?

Page 21: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

What is a homologous character?

• a homologous character is shared by two species because it was inherited from a common ancestor.

Page 22: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

What is a homologous character?

• a homologous character is shared by two species because it was inherited from a common ancestor.

• a character possessed by two species but was not present in their recent ancestors, it is said to exhibit “homoplasy”.

Page 23: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Types of homoplasy:   

Page 24: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Types of homoplasy:   

1. Convergent evolution 

Example: evolution of eyes, flight.  

Page 25: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Examples of convergent evolution

Page 26: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Types of homoplasy:   

1. Convergent evolution 

Example: evolution of eyes, flight.  

2. Parallel evolution 

Example: drug resistance in HIV.  

Page 27: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

What is the difference between convergent and parallel evolution?

   

Page 28: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

What is the difference between convergent and parallel evolution?

   

Convergent Parallel 

  

Page 29: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

What is the difference between convergent and parallel evolution?

   

Convergent Parallel 

Species compared: distantly closely related related

  

Page 30: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

What is the difference between convergent and parallel evolution?

   

Convergent Parallel 

Species compared: distantly closely related related

Trait produced by: different genes/ same genes/ developmental developmental pathways pathways

  

Page 31: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Types of homoplasy:   

1. Convergent evolution 

Example: evolution of eyes, flight.  

2. Parallel evolution 

Example: lactose tolerance in human adults  

3. Evolutionary reversals 

Example: back mutations at the DNA sequence level (C A C).

Page 32: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Evolutionary reversals are common in DNA sequences

Page 33: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Our objective is to identify monophyletic groups

Page 34: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Our objective is to identify monophyletic groups

A monophyletic group is derived from a single ancestral species and includes all descendants (e.g., mammals).

Page 35: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Three monophyletic groups:

Page 36: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Two mistakes are possible:

1. A paraphyletic group is derived from a single ancestral species but does not include all descendants.

Page 37: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Reptiles are paraphyletic

Page 38: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Two mistakes are possible:

1. A paraphyletic group is derived from a single ancestral species but does not include all descendants (e.g., reptiles).

2. A polyphyletic group fails to include the most recent common ancestor.

Page 39: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

“Warm blooded animals” is a polyphyletic group

Page 40: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Contending schools of systematics  

1. Phenetics (Distance methods) 

Page 41: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Contending schools of systematics  

1. Phenetics (Distance methods) 

Objectives:

1. Tree should reflect overall degree of similarity.

 

Page 42: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Contending schools of systematics  

1. Phenetics (Distance methods) 

Objectives:

1. Tree should reflect overall degree of similarity.

 2. Tree should be based on as many characters as possible. 

Page 43: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Contending schools of systematics  

1. Phenetics (Distance methods) 

Objectives:

1. Tree should reflect overall degree of similarity.

 2. Tree should be based on as many characters as possible. 

3. Tree should minimize the distance among taxa.

Page 44: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Examples of distance trees-HIV strains

Discrete character data is converted into a distance value

Page 45: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Distance tree—HIV strains

- Captures overall degree of similarity- Branch lengths are important

-Drawbacks: (a) loss of information about which traits have

changed.(b) have to correct for multiple substitutions at

the same site.(c) the tree may not reflect “true” phylogenetic

relationship

Page 46: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Contending schools of systematics 

 2. Cladistics 

Page 47: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Contending schools of systematics 

 2. Cladistics 

Objectives:  

1. Tree should reflect the true phylogeny. 

Page 48: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Contending schools of systematics 

 2. Cladistics 

Objectives:  

1. Tree should reflect the true phylogeny. 

2. Tree should use characters that are shared (among two or more taxa) and derived (from some inferred or known ancestral state). 

Page 49: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Contending schools of systematics 

 2. Cladistics 

Objectives:  

1. Tree should reflect the true phylogeny. 

2. Tree should use characters that are shared (among two or more taxa) and derived (from some inferred or known ancestral state). 

• shared and derived characters are called synapomorphies.  

Page 50: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Contending schools of systematics 

 2. Cladistics 

Objectives:  

1. Tree should reflect the true phylogeny. 

2. Tree should use characters that are shared (among two or more taxa) and derived (from some inferred or known ancestral state). 

• shared and derived characters are called synapomorphies.  

3. Ancestral state of characters inferred from an outgroup that roots the tree.

Page 51: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Contending schools of systematics 

 2. Cladistics 

Objectives:  

1. Tree should reflect the true phylogeny. 

2. Tree should use characters that are shared (among two or more taxa) and derived (from some inferred or known ancestral state). 

• shared and derived characters are called synapomorphies.  

3. Ancestral state of characters inferred from an outgroup that roots the tree.

• an outgroup is ideally picked from the fossil record.

Page 52: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Example of a cladogram

Page 53: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do distance trees differ from cladograms?

Page 54: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do distance trees differ from cladograms?

  

Distance trees Cladograms 

Page 55: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do distance trees differ from cladograms?

  

Distance trees Cladograms  

Characters used as many as synapomorphies possible only

Page 56: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do distance trees differ from cladograms?

  

Distance trees Cladograms  

Characters used as many as synapomorphies possible only

 Monophyly not required absolute

requirement 

Page 57: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do distance trees differ from cladograms?

  

Distance trees Cladograms  

Characters used as many as synapomorphies possible only

 Monophyly not required absolute

requirement Emphasis branch lengths branch-splitting

Page 58: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do distance trees differ from cladograms?

  

Distance trees Cladograms  

Characters used as many as synapomorphies possible only

 Monophyly not required absolute

requirement Emphasis branch lengths branch-splitting

Outgroup not required absolute requirement

Page 59: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do we select the “best” tree? 

No. of Taxa No. of possible trees

4 3

Page 60: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do we select the “best” tree? 

No. of Taxa No. of possible trees

4 3 5 15

Page 61: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do we select the “best” tree? 

No. of Taxa No. of possible trees

4 3 5 15 6 105

Page 62: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do we select the “best” tree? 

No. of Taxa No. of possible trees

4 3 5 15 6 105 7 945

Page 63: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do we select the “best” tree? 

No. of Taxa No. of possible trees

4 3 5 15 6 105 7 94510 2 x 106

Page 64: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do we select the “best” tree? 

No. of Taxa No. of possible trees

4 3 5 15 6 105 7 94510 2 x 106

11 34 x 106

Page 65: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do we select the “best” tree? 

No. of Taxa No. of possible trees

4 3 5 15 6 105 7 94510 2 x 106

11 34 x 106

50 3 x 1074

Page 66: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do we select the “best” tree? 

Page 67: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do we select the “best” tree? 

A. Maximum parsimony: the “best” tree is that which minimizes the number of evolutionary steps (changes among characters). 

Page 68: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do we select the “best” tree? 

A. Maximum parsimony: the “best” tree is that which minimizes the number of evolutionary steps (changes among characters). 

-the simplest explanation is preferred over more complicated ones. 

Page 69: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Examples of convergent evolution

Page 70: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Independent gain of camera eye requires two changes

Page 71: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Evolution and loss of camera eye requires six changes

Page 72: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do we select the “best” tree?  

B. Maximum likelihood: the “best” tree is that which maximizes the likelihood of producing the observed data. 

Page 73: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do we select the “best” tree?  

B. Maximum likelihood: the “best” tree is that which maximizes the likelihood of producing the observed data. 

- likelihood scores are estimated from a specific model of base substitution and a specific tree.

Page 74: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

How do we select the “best” tree?

C. Bootstrapping

Page 75: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Evaluating tree support by bootstrapping

Species 1 A A C G C C T… GSpecies 2 A T C G C C T… GSpecies 3 A T T G A C C… GSpecies 4 A T T G A C C… G

Page 76: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Evaluating tree support by bootstrapping

Species 1 A A C G C C T… GSpecies 2 A T C G C C T… GSpecies 3 A T T G A C C… GSpecies 4 A T T G A C C… G

Species 1

Species 2

Species 3

Species 4

Page 77: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Evaluating tree support by bootstrapping

Species 1 A A C G C C T… GSpecies 2 A T C G C C T… GSpecies 3 A T T G A C C… GSpecies 4 A T T G A C C… G

Step 1. Randomly select a base to represent position 1

Page 78: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Evaluating tree support by bootstrapping

Species 1 A A C G C C T… GSpecies 2 A T C G C C T… GSpecies 3 A T T G A C C… GSpecies 4 A T T G A C C… G

Step 1. Randomly select a base to represent position 1

Species 1 T Species 2 TSpecies 3 C Species 4 C

Page 79: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Evaluating tree support by bootstrapping

Species 1 A A C G C C T… GSpecies 2 A T C G C C T… GSpecies 3 A T T G A C C… GSpecies 4 A T T G A C C… G

Step 2. Randomly select a base to represent position 2

Species 1 T GSpecies 2 T GSpecies 3 C GSpecies 4 C G

Page 80: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Evaluating tree support by bootstrapping

Step 3. Generate complete data set (sampling with replacement).

Page 81: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Evaluating tree support by bootstrapping

Step 3. Generate complete data set (sampling with replacement).

Step 4. Build tree and record if groupings match original tree.

Page 82: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Evaluating tree support by bootstrapping

Step 3. Generate complete data set (sampling with replacement).

Step 4. Build tree and record if groupings match original tree.

Step 5. Repeat 1,000 times.

Page 83: BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference

Species 1

Species 2

Species 3

Species 4

Evaluating tree support by bootstrapping

98

92