62
Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time Thu-Hien To Michel Habib LIAFA and University Paris Diderot - Paris 7 Combinatorial Pattern Matching – June 22-24 2009 liafa

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Level-k Phylogenetic Networks are Constructablefrom a Dense Triplet Set in Polynomial Time

Thu-Hien To Michel Habib

LIAFA and University Paris Diderot - Paris 7

Combinatorial Pattern Matching – June 22-24 2009

liafa

Page 2: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

1 Introduction

2 Notations

3 General Algorithm to Construct Networks from a Triplet Set

4 Analyse the Algorithm

5 Conclusion

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 3: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Introduction

Phylogenetic: infer plausible evolutionary histories frombiological data of currently living species

Biological data: sequences, distance matrix, triplets, quartets,subtrees, characters etc

Considered problem: infer phylogenetic networks from a tripletset

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 4: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Introduction

Phylogenetic: infer plausible evolutionary histories frombiological data of currently living species

Biological data: sequences, distance matrix, triplets, quartets,subtrees, characters etc

Considered problem: infer phylogenetic networks from a tripletset

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 5: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Introduction

Phylogenetic: infer plausible evolutionary histories frombiological data of currently living species

Biological data: sequences, distance matrix, triplets, quartets,subtrees, characters etc

Considered problem: infer phylogenetic networks from a tripletset

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 6: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Combining triplets

Phylogenetic tree: a binary rooted tree whose each leaf is labeledby a species

Triplet: a phylogenetic tree on 3 species

⇒ O(|T |.|n|) algorithm, with n = |L|, by Aho, Sagiv, Szymanski,and Ullman ’81

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 7: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Combining triplets

Phylogenetic tree: a binary rooted tree whose each leaf is labeledby a speciesTriplet: a phylogenetic tree on 3 species

⇒ O(|T |.|n|) algorithm, with n = |L|, by Aho, Sagiv, Szymanski,and Ullman ’81

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 8: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Combining triplets

Phylogenetic tree: a binary rooted tree whose each leaf is labeledby a speciesTriplet: a phylogenetic tree on 3 species

⇒ O(|T |.|n|) algorithm, with n = |L|, by Aho, Sagiv, Szymanski,and Ullman ’81

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 9: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Combining triplets

Phylogenetic tree: a binary rooted tree whose each leaf is labeledby a speciesTriplet: a phylogenetic tree on 3 species

⇒ O(|T |.|n|) algorithm, with n = |L|, by Aho, Sagiv, Szymanski,and Ullman ’81

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 10: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Combining triplets

Phylogenetic tree: a binary rooted tree whose each leaf is labeledby a speciesTriplet: a phylogenetic tree on 3 species

⇒ O(|T |.|n|) algorithm, with n = |L|, by Aho, Sagiv, Szymanski,and Ullman ’81

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 11: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Conflicting triplets

Problem

Input: a triplet set T on a leaf set LOutput: a phylogenetic network consistent with T

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 12: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Conflicting triplets

Problem

Input: a triplet set T on a leaf set LOutput: a phylogenetic network consistent with T

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 13: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Conflicting triplets

Problem

Input: a triplet set T on a leaf set LOutput: a phylogenetic network consistent with T

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 14: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Related works - Main result

If T is arbitrary: NP-complete for all levels > 0Jansson, Nguyen, Sung ’06Iersel, Keijsper, Kelk, Stougie ’08Iersel, Kelk, Mnich ’09

If T is dense, i.e there is at least one triplet in T on eachthree leaves, then:- O(n3) algorithm for level-1 networks - Jansson, Nguyen,Sung ’04,’06- and O(n8) algorithm for level-2 network - Iersel, Keijsper,Kelk, Stougie ’08

Main result presented here: If T is dense, for any fixedinteger k, it is possible to construct a level-k network inpolynomial time

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 15: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Related works - Main result

If T is arbitrary: NP-complete for all levels > 0Jansson, Nguyen, Sung ’06Iersel, Keijsper, Kelk, Stougie ’08Iersel, Kelk, Mnich ’09

If T is dense, i.e there is at least one triplet in T on eachthree leaves, then:- O(n3) algorithm for level-1 networks - Jansson, Nguyen,Sung ’04,’06- and O(n8) algorithm for level-2 network - Iersel, Keijsper,Kelk, Stougie ’08

Main result presented here: If T is dense, for any fixedinteger k, it is possible to construct a level-k network inpolynomial time

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 16: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Related works - Main result

If T is arbitrary: NP-complete for all levels > 0Jansson, Nguyen, Sung ’06Iersel, Keijsper, Kelk, Stougie ’08Iersel, Kelk, Mnich ’09

If T is dense, i.e there is at least one triplet in T on eachthree leaves, then:- O(n3) algorithm for level-1 networks - Jansson, Nguyen,Sung ’04,’06- and O(n8) algorithm for level-2 network - Iersel, Keijsper,Kelk, Stougie ’08

Main result presented here: If T is dense, for any fixedinteger k, it is possible to construct a level-k network inpolynomial time

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 17: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Phylogenetic network

Hybrid vertices model:- Recombination- Hybridization- Horizontal genetransfer- Ambiguity

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 18: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Consistency of a triplet with a network

∃ 2 vertices u 6= v andpairwise internallyvertex-disjoint paths:u x , u v , v y ,v z x |yz is consistent with the network

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 19: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Consistency of a triplet with a network

∃ 2 vertices u 6= v andpairwise internallyvertex-disjoint paths:u x , u v , v y ,v z z |xy is also consistent with the network

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 20: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Level of a Phylogenetic network

First defined by Choy, Jansson,Sadakane, Sung ’05- Take the underlying undi-rected graph U(N)- Take the biconnected compo-nents of U(N)- Level k ⇔ each biconnectedcomponent has at most khybrid vertices

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 21: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Level of a Phylogenetic network

First defined by Choy, Jansson,Sadakane, Sung ’05- Take the underlying undi-rected graph U(N)- Take the biconnected compo-nents of U(N)- Level k ⇔ each biconnectedcomponent has at most khybrid vertices

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 22: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Level of a Phylogenetic network

First defined by Choy, Jansson,Sadakane, Sung ’05- Take the underlying undi-rected graph U(N)- Take the biconnectedcomponents of U(N)- Level k ⇔ each biconnectedcomponent has at most khybrid vertices

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 23: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Level of a Phylogenetic network

First defined by Choy, Jansson,Sadakane, Sung ’05- Take the underlying undi-rected graph U(N)- Take the biconnected compo-nents of U(N)- Level k ⇔ each bicon-nected component has atmost k hybrid vertices

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 24: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Decompose a Phylogenetic network

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 25: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Decompose a Phylogenetic network

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 26: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Decompose a Phylogenetic network

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 27: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Decompose a Phylogenetic network

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 28: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Cut-arc

Cut-arc: arc whose removaldisconnects the networkHighest cut-arc: cut-arcconnects with the highestbiconnected component

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 29: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Simple network

By replacing each sub-network below ahighest cut-arc by a leaf, we obtain asimple network

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 30: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Reconstruct Phylogenetic networks from a Triplet Set

first proposed by Jansson,Sung ’04- Partition the leaf setP = {P1, P2, . . . ,Pm}- Construct recursively anetwork consistent withT |Pi on each part Pi

- Construct a simple net-work Ns consistent withT ∇P- Replace each leaf of Ns

by the known correspondingsub-network

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 31: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Reconstruct Phylogenetic networks from a Triplet Set

first proposed by Jansson,Sung - 2004- Partition the leaf set P ={P1, P2, . . . ,Pm}- Construct recursively anetwork consistent withT |Pi on each part Pi

- Construct a simple networkNs consistent with T ∇P- Replace each leaf of Ns

by the known correspondingsub-network

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 32: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Reconstruct Phylogenetic networks from a Triplet Set

T ∇P = {Pi |PjPk such thati , j , k are pairwise distinctand ∃x ∈ Pi , y ∈ Pj , z ∈ Pk

for which x |yz ∈ T }

first proposed by Jansson,Sung - 2004- Partition the leaf setP = {P1, P2, . . . ,Pm}- Construct recursively anetwork consistent withT |Pi on each part Pi

- Construct a simplenetwork Ns consistentwith T ∇P- Replace each leaf of Ns

by the known correspondingsub-network

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 33: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Reconstruct Phylogenetic networks from a Triplet Set

first proposed by Jansson,Sung - 2004- Partition the leaf setP = {P1, P2, . . . ,Pm}- Construct recursively anetwork consistent withT |Pi on each part Pi

- Construct a simple net-work Ns consistent withT ∇P- Replace each leaf of Ns

by the known correspond-ing sub-network

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 34: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Problems

Two problems to solve:

Construct a simple network consistent with T ∇PExist an O(nk+1) algorithm to find all simple networksconsistent with a dense triplet set - Iersel,Kelk ’08

Find all possible partitions P of the leaf set

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 35: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Problems

Two problems to solve:

Construct a simple network consistent with T ∇PExist an O(nk+1) algorithm to find all simple networksconsistent with a dense triplet set - Iersel,Kelk ’08

Find all possible partitions P of the leaf set

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 36: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Properties of a Partition of the Leaf Set

∀x /∈ Pi and y , z ∈ Pi , theonly triplet on three leavesx , y , z , if there is any, is x |yz

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 37: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

SN-set

SN-set

first defined by Jansson, Sung ’04Let A ⊆ L. A is a SN-set, or Simple Network set,if either:- it is a singleton- or the whole L- or ∀x ∈ L\A, y , z ∈ A, the only triplet on{x , y , z} in T , if there is any, is x |yz .

Remark: Each part of the partition is a SN-set

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 38: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

SN-tree

If T is dense, the collection of all SN-sets is laminar - Jansson,Nguyen, Sung ’06

⇒ is tree-representable

- each node of the SN-tree represents a SN-set- the number of non-singleton SN-sets is O(n) with n = |L|- SN-tree can be computed from T in O(n3) -Jansson,Nguyen,Sung ’06

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 39: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

SN-tree

If T is dense, the collection of all SN-sets is laminar - Jansson,Nguyen, Sung ’06⇒ is tree-representable

- each node of the SN-tree represents a SN-set- the number of non-singleton SN-sets is O(n) with n = |L|- SN-tree can be computed from T in O(n3) -Jansson,Nguyen,Sung ’06

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 40: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Characterize a partition by split SN-sets

Remark: A partition ⇔ a set of split SN-sets

split SN-set: each child of a split SN-set is a part of the partition.

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 41: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

A SN-set split in a network

A = a1 ∪ a2 ∪ a3 issplit in this network

Fact: a SN-set is split in a network iff each of itschildren is hung below a different highest cut-arcof this network

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 42: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

A function from a split SN-set to a set of hybrid vertices

3cm8cm

Define a function f from a split SN-set to a setof hybrid vertices- Let A = a1 ∪ · · · ∪ am be a split SN-set. Each ai

is hung below a highest cut-arc (ui , vi )- H is the set of hybrid vertices of the highestbiconnected component ⇒ |H| ≤ k

f (A) = {h ∈ H|∃i so that ui h and the pathfrom ui to h does not contain any internal hybridvertex}

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 43: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

A function from a split SN-set to a set of hybrid vertices

3cm8cm

Define a function f from a split SN-set to a setof hybrid vertices- Let A = a1 ∪ · · · ∪ am be a split SN-set. Each ai

is hung below a highest cut-arc (ui , vi )- H is the set of hybrid vertices of the highestbiconnected component ⇒ |H| ≤ k

f (A) = {h ∈ H|∃i so that ui h and the pathfrom ui to h does not contain any internal hybridvertex}

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 44: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Bound the number of split SN-sets in a level-k network

Lemma 1

Let N be any network consistent with T , then(i) f (A) 6= ∅ for any SN-set A split in N(ii) ∀h ∈ H, there are at most three SN-sets split in N, whoseimage by f contains h

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 45: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Bound the number of split SN-sets in a level-k network

Lemma 1

Let N be any network consistent with T , then(i) f (A) 6= ∅ for any SN-set A split in N(ii) ∀h ∈ H, there are at most three SN-sets split in N, whoseimage by f contains h

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 46: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Bound the number of split SN-sets in a level-k network

Lemma 1

Let N be any network consistent with T , then(i) f (A) 6= ∅ for any SN-set A split in N(ii) ∀h ∈ H, there are at most three SN-sets split in N, whoseimage by f contains h

Lemma 2

Let T be a dense triplet set. For any level-k network Nconsistent with T , there are at most 3k SN-sets of T whichare split in N

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 47: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Theorem

Given a dense triplet set T , and a fixed positive integer k, it ispossible to construct a level-k network consistent with T , if oneexists, in O(|T |k+1n3k+1) time.

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 48: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Proof of complexity

Similar to those of level-1 and level-2 networks, we construct oneach SN-set A, in small-big order, a sub-network consistent withT |A

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 49: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Proof of complexity

Similar to those of level-1 and level-2 networks, we construct oneach SN-set A, in small-big order, a sub-network consistent withT |A

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 50: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Proof of complexity

Similar to those of level-1 and level-2 networks, we construct oneach SN-set A, in small-big order, a sub-network consistent withT |A

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 51: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Proof of complexity

Similar to those of level-1 and level-2 networks, we construct oneach SN-set A, in small-big order, a sub-network consistent withT |A

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 52: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Proof of complexity

Similar to those of level-1 and level-2 networks, we construct oneach SN-set A, in small-big order, a sub-network consistent withT |A

There are O(n) non-singleton SN-sets⇒ O(n) constructions of a network on a SN-set by knowing asub-network on each smaller SN-set

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 53: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Proof of complexity

For each of the O(n) constructions:- Constructing a simple level-k network: O(|T |k+1) - Iersel,Kelk ’08- The number of partitions = the number of sets of split SN-sets :O(n3k)

⇒ Total: O(|T |k+1n3k+1)

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 54: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Proof of complexity

For each of the O(n) constructions:- Constructing a simple level-k network: O(|T |k+1) - Iersel,Kelk ’08- The number of partitions = the number of sets of split SN-sets :O(n3k)⇒ Total: O(|T |k+1n3k+1)

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 55: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

The network with the minimum number of recombinations

Theorem

Given a dense triplet set T , and a fixed positive integer k, it ispossible to construct a level-k network consistent with T with theminimum number of recombinations, if one exists, inO(|T |k+1n3k+1) time.

Idea for the proof: If N is a level-k network with the minimumnumber of recombinations, then each sub-network below a highestcut-arc of N is also the one with the minimum number ofrecombinations

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 56: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

The network with the minimum number of recombinations

Theorem

Given a dense triplet set T , and a fixed positive integer k, it ispossible to construct a level-k network consistent with T with theminimum number of recombinations, if one exists, inO(|T |k+1n3k+1) time.

Idea for the proof: If N is a level-k network with the minimumnumber of recombinations, then each sub-network below a highestcut-arc of N is also the one with the minimum number ofrecombinations

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 57: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Conclusion

Constructing a level-k network consistent with a dense tripletset with the minimum number of recombinations ispolynomial with any fixed k

Better bound? better algorithm?

Relax the condition of density on the triplet set?

Open problem: the network with the minimum level consistentwith a dense triplet set? NP-complete?

Thank you

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 58: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Conclusion

Constructing a level-k network consistent with a dense tripletset with the minimum number of recombinations ispolynomial with any fixed k

Better bound? better algorithm?

Relax the condition of density on the triplet set?

Open problem: the network with the minimum level consistentwith a dense triplet set? NP-complete?

Thank you

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 59: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Conclusion

Constructing a level-k network consistent with a dense tripletset with the minimum number of recombinations ispolynomial with any fixed k

Better bound? better algorithm?

Relax the condition of density on the triplet set?

Open problem: the network with the minimum level consistentwith a dense triplet set? NP-complete?

Thank you

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 60: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Conclusion

Constructing a level-k network consistent with a dense tripletset with the minimum number of recombinations ispolynomial with any fixed k

Better bound? better algorithm?

Relax the condition of density on the triplet set?

Open problem: the network with the minimum level consistentwith a dense triplet set? NP-complete?

Thank you

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 61: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Conclusion

Constructing a level-k network consistent with a dense tripletset with the minimum number of recombinations ispolynomial with any fixed k

Better bound? better algorithm?

Relax the condition of density on the triplet set?

Open problem: the network with the minimum level consistentwith a dense triplet set? NP-complete?

Thank you

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Page 62: Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time

Introduction Notations General Algorithm to Construct Networks from a Triplet Set Analyse the Algorithm Conclusion

Conclusion

Constructing a level-k network consistent with a dense tripletset with the minimum number of recombinations ispolynomial with any fixed k

Better bound? better algorithm?

Relax the condition of density on the triplet set?

Open problem: the network with the minimum level consistentwith a dense triplet set? NP-complete?

Thank you

Thu-Hien To LIAFA and University Paris Diderot - Paris 7

Level-k Phylogenetic Networks are Constructable from a Dense Triplet Set in Polynomial Time