37
MITOGENOMICS MITOGENOMICS AND MOLECULAR AND MOLECULAR PHYLOGENETICS OF FISH BASED ON PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine Biology of Far Eastern Branch of Russian Academy of Sciences, Vladivostok 690059, Russia; Far Eastern Federal University, Vladivostok 690095, Russia; e-mail: [email protected]

MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

Embed Size (px)

Citation preview

Page 1: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

MITOGENOMICS MITOGENOMICS AND MOLECULAR AND MOLECULAR PHYLOGENETICS OF FISH BASED ON PHYLOGENETICS OF FISH BASED ON

COMPLETE MITOCHONDRIAL GENOME COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATASEQUENCES DATA

Y.Ph. KartavtsevA.V. Zhirmunsky Institute of Marine Biology of Far Eastern

Branch of Russian Academy of Sciences, Vladivostok 690059, Russia;

Far Eastern Federal University, Vladivostok 690095, Russia;

e-mail: [email protected]

Page 2: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

1. Comparative Mitogenomics: Introduction.1. Comparative Mitogenomics: Introduction.

2. The Review of Literature Data on 2. The Review of Literature Data on pp- Distances. - Distances.

3. 3. Molecular Phylogenetics & DNA Barcoding.Molecular Phylogenetics & DNA Barcoding.

MAIN GOALSMAIN GOALS

Page 3: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

•Mitochondrion DNA (mtDNA) is a ring molecule of 16-18 kilo-base pairs (kbp) in length. As literature data show, mtDNA of all fishes has similar organization (Lee et al., 2001; Kim et al., 2004; Kim et al., 2005; Nagase et al., 2005; Nohara et al., 2005) and small differences among all vertebrate animals, including men (Anderson et al., 1981; Bibb et al., 1981; Wallace, 1992; Kogelnik et al., 2005). •The complete content of whole mitochondrial genome (mitogenome) includes: control region (CR or D loop), where the site of initiation of replication and promoters are located, big (16S) and small (12S) rRNA subunits, 22 tRNA and 13 polypeptide genes, total N=37.

1. COMPARATIVE MITOGENOMICS:1. COMPARATIVE MITOGENOMICS: INTRODUCTION (1)INTRODUCTION (1)

Page 4: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

Fig. 1.1. a. Summary for the ANOVA two variable analysis: the plot of frequency distribution for the four nucleotides in torrent catfish, Liobagrus obesus. The difference between ND6 and 12 other protein genes nucleotide composition is illustrated. The total frequencies are shown for all 3 nucleotide positions in codons. Significance is shown on the top of the graph

b. Plot of the average percentage at four nucleotides of flatfish species, order Pleuronectiformes (Groups 1-2) in comparison with representatives of Perciformes (3). Original data were taken from Table 2 for the Cyt-b gene at all three nucleotide positions. On the top results of one-factor MANOVA are given. Groups 1 to 3: 1 – Species from this study; 2 – Species taken from GenBank; 3 – GenBank data on Perciformes (From: Kartavtsev et al., 2007a, 2007b, Gene and Marine Biology).

1. COMPARATIVE MITOGENOMICS: INTRODUCTION (2)1. COMPARATIVE MITOGENOMICS: INTRODUCTION (2)

a b

Page 5: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

Fig. 1.2. Fig. 1.2. The The control region (CR, control region (CR, 814 bp after 814 bp after alignment) and alignment) and conserved conserved sequences for the sequences for the bullhead torrent bullhead torrent catfish, catfish, Liobagrus Liobagrus obesusobesus in in comparison with comparison with three other catfish three other catfish speciesspecies.. Aligned Aligned sequence features sequence features and conserved and conserved blocks are shown: blocks are shown: TAS TAS –– strikeover strikeover letters, TATA-box letters, TATA-box –– underlined letters, underlined letters, CSB1 CSB1 –– gray box, gray box, CSB2 CSB2 –– frame, and frame, and SCB3 SCB3 –– double double frame (From: frame (From: Kartavtsev et al., Kartavtsev et al., 2007a). 2007a).

1. COMPARATIVE 1. COMPARATIVE MITOGENOMICS: MITOGENOMICS: INTRODUCTION (3)INTRODUCTION (3)

Page 6: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

Table 1.1. Genes of Mitogenome obtained in bullhead torrent catfish, Liobagrus obesus (sample) (From: Kartavtsev et al., 2007a).

Sequence Codon Gene

от до Size (bp)

Start Stop Note +

tRNAPhe 1 70 70 12S rRNA 71 1026 956 tRNAVal 1027 1098 72 16S rRNA 1099 2764 1666 tRNALeu

(UUR) 2765 2839 75

ND1 2840 3810 971 ATG TA- Ins 5 bp, GCGGG tRNAIle 3816 3887 72 Ov 1 bp, T tRNAGln 3887 3957 71 tRNAMet 3957 4025 69 ND2 4026 5070 1045 ATG T-- tRNATrp 5071 5140 70 Ins 2 bp, TA tRNAAla 5143 5211 69 Ins 1 bp, C tRNAAsn 5213 5285 73 Ins 29 bp,

CTTTCCCCGCC GCCCTTAAAAGGCGGGGA

tRNACys 5315 5381 67 +) Ins – Insertion, Ov – Overlap.

1. COMPARATIVE MITOGENOMICS: INTRODUCTION (4)1. COMPARATIVE MITOGENOMICS: INTRODUCTION (4)

Page 7: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

WHAT IS MAIN OUTCOMEWHAT IS MAIN OUTCOME

Mitogenome in vertebrates is very conservative (~16 KB Mitogenome in vertebrates is very conservative (~16 KB long). In invertebrates it is much longer, up to 27-300 KB.long). In invertebrates it is much longer, up to 27-300 KB.

Most conservative is gene order.Most conservative is gene order. Still, there are flexible sections.Still, there are flexible sections. Unknown are nuclear vs. mitogenome exchange and Unknown are nuclear vs. mitogenome exchange and

interaction.interaction. Many genes that known to be mitochondrial (Many genes that known to be mitochondrial (Idh*, Mdh*Idh*, Mdh* etc.) etc.)

are absent in mtDNA.are absent in mtDNA.Contrary to that, some mtDNA genes (Contrary to that, some mtDNA genes (Cyt-b, Co-cCyt-b, Co-c) are active ) are active in nuclear background.in nuclear background.

There are mutations in mtDNA that create human deceases.There are mutations in mtDNA that create human deceases. Some mobile elements might be present in mitogenome. IS Some mobile elements might be present in mitogenome. IS

sections detected.sections detected.

1. COMPARATIVE MITOGENOMICS: INTRODUCTION (5)1. COMPARATIVE MITOGENOMICS: INTRODUCTION (5)

Page 8: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

PHYLOGENETICS & DNA BARCODING PHYLOGENETICS & DNA BARCODING MARKERSMARKERS

Usually in phylogenetic research single gene sequences are usedUsually in phylogenetic research single gene sequences are used for for both mtDNA and nuclear genome. However, recently more and more both mtDNA and nuclear genome. However, recently more and more frequent are become complete mitogenome usage. frequent are become complete mitogenome usage.

Most popular in phylogenetics are sequences ofMost popular in phylogenetics are sequences of cytochrome cytochrome bb ((Cyt-bCyt-b) ) and cytochromeand cytochrome oxidaseoxidase 1 ( 1 (CCо-1о-1)) genes genes, , which used for taxa which used for taxa comparison at the species - familycomparison at the species - family level level ((Johns,Johns, Avise Avise, 1998; , 1998; Hebert et Hebert et alal., 2004; ., 2004; Kartavtsev, LeeKartavtsev, Lee, 2006, 2006; Kartavtsev, 2009; Kartavtsev, 2009). ). Many sequences Many sequences that bringing the phylogenetic signal obtainedthat bringing the phylogenetic signal obtained for different taxafor different taxa at at genegene 16 16S rRNA as wellS rRNA as well. .

Sequences ofSequences of separate genes can have different phseparate genes can have different phylogeneticylogenetic signal signal because of differences in substitution ratesbecause of differences in substitution rates.. This is also true for This is also true for different sections of genes.different sections of genes. Also, under comparison of higher taxa Also, under comparison of higher taxa there may be effects of homoplasy. When numerous taxa available there may be effects of homoplasy. When numerous taxa available there are problemsthere are problems of insufficient information capacity of sequences to of insufficient information capacity of sequences to cover big species diversity and adequate taxa representation is quite cover big species diversity and adequate taxa representation is quite important important ((Hilish et al., 1996Hilish et al., 1996). ). Nevertheless, for the species Nevertheless, for the species identificationidentification, , excluding rare casesexcluding rare cases, , fine results are availablefine results are available even with even with the usage of short target sequencesthe usage of short target sequences, , like like Со-1Со-1,, with with 656500 bpbp..

Page 9: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

SpacersSpacers[[ITS-1, 2ITS-1, 2]]

mtDNAmtDNA

nDNAnDNA,, rDNArDNA

Most substantiated statistically resultsMost substantiated statistically results

Statistically significant resultsStatistically significant results

SpeciesSpecies GenusGenus FamilyFamily OrderOrder ClassClass PhylumPhylum

Applicability of Different DNA Applicability of Different DNA TypesTypes in Phylogenetics and in Phylogenetics and TaxonomyTaxonomy

Page 10: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

33..1.1. DIVERSITY AT DNA MARKERS WITHIN DIVERSITY AT DNA MARKERS WITHIN SPECIES AND IN TAXA OF DIFFERENT RANKSPECIES AND IN TAXA OF DIFFERENT RANK. .

AN ANALYSIS OF EMPIRICAL DATAAN ANALYSIS OF EMPIRICAL DATA

2. THE REVIEW OF LITERATURE DATA 2. THE REVIEW OF LITERATURE DATA ON ON PP- DISTANCES- DISTANCES

Page 11: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

2.1. USING 2.1. USING PP-DISTANCES. SUMMARY-DISTANCES. SUMMARY

• To estimate the actual number of substitutions among sequences X and Y it is necessary to introduce a certain mathematical model.

• At least 8 major models (Nei, Kumar, 2000; Felsenstein, 2004) and 56 in

total are referred in sources nowadays (Posada, 2005;

http://darwin.uvigo.es/software/modeltest.html ).

• Among most simple and known are Jukes, Cantor (1968; JC) and Kimura (1980) two parametric (K2P) models. The late is default in some packages (e.g. PAUP). These models consequently suggest the equality of all kinds of substitutions and non equality for transitions (α) and transversions (β).

• Titles of some other models: Equal-input, Tamura, HKY (Hasigawa-Kishino-Yano), Tamura-Nei (TrN), General time reversible (GTR),

Unrestricted.

Page 12: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

• In the K2P model equilibrium frequencies of 4 nucleotides are 0.25. However, the algorithms suggested for calculations (expected p^ and its variance) here and in Jukes-Cantor model are applicable irrelevant to frequency deviations (Rzhetsky, Nei, 1995). Thus, both models are suitable for wider range of conditions, where real parameters stay unknown. • Be unconfused we should remember that in Kimura’s model ratio of transitions to transversions is R = α / 2β, however many authors and many software using different proportion - k = α / β.

• In our estimates (Kartavtsev, Lee, 2006) most authors using K2P (29%) and many using simple p^ or such measures as HKY, TrN etc. To choose an appropriate model there is a popular program MODELTEST (Posada, Grandal, 1998). Very useful info on model properties and their applicability over wide range of specific data sets may be find in literature (Nei, Kumar, 2000; Hall, 2001; Sanderson, Shaffer, 2002; Felsenstein, 2004).

2.2. USING 2.2. USING PP-DISTANCES. SUMMARY-DISTANCES. SUMMARY

Page 13: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

2.3. 2.3. PP-DISTANCE. SUMMARY-DISTANCE. SUMMARY

• p-Distance and others. p-distance is a fraction of different nucleotides in their total number for a pair of sequences.

• Numerical simulations showed that when p-distances are small, <20%, then any model give similar values of divergence (Fig. 1.1).

• Because of heterogeneity of substitution rates along sequences and

different parts of genes an important correction of p-distance is gamma-correction (e.g. Nei, Kumar, 2000; Felsenstein, 2004).

Fig. 2.1. Estimates of the number of nucleotide substitutions obtained by different distances measures when actual numbers follows TrN-model (From Nei, Kumar, 2000).

Page 14: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

Intraspecies diversityIntraspecies diversity

• In our data base for hundred intraspecies р-distances averages comprise at Cyt-b and Co-1: M=1.55± 0.56% and M=0.55± 0.19%, correspondingly (Kartavtsev, Lee, 2006).• Most important thing that I like to stress here is that for many species a stable, geographically restricted intraspecies gatherings have been detected. They are marked by mtDNA genes and obviously there are isolated intraspecies phylogroups existing for many generations, which as real as real are the local stocks that defined by other biological methods. Number of such examples is summarized by Avise & Wolker (1999) and many also presented in our reviews (Kartavtsev, Lee, 2006, Kartavtsev, 2009). Among others may be mentioned bottle-nose dolphin, Tursiops truncatus (Dowlin, Brown, 1993), Canadian gees, Branta canadiensis (Van Wagner, Baker, 1990), fishes, Fundulus heteroclitus and Stizostedion vitreum (Gonzales-Willasefior, Powers, 1990, Billington, Strange, 1990) etc. (Stepien, Faber, 1998).

There are many and variable estimates based on different markers. For instance, two copepod species obtained nucleotide diversity (π) dependent on latitude at rRNA gene of mtDNA. Subarctic species Calanus finmarchicus, π=0.37%, SD = 0.26, was less variable, than temperate water, Nanocalanus minor, π=0.50%, SD = 0.32 (Bucklin, Wiebe, 1998). If focus on Cyt-b and Co-1 sequence diversity, К2Р value at Со-1 at sequence some 600 bp was estimated for 107 intraspecies groups of different species for five families of baterfly (Lepidoptera: Arctidae, Geometridae, Noctuidae, Notodontidae, Sphingidae), as small (Hebert et al., 2002). For average values the variation is within limit: 0.17 – 0.36%. Our recalculation gives average for these groups: К2Р = 0.25 ± 0.04%.

Page 15: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

pp-DISTANCES IN GROUPS OF COMPARISON,-DISTANCES IN GROUPS OF COMPARISON,Review 2Review 2

Fig. 2.2. Plot of distribution of weighted mean p-distances among five groups of Plot of distribution of weighted mean p-distances among five groups of comparison at Cytcomparison at Cyt--bb andand CoCo-1 -1 genes and 21,000 animal speciesgenes and 21,000 animal species.. Groups here: 1. Groups here: 1. Intra-Intra-species, among individuals of the same species; 2. Intra-sibling species + semispecies + species, among individuals of the same species; 2. Intra-sibling species + semispecies + subspecies, 3. Intra-genus, among species of the same genera; 4. Intra-family, among subspecies, 3. Intra-genus, among species of the same genera; 4. Intra-family, among genera of the same family , 5. Intra-oreder, genera of the same family , 5. Intra-oreder, families of the same orderfamilies of the same order (From Kartavtsev, (From Kartavtsev, 2011a, Marine Genomics).2011a, Marine Genomics).

Thus, data available suggest that in general a phyletic evolution prevail in animal world, and so far, the Geographic speciation events (Type 1a) most common in nature.

Do data presented assume that speciation is always follows the Type 1a mode? I guess, no. Few examples below let to support this answer.

Page 16: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

DISTANCE VS TAXA SPLITTING

Mean ±0,95 Conf. Interval 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1,0 1,1 1,2 1,3

p-dis-tr

-800

-600

-400

-200

0

200

400

600

800

1000

PV

-co

mp

Has punctuation an impact in species origin on molecular level? • Avise, Ayala, 1976; Kartavtsev et al., 1980; current – No.• Pagel et al., 2006 – Yes.

Num

ber

of S

plitt

ings

Transformed p-distance

rs = 0.22, p < 0.05

Fig. 2.3. Plot of p-distance on number of splittings at Cyt-b sequence data for catfishes and flatfishes

Page 17: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

WHAT IS MAIN OUTCOMEWHAT IS MAIN OUTCOME

Distance measure alone is not satisfactory Distance measure alone is not satisfactory descriptor and they should be standartized. descriptor and they should be standartized. Data on intraspecies diversity (heterozygosity) at Data on intraspecies diversity (heterozygosity) at structural genes are necessary.structural genes are necessary.Measures of regulatory genome changes should Measures of regulatory genome changes should be necessary to describe transformative modes be necessary to describe transformative modes of speciation.of speciation.Other descriptors of genomic change are Other descriptors of genomic change are required (e.g. chromoseme number, NF, required (e.g. chromoseme number, NF, transcription rate etc.).transcription rate etc.).

Page 18: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

3. 3. MOLECULAR PHYLOGENETICS MOLECULAR PHYLOGENETICS & DNA BARCODING& DNA BARCODING

Page 19: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

Identification + Taxonomy

Historically # 1:Biochemical Systematics.DNA Barcoding

Phylogenetics + Taxonomy

Actually # 1:Basic Value.Dating of Splitting.

4. MOLECULAR PHYLOGENETICS & DNA 4. MOLECULAR PHYLOGENETICS & DNA BARCODINGBARCODING

Page 20: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

Fig. 3.1.Fig. 3.1. Maximum likelihood tree of the unambiguous aligned 14,293 nucleotide Maximum likelihood tree of the unambiguous aligned 14,293 nucleotide positions from 31 species/subspecies of Leuciscinae and 9 outgroup species based positions from 31 species/subspecies of Leuciscinae and 9 outgroup species based on heuristic search with the GTR+I+G model. on heuristic search with the GTR+I+G model. 1st codon, 2nd codon, 3rd codon, tRNAs, 1st codon, 2nd codon, 3rd codon, tRNAs, rRNAs were separately analyzed; i.e., partitions were introduced in program settings. Numbers rRNAs were separately analyzed; i.e., partitions were introduced in program settings. Numbers beside internal branches indicate bootstrap probabilities from 1,000 replicates.beside internal branches indicate bootstrap probabilities from 1,000 replicates. The scales in the The scales in the left bottom corners indicate relative branch lengths left bottom corners indicate relative branch lengths (From: Imoto et al., 2013, Gene).(From: Imoto et al., 2013, Gene).

RESULTS (1)RESULTS (1)

Page 21: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

Fig. 3.2. Fig. 3.2. Posterior distribution Posterior distribution of divergence times as estimated among Leuciscinae of divergence times as estimated among Leuciscinae species and related species. species and related species. Horizontal bars represent the estimated 95% credibility intervals of Horizontal bars represent the estimated 95% credibility intervals of

divergence times divergence times (From: Imoto et al., 2013, Gene).(From: Imoto et al., 2013, Gene).

RESULTS (2)RESULTS (2)

Page 22: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

Fig. 3.3.Fig. 3.3. Rooted consensus (50%) BA-tree, which shows molecular phylogenetic Rooted consensus (50%) BA-tree, which shows molecular phylogenetic interrelationships for 25 mitogenome sequences of 13 structural genes. interrelationships for 25 mitogenome sequences of 13 structural genes. In the nodes In the nodes the posterior BA probabilities for n=10the posterior BA probabilities for n=1066 simulated generations and bootstrap support for ML simulated generations and bootstrap support for ML (n=500), MP and NJ (n=1,000 for both) are shown (BA/ML/MP/NJ).(n=500), MP and NJ (n=1,000 for both) are shown (BA/ML/MP/NJ). BA and ML trees are based BA and ML trees are based on on GTR+I+G model. GTR+I+G model. All structural genes used including ND6, no partitions introduced (From: All structural genes used including ND6, no partitions introduced (From: Kartavtsev et al., 2013, in press)Kartavtsev et al., 2013, in press)..

RESULTS (3)RESULTS (3)

Page 23: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

Fig. 3.4.Fig. 3.4. Combined phylogram of Combined phylogram of Co-1 Co-1 andand Cyt-b Cyt-b genes for 59 sequences of our genes for 59 sequences of our and GenBank representatives of and GenBank representatives of GlyptothoraxGlyptothorax catfish jointly with 3 outgroup catfish jointly with 3 outgroup specimens built by BA algorithm. specimens built by BA algorithm. In the nodes the support levels (Probabilities, %) In the nodes the support levels (Probabilities, %) are shown in order: BA, ML, MP, and NJ. Abbreviation 100 x 4 denotes: 100,100,100,100 are shown in order: BA, ML, MP, and NJ. Abbreviation 100 x 4 denotes: 100,100,100,100 and 100 x 3 denotes: 100,100,100 and 100 x 3 denotes: 100,100,100 (From: Singh et al., 2013, submitted).(From: Singh et al., 2013, submitted).

RESULTS (4)RESULTS (4)

Page 24: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

Fig. 3.5.Fig. 3.5. Tanglegram built for Tanglegram built for Co-1 Co-1 vs.vs. Cyt-b Cyt-b genes as previously for 59 sequences of our and genes as previously for 59 sequences of our and GenBank representatives of GenBank representatives of GlyptothoraxGlyptothorax catfish jointly with 3 outgroup specimens in two catfish jointly with 3 outgroup specimens in two approaches depicted at A and B. approaches depicted at A and B. At A the consensus tree made by LSA algorithm from 2 single At A the consensus tree made by LSA algorithm from 2 single gene trees is represented.gene trees is represented. At B the consensus tree built by BA algorithm from 2 partitions At B the consensus tree built by BA algorithm from 2 partitions representing sequences of representing sequences of Co-1 Co-1 plusplus Cyt-b Cyt-b is visualized. Lines show common tips defined by the is visualized. Lines show common tips defined by the heuristic algorithm in PP Dendroscope. Theuristic algorithm in PP Dendroscope. The lowest stable ancestor (LSA) technique is used for technique is used for building consensus tree in A building consensus tree in A (From: Singh et al., 2013, submitted).(From: Singh et al., 2013, submitted).

RESULTS (4)RESULTS (4)A B

Page 25: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

THE INTERNATIONAL iBOL PROJECTTHE INTERNATIONAL iBOL PROJECT

•The iBOL&CBOL are main global initiatives. The Fish-BOL, its part, has over 5400 species barcoded by Co-1 from more than 30,000 specimens what makes it unique. P. Hebert and B. Hanner are preparing a $150M grant application for Genome Canada only for 2008. Other nations’ funds in iBOL are also big in some countries and unions: USA, EU.

In this year there will be held 5th world-wide international conference (November 2013, China) and many regional meeting were performed.

Page 26: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

INTERNATIONAL PROJECT, BARCODING OF LIFE, iBOL (1)

• iBOL, CBOL, Fish-BOL. DataBase is 88444 species and 1039413 DNA-barcodes at Co-1 gene (BOLD, 16.11.2010).

• Main Aim. The Whole Biodiversity Description. 1 million 700 are known now (without bacteria). Supposed number is 10 M, i.e. majority are still unknown.

Page 27: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

INTERNATIONAL PROJECT, BARCODING OF LIFE, iBOL (2)

Should be investigated: 29112

Remain for

barcoding:

22572

Barcoded: 6540

1 2

Co-Chair: P. Hebert и B. Ward

Page 28: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

AREA OF INVESTIGATION: START UP

Executing Chair: Youn-Ho Lee Co-Chairs: Yuri Kartavtsev T. Shao, Shunpin He,Keiichi Matsuura, Youn-Ho Lee

РАБОЧАЯ ГРУППА ПО СЕВЕРОВОСТОЧНОЙ АЗИИРАБОЧАЯ ГРУППА ПО СЕВЕРОВОСТОЧНОЙ АЗИИNorth East Asian Regional Working GroupNorth East Asian Regional Working Group

Afr

ica

Au

stra

liaE

uro

pe

Ind

iaM

eso

Am

eri

caN

ort

h A

me

rica

NE

Asi

aO

cea

nia

&A

nta

rc

So

uth

Am

eri

ca

SE

Asi

a

BarcodedSpecies No0

2000400060008000

1000012000

Barcoding Progress in Regions

Barcoded

Remaining

Species No

Should be investigated (4,5,18,61): 9483

Remainfor Barcoding:

9096

Barcoded 387

1 2

Page 29: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

OUR OWN RESULTS ON FISHOUR OWN RESULTS ON FISH

Some objectsSome objects: (: (11)) Opisthocentrus ocellatusOpisthocentrus ocellatus, , (2)(2) Ammodytes hexapterus 306a (Ammodytidae)(Ammodytidae), (3) , (3) Mioxocephalus brandtiiMioxocephalus brandtii, , ((44) ) Hemitripterus villosus Hemitripterus villosus (Cottidae)(Cottidae) (Images(Images:: 1-2 1-2 - - O. Rutenko, 4O. Rutenko, 4 - - A. A.

Sokolovsky)Sokolovsky)..

1

3 4

2

Page 30: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

SAMPLE OF DATABASESAMPLE OF DATABASE: : M. brandtiM. brandti

Page 31: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

SAMPLE OF DATABASESAMPLE OF DATABASE: : M. brandtiM. brandti

Page 32: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

FISH-BOL. RUSSIA DEVELOPMENTFISH-BOL. RUSSIA DEVELOPMENT

Page 33: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

THANKS FOR ATTENTIONTHANKS FOR ATTENTION!!

GRANT SUPPORT: GRANT SUPPORT: FEB RAS FEB RAS 12-I-OBN-07, 12-II-СО-06-017, and KPFI 12-06-002.

Page 34: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

SPECIATION MODES (SM): POPULATION GENETIC SPECIATION MODES (SM): POPULATION GENETIC VIEWVIEW

• ABSENCE OF QUANTITATIVE THEORY OF SPECIATION (QTS)We concluded that the speciation theory in evolutionary genetics is absent in exact scientific meaning, which expects the ability to predict future by the theory. In this case this is to predict species origin, or at least discriminate among several speciation modes on the basis of some quantitative parameters or their empirical estimates. Attempts made in this direction (Avise, Wollenberg, 1997, Templeton, 1998) do not fit the above criteria. That is why we attempted to step in the discrimination of the speciation modes on the basis of main population genetic measurements available in literature, and that may be laid in the frame of a genetic speciation concept. BASEMENT FOR THE QTSAs a basis for the set of evolutionary genetic concepts we used verbal descriptions of speciation modes made by Templeton (1981). As a result the classification scheme for 7 different modes of speciation was created (Fig. 3.3). This approach leads to quite simple experimental scheme that permits: (i) to arrange further investigation of speciation in different groups of organisms, and (ii) to derive analytical relations for each speciation mode (Fig. 3.4). EMPIRICAL QTS TESTING The scheme was tested for Cyprinids (Kartavtsev et al., 2002) and explains well our own earlier data on salmons (Kartavtsev, Mamontov, 1983, Kartavtsev et al., 1983). Certainly, both the testing of the scheme presented, and its theoretic background must be further developed.

Page 35: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

Fig. 3.6.Fig. 3.6. SPECIATION MODES (SM): POPULATION SPECIATION MODES (SM): POPULATION GENETIC VIEWGENETIC VIEW ((From:From: KartavtsevKartavtsev, , 2011a, Marine Genomics2011a, Marine Genomics))

Page 36: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

Fig. 3.4. ANALITICAL DESCRIPTION OF SEVEN TYPES OF SPECIATION MODES

Fig. 11. Analytic representation of seven speciation mode (From Kartavtsev, 2009 with modifications). D1–D3, divergent speciation modes; T1–T4, transformative (transilience) speciation modes. Descriptors: D, genetic distances for structural gene; DT, pT: in putative parental taxon; DS, pS: among conspecific demes; DD, pD: among subspecies or sibling species; HD, D: mean heterozygosity/diversity in putative daughter population; HP, P: mean heterozygosity/diversity in putative parental population; EP: divergence at regulatory genes in putative parental taxon; ED: divergence at regulatory genes in putative daughter taxon; TM+: test for modification (positive); TM–: test for modification (negative).

1 (S) {(DT > DS) (ED = EP) (HD = HP) (pT > pS) (D = P) TM-} (D1)

2 (S) {(DT > DS) (ED = EP) (HD = HP) (pT > pS) (D = P) TM-} (D2)

3 (S) {(DT = DS) (ED = EP) (HD = HP) (pT = pS) (D <= P) TM-} (D3)

4 (S) {(DT > DS) (ED = EP) (HD = HP) (pT > pS) (D = P) TM-} (T1)

5 (S) {(DT > DS) (ED = EP) (HD = HP) (pT > pS) (D = P) TM-} (T2)

6 (S) {(DT > DS) (ED = EP) (HD = HP) (pT > pS) (D = P) TM-} (T3)

7 (S) {(DT > DS) (ED = EP) (HD = HP) (pT > pS) (D = P) TM-} (T4)

Page 37: MITOGENOMICS AND MOLECULAR PHYLOGENETICS OF FISH BASED ON COMPLETE MITOCHONDRIAL GENOME SEQUENCES DATA Y.Ph. Kartavtsev A.V. Zhirmunsky Institute of Marine

SummarySummary• Algorithms of nucleotide diversity estimates and other measures of genetic divergence for the

two genes Cyt-b (cytochrome b) and Co-1 (cytochrome oxidase 1) are analyzed. Based on the theory and algorithms of distance estimates on DNA sequences, as well as on the observed distance values retrieved from literature, it is recommended for realistic tree building to use a specific nucleotide substitution model from at least 56 available. Using a database of p-distances and similar measures gathered from published sources and GenBank sequences, genetic divergence of populations (1) and taxa of different rank, such as subspecies, semispecies or/and sibling species (2), species within a genus (3), species from different genera within a family (4), and species from separate families within an order (5) have been compared.

• Distance data for 20,731 species reveal various and increasing levels of genetic divergence of the sequences of the two genes, Cyt-b and Co-1 (cytochrome b, and cytochrome c oxidase subunit 1), in the five groups compared. Mean unweighted scores of p-distances for five groups are: Cyt-b (1) 1.38±0.30, (2) 5.10±0.91, (3) 10.31±0.93, (4) 17.86±1.36, (5) 26.36±3.88 and Co-1 (1) 0.89±0.16, (2) 3.78±1.18, (3) 11.06±0.53, (4) 16.60±0.69, (5) 20.57±0.40. The estimates show good correspondence with other analyses.

• Differences in divergence between the genes themselves at the five hierarchical levels were also found. This conforms to the ample evidence showing different and nonuniform evolution rates of these and other genes and their various regions. The results of the analysis of the nucleotide divergence within species and higher taxa of animals are (1) in a good agreement with previous results and showed the stability of a general trend, and (2) suggest that a phyletic evolution in animals prevails at the molecular level, and speciation mainly corresponds to the geographic mode (type D1) that well fails in a frame of the Biological Species Concept (BSC). The prevalence of the D1 speciation mode and the BSC does not mean that other modes are absent and concepts are invalid. It is shown how we can recognize speciation modes formally with the operational genetic criteria.