84
Examining patterns of genetic variation in Canadian marine molluscs through DNA barcodes by Kara Layton A Thesis presented to The University of Guelph In partial fulfilment of requirements for the degree of Master of Science in Integrative Biology Guelph, Ontario, Canada © Kara Layton, January, 2012

Examining patterns of genetic ... - University of Guelph

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Examining patterns of genetic ... - University of Guelph

Examining patterns of genetic variation in Canadian marine molluscs through

DNA barcodes

by

Kara Layton

A Thesis

presented to

The University of Guelph

In partial fulfilment of requirements

for the degree of

Master of Science

in

Integrative Biology

Guelph, Ontario, Canada

© Kara Layton, January, 2012

Page 2: Examining patterns of genetic ... - University of Guelph

ABSTRACT

Examining patterns of genetic variation in Canadian marine molluscs through

DNA barcodes

Kara Layton Advisor:

University of Guelph, 2013 Professor P.D.N Hebert

In this thesis I investigate patterns of sequence variation at the COI gene in Canadian marine

molluscs. The research presented begins the construction of a DNA barcode reference library for this

phylum, presenting records for nearly 25% of the Canadian fauna. This work confirms that the COI gene

region is an effective tool for delineating species of marine molluscs and for revealing overlooked

species. This study also discovered a link between GC content and sequence divergence between

congeneric species. I also provide a detailed analysis of population structure in two bivalves with similar

larval development and dispersal potential, exploring how Canada’s extensive glacial history has shaped

genetic structure. Both bivalve species show evidence for cryptic taxa and particularly high genetic

diversity in populations from the northeast Pacific. These results have implications for the utility of DNA

barcoding both for documenting biodiversity and broadening our understanding of biogeographic patterns

in Holarctic species.

Page 3: Examining patterns of genetic ... - University of Guelph

iii

Acknowledgements

Firstly, I would like to thank my advisor Dr. Paul Hebert for providing endless guidance and

support during my program and for greatly improving my research. You always encouraged my

participation in field collections and conferences, allowing many opportunities to connect with colleagues

and present my research to the scientific community. I am also incredibly grateful for the invaluable

feedback and input I received from my committee members, Dr. Elizabeth Boulding and Dr. André

Martel. Your enthusiasm for this project always kept me motivated and I thank you for teaching me all

about the wonderful world of malacology.

I would like to thank the Natural Sciences and Engineering Research Council of Canada

(NSERC), the International Barcode of Life project and Genome Canada through the Ontario Genomics

Institute for funding this research. I am also grateful to Aboriginal Affairs and Northern Development

Canada for providing me with a Northern Scientific Training Program grant that aided in field collections

in Churchill, Manitoba. Lastly, I thank the Canadian Centre for DNA Barcoding (CCDB) for help in

sequence acquisition.

Research teams at the Kasitsna Bay Lab, the Churchill Northern Studies Centre and the

Huntsman Marine Science Centre provided logistical support for field collections and I am extremely

grateful for their help. I would also like to extend my gratitude to Sarah Hardy, Katrin Iken, Suzanne

Dufour, Barry McDonald, Robert Frank, Nicholas Jeffrey and Paolo Pierrossi for aid in specimen

collections.

A sincere thank you to all graduate students in the Hebert, Adamowicz, Smith, Crease, Hajibabaei

and Gregory labs for providing input into my project and contributing greatly to my happiness during this

program. I especially want to thank Christy Carr for her advice, patience and compassion- you truly are a

fantastic scientist. Finally, I need to thank my close friends and family, particularly my parents and Ryan,

who not only provided unconditional love but have shaped the person I am today.

Page 4: Examining patterns of genetic ... - University of Guelph

iv

Table of Contents

Abstract……………………………………………………………………………………………………ii

Acknowledgements……………………………………………………………………………………….iii

List of Tables……………………………………………………………………………………………...vi

List of Figures……………………………………………………………………………………………vii

General Introduction………………………………………………………………………………...........1

Chapter 1: Patterns of DNA barcode variation in Canadian marine molluscs………………….........3

Abstract…………………………………………………………………………………………….........3

Introduction……………………………………………………………………………………………...4

Methods…………………………………………………………………………………………….........5

Specimen collection and data scrutiny………………………………………………………………..5

DNA extraction, amplification and sequencing……………………………………………………….5

Data analysis…………………………………………………………………………………………..6

Results…………………………………………………………………………………………………..7

Sequence recovery…………………………………………………………………………….............7

COI variation in molluscs……………………………………………………………………………..7

Variation in nucleotide composition…………………………………………………………………..8

Distribution of indels…………………………………………………………………………………..8

Discussion………………………………………………………………………………………………8

Sequencing success in Mollusca………………………………………………………………………8

Patterns of sequence variation…………………………………………………………………….......9

Insertions and deletions in COI……………………………………………………………………...10

Patterns of nucleotide composition…………………………………………………………………..11

Conclusions…………………………………………………………………………………………..12

Chapter 2: Geographic patterns of mtCOI diversity in two species of

Canadian marine bivalves……………………………………………………………………………23

Abstract………………………………………………………………………………………………..23

Introduction……………………………………………………………………………………………24

Methods………………………………………………………………………………………………..25

Specimen collection…………………………………………………………………………………..25

DNA extraction, amplification and sequencing……………………………………………………...26

Data analysis…………………………………………………………………………………………26

Results…………………………………………………………………………………………………27

Sequence recovery and haplotype diversity………………………………………………………….27

Page 5: Examining patterns of genetic ... - University of Guelph

v

Patterns of genetic diversity…………………………………………………………………………28

Population structure…………………………………………………………………………………29

Discussion……………………………………………………………………………………………..30

Comparing diversity and structure in two bivalves with planktotrophic larval development……....30

Implications for glacial refugia in the northeast Pacific……………………………………………30

Evidence of sibling species…………………………………………………………………………..31

Conclusions…………………………………………………………………………………………..32

General Conclusions……………………………………………………………………………………..44

Summary of findings…………………………………………………………………………………44

The application of a DNA barcode library for Canadian marine molluscs………………………....44

Gene flow in marine populations…………………………………………………………………….45

Conclusions and implications for conservation……………………………………………………...46

Literature Cited………………………………………………………………………………………….47

Appendix A: Specimen Preservation…...……………………………………………………………....56

Appendix B: Species Identifications…………………………………………………………………....57

Appendix C: Chapter 1 Supplementary Material……………………………………………………..61

Appendix D: Chapter 2 Supplementary Material……………………………………………………..65

Appendix E: R Code……………………………………………………………………………………..74

Page 6: Examining patterns of genetic ... - University of Guelph

vi

List of Tables

Chapter 1 ………………………………………………………………………………………………...13

Table 1.1. Parameter settings for each OTU algorithm………………….………...………………….13

Table 1.2. Percent success in recovery of a COI sequence …………....……………………………..13

Table 1.3. The number of COI sequences and BINs, intraspecific and nearest neighbour

distances and mean GC content for each of 33 orders………………………………………………...14

Table 1.4. Mean intraspecific divergence, number of genetic clusters, number of

individuals sampled and locality information for each potential cryptic species complex

in this study……………………………………………………………………………………………15

Table 1.5. Intra and interspecific distances (K2P) for taxonomic groups examined by

DNA barcoding in previous literature…………………………………………………...……………15

Chapter 2…………………………………………………………………………………………………33

Table 2.1. Genetic diversity in populations of the bivalve species, Hiatella arctica and

Macoma balthica……………………………………………………………………………………..33

Table 2.2. Overall genetic structure measured by AMOVA for Hiatella arctica…...………………..34

Table 2.3. Overall genetic structure measured by AMOVA for Macoma balthica balthica…...…….34

Table 2.4. Overall genetic structure measured by AMOVA for ATL Macoma balthica…..………...35

Table 2.5. FST for populations of Hiatella arctica, Macoma balthica balthica and

ATL Macoma balthica……………………………………………………………………………….36

Appendix B……………………………………………………………………………………………….57

Table B.1. References for species identifications………………...…………………………………58

Appendix C………………………..…………………………………………………………………...…61

Table C.1. List of COI primers used for molecular techniques in Chapter 1…………..……………61

Table C.2. List of GenBank specimens used for analysis in Chapter 1……………………………...61

Appendix D……………………………………………………………………………….………………65

Table D.1. Detailed collection information for all 172 Hiatella arctica specimens…………………65

Table D.2. Detailed collection information for all 196 Macoma balthica specimens.........................67

Page 7: Examining patterns of genetic ... - University of Guelph

vii

List of Figures

Chapter 1…………………………………………………………………………………………………16

Figure 1.1. Sampling locations and the number of specimens examined in this study……...…….…16

Figure 1.2. Rarefaction curves for the five classes of Canadian marine mollusc

represented in this study……………...……………………………………………………………….17

Figure 1.3. Mean intraspecific divergences (K2P) and nearest neighbour distances

for all specimens in this study………………...………………………………………………………18

Figure 1.4. Maximum and mean intraspecific divergences plotted against the number

of individuals analyzed for 157 species……………………………………………………………….18

Figure 1.5. Box plots comparing mean nearest neighbour distance with the

number of species sampled from each genus with ≥ 2 representative species………………………..19

Figure 1.6. Neighbour-joining trees (K2P), with locality information, for 9 cryptic

species complexes in this study……………………………………………………………………….20

Figure 1.7. Mean nearest neighbour distance (K2P) plotted against mean GC

content for the 33 orders of Mollusca represented in this study………………………………………21

Figure 1.8. Secondary structure of COI marked with insertions and deletions for

gastropods and bivalves……………………………………………………………………………….22

Chapter 2…………………………………………………………………………………………………37

Figure 2.1. Collection sites for H. arctica and M. balthica…………………………………………..37

Figure 2.2. Intraspecific sequence divergence (K2P) for H. arctica and M. balthica……………......38

Figure 2.3. Neighbour-joining tree based on K2P distances for H. arctica. The top scale

shows estimated divergence time in millions of years while the bottom scale bar shows

sequence divergence…………………………………………………………………………………..39

Figure 2.4. Neighbour-joining tree based on K2P distances for M. balthica. Subspecies names

suggested by Väinölä (2003) are provided. The top scale shows estimated divergence time in millions

of years while the bottom scale bar shows sequence divergence........................................................40

Figure 2.5. Median-joining haplotype networks for H. arctica and M. balthica constructed

with maximum parsimony…………………………………………………………………………….41

Figure 2.6. FST values (Slatkin’s linearized) plotted against geographic distance for

populations within the main lineage of H. arctica…………………………………………………….42

Figure 2.7. FST values (Slatkin’s linearized) plotted against geographic distance for

populations of M. balthica balthica…………………………………………………………………...42

Figure 2.8. FST values (Slatkin’s linearized) plotted against geographic distance for

populations of ATL M. balthica………………………………………………………………………43

Page 8: Examining patterns of genetic ... - University of Guelph

viii

Appendix B……………………………………………………………………………………………….57

Figure B.1. Hinge dentition in bivalves………………………………………………………...…….59

Figure B.2. View of girdle scales on Tonicella marmorea and Tonicella rubra…………......……...60

Appendix C………………………………………………………………………………………………61

Figure C.1. Neighbour-joining tree (K2P) for all barcoded specimens……………………………...62

Appendix D……………………………………………………………………………………………….65

Figure D.1. Calculations for DXY, Tajima’s D, FST and Bray Curtis index……………….………….70

Figure D.2. Bray Curtis similarity values between populations of Hiatella arctica…………………71

Figure D.3. Bray Curtis similarity values between populations of Macoma balthica…….…………72

Figure D.4. Images of hinge dentition in each of the four cryptic lineages of Hiatella arctica……..73

Page 9: Examining patterns of genetic ... - University of Guelph

1

General Introduction

The discovery and quantification of marine biodiversity is crucial for evaluating ecosystem

structure and interactions, factors of growing importance in the face of climate change (Archambault et al.

2010). Species discovery also forms a basis for conservation and restoration of marine biodiversity

(Snelgrove 2010). Traditional approaches to species discrimination have relied solely on morphological

traits and often resulted in cryptic species going undetected (Hebert et al. 2003). In addition, the

identification of specimens was often impossible because of life stage diversity or damage to diagnostic

traits (Hebert et al. 2003, Carr et al. 2010, Radulovici et al. 2010). These facts make clear the need to

integrate molecular data into species delineation techniques. The use of the cytochrome c oxidase subunit

1 (COI) gene as a genetic barcode for animal taxa has created a standardized approach for species

delineation (Hebert et al. 2003). DNA barcoding is based on the observation that sequence divergences

among species are generally much greater than those within species (Hebert et al. 2003). DNA barcoding

studies typically employ distance-based methods to calculate intra and interspecific divergences, gaining

insight into sequence variation both within species and among congeners (Zou et al. 2011). Prior studies

have shown that a sequence divergence threshold of 2% is effective in delineating many animal species

(Hebert et al. 2003, Witt et al. 2006, Kerr et al. 2009). Moreover, such analysis often highlights cases of

cryptic diversity, particularly in marine species with broad ranges (Knowlton 2000, Carr et al. 2010,

Bucklin et al. 2011). The frequent discovery of distinct genetic clusters in past investigation on species

with Holarctic distributions suggests that genetic studies are a critical element if one wishes to accurately

quantify marine biodiversity (Knowlton 2000, Carr et al. 2010, Bucklin et al. 2011).The extension of

knowledge on biodiversity in Canada’s marine ecosystems is critical because of stressors originating from

climate change, eutrophication, acidification, overfishing and pollution (Archambault et al. 2010). In

addition, human-mediated dispersal, linked to increased vessel traffic, has resulted in invasive species

replacing the natural marine fauna (Carlton 1999, Bax et al. 2003, Archambault et al. 2010). These

invasive species not only threaten environmental functions, but hurt the economy through impacts on

commercially important fish and invertebrate species.

My thesis extends knowledge of Canadian species diversity in the most diverse marine phylum,

the Mollusca (Bouchet et al. 2002). This phylum is not only speciose, but ubiquitous as its member

species occupy diverse habitats and posses varied life history strategies. This study not only uses COI for

delineating species, but also integrates these results with traditional taxonomic methods to provide a

multi-dimensional approach to species identification. Exploring genetic variation in marine molluscs

provides insight into diversity patterns in Canadian oceans and will aid future conservation strategies.

Page 10: Examining patterns of genetic ... - University of Guelph

2

Chapter 1 reports progress in the assembly of a barcode reference library for Canada’s marine

molluscs. This chapter explores patterns of sequence variation, both within and between species, and also

provides a detailed analysis of variation in GC content across molluscan orders. In addition, it probes taxa

with deep intraspecific divergences with a view towards the revelation of potential cryptic complexes

while also providing comparative analyses of species identification techniques. Chapter 2 involves a

focused examination of sequence variation and population structure in two bivalve species with

planktonic larval development and a Holarctic distribution. This study was motivated by recent evidence

for deep sequence divergence and phylogeographic partitioning in other marine species which were once

thought to possess broad ranges (Knowlton 2000, Carr et al. 2010, Bucklin et al. 2011). Furthermore, it

has been shown that patterns of population structure can differ in species with similar life history

strategies, a result that likely reflects their differential responses to the glacial cycles that have impacted

Canada’s coasts for much of the last two million years (Bernatchez & Wilson 1998). In fact, repeated

glaciations have played a key role in shaping contemporary patterns of species distribution and population

divergence in Canadian waters (Bernatchez & Wilson 1998, Wares & Cunningham 2001).

This thesis demonstrates that COI is a useful tool for analyzing patterns of genetic variation in

marine molluscs, on local and regional scales. It is also useful for highlighting cases of deep intraspecific

divergence that may signal cryptic species. In this study, COI was successful in clarifying patterns of

intraspecific variation in two bivalve species, providing evidence for unrecognized species in both taxa,

and identifying locations of potential glacial refugia. This work highlights the need for greater effort at

species documentation, not only to further understanding of marine biodiversity, but also to aid

conservation and the implementation of marine protected areas.

Page 11: Examining patterns of genetic ... - University of Guelph

3

Chapter 1

Patterns of DNA Barcode Variation in Canadian Marine Molluscs

Abstract

A 648 base pair segment of the cytochrome c oxidase subunit 1 gene has proven useful for the

identification and discovery of species in many animal lineages. This study begins the assembly of a

comprehensive barcode reference library for Canadian marine molluscs, examining patterns of sequence

variation in 234 morphospecies, about 25% of the marine mollusc fauna in Canada. These taxa showed a

mean intraspecific sequence divergence of 0.51%, while congenerics showed a mean divergence of

13.5%. Nine cases of deep (>2%) intraspecific divergence were detected, suggesting possible overlooked

species. Structural variation was detected in the barcode region with indels in 38 species, most (71%) in

bivalves. GC content varied from 32 – 43% and there was a significant positive correlation between GC

content and nearest neighbour distance.

Page 12: Examining patterns of genetic ... - University of Guelph

4

Introduction

DNA barcoding employs sequence diversity in a 648 base pair region of the cytochrome c

oxidase subunit 1 (COI) gene to distinguish species (Hebert et al. 2003, Kerr et al. 2009, Carr et al. 2010).

Past work has shown that sequence divergences are generally much greater between than within species

(Hebert et al. 2003). Because of this fact, DNA barcoding aids both the identification of known species

and the discovery of overlooked taxa (Witt et al. 2006). The latter application has revealed that the

incidence of sibling species is often high enough to lead to serious inaccuracies in estimates of

biodiversity (Knowlton 2000, Carr et al. 2010). In light of this, it is increasingly recognized that

molecular approaches need to be incorporated into biodiversity surveys. DNA barcoding is a particularly

useful tool for groups with high diversity, especially those which have seen little taxonomic attention.

Although marine molluscs have been the subject of considerable research, the number of species in

Canadian waters remains uncertain with estimates ranging from 700 to 1200. In part, this uncertainty

reflects taxonomic problems linked to the fact that molluscs are the most diverse phylum of marine life

with more than 50,000 described species (Bouchet 2006). In addition, molluscs exhibit complex larval

stages, frequent cryptic taxa and substantial phenotypic plasticity, all factors that impede morphological

approaches to species identification (Drent et al. 2004, Marko & Moran 2009). In fact, the differing

juvenile stages of molluscs have sometimes been treated as distinct species generating inaccurate

estimates of biodiversity and geographic ranges (Johnson et al. 2008). The extreme phenotypic plasticity

in shell morphology which is common in molluscs poses additional problems for taxonomy (Johnson et

al. 2008, Zu et al. 2011). Because traditional morphological approaches to identification confront so many

challenges in molluscs, it is imperative to integrate molecular diagnostics into this process.

Several prior studies have validated the efficacy of DNA barcoding in the discrimination of

mollusc species, but most of this work has targeted a particular order or family. For instance, Meyer and

Paulay (2005) presented a detailed study of barcode diversity in cowries (Family: Cypraeidae),

demonstrating the general effectiveness of the approach, but showing that a fixed sequence threshold

could not be used for species diagnosis. They concluded that DNA barcoding was a powerful aid for

mollusc identification when paired with strong taxonomic validation and comprehensive sampling. More

recent studies have extended these results by establishing the value of DNA barcoding in resolving

cryptic species complexes in the molluscan families Muricidae, Thyasiriidae, Yoldiidae, Nuculidae and

Lepetodrilidae (Mikkelsen et al. 2007, Johnson et al. 2008, Zou et al. 2012). While Zou et al. (2011)

found that distance-based analyses were less effective than those based on characters, the former

approach successfully delineated 40 Chinese neogastropod species.

Despite the demonstrated utility of DNA barcoding in molluscs, no study has aimed to assemble a

comprehensive barcode registry for a large geographic region. The present investigation addresses this

Page 13: Examining patterns of genetic ... - University of Guelph

5

deficit, beginning the construction of a DNA barcode reference library for Canadian marine molluscs. It

compares intra and interspecific divergences among 33 molluscan orders with prior values, and also

examines the utility of different approaches for the designation of OTUs (Operational Taxonomic Units)

based on the analysis of sequence variation at COI. This study also investigates variation in nucleotide

composition among molluscs and how this property impacts levels of genetic divergence. Finally,

insertions and deletions in the COI region are analyzed for all molluscs examined in this study.

Methods

Specimen collection and data scrutiny

A total of 2471 specimens were collected from 2007 to 2012 at sites across Canada (Figure 1.1).

Figure 1.2 provides rarefaction curves for each class as a measure of sampling efficiency. Specimen

details, sequences and trace files are available on BOLD (www.boldsystems.org, Ratnasingham & Hebert

2007), while the specimens are held at the Biodiversity Institute of Ontario. Typically five specimens per

species were collected from intertidal or subtidal habitats using plankton nets, small dredges and SCUBA

diving, but samples from the Beaufort Sea were collected from deep subtidal soft-bottom habitats using

an Agassiz trawl. Specimens were immediately fixed in 90-100% ethanol, with regular replacement of

ethanol to prevent its dilution. During fixation, the opercula of gastropods and the shells of bivalves were

separated to ensure preservation of internal tissues. After each collecting trip, specimens were placed in

fresh 95% ethanol and stored at -20C. When possible, specimens were identified to a species level with

name usage following the World Register of Marine Species (WoRMS). Approximately 3% of the

barcoded specimens could not be identified to a species-level because they were immature, but they were

assigned to a genus and an interim species.

DNA extraction, amplification and sequencing

Doubly uniparental inheritance (DUI) has been found in some bivalve groups and is characterized

by the transmission of a maternal and paternal mitochondrial lineage through eggs and sperm,

respectively (Ghiselli et al. 2012). While DUI can cause deep divergences between male and female

conspecifics avoiding gonadal tissue during sampling can resolve this problem given that male somatic

tissue is still dominated by the female genome (Passamonti & Ghiselli 2009, Zouros 2012). In turn, DNA

extracts were prepared from a small sample of muscle tissue from each specimen. Tissue samples were

placed in cetyltrimethylammonium bromide (CTAB) lysis buffer solution with proteinase K and

incubated for 12 hours at 56°C. DNA was then extracted using a manual glass fibre plate method

(Ivanova et al. 2008). After incubation, the DNA was eluted with 40 µl of ddH2O. After re-suspension, 2

µl of each DNA extract was placed into a well into another additional plate and 18 µl of ddH2O was

added to dilute salts or mucopolysaccharides that might inhibit PCR. Three primer sets were employed to

maximize amplicon recovery (dgLCO1490/dgHCO2198, LCO1490_t1/HCO2198_t1 and

Page 14: Examining patterns of genetic ... - University of Guelph

6

BivF4_t1/BivR1_t1). The primer set that generated an amplicon for a particular specimen, and the primer

sequences are available on BOLD. A primer cocktail (C_LepFolF/C_LepFolR) was used in a second

round of PCR for specimens that failed to amplify in the first round of PCR. Each well was filled with 2

µl of diluted DNA and the following reagents were added to total a 12.5 µl PCR reaction: 6.25 µl 10%

trehalose, 2 µl ddH20, 1.25 µl 10× PCR buffer, 0.625 µl MgCl2 (50 mM), 0.125 µl of each forward and

reverse primer (10 µM), 0.0625 µl dNTP (10 mM) and 0.06 µl Platinum Taq polymerase. The

thermocycling regime consisted of one cycle of 1 min at 94°C, 40 cycles of 40 s at 94°C, 40 s at 52°C,

and 1 min at 72°C, and finally 5 min at 72°C. E-Gels (Invitrogen) were used to screen for amplification

success and all positive reactions were bidirectionally sequenced using BigDye v3.1 on an ABI 3730xl

DNA Analyzer (Applied Biosystems). Sequences were manually edited using CodonCode Aligner and

aligned both by eye in MEGA5 and through the BOLD aligner algorithm (CodonCode Corporation,

Tamura et al. 2011). MEGA5 was also used to assess the prevalence and location of insertions and

deletions (indels) (Tamura et al. 2011). Sequences containing more than 1% ambiguities, stop codons,

double peaks or that were shorter than 220 bp were removed from further analysis. Sequencing success

was assessed for this study and a Pearson’s Chi-Squared test was used to determine whether significant

differences existed between sequence recovery in each class.

Data analysis

A Kimura-2-parameter (K2P) distance model was employed in MEGA5 to construct a neighbour-

joining (NJ) tree which served as a preliminary basis for species recognition (Kimura 1980, Tamura et al.

2011). Genetic distances, including intra and interspecific divergence along with nearest-neighbour

distance, were calculated with the K2P distance model (Kimura 1980) and overall data were compared

using the ‘Distance Summary’ and ‘Barcode Gap Analysis’ tools on BOLD (Ratnasingham & Hebert

2007). Maximum intraspecific divergence was plotted against nearest neighbour (NN) distance to

determine how often NN distances surpassed intraspecific divergences, ultimately indicating the presence

of a barcode gap. In addition, the ‘Sequence Composition’ tool on BOLD was used to examine variation

in GC content among species in each order (Ratnasingham & Hebert 2007). Species numbers were

determined by two approaches: i) morphology, through the examination of shell characters and soft tissue,

and ii) through the analysis of sequence divergence patterns at COI to ascertain the number of sequence

clusters present. The morphological approach involved the use of the national mollusc collection

deposited at the Canadian Museum of Nature in Gatineau, Québec. The latter approach employed four

algorithms designed for this purpose - Barcode Index Number (BIN) (Ratnasingham and Hebert 2013),

Automated Barcode Gap Discovery (ABGD) (Pulliandre et al. 2011), Clustering 16S rRNA for OTU

Prediction (CROP) (Hao et al. 2011) and jMOTU (Jones et al. 2011). The BIN algorithm only analyzed

Page 15: Examining patterns of genetic ... - University of Guelph

7

sequences greater than 500 bp in length while the other three algorithms examined all sequences greater

than 400 bp. Parameter settings for each OTU algorithm can be found in Table 1.1.

Various packages in Revolution R were used to analyze levels of sequence variation. The Picante

and VEGAN packages were used to generate rarefaction curves to assess sampling effort for each class of

mollusc (Dixon 2003, Kembel et al. 2010). These software packages were also used to perform linear

regressions to determine if the number of individuals sampled within a species impacted values of

intraspecific divergence (Dixon 2003, Kembel et al. 2010). P-values less than 0.05 were considered

significant. Finally, these packages were used to generate a box plot for comparing mean nearest

neighbour distance between genera with two or more species sampled (Dixon 2003, Kembel et al. 2010).

For box plot analysis, significance was determined from an analysis of variance (ANOVA). A chi-square

test of homogeneity was used to determine whether nucleotide frequency was homogeneous among

classes. Species with intraspecific divergences greater than 2% were treated as potential cryptic

complexes. Genetic variation, both within and between species, as well as the number of individuals and

genetic clusters, was analyzed for each cryptic species. Lastly, the boot and Hmisc packages were used to

test whether mean nearest neighbour distance was correlated with mean GC content in molluscan orders

(Harrell Miscellaneous 2012).

Results

Sequence recovery

Among the 2471 marine mollusc specimens analyzed, 1334 COI sequences were recovered from

234 morphospecies. The LCO1490_t1 and HCO2198_t1 primer set, along with a 1:10 dilution of DNA

and an annealing temperature of 52°C for amplification, generated the highest success in sequence

recovery. Success rates were significantly different among chitons, bivalves and gastropods (p <.0001)

(Table 1.2). Sequences ranged in length from 223 to 658 bp, but 88% were greater than 600bp. Table 1.3

displays the number of sequences and species examined in the 5 classes and 33 orders represented in this

study, along with measures of intra and interspecific divergence and GC content. The overall mean

values for intra and interspecific divergence were 0.51% and 13.5% respectively. Values of maximum

intraspecific divergence ranged from 0 to 30.58%.

COI variation in marine molluscs

Morphological study indicated the presence of 234 species; 77 were represented by a single

specimen while the other 157 species had an average of 8 specimens (range 2-67). All but one of these

species had one or more sequence records >400 bp in length. A barcode gap was present for the majority

of species in this study with exceptional cases likely reflecting cryptic complexes (Fig 1.3). Algorithms

for OTU determination generated estimates of 242 (BIN), 255(jMOTU), 270(CROP) and 271(ABGD).

However, 72 specimens representing 27 morphologically identified species lack BIN assignments because

Page 16: Examining patterns of genetic ... - University of Guelph

8

their sequence records were <500bp. The BIN, jMOTU, CROP and ABGD algorithms generated 71, 97,

100 and 114 singletons, respectively. Values for intraspecific variation, both maximum and mean, were

not significantly associated with the number of individuals analyzed (Figure 1.4; p = 0.71, p = 0.40).

Furthermore, mean nearest neighbour distances were not correlated with the number of species analyzed

from a genus (Figure 1.5; p=0.07). Lastly, nine species in this study demonstrated deep intraspecific

divergences that were greater than 2%. Table 1.4 provides values of mean intraspecific divergence, the

number of individuals and clusters and locality information detailing the distribution of each cryptic

species across Canadian oceans. Figure 1.6 provides neighbour-joining trees (K2P) for each of the nine

cases where deep intraspecific divergence was detected.

Variation in nucleotide composition

Mean GC content averaged 37.1% for all species (range 32-43%). A chi-square test of homogeneity

demonstrated that nucleotide frequencies were not identical among species in each of the five molluscan

classes (p <0.001). Mean nearest neighbour distances appeared positively correlated with mean GC content,

with a correlation coefficient of 0.51, suggesting that as GC content increases across orders so does the

genetic distance between congeneric species (Figure 1.7; p=0.002).

Distribution of indels

Indels were only detected in two of the five classes, Bivalvia and Gastropoda, but they occurred

in nearly half (47%) of the bivalve species versus just 9% of the gastropods. Indels were detected in 27

bivalve species involving representatives of 12 families, and in 11 species of gastropods from 4 families.

Indels were conserved in 7 of the 10 bivalve families, but varied between genera in 2 families and

between species in 1 family. Indels were conserved in 3 of the 4 gastropod families but varied between

genera in the Lottidae. In bivalves, 1 deletion was observed in Cyclocardia borealis, Mactromeris

polynyma, Saxidomus gigantea and in all six Tellinidae species while 2-3 deletions occurred in

Delectopecten greenlandicus, Astarte montagui and both Crassostrea species. Moreover, 1 insertion

occurred in all species of Myidae, Mytilidae, Arcidae and Glycymerididae, while three insertions

occurred in all species of Thyasiridae. Finally, 1 insertion and 3 deletions occurred in Astarte borealis

while 2 insertions and 3 deletions occurred in Cyclocardia crassidens. In gastropods, all five Lottia

species had one insertion and all Pyramidellidae (Boonea cf. bisuturalis, Odostomia sp.1, Odostomia sp.

2) and Onchidiidae (Onchidella borealis and Onchidella cf. carpenteri) species had one deletion, while

Limacina helicina had four deletions. Indels ranged in length from 3 to 18 nucleotides in bivalves and

from 3 to 12 nucleotides in gastropods. All indels were in multiples of 3 nucleotides, suggesting they did

not derive from pseudogenes. Indels were mapped onto the secondary structure of COI and often

appeared in proximity to external loops (Figure 1.8).

Page 17: Examining patterns of genetic ... - University of Guelph

9

Discussion

Sequencing success in Mollusca

Most of the sequences recovered in this study were greater than 600 bp, but some were as short as

250 bp, reflecting the use of internal primers. This study employed multiple rounds of PCR, tested different

primer cocktails and modified PCR regimes to minimize contamination and pseudogene recovery. These

optimization studies revealed that a 1:10 dilution of DNA, coupled with tailed Folmer primers and a PCR

regime with an annealing temperature of 52°C, generated the highest success. This protocol usually

produced single amplicons that lacked evidence of pseudogene amplification. Mucopolysaccharides are

present in many mollusc groups and because they are not always removed during DNA extraction, they can

interfere with DNA polymerase, reducing PCR amplification success. Because only half of the specimens

generated an amplicon, significant challenges in barcode recovery remain. No barcode sequences were

recovered from six species (Anomia simplex, Astarte crenata, Astarte moerchi, Littorina scutulata,

Notoacmea testudinalis and Argopecten irradians). Sequencing success significantly differed between

classes, with polyplacophorans delivering the highest success (82.8%) and bivalves the lowest success

(32.9%). Future work should focus on the development of primer sets that target specific lineages to

generate greater success in sequence recovery. Despite the challenges with some mollusc groups, this study

still presents sequence records for nearly 25% of the Canadian marine mollusc fauna.

Patterns of sequence variation

Considering all 156 morphospecies represented by two or more records, Canadian marine molluscs

showed a mean intraspecific divergence of 0.51%, a value lower than that reported for echinoderms (0.62%)

but higher than in polychaetes (0.38%), marine fishes (0.39%) and decapods (0.46%)(Ward et al. 2005,

Costa et al. 2007, Ward et al. 2008, Carr et al. 2010). The relatively high intraspecific divergence in

Canadian molluscs likely reflects, at least in part, the impact of overlooked species. For instance, a mean

intraspecific divergence of 16.8% was detected for Tachyrhynchus erosus in this study, while Sun et al.

(2011) found a far lower mean intraspecific divergence (0.44%) for species in this order (Caenogastropoda).

When the nine cases of deep sequence divergence detected in this study were excluded, mean intraspecific

divergence dropped to 0.42%. In any case, there was little overlap between intra and interspecific

divergence (Figure 1.3), suggesting that DNA barcoding is very effective in delineating marine mollusc

species.

Levels of sequence variation in this study differed greatly from some other recent studies (Table

1.5). While interspecific divergences were consistently high among taxa, some cases of high intraspecific

divergence have been reported (Table 1.5). This difference may reflect the fact that many prior studies

focused on a single genus or family while this study involved a phylum-wide analysis, potentially allowing

for greater coverage of sequence variation in the former. However, the high intraspecific values in the

Page 18: Examining patterns of genetic ... - University of Guelph

10

literature may also reflect misidentifications which would inflate values of within species divergence.

However, the present study does confirm the particularly high interspecific variation in the Vetigastropoda

(Table 1.5)(Meyer & Paulay 2005). Members of this order are a diverse group of gastropods that inhabit

diverse marine environments from the shallow intertidal to deep-sea hydrothermal vents. Their success in

invading such a variety of habitats may be facilitated by a high evolutionary rate that is reflected in high

levels of sequence divergence between congeneric taxa (Colgan et al. 1999).

This study revealed nine taxa in which intraspecific divergences were greater than 2% (Table 1.4;

Figure 1.6). Prior work has established that deep mtDNA divergences do really exist in some mollusc

species, such as the land snail, Cepaea nemoralis, where distances reach 12.9% (Thomaz et al. 1996).

However, in most other cases, deep divergences, especially those involving allopatric lineages, are now

thought to represent different species. For example, the deep divergences in populations of the bipolar

pteropods Limacina helicina and Clione limacina are now viewed as evidence for separate species in the

Arctic and Antarctic (van der Spoel & Dadon 1999, Hunt et al. 2010, Jennings et al. 2010). This study

extends the earlier conclusion, suggesting the possible presence of two Clione species in the Arctic Ocean

although their divergences are far lower than the major Antarctic/Arctic lineages (Table 1.4; Figure 1.6).

Glaciation in Canada has played a key role in shaping the genetic structure of contemporary populations and

may be responsible for the segregation and differentiation of lineages on opposing coasts (Hewitt 1996,

Bernatchez & Wilson 1998, Wares & Cunningham 2001, Maggs et al. 2008, Dapporto 2009). For example,

the deeply divergent lineages of Hiatella arctica, Macoma balthica and Tachyrhynchus erosus detected in

this study may represent sibling species whose origin is linked to isolation in glacial refugia (Table 1.4;

Figure 1.6, Layton unpublished). Moreover, the two distinct clusters of Mya truncata and Mya arenaria

found in this study may include one of three cryptic species of Mya recorded from the Arctic Ocean

(Peterson 1999). In any case, deep intraspecific divergences often flag overlooked species in a lineage (Witt

et al. 2006). For instance, DNA barcoding unveiled five cryptic species complexes in the Leptodrilidae, a

family of limpets inhabiting deep-sea hydrothermal vents (Johnson et al. 2008). Similarly, COI analysis

unveiled that the cold-seep bivalve species, Acesta bullisi, should be separated into two species (Järnegren et

al. 2007). Not only have these discoveries been made in molluscs, but DNA barcoding has revealed

overlooked diversity in numerous marine taxa, including fishes of Pacific Canada and asteroid species of

southeast Australia (Naughton & O’Hara 2009, Steinke et al. 2009a).These results highlight the need for

integrating molecular approaches into species identifications. Future work should focus on evaluating

population structure in these potential cryptic cases and include a more detailed examination of genetic

variants.

Insertions and deletions in COI

Page 19: Examining patterns of genetic ... - University of Guelph

11

This study has demonstrated that indels are considerably more prevalent in bivalves than in other

molluscan classes. Interestingly, high rates of nucleotide substitution occur in lineages containing indels

(Tian et al. 2008). For instance, a high mutational load was observed in oysters (Crassostrea) and mussels

(Mytilus), genera for which three deletions and one insertion were detected in this study (Figure 9)

(Hedgecock et al. 2004, An and Lee 2012). In fact, Vetsigian and Goldenfeld (2005) found that indels

stimulate diversification fronts in the genome and over a short time can facilitate sequence divergence

between populations. Moreover, the discovery of a single amino acid deletion in the Heterostropha and

Pulmonata groups corroborates findings from Grande et al. (2004) which suggest this deletion may either be

due to convergence, as a result of a length constraint, or several deletions arising in some pulmonates and

the Heterstropha (Figure 9). Most indels in bivalves and gastropods occurred in close proximity to the first

loop, a pattern that Remigio and Hebert (2003) also observed in Heterobranchia and Patellogastropoda

(Figure 9). Furthermore, the presence of a single amino acid insertion in the Lottidae (Patellogastropoda)

and four amino acid deletions in Limacina helicina may be correlated to accelerated rates of substitution in

these groups (Remigio and Hebert 2003). Four bivalve orders demonstrated a single insertion event at

position 37 (Figure 9). This insertion likely arose independently in each order as it was not possessed by all

descendents of the monophyletic group (Plazzi et al. 2011). Interestingly, Mikkelsen et al. (2007) discovered

that Thyasira specimens harboured 3 to 4 additional codons in the COI gene, a pattern corroborated by our

finding of a three amino acid insertion in all Thyasiridae specimens. Together with prior work, the present

study has established that insertions and deletions in the COI gene are relatively common in molluscs,

suggesting that future work should aim to determine the functional relevance of this sequence variation as

well as its implications for rates of molecular evolution. Finally, indel patterns are useful for inferring

phylogenetic relationships, a particularly important application for a phylum which often faces taxonomic

scrutiny.

Patterns of nucleotide composition

After examining the frequency of each nucleotide (A, T, G and C) in all sequences, and grouping

these sequences by class, heterogeneity was apparent in nucleotide frequencies between each class.

Differences in nucleotide composition can provide insight into ratios of nonsynonymous to synonymous

substitutions and may also be linked to extrinsic factors (Albu et al. 2008). For instance, Dixon et al. (1992)

found a positive correlation between rDNA melting temperature and environmental temperature in

hydrothermal-vent polychaetes. Dixon et al. (1992) interpreted this finding to mean that organisms living in

more extreme conditions will have a higher GC content. While Wu et al. (2012) recognize that

environmental factors may impact GC content, they suggest that GC content is likely governed by more

intrinsic controls, particularly mutator genes. We also discovered a moderate, positive correlation between

GC content and sequence divergence between congeneric taxa, although unequal sampling between orders

Page 20: Examining patterns of genetic ... - University of Guelph

12

may have contributed to this finding. Future work should aim to remove sampling bias and incorporate

phylogenetic analyses to further probe this relationship. Examining GC content in a lineage can also provide

insight into whole-genome patterns. For instance, Clare et al. (2008) found that sequence composition

patterns in COI are likely reflective of patterns in the entire mitochondrial genome, potentially having

implications for rates of mitochondrial evolution. Moreover, Wu et al. (2012) found that increased genome

size in bacteria is based on an increase in GC content of the genome. In light of these findings, future work

should focus on comparing GC content between taxa for which ecological and genomic data are available.

Conclusions

There is a pressing need to gain a more detailed understanding of Canadian biodiversity. Although

barcode records are available for some groups, Canadian marine molluscs have, in the past, been the subject

of very little attention with regards to molecular taxonomy. The present study begins to address this gap,

providing barcode coverage for 25% of the Canadian fauna and establishing the effectiveness of this

approach in delineating species of marine molluscs. Because 30% of the species in this study were

represented by a single specimen, future studies should extend sample sizes for such cases to confirm that

they do not contain cryptic species. This study has revealed that DNA barcoding is not only useful for

documenting biodiversity, but also for unveiling patterns of genetic variation and sequence composition

across a broad taxonomic group on a large geographic scale.

Page 21: Examining patterns of genetic ... - University of Guelph

13

Table 1.1. Parameter settings for each OTU algorithm.

OTU Algorithm Parameter Settings

jMOTU 2% threshold = 13 bp differences

low BLAST identity filter = 99

sequence alignment overlap = 60% of min. sequence length

CROP l 0.3 -u 0.5 (1%)

l 0.6 -u 1 (2%)

s (3%)

l 1.2 -u 2 (4%)

g (5%)

ABGD Pmin = 0.0006

Pmax = 0.17

Table 1.2. Percent success in recovery of a COI sequence for specimens (n=1964) in three classes of

Mollusca.

Class Specimens Processed Sequencing Success

Bivalvia 586 32.9%

Gastropoda 1256 42.5%

Polyplacophora 122 82.8%

Page 22: Examining patterns of genetic ... - University of Guelph

14

Table 1.3. The number of COI sequences and BINs, intraspecific and nearest neighbour distances and

mean GC content for each of 33 orders. *denotes no BIN(s) assigned.

Class Order Sequences BINs

mean intra (%

K2P) (SE)

mean inter (%

K2P) (SE)

mean GC

(%) (SE)

Bivalvia Arcoida 5 2 0.26 (0.03) 0 (0) 37.3 (0.7)

Carditoida 4 2 0.11 (0.03) 58.2 (0.07) 35.0 (0.8)

Euheterodonta 42 10 3.5 (0.04) 19.8 (0.03) 36.3 (0.3)

Myoida 2 0* 0.47 0 (0) 40.5

Mytiloida 147 8 0.39 (0) 17.5 (0) 37.5 (0.1)

Nuculanoida 38 8 0.20 (0.002) 31.8 (0.9) 39.4 (0.5)

Ostreoida 21 3 0.29 (0.009) 25.7 (0.008) 39.0 (0.3)

Pectinoida 10 1 0.07 (0.004) 0 (0) 42.6 (0.6)

Pholadomyoida 3 2 0 (0) 0 (0) 38.5 (0.2)

Veneroida 112 20 0.4 (0.001) 19.3 (0.004) 35.5 (0.3)

Cephalopoda Decapodiformes 12 2 0.09 (0.002) 8.0 (0.2) 35.0 (0.2)

Myopsida 3 1 0.06 (0.03) 0 (0) 39.2 (0.3)

Octopoda 2 2 0 (0) 0 (0) 32.5

Oegopsida 22 2 0.65 (0.003) 0 (0) 37.8 (0.1)

Gastropoda Archaeogastropoda 55 11 0.12 (0.002) 21.0 (0.02) 39.4 (0.3)

Caenogastropoda 7 3 16.8 (0.7) 0 (0) 39.9 (3.2)

Cephalaspidea 28 6 0.65 (0.007) 11.8 (0.04) 31.9 (0.3)

Docoglossa 10 1 0.15 (0.004) 0 (0) 41.8 (0.09)

Gymnosomata 17 2 2.2 (0.01) 0 (0) 37.6 (0.1)

Heterostropha 4 3 0.31 (0) 21.5 (0) 35.8 (1.0)

Littorinimorpha 223 23 0.43 (0.01) 9.6 (0.06) 37.4 (0.1)

Mesogastropoda 1 1 0 (0) 0 (0) 39.1

Neogastropoda 162 40 0.23 (0.001) 8.4 (0.001) 35.3 (0.1)

Nudibranchia 107 29 0.63 (0.1) 20.2(0.3) 37.2 (0.2)

Opisthobranchia 12 4 0.24 (0.009) 0 (0) 34.0 (1.3)

Patellogastropoda 52 6 0.17 (0.001) 23.9 (0.004) 38.4 (0.6)

Pulmonata 27 3 0.57 (0.003) 11.1 (0.01) 34.6 (0.03)

Sorbeoconcha 19 4 0.45 (0.009) 19.3 (0.03) 38.2 (1.5)

Vetigastropoda 17 6 0.11 (0.004) 39.0 (0.4) 39.9 (0.8)

Polyplacophora Chitonida 115 15 0.54 (0) 15.4 (0.003) 36.6 (0.2)

Neoloricata 29 7 1.1 (0.007) 16.8 (0.007) 36.7 (0.3)

Scaphopoda Dentaliida 20 11 7.2 (0.8) 14.6 (0.2) 32.3 (0.5)

Gadilida 6 4 0.22 (0.1) 0 (0) 35.8 (0.9)

Page 23: Examining patterns of genetic ... - University of Guelph

15

Table 1.4. Mean intraspecific divergence (% K2P), number of genetic clusters, number of individuals

sampled and locality information for each potential cryptic species complex in this study. *(Layton

unpublished)

Table 1.5. Intra and interspecific distances (% K2P) for taxonomic groups examined by DNA barcoding in

previous literature. These values are compared to values obtained from similar taxa investigated in this

study.

Species N Mean Intra Clusters Cluster Locality

Clione limacina 17 2.15 2 2 Arctic

Mya arenaria 13 2.45 2 1 Pacific/Atlantic, 1 Pacific

Cryptonatica affinis 8 2.47 2 2 Arctic

Hiatella arctica* 172 3.9 4 1 Tri-oceanic, 2 Pacific, 1 Atlantic

Mya truncata 6 4.41 2 2 Arctic

Macoma balthica* 163 7.7 3 1 Pacific/Arctic, 2 NW Atlantic

Thyasira gouldi 7 10.04 2 2 Atlantic

Tachyrhynchus erosus 7 16.81 3 2 Arctic, 1 Atlantic

Triopha catalinae 3 17.99 2 2 Pacific

Group

This Study:

Mean intra

Mean inter

Literature:

Mean intra

Mean inter

Citation

Muricidae 0.12 6.5-11.4 0.4 6.7-25.2 Zou et al. 2012

Littorinimorpha1

0.43 9.6 0.81 - Meyer & Paulay

2005

Turbinidae 0.07 - 0.18 - Meyer & Paulay

2005

Lottidae 0.17 23.9 0.25 - Meyer & Paulay

2005

Vetigastropoda2

0-0.34 38.3-39.7 0.10-1.34 3.01-31.25 Johnson et al.

2008

Neogastropoda 0.23 8.4 0.64 8.1 Zou et al. 2011 1 Cypraeidae used in literature but no Canadian representatives of this family 2 Lepetodrilidae used in literature but no representatives collected in this study

Page 24: Examining patterns of genetic ... - University of Guelph

16

Figure 1.1. Sampling locations and the number of specimens examined in this study. Sequences obtained

from GenBank are not included as they lack locality information.

Page 25: Examining patterns of genetic ... - University of Guelph

17

Figure 1.2. Rarefaction curves for the five classes of Canadian marine mollusc represented in this study.

Plot A provides curves for Bivalvia and Gastropoda while plot B provides curves for Cephalopoda,

Polyplacophora and Scaphopoda.

A)

B)

Page 26: Examining patterns of genetic ... - University of Guelph

18

Figure 1.3. Maximum intraspecific divergence plotted against nearest neighbour distance for all species in

this study. All data points falling above the 1:1 line indicate a barcode gap is present for these species.

Figure 1.4. Maximum and mean intraspecific divergences (% K2P) plotted against the number of

individuals analyzed for 157 species. The regression between sample size and both maximum and mean

divergences were insignificant (p = 0.71, p = 0.4, respectively).

Page 27: Examining patterns of genetic ... - University of Guelph

19

Figure 1.5. Box plots comparing mean nearest neighbour distance (% K2P) with the number of species

sampled from each genus with ≥ 2 representative species (N=43). The ANOVA was insignificant (p =

0.069). Four morphospecies were excluded because they lack genus-level identification.

Page 28: Examining patterns of genetic ... - University of Guelph

20

Figure 1.6. Neighbour-joining trees (K2P), with locality information, for 9 cryptic species complexes in

this study. NJ trees are coloured blue for bivalves and red for gastropods and triangles represent

compressed clades, with sample size provided in brackets. *(Layton unpublished).

A) Mya truncata B) Cryptonatica affinis

C) Mya arenaria D) Clione limacina

E) Tachyrhynchus erosus F) Thyasira gouldi

G) Triopha catalinae

H) Macoma balthica * I) Hiatella arctica *

Page 29: Examining patterns of genetic ... - University of Guelph

21

Figure 1.7. Mean nearest neighbour distance (% K2P) plotted against mean GC content (%) for the 33

orders of Mollusca represented in this study.

Page 30: Examining patterns of genetic ... - University of Guelph

22

Figure 1.8. Secondary structure of COI marked with insertions and deletions for A) gastropods and B)

bivalves. Insertions are marked with a blue + sign and deletions are marked with a red x.

B)

A)

Page 31: Examining patterns of genetic ... - University of Guelph

23

Chapter 2

Geographic Patterns of mtCOI Diversity in Two Species of Canadian Marine Bivalves

Abstract

Variation in modes of larval development has strong impacts on dispersal potential and gene flow

among populations of marine invertebrates. However, Pleistocene glaciations have also played an

important role in shaping population structure in benthic taxa in the northern hemisphere, even those with

planktotrophic larvae. This study examines patterns of COI sequence divergence in two bivalve species,

Hiatella arctica and Macoma balthica, which share a similar mode of larval development

(planktotrophic) and a vast distribution in the Nearctic. This study reveals that both species possess high

genetic diversity in the northeast Pacific, but H. arctica has less phylogeographic structure and more

sequence variation across its range than M. balthica. Three North American lineages of M. balthica were

detected, corroborating a recent taxonomic revision. This study also provides the first evidence that H.

arctica may include four species. Ecological differences between these species have likely played a role

in their differing biogeographical patterns.

Page 32: Examining patterns of genetic ... - University of Guelph

24

Introduction

Life history attributes, vicariance events and environmental tolerances all play a role in

determining where species occur (Reid 1990, Hewitt 2000). In fact, contemporary patterns of population

structure in the northern hemisphere can only be understood by considering dispersal capacity and past

glacial events and how these factors impacted population isolation and differentiation (Wares &

Cunningham 2001). Patterns of population structure in Canadian marine benthic invertebrates are

particularly interesting (Meehan 1985) because many species rely on planktonic larvae for long-range

dispersal (Meehan 1985, Lee & Boulding 2009). Some larvae spend weeks in the plankton where passive

dispersal by oceanic currents can cause enough gene flow to ensure near panmixis across the species

range (Jablonski 1986, Bradbury et al. 2008, Keever et al. 2009, Lee & Boulding 2009). Although

planktotrophic species generally demonstrate less regional genetic divergence than congeneric species

lacking a planktonic stage (Jablonski 1986, Bradbury et al. 2008, Keever et al. 2009, Lee & Boulding

2009), substantial population structure can still occur in some species with planktonic larvae. For

instance, Macoma balthica shows clear genetic divergence between populations in the northwest and

northeast Atlantic (Väinölä 2003, Nikula et al. 2007). Cases such as this indicate that spatial homogeneity

does not inevitably accompany planktonic larval stages.

Glacial periods had a dramatic impact on Canadian marine environments. During the last

(Wisconsin) glacial maxima, global sea levels declined by up to 170 metres and the Arctic Ocean was

covered by persistent ice. The Atlantic coast of Canada was heavily impacted by the Laurentide ice sheet,

but the Cordilleran ice sheet on the west coast produced lesser impacts (Bernatchez & Wilson 1998,

Rohling et al. 1998, Hewitt 2000, Mandryk et al. 2001, Marko 2004). The recurrent opening and closing

of the Bering Strait linked to glacial cycles acted as a secondary factor in causing intermittent isolation

and exchange of species between the Pacific and Atlantic Oceans (Vermeij 1991, Taylor & Dodson 1994,

Dodson et al. 2007). While range expansions following the last deglaciation are responsible for current

distributions, prior episodes of glacial activity undoubtedly shaped both levels and patterns of genetic

variation (Bernatchez & Wilson 1998). During each glacial advance, species retreated to refugia and then

expanded their range during the subsequent interglacial (Vermeij et al. 1990, Hewitt 2000). For example,

studies of mitochondrial DNA diversity in two marine species with planktonic larvae - the fish, Mallotus

villosus, and the sea urchin, Strongylocentrotus droebachiensis, demonstrated their persistence in glacial

refugia in the northwest Atlantic and northeast Pacific (Addison & Hart 2005, Dodson et al. 2007).

Because each glacial advance tended to fragment species distributions in a consistent way, populations

were effectively separated for prolonged periods, setting the stage for their differentiation (Hewitt 1996,

Maggs et al. 2008, Dapporto 2009). As species expanded their range during interglacials, divergent

lineages re-established contact at sites along Canada’s coasts (Harper & Hart 2007). Despite their shared

Page 33: Examining patterns of genetic ... - University of Guelph

25

exposure to these environmental changes, species with similar life history attributes, but differing

ecological preferences, have undoubtedly responded differently to glacial cycles (Bernatchez & Wilson

1998). Thus, glacial cycles have significantly impacted contemporary genetic structure and these patterns

vary with dispersal potential and ecological tolerance.

Past studies have provided insights into how glacial cycles have shaped contemporary patterns of

genetic variation in some marine taxa, but little work has been directed toward molluscs. The scope for

investigation is vast with regards to marine molluscs as nearly 1000 species occur across Canada’s three

oceans. However, of all the marine molluscs occurring in Canada, fewer than 25 have distributions

spanning all three oceans. Species in this subset are ideal candidates for investigations of population

structure and this study targets two wide-ranging bivalves, Hiatella arctica and Macoma balthica. No

prior study has examined the genetic structure of H. arctica, a species with a Holarctic distribution

(Alison & Marincovich 1982, Schneider & Kaim 2012) and a long planktonic larval phase that lasts

several weeks (Lebour 1938). Adults of Hiatella display considerable phenotypic plasticity, its shell

growth being impacted differentially depending on the substrate or microhabitat in which the bivalve

grows, making identifications in this genus particularly difficult (Alison & Marincovich 1982). The

second species examined in this study, M. balthica, also has a Holarctic distribution (Gofas 2012) and a

long-lived planktonic larval stage that extends for 1-2 months, allowing for extensive offspring dispersal

along coasts. This species has seen prior genetic analysis which revealed considerable differentiation

across its range. In fact, Väinölä (2003) concluded that M. balthica includes two subspecies - M. balthica

balthica from the northeast Pacific and Baltic Sea and M. balthica rubra from Europe.

This study had the primary goal of comparing the extent and patterns of genetic structure in

Canadian populations of H. arctica and M. balthica, although European populations of M. balthica were

also considered. If component populations of these two species were repeatedly isolated into separate

glacial refugia, then this should be reflected in their current population structure and possibly by the

occurrence of two or more lineages at sites of secondary contact. Alternatively, if certain populations

were extirpated during glaciations, then patterns of reduced genetic diversity should reflect localized

extinction and subsequent postglacial recolonization.

Methods

Specimen collection

From 4-60 specimens of M. balthica and H. arctica were collected at various sites in the Pacific

Ocean (British Columbia, Alaska), the Arctic Ocean (Manitoba) and the Atlantic Ocean (New Brunswick,

Nova Scotia, Prince Edward Island, Newfoundland) between 2007 and 2011 (Figure 2.1). Specimens

were obtained from rock crevices, algal mats, holdfasts and soft bottoms in the intertidal and from

subtidal habitats using otter trawls, dredges and SCUBA diving. Specimens were transferred into 95%

Page 34: Examining patterns of genetic ... - University of Guelph

26

ethanol immediately after collection to ensure tissue preservation. Morphological identification of all

specimens was confirmed by Dr. André L. Martel at the Canadian Museum of Nature, while subspecies

designations for M. balthica follow Väinölä (2003).

DNA extraction, amplification and sequencing

Doubly uniparental inheritance (DUI), characterized by the transmission of a maternal and

paternal mitochondrial lineage through eggs and sperm, can cause deep divergences between male and

female conspecifics (Passamonti & Ghiselli 2009, Ghiselli et al. 2012). Despite the presence of this

genetic phenomenon in some mollusc groups, somatic tissue is dominated by the female genome and thus

sampling only this tissue type will avoid problems posed by DUI (Zouros 2012). In turn, DNA extracts

were prepared from a small sample of adductor muscle tissue from each specimen. Tissue samples were

placed in cetyltrimethylammonium bromide (CTAB) lysis buffer solution with proteinase K. The samples

were then incubated for 12 hours at 56°C before a manual glass fibre plate method was used for DNA

extraction (Ivanova et al. 2008). Following incubation, the DNA was eluted with 40 µl of ddH2O. After

re-suspension, 2 µl of each DNA extract was placed into a well in a separate plate with 18 µl ddH2O to

ensure the dilution of salts or mucopolysaccharides that might inhibit PCR. Species-specific primer sets

were used: HiaF1/HiaR1: AAGTTGTAATCATCGAGATATTGG and

TAGACTTCTGGGTGCCCGAAAAACCA for H. arctica and MMacF1/LepR1:

CTTTTATTAGCTGCACCTGATAT and TAAACTTCTGGATGTCCAAAAAATCA for M. balthica.

Each well was filled with 2 µl of diluted DNA and the following reagents to generate a 12.5 µl PCR

reaction mix: 6.25 µl 10% trehalose, 2 µl ddH20, 1.25 µl 10× PCR buffer, 0.625 µl MgCl2 (50 mM),

0.125 µl of each forward and reverse primer (10 µM), 0.0625 µl dNTP (10 mM) and 0.06 µl Platinum

Taq polymerase. The thermocycling regime consisted of one cycle of 1 min at 94°C, 40 cycles of 40 s at

94°C, 40 s at 52°C, and 1 min at 72°C, and finally 5 min at 72°C. An E-GelH 96 (Invitrogen) was used to

check 3 µL of each PCR product and reactions that generated an amplicon were bidirectionally sequenced

using BigDye v3.1 on an ABI 3730xl DNA Analyzer (Applied Biosystems). Sequences were edited

manually using CodonCode (CodonCode Corporation) and were aligned by eye in MEGA5 (Tamura et al.

2011). The COI gene was amplified from 172 and 196 specimens of H. arctica and M. balthica,

respectively.

Data analysis

Neighbour-joining (NJ) trees were constructed in MEGA5 using the Kimura-2-parameter (K2P)

distance model and 1000 bootstrap replicates (Kimura 1980, Saitou & Nei 1987, Tamura et al. 2011). The

H. arctica and M. balthica NJ trees were rooted with Mya arenaria and Macoma inquinata as outgroups,

respectively. Each sequence cluster showing more than 2% divergence from other clusters was labelled.

Clustering patterns in each NJ tree were compared with those in the corresponding median-joining

Page 35: Examining patterns of genetic ... - University of Guelph

27

haplotype network. These networks are based on maximum parsimony and were constructed in Network

4.6.1 (fluxus-engineering.com, Bandelt et al. 1999). Haplotype networks were then recreated in TCS 1.21

to identify ancestral haplotypes (Clement et al. 2000). Divergence times were estimated in MEGA5

assuming a substitution rate of 2% per million years (Hellberg & Vacquier 1999, Marko 2002, Donald et

al. 2005, Tamura et al. 2011).

Arlequin 3.5 (Excoffier & Lischer 2010) was employed to examine patterns of genetic structure

in each species. The haplotype and nucleotide diversity for each population was calculated, and Tajima’s

D test of neutrality with 10,000 simulated samples was used to infer the nature of selection pressures (Nei

1987, Tajima 1989). The number of haplotypes in each species was determined and the proportion of

unique haplotypes was quantified to ascertain which site possessed the highest genetic exclusivity. An

analysis of molecular variance (AMOVA) was conducted with a K2P distance model and significance

was tested with 1000 permutations (Kimura 1980). The AMOVA results were used to determine whether

the majority of genetic variation in each species existed within or between populations as a measure of

population differentiation (Excoffier & Lischer 2010). Fixation indices (FST) were estimated with a K2P

distance model and significance was tested with 100 permutations to determine the partitioning of

variance (Weir & Cockerham 1984). An FST value of 1 indicates that two populations share no haplotypes

while a value of 0 indicates all haplotypes are shared and have similar frequencies (Weir & Cockerham

1984). Slatkin’s linearized FST (Slatkin 1993) was subsequently plotted against geographic distance to

determine whether genetic variation among populations reflected long-term historical divergence or

geographic distance (Slatkin 1993, Kyle & Boulding 2000, Marko 2004, Keever et al. 2009). In order to

test the significance of isolation by distance, a Mantel test with 1000 permutations was conducted in

Arlequin 3.5 (p-values < 0.05 were treated as significant).

Results

Sequence recovery and haplotype diversity

The 196 COI sequences from M. balthica ranged in length from 377 - 655 bp (mean = 429 bp),

while all 172 sequences from H. arctica were 572 bp. All sequences contained less than 1% ambiguous

bases and none possessed stop codons or double peaks. The variable length of the M. balthica sequences

reflected the use of internal primers to recover sequences from some specimens with degraded DNA. For

the analysis of population structure, sequences from H. arctica were trimmed to 572 bp, while sequences

of the Pacific/Arctic M. balthica balthica and the northwest Atlantic (ATL) M. balthica were trimmed to

430 bp and 378 bp, respectively. The highest intraspecific divergences (23%) were observed in H.

arctica, while intraspecific divergences in the M. balthica complex peaked at 12.5% (Figure 2.2). NJ trees

indicated the presence of four lineages in both species, but one of the lineages of M. balthica was only

present in Europe (Figure 2.3 & 2.4).

Page 36: Examining patterns of genetic ... - University of Guelph

28

Most sequences (155 of 172) and haplotypes (63 of 75) of H. arctica fell into a dominant cluster

which was collected at all sites, although the Alaska population had the greatest number of unique

haplotypes (27). All 63 members of this clade showed low divergence - just 1 to 3 mutational steps

(Figure 2.3; Figure 2.5A). The remaining 12 haplotypes included representatives of three probable cryptic

species, two in the Pacific (Cook Inlet, Alaska and Barkley Sound, British Columbia) and a third in New

Brunswick (Figure 2.3; Figure 2.5A). TCS suggested that a northwest Atlantic haplotype was ancestral to

the dominant cluster with a projected migration route into Hudson Bay (Churchill, MB) and then the

northeast Pacific (Alaska, British Columbia)(Figure 2.5A). Subsequent analysis of intraspecific variation

in H. arctica only examined specimens belonging to the dominant clade.

M. balthica also showed high intraspecific divergences, but this variation showed a clear

geographic pattern with up to 12.5% divergence between M. balthica balthica and ATL M. balthica

populations (Figure 2.4). In total, 42 haplotypes of M. balthica were observed with the greatest number

(14) in Europe. This high diversity likely reflects the analysis of larger sample sizes in Europe although

only one representative of each European haplotype was submitted to GenBank (Luttikhuizen et al. 2003,

Nikula et al. 2007, Becquet et al. 2012). These sequences were excluded from analysis in Arlequin

because they would have distorted estimates of haplotype diversity. Interestingly, the northeast Pacific

population grouped with both the Churchill and Baltic Sea populations, while the other European

sequences formed a distinct cluster (M. balthica rubra) that appeared more similar to M. balthica balthica

than to ATL M. balthica (Figure 2.5B). Populations from the northwest Atlantic only included members

of the ATL M. balthica cluster, except those from Newfoundland which included some representatives of

the M. balthica balthica cluster (Figure 2.5B). In fact, the median-joining network suggested that an

ancestral haplotype for the M. balthica balthica cluster was present in Newfoundland (Figure 2.5B).

Because the divergence between clusters 1 and 2 of M. balthica was only slightly greater than 3%, they

were considered as a single taxon for analysis in Arlequin (Figure 2.5B). Regional divergence was

obvious in the ATL M. balthica and M. balthica rubra lineages while M. balthica balthica was more

broadly distributed, occurring in the northeast Pacific, Arctic and Baltic Sea (Figure 2.4; Figure 2.5B).

This divergence is corroborated by the high number of mutational steps that separate each cluster of M.

balthica (Figure 2.5B).

The sequence divergences of the three cryptic groups (2, 3, 4) of H. arctica suggest that they

diverged from a common ancestor 3, 4.5 and 5.5 million years ago, respectively (Figure 2.3). The four

clusters of M. balthica all date to more than 1 million years ago, excluding the split between M. balthica

balthica and the other lineage in Newfoundland which dates at around 900K years (Figure 2.4).

Patterns of genetic diversity

Page 37: Examining patterns of genetic ... - University of Guelph

29

Table 2.1 reports, for H. arctica and M. balthica, the number of unique haplotypes, nucleotide

diversity, haplotype diversity and Tajima’s D index for each population. Populations at Churchill,

Manitoba showed the lowest variation for both M. balthica and H. arctica. Populations of H. arctica from

New Brunswick and British Columbia possessed the highest values for nucleotide and gene diversity

respectively, while Newfoundland had the greatest gene diversity for both M. balthica clades. The high

genetic diversity for Newfoundland may be an artefact of the small sample size (N=5, N=2) for both M.

balthica lineages. Despite their exposure to intensive glaciations, populations of both H. arctica and M.

balthica from New Brunswick were very diverse. Nucleotide and haplotype diversity did not significantly

differ between species. Lastly, Tajima’s D values were negative for all populations except those from

Newfoundland, suggesting the presence of many low frequency haplotypes at these sites. Again, the

slightly positive Tajima’s D values for both Newfoundland populations may simply be a result of

undersampling at this location.

Population structure

There was evidence for population subdivision in M. balthica, but not in H. arctica where most of

the genetic variation resided within populations (Table 2.2A & B). This result for H. arctica was

unaffected by grouping populations from each coast. By contrast, most genetic variation in M. balthica

balthica existed among the coasts with deep divergence between populations in the Atlantic and Pacific

(Table 2.3A & B). The opposite was true for ATL M. balthica as most of its variation existed within

populations (Table 2.4). Fixation indices provided further insight into the partitioning of variation in H.

arctica and M. balthica. In H. arctica, populations from Nova Scotia and New Brunswick showed little

divergence (FST = 0.03), while those from British Columbia and New Brunswick were considerably more

divergent (FST = 0.28) (Table 2.5A). Populations of M. balthica balthica from Alaska and British

Columbia were the most similar (FST = 0.02), while those from Churchill and Newfoundland showed high

divergence (FST = 0.93) (Table 2.5B), a surprising result given the proximity of Hudson Bay and the

Atlantic coast. The ATL lineage of M. balthica also showed unique genetic structure with a high FST (FST

= 0.23) between the Newfoundland and Prince Edward Island populations, although they were only

separated by the narrow Northumberland and Cabot Straits (Table 2.5C). When Slatkin’s linearized FST

values were plotted against geographic distance, only H. arctica demonstrated evidence of isolation by

distance (Figure 2.6) with a strong, positive correlation (R2 = 0.83) and a Mantel test confirmed its

significance (p= 0.011). By contrast, M. balthica balthica demonstrated no evidence for isolation by

distance (Figures 2.7), while ATL M. balthica (Figure 2.8) demonstrated a negative relationship (R2 =

0.78), but Mantel tests indicated that neither value was significant. Only three populations were used to

examine evidence of isolation by distance in the ATL M. balthica, suggesting future work should aim to

gather additional data.

Page 38: Examining patterns of genetic ... - University of Guelph

30

Discussion

Comparing diversity and structure in two bivalves with planktotrophic larval development

While a widespread lineage was present in both bivalve species, the M. balthica complex also

showed evidence of regional divergence. Divergence between species with shared life history strategies

has been demonstrated in other molluscs including the direct developing gastropods Nucella lamellosa

and Nucella ostrina (Marko 2004). H. arctica showed less regional variation across Canada than M.

balthica, although one lineage of the latter species occurs in both Canadian waters and in the Baltic Sea.

The presence of widespread lineages in H. arctica and M. balthica suggests both species have high gene

flow among their populations (Keever et al. 2009, Lee & Boulding 2009). The spatial homogeneity

demonstrated by the main H. arctica lineage may be the result of adults burrowing in kelp holdfasts and

attaching to ship hulls, providing a secondary dispersal mechanism (Helmuth et al. 1994). H. arctica and

M. balthica differ radically in some life history strategies, including habitat and feeding behaviours

(Newell 1965, Ali 1970, Hines & Comtois 1985). M. balthica lives in soft sediment and occupies

shallower habitats and thus may have been more susceptible to local extinctions during glacial maxima,

similar to the intertidal dogwhelk Nucella lapillus (Dorjes et al. 1986, Colson & Hughes 2004. Patterns of

population fragmentation during glacial periods are often also reflected in measures of genetic diversity.

For instance, reduced genetic diversity has been demonstrated in populations of marine fishes severely

impacted by glaciation (Bernatchez & Wilson 1998) as well as in Mya arenaria, the softshell clam

(Strasser & Barber 2009). However, some taxa which have undergone repeated extinctions during

glaciations possess high diversity (Bernatchez & Wilson 1998, Strasser & Barber 2009). For example,

high genetic diversity in the dogwhelk, Nucella lapillus, has been linked to rapid expansion following a

severe population bottleneck (Colson & Hughes 2004). A similar process may explain the high diversity

in populations of both H. arctica and M. balthica from New Brunswick despite their likely exposure to

severe glacial conditions (Briggs 1970, Wares & Cunningham 2001). By contrast, the low genetic

diversity of populations of both species at Churchill may reflect both the fact that Hudson Bay only

formed 8000 years ago (Ashworth 1996) and its isolation from glacial refugia. Conversely, Alaskan

populations of both species were diverse perhaps reflecting their foundation through the admixture of

lineages from two or more refugia (Kelly et al. 2006, Sakaguchi et al. 2011).

Implications for glacial refugia in the northeast Pacific

The admixture of previously isolated lineages is thought to explain high levels of genetic

diversity in certain populations of the longnose dace (Rhinichthys cataractae) and those of the bluestriped

snapper (Lutjanus kasmira) (Girard & Angers 2006, Gaither et al. 2010). Because genetic diversity can be

elevated in both zones of admixture and at sites that served as glacial refugia, it is difficult to determine

which process explains high diversity in any particular situation (Petit et al. 2003, Kelly et al. 2006,

Page 39: Examining patterns of genetic ... - University of Guelph

31

Sakaguchi et al. 2011). Populations of both bivalves examined in this study were diverse in the northeast

Pacific, a pattern also seen in the direct developing gastropod N. lamellosa and the brooding sea

cucumber Cucumaria pseudocurata (Arndt & Smith 2002, Marko 2004). Although the northeast Pacific

coast was glaciated, the extent of glaciation along the Pacific was much less severe than in the northwest

Atlantic (Mandryk et al. 2001, Marko 2004). As well, there may have been several glacial refugia in the

North Pacific despite literature suggesting that populations were displaced to the south (Warner et

al.1982, Hewitt 2000, Mandryk et al. 2001, Marko 2004). Populations in proximity to refugia often

possess many unique haplotypes, while haplotypes in admixture zones combine variants found in two or

more refugia (Provan & Bennett 2008). The presence of many unique haplotypes in Alaskan population

of Nucella lamellosa was invoked as evidence for a northern refugium (Marko 2004). H. arctica and M.

balthica reinforce this pattern as 78% and 92% of the haplotypes in their Alaskan population were unique.

While most of their haplotypes are unique to this area, the presence of some shared haplotypes suggests

possible admixture. The potential for postglacial admixture in Alaska has important implications given

that founder populations may be more fit than parental lineages in these environments, ultimately

accelerating rates of evolution (Wares et al. 2005, Kelly et al. 2006).

Evidence of sibling species

Populations isolated into separate glacial refugia inevitably diverge, leading in time to speciation

(Dodson et al. 2007, Maggs et al. 2008, Dapporto 2009). M. balthica shows evidence of incipient

speciation as indicated by the recognition of two subspecies (Väinölä 2003) with largely allopatric ranges.

The detection of both M. balthica balthica and ATL M. balthica lineages in Newfoundland suggests that

this area is a zone of secondary contact between Pacific and Arctic haplotypes. It also suggests that this

region may be an ideal location for more detailed genetic analysis to determine if these lineages show

incipient or complete reproductive isolation. It is possible that populations near Newfoundland

experienced localized extinction and recolonization, a process reported for some taxa in the northwest

Atlantic (Briggs 1970, Wares & Cunningham 2001). In any case, the presence of more than 8% sequence

divergence between members of these two clusters suggests they may warrant recognition as sibling

species. H. arctica also shows evidence for sibling species with the discovery of four genetically

divergent lineages across Canada, despite the limited evidence for regional differentiation in the most

abundant of these groups. Additional work is required to ascertain if the divergence at COI is

accompanied by divergence at nuclear loci. These findings suggest that several glacial refugia occurred

on the Pacific and Atlantic coasts of Canada and that these refugia fostered speciation. Coyer et al. (2011)

discovered two separate glacial refugia for macroalgae (Fucus distichus) in the Newfoundland area alone.

Moreover, Nikula et al. (2007) suggest that multiple refugia existed for M. balthica, likely contributing to

the deep divergences observed in this complex. In all, the genetic structure of species with planktonic

Page 40: Examining patterns of genetic ... - University of Guelph

32

larvae has not only been influenced by dispersal potential but has also been significantly shaped by

Canada’s extensive glacial history (Bernatchez & Wilson 1998, Dodson et al. 2007).

Conclusions

This study constitutes the first investigation of population structure in Hiatella arctica and further

contributes to our understanding of population structure in Macoma balthica. The present study has

shown that while some lineages of M. balthica and H. arctica demonstrate spatial homogeneity across

Canada, there is also evidence of genetic subdivision, suggesting that population structure varies in

species with similar life history strategies. Because the taxonomic status of the three lineages in the M.

balthica complex is uncertain, future work should incorporate additional genes and studies of

reproductive compatibility to determine their status. Little phylogeographic structure was present in the

main lineage of H. arctica, but the detection of three other deeply divergent lineages suggests the

presence of overlooked sibling species. Whether there are morphological differences, at the larval or early

juvenile stages, among these divergent H. arctica lineages remains to be determined and certainly

warrants further taxonomic research. In addition, this study has revealed that the northeast Pacific is a

zone of high diversity, suggesting it served as a glacial refugium or that it is a zone of secondary contact.

Future work should compare intertidal and subtidal populations to rule out the possibility of depth related

differentiation as well as include a survey of key environments, such as the Chukchi and Labrador Seas,

which remain unsampled. Lastly, future work should aim to calibrate evolutionary rates in each genus as

well as gain an understanding of genetic diversity in populations in Asian waters.

Page 41: Examining patterns of genetic ... - University of Guelph

33

Table 2.1. Genetic diversity in populations of the bivalve species, H. arctica and M. balthica, as measured

by number of haplotypes, haplotype diversity (h), nucleotide diversity (π) and Tajima’s D.

Population N Haplotypes Unique H h π Tajima’s D

NS 60 24 17 0.88 0.0071 -1.35

AK 43 23 18 0.94 0.0075 -1.40

MB 19 7 3 0.71 0.0051 -0.82

NB 25 18 13 0.93 0.0078 -1.73

BC 8 7 5 0.96 0.0060 -0.06

Population N Haplotypes Unique H h π Tajima’s D

AK 38 12 11 0.69 0.0045 -1.48

MB 19 4 3 0.66 0.0019 -0.20

BC 4 3 2 0.83 0.0035 -0.75

NFLD1 5 4 4 0.90 0.0056 0

Population N Haplotypes Unique H h π Tajima’s D

PEI 33 9 6 0.74 0.0053 -1.04

NB 51 12 9 0.83 0.0090 0.01

NFLD2 2 2 0 1 0.016 0

C) ATL M. balthica

B) M. balthica balthica

A) H. arctica

Page 42: Examining patterns of genetic ... - University of Guelph

34

Table 2.2. Overall genetic structure measured by AMOVA (analysis of molecular variance) for H. arctica

both A) grouped by coastal populations and B) with no grouping. P-values < 0.05 were treated as

significant.

Variation df Sum of Squares Variance

Components

% of Variation P-value

Among Coasts 2 36.9 0.30 12.7 0.06

Among Populations w/in

Coasts

2 8.4 0.09 3.8 0.03

Within Populations 150 301.0 2.0 83.6 0.00

Total 154 346.3 2.40

Table 2.3. Overall genetic structure measured by AMOVA for M. balthica balthica A) grouped by coastal

populations and B) with no grouping. P-values < 0.05 were treated as significant.

Variation df Sum of Squares Variance

Components

% of Variation P-value

Among Populations 4 45.3 0.33 14.1 0.00

Within Populations 150 301.0 2.0 85.9

Total 154 346.3 2.33

Variation df Sum of Squares Variance

Components

% of Variation P-value

Among Coasts 2 72.0 2.1 70.9 0.34

Among Populations w/in

Coasts

1 1.1 0.04 1.4 0.19

Within Populations 62 50.3 0.81 27.8 0.00

Total 65 123.4 2.95

Variation df Sum of Squares Variance

Components

% of Variation P-value

Among Populations 3 73.1 1.9 69.6 0.00

Within Populations 62 50.3 0.81 30.4

Total 65 123.4 2.71

A)

B)

A)

B)

Page 43: Examining patterns of genetic ... - University of Guelph

35

Table 2.4. Overall genetic structure measured by AMOVA for ATL M. balthica with no grouping. P-

values < 0.05 were treated as significant.

Variation df Sum of Squares Variance

Components

% of Variation P-value

Among Populations 2 7.0 0.10 6.2 0.07

Within Populations 83 120.3 1.4 93.9

Total 85 127.3

Page 44: Examining patterns of genetic ... - University of Guelph

36

Table 2.5. FST for populations of A) H. arctica, B) M. balthica balthica and C) ATL M. balthica. P-values

< 0.05 are marked with an asterisk.

Nova Scotia New

Brunswick

Alaska B.C. Churchill

Nova Scotia 0

New Brunswick 0.03* 0

Alaska 0.16* 0.22* 0

British Columbia 0.25* 0.28* 0.08 0

Churchill 0.05* 0.06* 0.15* 0.25* 0

Alaska B.C. Churchill Newfoundland

Alaska 0

British Columbia 0.02 0

Churchill 0.09* 0.14* 0

Newfoundland 0.88* 0.88* 0.93* 0

P.E.I. New Brunswick Newfoundland

Prince Edward Island (P.E.I.) 0

New Brunswick 0.07* 0

Newfoundland 0.23 -0.19 0

A)

B)

C)

Page 45: Examining patterns of genetic ... - University of Guelph

37

Figure 2.1. Collection sites for H. arctica and M. balthica. The sample size for each species at a site is

shown in the pie.

Page 46: Examining patterns of genetic ... - University of Guelph

38

Figure 2.2. Intraspecific sequence divergence (K2P) for A) H. arctica and B) M. balthica.

B)

A)

Page 47: Examining patterns of genetic ... - University of Guelph

39

Figure 2.3. Neighbour-joining tree based on K2P distances for H. arctica. The top scale shows estimated

divergence times in millions of years, while the bottom scale bar shows % sequence divergence (K2P).

Bootstrap probabilities are shown on the NJ tree and red triangles represent compressed clades, with

sample size provided in brackets.

2

3

4

1

Page 48: Examining patterns of genetic ... - University of Guelph

40

Figure 2.4. Neighbour-joining tree based on K2P distances for M. balthica. Subspecies names suggested by Väinölä (2003) are provided. The top

scale shows estimated divergence times in millions of years, while the bottom scale bar shows sequence divergence (% K2P). Bootstrap

probabilities are shown on the NJ tree and blue triangles represent compressed clades, with sample size provided in brackets.

M. balthica complex

1: M. balthica balthica

2

3: M. balthica rubra

4

Page 49: Examining patterns of genetic ... - University of Guelph

41

Figure 2.5. Median-joining haplotype networks for A) H. arctica and B) M. balthica constructed in

Network 4.6.1 with maximum parsimony. All mutational steps are equal to 1 unless shown by numeral.

Presumptive ancestral haplotypes are marked with a white star. The size of circles in each network varies

with the number of sequences belonging to each haplotype. M. balthica rubra sequences are included.

1

3

4

2

A)

3

1

2

4 B)

Page 50: Examining patterns of genetic ... - University of Guelph

42

Figure 2.6. FST values (Slatkin’s linearized) plotted against geographic distance (km) for populations

within the main lineage of H. arctica to examine the extent of isolation by distance. Results from a

Mantel test for significance are provided on the plot.

Figure 2.7. FST values (Slatkin’s linearized) plotted against geographic distance (km) for populations of

M. balthica balthica to examine the extent of isolation by distance. Results from a Mantel test for

significance are provided on the plot.

Page 51: Examining patterns of genetic ... - University of Guelph

43

Figure 2.8. FST values (Slatkin’s linearized) plotted against geographic distance (km) for populations of

ATL M. balthica to examine the extent of isolation by distance. Results from a Mantel test for

significance are provided on the plot.

Page 52: Examining patterns of genetic ... - University of Guelph

44

General Conclusions

Summary of findings

My thesis has explored patterns of sequence variation in the COI gene in Canadian marine

molluscs both at the phylum and species level. The results of this work suggest that intraspecific variation

at COI may be higher in molluscs than in other marine phyla, and that patterns of sequence divergence

vary among species with similar life history strategies. My results expand our knowledge of species

diversity and population structure in molluscs and provide crucial insight for conservation strategies,

particularly for translocations. Chapter 1 makes an important contribution toward the construction of a

comprehensive barcode library for Canadian marine molluscs, generating records for nearly 25% of the

known malacological fauna of Canada. This work provides novel insight into how sequence divergence

varies among molluscs, and also reveals a correlation between nearest neighbour distance and GC

content. While my work has begun to fill the gap in barcode coverage for Canadian molluscs, many

species still lack data. My investigations in Chapter 1 revealed deep intraspecific divergences in 9 species,

motivating the two case studies in Chapter 2. In this chapter, I show that population structure can differ

greatly in species with similar larval development and dispersal potential, suggesting that vicariance

events have provoked variable outcomes in genetic divergence patterns. Certainly prior glaciations have

fragmented the populations of both species, provoking genetic subdivision and the formation of species

complexes. Given that patterns of population fragmentation vary between species, future research needs

to examine how habitat preferences and larval resilience have impacted the contemporary distribution of

species.

The application of a DNA barcode library for Canadian marine molluscs

DNA barcoding has proven effective in delineating species boundaries across many animal

groups, but my work is the first attempt to examine its performance in a large number of Canadian marine

molluscs. My investigations revealed 9 cases of deep intraspecific divergence (>2%) that likely represent

cryptic species, potentially representing new Canadian records. Webb et al. (2012) similarly discovered

that apparent cases of high intraspecific divergence in North American Ephemeroptera were largely due

to overlooked species complexes that showed large divergence between their haplotype clusters. Such

overlooked taxa inflate mean intraspecific divergences and highlight the importance of integrating

molecular and morphological approaches to advance our understanding of species boundaries. In light of

this, it is imperative that future barcode studies include detailed taxonomic work to resolve

misidentifications, cases of synonymy, and overlooked species that otherwise generate a misleading sense

of sequence divergence.

Although my study surveyed many species, future work should aim to fill gaps in species

coverage, especially by sampling deep sea habitats where many species await discovery (Archambault et

Page 53: Examining patterns of genetic ... - University of Guelph

45

al. 2010). It would also be valuable to compare patterns of genetic variation in marine molluscs to those in

their freshwater and terrestrial counterparts to gain insight into how patterns of sequence differentiation

vary across these habitats. For instance, the majority of marine species in the tropical Pacific were found

to be more widespread and to show less differentiation than terrestrial species (Paulay and Meyer 2002).

Future work should extend this analysis by comparing patterns of divergence in molluscs from the Arctic,

temperate zone and tropics to better understand how species diversity, as well as rates of speciation and

diversification, varies among these environments. This research would help to provide the data needed to

verify that lower extinction rates are responsible for the greater species richness evident in tropical

settings (Schemske 2012).

Gene flow in marine populations

While gene flow is undoubtedly greater in species with planktonic larvae, recent studies have

indicated that panmixis is not inevitable. In fact, spatial heterogeneity was noted in a planktotrophic

barnacle (Semibalanus balanoides), while the direct developing gastropod (Nucella ostrina) showed

spatial homogeneity (Holm & Bourget 1994, Marko 2004). These results suggest that while dispersal

potential is important, other factors also impact population structure. While my study assessed population

structure in two species with planktotrophic larval development, only a few studies have compared

patterns between species with different modes of larval development. Lee and Boulding (2009) found that

population structure in littorinid snails from the northeast Pacific differed both within and between larval

types. Given this complexity, future work should aim for a broad analysis of population structure in

planktotrophic, lecithotrophic and direct developing species. Furthermore, while planktonic larvae

facilitate gene flow among populations, gene flow may vary if larvae differ in their tolerance to

environmental conditions. As well, some species show delayed metamorphosis when exposed to low

temperatures, so gene flow can be impacted by environmental temperatures (Pechenik 1980). In light of

such complexities, a solid understanding of the physiological tolerances of planktonic larvae and of the

duration of the larval stage would bring new insight into population differentiation.

If dispersal potential were the only factor determining population structure then differing

phylogeographic patterns would always be apparent between species with planktotrophic larval

development and those with direct development. The fact that they are not suggests that vicariance events

play a key role. Multiple glacial cycles have impacted Canada’s coasts and while this study utilizes

coalescent methods to examine the impact of glaciation on population structure, fossil data would provide

crucial insight into both historical distributions and the location of glacial refugia. Lineages showing

marked sequence divergence were discovered in about 7% of the species examined in this study. My

work emphasizes the value of coalescent approaches for population genetics because of the difficulty of

discriminating cryptic species through the fossil record alone.

Page 54: Examining patterns of genetic ... - University of Guelph

46

Conclusions and implications for conservation

Protecting marine biodiversity is a challenging and difficult task, but one which is critical given

the rate at which i) marine ecosystems are being impacted by human activities and ii) species are

becoming extinct. My work has begun the construction of a DNA barcode library for Canadian marine

molluscs that not only provides taxonomic assignments, but also locality information. This study is a

particularly important contribution to the DNA barcoding literature because it highlights the efficacy of

COI for teasing apart patterns of genetic variation on both a broad and local scale. In addition, the

documentation of genetic variation in this phylum lends itself to comparative studies with other marine

phyla. Such research is crucial to better understand speciation in marine environments and to broaden

knowledge of the factors promoting diversification in marine phyla. Discovering cryptic diversity in the

marine realm, particularly in species with a Holarctic distribution, has implications for conservation

efforts that target both species diversity and genetic diversity. For instance, identifying distinct genetic

clusters in species thought to have broad distributions can impact translocation efforts which aim to

replenish locally extirpated populations. These findings can also significantly impact bivalve mariculture.

In all, this thesis broadens our understanding of genetic variation in Canadian marine molluscs, aiding

future conservation efforts.

Page 55: Examining patterns of genetic ... - University of Guelph

47

Literature Cited

Addison, J.A. & Hart, M.W. (2005). Colonization, dispersal, and hybridization influence phylogeography

of North Atlantic sea urchins (Strongylocentrotus droebachiensis). Evolution, 59, 532-543.

Albu, M., Min, X.J., Hickey, D. & Golding, B. (2008). Uncorrected nucleotide bias in mtDNA can mimic

the effects of positive Darwinian selection. Molecular Biology and Evolution, 25, 2521-2524.

Ali, R.M. (1970). The influence of suspension density and temperature on the filtration rate of Hiatella

arctica. Marine Biology, 6, 291-302.

Alison, R.C. & Marincovich, L.J. (1982). A late Oligocene or earliest Miocene molluscan fauna from

Sitkinak Island, Alaska. In: Jurassic (Oxfordian and late Callovian) Ammonites from the Western

Interior Region of the United States (ed Imlay, R.W.). Geological Survey professional paper,

Washington, 1232, 115.

An, H.S. & Lee, J.W. (2012). Development of microsatellite markers for the Korean mussel, Mytilus

coruscus (Mytilidae) using next-generation sequencing. Journal of Molecular Science, 13, 10583-

10593.

Archambault, P., Snelgrove, P.V.R., Fisher, J.A.D., Gagnon, J.M., Garbary, D.J., et al. (2010). From sea

to sea: Canada’s three oceans of biodiversity. PLoS ONE, 5, e12182.

Arndt, A. & Smith, M.J. (2002). Genetic diversity and population structure in two species of sea

cucumber: differing patterns according to mode of development. Molecular Ecology, 7, 1053-

1064.

Ashworth, A. (1996). The response of arctic Carabidae (Coleoptera) to climate change based on the fossil

record of the Quaternary. Annales Zoologici Fennici, 33, 125-131.

Bandelt, H-J., Forster, P. & Röhl, A. (1999). Median-joining networks for inferring intraspecific

phylogenies. Molecular Biology and Evolution, 16, 37-48.

Bax, N., Williamson, A., Aguero, M., Gonzalez, E. & Geeves, W. (2003). Marine invasive alien species:

a threat to global biodiversity. Marine Policy, 27, 313-323.

Becquet, V., Simon-Bouhet, B., Pante, E., Hummel, H. & Garcia, P. (2012). Glacial refugium versus

range limit: Conservation genetics of Macoma balthica, a key species in the Bay of Biscay

(France). Journal of Experimental Marine Biology and Ecology, 432-433, 73-82.

Bernatchez, L. & Wilson, C.C. (1998). Comparative phylogeography of Nearctic and Palearctic fishes.

Molecular Ecology, 7, 431-452.

Bouchet, P., Lozouet, P., Maestrati, P. & Heros, V. (2002). Assessing the magnitude of species richness

in tropical marine environments: exceptionally high numbers of molluscs at a New Caledonia

site. Biological Journal of the Linnean Society, 75, 421-436.

Bouchet, P. (2006). The magnitude of marine biodiversity. In: The exploration of marine biodiversity:

scientific and technological challenges (ed. Duarte, C.M.). Fundacion BBVA, Bilbao, Spain, 31-

64.

Page 56: Examining patterns of genetic ... - University of Guelph

48

Bradbury, I.R., Laurel, B., Snelgrove, P.V.R., Bentzen, P. & Campana, S.E. (2008). Global patterns in

marine dispersal estimates: the influence of geography, taxonomic category and life history.

Proceedings of the Royal Society of London: Biological Series, 275, 1803–1809.

Briggs, J.C. (1970). A faunal history of the North Atlantic Ocean. Systematic Zoology, 19, 19–34.

Bucklin, A., Steinke, D. & Blanco-Bercial, L. (2011). DNA barcoding of marine metazoa. Annual

Review of Marine Science, 3, 471-508.

Campillo, S., Serra, M., Carmona, M.J. & Gomez, A. (2011). Widespread secondary contact and new

glacial refugia in the halophilic rotifer Brachionus plicatilis in the Iberian Peninsula. PLoS ONE,

6, e20986.

Canestrelli, D., Sacco, F. & Nascetti, G. (2011). On glacial refugia, genetic diversity, and

microevolutionary processes: deep phylogeographical structure in the endemic newt Lissotriton

italicus. Biological Journal of the Linnaean Society, 105, 42-55.

Carlton, J.T. (1999). The scale and ecological consequences of biological invasions in the world’s oceans.

In: Invasive species and biodiversity management (eds Sandlund, O.T., Schei, P.J. & Viken, A.).

Kluwer Academic Publishers, Dordrecht, 195-212.

Carr, C., Hardy, S.M., Brown, T.M., Macdonald, T.A. & Hebert, P.D.N. (2010). A tri-oceanic

perspective: DNA barcoding reveals geographic structure and cryptic diversity in Canadian

polychaetes. PLoS ONE, 6, e22232.

Clare, E.L., Kerr, K.C.R., von Königslöw, T.E., Wilson, J.J. & Hebert, P.D.N. (2008). Diagnosing

mitochondrial DNA diversity: applications of a sentinel gene approach. Journal of Molecular

Evolution, 66, 362:367.

Clement, M., Posada, D. & Crandall, K. (2000). TCS: a computer program to estimate gene genealogies.

Molecular Ecology, 9, 1657-1660.

Colgan, D.J., Ponder, W.F. & Eggler, P.E. (1999). Gastropod evolutionary rates and phylogenetic

relationships assessed using partial 28S rDNA and histone H3 sequences. Zoologica Scripta, 29,

29-63.

Colson, I. & Hughes, R.N. (2004). Rapid recovery of genetic diversity of dogwhelk (Nucella lapillus L.)

populations after local extinction and recolonization contradicts predictions from life-history

characteristics. Molecular Ecology, 13, 2223-2233.

Costa, F.O., deWaard, J.R., Boutillier, J., Ratnasingham, S., Dooh, R.T., Hajibabaei, M. & Hebert, P.D.N.

(2007). Biological identifications through DNA barcodes: the case of the Crustacea. Canadian

Journal of Fisheries and Aquatic Sciences, 64, 272-295.

Coyer, J.A., Hoarau, G., Van Schaik, J., Luijckx, P. & Olsen, J.L. (2011). Trans-Pacific and trans-Arctic

pathways of the intertidal macroalga Fucus distichus L. reveal multiple glacial refugia and

colonizations from the North Pacific to the North Atlantic. Journal of Biogeography, 38, 756-

771.

Dapporto, L. (2009). Speciation in Mediterranean refugia and post-glacial expansion of Zerynthia

polyxena (Lepidoptera, Papilionidae). Journal of Zoological Systematics and Evolutionary

Research, 48, 229-237.

Page 57: Examining patterns of genetic ... - University of Guelph

49

Dixon, D.R., Simpson-White, R. & Dixon, L.R.J. (1992). Evidence of thermal stability of ribosomal DNA

sequences in hydrothermal vent organisms. Journal of the Marine Biological Association of the

United Kingdom, 72, 519-527.

Dixon, P. (2003). VEGAN, a package of R functions for community ecology. Journal of Vegetation

Science, 14, 927–930.

Dodson, J.J., Tremblay, S., Colombani, F., Carscadden, J.E. & Lecomte, F. (2007). Trans-Arctic

dispersals and the evolution of a circumpolar marine fish species complex, the capelin (Mallotus

villosus). Molecular Ecology, 16, 5030-5043.

Donald, K.M., Kennedy, M. & Spencer, H.G. (2005). Cladogenesis as the result of long-distance rafting

events in South Pacific topshells (Gastropoda, Trochidae). Evolution, 59, 1701-1711.

Dorjes, J., Michaelis, H. & Rhode, B. (1986). Long-term studies of macrozoobenthos in intertidal and

shallow subtidal habitats near the island of Norderney (East Frisian Coast, Germany).

Hydrobiologia, 142, 217-232.

Drent, J., Luttikhuizen, P.C. & Piersma, T. (2004). Morphological dynamics in the foraging apparatus of

a deposit feeding marine bivalve: phenotypic plasticity and heritable effects. Functional Ecology,

18, 349-356.

Excoffier, L. & Lischer, H.E.L. (2010). Arlequin suite ver 3.5: a new series of programs to perform

population genetics analyses under Linux and Windows. Molecular Ecology Resources, 10, 564-

567.

Gaither, M.R., Bowen, B.W., Toonen, R.J., Planes, S., Messmer, V., Earle, J. & Robertson, D.R. (2010).

Genetic consequences of introducing allopatric lineages of bluestriped snapper (Lutjanus

kasmira) to Hawaii. Molecular Ecology, 19, 1107-1121.

Ghiselli, F., Milani, L., Chang, P.L., Hedgecock, D., Davis, J.P., Nuzhdin, S.V. & Passamonti, M. (2012).

De Novo assembly of the Manila clam Ruditapes philippinarum transcriptome provides new

insights into expression bias, mitochondrial doubly uniparental inheritance and sex determination.

Molecular Biology and Evolution, 29, 771-786.

Girard, P. & Angers, B. (2006). The impact of postglacial marine invasions on the genetic diversity of an

obligate freshwater fish, the longnose dace (Rhinichthys cataractae), on the Quebec peninsula.

Canadian Journal of Fisheries and Aquatic Sciences, 63, 1429-1438.

Gofas, S. (2012). Macoma balthica (Linnaeus, 1758). Accessed through: World Register of Marine

Species at http:// marinespecies.org/aphia.php?p=taxdetails&id=141579.

Grande, C., Templado, J., Cervera, J.L. & Zardoya, R. (2004). Molecular phylogeny of Euthyneura

(Mollusca: Gastropoda). Molecular Biology and Evolution, 21, 303-313.

Hao, X., Jiang, R. & Chen, T. (2011). Clustering 16S rRNA for OTU prediction: a method of

unsupervised Bayesian clustering. Bioinformatics, 27, 611-618.

Harrell, F.E. & Miscellaneous (2012). Hmisc: Harrell miscellaneous. R package version 3.9-3.

Page 58: Examining patterns of genetic ... - University of Guelph

50

Harper, F.M. & Hart, M.W. (2007). Morphological and phylogenetic evidence for hybridization and

introgression in a sea star secondary contact zone. Invertebrate Biology, 126, 373-384.

Hebert, P.D.N., Cywinska, A., Ball, S.L. & deWaard, J.R. (2003). Biological identifications through

DNA barcodes. Proceedings of the Royal Society of London Series B: Biological Sciences, 270,

313-321.

Hebert, P.D.N., Penton, E.H., Burns, J.M., Janzen, D.H., Hallwachs, W. (2004). Ten species in one: DNA

barcoding reveals cryptic species in the neotropical skipper butterfly Astraptes fulgerator.

Proceedings of the National Academy of Sciences of the United States of America, 101,

14812-14817.

Hedgecock, D., Li, G., Hubert, S., Bucklin, K. & Ribes, V. (2004). Widespread null alleles and poor

cross-species amplification of microsatellite DNA loci cloned from the Pacific oyster

(Crassostrea gigas). Journal of Shellfish Research, 23, 379-385.

Hellberg, M.E. & Vacquier, V.D. (1999). Rapid evolution of fertilization selectivity and lysin cDNA

sequences in teguline gastropods. Molecular Biology and Evolution, 16, 839-848.

Helmuth, B., Veit, R.R. & Holberton, R. (1994). Long distance dispersal of subantarctic brooding bivalve

(Gaimardia trapesina) by kelp rafting. Marine Biology, 120, 421-426.

Hewitt, G.M. (1996). Some genetic consequences of ice ages, and their role in divergence and speciation.

Biological Journal of the Linnaean Society, 58, 247–276.

Hewitt, G. (2000). The genetic legacy of the Quaternary ice ages. Nature, 405, 907-913.

Hines, A.H. & Comtois, K.L. (1985). Vertical distribution of infauna in sediments of a subestuary of

central Chesapeake Bay. Estuaries, 8, 296-304.

Holm, E.R. & Bourget, E. (1994). Selection and population genetic structure of the barnacle Semibalanus

balanoides in the northwest Atlantic and Gulf of St. Lawrence. Marine Ecological Progress

Series, 113, 247-256.

Hunt, B., Strugnell, J., Bednarsek, N., Linse, K., Nelson, R.J., Pakhomov, E., Seibel, B., Steinke, D. &

Würzberg, L. (2010). Poles apart: The “bipolar” pteropod species Limacina helicina is genetically

distinct between the Arctic and Antarctic Oceans. PLoS ONE, 5, e9835.

Ingolfsson, A. (1992). The origin of the rocky shore fauna of Iceland and the Canadian Maritimes.

Journal of Biogeography, 19, 705-712.

Ivanova, N.V., Fazekas, A.J. & Hebert, P.D.N. (2008). Semi-automated, membrane-based protocol for

DNA isolation from plants. Plant Molecular Biology Reporter, 26, 186-198.

Jablonski, D. (1986). Larval ecology and macroevolution in marine invertebrates. Bulletin of Marine

Science, 39, 565-587.

Järnegren, J., Schander, C., Sneli, J.A., Rønningen, V. & Young, C. (2007). Four genes, morphology and

ecology: distinguishing a new species of Acesta (Mollusca; Bivalvia) from the Gulf of Mexico.

Marine Biology, 152, 43-55.

Page 59: Examining patterns of genetic ... - University of Guelph

51

Jennings, R.M., Bucklin, A., Ossenbrügger, H. & Hopcroft, R.R. (2010). Species diversity of planktonic

gastropods (Pteropoda and Heteropoda) from six ocean regions based on DNA barcode analysis. Deep-Sea Research Part II: Topical Studies in Oceanography, 57, 2199-2210.

Johnson, S.B., Waren, A. & Vrijenhoek, R.C. (2008). DNA barcoding of Lepetodrilus limpets reveals

cryptic species. Journal of Shellfish Research, 27, 43-51.

Jones, M., Ghoorad, A. & Blaxter, M. (2011). jMOTU and Taxonerator: turning DNA barcode sequences

into annotated operational taxonomic units. PLoS ONE, 6, e19359.

Keever, C.C., Sunday, J., Puritz, J.B., Addison, J.A., Toonen, R.J., Grosberg, R.K. & Hart, M.W. (2009).

Discordant distribution of populations and genetic variation in a sea star with high dispersal

potential. Evolution, 63, 3214-3227.

Kelly, D.W., Muirhead, J.R., Heath, D.D. & MacIsaac, H.J. (2006). Contrasting patterns in genetic

diversity following multiple invasions of fresh and brackish waters. Molecular Ecology, 15,

3641-3653.

Kembel, S.W., Cowan, P.D., Helmus, M.R., Cornwell, W.K., Morlon, H., Ackerly, D.D., Blomberg, S.P.

& Webb, C.O. (2010). Picante: R tools for integrating phylogenies and ecology. Bioinformatics,

26, 1463-1464.

Kerr, K.C.R., Lijtmaer, D.A., Barreira, A.S., Hebert, P.D.N. & Tubaro, P.L. (2009). Probing evolutionary

patterns in neotropical birds through DNA barcodes. PLoS ONE, 4, e4379.

Kimura, M. (1980). A simple method for estimating evolutionary rates of base substitutions through

comparative studies of nucleotide sequences. Journal of Molecular Evolution, 16, 111-120.

Knowlton, N. (2000). Molecular genetic analyses of species boundaries in the sea. Hydrobiologia, 420,

73–90.

Kyle, C.J. & Boulding, E.G. (2000). Comparative population genetic structure of marine gastropods

(Littorina spp.) with and without pelagic larval dispersal. Marine Biology, 137, 835-845.

Lebour, M.V. (1938). Notes on the breeding of some lamellibranchs from Plymouth and their larvae.

Journal of the Marine Biological Association of the United Kingdom, 23, 119-144.

Lee, H.J. & Boulding, E.G. (2009). Spatial and temporal population structure of four northeastern Pacific

littorinid gastropods: the effect of mode of larval development on variation at one mitochondrial

and two nuclear DNA markers. Molecular Ecology, 18, 2165-2184.

Luttikhuizen, P.C., Drent, J. & Baker, A.J. (2003). Disjunct distribution of highly diverged mitochondrial

lineage clade and population subdivision in a marine bivalve with pelagic larval dispersal.

Molecular Ecology, 12, 2215–2229.

Maggs, C.A., Castilho, R., Foltz, D., Henzler, C., Jolly, M.T., Kelly, J., Olsen, J., Perez, K.E., Stam, W.,

Väinolä, R., Viard, F. & Wares, J. (2008). Evaluating signatures of glacial refugia for North

Atlantic benthic marine taxa. Ecology, 89, S108-S122.

Page 60: Examining patterns of genetic ... - University of Guelph

52

Mandryk, C.A.S., Josenhans, H., Fedje, D.W. & Mathewes, R.W. (2001). Late Quaternary

paleoenvironments of Northwestern North America: implications for inland versus coastal

migration routes. Quaternary Science Review, 20, 301-314.

Marko, P.B. (2002). Fossil calibration of molecular clocks and the divergence times of geminate species

pairs separated by the Isthmus of Panama. Molecular Biology and Evolution, 19, 2005-2021.

Marko, P.B. (2004). ‘What’s larvae got to do with it?’ Disparate patterns of post-glacial population

structure in two benthic marine gastropods with identical dispersal potential. Molecular Ecology,

13, 597-611.

Marko, P.B. & Moran, A.L. (2009). Out of sight, out of mind: high cryptic diversity obscures the

identities and histories of geminate species in the marine bivalve subgenus Acar. Journal of

Biogeography, 36, 1861-1880.

Meehan, B.W. (1985). Genetic comparison of Macoma balthica (Bivalvia, Telinidae) from the eastern

and western North Atlantic Ocean. Marine Ecological Progress Series, 22, 69-76.

Meyer, C. (2003). Molecular systematics of cowries (Gastropoda: Cypraeidae) and diversification

patterns in the tropics. Biological Journal of the Linnean Society, 79, 401-459.

Meyer, C.P. & Paulay, G. (2005). DNA barcoding: error rates based on comprehensive sampling. PLoS

Biology, 3, e422.

Mikkelsen, N.T., Schander, C. & Willassen, E. (2007). Local scale DNA barcoding of bivalves

(Mollusca): a case study. Zoologica Scripta, 36, 455-463.

Naughton, K.M. & O’Hara, T.D. (2009). A new brooding species of the biscuit star Tosia

(Echinodermata: Asteroidea: Goniasteridae), distinguished by molecular, morphological and

larval characters. Invertebrate Systematics, 23, 348-366.

Nei, M. (1987). Molecular Evolutionary Genetics. Columbia University Press, New York.

Newell, R.C. (1965). The role of detritus in the nutrition of two marine deposit feeders, the prosobranch

Hydrobia ulvae and the bivalve Macoma balthica. Proceedings of the Royal Zoological Society of

London, 144, 25- 45.

Nikula, R., Strelkov, P. & Väinölä, R. (2007). Diversity and trans-arctic invasion history of mitochondrial

lineages in the North Atlantic Macoma balthica complex (Bivalvia: Tellinidae). Evolution, 61,

928-941.

Passamonti, M. & Ghiselli, F. (2009). Doubly uniparental inheritance: two mitochondrial genomes, one

precious model for organelle DNA inheritance and evolution. DNA and Cell Biology, 28, 79-89.

Paulay, G. & Meyer, C.M. (2002). Diversification in the tropical Pacific: comparisons between marine

and terrestrial systems and the importance of founder speciation. Integrative and Comparative

Biology, 42, 922-934.

Pechenik, J.A. (1980). Growth and energy balance during the larval lives of three prosobranch

gastropods. Journal of Experimental Marine Biology and Ecology, 44, 1-28.

Page 61: Examining patterns of genetic ... - University of Guelph

53

Peterson, G.H. (1999). Five recent Mya species, including three new species and their fossil connections.

Polar Biology, 22, 322-328.

Petit, R.J., Aguinagalde, I., de Beaulieu, J-L., Bittkau, C., Brewer, S., et al. (2003). Glacial refugia:

hotspots but not melting pots of genetic diversity. Science, 300, 1563-1565.

Plazzi, F., Ceregato, A., Taviani, M. & Passamonti, M. (2011). A molecular phylogeny of bivalve

mollusks: ancient radiations and divergences as revealed by mitochondrial genes. PLoS ONE, 6,

e27147.

Puillandre, N., Lambert, A., Brouillet, S. & Achaz, G. (2011). ABGD, Automated Barcode Gap

Discovery for primary species delineation. Molecular Ecology, 21, 1864-1877.

R Development Core Team (2008). R: A language and environment for statistical computing. R

Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, URL

http://www.R-project.org.

Radulovici, A.E., Archambault, P. & Dufresne, F. (2010). DNA barcodes for marine biodiversity: moving

fast forward? Diversity, 2, 450-472.

Ramel, C. (1998). Biodiversity and intraspecific genetic variation. Pure and Applied Chemistry, 70, 2079-

2084.

Ratnasingham, S. & Hebert, P.D.N. (2007). BOLD: The Barcode of Life Data System

(www.barcodinglife.org). Molecular Ecology Notes, 7, 355-364.

Reid, D.G. (1990). Trans-Arctic migration and speciation induced by climate change: The biogeography

of Littorina (Mollusca: Gastropoda). Bulletin of Marine Science, 47, 35-49.

Remigio, E.A. & Hebert, P.D.N. (2003). Testing the utility of partial COI sequences for phylogenetic

estimates of gastropod relationships. Molecular Phylogenetics and Evolution, 29, 641-647.

Rohling, R.J., Fenton, M., Jorissen, F.J., Bertrand, P., Ganssen, G., Caulet, J.P. (1998). Magnitudes of

sea-level lowstands of the past 500,000 years. Nature, 394, 162–165.

Saitou, N. & Nei, M. (1987). The neighbor-joining method: a new method for reconstructing phylogenetic

trees. Molecular Biology and Evolution, 4, 406-425.

Sakaguchi, S., Takeuchi, Y., Yamasaki, M., Sakurai, S. & Isagi, Y. (2011). Lineage admixture during

postglacial range expansion is responsible for the increased gene diversity of Kalopanax

septemlobus in a recently colonised territory. Heredity, 107, 338-348.

Schemske, D.W. (2009). Biotic interactions and speciation in the tropics. In: Speciation and patterns of

diversity (eds Butlin, R.K., Bridle, J.R. & Schulter, D.). Cambridge University Press, British

Ecological Society, 219-239.

Schneider, S. & Kaim, A. (2012). Early ontogeny of middle Jurassic hiatellids from a wood-fall

association: implications for phylogeny and paleoecology of Hiatellidae. Journal of Molluscan

Studies, 78, 119-127.

Page 62: Examining patterns of genetic ... - University of Guelph

54

Slatkin, M. (1993). Isolation by distance in equilibrium and non-equilibrium populations. Evolution, 47,

264-279.

Snelgrove, P.V.R. (1999). Getting to the bottom of marine biodiversity: sedimentary habitats. Bioscience,

49, 129-138.

Snelgrove, P.V.R. (2010). Discoveries of Census of Marine Life: making ocean life count. Cambridge

University Press.

Steinke, D., Zemlak, T.S., Boutillier, J.A. & Hebert, P.D.N. (2009a). DNA barcoding Pacific Canada’s

fishes. Marine Biology, 156, 2641-2647.

Sun, Y., Li, Q., Kong, L. & Zheng, X. (2011). DNA barcoding of Caenogastropoda along coast of China

based on the COI gene. Molecular Ecology Resources, 12, 209-218.

Tajima, F. (1989). Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

Genetics, 123, 585-595.

Tamura, K., Peterson, D., Peterson, N., Stecher, G., Nei, M. & Kumar, S. (2011). MEGA5: molecular

evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum

parsimony methods. Molecular Biology and Evolution, 28, 2731-2739.

Taylor, E.B. & Dodson, J.J. (1994). A molecular analysis of relationships and biogeography within a

species complex of Holarctic fish (genus Osmerus). Molecular Ecology, 3, 235–248.

Thomaz, D., Guiller, A. & Clarke, B. (1996). Extreme divergence of mitochondrial DNA within species

of pulmonate land snails. Proceedings of the Royal Society B: Biological Sciences, 263, 363-368.

Tian, D., Wang, Q., Zhang, P., Araki, H., Yang, S., Kreitman, M., Nagylaki, T., Hudson, R., Bergelson, J.

& Chen, J.Q. (2008). Single-nucleotide mutation rate increases close to insertions/deletions in

eukaryotes. Nature, 455, 105–108.

Újvári, B., Madsen, T., Kotenko, T., Olsson, M., Shine, R. & Wittzell, H. (2002). Low genetic diversity

threatens imminent extinction for the Hungarian meadow viper (Vipera ursinii rakosiensis).

Biological Conservation, 105, 127-130.

van der Spoel, S. & Dadon, J.R. (1999). Pteropoda. In: South Atlantic Zooplankton (ed. Boltovskoy, D.)

Bachhuys Publishers, Leiden Netherlands, 649–706.

Väinölä, R. (2003). Repeated trans-Arctic invasions in littoral bivalves: molecular zoogeography of the

Macoma balthica complex. Marine Biology, 143, 935-946.

Vermeij, G. (1991). Anatomy of an invasion: the trans-Arctic interchange. Paleobiology, 17, 281–307.

Vetsigian, K. & Goldenfeld, N. (2005). Global divergence of microbial genome sequences mediated by

propagating fronts. Proceedings of the National Academy of Sciences of the United States of

America, 102, 7332-7337.

Ward, R.D., Zemlak, T.S., Innes, B.H., Last, P.R. & Hebert, P.D.N. (2005). DNA barcoding Australia’s

fish species. Philosophical Transactions of the Royal Society B: Biological Sciences, 360, 1847–

1857.

Page 63: Examining patterns of genetic ... - University of Guelph

55

Ward, R.D., Holmes, B.H. & O’Hara, T.D. (2008). DNA barcoding discriminates echinoderm species.

Molecular Ecology Resources, 8, 1202-1211.

Wares, J.P. & Cunningham, C.W. (2001). Phylogeography and historical ecology of the North Atlantic

intertidal. Evolution, 55, 2455-2469.

Wares, J.P., Hughes, A.R. & Grosberg, R.K. (2005). Mechanisms that drive evolutionary change: insights

from species introductions and invasions. In: Species Invasions: Insights into Ecology, Evolution

and Biogeography (eds Sax, D.F., Stachowicz, J.J. & Gaines, S.D.). Sinauer Associates,

Sunderland, Massachusetts, 229-257.

Warner, B.G., Mathewes, R.W. & Clague, J.J. (1982). Ice-free conditions on the Queen Charlotte Islands,

British Columbia, at the height of the late Wisconsin glaciation. Science, 218, 675-677.

Webb, T. III & Bartlein, P.J. (1992). Global change during the last 3 million years: climatic controls and

biotic responses. Annual Review of Ecology, Evolution and Systematics, 23, 141–173.

Webb, J.M., Jacobus, L.M., Funk, D.H., Zhou, X., Kondratieff, B., Geraci, C.J., DeWalt, R.E., Baird,

D.J., Richard, B., Phillips, I. & Hebert, P.D.N. (2012). A DNA barcode library for North

American Ephemeroptera: progress and prospects. PLoS ONE, 7, e38063.

Weir, B.S. & Cockerham, C.C. (1984). Estimating F-statistics for the analysis of population structure.

Evolution, 38, 1358-1370.

Witt, J.D.S., Threloff, D.L. & Hebert, P.D.N. (2006). DNA barcoding reveals extraordinary cryptic

diversity in an amphipod genus: implications for desert spring conservation. Molecular Ecology,

15, 3073-3082.

Wu, H., Zhang, Z., Hu, S. & Yu, J. (2012). On the molecular mechanism of GC content variation among

eubacterial genomes. Biology Direct, 7, 1-16.

Zou, S., Li, Q., Kong, L., Yu, H. & Zheng, X. (2011). Comparing the usefulness of distance, monophyly

and character-based DNA barcoding methods in species identification: a case study of

Neogastropoda. PLoS ONE, 6, e26619.

Zou, S,. Li, Q. & Kong, L. (2012). Multigene barcoding and phylogeny of geographically widespread

muricids (Gastropoda: Neogastropoda) along the coast of China. Marine Biotechnology, 14, 21-

34.

Zouros, E. (2012). Biparental inheritance through uniparental transmission: the Doubly Uniparental

Inheritance (DUI) of Mitochondrial DNA. Evolutionary Biology, 1-31.

Page 64: Examining patterns of genetic ... - University of Guelph

56

Appendices

Appendix A

Specimen Preservation

Most specimens were fixed in the field in 95% ethanol however specimens from southeast Alaska

were fixed in 100% ethanol due to available resources at the University of Fairbanks. Upon initial

fixation, the operculum of snails was carefully pulled back to prevent DNA degradation. Similarly, a

small, lateral incision was made along the shells of bivalves to ensure that interior tissues would be

properly preserved. Ethanol was replaced in the field once a day for the first 3 days and up to 3 additional

times when resources were available. Once arriving back at the lab, specimens were stored in a -20°C

freezer or a fridge, depending on available space. Ethanol was refreshed immediately upon returning to

the lab, even prior to sorting lots into individual specimens.

Five to ten specimens per species were chosen for processing and were assigned unique sample

names. These specimens were imaged with a Canon EOS 30D/50D camera and hydrated with several

drops of ethanol during imaging. All images, along with specimen information, can be found on the

Barcode of Life Database (BOLD). During sub-sampling, tissue was typically removed from the foot in

gastropods and chitons and from the adductor muscle or mantel in bivalves. After tissue was removed

from each specimen for molecular analysis, ethanol was refreshed in all specimens prior to being placed

back in cold storage. Interestingly, we saw high sequencing success in specimens from British Columbia

that were fixed in dry ice and stored in a -80°C freezer, suggesting these are the best preservation

techniques for marine molluscs.

Page 65: Examining patterns of genetic ... - University of Guelph

57

Appendix B

Species Identifications

All specimens were assigned interim species names in the field through the references outlined in

Table B.1. After processing specimens, I traveled to the Canadian Museum of Nature to verify species

identifications for all barcode clusters recovered in Chapter 1. I worked alongside Dr. André Martel and

used the references outlined in Table B.1 to assign species-level identifications to each cluster.

Nudibranchs and microsnails were hydrated in 70% ethanol and identified under a microscope while

larger, shelled gastropods were identified dry. Shelled gastropods were identified by the number and

shape of radial ridges and striaea along their shell as well as by the operculum material, aperature teeth

and general whorl shape. Bivalves were often identified through shell morphology alone so tissue was

removed to view muscle scars, the pallial sinus and hinge dentition, all diagnostic features used for

identifying bivalves (Figure B.1). Chitons were hydrated in 70% ethanol under a microscope to view

girdle scales and spines, key features used for identification in this class. Species identifications are

particularly difficult in Tonicella, a diverse genus of chiton. Two commonly misidentified species, T.

marmorea and T. rubra, were distinguished through girdle scale patterns (Figure B.2). In T. rubra, girdle

scales are 2 to 3 times wider than in T. marmorea, the latter also possesses a much lighter girdle colour.

Page 66: Examining patterns of genetic ... - University of Guelph

58

Table B.1. References for species identifications made in the field and at the Canadian Museum of

Nature.

Place of Identification Reference: Title Reference: Author

Field Sites National Audubon Society Regional Guide to

Atlantic and Gulf Coast: A Personal Journey

Amos, S.H. (1985)

The Marine Molluscs of Arctic Canada:

National Museums of Canada

Macpherson, E. (1971)

The Larousse Guide to Shells of the World Oliver, A.P.H. (1980)

Seashells of the Northeast Coast from Cape

Hatteras to Newfoundland

Gordon, J. & Weeks, T.E.

(1982)

Intertidal bivalves: a guide to the common

marine bivalves of Alaska

Foster, N.R. (1991)

National Audubon Society Field Guide to

North American Seashore Creatures

Meinkoth. N.A. (1981)

Shells & Shellfish of the Pacific Northwest:

A Field Guide

Harbo, R.M. (2009)

Canadian Museum of Nature Peterson Field Guides: Shells of the Atlantic

& Gulf Coasts & the West Indies

Abbott, R.T., Morris, P.A. &

Peterson, R.T. (1995)

Seashore Life of the Northern Pacific Coast:

An Illustrated Guide to Northern California,

Oregon, Washington, and British Columbia

Kozloff, E.N. (1983)

Between Pacific Tides Ricketts, E.F., Calvin, J. &

Hedgpeth, J.W. (1992)

American Seashells: The Marine Mollusca of

the Atlantic and Pacific Coasts of North

America

Abbott, R.T. (1974)

Peterson Field Guides: A Field Guide to the

Atlantic Seashore

Gosner, K.L. & Peterson,

R.T. (1999)

A New Species of the Genus Macoma

(Pelecypoda) from West American Coastal

Waters, with comments on Macoma calcarea

Dunnill, R.M. & Coan, E.V.

(1968)

Marine Bivalve Molluscs of the Canadian

Central and Eastern Arctic: Faunal

Composition & Zoogeography

Lubinksy, I. (1980)

Catalogue of the Marine Invertebrates of the

Estuary and Gulf of Saint Lawrence

Brunel, P., Bosse, L. &

Lamarche, G. (1998)

Seashells of North America: A Guide to Field

Identification

Abbott, R.T. (2001)

Bivalve Seashells of Western North America:

Marine Bivalve Molluscs from Arctic Alaska

to Baja California

Coan, E.V., Scott, P.V. &

Bernard, F.R. (2000)

Page 67: Examining patterns of genetic ... - University of Guelph

59

Figure B.1. Hinge dentition in bivalves, a common character used for species identifications. A) Primitive

taxodont teeth displayed in a Nucula specimen and B) heterodont teeth present near the umbone in

Macoma balthica.

A)

B)

Page 68: Examining patterns of genetic ... - University of Guelph

60

Figure B.2. View of girdle scales on A) Tonicella marmorea and B) Tonicella rubra, two cryptic species

of chiton in Canadian oceans. Scale patterns are diagnostic features for identification in this genus.

A)

B)

Page 69: Examining patterns of genetic ... - University of Guelph

61

Appendix C

Chapter 1 Supplementary Material

Table C.1. List of COI primers used for molecular techniques in Chapter 1. * indicates those primers that

were tested but recovered little to no sequences.

Primer Name

(F/R)

Nucleotide Sequence (5’ to 3’) Reference

LCO1490_t1/

HCO2198_t1

TGTAAAACGACGGCCAGTGGTCAACAAATCATAAAGATATTGG/

CAGGAAACAGCTATGACTAAACTTCAGGGTGACCAAAAAATCA Floyd

dgLCO-1490/

dgHCO-2198

GGTCAACAAATCATAAAGAYATYGG/

TAAACTTCAGGGTGACCAAARAAYCA Meyer 2003

BivF4_t1/BivR1_t1 TGTAAAACGACGGCCAGTGKTCWACWAATCATAARGATATTGG/ CAGGAAACAGCTATGACTAMACCTCWGGRTGVCCRAARAACCA

Prosser

unpublished

C_LepFolF/

C_LepFolR

ATTCAACCAATCATAAAGATATTGGGGTCAACAAATCATAAAGATATTGG/

TAAACTTCTGGATGTCCAAAAAATCATAAACTTCAGGGTGACCAAAAAATCA Ivanova

LepF1/LepR1* ATTCAACCAATCATAAAGATATTGG/ TAAACTTCTGGATGTCCAAAAAATCA

Hebert et al. 2004

FishF2/FishR2* TCGACTAATCATAAAGATATCGGCAC/

ACTTCAGGGTGACCGAAGAATCAGAA Ward et al. 2005

C_GasF1_t1*/

GasR1_t1

TGTAAAACGACGGCCAGTTTTCAACAAACCATAARGATATTGGTGTAAAACG

ACGGCCAGTATTCTACAAACCACAAAGACATCGGTGTAAAACGACGGCCAGTTTTCWACWAATCATAAAGATATTGG/

CAGGAAACAGCTATGACACTTCWGGRTGHCCRAARAATCARAA

Prosser

unpublished

Table C.2. List of GenBank specimens used for analysis in Chapter 1.

Process ID Sample ID Class Species

GBML0009-06

GBML0013-06

GBML0014-06

GBML0015-06

GBML0016-06

GBML0017-06

GBML0018-06

GBML0019-06

GBML0020-06

GBML0021-06

GBML0022-06

GBML0024-06

GBML0025-06

GBML0026-06

GBML0027-06

GBML0028-06

GBML0029-06

GBML0030-06

GBML0031-06

GBML0032-06

GBML0033-06

GBML0035-06

GBML0062-06

GBML0064-06

GBML0065-06

AB084110

AF120639

AF120640

AY260813

AY260814

AY260815

AY260816

AY260817

AY260818

AY260821

AY260822

AY260824

AY260825

AY260826

AY260827

AY260828

AY260829

AY260830

AY260831

AY260832

AY260833

AY342055

DQ093531

NC_005840

NC_006162

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Scaphopoda

Episiphon yamakawai

Dentalium pilsbryi

Rhabdus rectius

Antalis antillarum

Antalis antillarum

Antalis dentalis

Antalis entalis

Antalis entalis

Antalis entalis

Dentalium pilsbryi

Antalis sp. PR-2003

Fissidentalium candidum

Graptacme eborea

Rhabdus rectius

Rhabdus rectius

Entalina tetragona

Gadila aberrans

Gadila aberrans

Polyschides carolinensis

Pulsellum salishorum

Pulsellum salishorum

Siphonodentalium lobatum

Antalis inaequicostata

Siphonodentalium lobatum

Graptacme eborea

Page 70: Examining patterns of genetic ... - University of Guelph

62

Page 71: Examining patterns of genetic ... - University of Guelph

63

Page 72: Examining patterns of genetic ... - University of Guelph

64

Figure C.1. Neighbour-joining tree (K2P) for all barcoded specimens (1334).

Page 73: Examining patterns of genetic ... - University of Guelph

65

Appendix D

Chapter 2 Supplementary Material

Table D.1. Detailed collection information for all 172 Hiatella arctica specimens in this study.

Sample ID Country Province Region Sector Site Descrip. Lat Long Depth

11BIOAK-0043

11BIOAK-0046

11BIOAK-0047

11BIOAK-0048

11BIOAK-0049

11BIOAK-0050

11BIOAK-0051

11BIOAK-0052

11BIOAK-0053

11BIOAK-0054

11BIOAK-0115

11BIOAK-0116

11BIOAK-0117

11BIOAK-0125

11BIOAK-0126

11BIOAK-0127

11BIOAK-0128

11BIOAK-0135

11BIOAK-0150

11BIOAK-0151

11BIOAK-0152

11BIOAK-0153

11BIOAK-0154

11BIOAK-0188

11BIOAK-0198

11BIOAK-0254

11BIOAK-0255

11BIOAK-0337

11BIOAK-0338

11BIOAK-0606

11BIOAK-0622

11BIOAK-0636

11BIOAK-0637

11BIOAK-0638

11BIOAK-0639

11BIOAK-0640

11BIOAK-0641

11BIOAK-0642

11BIOAK-0643

11BIOAK-0644

11BIOAK-0645

11BIOAK-0646

11BIOAK-0647

11BIOAK-0705

11BIOAK-0706

11BIOAK-0707

11BIOAK-0708

11BIOAK-0710

11BIOAK-0714

11BIOAK-0715

10BCMOL-00374

11POPHI-0007

11POPHI-0008

11POPHI-0009

11POPHI-0010

11POPHI-0011

11POPHI-0012

11POPHI-0013

11POPHI-0014

11POPHI-0015

11POPHI-0016

10PROBE-105770.1

10PROBE-105780.1

10PROBE-105790.1

10PROBE-105900.1

10PROBE-105910.1

10PROBE-105920.1

10PROBE-106050

10PROBE-106060

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

BC

BC

BC

BC

BC

BC

BC

BC

BC

BC

BC

MB

MB

MB

MB

MB

MB

MB

MB

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Bamfield

Edward King Is.

Scott's Bay

Scott's Bay

Scott's Bay

Scott's Bay

Scott's Bay

Prasiola Point

Prasiola Point

Prasiola Point

Scott's Bay

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Kasitsna Bay Lab

Little Tutka

Little Tutka

Little Tutka

Little Tutka

Little Tutka

Little Tutka

Little Tutka

Little Tutka

Little Tutka

Kachemak Bay

Kachemak Bay

Kachemak Bay

Outside Beach

Outside Beach

Outside Beach

Outside Beach

Kasitsna Bay Lab

Kasitsna Bay Lab

Kasitsna Bay Lab

Kasitsna Bay Lab

Kasitsna Bay Lab

Kasitsna Bay Lab

Little Tutka

Jakalof Bay

China Poot

China Poot

Little Tutka

Little Tutka

Camel Rock

Outside Beach

Little Tutka

Little Tutka

Little Tutka

Little Tutka

Little Tutka

Little Tutka

Little Tutka

Kasitsna Bay Lab

Kasitsna Bay Lab

Kasitsna Bay Lab

Kasitsna Bay Lab

Kasitsna Bay Lab

Little Tutka

Little Tutka

Little Tutka

Little Tutka

Little Tutka

Little Tutka

Little Tutka

Churchill River

Churchill River

Churchill River

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

kelp trawl

kelp trawl

kelp trawl

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

dock side

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

dock side

kelp trawl

kelp trawl

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

dredge

dredge

dredge

intertidal

intertidal

intertidal

intertidal

intertidal

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.64

59.64

59.64

59.46

59.46

59.46

59.46

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.56

59.56

59.47

59.47

59.44

59.46

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

58.79

58.79

58.79

58.77

58.77

58.77

58.77

58.77

-151.55

-151.49

-151.49

-151.49

-151.49

-151.49

-151.49

-151.49

-151.49

-151.49

-151.37

-151.37

-151.37

-151.71

-151.71

-151.71

-151.71

-151.55

-151.55

-151.55

-151.55

-151.55

-151.55

-151.49

-151.54

-151.25

-151.25

-151.49

-151.49

-151.72

-151.71

-151.49

-151.49

-151.49

-151.49

-151.49

-151.49

-151.49

-151.55

-151.55

-151.55

-151.55

-151.55

-151.49

-151.49

-151.49

-151.49

-151.49

-151.49

-151.49

-94.21

-94.21

-94.21

-93.87

-93.87

-93.87

-93.87

-93.87

0

0

0

0

0

0

0

0

0

0

15

15

15

0

0

0

0

0

0

0

0

0

0

0

0

9

9

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

15

15

15

0

0

Page 74: Examining patterns of genetic ... - University of Guelph

66

10PROBE-106070

10PROBE-106080

10PROBE-106090

10PROBE-106100

10PROBE-106110

10PROBE-106120

10PROBE-106130

10PROBE-106140

10PROBE-106150

10PROBE-106160

10PROBE-106170

10NBMOL-10008.1

10NBMOL-10009.1

10NBMOL-10010.1

10NBMOL-10011.1

11BFMOL-0017

11BFMOL-0018

11BFMOL-0019

11BFMOL-0020

11BFMOL-0021

11BFMOL-0022

11BFMOL-0023

11BFMOL-0024

11BFMOL-0025

11BFMOL-0026

11BFMOL-0027

11BFMOL-0040

11BFMOL-0041

11BFMOL-0042

11BFMOL-0043

11BFMOL-0044

11BFMOL-0045

11BFMOL-0068

11BFMOL-0069

11BFMOL-0070

11BFMOL-0087

11BFMOL-0132

11BFMOL-0157

11BFMOL-0158

11BFMOL-0159

11BFMOL-0160

11BFMOL-0161

11BFMOL-0297

11ECMOL-0364

11ECMOL-0365

11ECMOL-0366

11ECMOL-0367

11ECMOL-0368

11ECMOL-0369

11ECMOL-0370

11ECMOL-0371

11ECMOL-0372

11ECMOL-0373

11ECMOL-0374

11ECMOL-0375

11ECMOL-0376

11ECMOL-0377

11ECMOL-0378

11ECMOL-0379

11ECMOL-0380

11ECMOL-0381

11ECMOL-0382

11ECMOL-0383

11ECMOL-0384

11ECMOL-0385

11ECMOL-0386

11ECMOL-0387

11ECMOL-0388

11ECMOL-0389

11ECMOL-0390

11ECMOL-0391

11ECMOL-0392

11ECMOL-0393

11ECMOL-0394

11ECMOL-0395

11ECMOL-0396

11ECMOL-0397

11ECMOL-0398

11ECMOL-0399

11ECMOL-0400

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Passamaquoddy

Passamaquoddy

Passamaquoddy

Passamaquoddy

Spruce Island

Spruce Island

Spruce Island

Spruce Island

Spruce Island

Spruce Island

Spruce Island

Spruce Island

Spruce Island

Spruce Island

Spruce Island

The Wolves

The Wolves

The Wolves

The Wolves

The Wolves

The Wolves

Navy Island

Navy Island

Navy Island

Western Passage

Western Passage

Casco Bay Island

Casco Bay Island

Casco Bay Island

Casco Bay Island

Casco Bay Island

Indian Point

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

intertidal

intertidal

intertidal

intertidal

intertidal

intertidal

intertidal

intertidal

intertidal

intertidal

intertidal

boat dive

boat dive

boat dive

boat dive

boat dive

boat dive

boat dive

boat dive

boat dive

boat dive

boat dive

boat dive

boat dive

boat dive

boat dive

boat dive

boat dive

rocky intertidal

rocky intertidal

rocky intertidal

trawl

trawl

boat dive

boat dive

boat dive

boat dive

boat dive

rocky intertidal

58.77

58.77

58.77

58.77

58.77

58.77

58.77

58.77

58.77

58.77

58.77

45.08

45.08

45.08

45.08

44.97

44.97

44.97

44.97

44.97

44.97

44.97

44.97

44.97

44.97

44.97

44.95

44.95

44.95

44.95

44.95

44.95

45.06

45.06

45.06

44.95

44.95

44.96

44.96

44.96

44.96

44.96

45.07

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

-93.87

-93.87

-93.87

-93.87

-93.87

-93.87

-93.87

-93.87

-93.87

-93.87

-93.87

-67.08

-67.08

-67.08

-67.08

-66.91

-66.91

-66.91

-66.91

-66.91

-66.91

-66.91

-66.91

-66.91

-66.91

-66.91

-66.73

-66.73

-66.73

-66.73

-66.73

-66.73

-67.06

-67.06

-67.06

-67.02

-67.02

-66.93

-66.93

-66.93

-66.93

-66.93

-67.04

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

0

0

0

0

0

0

0

0

0

0

0

20

20

20

20

20

20

20

20

20

20

20

9

9

9

9

9

9

0

0

0

9

9

9

9

9

9

9

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

Page 75: Examining patterns of genetic ... - University of Guelph

67

11ECMOL-0401

11ECMOL-0402

11ECMOL-0403

11ECMOL-0404

11ECMOL-0405

11ECMOL-0406

11ECMOL-0407

11ECMOL-0408

11ECMOL-0409

11ECMOL-0410

11ECMOL-0411

11ECMOL-0412

11ECMOL-0413

11ECMOL-0414

11ECMOL-0415

11ECMOL-0416

11ECMOL-0417

11ECMOL-0418

11ECMOL-0419

11ECMOL-0420

11ECMOL-0421

11ECMOL-0422

11ECMOL-0423

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

NS

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

Bedford Basin

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

44.69

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

-63.64

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

Table D.2. Detailed collection information for all 196 Macoma balthica specimens in this study.

* denotes GenBank specimens.

Sample ID Country Province Region Sector Site Descrip. Lat Long Depth

11BIOAK-0439

11BIOAK-0424

11BIOAK-0330

11BIOAK-0328

11BIOAK-0326

11BIOAK-0248

11BIOAK-0245

11BIOAK-0244

11BIOAK-0243

11BIOAK-0416

11BIOAK-0455

11BIOAK-0454

11BIOAK-0453

11BIOAK-0452

11BIOAK-0451

11BIOAK-0450

11BIOAK-0449

11BIOAK-0448

11BIOAK-0447

11BIOAK-0446

11BIOAK-0445

11BIOAK-0444

11BIOAK-0443

11BIOAK-0442

11BIOAK-0441

11BIOAK-0440

11BIOAK-0438

11BIOAK-0437

11BIOAK-0436

11BIOAK-0435

11BIOAK-0434

11BIOAK-0433

11BIOAK-0432

11BIOAK-0431

11BIOAK-0430

11BIOAK-0429

11BIOAK-0428

11BIOAK-0427

11BIOAK-0426

11BIOAK-0425

11BIOAK-0423

11BIOAK-0422

11BIOAK-0421

11BIOAK-0420

11BIOAK-0419

11BIOAK-0418

11BIOAK-0417

10BCMOL-00310

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

USA

Canada

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

AK

BC

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Cook Inlet

Haida Gwaii

Jakalof Bay

Jakalof Bay

Kasitsna Bay

Kasitsna Bay

Kasitsna Bay

China Poot

China Poot

China Poot

China Poot

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

Jakalof Bay

mud flats

mud flats

rocky intertidal

rocky intertidal

rocky intertidal

gravel mud flat

gravel mud flat

gravel mud flat

gravel mud flat

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

mud flats

59.47

59.47

59.47

59.47

59.47

59.57

59.57

59.57

59.57

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

59.47

-151.54

-151.54

-151.55

-151.55

-151.55

-151.30

-151.30

-151.30

-151.30

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

-151.54

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

Page 76: Examining patterns of genetic ... - University of Guelph

68

10BCMOL-00312

10BCMOL-00314

10BCMOL-00315

10PROBE-106310

10PROBE-106300

10PROBE-106290

10PROBE-106280

10PROBE-106270

10PROBE-106260

10PROBE-106250

10PROBE-106240

10PROBE-106230

10PROBE-106220

10PROBE-106210

10PROBE-106200

10PROBE-105590.1

10PROBE-105600.1

10PROBE-105620.1

10PROBE-105710.1

10PROBE-106360

10PROBE-106340

10PROBE-106330

11BFMOL-0277

11BFMOL-0274

11BFMOL-0220

11BFMOL-0279

11BFMOL-0278

11BFMOL-0276

11BFMOL-0275

11BFMOL-0273

11BFMOL-0271

11BFMOL-0270

11BFMOL-0269

11BFMOL-0268

11BFMOL-0266

11BFMOL-0265

11BFMOL-0264

11BFMOL-0263

11BFMOL-0262

11BFMOL-0261

11BFMOL-0260

11BFMOL-0259

11BFMOL-0258

11BFMOL-0257

11BFMOL-0256

11BFMOL-0255

11BFMOL-0254

11BFMOL-0253

11BFMOL-0252

11BFMOL-0251

11BFMOL-0250

11BFMOL-0248

11BFMOL-0247

11BFMOL-0245

11BFMOL-0244

11BFMOL-0243

11BFMOL-0242

11BFMOL-0241

11BFMOL-0240

11BFMOL-0239

11BFMOL-0238

11BFMOL-0236

11BFMOL-0235

11BFMOL-0234

11BFMOL-0233

11BFMOL-0231

11BFMOL-0228

11BFMOL-0227

11BFMOL-0225

11BFMOL-0224

11BFMOL-0223

11BFMOL-0222

11BFMOL-0221

11MMMOL-00050

11MMMOL-00046

11MMMOL-00045

11MMMOL-00044

11MMMOL-00043

11MMMOL-00042

11MMMOL-00031

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

BC

BC

BC

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NB

NFLD

NFLD

NFLD

NFLD

NFLD

NFLD

NFLD

Haida Gwaii

Haida Gwaii

Haida Gwaii

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

Churchill

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

St. Andrews

Bonne Bay

Bonne Bay

Bonne Bay

Bonne Bay

Bonne Bay

Bonne Bay

Bonne Bay

Churchill River

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Bird Cove

Churchill River

Churchill River

Churchill River

Churchill River

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

Indian Point

dredge

intertidal

intertidal

intertidal

intertidal

intertidal

intertidal

intertidal

mud flats

mud flats

intertidal

intertidal

mud flats

mud flats

mud flats

dredge

dredge

dredge

dredge

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

rocky intertidal

St. Paul's

St. Paul's

DB lagoon

DB lagoon

DB lagoon

DB estuary

DB estuary

58.79

58.77

58.77

58.77

58.77

58.77

58.77

58.77

58.77

58.77

58.77

58.77

58.77

58.77

58.77

58.79

58.79

58.79

58.79

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

45.07

-94.21

-93.87

-93.87

-93.87

-93.87

-93.87

-93.87

-93.87

-93.84

-93.84

-93.87

-93.87

-93.84

-93.84

-93.84

-94.21

-94.21

-94.21

-94.21

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

-67.04

15

0

0

0

0

0

0

0

0

0

0

0

15

15

15

15

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

Page 77: Examining patterns of genetic ... - University of Guelph

69

11ECMOL-0037

11ECMOL-0032

11ECMOL-0029

11ECMOL-0028

11ECMOL-0027

11ECMOL-0020

11ECMOL-0017

11ECMOL-0015

11ECMOL-0013

11ECMOL-0011

11ECMOL-0010

11ECMOL-0009

11ECMOL-0008

11ECMOL-0007

11ECMOL-0006

11ECMOL-0005

11ECMOL-0004

11ECMOL-0003

11ECMOL-0001

11ECMOL-0035

11ECMOL-0034

11ECMOL-0036

11ECMOL-0038

11ECMOL-0033

11ECMOL-0026

11ECMOL-0024

11ECMOL-0023

11ECMOL-0022

11ECMOL-0021

11ECMOL-0019

11ECMOL-0018

11ECMOL-0016

11ECMOL-0014

11ECMOL-0012

11ECMOL-0002

*HM756189.1

*HM756187.1

*HM756185.1

*HM756183.1

*HM756181.1

*HM756179.1

*HM756177.1

*HM756175.1

*HM756173.1

*HM756171.1

*HM756188.1

*HM756186.1

*HM756182.1

*HM756180.1

*HM756178.1

*HM756176.1

*HM756174.1

*HM756172.1

*HM756170.1

*AF443220.1

*AF443222.1

*AF443219.1

*AF443221.1

*AY162261.1

*AY162263.1

*AY162260.1

*AY162262.1

*EF044127.1

*EF044125.1

*EF044126.1

*AF443216.1

*AF443218.1

*AF443217.1 *EF044130.1

*EF044132.1

*EF044134.1

*EF044131.1

*EF044129.1

*EF044133.1

*EF044135.1

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

Canada

France

France

France

France

France

France

France

France

France

France

France

France

France

France

France

France

France

France

France

Europe

Europe

Europe

Europe

Europe

Europe

Europe

Europe

Arctic Ocean

Arctic Ocean

Arctic Ocean

Europe

Europe

Europe

Europe

Europe

Europe

Europe

Europe

Europe

Europe

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

PEI

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Wadden Sea

Wadden Sea

Wadden Sea

Wadden Sea

Baltic Sea

Baltic Sea

Baltic Sea

Baltic Sea

Chukchi Sea

Chukchi Sea

Chukchi Sea

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Pinette

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Bay of Biscay

Wadden Sea

Wadden Sea

Wadden Sea

Wadden Sea

Baltic Sea

Baltic Sea

Baltic Sea

Baltic Sea

Chukchi Sea

Chukchi Sea

Chukchi Sea

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

red mud

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

46.05

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

-62.91

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

Page 78: Examining patterns of genetic ... - University of Guelph

70

BCD (i,j) = (

Figure D.1. Calculations for a) Tajima’s D, b) FST and c) Bray Curtis index.

A)

B)

C)

FST = between - within / between

π = mean pairwise difference

S = number of segregating sites

i,j = objects

k = index of variable

y = variable

n = total number of variables

d = distance matrix

Page 79: Examining patterns of genetic ... - University of Guelph

71

Figure D.2. Bray Curtis similarity values between populations of Hiatella arctica A) including potential

cryptic species and B) excluding potential cryptic species (main lineage with shared haplotypes). Values

closer to 1 indicate that two populations are more similar.

A)

B)

Page 80: Examining patterns of genetic ... - University of Guelph

72

Figure D.3. Bray Curtis values of similarity between populations of Macoma balthica based on identified

clusters/provisional species in the NJ tree. Values closer to 1 indicate that two populations are more

similar.

Page 81: Examining patterns of genetic ... - University of Guelph

73

Figure D.4. Hinge dentition in each of the four cryptic lineages of Hiatella arctica. A) the main lineage

has two similar denticles on the left valve (LV) and one denticle on the right valve (RV), B) the New

Brunswick cryptic lineage has two very dissimilar denticles on the LV and one denticle on the RV C) the

first northeast Pacific cryptic lineage has one very prominent tubercle in the LV as well as an elongated

phalange, with a definite denticle in the RV and lastly D) the second northeast Pacific cryptic lineage

lacks any defined dentition. Denticles in the RV are marked with blue arrows and denticles in the LV are

marked with red arrows. An initial examination suggests that shell morphology may be different in each

genetic cluster, but future work will aim to focus on morphometrics and determine whether these traits

show phylogenetic signal.

A) Main Lineage B) New Brunswick Cryptic

C) NE Pacific Cryptic 1 D) NE Pacific Cryptic 2

LV

RV

RV

LV

LV

RV RV

LV

Page 82: Examining patterns of genetic ... - University of Guelph

74

Appendix E

R Code

Chapter 1

Pearson’s Chi-Square for Sequencing Success

chi <- read.csv("chi Dec 5.csv")

chi

chisq.test(chi)

Testing for a correlation between mean GC content and sequence divergence between congenerics

order<-read.csv("GC vs NN R input Nov 28.csv", header=T, sep=",")

order

plot(order,xlab="Mean GC Content (%)",ylab="Mean Nearest Neighbour Distance (K2P)",pch=20,

col="darkgoldenrod2")

abline(lm(mean.NN ~ mean.GC, data= order),col= "red")

legend("topleft",legend=c("Correlation coefficient= 0.51","p= 0.002*"),col="black",box.col="black")

library(boot)

order1<-corr(order)

order1

# Correlations with significance levels

library(Hmisc)

rcorr(order$mean.GC, order$mean.NN, type="pearson")

Constructing rarefaction curves for Bivalvia and Gastropoda

Biv<-read.csv("BivAccumInput Nov 27.csv")

library(picante)

BivMatrix<-sample2matrix(Biv)

BivAccum<-specaccum(BivMatrix, "rarefaction")

plot(BivAccum, xlab="", ylab="", col="cyan4", xlim=c(0,800), ylim=c(0,150), ann=FALSE, ci=0,

lwd=2)

title(main="", col.main=FALSE, sub="", col.sub=FALSE, xlab="Sample", ylab="Species",

col.lab="black")

Gas<-read.csv("GasAccumInput Nov 27.csv")

library(picante)

GasMatrix<-sample2matrix(Gas)

GasAccum<-specaccum(GasMatrix, "rarefaction")

plot(GasAccum, add=T,xlab="", ylab="", col="darkgoldenrod2", xlim=c(0,800), ylim=c(0,150),

ann=FALSE, ci=0, lwd=2)

title(main="", col.main=FALSE, sub="", col.sub=FALSE, xlab="Sample", ylab="Species",

col.lab="black")

legend(5,150, c("Bivalvia","Gastropoda"), lty=c(1,1,1), lwd=c(2,2,2,2,2),

col=c("cyan4","darkgoldenrod2"), box.lwd=0, box.col="white")

Constructing rarefaction curves for Cephalopoda, Polyplacophora and Scaphopoda

Ceph<-read.csv("CephAccumInput Nov 27.csv")

library(picante)

Page 83: Examining patterns of genetic ... - University of Guelph

75

CephMatrix<-sample2matrix(Ceph)

CephAccum<-specaccum(CephMatrix, "rarefaction")

plot(CephAccum, xlab="", ylab="", col="cornflowerblue", xlim=c(0,150), ylim=c(0,25), ann=FALSE,

ci=0, lwd=2)

title(main="", col.main=FALSE, sub="", col.sub=FALSE, xlab="Sample", ylab="Species",

col.lab="black")

Poly<-read.csv("PolyAccumInput Nov 27.csv")

library(picante)

PolyMatrix<-sample2matrix(Poly)

PolyAccum<-specaccum(PolyMatrix, "rarefaction")

plot(PolyAccum, add=T, xlab="", ylab="", col="mediumvioletred", xlim=c(0,150), ylim=c(0,25), ann=

FALSE, ci=0, lwd=2)

title(main="", col.main=FALSE, sub="", col.sub=FALSE, xlab="Sample", ylab="Species",

col.lab="black")

Scaph<-read.csv("ScaphAccumInput Nov 27.csv")

library(picante)

ScaphMatrix<-sample2matrix(Scaph)

ScaphAccum<-specaccum(ScaphMatrix, "rarefaction")

plot(ScaphAccum, add=T,xlab="", ylab="", col="black", xlim=c(0,150), ylim=c(0,25), ann=FALSE,

ci=0, lwd=2)

title(main="", col.main=FALSE, sub="", col.sub=FALSE, xlab="Sample", ylab="Species",

col.lab="black")

legend(5,25, c("Cephalopoda","Polyplacophora","Scaphopoda"), lty=c(1,1,1), lwd=c(2,2,2,2,2),

col=c("cornflowerblue","mediumvioletred","black"), box.lwd=0, box.col="white")

Barcode gap histogram

mean<-read.csv("mean intra R input Nov 27.csv", header=TRUE, sep=",")

mean

hist(mean$Mean.Intra, xaxt='n', col=rgb(0,0,1,0.5), xlab="Sequence Divergence (%),main="",

ylim=c(0,150), xlim=c(0,50), breaks= seq(0,50, by=2), axes = TRUE, plot = TRUE, labels = FALSE)

axis(side= 1, at = seq(0, 50, by=2))

NN<-read.csv("NN R input Nov 27.csv", header=TRUE, sep=",")

NN

hist(NN$NN, add=TRUE, col=rgb(1,0,0,0.5), breaks= seq(0,50, by=2))

legend(30, 130, c("Mean Intra-specific", "Nearest Neighbour"), cex=1.0, bty="n", c(col=rgb(0,0,1,0.5),

col=rgb(1,0,0,0.5)), box.lwd=0, box.col="white")

ANOVA for testing whether mean nearest neighbour distance is equal between genera with ≥ 2 species

genus<-read.csv("NNgenus vs species Nov 27.csv", header=T, sep=",")

genus

boxplot(genus$NN ~ genus$Species, ylab="Mean NN Distance (%)", xlab="Number of Species

Sampled")

legend("topright",legend=c("p-value = 0.069"), box.col="black")

genus.aov<-aov(genus$NN ~ genus$Species)

summary(genus.aov)

Linear regression modeling the relationship between intraspecific divergence and number of individuals

MaxIndi<-read.csv("max intra vs indi Nov 27.csv", header=T, sep=",")

Page 84: Examining patterns of genetic ... - University of Guelph

76

MaxIndi

plot(MaxIndi$Max.Intra ~ MaxIndi$X..Individuals, xlim=c(0,70), ylim=c(0,35), col="mediumvioletred",

xlab="Number of Individuals", ylab="Intra-specific Divergence (%)", bty = "n", pch=20, cex=1.5)

Max1<-lm(MaxIndi$Max.Intra ~ MaxIndi$X..Individuals)

summary(Max1)

abline(Max1, col="mediumvioletred")

MeanIndi<-read.csv("mean intra vs indi Nov 27.csv", header=T, sep=",")

MeanIndi

par(new=TRUE)

plot(MeanIndi$Mean.Intra ~ MeanIndi$X..Individuals, xlim=c(0,70), ylim=c(0,35), axes=FALSE,

col="darkgoldenrod2", xlab="Number of Individuals", ylab="Intra-specific Divergence (%)", bty = "n",

pch=20, cex=1.5)

Mean1<-lm(MeanIndi$Mean.Intra ~ MeanIndi$X..Individuals)

summary(Mean1)

abline(Mean1, col="darkgoldenrod2")

legend(50,35, legend=c("Maximum", "Mean"), box.col="white", text.col=c("black"), pch=c(20,20),

col=c("mediumvioletred","darkgoldenrod2"))

text(62,4,c("R2=-0.006"),col=c("black"), cex=0.75)

text(62,1,c("R2=-0.002"),col=c("black"), cex=0.75)

Chapter 2

Paired t-test for determining whether mean haplotype and nucleotide diversity are equal between species

pop<-read.csv("diversity input POP GEN.csv", header=TRUE, sep=",")

pop

boxplot(pop$Haplotype ~ pop$Species, ylab="Haplotype Diversity", xlab="Species")

t.test(Haplotype~Species, data=pop)

nuc<-read.csv("nucleotide input POP GEN.csv", header=TRUE, sep=",")

nuc

boxplot(nuc$Nucleotide ~ nuc$Species, ylab="Nucleotide Diversity", xlab="Species")

t.test(Nucleotide~Species, data=nuc)

Histogram displaying the range of intraspecific divergence in Hiatella arctica and Macoma balthica

HiDiv <- read.csv("Hia divergences Nov 24.csv", header=TRUE, sep=",")

hist(HiDiv$Divergence, xlab="Sequence Divergence (%)", ylab="Frequency", main="", xlim=c(0,25),

ylim=c(0,13000), col="red")

MacDiv <- read.csv("Mac divergences Nov 24.csv", header=TRUE, sep=",")

hist(MacDiv$Divergence, xlab="Sequence Divergence (%)", ylab="Frequency", main="", xlim=c(0,15),

ylim=c(0,6000),col="blue")