6
The genome of Bacillus coahuilensis reveals adaptations essential for survival in the relic of an ancient marine environment Luis David Alcaraz*, Gabriela Olmedo*, Germa ´ n Bonilla , Rene ´ Cerritos , Gustavo Herna ´ ndez , Alfredo Cruz , Enrique Ramı ´rez § , Catherine Putonti , Beatriz Jime ´ nez* , Eva Martı ´nez*, Varinia Lo ´ pez*, Jacqueline L. Arvizu*, Francisco Ayala*, Francisco Razo*, Juan Caballero*, Janet Siefert , Luis Eguiarte , Jean-Philippe Vielle* , Octavio Martı ´nez* , Valeria Souza , Alfredo Herrera-Estrella* , and Luis Herrera-Estrella* ** Laboratorio Nacional de Geno ´mica para la Biodiversidad (Langebio), *Departamento de Ingenierı´a Gene ´ tica and § Departamentode Biotecnologı´a, Cinvestav, Campus Guanajuato, AP 629 Irapuato, Guanajuato CP36500, Me ´ xico; Departamento de Ecologı´a Evolutiva, Instituto de Ecologı´a, Universidad Nacional Auto ´ noma de Me ´ xico, CU, AP 70-275 Coyoaca ´ n 04510 Me ´ xico D.F., Me ´ xico; Department of Statistics, Rice University, P.O. Box 1892, MS-138, Houston, TX 77251-1892; and Departments of Computer Science and Biology and Biochemistry, 4800 Cullen Boulevard, University of Houston, Houston, TX 77204-5001 Contributed by Luis Herrera-Estrella, January 31, 2008 (sent for review December 10, 2007) The Cuatro Cie ´ negas Basin (CCB) in the central part of the Chihua- han desert (Coahuila, Mexico) hosts a wide diversity of microor- ganisms contained within springs thought to be geomorphological relics of an ancient sea. A major question remaining to be answered is whether bacteria from CCB are ancient marine bacteria that adapted to an oligotrophic system poor in NaCl, rich in sulfates, and with extremely low phosphorus levels (<0.3 M). Here, we report the complete genome sequence of Bacillus coahuilensis, a sporu- lating bacterium isolated from the water column of a desiccation lagoon in CCB. At 3.35 Megabases this is the smallest genome sequenced to date of a Bacillus species and provides insights into the origin, evolution, and adaptation of B. coahuilensis to the CCB environment. We propose that the size and complexity of the B. coahuilensis genome reflects the adaptation of an ancient marine bacterium to a novel environment, providing support to a ‘‘marine isolation origin hypothesis’’ that is consistent with the geology of CCB. This genomic adaptation includes the acquisition through horizontal gene transfer of genes involved in phosphorous utili- zation efficiency and adaptation to high-light environments. The B. coahuilensis genome sequence also revealed important ecological features of the bacterial community in CCB and offers opportuni- ties for a unique glimpse of a microbe-dominated world last seen in the Precambrian. evolution genomic adaptation horizontal gene transfer phosphorus stress sulfolipids T he Cuatro Cie ´negas Basin (CCB) is located in a valley 740 m above sea level in the state of Coahuila, Mexico, that measures 30 km by 40 km and is surrounded by high mountains (3,000 m) (Fig. 1). CCB is an enclosed evaporitic basin that receives 150 mm of annual precipitation. Despite the dry climate of the valley, the CCB harbors an extensive system of springs, streams, and pools (1). The CCB ecosystem is not only characterized by a high endemism of macrooganisms and biodiversity of microorganisms (1, 2), but also by extremely oligotrophic waters that are unable to sustain algal growth, making microbial mats the base of the food web (3). In particular, phosphorus (P) levels in CCB appear to be rather low, because they were below the level of detection of several methods used (0.3 M) and the extremely high biomass C:P and N:P ratios (100 by moles) previously reported for CCB stromatolites (3, 4). Unlike the present sea, the Churince spring water is poor in NaCl and carbonates, but it is rich in sulfates, magnesium, and calcium (4). Characterization of the microbiological diversity by sequencing 16S rRNA genes revealed that nearly half of the phylotypes from the CCB were closely related to bacteria from marine environments (2). Bacillus coahuilensis is a free-living, spore-forming bacteria isolated from the water column of a shallow desiccation lagoon in the Churince system at CCB (4) (Fig. 1 A and B). A molecular phylogenetic analysis of 16S rRNA sequences indicates that B. coahuilensis is closely related to other marine Bacillus spp. (4), in agreement with the theory of an ancient marine origin of these ponds. We sequenced the genome of B. coahuilensis to gain insight on the origin and genomic adaptation of this bacterium to the extremely P-limited oligotrophic environment of the Churince pond in CCB. Results General Genome Features. Sequencing of the B. coahuilensis genome was accomplished by using a hybrid strategy of high-coverage pyrosequencing (29) and a low-coverage Sanger sequencing (6) (supporting information (SI) Table S1). The assembled genome size is 3.35 Megabases (Mb), making it the smallest genome reported to date for Bacillus spp. (Fig. 1C). Its genome is composed of a predicted 3,640 coding genes (5) and has a GC content of 37.5% (Table S2), with an average 1,100 orthologues shared with all 14 Bacillus genomes currently available (including Bacillus sp. NRRL B-14911, a recently sequenced marine Bacillus from the Gulf of Mexico) and 905 unique B. coahuilensis genes. By using a phylogenomic approach (6), we reconstructed a phylogeny through a concatenated alignment of 20 universally distributed genes from the Clusters of Orthologous Groups (COGs) that are considered largely unsusceptible to horizontal gene transfer (6), and that are present in the 14 Bacillus spp. genomes (Table S3). This analysis revealed that B. coahuilensis is basal to the other 14 Bacillus spp. genomes and more related to the marine strain NRRL B-14911 than any other Bacillus spp. Additionally, the short phy- logenetic branch length suggests that the B. coahuilensis genome is associated with a low substitution rate compared with the other genomes, despite its small size (Fig. 1C). An extensive 16S analysis shows that B. coahuilensis clearly forms a unique group, but also supports a phylogenetic relation to marine isolates (Fig. S1). However, at the genome level we found no signals that would suggest the presence of bacteria closely related to B. coahuilensis in the Global Ocean Sampling database (7) (data not shown). Taken Author contributions: G.O., C.P., A.H.-E., and L.H.-E. designed research; L.D.A., G.H., A.C., E.R., B.J., E.M., V.L., J.L.A., F.A., and F.R. performed research; L.D.A., G.O., G.B., R.C., A.C., C.P., J.C., J.S., L.E., J.-P.V., V.S., O.M., and L.H.-E. analyzed data; and L.D.A., G.O., and L.H.-E. wrote the paper. The authors declare no conflict of interest. Data deposition: The sequence from this Whole Genome Shotgun project has been depos- ited in the DDBJ/EMBL/GenBank database (accession no. ABFU00000000). The version described in this article is the first version (accession no. ABFU01000000). **To whom correspondence should be addressed. E-mail: [email protected]. This article contains supporting information online at www.pnas.org/cgi/content/full/ 0800981105/DCSupplemental. © 2008 by The National Academy of Sciences of the USA www.pnas.orgcgidoi10.1073pnas.0800981105 PNAS April 15, 2008 vol. 105 no. 15 5803–5808 EVOLUTION Downloaded by guest on March 12, 2020

The genome of Bacillus coahuilensisreveals adaptations ... · The genome of Bacillus coahuilensisreveals adaptations essential for survival in the relic of an ancient marine environment

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: The genome of Bacillus coahuilensisreveals adaptations ... · The genome of Bacillus coahuilensisreveals adaptations essential for survival in the relic of an ancient marine environment

The genome of Bacillus coahuilensis revealsadaptations essential for survival in therelic of an ancient marine environmentLuis David Alcaraz*, Gabriela Olmedo*, German Bonilla†, Rene Cerritos†, Gustavo Hernandez‡, Alfredo Cruz‡,Enrique Ramırez§, Catherine Putonti¶, Beatriz Jimenez*‡, Eva Martınez*, Varinia Lopez*, Jacqueline L. Arvizu*,Francisco Ayala*, Francisco Razo*, Juan Caballero*, Janet Siefert�, Luis Eguiarte†, Jean-Philippe Vielle*‡,Octavio Martınez*‡, Valeria Souza†, Alfredo Herrera-Estrella*‡, and Luis Herrera-Estrella*‡**

‡Laboratorio Nacional de Genomica para la Biodiversidad (Langebio), *Departamento de Ingenierıa Genetica and §Departamento de Biotecnologıa,Cinvestav, Campus Guanajuato, AP 629 Irapuato, Guanajuato CP36500, Mexico; †Departamento de Ecologıa Evolutiva, Instituto de Ecologıa, UniversidadNacional Autonoma de Mexico, CU, AP 70-275 Coyoacan 04510 Mexico D.F., Mexico; �Department of Statistics, Rice University, P.O. Box 1892, MS-138, Houston, TX77251-1892; and ¶Departments of Computer Science and Biology and Biochemistry, 4800 Cullen Boulevard, University of Houston, Houston, TX 77204-5001

Contributed by Luis Herrera-Estrella, January 31, 2008 (sent for review December 10, 2007)

The Cuatro Cienegas Basin (CCB) in the central part of the Chihua-han desert (Coahuila, Mexico) hosts a wide diversity of microor-ganisms contained within springs thought to be geomorphologicalrelics of an ancient sea. A major question remaining to be answeredis whether bacteria from CCB are ancient marine bacteria thatadapted to an oligotrophic system poor in NaCl, rich in sulfates, andwith extremely low phosphorus levels (<0.3 �M). Here, we reportthe complete genome sequence of Bacillus coahuilensis, a sporu-lating bacterium isolated from the water column of a desiccationlagoon in CCB. At 3.35 Megabases this is the smallest genomesequenced to date of a Bacillus species and provides insights intothe origin, evolution, and adaptation of B. coahuilensis to the CCBenvironment. We propose that the size and complexity of the B.coahuilensis genome reflects the adaptation of an ancient marinebacterium to a novel environment, providing support to a ‘‘marineisolation origin hypothesis’’ that is consistent with the geology ofCCB. This genomic adaptation includes the acquisition throughhorizontal gene transfer of genes involved in phosphorous utili-zation efficiency and adaptation to high-light environments. The B.coahuilensis genome sequence also revealed important ecologicalfeatures of the bacterial community in CCB and offers opportuni-ties for a unique glimpse of a microbe-dominated world last seenin the Precambrian.

evolution � genomic adaptation � horizontal gene transfer �phosphorus stress � sulfolipids

The Cuatro Cienegas Basin (CCB) is located in a valley �740 mabove sea level in the state of Coahuila, Mexico, that measures

�30 km by 40 km and is surrounded by high mountains (�3,000 m)(Fig. 1). CCB is an enclosed evaporitic basin that receives �150 mmof annual precipitation. Despite the dry climate of the valley, theCCB harbors an extensive system of springs, streams, and pools (1).The CCB ecosystem is not only characterized by a high endemismof macrooganisms and biodiversity of microorganisms (1, 2), butalso by extremely oligotrophic waters that are unable to sustain algalgrowth, making microbial mats the base of the food web (3). Inparticular, phosphorus (P) levels in CCB appear to be rather low,because they were below the level of detection of several methodsused (0.3 �M) and the extremely high biomass C:P and N:P ratios(�100 by moles) previously reported for CCB stromatolites (3, 4).Unlike the present sea, the Churince spring water is poor in NaCland carbonates, but it is rich in sulfates, magnesium, and calcium(4). Characterization of the microbiological diversity by sequencing16S rRNA genes revealed that nearly half of the phylotypes fromthe CCB were closely related to bacteria from marine environments(2). Bacillus coahuilensis is a free-living, spore-forming bacteriaisolated from the water column of a shallow desiccation lagoon inthe Churince system at CCB (4) (Fig. 1 A and B). A molecular

phylogenetic analysis of 16S rRNA sequences indicates that B.coahuilensis is closely related to other marine Bacillus spp. (4), inagreement with the theory of an ancient marine origin of theseponds. We sequenced the genome of B. coahuilensis to gain insighton the origin and genomic adaptation of this bacterium to theextremely P-limited oligotrophic environment of the Churince pondin CCB.

ResultsGeneral Genome Features. Sequencing of the B. coahuilensis genomewas accomplished by using a hybrid strategy of high-coveragepyrosequencing (29�) and a low-coverage Sanger sequencing (6�)(supporting information (SI) Table S1). The assembled genomesize is 3.35 Megabases (Mb), making it the smallest genomereported to date for Bacillus spp. (Fig. 1C). Its genome is composedof a predicted 3,640 coding genes (5) and has a GC content of 37.5%(Table S2), with an average 1,100 orthologues shared with all 14Bacillus genomes currently available (including Bacillus sp. NRRLB-14911, a recently sequenced marine Bacillus from the Gulf ofMexico) and 905 unique B. coahuilensis genes.

By using a phylogenomic approach (6), we reconstructed aphylogeny through a concatenated alignment of 20 universallydistributed genes from the Clusters of Orthologous Groups (COGs)that are considered largely unsusceptible to horizontal gene transfer(6), and that are present in the 14 Bacillus spp. genomes (Table S3).This analysis revealed that B. coahuilensis is basal to the other 14Bacillus spp. genomes and more related to the marine strain NRRLB-14911 than any other Bacillus spp. Additionally, the short phy-logenetic branch length suggests that the B. coahuilensis genome isassociated with a low substitution rate compared with the othergenomes, despite its small size (Fig. 1C). An extensive 16S analysisshows that B. coahuilensis clearly forms a unique group, but alsosupports a phylogenetic relation to marine isolates (Fig. S1).However, at the genome level we found no signals that wouldsuggest the presence of bacteria closely related to B. coahuilensis inthe Global Ocean Sampling database (7) (data not shown). Taken

Author contributions: G.O., C.P., A.H.-E., and L.H.-E. designed research; L.D.A., G.H., A.C.,E.R., B.J., E.M., V.L., J.L.A., F.A., and F.R. performed research; L.D.A., G.O., G.B., R.C., A.C.,C.P., J.C., J.S., L.E., J.-P.V., V.S., O.M., and L.H.-E. analyzed data; and L.D.A., G.O., and L.H.-E.wrote the paper.

The authors declare no conflict of interest.

Data deposition: The sequence from this Whole Genome Shotgun project has been depos-ited in the DDBJ/EMBL/GenBank database (accession no. ABFU00000000). The versiondescribed in this article is the first version (accession no. ABFU01000000).

**To whom correspondence should be addressed. E-mail: [email protected].

This article contains supporting information online at www.pnas.org/cgi/content/full/0800981105/DCSupplemental.

© 2008 by The National Academy of Sciences of the USA

www.pnas.org�cgi�doi�10.1073�pnas.0800981105 PNAS � April 15, 2008 � vol. 105 � no. 15 � 5803–5808

EVO

LUTI

ON

Dow

nloa

ded

by g

uest

on

Mar

ch 1

2, 2

020

Page 2: The genome of Bacillus coahuilensisreveals adaptations ... · The genome of Bacillus coahuilensisreveals adaptations essential for survival in the relic of an ancient marine environment

together, these results suggest that B. coahuilensis is ancestral to theother sequenced Bacillus spp. and that the presence of this speciesin the CCB did not originate from a recent migration through sporedisbursal from present marine habitats.

Lipid Profile Adaptation to a Low-Phosphorous Environment. Mem-brane phospholipids constitute �30% of the total phosphate inmost organisms. In plants and cyanobacteria subjected to phospho-rous deprivation, phospholipids can be replaced by non-P lipids(such as sulfo- and galactolipids) to maintain membrane function-ality and integrity, and release P to sustain other P-requiring cellularprocesses (8). Interestingly, genes encoding sulfoquinovose syn-thase (sqd1) and glycosyltransferase (sqdX), the two key enzymes inthe synthesis of sulfolipids, are present in B. coahuilensis. Becausethe sulfoquinovose synthesis operon is absent in all other knownBacillus spp. genomes, this finding suggests that the adaptation ofB. coahuilensis to the extremely low P concentration of CCBincludes the acquisition of these genes through horizontal genetransfer (HGT). The B. coahuilensis genes are closely related tocyanobacterial sqd1 and sqdX (Fig. 2A and Fig. S2), and the operonarrangement is identical to that in Synechococcus sp. PC7942, where

these genes participate in the synthesis of sulfolipids (9) (Fig. 2B).The structural prediction of the B. coahuilensis Sqd1 protein isremarkably similar to the crystallized structure of Arabidopsisthaliana SQD (1.2 Å; Fig. 2C); in particular, residues important forthe interactions with NAD� and UDP-ribose are conserved in B.coahuilensis. Reverse transcription PCR (RT-PCR) experimentsrevealed that, in contrast to plants and cyanobacteria, sqd1 isconstitutively expressed and not induced by P limitation (Fig. 2D).Thin layer chromatography revealed a spot in B. coahuilensis lipidextracts with migration similar to that of pure sulfolipids, compa-rable to that present in A. thaliana and in a cyanobacteria spp.(isolated from CCB) but absent in lipid preparations from Bacillussp. NRRL B-91411 (Fig. 2E). Mass spectrometry analysis con-firmed the presence of sulfolipids in B. coahuilensis (Fig. 2F and Fig.S3). The remarkable acquisition of constitutively expressed genesallowing B. coahuilensis to replace membrane phospholipids withsulfolipids is in agreement with genomic adaptations to extremephosphate limitation.

Light Sensing in a High-Radiation Environment. The presence ofgenes encoding Bacteriorhodopsin (BR) in B. coahuilensis is rem-iniscent of the abundance of BR genes in marine environmentalsamples (7), suggesting an additional adaptation of marine bacteria.The phylogeny of B. coahuilensis sensory BR (BSR) showed that itsclosest orthologue is the Anabaena sp. PCC7120 rhodopsin (ASR)(Fig. 3A). ASR was the first eubacterial SR identified (10) andsuggested to function as a photosensory receptor. The structuralprediction of BSR and its comparison with crystallized ASR and SRof Natronomonas pharaonis (11, 12) indicates that all three proteinscontain seven conserved transmembrane helices and a Lys residuethat binds retinal (Fig. 3B). Alignment of ASR and BSR shows thatboth have a Pro residue instead of the Asp (BSR203 and ASR206)residue present in all other microbial rhodopsins (10). Evidence forHGT of rhodopsins has recently been obtained from whole-genomesequencing and metagenomic projects, and is now thought to be afrequent event in marine bacteria from the photic zone and extremesaline environments (13–15). The retinal chromophore of rhodop-sin is synthesized as a cleavage product of carotenoids; thus, thecombination of carotenoid synthesis and rhodopsin genes has beensuggested to be sufficient for rhodopsin function (14). The genomeof B. coahuilensis also contains genes encoding crtB (phytoenesynthase) and crtICA2 (phytoene dehydrogenases) that could beinvolved in retinal biosynthesis (Fig. S4). RT-PCR experimentsshowed that the expression of bsr is constitutive and not light-dependent (Fig. 3C). The high-radiation exposure prevalent inshallow waters of CCB could explain the selection pressure respon-sible for the maintenance and constitutive expression of the bsrgene.

Nucleotide composition analysis (16) identified numerousgenomic islands containing genes likely to be acquired by HGT;several have been annotated as HGT genes in B. subtilis and B.halodurans (Fig. S5). Interestingly, our nucleotide compositionanalysis did not identify bsr and sqd1 as HGT genes. We alsodetermined the Codon adaptation index (CAI) (Fig. S6) for bsr andsqd1 which is of 0.719 and 0.735, respectively, indicating that thesegenes have been present in B. coahuilensis long enough to undergoamelioration to the average CAI (0.714).

Nitrogen Cycle Strategies and Feeding Capabilities. To gain insightinto the nutritional requirements and metabolic capabilities of B.coahuilensis within the CCB microbial community, we analyzedgenes involved in ABC transport systems, one of the largestparalogous families present in the genome of this bacterium.Hidden Markov model (HMM) profiles were built for each ABCimporter family (17) and searched against 11 Bacillus spp. genomes.The analysis retrieved a total of 1,038 import systems. The two mostabundant families among all Bacillus spp. were metal (MET) andosmoprotectant (OTCN) importers (Fig. S6), which seems to be a

N 26°54'

N 26°50'

N 26°46'

W 102°20' W 102°12' W 102°04' W 102°56'

N 26°58'

e

d

ar

rei

S

so

cr

aM

na

S

BCC

N

S

W E

B. anthracis Sterne

B. anthracis AMES

B. thuringiensis konkukian

B. cereus ATCC 14579

B. halodurans

B. clausii

O. iheyensis

B. licheniformis 14580

B. licheniformis

B. subtilis

G. kaustophilus

B. sp. NRRL B-14911

B. coahuilensis

Listeria monocytogenes

Listeria inocua

100

100

100

100

96

92

97

81

60

71

54

96

0.05

Megabases

3535353535434435464643524537

GC %

B. cereus E33L

56

50

48

46

45

43

37

BA

C

0 1 2 3 4 5 6

Fig. 1. Marine origin of B. coahuilensis, isolated from a pond in the Chihua-huan desert. (A) Sierra de San Marcos is a prominent mountain system in themiddle of CCB where �400 ponds sustain most of the biodiversity. Thegeomorphological origin of CCB has been recently reviewed (2). (B) Churincesystem (shown with a red triangle in A) consists of a springhead that feeds a2-km-long stream with an intermediate lagoon terminating at a large shallowdesiccation lagoon. (C) Phylogenomic reconstruction. Maximum likelihoodphylogenomic reconstruction by using Tree-Puzzle (29) was carried out with20 universally conserved COGs from the sequenced Bacillus spp. and closelyrelated species (Table S3). Maximum likelihood bootstrap percentage supportvalues are only indicated for major nodes; numbers in red represent tentativetimescale, in million years, calculated with the method proposed by Battistuzziet al. (30). Genome size (represented as bars) and GC content of each Bacillusspp. genome is shown to the right.

5804 � www.pnas.org�cgi�doi�10.1073�pnas.0800981105 Alcaraz et al.

Dow

nloa

ded

by g

uest

on

Mar

ch 1

2, 2

020

Page 3: The genome of Bacillus coahuilensisreveals adaptations ... · The genome of Bacillus coahuilensisreveals adaptations essential for survival in the relic of an ancient marine environment

characteristic for the group. B. coahuilensis returned 63 importsystems, thus being one of the genomes with a less absolute numberof import systems (Table S4), below the expected values for thegroup. Nonetheless, the best represented families in B. coahuilensisare also MET and OTCN. B. coahuilensis possesses a very highproportion of iron-siderophore (ISVH) importers relative to ge-nome size, a feature shared with the water-column marine bacilliOceanobacillus iheyensis, and Bacillus sp. NRRL B-14911. In addi-tion, an operon coding for the ferric-enterobactin synthesis andtransport system fepBCD is shared with O. iheyensis. It also encodesan Iron(III)-dicitrate and ferric enterobactin transporters, suggest-ing that marine bacilli and B. coahuilensis are actively scavenging foriron.

The proportion of polar amino acid and opine (PAO), as well asD-L-Methionine (DLM) importers relative to the total number oftransporters and genome size is greater in B. coahuilensis than inany other Bacillus. It also has one of the lowest ratios of oligopeptide(OPN) importers, suggesting a specialization for the preferential

acquisition of single amino acids over oligopeptides. Experimentalresults show that B. coahuilensis has an absolute requirement for 8aa (four polar and three hydrophobic) and a partial requirement foranother 5 aa (three hydrophobic and two polar), confirming thatthis bacterium depends on amino acid import (Table S4 and Fig.S6). This overrepresentation of single-amino acid importers isshared with the recently sequenced water-living beta-proteobacte-ria Minibacterium massiliensis (18), suggesting that this featuremight be common in the reduced genomes of aquatic free-livingbacteria.

HMM profiles were built for 86 genes involved in the N2 cycle(data not shown) and were searched against all sequenced Bacillusspp. genomes. This revealed that B. coahuilensis has the lowestnumber of N2 cycle enzymes of all sequenced Bacillus spp., lackingan inorganic N pathway and most of the urea cycle and ureadegradation pathways (Fig. 4). It only possesses the canonicaltransaminases, an ammonia transporter and a D-Ala transporter,thus not being capable of performing the entire N cycle on its own.

Tyr 200/182

His 201/183

Thr 163/145Lys 204/186

B. coahuilensis SQD1 model

Arabidopsis thaliana SQD1 crystal structure

UDP-glucoseNAD

SO4

C

BBacillus coahuilensis

Synechoccocus sp.

1 2 3 4 5 6

MGDG

SQDGDGDG

1 2 3 4

D

E

A

Oriza sativa

Synechococcus sp. WH 8102100

95

45

52

100

100

78

99

45

59

100

100

60

79

100

99

99

98

56

10098

97

43

32

0.1

FirmicutesCyanobacteriaActinobacteriaArchaeaNon cultivablePlants and Algae

sqd1 sqdX egsAppiB

sqd1 sqdX

F

Nostoc sp. PCC 7120Anabaena variabilis ATCC 29413Nostoc punctiforme PCC 73102

Thermosynechococcus elongatus BP-1Synechocystis sp. PCC 6803

Crocosphaera watsonii WH 8501Trichodesmium erythraeum IMS101Synechococcus sp. JA-3-3Ba

Chlamydomonas reinhardtiiMedicago trunculata

Spinacia oleraceaChainA crystal structure Sqd1 A. thalianaArabidopsis thaliana

Bacillus coahuilensisRubrobacter xylanophilus

Syntrophobacter fumaroxidans MPOBThermoplasma volcanium GSS1

T. acidophilum

Prochlorococcus marinus NA TL2P. marinus pastoris

P. marinus MIT 9312

Mycobacterium sp. JLSgi|43443278gi|43177758gi|44133788gi|43837657gi|43443861gi|43553861gi|43089039gi|43851695gi|43200131

Haloarcula marismortui ATCC 43049

Synechococcus sp. JA-3-3Ab

ChainA crystal structure Sqd1 A. thaliana

P. marinus MIT 9211

9-Me-14:0

15-Me-16:0

100

%

C2 569.19

C1 597.23

A270.98

B 329.01

0 250 500 750 900m/z

-H

C1

B

C2

-H

SO2Na2+

598.23

838.5 841.44

840.45

839.46

Fig. 2. Acquisition of sulfoquinovose synthesis capabilities through HGT, an adaptation of B. coahuilensis to a phosphorus-limiting environment. (A)Neighbor-joining phylogenetic reconstruction of SQD1 (31). (B) The operon structure of the sqd1 and sqdX resembles that of the Synecchococcus genes (9).Flanking genes are ppi (peptidyl-prolyl cis-trans isomerase B) and egsA (glycerol-1-phosphate dehydrogenase) (not to scale). (C) Modeling (32) to the A. thalianaSQD (PDB ID code 1I24) protein shows conservation of the NAD and UDP-glucose-binding residues. Diagrammic representation was done by using PyMol(http://www.pymol.org). (D) Expression of the sqd1 gene. RT-PCR was carried out from RNA of B. coahuilensis grown under different phosphate concentrations(lanes 1 to 5, RT-PCR products obtained from cells cultured with 0.001, 0.005, 0.05, 0.5, and 5 mM phosphate, respectively; lane 6, control without reversetranscriptase). (E) TLC analysis reveals the presence of a probable SQDG band in B. coahuilensis, A. thaliana, Cyanobacteria sp. (lanes 1 to 3) but absent in Bacillussp. NRRL B-14911 (lane 4). (F) Mass spectroscopic analysis confirmed the identity of the sulfoquinovoside (Fig. S3).

Alcaraz et al. PNAS � April 15, 2008 � vol. 105 � no. 15 � 5805

EVO

LUTI

ON

Dow

nloa

ded

by g

uest

on

Mar

ch 1

2, 2

020

Page 4: The genome of Bacillus coahuilensisreveals adaptations ... · The genome of Bacillus coahuilensisreveals adaptations essential for survival in the relic of an ancient marine environment

B. coahuilensis codes for rocF, an arginase mainly found in Firmi-cutes, but not for any other urea-cycle enzyme. All sequencedBacillus spp. code for a complete urea cycle (argFGH and rocF),with the notable exception of O. iheyensis. Moreover, growth of B.coahuilensis is impaired, but not completely arrested, when grownwithout arginine, and this amino acid is one of those transported bythe overrepresented import family PAO. B. coahuilensis also lacksthe other gene that could provide a source of arginine fromcitrulline, arcA. This suggests that arginine is an important aminoacid in this bacterium, and that it is most likely synthesizing arginineby employing an unknown pathway. It was reported for example,that Pseudomonas spp. synthesizes arginine by an arginine racemaseand a D-arginine dehydrogenase (19).

In summary, B. coahuilensis exhibits numerous auxotrophies andhigh dependence on the N2 cycle, which is mainly carried out in theCCB ponds by cyanobacteria, and is therefore also likely to behighly dependent on the microbial community within this environ-ment. This is in contrast with the genome from the marine strainNRRL B-14911, which exhibits partial requirement for only 2 aa

and appears well suited to perform most of its N2 cycle on its own(Fig. 4 and Table S5).

Genome Size Evolution and Essential Genes. The basal position of B.coahuilensis (Fig. 1C) suggests that it is ancestral to the othersequenced Bacillus spp. and related species. Although most of theessential genes (20) of B. subtilis are conserved in B. coahuilensis,many of those with redundant functions are absent (Table S6, TableS7, and Table S8). Genes for endospore formation are conserved,but the number of genes encoding components of the spore coat aresignificantly underrepresented (Table S6), although this does notcompromise formation of heat-resistant spores. We also carried outan analyses of the largest paralogous gene families within B.coahuilensis compared with those in B. subtilis (21), as a means ofidentifying important functions that are maintained in a smallgenome. Although B. subtilis has 13 paralogous gene families of �5members involved in secondary metabolites biosynthesis, transport,and catabolism, these are almost completely absent in B. coahuilen-sis (COG Q category, 1 gene family of �5 members). Signaltransduction, in contrast, seems to be overrepresented in B.coahuilensis (COG T category, 57 gene families of �5 members)compared with the same category in B. subtilis (1 gene family of �5members) suggesting that environmental monitoring is key tosurvival in the CCB environment (Fig. S7). Analysis of all knownB. subtilis genes involved in the synthesis of cell wall and membranecomponents reveals that B. coahuilensis also lacks genes necessaryfor the synthesis of polyphosphate-rich teichoic and polyanionteichuronic acids, useful cell wall phosphorus storage compounds.Because teichoic acid synthesis genes are not only present in severalBacillus species, but also in Listeria spp. which are used as anoutgroup for comparing the Bacillus genus (Table S7), it is likelythat B. coahuilensis lost the capacity to produce teichoic acidbecause of the P-limited environment of CCB. Of the 45 genes that

A Uncultured rhodopsins

Candidatus Pelagibacter ubique Bop

Uncultured Bop

Uncultured Bop

Exiguobacterium sibiricum Bop

Pyrocistis lunula rhodopsin

Salinibacter ruber Xop

Gleobacter violaceus Sop1

Chlamydomonas reinhardtii Sop2

C. reinhardtii Sop1

Neurospora crassa Sop

Nostoc sp. PCC 7120 Sop

Bacillus coahuilensis Sop

Salinibacter ruber Sop1

Salinibacter ruber Sop2

Halorubrum xinjiangense Bop

Rubrobacter xylanophilus Bop

Salinobacter ruber Hop

Kineococcus radiotolerans Bop

Haloarcula marismortui Sop1

Natronomonas pharaonis Sop1

Haloarcula marismortui Sop1

Haloquadratum walsbyi Bop1

Haloarcula marismortui Bop precursor

Haloquadratum walsbyi Bop2

Haloarcula marismortui Bop

Natronomonas pharaoinis Hop

Haloquadratum walsbyi Hop

Haloarcula marismortui Hop

Natronomonas pharaonis Sop1 (non-functional)

Haloarcula marismortui Bop related

Salinibacter ruber bop related

57

46

100

100

100

67

100

99

77

38

72

89

47

52

47

36

12

98

85

94

58

37

48

46

24

32

37

27

20

16

84

0.2

Proton-Pumprhodopsins

Halorhodopsins

Sensoryrhodopsins

BSR 64 TTIYYARYIDWVISTPLLLAALALTAMFGGKKNASR 66 QIAHYARYIDWMVTTPLLLLSLSWTAMQFIKKD

* ** ***

BSR 156 TRLARHYTRVAIYLSVLWVCYPTAWLLGPSGLGASR 159 SELANLYDKLVTYFTVLWIGYPIVWIIGPSGFG

*** *

ASR 192 WINQTIDTFLFCLLPFFSKVGFSFLDLHGLRNLBSR 189 LAQELTEVLVFIILPIFSKVGFSIVDLHGLRKL

* *

Helix C

Helix F

Helix G

SRII 66 RTVFVPRYIDWILTTPLIVYFLGLLAGLDSREF

SRII 154 SGIKSLYVRLRNLTVVLWAIYPFIWLLGPPGVA

SRII 187 LLTPTVDVALIVYLDLVTKVGFGFIALDAAATR

Helix AHelix B

Helix C

Helix D

Helix E

Helix F

Helix G Phe 210/213

Leu 202/205

Lys 207/210

Pro 203/206

B. coahuilensis SOP model

Nostoc sp. PCC 7120 SOP crystal structure

Retinal

C 1 2 3 4

B

Fig. 3. B. coahuilensis contains a sensory bacteriorhodopsin possibly ac-quired from cyanobacteria through an ancient HGT event. (A) Neighbor-joining tree showing the phylogenetic diversity of rhodopsins (31). (B) Bsrpossesses all of the residues involved in retinal binding and was modeled (32)to the predicted structures of Anabaena and Natronomonas pharaonis. Align-ment of segments of SR from B. coahuilensis (BSR, BM4401574), Anabaena sp.PCC7120 (ASR, PDB ID code 1XIO), and Natronomonas pharaonis DSM2160(SRII, PDB ID code 1JGJ) show conservation of residues in the retinal-bindingpocket (marked with asterisks) except for a Pro residue (positions BSR203 andASR206), which is an Asp residue in all other microbial rhodopsins (bluerectangle; alignment adapted from ref. 10). Diagrammatic representationwas done by using PyMol (http://www.pymol.org). (C) bsr is expressed in B.coahuilensis grown either under white light or in the dark. RT-PCR was carriedout by using RNA obtained from bacteria grown under dark or white-lightconditions (lanes 1 and 2, respectively). Lanes 3 and 4 are controls withoutreverse transcriptase.

B. anthracis str. AmesB. cereus E33LB. thuringiensis konkukian

B. licheniformis ATCC14580

B. subtilis str. 168

B. clausii KSM-16

B. halodurans C-125

O. iheyensis HTE831

G. kaustophilus HTA426

B. sp. NRRL B14911

B. coahuilensis

lA

rg

rA

Dp

sA

Ar

aC

Bg

aD

Ac

ra.

Ah

dG

hn l

GA

vlI

EL

ty

Cp

bP

Cg

rA

Fg

rA

Gg

rA

He

rC

As

ca

b.

Ah

dG

um

Hp

rt

NC

Tp

rC

Tp

rF

TyrB g

RI

tu

HH

pa

NA h

Ci

cd

LA i

tih

Cn

ct

NA

co

RF i

hC

Al l

AC

cy

CA

tlG

B ra

NI

As

aN

ta

H ih

Cl

ga

NZ

ra

NGr

aN

H ra

NJ

nlG

Bri

NA

ih

CD

cr

AA o

ce.

Ah

dG

lH

cp

er

UA

er

UB

er

UC

cr

ACr

oN

Br

oN

Z

Fig. 4. B. coahuilensis lacks many of the genes coding for enzymes involvedin nitrogen metabolism. Hidden Markov models for the bacterial enzymesinvolved in N2 metabolism were constructed to detect these genes in all of thesequenced Bacillus spp. Bars in different colors denote the presence of a genepredicted to code for a given enzyme.

5806 � www.pnas.org�cgi�doi�10.1073�pnas.0800981105 Alcaraz et al.

Dow

nloa

ded

by g

uest

on

Mar

ch 1

2, 2

020

Page 5: The genome of Bacillus coahuilensisreveals adaptations ... · The genome of Bacillus coahuilensisreveals adaptations essential for survival in the relic of an ancient marine environment

constitute the PhoP/PhoR regulon in B. subtilis, only 24 are presentin B. coahuilensis (Table S8). B. coahuilensis lacks the pit geneencoding a low-affinity P transporter that is conserved in Listeriaspp. and many Bacillus spp. (except for B. halodurans, B. clausi, andO. iheyensis) suggesting that pit was also lost in B. coahuilensisbecause it provided no advantage in the CCB low-P environment.Interestingly, the pit gene is also absent in the marine cyanobacterialgenomes (22). Overall, several genomic characteristics seem to bethe result of evolutionary mechanisms that caused a reduction ingenome size and selected against redundant genes and genes thatprovided little or no advantage for survival in the oligotrophicenvironment of the CCB pools. However, given the basal positionof B. coahuilensis in the phylogenomic analysis, it is also possiblethat some of the genes it is lacking, but which are present in otherBacillus spp., may represent gains of bacilli species that evolvedlater.

DiscussionP limitation seems to be an important driving force for theadaptations observed in the B. coahuilensis genome. This bacteriumhas the capability of synthesizing membrane sulfolipids, apparentlyacquired by HGT from a cyanobacteria. Sulfolipid SQDG is presentin all higher plants, mosses, ferns, and algae, but it has also beenreported in nonphotosynthetic organisms, predominantly in cya-nobacteria (Fig. 3A). Discovery of this capability in the bacilli isespecially interesting given the context of an environment withextremely limiting P. HGT of sqd1 and sqdX seems to be frequentin cyanobacteria and, because it involves transfer of only a coupleof genes, it seems relatively simple for a bacteria to appropriate thismechanism to cope with P scarceness. Finally, genome-size reduc-tion in an ecosystem with very low P availability, such as the CCBpools, could increase the fitness of a bacterium with a reduced Pdemand for nucleotide synthesis.

The predicted rhodopsin sequence of B. coahuilensis is phylo-genically closer to sequences from cyanobacteria than to thosereported in other bacteria and archeobacteria. Because retinal,which is derived from carotenoids, is probably readily available inmany of the pigmented CCB bacteria, the acquisition of rhodopsingenes from cyanobacteria could easily lead to a new functionaladaptation. B. coahuilensis genome encodes a couple of photolyasesthat repair UV-induced DNA lesions and could explain, along withthe bsr gene, the adaptation of B. coahuilensis to the high-lightfluency in the shallow and clear waters of the CCB ponds.

The incidence of both rhodopsin and sulfolipid biosynthesisgenes in environmental samples is spatially restricted. We searchedthe CAMERA database (23) for homologue genes to the B.coahuilensis’ bsr and sqd1 and found only four sample points withinthe Global Ocean Sampling (GOS) having both genes represented:GS027, GS031, GS031, and GS034, which are located in shallowwaters of the Galapagos Islands. Two important common featuresbetween CCB ponds and the Galapagos Islands are a high-radiationenvironment in shallow waters and the lack of P. The fact that sqd1and bsr coexist in two distinct geographically distant locations withcommon environmental characteristics is suggestive of commonadaptation strategies. P starvation seems to be a general constraintin marine environments, as shown in ref. 7.

Analysis of paralogous gene families revealed a dramatic limi-tation in genes for secondary metabolism in B. coahuilensis possiblyas a result of both genome-size reduction and adaptation to aunique niche (24, 25). The large number of auxotrophies, thelimitation in N2 cycle genes (Fig. 4), and overrepresented signaltransduction genes suggest that this bacterium is a specialist withinthe CCB water systems, dependent and tightly bound to the primaryproducers and prepared for sensing and responding to the extremeseasonal environmental conditions (2). This is in contrast to pan-demic and generalist bacteria like B. subtilis with its arsenal ofsecondary metabolism genes.

Our findings show that HGT played a key role in the adaptationsof B. coahuilensis. We are currently exploring whether transposons(of which there are �20 in B. coahuilensis), phages [known to beabundant in ponds of the CCB (26)], and natural competence couldprovide the mechanisms driving changes in the B. coahuilensisgenome as well as in the other bacteria in the community. We haveisolated numerous cyanobacteria in the Churince pond (27).

The strategy of sequencing a single bacterial isolate to obtaininformation on the adaptations of bacteria living in a highlyoligotrophic environment was highly fruitful because it led to a clearidentification of the phylogenetic affiliation of some ecologicalfunctions. A metagenome approach might have obscured thephylogenetic association of genes of cyanobacterial origin to firmi-cutes and would not have allowed us to observe the importantnitrogen cycle and amino acid synthesis limitations that make thisbacterium dependent on the microbial community. The smallgenome size, constraints in secondary metabolism, and overrepre-sentation of signal transduction genes are also features that couldnot have been deduced from a metagenomic approach. Efforts tostudy the metagenomics of the CCB ponds should be made to helpus understand how these ancient water ecosystems in the middle ofthe desert are self-sufficient in their biogeochemical cycles.

Our results provide evidence that, because of the specific genomedynamics, its ancestry, and the local adaptive response, B.coahuilensis is most likely the result of adaptation of an ancientmarine bacterium to a novel environment. Results that are inagreement with the geology/ontology model that predicts a marineenvironment for this region in the early Jurassic followed by therising of the continent, the formation of the CCB valley, and itsisolation by the surrounding Sierras in the Cretaceous period �70million years ago (2). B. coahuilensis is likely a primitive bacterialcomponent of a complex community that included Archaea andCyanobacteria that provided genomic fodder for gene transfer andthe implementation of innovative and necessary strategies forsurvival in an evolving ecology.

Materials and MethodsStrains. B. coahuilensis was provided by V.S (4) and NRRL B-14911 by J.S. Themedia used for growth are described in SI Text.

Genome Sequencing and Annotation. B. coahuilensis was sequenced by a hybridSanger/454 approach. The entire genome sequence was obtained from a com-bination of 16,698 end sequences (providing 6-fold coverage) from a pUC18genomic shotgun library (2–5 kb), by using dye terminator chemistry on auto-mated DNA sequencers (ABI3700; Applied Biosystems) and 454 technology withseven runs at a 29-fold coverage. Predicted protein-encoding genes were man-ually refined (see SI Text) and automatically annotated by using the BASyssystem (5).

Prediction of Horizontal Transfer. The Similarity Plot (S-plot) application was used(16) to identify windows that contained regions of unusual compositional prop-erties (RUCPs) within B. coahuilensis genome (see SI Text).

RT-PCR. SemiquantitativeRT-PCRswerecarriedoutbyusingSuperScriptOneStepRT-PCR with Platinum Taq (Invitrogen Life Technologies) (see SI Text) from RNAisolated from strains grown in modified marine medium supplemented withphosphate. For light/dark experiments, strain was grown on Petri dishes withmarine medium grown at 37°C either under white or blue light or in the dark.

Lipid Extraction and Analysis. Lipids from Arabidopsis, Cyanobacteria spp., andB. coahuilensis were extracted (details are available on request) and analyzed byusing the TLC technique as described in ref. 28. For lipid footprint analysis,individual lipids were isolated from TLC plates, and duplicates of each lipid spotwere analyzed by MALDI-TOF MS technology (see SI Text).

ACKNOWLEDGMENTS. We thank Laura Espinosa Azuar and Antonio Cruz intechnical assistance at Instituto de Ecologıa, Universidad Nacional Autonomade Mexico; Dr. Michael Travisano (University of Minnesota) and June Simpson(Cinvestav Campus Guanajuato) for their comments on the manuscript. Thiswork was supported in part by Secretarıa de Agricultura, Ganaderıa, Desar-rollo Rural, Pesca y Alimentacion (SAGARPA) Zea-2006 and Consejo Nacional

Alcaraz et al. PNAS � April 15, 2008 � vol. 105 � no. 15 � 5807

EVO

LUTI

ON

Dow

nloa

ded

by g

uest

on

Mar

ch 1

2, 2

020

Page 6: The genome of Bacillus coahuilensisreveals adaptations ... · The genome of Bacillus coahuilensisreveals adaptations essential for survival in the relic of an ancient marine environment

de Ciencia y Tecnologıa (CONACyT)-Secretaria de Education Publica 43979grants and Howard Hughes Medical Institute Grant 55005946 (to L.H.-E.);CONACyT IN Grants 223105 and 44673Q (to V.S. and L.E.); CONACyT-Secretaria

de Medio Ambiente y Recursos Naturales Grant 2002-C01-0237 (to V.S.);Exobiology Grant NG04GJ12G (to J.S.); and graduate scholarships fromCONACyT (to L.D.A., R.C., and J.C.).

1. Minckley WL (1969) Environments of the Bolson of Cuatro Cienegas, Coahuila, Mexico,with Special Reference to the Aquatic Biota.University of Texas: El Paso Science Series(Texas Western Press, El Paso, TX).

2. Souza V, et al. (2006) An endangered oasis of aquatic microbial biodiversity in theChihuahuan desert. Proc Natl Acad Sci USA 103:6565–6570.

3. Elser JJ, et al. (2005) Effects of phosphorus enrichment and grazing snails on modernstromatolitic microbial communities. Freshwater Biol 50:1808–1825.

4. Cerritos R, et al. (2008) Bacillus coahuilensis sp. nov. a new moderately halophilicspecies from different pozas in the Cuatro Cienegas Valley in Coahuila, Mexico. Int JSyst Evol Microbiol 58:919–923.

5. Van Domselaar GH, et al. (2005) BASys: A web server for automated bacterial genomeannotation. Nucleic Acids Res 33:W455–W459.

6. Ciccarelli FD, et al. (2006) Toward automatic reconstruction of a highly resolved tree oflife. Science 311:1283–1287.

7. Rusch DB, et al. (2007) The Sorcerer II Global Ocean Sampling Expedition: NorthwestAtlantic through Eastern Tropical Pacific. PLoS Biol 5:e77.

8. Dormann P, Benning C (2002) Galactolipids rule in seed plants. Trends Plants Sci7:112–118.

9. Benning C (1998) Biosynthesis and function of the sulfolipid sulfoquinovosyl diacyl-glycerol. Annu Rev Plant Physiol Plant Mol Biol 49:53–75.

10. Jung KH, Trivedi VD, Spudich JL (2003) Demonstration of a sensory rhodopsin ineubacteria. Mol Microbiol 47:1513–1522.

11. Vogeley L, et al. (2004) Anabaena sensory rhodopsin: A photochromic color sensor at20 A. Science 306:1390–1393.

12. Luecke H, Schobert B, Lanyi JK, Spudich EN, Spudich JL (2001) Crystal structure ofsensory rhodopsin II at 2.4 angstroms: Insights into color tuning and transducerinteraction. Science 293:1499–1503.

13. Mongodin EF, et al. (2005) The genome of Salinibacter ruber: Convergence and geneexchange among hyperhalophilic bacteria and archaea. Proc Natl Acad Sci USA102:18147–18152.

14. Frigaard NU, Martinez A, Mincer TJ, DeLong EF (2006) Proteorhodopsin lateral genetransfer between marine planktonic Bacteria and Archaea. Nature 439:847–850.

15. Venter JC, et al. (2004) Environmental genome shotgun sequencing of the SargassoSea. Science 304:66–74.

16. Putonti C, et al. (2006) A computational tool for the genomic identification of regionsof unusual compositional properties and its utilization in the detection of horizontallytransferred sequences. Mol Biol Evol 23:1863–1868.

17. Bouige P, Laurent D, Piloyan L, Dassa E (2002) Phylogenetic and functional classificationof ATP-binding cassette (ABC) systems. Curr Protein Pept Sci 3:541–559.

18. Audic S, et al. (2007) Genome analysis of Minibacterium massiliensis highlights theconvergent evolution of water-living bacteria. PLoS Genet 3:e138.

19. Jann A, Matsumoto H, Haas D (1988) The fourth arginine catabolic pathway ofPseudomonas aeruginosa. J Gen Microbiol 134:1043–1053.

20. Kobayashi K, et al. (2003) Essential Bacillus subtilis genes. Proc Natl Acad Sci USA100:4678–4683.

21. Pushker R, Mira A, Rodriguez-Valera F (2004) Comparative genomics of gene-familysize in closely related bacteria. Genome Biol 5:R27.

22. Su Z, et al. (2003) Computational inference of regulatory pathways in microbes: Anapplication to phosphorus assimilation pathways in Synechococcus sp. WH8102. Ge-nome Inform 14:3–13.

23. Seshadri R, Kravitz SA, Smarr L, Gilna P, Frazier M (2007) CAMERA: A communityresource for metagenomics. PLoS Biol 5:e75.

24. Mira A, Pushker, R (2005) The silencing of pseudogenes. Mol Biol Evol 22:2135–2138.25. Mira A, Pushker R, Rodriguez-Valera F (2006) The Neolithic revolution of bacterial

genomes. Trends Microbiol 14:200–206.26. Desnues C, et al. (2008) Biodiversity and biogeography of phages in modern stroma-

tolites and thrombolites. Nature 452:340–343.27. Falcon LI, Cerritos R, Eguiarte LE, Souza, V (2007) Nitrogen fixation in microbial mat and

stromatolite communities from Cuatro Cienegas. Mexico Microb Ecol 54(2):363–373.28. Hartel H, Dormann P, Benning C (2000) DGD1-independent biosynthesis of extraplas-

tidic galactolipids after phosphate deprivation in Arabidopsis. Proc Natl Acad Sci USA97:10649–10654.

29. Schmidt HA, Strimmer K, Vingron M, von Haeseler A (2002) TREE-PUZZLE: Maximumlikelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics18:502–504.

30. Battistuzzi FU, Feijao A, Hedges SB (2004) A genomic timescale of prokaryote evolu-tion: Insights into the origin of methanogenesis, phototrophy, and the colonization ofland BMC. Evol Biol 4:44.

31. Kumar S, Tamura K (2004) Nei M MEGA3: Integrated software for molecular evolu-tionary genetics analysis and sequence alignment. Brief Bioinform 5:150–163.

32. Schwede T, Kopp J, Guex N, Peitsch MC (2003) SWISS-MODEL: An automated proteinhomology-modeling server. Nucleic Acids Res 31:3381–3385.

5808 � www.pnas.org�cgi�doi�10.1073�pnas.0800981105 Alcaraz et al.

Dow

nloa

ded

by g

uest

on

Mar

ch 1

2, 2

020