Methylotrophy describes the ability of organisms to grow on reduced organic compounds without carbon-carbon bonds. The genomes of two pink-pigmented facultative methylotrophic bacteria of the Alpha-proteobacterial genus Methylobacterium, the reference species Methylobacterium extorquens strain AM1 and the dichloromethane-degrading strain DM4, were compared.
The 6.88 Mb genome of strain AM1 comprises a 5.51 Mb chromosome, a 1.26 Mb megaplasmid and three plasmids, while the 6.12 Mb genome of strain DM4 features a 5.94 Mb chromosome and two plasmids. The chromosomes are highly syntenic and share a large majority of genes, while plasmids are mostly strain-specific, with the exception of a 130 kb region of the strain AM1 megaplasmid which is syntenic to a chromosomal region of strain DM4. Both genomes contain large sets of insertion elements, many of them strain-specific, suggesting an important potential for genomic plasticity. Most of the genomic determinants associated with methylotrophy are nearly identical, with two exceptions that illustrate the metabolic and genomic versatility of Methylobacterium. A 126 kb dichloromethane utilization (dcm) gene cluster is essential for the ability of strain DM4 to use DCM as the sole carbon and energy source for growth and is unique to strain DM4. The methylamine utilization (mau) gene cluster is only found in strain AM1, indicating that strain DM4 employs an alternative system for growth with methylamine. The dcm and mau clusters represent two of the chromosomal genomic islands (AM1: 28; DM4: 17) that were defined. The mau cluster is flanked by mobile elements, but the dcm cluster disrupts a gene annotated as chelatase and for which we propose the name “island integration determinant” (iid).
These two genome sequences provide a platform for intra- and interspecies genomic comparisons in the genus Methylobacterium, and for investigations of the adaptive mechanisms which allow bacterial lineages to acquire methylotrophic lifestyles.
Citation:Vuilleumier S, Chistoserdova L, Lee M-C, Bringel F, Lajus A, et al. (2009) Methylobacterium Genome Sequences: A Reference Blueprint to Investigate Microbial Metabolism of C1 Compounds from Natural and Industrial Sources. PLoS ONE 4(5): e5584. doi:10.1371/journal.pone.0005584
Editor: Niyaz Ahmed, University of Hyderabad, India
Received: January 24, 2009; Accepted: March 30, 2009; Published: May 18, 2009
Copyright: © 2009 Vuilleumier et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding:SV was supported by a CNRS ATIP career development award, a CNRS USA mobility grant and a Génoscope sequencing grant, and also by the Integrated Computational Genomics Resources of the Swiss Institute of Bioinformatics (RITA-CT-2006-026204) together with MP. Annotation was supported by a grant from MRT/ANR PFTV 2007, MicroScope project. CP, EH and EM were supported by PhD grants from the Government of Luxembourg, the French Ministry of Research and Région Alsace, respectively. RP was supported by ETH (ETH-25 08-2). ML, DGR, and CJM were supported by NSF (IOB-0612591), the Clarke-Cooke Fund, and the Harvard University Microbial Sciences Initiative. LC and MEL were supported by NIH (GM 58933). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Pink-pigmented facultative methylotrophs of the genus Methylobacterium are ubiquitous in soil, air and water environments . The common trait of all Methylobacterium species is the ability to grow on one or several reduced one carbon (C1) compounds other than methane, most prominently methanol, which is a major volatile organic compound emitted by vegetation . Accordingly, strains of Methylobacterium are often found in association with plants, either involved in bona fide symbioses as endophytes, or as epiphytes on leaf surfaces –. The potential of strains from this genus to provide biotechnological products of high added value has attracted sustained scientific attention , .
Of all Methylobacterium strains, M. extorquens strain AM1 (formerly Pseudomonas AM1, Methylobacterium sp. AM1) is the best studied, and has served as a model organism for over four decades. It was first isolated in 1960 in Oxford, England, as an airborne contaminant growing on methylamine . It was then used as a workhorse to characterize the serine cycle for assimilation of the C1-unit of methylene tetrahydrofolate, a central intermediate in methylotrophic metabolism, and more recently the ethylmalonyl-CoA pathway for glyoxylate regeneration , – (Fig. 1). Enzymatic systems for oxidation of both methanol ,  and methylamine , which involve the use of specific cofactors pyrroloquinoline quinone (PQQ) and tryptophan tryptophylquinone (TTQ), respectively , were characterized in strain AM1. Bacterial tetrahydromethanopterin (H4MPT)-dependent enzymes, now known to occur in most methylotrophs ,  but originally thought to be unique to archaeal methanogens  were also first demonstrated in this strain. In Methylobacterium, the H4MPT-dependent pathway has been shown to play a major role in both energy generation and protecting cells from formaldehyde poisoning . As to the analogous tetrahydrofolate (H4F)-linked pathway that involves two enzymes encoded by mtdA and fch, also first discovered in this organism , , its major role in assimilatory metabolism was recently identified in supplying C1 units into the serine cycle ,  (Fig. 1).
Full lines, H4MPT-dependent pathway, H4F-dependent pathway, serine cycle and ethylmalonyl-CoA pathway for glyoxylate regeneration; broken line, tricarboxylic acid cycle reactions (2-oxoglutarate dehydrogenase activity (and a dissimilatory TCA cycle) are not essential for methylotrophic growth , ). Key pathway outputs  used for carbon assimilation (biomass production) are shown in bold italics. Genes involved in the serine cycle, the TCA cycle and in the ethylmalonyl-CoA pathway are indicated. Genes given on the same line and not separated by hyphens are closely associated on the chromosome. Genes and their arrangement on the chromosome are strongly conserved in strains AM1 and DM4 (see Suppl. Table S1), except the mau cluster for methylamine utilization and the dcm cluster for dichloromethane utilization which are unique to strain AM1 and strain DM4, respectively.
Draft genome data for M. extorquens AM1 have been available since 2003  and have enabled transcriptomic and proteomic approaches (see e.g. , ), as well as metabolomic studies (see e.g. , , ). Combined with the large complement of genetic tools developed for Methylobacterium (see e.g. , ), this has established M. extorquens AM1 as a model for systems level investigations.
Methylobacterium strain DM4 has been isolated from industrial wastewater sludge in Switzerland, as part of efforts to characterize microorganisms able to degrade the organohalogenated pollutant dichloromethane (DCM) . Unlike methanol and methylamine, which are mainly produced naturally, DCM is better known as a synthetic compound , . Rated as potentially carcinogenic for humans and the most highly produced chlorinated organic compound (http://www.eurochlor.org/solvents), DCM is highly volatile (b.p. 38°C) and water-soluble, making it a widespread contaminant in the environment . Aerobic methylotrophic bacteria capable of using DCM as the sole source of carbon and energy  express high levels of DCM dehalogenase, which transforms DCM into formaldehyde and two molecules of HCl . The genotoxic effects of DCM in both mammals  and bacteria  are due to a short-lived intermediate in the enzymatic transformation of DCM to formaldehyde . Growth with DCM, as the main trait that distinguishes strain DM4 from M. extorquens AM1, has led to its classification as a separate Methylobacterium species . Basing on 16S rRNA gene sequence and DNA-DNA relatedness, it was recently proposed that strain DM4 should be reclassified as M. extorquens .
The primary objective of this work was to define a fully assembled and annotated reference genomic blueprint for Methylobacterium, to assist future experimental investigations of methylotrophic metabolism by global approaches. We report here complete genomic sequences of strains AM1 and DM4, and describe the genomic make-up and potential for genomic plasticity that underlies the extensive capacity of Methylobacterium for physiological adaption to methylotrophic lifestyles. Availability of complete genomic sequences of the two strains also provides the opportunity to define the conserved complements of genes associated with methylotrophy, and to investigate the differences between the two strains associated with strain-specific adaptations.
Results and Discussion
The genome of M. extorquens AM1 totals 6.88 Mb and consists of five replicons: a chromosome of 5.51 Mbp (Acc. No. CP001510), a megaplasmid of 1.26 Mbp (CP001511), and three plasmids (25 kb (CP001512), 38 kb (CP001513), and 44 kb (CP001514)), with an average GC content of 68.5% (Table 1, Fig. 2). The genome of strain DM4 is somewhat smaller (6.12 Mb), and features only three replicons: a chromosome of 5.94 Mbp (Acc. No. FP103042) and two plasmids (141 kb (FP103043) and 38 kb (FP103044)), with an average GC content of 68.0% (Table 1, Fig. 2). Based on their sizes and the relative distribution of sequencing reads for each replicon, plasmids p1META and p2META are predicted to be present at 2–3 copies per strain AM1 genome, the replicon p3META at 1–2 copies per genome, and the megaplasmid at one copy per genome. Predicted copy numbers of 0.4–0.5 and 0.6–0.7 per genome were obtained for DM4 plasmids p1METDI and p2METDI, respectively.
Successive circles from inside to outside: GC skew; GC deviation (with values exceeding +/− 2SD indicated in red); rRNA (pink); tRNA (green); IS elements (brown); all genes coloured according to functional class (COG); methylotrophy genes (blue, see Suppl. Table S1); strain-specific genes (yellow, except genes predicted to be of foreign origin, in red); genomic islands (green, see Table 3). Plasmids are not shown to scale.
By convention, the origin of both chromosomes was set upstream of the dnaA gene, as no GC skew was observed to help in predict initiation and termination of replication . The chromosomes of the two strains are remarkably similar in both gene content and synteny (Fig. 2, Fig. 3). 85% of the M. extorquens AM1 chromosomal genes have full-length homologs (higher than 30% identity) on the chromosome of strain DM4. Of these, 89% have homologs at higher than 95% identity, underlining the orthologous nature of most genes in the two strains. Ribosomal genes (23S, 16S, and 5S) are identical in all five copies of the ribosomal operon of strains AM1 and DM4. The intergenic spacer length between 16S and 23S genes is identical in all five copies of the ribosomal operon of the same strain, but its length differs markedly between the two strains (905 nt in M. extorquens AM1, 602 nt in strain DM4). These data confirm the already mentioned recent suggestion  that strain DM4 belongs to the species M. extorquens.
The linearized replicons were aligned and visualized by Lineplot in Mage. Syntenic relationships comprising at least 8 genes are indicated by violet and blue lines for genes found on the same strandor on opposite strands, respectively. IS elements (pink), ribosomal operons (blue) and tRNAs (green) are also indicated.
The distribution of functional categories according to the COG classification (, Table 2) was as expected for a free-living proteobacterium with a versatile lifestyle and the observed genome size. COG functional class assignments are more frequent and diverse for chromosomal genes than for plasmid genes (Table 2). No significant differences in functional classes were evident between AM1 and DM4 chromosomes, except for the larger proportion of genes associated with recombination, replication and repair in strain AM1, a reflection of the larger set of IS elements in that strain (Table 1, and see below).
The Mage annotation platform  and Alien Hunter  were used to detect genes and genome regions found in one strain but not in the other. Several unique chromosomal regions, termed genomic islands and ranging from a few genes to hundreds of genes, which represent approximately 632 kb (11.5%) and 1,054 kb (17.7%) of the chromosome for strains AM1 and DM4, respectively, were defined (Table 3). With the exception of the dcm and mau gene clusters (see below), few of these islands appear to encode functions important for central metabolism or methylotrophy. One remarkable genomic island in strain AM1 (Table 3) contains a hypothetical gene of unknown function of 47.5 kb (META1_2412) which encodes a 15,831 residue-long repeat-rich polypeptide (Pfam PF00353 (hemolysin-type calcium-binding region); PF05594, (haemagglutinin, bacterial); COG3210 (large exoproteins involved in heme utilization or adhesion), and COG2931 (RTX toxins and related Ca2+−binding proteins). This gene product, if expressed, would represent one of the largest proteins known in biology .
Extra-chromosomal replicons are highly strain-specific and show little similarity in size, gene content or synteny with each other. However, an approximately 130 kb region of the AM1 megaplasmid is globally syntenic to a region of similar length in the chromosome of strain DM4 (Fig. 3). Plasmids encode mostly proteins of currently unknown function (Table 2) or proteins associated with plasmid-related functions. Exceptions include a cation efflux system on plasmid p1META1 (p1META1_0021/p1META1_0022); a cluster of copper resistance genes on plasmid p2META1 (p2META1_0029/p2META1_0030); a truncated luxI gene (p1META1_0049) recently shown to be essential for the operation of two bona fide, chromosomally-located luxI genes, and encoding two acyl homoserine lactone synthases ; and UmuDC systems involved in SOS DNA repair. Unlike in strain DM4, which has two complete copies of umuDC on its chromosome (METDI0144/METDI0143 and METDI4328/METDI4329), a complete umuDC cluster in AM1 is only found on the megaplasmid (META2_0643/META2_0644), while a truncated copy of umuC is found on the chromosome (META1_4790)
Comparative genomics of aerobic methylotrophy
Methylotrophy can be envisioned in terms of the assembly of discrete metabolic modules, each responsible for a specific metabolic task, which in combination define pathways for methylotrophic metabolism, several variants of which have been well characterized , .
The Methylobacterium blueprint.
In Methylobacterium, the currently recognized methylotrophy genes and modules are found exclusively on the chromosomes of strains AM1 and DM4 (Fig. 2, Suppl. Table S1). Common genes associated with methylotrophy inventoried in Suppl. Table S1 display at least 95% identity at the protein level (99.1% average), with complete synteny between the two strains . Several methylotrophy genes are found as singletons, including several cases of genes that encode different subunits of the same enzyme (e.g. mcmAB, pccAB, see Suppl. Table S1). Nevertheless, a majority of methylotrophy genes are found in large clusters. Only two known methylotrophy gene clusters are not shared between the two strains (Suppl. Table S1, and see below): the dcm (dichloromethane degradation) gene region present only in strain DM4, and the mau gene cluster encoding methylamine dehydrogenase and accessory functions in strain AM1. One large multi-operon cluster (49.3 kb) encodes most of the serine cycle enzymes, most of the PQQ biosynthesis functions , genes for H4MPT-linked reactions and H4MPT biosynthesis, and H4F biosynthesis genes. It also contains genes encoding a homolog of methanol dehydrogenase (XoxFJG) of still unknown function, often found nearby genes involved in C1 metabolism ,  and recently suggested to be involved in formaldehyde metabolism in the photosynthetic bacterium Rhodobacter sphaeroides .
Comparison of gene sets for methylotrophy in fully sequenced genomes.
A steadily increasing number of genomes of methylotrophic microorganisms have been sequenced, assembled and annotated. We limit our comparative analysis of known genetic determinants and modules of methylotrophy (Suppl. Table S2) to completed, manually annotated and officially published methylotroph genomes (listed in Suppl. Table S3), six of which belong to the phylum Proteobacteria and one to the phylum Verrucomicrobia. Methylococcus capsulatus represents Gamma-proteobacterial methanotrophs , while Methylibium petroleiphilum , Methylobacillus flagellatus  and Methylophilales strain HTCC2181  feature two different orders within Beta-proteobacteria (Burkholderiales and Methylophilales). Silicibacter pomeroyi, although not reported to grow methylotrophically, is an Alpha-proteobacterium of the family Rhodobacteriaceae capable of degrading methylated sulfur compounds . Granulibacter bethesdensis is an emerging human pathogen of the family of Acetobacteriaceae within Alpha-proteobacteria  reported to grow on methanol . Finally, strain V4 (candidatus “Methyloacidiphilum infernorum”) represents the recently discovered group of thermophilic and acidophilic methanotrophs of the phylum Verrucomicrobia .
The mxa gene cluster encoding the classic methanol dehydrogenase is nearly identical (over 99% identity at the protein level) between strains AM1 and DM4, and very similar in the genomes of M. capsulatus, M. flagellatus, G. bethesdensis and several other proteobacterial methylotrophs . This conservation of both gene sequence and gene synteny suggests that the mxa gene cluster has most likely disseminated via lateral transfer among methylotrophs of different subclasses of Proteobacteria. This notwithstanding, no similar gene clusters are recognizable in the genomes of the other four organisms discussed here. M. petroleiphilum features a gene cluster encoding an alternative methanol dehydrogenase (Mdh2; ) with little homology to either mxaF or xoxF. The gene xoxF is found in all of the genomes discussed here except that of S. pomeroyi but, as discussed elsewhere , the Xox system is unlikely to be responsible for aerobic methanol oxidation. The genes responsible for methanol oxidation by Methylophilales HTCC2181 and strain V4 remain unknown, suggesting the existence of other, yet unidentified systems for methanol dissimilation.
The mau gene cluster encoding the canonical system for methylamine utilization was characterized for a large part in strain AM1, and the genome of M. flagellatus  contains a mau gene cluster very similar to it. The main difference is that the gene for the electron acceptor from methylamine dehydrogenase in strain AM1, amicyanin, is replaced by a gene for azurin, an analogous copper-containing electron acceptor protein in M. flagellatus. The mau cluster was not found in genomes of the other methylotrophs including strain DM4 discussed here, which were shown or assumed to grow with methylamine (Suppl. Table S2). Thus, as yet uncharacterized genetic determinants are responsible for methylamine utilization in most methylotrophs, including strain DM4 in particular.
H4MPT-dependent formaldehyde oxidation.
Tetrahydomethanopterin (H4MPT)-dependent formaldehyde oxidation is the main pathway for both energy generation and formaldehyde detoxification in M. extorquens, and therefore absolutely essential for methylotrophy in this organism , , . First defined in M. extorquens AM1, this pathway has also been described in a variety of other bacteria, including from phyla whose methylotrophic ability has not yet been demonstrated such as Planctomycetes , , . Phylogenetic analysis suggests that this pathway must be one of the most ancient in the context of methylotrophic metabolism. However, it is unessential in M. flagellatus , and is absent in some other methylotrophs. S. pomeroyi possesses an alternative glutathione-dependent (FlhA/FghA) system for oxidation of formaldehyde similar to that of P. denitrificans  and R. sphaeroides . No formaldehyde oxidation systems were identified in the genomes of Methylophilales HTCC2181 or Verrucomicrobia strain V4.
Conversion of formate to CO2.
M. extorquens strains possess four different functional formate dehydrogenases for the final step of energy generation from carbon oxidation . The other methylotrophs included in our analysis also encode one or several FDH homologs (Suppl. Table S2), but only one, FHD2, is consistently detected. These observations suggest that formate oxidation, as a transformation ubiquitous to life, does not strictly qualify as a methylotrophy-specific reaction, and may thus involve analogous  enzymatic systems.
C1 assimilation via methylene tetrahydrofolate and the serine cycle.
The serine cycle is essential for carbon assimilation in Methylobacterium and comprises reactions specific to methylotrophy as well as reactions involved in multicarbon metabolism (Fig. 1, see also , ). Genes involved in the serine cycle can be ascribed to two categories on the basis of mutational analysis : methylotrophy-specific genes (glyA, sga, hpr, gck, ppc, mtkAB and mcl), and genes which are essential under non methylotrophic growth conditions (eno and mdh). Recent evidence ,  and mutant analyses , – suggest that genes for the C1 transfer pathway linked to H4F (mtdA, fch and ftfL) are specifically involved in assimilatory metabolism in Methylobacterium. Six methylotrophy-specific serine cycle genes, along with mtdA and fch, belong to gene clusters associated with methylotrophy on the chromosomes of strains AM1 and DM4 (Fig. 4), while the three remaining genes (glyA, gck and ftfL) are not parts of methylotrophy gene clusters and are located elsewhere on the chromosome.
Sequences were retreived from Genbank and visualized using CLC Sequence Viewer 5 (www.clcbio.com). Chromosome sequence positions are indicated, as well as the percent identity at the protein level with Methylobacterium prototypes (nd: not detectable). Formate tetrahydrofolate ligase/formyl-tetrahydrofolate synthetase (ftfL, black); serine hydroxymethyltransferase (glyA, pink); serine glyxoylate aminotransferase (sga, yellow); hydroxypyruvate reductase (hprA, red); glycerate kinase (gck, purple); phosphoenolpyruvate carboxylase (ppc, orange); malyl-CoA lyase/β-methylmalyl-CoA lyase (mcl, dark green); malate thiokinase (mtkA/mtkB, light green); NAD(P)-dependent methylene-tetrahydromethanopterin/methylene-tetrahydrofolatedehydrogenase (mtdA, dark blue); methenyl tetrahydrofolate cyclohydrolase (fch, light blue); bifunctional methylene-tetrahydrofolate dehydrogenase/methenyl-tetrahydrofolate cyclohydrolase (folD, grey); transcriptional regulator (pale green); other (white); tRNA (black rectangles).
As exemplified here for M. petroleiphilum , Beta-proteobacterial methylotrophs may also employ the serine cycle for C1 carbon assimilation, . As that of the Alpha-proteobacterium S. pomeroyi, the genome of M. petroleiphilum contains a single gene cluster encoding all required functions of the serine cycle (Fig. 4). In S. pomeroyi however, the organisation of this gene cluster is quite different from that of Methylobacterium, Granulibacter bethesdensis and M. petroleiphilum (Fig. 4), and contains tandem genes for two distantly related bifunctional methylene-H4F dehydrogenase/methenyl-H4F cyclohydrolase (FolD) enzymes instead of the isofunctional mtdA/fch genes found in the other genomes discussed here. Moreover, the hpr, gck and sga genes inferred from the genomic context display only modest sequence identity with Methylobacterium prototypes (Fig. 4), further suggesting that the serine cycle in S. pomeroyi belongs to an independent evolutionary lineage.
Extending the analysis to methylotrophic organisms able to grow with methane, the Gamma-proteobacterial methanotroph M. capsulatus also harbors serine cycle gene homologs in its genome, including the mtdA/fch pair, but few of them are clustered (Fig. 4). However, the gene for one key enzyme of the serine cycle, the methylotrophy-specific phosphoenolpyruvate carboxylase gene, is missing , consistent with the extensive biochemical studies demonstrating that the main pathway for C1 assimilation in Methylococcus capsulatus is the RuMP pathway.
C1 assimilation and the ethylmalonyl-CoA pathway for glyoxylate regeneration.
The assimilation of C1 units by the serine cycle requires the regeneration of glyoxylate from acetyl-CoA. It has been a long standing puzzle how strain AM1 achieves this given that it lacks isocitrate lyase activity, the key enzyme of the classical glyoxylate regeneration pathway . Indeed, and the corresponding gene was not detected in the Methylobacterium genome. Glyoxylate regeneration via the recently elucidated ethylmalonyl-CoA pathway  has now been demonstrated in strain AM1 , and the corresponding genes have been identified , . The genomes of the other bacteria compared here present a contrasting picture in this respect. The genome of S. pomeroyi also contains a complete set of the genes for the ethylmalonyl-CoA pathway (not shown), and as in M. extorquens, these genes are not clustered on the chromosome. In M. petroleiphilum and G. bethesdensis, however, the genes for the key enzymes of the ethylmalonyl-CoA pathway  are missing (not shown), but genes thought to encode the isocitrate lyase shunt are present instead . In M. capsulatus, neither ethylmalonyl-CoA pathway nor the isocitrate lyase shunt appear to be encoded within the genome , consistent with the operation of the RuMP pathway as the predominant pathway for C1 assimilation in M. capsulatus.
Transcriptional regulation of carbon assimilation in methylotrophic metabolism.
The gene of the global serine cycle regulator in Methylobacterium (QscR, a LysR-type regulator homologous to CbbR), is essential for methylotrophic growth. It activates transcription of the clustered serine cycle genes as well as of glyA, and negatively regulates its own transcription  but it is not located in the proximity of known serine cycle genes in the genome. However, the genes of several probable regulators of unknown function are found nearby serine cycle genes in all methylotrophic bacteria including Methylobacterium discussed here (Fig. 4).
Analysis of IS elements uncovers a significant potential for genome plasticity in Methylobacterium
Methylobacterium genomes display an IS content comparable to that other microbial genomes , but with a clear differential distribution of highly diverse IS elements in AM1 and DM4 (Fig. 5, Suppl. Table S4). In AM1, 39 different IS types (defined by a 95% amino acid identity threshold), belonging to 14 IS families (defined as broad groupings of related elements in ISfinder ), were detected, compared to 42 IS types belonging to 14 IS families in DM4. Overall diversity of IS types is higher in DM4, but the total number of IS elements in AM1 is twice as high as in DM4 (Table 1). A total of 71 intact and 23 partial IS elements were detected in strain DM4, representing about 2% of the genome (Table 1). With 9 and 7 copies, respectively, ISMex15 and ISMex17 were the two most abundant IS elements in this strain. In comparison, strain AM1 featured 142 intact and 32 partial IS elements, representing 3.7% of the genome (Table 1). At 37, 16 and 23 intact copies, respectively, ISMex1, ISMex2 and ISMex3 of AM1 (with average pairwise nucleotide differences between different gene copies of only 0.01%, 0.11% and 0.03% respectively), the most abundant IS elements identified, may have undergone recent expansion. In addition, one miniature inverted-repeat transposable element (MITE), MiniMdi3, was detected in both strains (Suppl. Table S4). This element (~400 bp) is related to ISMdi3 but lacks the transposase gene. Few studies so far have identified the presence of both non-autonomous and autonomous transposable elements in the same bacterial genome (see e.g. Out of a total of 70 IS types identified in this work, only 11 IS types are shared between the two strains (Suppl. Table S4, intact IS). IS5 and IS110 are the most abundant shared IS families, each family featuring 5 to 7 different types of IS (Fig. 5). This suggests that substantial IS loss and/or acquisition has occurred during the relatively short period of time since both strains have emerged from a common ancestor.
The bar length shows the total intact IS copy number of each IS family in DM4 (right) and AM1 (left) (see Suppl. Table S4). Differently colored regions represent different replicons : blue – DM4 chromosome, cyan – DM4 plasmid p1METDI, green – DM4 plasmid p2METDI, pink – AM1 chromosome, orange – AM1 megaplasmid, dark yellow – AM1 plasmid p1META1, light yellow – AM1 plasmid p2META1, light green – AM1 plasmid p3META1. Open circles and squares represent the numbers of different types of ISs within each family in AM1 and DM4, respectively.
The distribution of IS element localization within each genome displays clear-cut, non-random features. Plasmids harbor a higher density of IS elements than the chromosomes. Over 20% of the length of the DM4 plasmid p2METDI and of the AM1 plasmids p1META and p2META encode IS elements. Similarly, IS elements comprise about 8% of the length of the AM1 megaplasmid (Table 1), a significantly higher proportion than in the chromosome (x2 test, p<0.0001). Moreover, several IS families are significantly over-represented on particular replicons. For example, all 16 copies of ISMex2, an IS element belonging to the IS481 family that is specific to strain AM1, are found on its chromosome while all 5 copies of the IS elements belonging to the IS110 family are on the megaplasmid. In contrast, 13 out of the 14 copies of IS elements of this group in DM4 are located on the chromosome. For some IS elements, however, a more homogeneous distribution was noted. For example, the Tn3 family element ISMex22 unique to strain AM1 is found in one copy per replicon. Transposition immunity was described for this type of IS element , suggesting the occurrence of transposition saturation in this case.
The observed non-random IS density across replicons may be due to one or more of three potential causes: (1) biased transposition rates by different IS types across replicons, such as local hopping or plasmid specificity; (2) biased selective effects of transposition events, such as over-representation in regions with high density of genes with little or no selective value, such as plasmids or IS elements themselves; or (3) insufficient time for reaching equilibrium, e.g. for IS elements acquired via recent plasmid-mediated transmission. A second pattern in the distribution of IS locations was noted within each replicon. There is an over-representation of IS elements by 7-fold and 39-fold in chromosomal regions unique to AM1 and DM4, respectively, relative to the regions shared between the two strains (x2 test, p<0.0001; also see Fig. 2). These could represent regions with fewer essential genes and therefore relaxed selection against DNA insertions. Alternatively, they could have been IS-rich regions dating back to the common ancestor of these two strains. This would have then led to increased rates of deletion between two co-directional copies of the same IS element, causing such IS-rich regions to be lost more frequently.
IS elements linked to methylotrophy.
The two strain-specific methylotrophy regions containing mau (in AM1) and dcm (in DM4) gene clusters (Table 3) are closely associated with IS elements. In strain DM4, genes dcmR and dcmA are embedded within several overlapping IS elements (, Table 3 and see below, Fig. 6). In strain AM1, the mau cluster (12 kb) lies between 2 copies of ISMex15 (~30 kb), as part of a larger (approx. 66 kb) gene cluster unique to this strain (Table 3). This suggests that such methylotrophy-associated gene clusters may be prone to lateral gene transfer and/or deletion. Indeed, it has been shown recently that the presence of the mau gene cluster is variable in closely related environmental strains of Methylotenera, a betaproteobacterial methylotroph . This phenomenon may be involved in the emergence of new ecotypes of methylotrophs.
All functional annotations are putative except for the DCM dehalogenase gene and its upstream regulator (bold). Highlighted are genes for putative enzymes (red), regulators (orange) transporters (yellow), proteins involved in DNA modification (blue), transposases (cyan), proteins involved in plasmid functions (green), and gene fragments (grey), with hypothetical and conserved hypothetical proteins left in white. The interrupted chelatase-likegene (hashed) defined here as “island integration determinant” flanks the 126 kb dcm island.
The genomic island for DCM utilization: a new type of mobility determinant?
Unlike methylamine and methanol which are produced naturally in large amounts , , DCM is produced naturally at low levels only , and presumably occurs at significant concentrations in the environment due to industrial production. The dcm genomic island unique to strain DM4 with the dcmA gene encoding DCM dehalogenase required for growth of Methylobacterium with DCM is located on the chromosome (Table 3, Fig. 6), just 20 genes downstream of the large conserved 49 kb methylotrophy gene cluster (Fig. 2, Suppl. Table S1). This 126 kb DNA region, of markedly different GC content (60.5%) from the genome average, was most likely acquired by horizontal transfer. The sequences upstream and downstream of the unique dcm region are in complete synteny between the genomes of strains DM4 and AM1. The integration point of the dcm region features the 5′-end and 3′-end remains of a “chelatase-like” (COG0606, predicted ATPase with chaperone activity). Although most currently known genomic islands are located at the 3′ end of a tRNA locus, other genes serving as integration sites have been described, such as the glr (glutamate racemase) gene of the Helicobacter pylori pathogenicity island . Clues on the mode of integration of the dcm region within the Methylobacterium chromosomal framework were obtained by a more detailed analysis. The first CDS within the dcm region encodes a putative recombinase. Arrangements of non-overlapping 5′ and 3′ fragments of such a “chelatase” gene bordering an internal DNA fragment beginning with a recombinase gene are also evident in three other published complete genomes (Table 4). Additional DNA motifs associated with such structures include 5–20 bp direct repeats and palindromic sequences located immediately up- and downstream of the 5′- and 3′-fragments of the disrupted gene, respectively (Table 4). DNA sequences encoding “chelatase” homologs are often apparent pseudogenes, partial sequences, or sequences containing one or several internal stop codons, suggesting that such sequences may have experienced insertion and subsequent excision of DNA fragments. It is tempting to speculate that such sequences represent novel determinants of genome plasticity, and we propose the term “island integration determinant” (iid) to describe them.
The dcm region features only few genes that can be associated with confidence with methylotrophic metabolism (Fig. 6). The majority of the genes within this region (74/128, 58%) are hypothetical or conserved hypothetical proteins (compared to the chromosomal average of 41.1% and plasmid average of 43.8% for such proteins, Table 1). Several genes of the dcm region are interrupted by IS elements (e.g. a glutathione S-transferase METDI2660/2663), or are present in truncated form (e.g. a DNA helicase METDI2648). Many CDS seem associated with DNA modification, stability and mobility. Moreover, 7 IS elements were identified in this region, with 4 in close proximity to dcmA . The structural elements of a bona fide repABC plasmid , i.e. a canonical repABC operon encoding plasmid replication and maintenance function with its counter-transcribed small RNA in divergent orientation upstream of repC, and a palindromic 16 nt sequence GTTCTCAGCTGAGAAC fitting the par binding site consensus sequence  upstream of repA, were also found within the dcm region. The 8 kb region centered around repABC displays extensive synteny with several rhizobial plasmids and with several regions on the chromosome of Nitrobacter hamburgensis X14 . This suggests that part or all of the dcm region may have once existed as an extrachromosomal element and contributed to the spread of the metabolic capacity to degrade DCM in the environment. Nevertheless, introduction of the dcmA gene into strain AM1, with expression of active DCM dehalogenase at high levels, failed to enable growth on DCM . Thus, specific adaptations are required beyond the presence of DCM dehalogenase to enable Methylobacterium to grow with this compound . Additional genetic determinants needed for growth with DCM remain to be discovered, and the availability of genomic sequences will facilitate experimental efforts towards identifying them.
The assembled and complete genome sequences of two strains representing the pink-pigmented facultative methylotrophs of the genus Methylobacterium reveal extensive genome-wide homology and gene synteny. Genomic determinants of methylotrophy are almost identical between the two strains, with the exception of the methylamine utilization cluster unique to strain AM1 and of the DCM utilization cluster unique to strain DM4. Still, the two strains differ in genome size and number of replicons, and feature a set of strain-specific genes, mostly of unknown function. The large number and extensive diversity of IS elements in Methylobacterium genomes, along with the often clustered organization of genes for utilization of C1 compounds, , suggests that genome rearrangements and horizontal gene transfer most often associated with IS elements, represent key mechanisms of Methylobacterium evolution relating to growth-supporting nutrients and environmental conditions.The co-linearity of the two genomes and the absence of substantial large-scale sequence rearrangements are all the more striking in this context, and may indicate that purifying selection sets strong constraints against major alterations of the genome structure in Methylobacterium, despite the long laboratory history of the two strains, usually grown with different carbon sources (methanol for strain AM1 and DCM for strain DM4). These two genome sequences thus afford a refined picture of the potential of Methylobacterium for physiological flexibility and adaptation to specific environmental constraints within a conserved genomic framework, and provide the basis for renewed, systems level experimental investigations.
Materials and Methods
Sequencing, assembly, and validation of the genome of M. extorquens AM1
Sequence data were obtained by whole genome shotgun sequencing as previously described . BigDye terminator chemistry and capillary DNA sequencers (model 3700, Applied Biosystems) were used. Randomly picked blunt end-cloned small insert pUC19 vector-based plasmids (average ~3 kb insert size) were sequenced at both ends using universal forward and reverse sequencing primers, according to standard protocols established at the University of Washington Genome Center. In addition, a large insert fosmid library was constructed from Sau3A partial-restricted genomic DNA cloned in BamH1 digested pFOS1 vector. About 1,920 randomly picked fosmid clones were end-sequenced and the data pooled with the small insert shotgun sequence data. Sequence data were assembled and visualized using Phred/Phrap/Consed software (www.phrap.com). The sequence quality and assembly was improved by carrying out several rounds of experiments designed by the Autofinish tool in Consed . Manual finishing was carried out that involved (a) use of specialized sequencing chemistries to sequence difficult regions; (b) PCR amplification and sequencing of specific targeted regions; (c) transposon mutagenesis of over 110 small insert clones followed by sequencing to fix misassembled or difficult to assemble regions; and (d) shotgun sequencing of the 58 targeted fosmid clones to fix long-range misassemblies in the assembled genome. The consensus sequences from transposon mutagenized small insert clones, and the shotgun sequenced fosmid clones were used as backbones in the main genome assembly to resolve misassembled regions. The final strain AM1 genome assembly contained a total of 132942 sequence reads, as well as the backbones from 58 fosmids and over 110 transposon mutagenized small insert clones, and was validated by two independent methods. The gross-scale long-range validity of the genome assembly was established by pulse-field-gel-electrophoresis, with complete agreement between the virtual and experimentally determined fingerprint patterns of the final assembled genome, either by single restriction enzyme digestion with PmeI or SwaI or by double digestion with a mixture of PmeI and SwaI restriction enzymes (data not shown). For kb scale validation of the genome assembly, fingerprint data were generated from 1673 of the paired-end-sequenced fosmid clones by digesting with three independent restriction enzymes, FspI, NcoI and SphI. The fosmid paired-end-sequence and experimentally derived fingerprint data were used for assembly validation by comparison with the virtual fingerprint patterns from the assembled genome using the SeqTile software tools developed for this purpose at UWGC . The fosmid paired-end-reads anchored the clone to a unique position in the genome, while the fingerprint data were used to compare experimentally derived fingerprints with the sequence derived virtual patterns. A complete correspondence between the virtual and experimentally derived fingerprint pattern of the genome in the three restriction enzyme domains of FspI, NcoI and SphI was observed, thus validating the genome assembly.
Genome sequencing, assembly and validation of the genome of strain DM4
The complete sequence of the genome of strain DM4 was obtained using three different libraries. Genomic DNA was fragmented by mechanical shearing, and 3 kb (A) and 10 kb (B) inserts were cloned, respectively, into plasmid vectors pNAV (a pcDNA2.1 (Invitrogen) derivative) and pCNS (a pSU18 derivative). In addition, a large insert BAC library (25 kb inserts, C) was constructed from Sau3A partially digested total DNA by cloning into pBeloBAC11. Plasmid DNAs were purified and end-sequenced (79200 (A), 27648 (B), 13056 (C) paired match end-reads, respectively) using dye-terminator chemistry on ABI3730 sequencers. Assembly was realized as described  with Phred/Phrap/Consed software package (www.phrap.com). An additional 2170 sequences from selected clones were used in the finishing phase of assembly.
Genome annotation and bioinformatic analysis
Coding sequences were predicted using the AMIGene (Annotation of Microbial Genomes) software  and then submitted to automatic functional annotation using the set of tools listed in . Putative orthology relationships between the two genomes were defined by gene pairs satisfying either the Bidirectional Best Hit criterion  or an alignment threshold (at least 40% sequence identity over at least 80% of the length of the smallest protein). These relationships were subsequently used to search for conserved gene clusters (synteny groups) among several bacterial genomes using an algorithm based on an exact graph-theoretical approach . This method allowed for multiple correspondences between genes, detection of paralogy relationships, gene fusions, and chromosomal rearrangements (inversion, insertion/deletion). The ‘gap’ parameter, representing the maximum number of consecutive genes that are not involved in a synteny group, was set to five.
Manual validation of automatic annotations was performed in a relational database (MethylobacScope, https://www.genoscope.cns.fr/agc/mage/wwwpkgdb/Login/log.php?pid=26) using the MaGe web interface , which allows graphic visualization of the annotations enhanced by a synchronized representation of synteny groups in other genomes chosen for comparison. Genomes were checked for the presence of genes without homologs in the parent genome using thresholds of 80% sequence identity threshold at the protein level and 80% of the length of the shorter homolog (minLrap 0.8). Chromosomal genes of potentially foreign origin were detected using Alien Hunter . Potential genomic islands were searched for with the RGP (Region of Genomic Plasticity) tool of the Mage web-based interface  based on synteny breaks between compared genomes, and then checked the predicted regions manually. Only regions larger than 8 kb are reported here.
IS annotations were done by in-house computational tools (Robinson, Lee, Marx, unpublished) that incorporated IScan , followed by manual validation based on ISfinder . IS elements were given names of type “ISMex3”, with “Mex” (for M. extorquens) and “Mdi” (for Methylobacterium degrading dichloromethane) indicating strains AM1 or DM4, respectively. The same type name was used for both strains for IS elements with >95% identity in protein sequence. An intact copy was defined as a sequence whose length was at least 99% of the length of the longest copy detected, and a partial IS was defined as a >500 bp fragment with >80% DNA identity to an intact copy.
Methylotrophy genes in M. extorquens AM1 and DM4
(0.07 MB DOC)
Methylotrophy enzymes and pathways deduced from complete genomic sequences of methylotrophs
(0.04 MB DOC)
Methylotrophic bacteria with published genome sequences included in comparative analyses
(0.04 MB DOC)
Characteristics of IS elements in Methylobacterium extorquens
(0.05 MB DOC)
Elizabeth Skovran, Sandro Roselli, Romain Lang and David Lalaouna are thanked for participation in the annotation work.
Analyzed the data: SV LC MCL FB AL YZ BG SC RP DR. Contributed reagents/materials/analysis tools: SC WG MP DGR DR GS DV. Wrote the paper: SV LC MCL FB BG MP RP CJM JAV CM. Designed the initial project: SV LC MVO JW MEL. Contributed to the manual expert annotation of the two strains: SV LC MCL FB BG CG EHourcade EM TN CP RP. Comparative genomic analyses: SV LC MCL FB RP DR. Designed Figure 1: SV LC RP JAV. Designed Figure 2: ZR. Designed Figure 3: GS. Designed Figure 4: SV LC. Designed Figure 5: MCL. Designed Figure 6: SV. Secured funds to support the study: SV JW CM MEL. Shotgun sequencing and assembly: YZ VB JC CD RL SM CS ZW RK. Managed informatics resources of the project: CD EHaugen MP GS DV.
- 1. Lidstrom ME (2006) Aerobic methylotrophic prokaryotes. In: Dworkin M,Falkow S,Rosenberg E,Schleifer K-H,Stackebrandt E, editors. The Prokaryotes,Vol 2: Ecophysiology and Biochemistry. New York: Springer-Verlag. pp. 618–634.
- 2. Galbally IE,Kirstine W (2002) The production of methanol by flowering plants and the global cycle of methanol. J Atmos Chem 43: 195–229.
- 3. Jourand P,Giraud E,Bena G,Sy A,Willems A,et al. (2004) Methylobacterium nodulans sp. nov., for a group of aerobic, facultatively methylotrophic, legume root-nodule-forming and nitrogen-fixing bacteria. Int J Syst Evol Microbiol 54: 2269–2273.
- 4. Lidstrom ME,Chistoserdova L (2002) Plants in the pink: Cytokinin production by Methylobacterium. J Bacteriol 184: 1818–1818.
- 5. Sy A,Timmers ACJ,Knief C,Vorholt JA (2005) Methylotrophic metabolism is advantageous for Methylobacterium extorquens during colonization of Medicago truncatula under competitive conditions. Appl Environ Microbiol 71: 7245–7252.
- 6. Van Aken B,Yoon JM,Schnoor JL (2004) Biodegradation of nitro-substituted explosives 2,4,6-trinitrotoluene, hexahydro-1,3,5-trinitro-1,3,5-triazine, an octahydro-1,3,5,7-tetranitro-1,3,5-tetrazocineby a phytosymbiotic Methylobacterium sp. associated with poplar tissues (Populus deltoides×nigra DN34). Appl Environ Microbiol 70: 508–517.
- 7. Abanda-Nkpwatt D,Musch M,Tschiersch J,Boettner M,Schwab W (2006) Molecular interaction between Methylobacterium extorquens and seedlings: growth promotion, methanol consumption, and localization of the methanol emission site. J Exp Bot 57: 4025–4032.
- 8. Anthony C (1982) The Biochemistry of Methylotrophs. London: Academic Press.
- 9. Schrader J,Schilling M,Holtmann D,Sell D,Villela Filho M,et al. (2009) Methanol-based industrial biotechnology: current status and future perspectives of methylotrophic bacteria. Trends Biotechnol 27: 107–115.
- 10. Peel D,Quayle JR (1961) Microbial growth on C1 compounds. I. Isolation and characterization of Pseudomonas AM1. Biochem J 81: 465–469.
- 11. Chistoserdova L,Chen SW,Lapidus A,Lidstrom ME (2003) Methylotrophy in Methylobacterium extorquens AM1 from a genomic point of view. J Bacteriol 185: 2980–2987.
- 12. Erb TJ,Berg IA,Brecht V,Muller M,Fuchs G,et al. (2007) Synthesis of C-5-dicarboxylic acids from C-2-units involving crotonyl-CoA carboxylase/reductase: The ethylmalonyl-CoA pathway. Proc Natl Acad Sci U S A 104: 10631–10636.
- 13. Peyraud R,Kiefer P,Christen P,Massou S,Portais J-C,et al. (2009) Demonstration of the ethylmalonyl-CoA pathway using 13C metabolomics. Proc Natl Acad Sci USA 106: 4846–4851.
- 14. Afolabi PR,Mohammed F,Amaratunga K,Majekodunmi O,Dales SL,et al. (2001) Site-directed mutagenesis and X-ray crystallography of the PQQ-containing quinoprotein methanol dehydrogenase and its electron acceptor, cytochrome cL. Biochemistry 40: 9799–9809.
- 15. Williams PA,Coates L,Mohammed F,Gill R,Erskine PT,et al. (2005) The atomic resolution structure of methanol dehydrogenase from Methylobacterium extorquens. Acta Crystallogr Sect D-Biol Crystallogr 61: 75–79.
- 16. Chistoserdov AY,Chistoserdova LV,McIntire WS,Lidstrom ME (1994) Genetic organization of the mau gene cluster in Methylobacterium extorquens AM1: complete nucleotide sequence and generation and characteristics of mau mutants. J Bacteriol 176: 4052–4065.
- 17. Davidson VL (2001) Pyrroloquinoline quinone (PQQ) from methanol dehydrogenase and tryptophan tryptophylquinone (TTQ) from methylamine dehydrogenase. Adv Protein Chem 58: 95–140.
- 18. Chistoserdova L,Jenkins C,Kalyuzhnaya MG,Marx CJ,Lapidus A,et al. (2004) The enigmatic Planctomycetes may hold a key to the origins of methanogenesis and methylotrophy. Mol Biol Evol 21: 1234–1241.
- 19. Vorholt JA,Chistoserdova L,Stolyar SM,Thauer RK,Lidstrom ME (1999) Distribution of tetrahydromethanopterin-dependent enzymes in methylotrophic bacteria and phylogeny of methenyl tetrahydromethanopterin cyclohydrolases. J Bacteriol 181: 5750–5757.
- 20. Chistoserdova L,Vorholt JA,Thauer RK,Lidstrom ME (1998) C-1 transfer enzymes and coenzymes linking methylotrophic bacteria and methanogenic Archaea. Science 281: 99–102.
- 21. Vorholt JA,Marx CJ,Lidstrom ME,Thauer RK (2000) Novel formaldehyde-activating enzyme in Methylobacterium extorquens AM1 required for growth on methanol. J Bacteriol 182: 6645–6650.
- 22. Vorholt JA,Chistoserdova L,Lidstrom ME,Thauer RK (1998) The NADP-dependent methylene tetrahydromethanopterin dehydrogenase in Methylobacterium extorquens AM1. J Bacteriol 180: 5351–5356.
- 23. Pomper BK,Vorholt JA,Chistoserdova L,Lidstrom ME,Thauer RK (1999) A methenyl tetrahydromethanopterin cyclohydrolase and a methenyl tetrahydrofolate cyclohydrolase in Methylobacterium extorquens AM1. Eur J Biochem 261: 475–480.
- 24. Marx CJ,Van Dien SJ,Lidstrom ME (2005) Flux analysis uncovers key role of functional redundancy in formaldehyde metabolism. PLoS Biol 3: 244–253.
- 25. Crowther GJ,Kosály G,Lidstrom ME (2008) Formate as the main branch point for methylotrophic metabolism in Methylobacterium extorquens AM1. J Bacteriol 190: 5057–5062.
- 26. Okubo Y,Skovran E,Guo XF,Sivam D,Lidstrom ME (2007) Implementation of microarrays for Methylobacterium extorquens AM1. OMICS 11: 325–340.
- 27. Bosch G,Skovran E,Xia Q,Wang T,Taub F,et al. (2008) Comprehensive proteomics of Methylobacterium extorquens AM1 metabolism under single carbon and nonmethylotrophic conditions. Proteomics 8: 3494–3505.
- 28. Guo XF,Lidstrom ME (2008) Metabolite profiling analysis of Methylobacterium extorquens AM1 by comprehensive two-dimensional gas chromatography coupled with time-of-flight mass spectrometry. Biotechnol Bioeng 99: 929–940.
- 29. Kiefer P,Portais J-C,Vorholt JA (2008) Quantitative metabolome analysis using liquid chromatography-high-resolution mass spectrometry. Anal Biochem 382: 94–100.
- 30. Marx C (2008) Development of a broad-host-range sacB-based vector for unmarked allelic exchange. BMC Research Notes 1: 1.
- 31. Marx CJ,Lidstrom ME (2001) Development of improved versatile broad-host-range vectors for use in methylotrophs and other Gram-negative bacteria. Microbiology 147: 2065–2075.
- 32. Gälli R,Leisinger T (1985) Specialized bacterial strains for the removal of dichloromethane from industrial waste. Conservation and Recycling 8: 91–100.
- 33. Khalil MAK,Moore RM,Harper DB,Lobert JM,Erickson DJ,et al. (1999) Natural emissions of chlorine-containing gases: Reactive Chlorine Emissions Inventory. J Geophys Res-Atmos 104: 8333–8346.
- 34. McCulloch A,Aucott ML,Graedel TE,Kleiman G,Midgley PM,et al. (1999) Industrial emissions of trichloroethene, tetrachloroethene, and dichloromethane: Reactive Chlorine Emissions Inventory. J Geophys Res-Atmos 104: 8417–8427.
- 35. Keith LH,Telliard WA (1979) Priority pollutants I - a perspective view. Environ Sci Technol 13: 416–423.
- 36. Vuilleumier S (2002) Coping with a halogenated one-carbon diet: aerobic dichloromethane-mineralising bacteria. In: Reineke W,Agathos S, editors. Biotechnology for the environment, Focus on Biotechnology Series. Dordrecht: Kluwer Academic Publishers. pp. 105–131.
- 37. Vuilleumier S,Ivoš N,Dean M,Leisinger T (2001) Sequence variation in dichloromethane dehalogenases/glutathione S-transferases. Microbiology 147: 611–619.
- 38. Starr TB,Matanoski G,Anders MW,Andersen ME (2006) Workshop overview: Reassessment of the cancer risk of dichloromethane in humans. Toxicological Sciences 91: 20–28.
- 39. Gisi D,Leisinger T,Vuilleumier S (1999) Enzyme-mediated dichloromethane toxicity and mutagenicity of bacterial and mammalian dichloromethane-active glutathione S-transferases. Arch Toxicol 73: 71–79.
- 40. Kayser MF,Vuilleumier S (2001) Dehalogenation of dichloromethane by dichloromethane dehalogenase/glutathione S-transferase leads to the formation of DNA adducts. J Bacteriol 183: 5209–5212.
- 41. Doronina NV,Trotsenko YA,Tourova TP,Kuznetsov BB,Leisinger T (2000) Methylopila helvetica sp. nov. and Methylobacterium dichloromethanicum sp. nov. - Novel aerobic facultatively methylotrophic bacteria utilizing dichloromethane. Syst Appl Microbiol 23: 210–218.
- 42. Kato Y,Asahara M,Arai D,Goto K,Yokota A (2005) Reclassification of Methylobacterium chloromethanicum and Methylobacterium dichloromethanicum as later subjective synonyms of Methylobacterium extorquens and of Methylobacterium lusitanum as a later subjective synonym of Methylobacterium rhodesianum. J Gen Appl Microbiol 51: 287–299.
- 43. Necsulea A,Lobry JR (2007) A new method for assessing the effect of replication on DNA base composition asymmetry. Mol Biol Evol 24: 2169–2179.
- 44. Tatusov RL,Fedorova ND,Jackson JD,Jacobs AR,Kiryutin B,et al. (2003) The COG database: an updated version includes eukaryotes. BMC Bioinformatics 4: 41.
- 45. Vallenet D,Labarre L,Rouy Z,Barbe V,Bocs S,et al. (2006) MaGe: a microbial genome annotation system supported by synteny results. Nucleic Acids Res 34: 53–65.
- 46. Vernikos GS,Parkhill J (2006) Interpolated variable order motifs for identification of horizontally acquired DNA: revisiting the Salmonella pathogenicity islands. Nucleic Acids Res 22: 2196–2203.
- 47. Fukuda N,Granzier HL (2005) Titin/connectin-based modulation of the Frank-Starling mechanism of the heart. J Muscle Res Cell Motil 26: 319–323.
- 48. Nieto-Peñalver CG,Cantet F,Morin D,Haras D,Vorholt JA (2006) A plasmid-borne truncated luxI homolog controls quorum-sensing systems and extracellular carbohydrate production in Methylobacterium extorquens AM1. J Bacteriol 188: 7321–7324.
- 49. Chistoserdova L,Kalyuzhnaya MG,Lidstrom ME (2005) C1-transfer modules: from genomics to ecology. ASM News 71: 521–528.
- 50. Chistoserdova L,Lidstrom ME (1997) Molecular and mutational analysis of a DNA region separating two methylotrophy gene clusters in Methylobacterium extorquens AM1. Microbiology 143: 1729–1736.
- 51. Denef VJ,Patrauchan MA,Florizone C,Park J,Tsoi TV,et al. (2005) Growth substrate- and phase-specific expression of biphenyl, benzoate, and C-1 metabolic pathways in Burkholderia xenovorans LB400. J Bacteriol 187: 7996–8005.
- 52. Wilson SM,Gleisten MP,Donohue TJ (2008) Identification of proteins involved in formaldehyde metabolism by Rhodobacter sphaeroides. Microbiology 154: 296–305.
- 53. Ward N,Larsen O,Sakwa J,Bruseth L,Khouri H,et al. (2004) Genomic insights into methanotrophy: The complete genome sequence of Methylococcus capsulatu (Bath). PLoS Biol 2: 1616–1628.
- 54. Kane SR,Chakicherla AY,Chain PSG,Schmidt R,Shin MW,et al. (2007) Whole-genome analysis of the methyl tert-butyl ether-degrading beta-proteobacterium Methylibium petroleiphilum PM1. J Bacteriol 189: 1931–1945.
- 55. Chistoserdova L,Lapidus A,Han C,Goodwin L,Saunders L,et al. (2007) Genome of Methylobacillus flagellatus, molecular basis for obligate methylotrophy, and polyphyletic origin of methylotrophy. J Bacteriol 189: 4020–4027.
- 56. Giovannoni SJ,Hayakawa DH,Tripp HJ,Stingl U,Givan SA,et al. (2008) The small genome of an abundant coastal ocean methylotroph. Environmental Microbiol 10: 1771–1782.
- 57. Moran MA,Buchan A,Gonzalez JM,Heidelberg JF,Whitman WB,et al. (2004) Genome sequence of Silicibacter pomeroyi reveals adaptations to the marine environment. Nature 432: 910–913.
- 58. Greenberg DE,Porcella SF,Zelazny AM,Virtaneva K,Sturdevant DE,et al. (2007) Genome sequence analysis of the emerging human pathogenic acetic acid bacterium Granulibacter bethesdensis. J Bacteriol 189: 8727–8736.
- 59. Greenberg DE,Porcella SF,Stock F,Wong A,Conville PS,et al. (2006) Granulibacter bethesdensis gen. nov., sp. nov., a distinctive pathogenic acetic acid bacterium in the family Acetobacteraceae. Int J Syst Evol Microbiol 56: 2609–2616.
- 60. Hou S,Makarova KS,Saw JH,Senin P,Ly BV,et al. (2008) Complete genome sequence of the extremely acidophilic methanotroph isolate V4, “Methylacidiphilum infernorum”, a representative of the bacterial phylum Verrucomicrobia. Biol Direct 3: 26.
- 61. Kalyuzhnaya MG,Hristova KR,Lidstrom ME,Chistoserdova L (2008) Characterization of a novel methanol dehydrogenase in representatives of Burkholderiales: Implications for environmental detection of methylotrophy and evidence for convergent evolution. J Bacteriol 190: 3817–3823.
- 62. Chistoserdova L,Vorholt J,Thauer R,Lidstrom M (1998) C1 transfer enzymes and coenzymes linking methylotrophic bacteria and methanogenic Archaea. Science 281: 99–102.
- 63. Marx CJ,Chistoserdova L,Lidstrom ME (2003) Formaldehyde-detoxifying role of the tetrahydromethanopterin-linked pathway in Methylobacterium extorquens AM1. J Bacteriol 185: 7160–7168.
- 64. Bauer M,Lombardot T,Teeling H,Ward NL,Amann R,et al. (2004) Archaea-like genes for C-1-transfer enzymes in Planctomycetes: Phylogenetic implications of their unexpected presence in this phylum. J Mol Evol 59: 571–586.
- 65. Marx CJ,Miller JA,Chistoserdova L,Lidstrom ME (2004) Multiple formaldehyde oxidation/detoxification pathways in Burkholderia fungorum LB400. J Bacteriol 186: 2173–2178.
- 66. Chistoserdova L,Gomelsky L,Vorholt JA,Gomelsky M,Tsygankov YD,et al. (2000) Analysis of two formaldehyde oxidation pathways in Methylobacillus flagellatus KT, a ribulose monophosphate cycle methylotroph. Microbiology 146: 233–238.
- 67. Ras J,van Ophem PW,Reijnders WN,van Spanning RJ,Duine JA,et al. (1995) Isolation, sequencing, and mutagenesis of the gene encoding NAD- and glutathione-dependent formaldehyde dehydrogenase (GD-FALDH) from Paracoccus denitrificans, in which GD-FALDH is essential for methylotrophic growth. J Bacteriol 177: 247–251.
- 68. Chistoserdova L,Crowther GJ,Vorholt JA,Skovran E,Portais JC,et al. (2007) Identification of a fourth formate dehydrogenase in Methylobacterium extorquens AM1 and confirmation of the essential role of formate oxidation in methylotrophy. J Bacteriol 189: 9076–9081.
- 69. Galperin MY,Walker DR,Koonin EV (1998) Analogous enzymes: independent inventions in enzyme evolution. Genome Res 8: 779–790.
- 70. Chistoserdova LV,Lidstrom ME (1994) Genetics of the serine cycle in Methylobacterium extorquens AM1: identification, sequence, and mutation of three new genes involved in C1 assimilation, orf4, mtkA, and mtkB. J Bacteriol 176: 7398–7404.
- 71. Marx CJ,O'Brien BN,Breezee J,Lidstrom ME (2003) Novel methylotrophy genes of Methylobacterium extorquens AM1 identified by using transposon mutagenesis including a putative dihydromethanopterin reductase. J Bacteriol 185: 669–673.
- 72. Marx CJ,Lidstrom ME (2004) Development of an insertional expression vector system for Methylobacterium extorquens AM1 and generation of null mutants lacking mtdA and/or fch. Microbiology 150: 9–19.
- 73. Marx CJ,Laukel M,Vorholt JA,Lidstrom ME (2003) Purification of the formate-tetrahydrofolate ligase from Methylobacterium extorquens AM1 and demonstration of its requirement for methylotrophic growth. J Bacteriol 185: 7169–7175.
- 74. Kalyuzhnaya MG,De Marco P,Bowerman S,Pacheco CC,Lara JC,et al. (2006) Methyloversatilis universalis gen. nov., sp nov., a novel taxon within the Betaproteobacteria represented by three methylotrophic isolates. Int J Syst Evol Microbiol 56: 2517–2522.
- 75. Kalyuzhnaya MG,Lidstrom ME (2005) QscR-mediated transcriptional activation of serine cycle genes in Methylobacterium extorquens AM1. J Bacteriol 187: 7511–7517.
- 76. Siguier P,Filée J,Chandler M (2006) Insertion sequences in prokaryotic genomes. Curr Op Microbiol 9: 526–531.
- 77. Siguier P,Perochon J,Lestrade L,Mahillon J,Chandler M (2006) ISfinder: the reference centre for bacterial insertion sequences. Nucleic Acids Res 34: D32–D36.
- 78. Chandler M,Mahillon J (2002) Insertion sequences revisited. In: Craig NL,Craigie R,Gellert M,Lambowitz AM, editors. Mobile DNA II. Washington DC: ASM Press. pp. 305–366.
- 79. Schmid-Appert M,Zoller K,Traber H,Vuilleumier S,Leisinger T (1997) Association of newly discovered IS elements with the dichloromethane utilization genes of methylotrophic bacteria. Microbiology 143: 2557–2567.
- 80. Kalyuzhnaya MG,Lapidus A,Ivanova N,Copeland AC,McHardy AC,et al. (2008) High-resolution metagenomics targets specific functional types in complex microbial communities. Nat Biotech 26: 1029–1034.
- 81. Neff JC,Holland EA,Dentener FJ,McDowell WH,K.M. R (2002) The origin, composition and rates of organic nitrogen deposition: a missing piece of the nitrogen cycle? Biogeochemistry 57/58: 99–136.
- 82. Censini S,Lange C,Xiang ZY,Crabtree JE,Ghiara P,et al. (1996) cag, a pathogenicity island of Helicobacter pylori, encodes type I-specific and disease-associated virulence factors. Proc Natl Acad Sci USA 93: 14648–14653.
- 83. Cevallos MA,Cervantes-Rivera R,Gutierrez-Rios RM (2008) The repABC plasmid family. Plasmid 60: 19–37.
- 84. Starkenburg SR,Larimer FW,Stein LY,Klotz MG,Chain PSG,et al. (2008) Complete genome sequence of Nitrobacter hamburgensis X14 and comparative genomic analysis of species within the genus Nitrobacter. Appl Environ Microbiol 74: 2852–2863.
- 85. Kayser MF,Ucurum Z,Vuilleumier S (2002) Dichloromethane metabolism and C1 utilization genes in Methylobacterium strains. Microbiology 148: 1915–1922.
- 86. Rohmer L,Fong C,Abmayr S,Wasnick M,Freeman TJL,et al. (2007) Comparison of Francisella tularensis genomes reveals evolutionary events associated with the emergence of human pathogenic strains. Genome Biol 8: R102.
- 87. Gordon D,Desmarais C,Green P (2001) Automated finishing with Autofinish. Genome Res 11: 614–625.
- 88. Vallenet D,Nordmann P,Barbe V,Poirel L,Mangenot S,et al. (2008) Comparative analysis of Acinetobacters: Three genomes for three lifestyles. PLoS ONE 3: e1805.
- 89. Bocs S,Cruveiller S,Vallenet D,Nuel G,Medigue C (2003) AMIGene: Annotation of MIcrobial genes. Nucleic Acids Res 31: 3723–3726.
- 90. Overbeek R,Fonstein M,D'Souza M,Pusch GD,Maltsev N (1999) The use of gene clusters to infer functional coupling. Proc Natl Acad Sci USA 96: 2896–2901.
- 91. Boyer F,Morgat A,Labarre L,Pothier J,Viari A (2005) Syntons, metabolons and interactons: an exact graph-theoretical approach for exploring neighbourhood between genomic and functional data. Bioinformatics 21: 4209–4215.
- 92. Wagner A,Lewis C,Bichsel M (2007) A survey of bacterial insertion sequences using IScan. Nucleic Acids Res 35: 5284–5293.
- 93. Taylor IJ,Anthony C (1976) A biochemical basis for obligate methylotrophy: properties of a mutant of Pseudomonas AM1 lacking 2-oxoglutarate dehydrogenase. J Gen Microbiol 93: 259–265.
- 94. van Dien SJ,Okubo Y,Hough MT,Korotkova N,Taitano T,et al. (2003) Reconstruction of C-3 and C-4 metabolism in Methylobacterium extorquens AM1 using transposon mutagenesis. Microbiology 149: 601–609.