Abstract
Staphylococcus aureus is recognized as one of the major human pathogens and is by far one of the most common nosocomial organisms. The genetic basis for the emergence of highly epidemic strains remains mysterious. Studying the microevolution of the different clones of S. aureus is essential for identifying the forces driving pathogen emergence and spread. The aim of the present study was to determine the genetic changes characterizing a lineage belonging to the South German clone (ST228) that spread over ten years in a tertiary care hospital in Switzerland. For this reason, we compared the whole genome of eight isolates recovered between 2001 and 2008 at the Lausanne hospital. The genetic comparison of these isolates revealed that their genomes are extremely closely related. Yet, a few more important genetic changes, such as the replacement of a plasmid, the loss of large fragments of DNA, or the insertion of transposases, were observed. These transfers of mobile genetic elements shaped the evolution of the ST228 lineage that spread within the Lausanne hospital. Nevertheless, although the strains analyzed differed in their dynamics, we have not been able to link a particular genetic element with spreading success. Finally, the present study showed that new sequencing technologies improve considerably the quality and quantity of information obtained for a single strain; but this information is still difficult to interpret and important investments are required for the technology to become accessible for routine investigations.
Figures
Citation: Vogel V, Falquet L, Calderon-Copete SP, Basset P, Blanc DS (2012) Short Term Evolution of a Highly Transmissible Methicillin-Resistant Staphylococcus aureus Clone (ST228) in a Tertiary Care Hospital. PLoS ONE 7(6): e38969. https://doi.org/10.1371/journal.pone.0038969
Editor: Marco Salemi, University of Florida, United States of America
Received: December 23, 2011; Accepted: May 15, 2012; Published: June 18, 2012
Copyright: © 2012 Vogel et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by a grant #3100A0-113346 of the Swiss National Foundation for Research. The SIB Swiss Institute of Bioinformatics is an independent non-profit foundation recognised of public utility. More details on the SIB are available at http://www.isb-sib.ch/aboutsib/organisation.html. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Staphylococcus aureus is recognized as one of the major human pathogen and is by far one of the most common nosocomial organisms. This bacterium can cause a variety of infections ranging from superficial skin infections to severe life-threatening conditions such as bacteraemia [1], endocarditis, pneumonia and toxic shock syndrome. S. aureus has amazing capacities to survive and adapt in hostile environments, as illustrated by the emergence of S. aureus strains resistant to almost all classes of antimicrobials [2]. In particular, methicillin-resistant Staphylococcus aureus (MRSA) represents a major issue both in health care settings (HA-MRSA) and in the community (CA-MRSA), and numerous MRSA outbreaks have been reported worldwide. Epidemiological studies revealed that MRSA infections were mostly caused by single clones that spread rapidly in the hospital settings or recently in the community (epidemic MRSA). The reason why a single MRSA clone can be common and widespread in a large geographic area while other clones are restricted to few sporadic isolates remains mysterious. However, the dissemination of particular clones in a specific environment in favor of other strains [3]–[5] or the replacement of clones in a single environment [6], [7] suggests a genetic basis for epidemicity. Thus, the speed at which genetic changes accumulate as well as the frequency at which entire genes, or complex mobile genetic elements are gained and lost are crucial parameters for understanding the mechanisms by which epidemicity, pathogenicity, virulence, or antibiotic resistance might be acquired.
The S. aureus population consists of about ten major lineages associated to humans and many other minor lineages [8]. Genome comparisons performed so far indicated that isolates from different lineages differ at hundreds of genes [9]–[11] while isolates from the same lineage have remarkably conserved genomes despite wide geographic, temporal and presumably selective diversity [12]–[14]. For example, only 285 points mutations among approximately 2500 core genes distinguish two S. aureus genomes belonging to the same lineage [12]. Similarly, comparative whole genome sequencing of ten USA300 isolates collected nationwide in the USA showed a limited number of single nucleotide polymorphisms and regions of differences among these isolates [15]. More recently, Harris et al. [16] used high-throughput sequencing to compare the genome of 63 isolates of a predominant clonal lineage (ST239) common in mainland Asia, South America and parts of Eastern Europe. They also found a relatively low number of variable sites (around 4300) in the core genome and this variation was sufficient to infer the evolutionary history of the lineage. A complementary analysis of these data using Markov spatial models demonstrated that bacterial genomes could indeed contain sufficient evolutionary information to elucidate the temporal and spatial dynamics of transmission of this lineage [17]. These studies did not only provide new clues about the geographical origin, international spread and local epidemiology of this clonal lineage; it also highlighted the potential of whole-genome sequencing technologies to infer the local epidemiology and short term evolution of pathogens.
The ST228-MRSA-I, also called the South German clone or Italian clone [18], is prevalent in several central European countries including Germany [19], Italy [18], Hungary [20], Slovenia [21], Austria [21] and Switzerland [7]. In Switzerland, it was first detected at the end of the 90’s [7]. Since then, it was regularly observed at the Geneva and Lausanne tertiary care hospitals but rarely found in other places in Switzerland. Interestingly, the importance of this clone varies not only geographically but also temporarily. Short after its arrival at the tertiary care hospital of Lausanne, the ST228 clone was responsible for two important outbreaks (Fig. 1B). Four years later, the prevalence of this clone diminishes drastically and it became rare compared to other lineages. Variants of this clone were regularly observed but never spread such extensively until 2008. During the last trimester of 2008, the unusual dissemination of a strain belonging to ST228 was observed. This strain affected more than five hundred patients in less than one year and a half, whereas other clones did not disseminate as extensively although they were in the same setting. The surveillance program established at the Lausanne hospital permitted to follow the dynamic of this clone over ten years providing the ideal material for studying the local microevolution of an epidemic MRSA clone (Fig. 1).
Molecular typing was performed by Pulsed Field Gel Electrophoresis (PFGE) until 2005 and then by Double Locus Sequence Typing (DLST; [51]). A. Trimestral incidence of patients with MRSA between 1999 and 2009. The four main lineages of MRSA encountered at the hospital are labeled by colors; the ST228 in red, the ST105 in violet, the ST45 in green and the ST8 in yellow. The other lineages, less common at the hospital, are in black. B. Trimestral incidence of patients with MRSA of clone ST228 at the University hospital of Lausanne between 1999 and 2009. A subset of MRSA detected between 2008 and 2009 were randomly selected for PFGE typing. The four most common pulsotypes (Da, Db,Dc and Dd) were labeled in red-like colors. C. PFGE profile of the eight isolates selected for the genome comparison. Among these eight isolates, we selected the four most common pulsotypes described in B. The eight pulsotypes were bordered by a marker (M) corresponding to the restriction of the reference strain NCTC 8325.
The aim of the present study was to determine the genetic changes characterizing the ST228 lineage that spread over ten years in a single hospital in order to understand its short-term evolution as well as its local dynamic. To determine the genetic changes that occurred during the spread of this lineage, we compared the whole genome of eight isolates recovered at the Lausanne tertiary care hospital between 2001 and 2008. In addition, we used these data to test the utility of next generation sequencing for very localized epidemiology. The comparison of these eight genomes revealed that they evolved mainly through point mutations accompanied by few transfers of mobile genetic elements (MGEs). None of these changes was characteristic of the most successful isolates suggesting that there is probably not a single genetic factor explaining the particular spreading success of some of the ST228 variants found at the hospital. Finally, the present study confirmed that whole genome comparison improves considerably the genetic characterization of a lineage at a very local epidemiological level but requires a broader knowledge on the evolution of the linage to draw reliable interpretation of the results for epidemiological tracking.
Materials and Methods
Selection of Isolates for Genome Sequencing
Eight different isolates of Staphylococcus aureus belonging to ST228 (spa type t041) collected between 2001 and 2008 at the tertiary care hospital of Lausanne were selected for genomes comparison (Table 1). During this period, about 550 ST228 isolates were identified at the Lausanne hospital (Fig. 1A). Pulsed-field gel electrophoresis (PFGE) typing of a subset of these 550 isolates (n = 320) identified four predominant PFGE profiles, with an addition of about 20 non-predominant profiles (Fig. 1B). The eight isolates were chosen among these 550 isolates in order to (i) represent the best overview of the evolutionary history of the clone by selecting isolates over the eight years period, (ii) account for the greatest genetic variability by choosing strains having different PFGE profiles, and finally (iii) we selected isolates presenting various dynamics to check if there are genetic changes associated with the predominant variants. According to these criterions, we selected one isolate of each of the four major pulsotypes (Fig. 1B) plus four other isolates that have been observed only sporadically. For each of the predominant pulsotypes, we selected one isolate among those that were first recovered at the hospital during the outbreaks; two isolates were collected in 2001 (Da and Db respectively; Fig. 1B and 1C) and two in 2008 (Dc and Dd; Fig. 1B and 1C). The remaining four sporadic isolates were selected during the non-outbreak period; three isolates were chosen in 2006 and one in 2008 just before the beginning of the outbreak of 2008 (Table 1).
Genome Sequencing
Total genomic DNA was isolated from the eight isolates (Table 1) using the phenol chloroform extraction method. Genome sequencing was performed with the Illumina sequencing technology (Genome Analyzer IIx) at the Genomic Technologies Facility of the University of Lausanne. A paired-end library with approximately 600 bp insert was constructed from 5 µg of genomic DNA and 36 bp were obtained on both ends following manufacturer’s instructions (Illumina, San Diego, CA). Each isolate was run on a single lane generating from five to eight million reads. In these conditions, the theoretical coverage based on the published genome size of S. aureus (c. a. 2.8×106) varied from 128× to 205×.
Assembly
The quality of the data obtained from the sequencing was verified using FastQC [http://www.bioinformatics.bbsrc.ac.uk/projects/fastqc/]. Most of the reads were of excellent quality; reads of insufficient quality or contaminant sequences (less than 1%) were removed using locally developed scripts (available upon request).
To have an overview of the genome of the eight isolates, we preformed a first assembly by mapping the reads of each isolate against a reference for which the genome was known using Bowtie [22]. The strain N315 was used as reference for the genome assembly because it was the most closely related strain for which the whole genome was known [23]. These assemblies revealed that a few elements of the strain N315 were missing in the eight isolates and that, the eight isolates harbored elements not existing in N315. To improve the quality of the assembly, to identify other differences such as genome rearrangement and to determine the nature of the non-mapped elements, we developed a new procedure combining different approaches.
To reconstruct the genomes of our eight isolates, we used a combination of de novo assembly (i.e., assembling reads together so that they form a new, previously unknown sequence) and a mapping approach (i.e, assembling reads or contigs against a reference). The strain N315 was again used as reference for the genome assembly [23]. First, the reads of one isolate (I12001) were assembled with SOAPdenovo [24]. Then, the resulting contigs were mapped to the genome sequence of the strain N315 (accession number BA000018) with the Smith-Waterman algorithm implemented in BWA-SW [25]. Finally, the gaps were closed by combining various assemblies manually in Consed [26]. With this approach, we constructed a first genome that was further used as reference to assemble the genomes of the seven other isolates using the same method. All genomes were corrected with PrInSeS-G [27], a software tool that detects and assembles sequence variants (1 base pair (bp) to ∼10 kilobases (kb)) from single- or paired-end short reads. The eight assembled genomes were annotated using an internally developed pipeline [28] and compared to each other in order to identify the elements that differ among our eight isolates. From this procedure, we identified the elements that were common to the sequenced and reference strains and those elements that were present in the reference but absent in our isolates. The reads that did not map onto the reference were assembled separately by a de novo approach using SOAPdenovo [24]. The resulting contigs constituted the elements that were present in our isolates but not in the reference. The sequences of the isolates I12001 to I82008 resulting from this assembly were deposited at the European Nucleotide Archive (ENA); the accession numbers of each isolate are given in Table 1.
Phylogeny
To infer the genetic relationships among our eight isolates we performed a phylogenetic analysis. To avoid that horizontal transfer of MGEs confound phylogenetic interpretations; we based our phylogenies on the SNPs located on the core genome defined as the elements present in the eight isolates and in the reference strain N315. We considered that these elements were likely acquired by a common ancestor while those that are missing in one or several isolates might be MGEs lost or acquired during the evolution of the selected strains. The strain N315, which was the most closely related strain for which the whole genome was known [23], was included in the analysis and was used as outgroup for rooting the phylogenetic tree. This analysis was exclusively based on a multiple sequence alignment of the core genome (see below); thus small indels (<200 bp) and the regions varying between the two lineages (SCCmec, Φ N315, SaPIn1/SaPlm1, plasmids and ST228 phages; see results section for more details) were excluded from the analysis by a masking procedure. The assembled genomes were first aligned using MUGSY [29] and then the regions of the multiple sequence alignment showing indels (insertions or deletions) were masked with “N”. At this stage the column differences (SNPs) were identified. The phylogenetic tree of this “core genome” was then constructed using RAxML using an approximation of the GTR model with a gamma correction for among site rate variations (GTRCAT) [30]. In this software, support for nodes was assessed by bootstrapping with the autoMRE option of automatic “bootstrapping” using majority-rule tree based criteria [31].
The resolution of the tree obtained was not sufficient to be confident about the genetic relationship among the eight isolates (Fig. 2A and 2B). In this analysis, large regions were removed from the alignment (e.g., the SCCmec region) because they differed between our isolates and N315. The exclusion of these elements reduced the number of informative differences which may have consequences on the resolution of the tree. To assess the impact of this last point, we also analyzed the eight isolates without N315 using the same approach as described above. Finally, to see if large-scale genomic changes followed the evolutionary history of the ST228 lineage, we mapped the MGEs (i.e., elements not present in the eight isolates) and the major differences (i.e., differences observed between the eight isolates that affected more than three base pairs, see Table 2) on the phylogenies.
A. In this analysis, the strain N315 was included and used as outgroup. B. Same analysis as in A with bootstrap values given for each node. Nodes that were under the 50% majority rule were collapsed. The tree was rooted with N315 but the distance between this strain and the other was shortened to focus on the relationships between the eight isolates. C. Phylogeny based on the SNPs of the core genome based on the comparison of the eight isolates. D. Mapping of the MGEs and major differences observed between the different isolates onto the phylogenetic tree. Plasmids are represented by circle. The light blue and green circles represent the plasmids that are closely related to the published plasmid SAP064A and pUSA03, respectively. The black trait at the left side of the plasmids represents the presence of the mercuric operon whereas those on the right represent insertions of IS elements. The purple square illustrates the presence of an element of phage of about 18’500 bp. The smaller symbols describe the genetic variations described in Table 2. Circles were used for deletions, triangles for insertions and squares for other types of modifications. The numbers within each symbol correspond to the genetic variation described in Table 2. The symbols are white when the difference is unique to one isolate and colored when it is shared between different isolates.
To understand why the resolution of the trees obtained was low (Fig. 2A, 2B and 2C), we performed two additional analyses. First, we quantified the phylogenetic signal of the two data sets (i. e., core genomes with and without N315) by using the likelihood mapping approach implemented in TREE-PUZZLE [32]. This method displays phylogenetic information contained in a sequence alignment by representing the tree-likeliness of all groups of four randomly chosen sequences (e.g., quartets) in a single graph. For each quartet, the three possible unrooted tree topologies are weighted and the posterior weights are plotted using triangular coordinates and mapped onto a triangle. The resulting distribution of points shows whether the data are suitable for a phylogenetic reconstruction or not. Hence, dots localized close to the triangle vertices represent tree-like phylogenetic signal, whereas those close to the center and on the sides represent star-like (completely unresolved) and network-like (partially unresolved) signal, respectively [33]. To facilitate the interpretation of the results, the triangle was partitioned in seven different regions (basins of attraction). Three regions correspond to fully resolved topologies (A1 A2 and A3), one region represents star-like evolution (A*) and the three last regions (A12, A13, and A23) reflect the situation where it is difficult to distinguish between two of the three trees (Fig. S1A). The occupancies of each of these regions were then presented in percent. Second, we tested for recombination. Recombination creates mosaic genomes, which violate the assumption of tree-like evolution. Therefore, a network was inferred for each data set using SplitsTree [34]. Each data set was then analyzed for the presence of recombinant sequences using the PHI test [35] with alpha = 0.001 [36].
Results
Assembly
De novo assembly and mapping procedure.
We used two methods to assemble our eight genomes. First we mapped the reads onto N315. This approach permitted to determine the elements that are in common between N315 and our isolates but gave no indication about the nature of elements present exclusively in our isolates and no idea about possible genome rearrangements. To overcome these two points, we developed a mix de novo/mapping procedure. We compared the results obtained with the two methods and observed that our mix de novo/mapping procedure gave similar results as the single mapping procedure but permitted to characterize the plasmids and identify the presence of a phage elements. For this reason, we presented and used the results of the mix de novo/mapping procedure.
After quality control of the reads and de novo assembly of the reads, the contigs obtained were mapped to the reference genome N315. The mapping procedure gave rise to a first alignment of our eight isolates (I12001 to I82008) with N315. This alignment enabled to identify the elements that are common between our isolates and the strain N315 as well as the elements that were absent in our isolates relative to N315. From this alignment, we observed two major differences between ST228 and N315. The first variation was observed in the region corresponding to the SCCmec element, the genetic island encoding for the resistance to methicillin. The strain N315 harbored the SCCmec type II while our eight isolates had the complete sets of reads corresponding to the SCCmec type I. The second difference concerned the region comprising the bacteriophage Φ N315 followed by the pathogenicity island SaPIn1/SaPlm1. Important gaps were observed between the positions 2’007’561 and 2’072’272 of the strain N315 suggesting that these two elements were missing in our eight isolates. Other minor divergences such as mutations, indels and the insertion of a transposase of approximately 1’500 bp in a non coding region of the genome were observed between our isolates and the strain N315. When the two major differences described above and the indels were not considered (by masking them), between 734 and 783 SNPs differentiated our isolates from the genome of the strain N315. Among these, 720 SNPs were common to the eight isolates.
By removing N315, we obtained a second alignment based exclusively on the eight isolates. The total length of this alignment was 2’760’547 bp. The comparison of these 2’760’547 bp revealed that our eight isolates were differentiated by a low number of single mutations plus some differences affecting more than three base pairs (Table 2). Most of these differences consisted of insertions or deletions (indels) of several base pairs (between five and 189 bp). In addition, differences (SNPs and indels) were observed over a sequence of 1200 bp of the isolate I32006. These differences were located in one uncharacterized gene with repeated elements. Without taking into account the differences of Table 2, we observed a total of 163 SNPs (single indels were not considered) with 13 to 90 SNPs differentiating pairs of isolates. Five to 61 private SNPs (i.e., SNPs that were exclusively found in one isolate) were observed for the different isolates.
De novo assembly of reads unmapped to N315.
In addition to the main alignment representing most of the genome, we found a few elements that did not belong to the chromosome or that could not be positioned on the chromosome. These elements corresponded to reads that did not map to the genome of N315. To determine the constitution of these elements, the non-mapped reads were assembled de novo. For each isolate, between 16–17 contigs larger than 1’000 bp were obtained. The annotation and analysis of these contigs indicated that one corresponded to a plasmid and that the others corresponded to phage elements. Interestingly, the presence/absence of these elements varied among isolates. The differences among our eight isolates are described below and illustrated in Figure 2.
The plasmid sequences recovered in our isolates were clearly different from the plasmid sequence of strain N315 (accession number AP003139). The comparison of our plasmid sequences with published database revealed that all our isolates except I52006 had a plasmid closely related to the published plasmid SAP064A (accession number GQ900419). Only two deletions differentiated the plasmids of I12001, I22001 and SAP064A and an additional SNP was observed for the I42006 plasmid. The four other SAP064A-like plasmids exhibited additional minor changes plus the insertion or loss of larger DNA fragments. The plasmids of isolates I32006 and I62008 were characterized by the insertion of a transposase of 1’330 bp and 1’900 bp respectively and the plasmid of isolates I72008 and I82008 by the loss of the whole mercuric operon (c.a. 7’000 bp). Finally, we found a completely different plasmid for isolate I52006. This plasmid was closely related to one of the three plasmids found in the USA300 community-acquired MRSA (pUSA03; accession number CP000258; [37]). Our plasmid and pUSA03 differed by only 13 deletions of single base pairs plus 14 SNPs and one 162 bp insertion. The complete sequences of each plasmid were deposited at ENA under the accession numbers given in Table 1.
The remaining 16 contigs obtained after de novo assembly represented about 80’000 bp and were constituted of phage elements not present in the strain N315. Interestingly, comparing these contigs with published databases [38], [39] highlighted that they were all constituted of elements found in other published phages but none corresponded exactly to previously described phages. Fifteen out of the 16 contigs were present in all isolates and represented altogether approximately 62’500 bp. The last and largest contig (18’700 bp) was absent from I32006. For technical reasons, it was not possible to determine the complete and exact constitution of these phages and to find their position within the genome. In addition, it was difficult to precisely compare the phage elements among isolates. Although similar, the contigs constituting these phages differed slightly in their size. This is likely explained by the presence of phage elements in unassembled reads or contigs not considered for analyses (i.e., <1000 bp). These unassembled reads and contigs represented c.a. 4’000–8’000 bp depending of the isolates (i.e., less than 0.03% of the genome).
Phylogeny
The phylogenetic tree including N315 clearly confirmed the close relationship among our eight isolates (Fig. 2A). Unfortunately, the resolution of this tree was not sufficient to assess the real link between the eight isolates since bootstrap values were low (Fig. 2B). The tree based exclusively on the eight isolates (Fig. 2C) had a similar topology with branches slightly better supported. Interestingly, mapping the MGEs and the major differences onto this tree (Fig 2D) indicated that all the differences shared between isolates were not monophyletic. This is particularly well exemplified by isolates I72008 and I82008 which did not cluster together on the tree although they had a large number of deletions in common. However, bootstrap supports remained low and it is unclear if these trees reflect the real evolutionary history of the lineage. Detailed analyses of the content of the sequences indicated that there is a low number of informative SNPs; only 41 informative SNPs were identified in the alignment comprising the strain N315 and 44 in the second alignment of the eight isoaltes. Nevertheless, the likelihood mapping indicated high phylogenetic signal (>90%) for the two data sets (Fig. S1B and S1C). In addition, the split decomposition (Fig. S2) and the PHI test found statistically significant evidence for recombination (P>0.01 for both data sets).
Discussion
The aim of the present study was to investigate the short-term evolution of a European clone (ST228) that spread over a ten-year period in a tertiary care hospital. To determine the genetic changes that characterized this lineage, we compared the whole genome of eight isolates collected in the hospital between 2001 and 2008.
The comparison of the eight genomes indicated that these isolates are genetically very closely related (Fig. 2A). Most variations among their genomes consist of SNPs plus a few indels of one to several base pairs (Table 2). In addition to these small variations, more important genetic changes such as the replacement of a plasmid, the loss of phage- or plasmid- elements or the insertion of transposase (Fig. 2D) shaped the genetic diversity of the ST228 lineage that spread within our hospital. Similar acquisitions/losses of mobile genetic elements (MGEs) (e.g., SCCmec or bacteriophages) have also been observed in other lineages such as ST5 or ST225 [40], [41]. These and our results indicate that the transfers of MGEs occur locally and probably much more often than previously thought. The impact of such events on the fitness, virulence or epidemicity of the strain is difficult to evaluate. However, it is highly likely that two strains sharing the same genetic background but harboring different plasmids or phages will exhibit differences in their phenotypes.
Interestingly, the survey of MRSA at the tertiary care hospital of Lausanne revealed an important dynamic of clone ST228 over time with a few variants that became dominant while others occurred only sporadically (Fig 1, [7]). Yet, no link between the genetic variations described above and the spreading success of the different variants belonging to ST228 could be established even though the entire genome was scanned. In 2001, 2003 and 2008, outbreaks associated with the clone ST228 took place at the hospital but we found no MGEs (Fig. 2D) or mutations specific of the isolates (I12001, I22001, I72008 and I82008) responsible for these outbreaks. This suggests that the sudden success of some variants is probably not explained by the acquisition or loss of a particular genetic element that is transmitted or inherited from one variant to the others. These results, however, do not mean that the variation in spreading success of the ST228 variants has no genetic basis. First, the genetic factors explaining the spreading success of MRSA might differ from one variant to another. If so, the identification of these factors is especially challenging. Searching for a genetic element common to several strains (e.g., epidemic strains) reduces considerably the number of candidate genes or elements. This is well illustrated by the recent discovery in most strains of the community acquired MRSA USA300 clone of the arginine catabolic mobile element (ACME), which is suspected to favor survival and spreading capacities [37]. In contrast, if epidemicity factors are strain specific, the number of genetic elements that need to be considered is much higher, therefore reducing considerably their possible identification. Second, variations in success of different isolates may also be explained by the acquisition of deleterious genetic changes reducing the spreading capacity of strains originally successful. This hypothesis is supported by the European-wide distribution of the clone ST228 [42] suggesting it has intrinsically a high spreading capacity. It is probable that the genetic changes that have accumulated over time imply more often a burden for the ST228 strains than an improvement of their fitness. Finally, the spreading success of different variants might also be explained by differences in environmental conditions rather than by intrinsic genetic capacities. The hospital setting is complex and multiple parameters such as the use of antimicrobial, the comorbities of the patients, infection control measures (reviewed in [43]), bed occupancy [44] or the number of nurses per patients [44] also affect the dynamic of a clone. We did not notice such changes in our hospital during the last decade. However, variations occurring over very short periods of time or localized in small areas of the hospital in association with stochastic events might create conditions that favor the spread of a strain.
Our last aim was to determine the potential of next generation technologies for very local epidemiology. These technologies are expected to answer fundamental questions about local evolution of pathogens and provide new tools to trace the spread of an isolate both locally and internationally [45]–[49]. The typing methods currently used for studying MRSA (reviewed in [45], [48], [50]) tend to lack the discrimination required for studies performed at local scales. Our study showed that whole genome sequencing provides much more information about the differences characterizing our eight isolates (Table 2, Fig. 2D) than common typing methods; but it is not clear if this additional information is really useful to answer epidemiological issues at very local scales.
Although the likelihood mapping analyses perceived a phylogenetic signal in our data, the low number of SNPs observed in our study and the probable presence of recombination did not permit to obtain reliable information about the real relationship among isolates. Most branches of the trees were not well supported and the tree topology varied depending the part of the genome included in the analyses (core genome based on the differences with N315 vs based on the eight isolates, Fig. 2B and 2C respectively) or the algorithm chosen for the alignment (data not shown). Additionally, the fact that the isolates I72008 and I82008 did not cluster together on the trees although they share several significant differences (Fig. 2D) also raised some doubts about the phylogenetic value of these trees. So far, most studies investigating the evolution of pathogens analyzed isolates recovered from the global range of distribution of the clone or species of interest [16], [17], [40], [41]. These studies highlighted geographical signatures in the relationships among isolates. However, this pattern is most likely detectable because the strains that are transferred over large geographical distances inevitably go through bottlenecks followed by rapid expansion enhancing genetic differences among geographical areas [41]. Such signature has not been detected in our study probably because we investigated the evolution of a clone within a single hospital. This local evolution is also suggested by the close relationship among our eight isolates compared to N315 (Fig. 2A). Nevertheless, to confirm the local evolution of our isolates it is crucial to compare the genomes of our isolates with the genomes of other ST228 isolates collected in different hospitals. Such analyses would allow to i) establish the global genetic diversity of the lineage, and ii) determine the within versus between hospitals genetic variations. This information is essential to establish the ratio between local evolution and transfers from other regions. Consequently, as illustrated by the study of clone ST239 [16], it is important to know the genetic background of a lineage at different evolutionary levels for a reliable interpretation of whole genome sequencing data in epidemiological tracking. Thus, understanding the fine scale epidemiology of a particular clone with new technologies requires important investments that need to be adapted for every genetic lineage. In this context, the multiplication of large scale sequencing projects will probably provide important information about many such lineages and will facilitate the use of whole genome sequencing in local epidemiological investigations.
Conclusion
The sequencing and the genome comparison of eight isolates belonging to a lineage that spread over ten years within a tertiary care hospital revealed that these genomes were extremely closely related. The local evolution of this lineage is mainly driven by point mutations accompanied by few MGE transfers. The rapid evolution of MRSA lineages and their potential phenotypic variations are probably more explained by these MGEs transfers rather than single mutations. Nevertheless, the sudden occurrence of several outbreaks of lineage ST228 at the Lausanne tertiary care hospital did not seem to be explained by a single particular genetic element. Finally, the present study showed that new sequencing technologies improve considerably the quality and quantity of information obtained for a single strain; but this information is still difficult to interpret and needs to be placed into a broader context. Important investments are still necessary for this technology to become useful in routine investigations.
Supporting Information
Figure S1.
Likelihood Mapping Analysis for the core genomes with and without the strain N315. Phylogenetic noise was calculated using likelihood mapping analysis as implemented in TREE-PUZZLE [32]. A. Partitioning of the triangle in the seven basins of attraction (see [33] for details). Three basins correspond to fully resolved topologies (A1 A2 and A3), one represents star-like evolution (A*) and the three last basins (A12, A13, and A23) reflect the situation where it is difficult to distinguish between two of the three trees. The occupancies (in percent) of the seven basins of attraction for the core genome including the reference strain N315 (B) and for the core genome based exclusively on the eight isolates of ST228 (C).
https://doi.org/10.1371/journal.pone.0038969.s001
(DOCX)
Figure S2.
Neighbor-nets for the core genomes with and without the strain N315. Neighbor-nets for the core genomes with (A) and without (B) the strain N315 were inferred using uncorrected p-distances. P-values are shown for the PHI test for recombination, where the alpha value is p = 0.001.
https://doi.org/10.1371/journal.pone.0038969.s002
(DOCX)
Acknowledgments
We are grateful to C. Choulat for technical assistance. The sequencing was performed by the Genomics Technology Facility of the University of Lausanne. The computations were performed at the Vital-IT (http://www.vital-it.ch) Center for high-performance computing of the Swiss Institute of Bioinformatics.
Author Contributions
Conceived and designed the experiments: VV PB DSB. Performed the experiments: VV LF. Analyzed the data: VV LF SPC. Contributed reagents/materials/analysis tools: DSB. Wrote the paper: VV PB.
References
- 1. Lowy FD (1998) Staphylococcus aureus infections. New England Journal of Medicine 339: 520.532
- 2. Woodford N (2005) Biological counterstrike: antibiotic resistance mechanisms of Gram-positive cocci. Clinical Microbiology and Infection 11: 2.21
- 3. Blanc DS, Petignat C, Moreillon P, Entenza JM, Eisenring M-C, et al. (1999) Unusual spread of a penicillin-susceptible methicillin-resistant Staphylococcus aureus clone in a geographic area of low incidence. Clinical Infectious Diseases 29: 1512.1518
- 4. Blanc DS, Francioli P, Le Coustumier A, Gazagne L, Lecaillon E, et al. (2001) Reemergence of gentamicin-susceptible strains of methicillin-resistant Staphylococcus aureus in France: a phylogenetic approach. Journal of Clinical Microbiology 39: 2287.2290
- 5. Wannet WJB, Spalburg E, Heck MEOC, Pluister GN, Willems RJL, et al. (2004) Widespread dissemination in the Netherlands of the epidemic Berlin methicillin-resistant Staphylococcus aureus clone with low-level resistance to oxacillin. Journal of Clinical Microbiology 42: 3077.3082
- 6. Aires de Sousa M, De Lencastre H (2004) Bridges from hospitals to the laboratory: genetic portraits of methicillin-resistant Staphylococcus aureus clones. FEMS Immunology & Medical Microbiology 40: 101.111
- 7. Blanc DS, Petignat C, Wenger A, Kuhn G, Vallet Y, et al. (2007) Changing molecular epidemiology of methicillin-resistant Staphylococcus aureus in a small geographic area over an eight-year period. Journal of Clinical Microbiology 45: 3729.3736
- 8. Enright MC, Robinson DA, Randle G, Feil EJ, Grundmann H, et al. (2002) The evolutionary history of methicillin-resistant Staphylococcus aureus (MRSA). Proceedings of the National Academy of Sciences 99: 7687.7692
- 9. Lindsay JA, Moore CE, Day NP, Peacock SJ, Witney AA, et al. (2006) Microarrays reveal that each of the ten dominant lineages of Staphylococcus aureus has a unique combination of surface-associated and regulatory genes. Journal of Bacteriology 188: 669.676
- 10. Sung JM-L, Lloyd DH, Lindsay JA (2008) Staphylococcus aureus host specificity: comparative genomics of human versus animal isolates by multi-strain microarray. Microbiology 154: 1949.1959
- 11. Fitzgerald JR, Sturdevant DE, Mackie SM, Gill SR, Musser JM (2001) Evolutionary genomics of Staphylococcus aureus: Insights into the origin of methicillin-resistant strains and the toxic shock syndrome epidemic. Proceedings of the National Academy of Sciences 98: 8821.8826
- 12. Holden MTG, Feil EJ, Lindsay JA, Peacock SJ, Day NPJ, et al. (2004) Complete genomes of two clinical Staphylococcus aureus strains: Evidence for the rapid evolution of virulence and drug resistance. Proceedings of the National Academy of Sciences of the United States of America 101: 9786.9791
- 13. Lindsay JA, Holden MTG (2004) Staphylococcus aureus: superbug, super genome? Trends in Microbiology 12: 378.385
- 14. Gill SR, Fouts DE, Archer GL, Mongodin EF, DeBoy RT, et al. (2005) Insights on evolution of virulence and resistance from the complete genome analysis of an early methicillin-resistant Staphylococcus aureus strain and a biofilm-producing methicillin-resistant Staphylococcus epidermidis strain. Journal of Bacteriology 187: 2426.2438
- 15. Kennedy AD, Otto M, Braughton KR, Whitney AR, Chen L, et al. (2008) Epidemic community-associated methicillin-resistant Staphylococcus aureus: Recent clonal expansion and diversification. Proceedings of the National Academy of Sciences 105: 1327.1332
- 16. Harris SR, Feil EJ, Holden MTG, Quail MA, Nickerson EK, et al. (2010) Evolution of MRSA during hospital transmission and intercontinental spread. Science 327: 469.474
- 17.
Gray RR, Tatem AJ, Johnson JA, Alekseyenko AV, Pybus OG (2010) Testing spatiotemporal hypothesis of bacterial evolution using Methicillin-Resistant Staphylococcus aureus ST239 genome-wide data within a Bayesian framework. Molecular Biology and Evolution. doi: 10.1093/molbev/msq319.
- 18. Mato R, Campanile F, Stefani S, Crisóstomo MI, Santagati M, et al. (2004) Clonal types and miltidrugs resistance patterns of methicillin-resistant Staphylococcus aureus (MRSA) in Italy during the 90s. Microbial Drug Resistance 10: 106.113
- 19. Witte W, Kresken M, Braulke C, Cuny C (1997) Increasing incidence and widespread dissemination of methicillin-resistant Staphylococcus aureus (MRSA) in hospitals in central Europe, with special reference to German hospitals. Clinical Microbiology and Infections 3: 412.422
- 20. Conceição T, Aires-de-Sousa M, Füzi M, Tóth Á, Pászti J, et al. (2007) Replacement of methicillin-resistant Staphylococcus aureus clones in Hungary over time: a 10-year surveillance study. Clinical Microbiology and Infection 13: 971.979
- 21. Grundmann H, Aanensen DM, van den Wijngaard CC, Spratt BG, Harmsen D, et al. (2010) Geographic distribution of Staphylococcus aureus causing invasive infections in Europe: A molecular-epidemiological analysis. PLoS Med 7: e1000215.
- 22. Langmead B, Trapnell C, Pop M, Salzberg S (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biology 10: R25.
- 23. Kuroda M, Ohta T, Uchiyama I, Baba T, Yuzawa H, et al. (2001) Whole genome sequencing of meticillin-resistant Staphylococcus aureus. The Lancet 357: 1225.1240
- 24. Li R, Zhu H, Ruan J, Qian W, Fang X, et al. (2010) De novo assembly of human genomes with massively parallel short read sequencing. Genome Research 20: 265.272
- 25. Li H, Durbin R (2010) Fast and accurate long-read alignment with Burrows–Wheeler transform. Bioinformatics 26: 589.595
- 26. Gordon D, Abajian C, Green P (1998) Consed: A graphical tool for sequence finishing. Genome Research 8: 195.202
- 27. Massouras A, Hens K, Gubelmann C, Uplekar S, Decouttere F, et al. (2010) Primer-initiated sequence synthesis to detect and assemble structural variants. Nature Methods 7: 485.486
- 28. Calderon-Copete S, Wigger G, Wunderlin C, Schmidheini T, Frey J, et al. (2009) The Mycoplasma conjunctivae genome sequencing, annotation and analysis. BMC Bioinformatics 10: S7.
- 29. Angiuoli SV, Salzberg SL (2011) Mugsy: fast multiple alignment of closely related whole genomes. Bioinformatics 27: 334.342
- 30. Stamatakis A, Hoover P, Rougemont J (2008) A rapid bootstrap algorithm for the RAxML Web servers. Systematic Biology 57: 758.771
- 31. Pattengale ND, Alipour M, Bininda-Emonds OR, Moret BM, Stamatakis A (2009) How many bootstrap replicates are necessary? Journal of Computational Biology 17: 337.354
- 32. Schmidt HA, Strimmer K, Vingron M, von Haeseler A (2002) TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics 18: 502.504
- 33. Strimmer K, von Haeseler A (1997) Likelihood-mapping: A simple method to visualize phylogenetic content of a sequence alignment. Proceedings of the National Academy of Sciences 94: 6815.6819
- 34. Huson DH, Bryant D (2006) Application of phylogenetic networks in evolutionary studies. Molecular Biology and Evolution 23: 254.267
- 35. Bruen TC, Philippe H, Bryant D (2006) A simple and robust statistical test for detecting the presence of recombination. Genetics 172: 2665.2681
- 36. Salemi M, Gray RR, Goodenow MM (2008) An exploratory algorithm to identify intra-host recombinant viral sequences. Molecular Phylogenetics and Evolution 49: 618.628
- 37. Diep BA, Gill SR, Chang RF, Phan TH, Chen JH, et al. (2006) Complete genome sequence of USA300, an epidemic clone of community-acquired meticillin-resistant Staphylococcus aureus. The Lancet 367: 731.739
- 38.
Magrane M, Consortium U (2011) UniProt Knowledgebase: a hub of integrated protein data. Database 2011.
- 39. Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL (2008) GenBank. Nucleic Acids Research 36: D25.D30
- 40. Nübel U, Roumagnac P, Feldkamp M, Song J-H, Ko KS, et al. (2008) Frequent emergence and limited geographic dispersal of methicillin-resistant Staphylococcus aureus. Proceedings of the National Academy of Sciences 105: 14130.14135
- 41. Nübel U, Dordel J, Kurt K, Strommenger B, Westh H, et al. (2010) A Timescale for evolution, population expansion, and spatial spread of an emerging clone of methicillin-resistant Staphylococcus aureus. PLoS Pathog 6: e1000855.
- 42. Monecke S, Coombs G, Shore AC, Coleman DC, Akpaka P, et al. (2011) A field guide to pandemic, epidemic and sporadic clones of methicillin-resistant Staphylococcus aureus. PLoS ONE 6: e17936.
- 43. Backman C, Taylor G, Sales A, Marck PB (2011) An integrative review of infection prevention and control programs for multidrug-resistant organisms in acute care hospitals: A socio-ecological perspective. American Journal of Infection Control 39: 368.378
- 44. Clements A, Halton K, Graves N, Pettitt A, Morton A, et al. (2008) Overcrowding and understaffing in modern health-care systems: key determinants in meticillin-resistant Staphylococcus aureus transmission. The Lancet Infectious Diseases 8: 427.434
- 45. Lazarevic V, Beaume M, Corvaglia A, Hernandez D, Schrenzel J, et al. (2011) Epidemiology and virulence insights from MRSA and MSSA genome analysis. Future Microbiology 6: 513.532
- 46. Jackson RW, Johnson LJ, Clarke SR, Arnold DL (2011) Bacterial pathogen evolution: breaking news. Trends in genetics : TIG 27: 32.40
- 47. Pallen MJ, Loman NJ, Penn CW (2010) High-throughput sequencing and clinical microbiology: progress, opportunities and challenges. Current Opinion in Microbiology 13: 625.631
- 48. Willems RJL, Hanage WP, Bessen DE, Feil EJ (2011) Population biology of Gram-positive pathogens: high-risk clones for dissemination of antibiotic resistance. FEMS Microbiology Reviews 35: 872.900
- 49. Aziz RK, Nizet V (2010) Pathogen microevolution in high resolution. Science Translational Medicine 2: 16ps14.
- 50. Deurenberg RH, Stobberingh EE (2008) The evolution of Staphylococcus aureus. Infection, Genetics and Evolution 8: 747.763
- 51. Kuhn G, Francioli P, Blanc DS (2007) Double-locus sequence typing using clfB and spa, a fast and simple method for epidemiological typing of methicillin-resistant Staphylococcus aureus. Journal of Clinical Microbiology 45: 54.62