Listeria monocytogenes is a model organism for cellular microbiology and host–pathogen interaction studies and an important food-borne pathogen widespread in the environment, thus representing an attractive model to study the evolution of virulence. The phylogenetic structure of L. monocytogenes was determined by sequencing internal portions of seven housekeeping genes (3,288 nucleotides) in 360 representative isolates. Fifty-eight of the 126 disclosed sequence types were grouped into seven well-demarcated clonal complexes (clones) that comprised almost 75% of clinical isolates. Each clone had a unique or dominant serotype (4b for clones 1, 2 and 4, 1/2b for clones 3 and 5, 1/2a for clone 7, and 1/2c for clone 9), with no association of clones with clinical forms of human listeriosis. Homologous recombination was extremely limited (r/m<1 for nucleotides), implying long-term genetic stability of multilocus genotypes over time. Bayesian analysis based on 438 SNPs recovered the three previously defined lineages, plus one unclassified isolate of mixed ancestry. The phylogenetic distribution of serotypes indicated that serotype 4b evolved once from 1/2b, the likely ancestral serotype of lineage I. Serotype 1/2c derived once from 1/2a, with reference strain EGDe (1/2a) likely representing an intermediate evolutionary state. In contrast to housekeeping genes, the virulence factor internalin (InlA) evolved by localized recombination resulting in a mosaic pattern, with convergent evolution indicative of natural selection towards a truncation of InlA protein. This work provides a reference evolutionary framework for future studies on L. monocytogenes epidemiology, ecology, and virulence.
Listeria monocytogenes is a pathogen transmitted through contaminated food and is responsible for severe infections, including meningitis and abortion in animals and humans. It is known that many distinct strains of this pathogen exist, and that they differ in their virulence and epidemic potential. Unfortunately, there is currently no standard definition of strains and no comprehensive overview of their evolution. To tackle these serious limitations to the control of listeriosis and to improve knowledge of how virulence evolves, we characterized a large collection of isolates with sequence-based genotyping methods. We were thus able to identify precisely the most prevalent clones of L. monocytogenes, i.e., groups of isolates that descend from a single ancestral bacterium, which can now be characterized further for diagnostic purposes and determination of their precise ecology and virulence potential. We also determined how these clones evolved from their common ancestor and the evolutionary history by which they acquired their phenotypic characteristics, such as antigenic structures. Finally, we show that some particular strains tend to lose a virulence factor that plays a crucial role in infection in humans. This is a rare example of evolution towards reduced virulence of pathogens, and the discovery of the selective forces behind this phenomenon may have important epidemiological and biological implications.
Citation: Ragon M, Wirth T, Hollandt F, Lavenir R, Lecuit M, Le Monnier A, et al. (2008) A New Perspective on Listeria monocytogenes Evolution. PLoS Pathog 4(9): e1000146. https://doi.org/10.1371/journal.ppat.1000146
Editor: Dan Dykhuizen, SUNY at Stony Brook, United States of America
Received: May 5, 2008; Accepted: August 7, 2008; Published: September 5, 2008
Copyright: © 2008 Ragon et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This study received financial support from Institut Pasteur and the Institut de Veille Sanitaire (St-Maurice, France), and the GPH Program “Towards new therapeutics against low GC% Gram-positive bacteria”.
Competing interests: The authors have declared that no competing interests exist.
The opportunistic pathogen Listeria monocytogenes causes life-threatening infections in animal and in human populations at risk. This facultative intracellular bacterium is widespread in the environment and infections occur through ingestion of contaminated food ,. Although the species L. monocytogenes has long been known to be genetically diverse , with strains showing differences in their virulence potential –, detailed knowledge of strain diversity and evolution is still lacking.
Several methods have been used to differentiate L. monocytogenes strains . The Listeria serotyping scheme  based on somatic (O) and flagellar (H) antigens currently represents a common language for L. monocytogenes isolate typing and investigations into the ecological distribution, epidemiology and virulence of strains. Unfortunately, serotyping discriminates only 13 serotypes, many of which are known to represent genetically diverse groups of strains, and only four serotypes (1/2a, 1/2b, 1/2c, and 4b) cause almost all cases of listeriosis in humans . Given its higher discriminatory power, pulsed-field gel electrophoresis (PFGE) is considered accurate for epidemiological investigations and of help for surveillance and control of listeriosis ,, but fingerprint-based methods such as PFGE or ribotyping  are difficult to standardize. Hence, inter-laboratory comparisons necessitate considerable harmonization , which limits knowledge at the global scale. In addition, these widely used methods provide only limited information on the phylogenetic relationships among strains, which is a serious limitation to understand the evolution of important phenotypic traits such as virulence. Sequence-based or SNP-based approaches appear as promising tools for strain typing and phylogeny in L. monocytogenes –. Multilocus sequence typing (MLST) – can accurately define the clonal framework of bacterial species. MLST has been shown to discriminate among L. monocytogenes isolates ,,, but has not yet been applied on a large scale, and an overview of the clonal structure of L. monocytogenes is currently not available. The molecular factors that determine ecological differences among strains are also poorly understood.
One salient feature of the population structure of L. monocytogenes is the distinction of three phylogenetic lineages. Initially, two major lineages were distinguished, mainly based on multilocus enzyme electrophoresis and PFGE ,,,,, with a third lineage being subsequently recognized based on virulence gene variation, ribotyping and DNA arrays –. Lineage I includes isolates of serotypes 4b, 1/2b, 3b, 4d and 4e, whereas lineage II includes serotypes 1/2a, 1/2c, 3a and 3c. Lineage III contains serotypes 4a and 4c, as well as serotype 4b as was recently discovered . The relative virulence and contribution of the three lineages and their serotypes to food contamination and clinical burden is subject of debate , , , –. As each lineage is genetically heterogeneous, a precise delineation of L. monocytogenes clones is needed to determine which ones mostly contribute to human or animal infection ,,, and this knowledge would set a landmark for further studies on the biological characteristics of the clones and the evolution of molecular mechanisms by which they cause disease .
Several virulence genes play an important role in the virulence of L. monocytogenes strains ,. Internalin (InlA) is a surface protein that mediates the entry of L. monocytogenes into various non-phagocytic human eukaryotic cells expressing its receptor E-cadherin , and plays a key role in the crossing of the intestinal barrier, enabling the bacterium to reach the host bloodstream . Almost all isolates causing listeriosis in humans express a full-length functional InlA, whereas isolates expressing a truncated form are frequently found in food items and the environment and are associated with a lower virulence potential . Currently, the ecological factors that drive the evolution of these apparently attenuated strains are unknown. Evolution of virulence would be best understood by mapping the variation of virulence genes such as inlA, onto the phylogenetic framework of the genomes in which they are presently distributed.
The aims of this study were to provide a robust phylogenetic framework based on MLST analysis of a highly diverse isolate collection and determine (i) the population structure of L. monocytogenes; (ii) the evolutionary origin and stability of serotypes; and (iii) the patterns of variation of the virulence gene inlA with respect to the evolution of the core genome.
Materials and Methods
A total of 360 Listeria monocytogenes and four L. innocua isolates were selected from the collections of the French National Reference Centre for Listeria and the WHO Collaborative Centre for foodborne listeriosis (Table S1). These 360 L. monocytogenes isolates were subdivided in three subsets, each being included in order to address specific questions: (i) a diversity subset of 171 isolates, which included representative isolates of the distinct L. monocytogenes serotypes, atypical strains from lineage III, isolates that caused major epidemics throughout the world, strains for which the complete genome sequence is available, 75 historical strains collected from 1924 to 1966 and belonging to H.P.R. Seeliger Listeria Culture Collection (Würzburg, Germany), isolates from the environment, food or animals, and research strains from several countries used in previous studies involving the Institut Pasteur Listeria laboratory (Table S1); (ii) 126 isolates selected from maternal-fetal cases, collected prospectively and exhaustively from 1987 to 2005 (i.e., 5 to 10 epidemiologically non-related isolates randomly selected per year), and which were included to probe the temporal dynamics of clone prevalence (‘MF chronological’ subset in Table S1); and (iii) 63 isolates from year 2000, including 25 from bacteremia, 20 from central nervous system (CNS) infection, and 18 from maternal-fetal infection, which were included to investigate the possible association of specific clones with clinical forms (subset ‘Human clinical, 2000’ in Table S1).
Isolates were identified as L. monocytogenes using API Listeria strips (BioMerieux, La Balme Les Grottes, France). Identification was confirmed and subdivided into serotypes by classical serotyping , which distinguishes 13 serotypes, and multiplex PCR , which groups L. monocytogenes isolates into four major groups (IIA, IIB, IIC et IVB) corresponding to groups of serotypes (Table S1).
Multilocus Sequence Typing
The MLST scheme used to characterize Listeria strains is based on the sequence analysis of the following seven housekeeping genes: acbZ (ABC transporter), bglA (beta-glucosidase), cat (catalase), dapE (Succinyl diaminopimelate desuccinylase), dat (D-amino acid aminotransferase), ldh (lactate deshydrogenase), and lhkA (histidine kinase). This MLST scheme was adapted from the MLST system proposed by Salcedo and colleagues , with the following modifications. First, the template for gene ldh was extended from 354 to 453 nucleotides, thus improving strain discrimination. Second, gene templates were shortened because the extremities of the previous templates correspond to parts of the PCR primer sequences, thus possibly not corresponding totally to the genomic sequence of the isolates analyzed. Third, we incorporated universal sequencing tails to the PCR primers (Table 1), which allows to sequence PCR fragments of all genes using only two primers. DNA extraction was performed by the boiling method . The PCR amplification conditions were as follows: an initial cycle of 94°C for 4 min; 25 amplification cycles, each consisting of 94°C for 30 s, 52°C for 30 s (except for bglA which has an annealing temperature of 45°C), and 72°C for 2 min; and a final incubation at 72°C for 10 min. The PCR products were purified by ultrafiltration (Millipore, France) and were sequenced on both strands with Big Dye v.1.1 chemistry on an ABI3730XL sequencer (Applied BioSystems).
inlA gene sequencing
The 2,400 bp long inlA gene was sequenced from 157 isolates (Table S1) representing the clonal diversity of L. monocytogenes (see below). DNA extraction was performed with the Wizard® kit (Promega Corporation, USA). The PCR amplification conditions were as follows: an initial cycle at 94°C for 5 min; 35 amplification cycles, each consisting of 94°C for 30 s, 55.2°C for 30 s, and 72°C for 1 min 30; and a final incubation at 72°C for 10 min. We used external primers for amplification and internal primers for sequencing (Table 1), which was performed as described above.
For each MLST locus, an allele number was given to each distinct sequence variant, and a distinct sequence type (ST) number was attributed to each distinct combination of alleles at the seven genes. Numbers were initially based on highest frequency for the frequent alleles and STs, and were subsequently incremented arbitrarily. In order to define the relationships among strains at the microevolutionary level, we performed allelic profile-based comparisons using a minimum spanning tree (MST) analysis with the BioNumerics v5.10 software (Applied-Maths, Sint Maartens-Latem, Belgium). MST analysis links profiles so that the sum of the distances (number of distinct alleles between two STs) is minimized . Strains were grouped into clonal complexes (clonal families), defined as groups of profiles differing by no more than one gene from at least one other profile of the group . Accordingly, singletons were defined as STs having at least two allelic mismatches with all other STs.
Neighbor-joining tree analysis was performed using MEGA v4  or SplitsTree v4b06 . Calculations of recombination tests were performed using RDP3 . Nucleotide diversity indices were calculated using DNAsp v4 . ClonalFrame analysis  was performed with 50,000 burn-in iterations and 100,000 subsequent iterations.
To test for phylogenetic congruence among genes, one strain of all 39 STs with allelic mismatch distance >0.65 was used in order to exclude the expected congruence among genes at small evolutionary scale due to common clonal descent, as proposed previously . Neighbor-joining trees were generated using PAUP* v4  for each gene individually and for the concatenated sequence of the seven genes. For each gene, the differences in log likelihood (Δ−ln L) were computed using PAUP* between the tree for that gene and the trees constructed using the other genes, with branch lengths optimized . These differences were compared to those obtained for 200 randomly generated trees.
The relative contribution of recombination and mutation on the short term was calculated using software MultiLocus Analyzer (Brisse, unpublished) and the simplest implementation of the clonal diversification method ,. For each pair of allelic profiles that are closely related, the number of nucleotide changes between the alleles that differ is counted. A single nucleotide difference is considered to be likely caused by mutation, whereas more than one mutation in the same gene portion is considered to derive from recombination, as it is considered unlikely that two mutations would occur on the same gene while the other genes remain identical. No correction was made for single nucleotide differences possibly introduced by recombination.
We used the linkage model in Structure  to identify groups with distinct allele frequencies . This procedure assigns a probability of ancestry for each polymorphic nucleotide for a given number of groups, K, and also estimates q, the combined probability of ancestry from each of the K groups for each individual isolate. We chose three groups for this report because repeated analyses (200,000 iterations, following a burn-in period of 100,000 iterations) with K between 1 and 10 showed that the model probability increased dramatically between K = 2 and K = 3 and only slowly thereafter.
The population recombination rate was estimated by a composite-likelihood method with LDhat . LDhat employs a parametric approach, based on the neutral coalescent, to estimate the scaled parameter 2Ner where Ne is the effective population size, and r is the rate at which recombination events separate adjacent nucleotides. The crossing-over model L was used for the analysis of biallelic sites.
We also tested for presence of positively selected sites using the software omegaMap . This program applies a coalescence-based Bayesian strategy that co-estimates the rate of synonymous vs. non-synonymous substitions ω and the population recombination rate ρ, thus circumventing the high rate of false positives arising from incongruent phylogenies . The following prior distributions were used for the analyses: μ, κ and Φindel: improper inverse, ω: inverse with range 0.0001–10, ρ: inverse with range 0.01–10. The variable block model was chosen for both ω and ρ, with block sizes of 10 and 30, respectively. We created 10 subsets of 50 randomly drawn inlA sequences each, and analyzed each subset with 50,000 iterations and 10 reorderings. The first 20,000 sequences were discarded as burn-in period.
Sequences generated in this study are available at www.pasteur.fr/mlst for the seven MLST genes. inlA sequences have been deposited in GenBank/EMBL/DDBJ databases under the accession numbers FM178779 to FM178796 and FM179771 to FM179785. Alleles of the seven MLST genes were deposited under the accession numbers FM180227 to FM180445.
The majority of clinical isolates of L. monocytogenes belong to seven distinct clones
The seven gene portions, sequenced in the 360 L. monocytogenes isolates, harbored a total of 438 polymorphisms (13.3%; range 7.01%–17.7% per gene) consisting in bi-allelic (404 sites), tri-allelic (32 sites) or four-allelic (2 sites) single nucleotide polymorphisms (SNPs). The average nucleotide diversity π was 2.91%, ranging from 1.18% to 5.98% per gene (Table 2). The GC% observed in all alleles ranged from 36.5% to 43.3%, consistent with the 39% value observed across the entire L. monocytogenes EGDe genome . The 126 resulting allelic profiles (or sequence types, STs) were distributed into twenty-three clonal complexes (CC) and 22 singletons (Figure 1). Five CCs (CC2 to CC4, CC7 and CC9) consisted of a central prevalent genotype associated with several much less-frequent single locus variants (SLVs). CC1 was slightly more diverse, as its central genotype had two SLVs that themselves were associated to other variants. ST5 stood out among all singletons by its high frequency. Each of these CCs and singletons is likely to have descended from a single ancestral bacterium, i.e. corresponds to a clone. Remarkably, the seven above-mentioned CCs were well demarcated, as they differed by at least four genes out of seven among themselves (with the exception of CC2 and CC3, with three mismatches between one pair of STs) and by at least three mismatches from all other STs (Figure 1, inset A). Together, these seven clones comprised 58 (47%) STs and 245 (69%) isolates, and included 73% of the 252 recent (after 1987) clinical isolates. Five of these clones belonged to lineage I (see below) and comprised 177 of 203 (87%) isolates of this lineage. Other frequent clones were CC6, CC8 and CC101, together representing 32 (9%) additional isolates. Reference strains of large outbreaks and genome sequencing project strains were mapped on the disclosed MLST diversity (Figure 1; Table S1); for example, ST1, ST6 and ST11 include reference strains of epidemic clones I, II and III ,, respectively.
Each circle corresponds to a sequence type (ST). Grey zones surround STs that belong to the same clonal complex (CC; 24 CCs are visible in total). ST numbers are given inside the circles and are enlarged for the central genotypes that define the major CCs (e.g., ST9 defines the central genotype of CC9). The three major lineages are highlighted by polygons. Four L. innocua sequence types are also represented (black circles). The lines between STs indicate inferred phylogenetic relationships and are represented as bold, plain, discontinuous and light discontinuous depending on the number of allelic mismatches between profiles (1, 2, 3 and 4 or more, respectively); note that discontinuous links are only indicative, as alternative links with equal weight may exist. There were no common alleles between the three major lineages, L. innocua, ST161 (CLIP98) and ST157 (CLIP85); they are arbitrarily linked through ST7 by default. Circles and sectors were colored based on serotyping data according to the provided legend; in addition, rare serotypes (3a, 3c, 4d, 4e) are indicated directly on the Figure. Note that for simplicity, the serotype of strains that were serotyped by the PCR method (Table S1) was equated to the most frequent serotype of each PCR group (e.g., 1/2a for PCR group IIA). STs in which truncated forms of InlA were found are indicated by a black triangle, with the position of the premature stop codons given after letter Δ. The ST of reference genome strains is indicated. The positioning of H7858 (ECII) is based on 6 genes only, as gene dat is incomplete. Inset A. Crosslinks corresponding to one or two allelic mismatches are indicated. Note the absence of links among major clonal complexes, indicative of their neat demarcation. Circles were colored by grey levels according to the number of isolates. Inset B. Correlation between the number of allelic mismatches (number of distinct alleles between MLST profiles) and the average number of nucleotide differences at distinct alleles. Note the regular positive trend, which indicates that L. monocytogenes genotypes diverge predominantly by a mutational process . Allelic mismatch values of 7 correspond mostly to inter-lineages comparisons.
Remarkably, most isolates within a given clone had the same serotype, or a restricted set of serotypes. CC1 and CC2 were dominated by isolates of serotype 4b, and included all isolates of serotypes 4d and 4e. CC3 comprised a large proportion (19/42, 45%) of isolates of serotype 1/2b, and included all isolates of serotypes 3b, and all but one (ST75) isolates of serotype 7. These results suggest that serotypes 4d and 4e each derived at least twice from 4b ancestors, consistent with previous data ,, whereas isolates of serotypes 3b and 7 (excepted ST75) may be regarded as serotypic variants of serotype 1/2b CC3 isolates. CC4 (serotype 4b), ST5 (1/2b), CC6 (4b), CC7 (1/2a), CC101 (1/2a) and CC102 (4b) were each homogeneous with respect to serotype. Finally, CC9 included all isolates of serotype 1/2c, indicating that this serotype is genetically homogeneous. Notably, the virulent strain EGDe of serotype 1/2a also fell into CC9. EGDe only differs from ST9 (1/2c) by dapE (allele dapE-20 instead of dapE-4 in ST9), while it differs from all other 1/2a strains by several genes. CC9 also comprised the only included isolate of serotype 3c.
Among isolates from human cases of listeriosis, we sought to determine the possible association between clones and clinical sources of the isolates. To eliminate the possible effect of the temporal variation (see below), we compared L. monocytogenes isolates from a single year (year 2000) and the three major clinical presentations in humans: bacteremia (n = 25), CNS infections (n = 20), and maternal-fetal infections (n = 18). These isolates corresponded to 28 STs, distributed into 7 CCs and 13 singletons (Table S1). There was no association of particular CCs or ST with clinical presentation: the 11 STs with more than one isolate were encountered in at least two clinical sources, and isolates from prevalent CCs or STs were equally isolated from the three clinical forms (Table S1).
Possible trends in the relative prevalence of CCs over time were investigated based on 126 isolates from maternal-fetal cases of listeriosis, collected from 1987 to 2005 (Table S1). These isolates (Table S1) fell into 43 STs and were grouped into 7 CCs and 14 singletons. Four CCs (CC1 to CC4) and two singletons (ST5 and ST9) comprised more than 10 isolates. Numbers of isolates of each of these clones over the 19 year period showed distinct patterns of temporal dynamics: while CC1 (4b) and ST9 (1/2c) were sampled equally over the entire period, CC3 (1/2b-3b-7) shows a clear decrease in prevalence (16 isolates before 1995, 2 isolates after; Chi2 p<0.001). In contrast, ST5 (1/2b) was isolated only once before 1997 but 12 times in the second period (p = 0.02). Similarly, CC2 (4b) showed an apparent increase in prevalence (2 vs. 9, p = 0.034).
Homologous recombination is rare in Listeria monocytogenes
Divergence among genotypes appeared to be mainly driven by the progressive accumulation of mutations over time, as strains diverge from their common ancestor (Figure 1, inset B). Congruence among the seven individual gene phylogenies obtained for the distantly related STs  was statistically significant (p<0.005), as assessed by the likelihood method . Similarly, the short-term contribution of recombination to genotypic diversity was modest, as L. monocytogenes alleles are five times more likely to change by mutation than by recombination (r/m = 0.197). In addition, the r/m rate for nucleotides was 0.59, indicating that nucleotides are approximately twice more likely to change by mutation than by recombination. As an independent approach, the composite likelihood of r/m  on the concatenated sequence of the seven genes was 0.62 for lineage I, 0.47 for lineage II and 0 for lineage III. r/m values of some of the observed housekeeping genes exceeded 1, but lacked statistical significance (Table 3). Consistently, r/m was 0.81 as estimated using ClonalFrame .
In order to determine which lineages underwent recombination events that left a detectable footprint in extant strains, the nucleotide polymorphisms within the seven gene fragments were analyzed with structure ,, a Bayesian method that attempts to identify the ancestral sources of nucleotides. The ancestry of each isolate can be estimated as the summed probabilities of derivation from each ancestral group over all polymorphic nucleotides. structure recognized three clusters of strains within L. monocytogenes, which were largely homogeneous in terms of their ancestral sources of polymorphism (Figure 2A). However, a number of isolates are likely to have a mixed origin (Figure 2A), and this was confirmed statistically using RDP3 on the concatenated sequences (Table S2).
A) Proportions of ancestry in seven housekeeping genes of L. monocytogenes strains from three ancestral populations as inferred by the linkage model of structure. This plot shows one vertical line for each isolate in which the proportions of ancestry from the three sources are color-coded. For example, strain CLIP85 was inferred as having mixed ancestry, with approximately 75% of nucleotides originated from lineage III (blue), whereas 25% of them were inferred to have an origin in lineage II (green). A number of strains from lineage I have a small proportion of nucleotides with ancestry in lineage III, while strains of lineage II had some nucleotides from lineages I (red) or III (blue). The case of CLIP98 is particular, as it was inferred as deriving from lineage II by imports mainly from L. innocua (see text). B) Neighbor-joining phylogenetic analysis of concatenated housekeeping gene sequences, using the Tamura-Nei+G+I model. The three major L. monocytogenes lineages are recognized. Together, they included all strains except CLIP85 and CLIP98. Bootstrap support of lineages is given at corresponding branches. An asterisk (*) marks strains that were recognized by structure to contain a fraction of nucleotides imported from another ancestral population, with corresponding branches colored according to the source of the recombined nucleotides. Recombination events detected independently using with RDP3 are numbered from 1 to 7 (referring to Table S2), and the involved genes are indicated.
Phylogenetic structure of L. monocytogenes and evolutionary origin of serotypes
Because recombination events, even if they are rare, can strongly distort phylogenetic reconstruction, we took into account potential recombination events using ClonalFrame (Figure 3). The majority-rule consensus tree revealed three major branches, which could be equated to the three currently recognized L. monocytogenes lineages I, II and III, as deduced from serotyping data and inclusion of reference strains. In particular, strains with serotypes 4b and 1/2b fell into lineage I, serotypes 1/2a and 1/2c were associated with lineage II, whereas serotypes 4a and 4c belonged to lineage III. The neighbor-joining (NJ) method (Figure 2) retained the three major lineages, which were also consistent with the three major groups revealed by Structure. However, the obtained branching pattern was conspicuously distinct for those isolates that underwent recombination events. The most conspicuous example was isolate CLIP85, which was clearly associated with lineage III (Figure 3), but not in the NJ tree (Figure 2B). This difference could be attributed to horizontal transfer of lhkA from lineage II into CLIP85, as detected with high probability by ClonalFrame (Figure 3E). Likewise, strains that were inferred to have mixed ancestries (Figure 2A) were placed at the tip of relatively longer branches on the NJ tree (Figure 2B) than on the tree derived from ClonalFrame tree (Figure 3).
A) 50% consensus tree obtained after 100,000 iterations (after 50,000 burn-in) for the 130 distinct Listeria STs. L. monocytogenes appears monophyletic, with three distinct lineages and one strain (CLIP98) considered as a fourth lineage. Note the close association of CLIP85 with other lineage III strains. B) Detailed view of the inferred relationships within lineage I. Note the monophyly of serotype 4b. C) Detailed view of the inferred relationships within lineage II. Note that all strains of serotype 1/2c (CC9) are nested inside the diversity of 1/2a, and that serotype 1/2c seems to have evolved from 1/2a just after the split from strain EDGe. D) Detailed view of the inferred relationships within lineage III. E) Events on the branch that separates CLIP85 (ST157) from the rest of lineage III. Note the high number of nucleotide changes in lhkA (seventh gene), inferred by clonalFrame to correspond to a single recombination event.
One exceptional isolate, CLIP98 (serotype 1/2a) isolated from a human blood infection in Canada, was placed at the tip of a long branch, thus representing an apparent fourth lineage. Individual gene genealogies based on the neighbor-joining method also placed CLIP98 outside the three lineages, except for genes dat and lhkA, which clearly associated CLIP98 with lineage II (not shown). Close inspection of the sequence alignment showed that a large proportion of nucleotide changes that distinguished CLIP98 from lineage II strains were clustered in a small number of short segments and corresponded to nucleotide bases also observed in L. innocua strains.
The phylogenetic relationships within lineage I (Figure 3B) suggest that serotype 4b is monophyletic, since all strains of this serotype formed a unique branch. Differently, serotype 1/2b is paraphyletic, as it was encountered in two distinct branches, one of which is branching off early in the history of lineage I. Within lineage II, serotype 1/2a was paraphyletic, whereas 1/2c was monophyletic (Figure 3C). Notably, the sequenced strain EGDe (1/2a) appears to branch off just before the evolutionary change from 1/2a to 1/2c.
Evolution of inlA coding sequences: local recombination and convergence of truncated forms in defined clones
The 2,400-nt coding sequence of virulence factor InlA showed 162 (6.7%) polymorphic sites and 33 alleles undergoing a distinctive pattern of evolution (Figure 4). First, in contrast to housekeeping genes, phylogenetic analysis of inlA sequences revealed a conspicuous pattern of intragenic homologous recombination. Visual inspection of the distribution of polymorphic sites in inlA revealed a mosaic structure (Figure 4), with several regions having estimated recombination rates one order of magnitude higher (ρ≥0.03) than for baseline regions (ρ≈0.003). Thus, each L. monocytogenes inlA sequence represents a composite assembly of short sequences with distinct evolutionary history, likely the result of multiple horizontal gene transfer events. Notably, in no case did we find fully- or nearly identical inlA sequences in unrelated STs, showing that horizontal transfer of entire inlA alleles is infrequent or non-existing, and that the entire inlA coding sequence is clone-specific. The short-term and long-term impacts of localized recombination in the inlA sequence were contrasted. Over the short term, inlA sequences clearly evolved more rapidly than housekeeping genes. For example, inlA sequences within lineage II evolved more than twice as fast (c = 2.13, r = 18%) as MLST genes. In contrast, over the long term, inlA sequence divergence was restricted, as housekeeping genes were on average more divergent between the three major lineages than are inlA sequences (e.g., 4.8% and 1.3%, respectively, between lineages I and III strains). This constraint on inlA sequence divergence resulted in the lack of phylogenetic demarcation of lineages I and III (Figure S1), contrasting sharply with housekeeping genes-based phylogeny (Figures 2 and 3). Thus, import of sequence stretches from other clones accelerates diversification of clones, while homogenizing inlA sequences among distantly related strains.
A) Distribution of the polymorphisms along the 2,400 nt of gene inlA in the 33 distinct inlA alleles encountered. The scale above the graph is in amino-acids (AA). InlA functional domains are represented as distinct blocks: signal peptide (SP), alpha-domain linker, leucine-rich repeats (LRR), Ig-like, B-repeat, Pre-anchor (PA) and cell wall anchor (CWA). Vertical bars above these blocks, correspond to synonymous nucleotide polymorphisms; below these blocks, non-synonymous polymorphisms resulting in amino-acid changes. Note that the LRR domain, especially repeats 7 to 15, is highly conserved. Triangles indicate the position of premature stop codons (PMSC) observed in this study and previous reports; black triangles: PMSCs observed in clonal complex CC9; red: PMSCs in other clones; see Table 4 for details. B) Deduced amino-acid polymorphisms in InlA. Lineages in which the inlA alleles were found are indicated on the left (1, 2 or 3). Blocks of amino-acids that are identical to the sequence in reference strain EGDe (allele 8) are color-shaded. Note the mosaic pattern, with blocks of polymorphisms shared between distinct groups of alleles when scrolling along the sequence.
Second, when the entire length of inlA was considered, purifying selection against amino-acid changes appeared more relaxed than for housekeeping genes, with a Ka/Ks ratio for inlA (0.094) higher than for the seven concatenated MLST genes (0.039 for the same 157 isolates). However, the distribution of ω (the Bayesian estimate of the rate of synonymous vs. non-synonymous substitutions) along the sequence was heterogeneous, with a highly constrained LRR-region (ω≈0.04) and moderately constrained Ig-like and B-repeats regions, with peak values of 0.5. It is worth mentioning that no single stretch of the molecule displayed a significant signature of positive selection, a result that contradicts previous analyses in which recombination was not incorporated . All but one amino-acid changes found in the LRR domain were located in repeats 1 to 6 (Figure 4), suggesting a stronger constraint on repeats 7 to 15, which are more extensively involved in interactions with E-cadherin .
Truncated forms of InlA have been described and associated with reduced virulence , , , –. We found four distinct inlA alleles that had premature stop codons (PMSC) at positions 492, 539, 577 and 685 (Figures 1 and 4; Table 4). Remarkably, three of these four PMSCs occurred in isolates that belonged to CC9 (Figure 1). In addition, out of eight previously reported inlA sequences leading to truncated forms, five were identical or nearly identical (<2 SNPs) to the inlA sequences that are specific of clone CC9, strongly suggesting that they first occurred in isolates of clone CC9 as well. The remaining PMSC (at codon 492) was observed in ST121 (1/2a), and three other PMSCs from previous reports were also observed in inlA alleles unrelated to those in CC9 (Table 4).
The phylogenetic structure of L. monocytogenes was investigated to provide a framework for the evolutionary history, epidemiology and virulence of this model pathogen. We based our analysis on an update of the MLST scheme proposed by Salcedo et al. . Alternative sets of genes were used previously , –, but were either not extensively validated , biased towards high levels of nucleotide diversity between two particular lineages  or based at least in part on virulence genes ,,. Virulence-associated genes generally provide improved discrimination among strains, but may reflect ecological adaptation and selection. In contrast, housekeeping genes are considered more appropriate to obtain an unbiased view of the population structure, as their polymorphisms can be considered nearly neutral and are less subject to horizontal transfer.
Clonal structure of L. monocytogenes
A majority of L. monocytogenes isolates belonged to limited number of major clones. Although these were defined based on allelic profiles, the same groupings were obtained by MST analysis based on nucleotide sequences (not shown), with the exception of the three STs (ST35, ST15 and ST74) inferred to derive by a recombination event that changed more than two SNPs. Hence, no significant loss of information was incurred by collapsing nucleotide sequence information into allelic profiles, as expected given limited amounts of recombination. Interestingly, the major clones were almost always separated from other strains by at least three allelic mismatches, indicating ancient divergence. Hence, a relaxed criterion (e.g., two allelic mismatches) would have little impact on the assignment of L. monocytogenes isolates to particular clones. Given the neat delineation of the major clones and the fact that they account for a large proportion of clinical L. monocytogenes isolates, we propose that these genetic entities represent reference units for future studies on strain virulence, ecology or epidemiology.
For global population biology and international surveillance purposes, a definitive strain typing scheme is greatly needed . The present MLST data represent a unifying language on clone characterization in L. monocytogenes and are freely available for comparison at www.pasteur.fr/mlst. Other sequence-based strain characterization methods have been developed –,. Future determination of the relative power of MLST and these methods for discrimination among L. monocytogenes strains, as well as establishment of the correspondence among the sequence types they define, is required for optimal communication and will provide Listeria specialists with a choice of methods that may be suited for distinct purposes (e.g., fine-scale epidemiology versus long-term population biology).
Although our strain collection was not designed to address the important question of ecological or epidemiological differences among L. monocytogenes clones, we found different distributions of serotypes and clones among animal, environmental and clinical isolates that are consistent with previous reports ,; for example, serotypes 4b clones mostly included isolates from clinical or food sources, whereas they are rare among isolates from animals (4 out of 21) or the environment (none out of 9). However, further MLST studies should be performed with ecologically representative collections of isolates. Likewise, our initial temporal analysis of maternal-fetal isolates suggests the existence of temporal shifts in prevalence of clones over relatively short periods of time (19 years), but a larger longitudinal survey is needed to provide a clearer picture of the temporal dynamics of these clones. Finally, the disclosed diversity is based largely on isolates from France, and a worldwide collection may therefore reveal additional diversity. However, it is noteworthy that many reference strains from outbreaks in other countries and continents belonged to clones defined by French isolates (Figure 1). It is also important to remember that L. monocytogenes is an environmental saprophyte, which does not need to infect animals for survival and propagation. The diversity of clinical isolates may thus only represent a particular subset of the entire diversity of this species.
Evolution of serotypes
The evolutionary relationships of serotypes within the major lineages have not been previously defined, although this knowledge is particularly important for correct interpretation of serotyping data. We found that all serotype 4b strains belonged to a unique branch, consistent with early MLEE data  and recent sequence data . This result indicates that serotype 4b appeared only once during the evolution of lineage I. Hence, the apparent increased potential of serotype 4b strains to cause outbreaks may be explained by genetic characteristics that evolved ancestrally, before the diversification of serotype 4b into several clones that appear highly successful among clinical isolates. The fact that the 4b branch is nested within a larger diversity made of 1/2b strains suggests that serotype 1/2b is more ancestral than 4b, and possibly represents the ancestral serotype of lineage I. Likewise, within lineage II, paraphyletic serotype 1/2a clearly stands as the most likely ancestor of lineage II, with serotype 1/2c having evolved more recently (Figure 3C). As a consequence, our results contradict the DNA array-derived hypothesis that serotype 1/2c represents an ancestral state . Notably, strain EGDe (1/2a) appears to branch off from the ancestor of CC9 1/2c strains just before the evolutionary shift from 1/2a to 1/2c. Hence, strain EGDe may represent a genomic state close to the evolutionary link between 1/2a and 1/2c strains. While exhibiting an antigenic structure that retained ancestral characteristics, its full genomic content  is likely to be more related to that of 1/2c strains than to most other 1/2a strains, consistent with DNA array data ,.
Groups of distinct serotypes observed in isolates with the same ST or CC reflect relatively recent evolution of antigenic structures. Under our evolutionary scenario (Figures 1 and 2), serotypes 7 and 3b derived from serotype 1/2b in the branch corresponding to clone 3, whereas serotypes 4d and 4e each derived at least twice from distinct 4b ancestors, as also proposed recently . Likewise, serotype 3c may be derived from 1/2c strains (although the reverse, from 3c to 1/2c, cannot be excluded). All these evolutionary changes in serotype involve modification of the somatic antigens . In contrast, H (flagellar) antigens appear stable over evolutionary time, as we inferred a single within-lineage change of the H antigen, namely from antigen A to antigen D, corresponding to the evolution of 1/2c [antigenic formula I,II,(III):B,D] from 1/2a [I,II,(III):A,B] . Knowledge and subsequent characterization of the genetic determinants of somatic and flagellar antigenic structures , will provide molecular details on serotype evolution.
Phylogenetic structure of L. monocytogenes and recombination
The distinct phylogenetic methods used herein consistently identified three major lineages of L. monocytogenes strains, in agreement with a wide set of previous studies based on alternative markers –. The only exception was isolate CLIP98, placed at the tip of a long branch with weak relatedness to any lineage. However, we suggest that CLIP98 does not represent the first disclosed member of a fourth L. monocytogenes lineage. Instead, the fact that two genes of CLIP98 were typical of lineage II, whereas numerous polymorphisms in the other five genes were shared with L. innocua, suggests that CLIP98 could have a ‘hybrid’ genome derived from lineage II, but which has received from L. innocua donors enough recombined fragments to now appear poorly affiliated with its ancestral lineage. Because putative recombined segments are very small and often consist of only two or three SNPs, these imports were overlooked by ClonalFrame. As three nucleotides of CLIP98 (between positions cat 364–368) were also uniquely shared with lineage III strains, CLIP98 may correspond to a recombinant strain with multiple ancestries, possibly due to an increased capacity for incorporation of foreign DNA. Further genomic characterization of this isolate is needed to clarify these aspects.
L. monocytogenes lineages differed from L. innocua, their closest relative, by 11% of nucleotide positions on average. This is consistent with the monophyly of L. monocytogenes and does not support the hypothesis of a descent of L. innocua as a whole from L. monocytogenes . Amounts of diversity within the three lineages were 0.33%, 0.61% and 1.25%, respectively, consistent with data obtained by others ,. In contrast, the nucleotide divergence was 4.99% between lineages I and II, 5.3% between lineages I and III, and 7.57% between lineages II and III. Notably, there was not a single common allele among the three lineages, as well as between them and L. innocua, whereas strains within a particular lineage generally shared at least one allele. Hence, consistent with DNA hybridization data , the three major lineages correspond to clearly demarcated sequence clusters that fulfill the separateness criteria and divergence levels used in other bacterial groups to distinguish species –. However, as noted earlier , the issue of taxonomic revision of L. monocytogenes needs careful evaluation. In particular, improved sampling (especially from diversity-rich sources such as the environment) would be necessary to challenge the neat demarcation among lineages and characterize their ecology.
The rate of homologous recombination within bacterial species can differ widely from one species to another , and has a profound impact on the validity of phylogenetic analyses , on the evolutionary stability of genotypes , on biological features such as virulence , and on interpretation of typing data . Previous reports have indicated that L. monocytogenes undergoes low levels of recombination among housekeeping genes ,. Here, we quantified the relative impact of recombination and mutation at various levels of phylogenetic depth and found similar estimates with independent approaches. To our knowledge, L. monocytogenes is one of the bacterial species with the lowest rate of recombination ,,. Highly restricted levels of recombination were disclosed in all three major lineages, contrasting with a previous proposal that rates of recombination vary among lineages . Full genome sequencing  suggested that competence genes are present in L. monocytogenes EDGe (CC9), but the regulatory genes for their expression have not been identified, and a noncoding RNA (RliE) may regulate negatively the L. monocytogenes orthologs required for competence in Bacillus subtilis . Yet, evidence that L. monocytogenes retained the ability for localized recombination is clearly provided by the inlA gene encoding InlA, in agreement with previous reports ,,. For this gene, recombination events may contribute to the acquisition by the recipient strain of a selective advantage, for example by escaping the immune response while retaining the ability to interact with its receptor. Such exchange could possibly occur in the intestinal lumen, in which multiple L. monocytogenes strains may coexist. Hence, rapid diversification of inlA sequences contrasts with the inferred high stability of clonal backgrounds defined by housekeeping genes. This illustrates how the phylogenetic structure based on MLST genes provides a scaffold, which sheds light onto the evolution of individual genes exposed to selective pressures, such as virulence genes.
Currently, the ecological significance of loss of a full-length InlA is not understood, and the clonal background in which these forms evolve have not been defined precisely. Little is known about the ecology of L. monocytogenes clones, but a realistic scenario is that different clonal families might be adapted to different niches, and their occurrence as mammalian pathogens may be of limited significance for their evolutionary success in the long term. Among the four alleles of gene inlA identified among ST9 isolates, the one corresponding to the non-truncated form (inlA-8) can be inferred to be ancestral, as the three inlA alleles corresponding to truncated forms (inlA-9, inlA-10 and inlA-16) differed by only one mutation from inlA-8, whereas they differed by two mutations from each other (Table 4). In addition, inlA-8 was also found in strain EGDe (ST35), which was inferred to branch off before diversification of other CC9 members (Figure 3). It is intriguing that InlA, an important bacterial factor for host colonization, was repeatedly lost by convergent evolution in the genetically homogeneous 1/2c ST9 genotype. Such a pattern can be explained either by a relaxed selective constraint on maintaining InlA function, or by a selective advantage provided by the loss of a functional InlA protein, in the ecological niche occupied by members of ST9. Determining the natural habitat of ST9 may provide clues as to why the expression of a virulence trait may in fact turn out to be disadvantageous in particular environments.
inlA phylogeny. Neighbor-joining tree of the 33 inlA alleles (midpoint rooting). Alleles resulting in truncated forms of the protein InlA are indicated by a delta letter followed by the position of the stop codon. Isolates in which the alleles were observed are indicated in Table S1. The lineages in which the alleles were observed are indicated by colored vertical bars on the right. Note the lack of separation of lineages I and III, as opposed to the results obtained with housekeeping genes (Figures 2 and 3).
(0.25 MB EPS)
We thank L. Diancourt and C. Tran for expert technical support and A. Leclercq and C. Buchrieser for their expertise on Listeria strains.
Conceived and designed the experiments: ALM SB. Performed the experiments: MR RL. Analyzed the data: MR TW FH SB. Wrote the paper: MR TW FH ML ALM SB.
- 1. Farber JM, Peterkin PI (1991) Listeria monocytogenes, a food-borne pathogen. Microbiol Rev 55: 476–511.
- 2. Schlech WF 3rd, Lavigne PM, Bortolussi RA, Allen AC, Haldane EV, et al. (1983) Epidemic listeriosis–evidence for transmission by food. N Engl J Med 308: 203–206.
- 3. Piffaretti JC, Kressebuch H, Aeschbacher M, Bille J, Bannerman E, et al. (1989) Genetic characterization of clones of the bacterium Listeria monocytogenes causing epidemic disease. Proc Natl Acad Sci U S A 86: 3818–3822.
- 4. Malinverni R, Glauser MP, Bille J, Rocourt J (1986) Unusual clinical features of an epidemic of listeriosis associated with a particular phage type. Eur J Clin Microbiol 5: 169–171.
- 5. Jacquet C, Doumith M, Gordon JI, Martin PM, Cossart P, et al. (2004) A molecular marker for evaluating the pathogenic potential of foodborne Listeria monocytogenes. J Infect Dis 189: 2094–2100.
- 6. Pohl MA, Wiedmann M, Nightingale KK (2006) Associations among Listeria monocytogenes genotypes and distinct clinical manifestations of listeriosis in cattle. Am J Vet Res 67: 616–626.
- 7. Brosch R, Catimel B, Milon G, Buchrieser C, Vindel E, et al. (1993) Virulence heterogeneity of Listeria monocytogenes strains from various sources (food, human, animal) in immunocompetent mice and its association with typing characteristics. J Food Prot 56: 293–301.
- 8. Liu D (2006) Identification, subtyping and virulence determination of Listeria monocytogenes, an important foodborne pathogen. J Med Microbiol 55: 645–659.
- 9. Seeliger HP, Hohne K (1979) Serotyping of Listeria monocytogenes and related species. Methods Microbiol 13: 31–49.
- 10. Brosch R, Chen J, Luchansky JB (1994) Pulsed-field fingerprinting of listeriae: identification of genomic divisions for Listeria monocytogenes and their correlation with serovar. Appl Environ Microbiol 60: 2584–2592.
- 11. Kerouanton A, Brisabois A, Denoyer E, Dilasser F, Grout J, et al. (1998) Comparison of five typing methods for the epidemiological study of Listeria monocytogenes. Int J Food Microbiol 43: 61–71.
- 12. Graves LM, Swaminathan B, Reeves MW, Hunter SB, Weaver RE, et al. (1994) Comparison of ribotyping and multilocus enzyme electrophoresis for subtyping of Listeria monocytogenes isolates. J Clin Microbiol 32: 2936–2943.
- 13. Graves LM, Swaminathan B (2001) PulseNet standardized protocol for subtyping Listeria monocytogenes by macrorestriction and pulsed-field gel electrophoresis. Int J Food Microbiol 65: 55–62.
- 14. Salcedo C, Arreaza L, Alcala B, de la Fuente L, Vazquez JA (2003) Development of a multilocus sequence typing method for analysis of Listeria monocytogenes clones. J Clin Microbiol 41: 757–762.
- 15. Zhang W, Jayarao BM, Knabel SJ (2004) Multi-virulence-locus sequence typing of Listeria monocytogenes. Appl Environ Microbiol 70: 913–920.
- 16. Ducey TF, Page B, Usgaard T, Borucki MK, Pupedis K, et al. (2007) A single-nucleotide-polymorphism-based multilocus genotyping assay for subtyping lineage I isolates of Listeria monocytogenes. Appl Environ Microbiol 73: 133–147.
- 17. Chen Y, Knabel SJ (2008) Prophages in Listeria monocytogenes contain single-nucleotide polymorphisms that differentiate outbreak clones within epidemic clones. J Clin Microbiol 46: 1478–1484.
- 18. Maiden MC, Bygraves JA, Feil E, Morelli G, Russell JE, et al. (1998) Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. Proc Natl Acad Sci U S A 95: 3140–3145.
- 19. Feil EJ (2004) Small change: keeping pace with microevolution. Nat Rev Microbiol 2: 483–495.
- 20. Feil EJ, Spratt BG (2001) Recombination and the population structures of bacterial pathogens. Annu Rev Microbiol 55: 561–590.
- 21. Paciorek J, Jacquet C, Salcedo C, Doumith M, Vazquez JA, et al. (2006) Genotypes of Listeria monocytogenes strains isolated from 2000 to 2002 in Poland. Pol J Microbiol 55: 31–35.
- 22. Nightingale KK, Windham K, Wiedmann M (2005) Evolution and molecular phylogeny of Listeria monocytogenes isolated from human and animal listeriosis cases and foods. J Bacteriol 187: 5537–5551.
- 23. Rasmussen OF, Beck T, Olsen JE, Dons L, Rossen L (1991) Listeria monocytogenes isolates can be classified into two major types according to the sequence of the listeriolysin gene. Infect Immun 59: 3945–3951.
- 24. Ripabelli G, McLauchin J, Threlfall EJ (2000) Amplified fragment length polymorphism (AFLP) analysis of Listeria monocytogenes. Syst Appl Microbiol 23: 132–136.
- 25. Rasmussen OF, Skouboe P, Dons L, Rossen L, Olsen JE (1995) Listeria monocytogenes exists in at least three evolutionary lines: evidence from flagellin, invasive associated protein and listeriolysin O genes. Microbiology 141(Pt 9): 2053–2061.
- 26. Wiedmann M, Bruce JL, Keating C, Johnson AE, McDonough PL, et al. (1997) Ribotypes and virulence gene polymorphisms suggest three distinct Listeria monocytogenes lineages with differences in pathogenic potential. Infect Immun 65: 2707–2716.
- 27. Ward TJ, Gorski L, Borucki MK, Mandrell RE, Hutchins J, et al. (2004) Intraspecific phylogeny and lineage group identification based on the prfA virulence gene cluster of Listeria monocytogenes. J Bacteriol 186: 4994–5002.
- 28. Doumith M, Cazalet C, Simoes N, Frangeul L, Jacquet C, et al. (2004) New aspects regarding evolution and virulence of Listeria monocytogenes revealed by comparative genomics and DNA arrays. Infect Immun 72: 1072–1083.
- 29. Jeffers GT, Bruce JL, McDonough PL, Scarlett J, Boor KJ, et al. (2001) Comparative genetic characterization of Listeria monocytogenes isolates from human and animal listeriosis cases. Microbiology 147: 1095–1104.
- 30. Bruhn JB, Vogel BF, Gram L (2005) Bias in the Listeria monocytogenes enrichment procedure: lineage 2 strains outcompete lineage 1 strains in University of Vermont selective enrichments. Appl Environ Microbiol 71: 961–967.
- 31. Nightingale KK, Lyles K, Ayodele M, Jalan P, Nielsen R, et al. (2006) Novel method to identify source-associated phylogenetic clustering shows that Listeria monocytogenes includes niche-adapted clonal groups with distinct ecological preferences. J Clin Microbiol 44: 3742–3751.
- 32. Hong E, Doumith M, Duperrier S, Giovannacci I, Morvan A, et al. (2007) Genetic diversity of Listeria monocytogenes recovered from infected persons and pork, seafood and dairy products on retail sale in France during 2000 and 2001. Int J Food Microbiol 114: 187–194.
- 33. Kathariou S (2003) Foodborne outbreaks of listeriosis and epidemic-associated lineages of Listeria monocytogenes. In: Torrence ME, Isaacson RE, editors. Microbial food safety in animal agriculture. Ames, IA: Iowa State University Press.
- 34. Chen Y, Zhang W, Knabel SJ (2007) Multi-virulence-locus sequence typing identifies single nucleotide polymorphisms which differentiate epidemic clones and outbreak strains of Listeria monocytogenes. J Clin Microbiol 45: 835–846.
- 35. Turner KM, Feil EJ (2007) The secret life of the multilocus sequence type. Int J Antimicrob Agents 29: 129–135.
- 36. Vazquez-Boland JA, Kuhn M, Berche P, Chakraborty T, Dominguez-Bernal G, et al. (2001) Listeria pathogenesis and molecular virulence determinants. Clin Microbiol Rev 14: 584–640.
- 37. Hamon M, Bierne H, Cossart P (2006) Listeria monocytogenes: a multifaceted model. Nat Rev Microbiol 4: 423–434.
- 38. Gaillard JL, Berche P, Frehel C, Gouin E, Cossart P (1991) Entry of Listeria monocytogenes into cells is mediated by internalin, a repeat protein reminiscent of surface antigens from gram-positive cocci. Cell 65: 1127–1141.
- 39. Mengaud J, Ohayon H, Gounon P, Mege RM, Cossart P (1996) E-cadherin is the receptor for internalin, a surface protein required for entry of Listeria monocytogenes into epithelial cells. Cell 84: 923–932.
- 40. Lecuit M, Vandormael-Pournin S, Lefort J, Huerre M, Gounon P, et al. (2001) A transgenic model for listeriosis: role of internalin in crossing the intestinal barrier. Science 292: 1722–1725.
- 41. Doumith M, Buchrieser C, Glaser P, Jacquet C, Martin P (2004) Differentiation of the major Listeria monocytogenes serovars by multiplex PCR. J Clin Microbiol 42: 3819–3822.
- 42. Schouls LM, van der Heide HG, Vauterin L, Vauterin P, Mooi FR (2004) Multiple-locus variable-number tandem repeat analysis of Dutch Bordetella pertussis strains reveals rapid genetic changes with clonal expansion during the late 1990s. J Bacteriol 186: 5496–5505.
- 43. Kumar S, Tamura K, Jakobsen IB, Nei M (2001) MEGA2: molecular evolutionary genetics analysis software. Bioinformatics 17: 1244–1245.
- 44. Huson DH, Bryant D (2006) Application of phylogenetic networks in evolutionary studies. Mol Biol Evol 23: 254–267.
- 45. Martin DP, Williamson C, Posada D (2005) RDP2: recombination detection and analysis from sequence alignments. Bioinformatics 21: 260–262.
- 46. Rozas J, Rozas R (1999) DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics 15: 174–175.
- 47. Didelot X, Falush D (2007) Inference of bacterial microevolution using multilocus sequence data. Genetics 175: 1251–1266.
- 48. Meats E, Feil EJ, Stringer S, Cody AJ, Goldstein R, et al. (2003) Characterization of encapsulated and noncapsulated Haemophilus influenzae and determination of phylogenetic relationships by multilocus sequence typing. J Clin Microbiol 41: 1623–1636.
- 49. Swofford DL (1998) PAUP*: phylogenetic analysis using parsimony (and other methods). 4.0b10 ed. Sunderland, Massachusetts: Sinauer.
- 50. Feil EJ, Holmes EC, Bessen DE, Chan MS, Day NP, et al. (2001) Recombination within natural populations of pathogenic bacteria: short-term empirical estimates and long-term phylogenetic consequences. Proc Natl Acad Sci U S A 98: 182–187.
- 51. Feil EJ, Maiden MC, Achtman M, Spratt BG (1999) The relative contributions of recombination and mutation to the divergence of clones of Neisseria meningitidis. Mol Biol Evol 16: 1496–1502.
- 52. Guttman DS, Dykhuizen DE (1994) Clonal divergence in Escherichia coli as a result of recombination, not mutation. Science 266: 1380–1383.
- 53. Falush D, Stephens M, Pritchard JK (2003) Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics 164: 1567–1587.
- 54. McVean G, Awadalla P, Fearnhead P (2002) A coalescent-based method for detecting and estimating recombination from gene sequences. Genetics 160: 1231–1241.
- 55. Wilson DJ, McVean G (2006) Estimating diversifying selection and functional constraint in the presence of recombination. Genetics 172: 1411–1425.
- 56. Anisimova M, Nielsen R, Yang Z (2003) Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites. Genetics 164: 1229–1236.
- 57. Glaser P, Frangeul L, Buchrieser C, Rusniok C, Amend A, et al. (2001) Comparative genomics of Listeria species. Science 294: 849–852.
- 58. Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155: 945–959.
- 59. Orsi RH, Ripoll DR, Yeung M, Nightingale KK, Wiedmann M (2007) Recombination and positive selection contribute to evolution of Listeria monocytogenes inlA. Microbiology 153: 2666–2678.
- 60. Schubert WD, Urbanke C, Ziehm T, Beier V, Machner MP, et al. (2002) Structure of internalin, a major invasion protein of Listeria monocytogenes, in complex with its human receptor E-cadherin. Cell 111: 825–836.
- 61. Jonquieres R, Bierne H, Mengaud J, Cossart P (1998) The inlA gene of Listeria monocytogenes LO28 harbors a nonsense mutation resulting in release of internalin. Infect Immun 66: 3420–3422.
- 62. Nightingale KK, Windham K, Martin KE, Yeung M, Wiedmann M (2005) Select Listeria monocytogenes subtypes commonly found in foods carry distinct nonsense mutations in inlA, leading to expression of truncated and secreted internalin A, and are associated with a reduced invasion phenotype for human intestinal epithelial cells. Appl Environ Microbiol 71: 8764–8772.
- 63. Olier M, Pierre F, Rousseaux S, Lemaitre JP, Rousset A, et al. (2003) Expression of truncated Internalin A is involved in impaired internalization of some Listeria monocytogenes isolates carried asymptomatically by humans. Infect Immun 71: 1217–1224.
- 64. Rousseaux S, Olier M, Lemaitre JP, Piveteau P, Guzzo J (2004) Use of PCR-restriction fragment length polymorphism of inlA for rapid screening of Listeria monocytogenes strains deficient in the ability to invade Caco-2 cells. Appl Environ Microbiol 70: 2180–2185.
- 65. Borucki MK, Kim SH, Call DR, Smole SC, Pagotto F (2004) Selective discrimination of Listeria monocytogenes epidemic strains by a mixed-genome DNA microarray compared to discrimination by pulsed-field gel electrophoresis, ribotyping, and multilocus sequence typing. J Clin Microbiol 42: 5270–5276.
- 66. Meinersmann RJ, Phillips RW, Wiedmann M, Berrang ME (2004) Multilocus sequence typing of Listeria monocytogenes by use of hypervariable genes reveals clonal and recombination histories of three lineages. Appl Environ Microbiol 70: 2193–2203.
- 67. Revazishvili T, Kotetishvili M, Stine OC, Kreger AS, Morris JG Jr, et al. (2004) Comparative analysis of multilocus sequence typing and pulsed-field gel electrophoresis for characterizing Listeria monocytogenes strains isolated from environmental and clinical sources. J Clin Microbiol 42: 276–285.
- 68. Seeliger HPR, Jones D (1986) Genus Listeria. Bergey's Manual of Systematic Bacteriology. Baltimore: Williams & Wilkins. pp. 1235–1245.
- 69. Fiedler F, Seger J, Schrettenbrunner A, Seeliger HPR (1984) The biochemistry of murein and cell wall teichoic acids in the genus Listeria. Syst Appl Microbiol 5: 360–376.
- 70. Peel M, Donachie W, Shaw A (1988) Physical and antigenic heterogeneity in the flagellins of Listeria monocytogenes and L. ivanovii. J Gen Microbiol 134: 2593–2598.
- 71. Rocourt J, Grimont F, Grimont PAD, Seeliger HPR (1982) DNA relatedness among serovars of Listeria monocytogenes sensu lato. Curr Microbiol 7: 383–388.
- 72. Palys T, Nakamura LK, Cohan FM (1997) Discovery and classification of ecological diversity in the bacterial world: the role of DNA sequence data. Int J Syst Bacteriol 47: 1145–1156.
- 73. Gevers D, Cohan FM, Lawrence JG, Spratt BG, Coenye T, et al. (2005) Opinion: Re-evaluating prokaryotic species. Nat Rev Microbiol 3: 733–739.
- 74. Hanage WP, Fraser C, Spratt BG (2006) Sequences, sequence clusters and bacterial species. Philos Trans R Soc Lond B Biol Sci 361: 1917–1927.
- 75. Falush D, Torpdahl M, Didelot X, Conrad DF, Wilson DJ, et al. (2006) Mismatch induced speciation in Salmonella: model and data. Philos Trans R Soc Lond B Biol Sci 361: 2045–2053.
- 76. Smith JM, Smith NH, O'Rourke M, Spratt BG (1993) How clonal are bacteria? Proc Natl Acad Sci U S A 90: 4384–4388.
- 77. Perez-Losada M, Browne EB, Madsen A, Wirth T, Viscidi RP, et al. (2006) Population genetics of microbial pathogens estimated from multilocus sequence typing (MLST) data. Infect Genet Evol 6: 97–112.
- 78. Dykhuizen DE, Green L (1991) Recombination in Escherichia coli and the definition of biological species. J Bacteriol 173: 7257–7268.
- 79. Suerbaum S, Smith JM, Bapumia K, Morelli G, Smith NH, et al. (1998) Free recombination within Helicobacter pylori. Proc Natl Acad Sci U S A 95: 12619–12624.
- 80. Wirth T, Falush D, Lan R, Colles F, Mensa P, et al. (2006) Sex and virulence in Escherichia coli: an evolutionary perspective. Mol Microbiol 60: 1136–1151.
- 81. Feil EJ, Cooper JE, Grundmann H, Robinson DA, Enright MC, et al. (2003) How clonal is Staphylococcus aureus? J Bacteriol 185: 3307–3316.
- 82. Mandin P, Repoila F, Vergassola M, Geissmann T, Cossart P (2007) Identification of new noncoding RNAs in Listeria monocytogenes and prediction of mRNA targets. Nucleic Acids Res 35: 962–974.