Streptococcus suis is a zoonotic pathogen that infects pigs and can occasionally cause serious infections in humans. S. suis infections occur sporadically in human Europe and North America, but a recent major outbreak has been described in China with high levels of mortality. The mechanisms of S. suis pathogenesis in humans and pigs are poorly understood.
The sequencing of whole genomes of S. suis isolates provides opportunities to investigate the genetic basis of infection. Here we describe whole genome sequences of three S. suis strains from the same lineage: one from European pigs, and two from human cases from China and Vietnam. Comparative genomic analysis was used to investigate the variability of these strains. S. suis is phylogenetically distinct from other Streptococcus species for which genome sequences are currently available. Accordingly, ∼40% of the ∼2 Mb genome is unique in comparison to other Streptococcus species. Finer genomic comparisons within the species showed a high level of sequence conservation; virtually all of the genome is common to the S. suis strains. The only exceptions are three ∼90 kb regions, present in the two isolates from humans, composed of integrative conjugative elements and transposons. Carried in these regions are coding sequences associated with drug resistance. In addition, small-scale sequence variation has generated pseudogenes in putative virulence and colonization factors.
Citation: Holden MTG, Hauser H, Sanders M, Ngo TH, Cherevach I, Cronin A, et al. (2009) Rapid Evolution of Virulence and Drug Resistance in the Emerging Zoonotic Pathogen Streptococcus suis. PLoS ONE 4(7): e6072. https://doi.org/10.1371/journal.pone.0006072
Editor: Adam J. Ratner, Columbia University, United States of America
Received: March 11, 2009; Accepted: April 22, 2009; Published: July 15, 2009
Copyright: © 2009 Holden et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the Wellcome Trust. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Streptococcus suis is a Gram positive coccus that colonises pigs. While it is generally carried asymptomatically in adult pigs, it can cause severe systemic disease in piglets, manifested as a rapidly fatal sepsis associated with meningitis, polyarthritis and pneumonia. Why adult pigs carry the causative bacteria asymptomatically while piglets develop acute disease is unknown. The main site of carriage in the adult are the tonsils, but bacteria have also been isolated from the nasal cavities, the gastrointestinal tract and genital tract. The carriage rate in adult pigs can approach 100% and this organism has been a worldwide problem for the pig industry for a number of years. S. suis has also been isolated from a range of other mammalian and avian species .
S. suis is an important zoonotic agent. The first case in humans was described in Denmark in 1968 . Human infection with S. suis occurs sporadically in Europe and North-America and case reports suggest that it is almost exclusively related to occupational exposure to pigs or pork products. Incidences of human infection with S. suis are greater in S.E. Asia and China. Meningitis is the most common presentation in humans, but septicaemia and endocarditis are also seen. In areas of Vietnam, S. suis is the main cause of acute bacterial meningitis in adults  and it is the third most common cause of meningitis in Hong Kong. The incidence of S. suis infection in humans is almost certainly under-reported. Most cases have been described as being caused by serotype 2 isolates, but other serotypes can cause human disease .
Recently two outbreaks of severe acute disease in humans with high morbidity and mortality in humans have been reported in China , in both cases due to serotype 2 strains. An outbreak in 1998 killed 14 of 25 patients, and an outbreak in 2005 affected 204 people, killing 38 of them (19%), a mortality rate around two to four times that previously reported. A high proportion of people in these outbreaks had a toxic shock-like syndrome, and most of the deaths occurred in this group rather than in those suffering meningitis. Streptococcal toxic shock syndrome (STSS) was previously associated with Streptococcus pyogenes, so the emergence of STSS due to presumptive zoonotic S. suis infection is of considerable concern . One isolate from each of these outbreaks has been sequenced  and this has identified a proposed pathogenicity island (PI) that may be involved in this particular clinical manifestation of S. suis infection, although this remains speculative at this stage. The PI was identified through comparison of the Chinese strains to the unfinished, unannotated genome sequence of strain P1/7 from the Sanger Institute. In this paper we present the completed fully annotated genome sequence of strain P1/7. In addition we have sequenced the genomes of two other S. suis strains, SC84 and BM407, which are human isolates from China and Vietnam respectively. Strain SC84 is a representative of the of 2005 outbreak in China .
Materials and Methods
Bacterial strains, growth and DNA isolation
S. suis strain P1/7 was isolated from an ante-mortem blood culture from a pig dying with meningitis , and is ST1 by MLST . S. suis strain BM407 is also ST1, and was isolated from CSF from a human case of meningitis in Ho Chi Minh City, Vietnam in 2004 . S. suis strain SC84 is ST7, which is closely related to ST1, and was isolated from a case of streptococcal toxic shock-like syndrome in Sichuan Province, China in 2005 . Strain P1/7 is resistant to gentamycin, streptomycin, neomycin, nalidixic acid, and sulfamethoxazole, and sensitive to penicillin, ampicillin, cephalotin, erythromycin, tulathromycin, clarythromycin, lincomycin, clindamycin, pirlimicin, tetracycline, trimethoprim-sulfa, ciprofloxacin, and chloramphenicol. Strain BM407 is resistant to trimethoprim-sulfamethoxazole, tetracycline, erythromycin, azithromycin and chloramphenicol and susceptible to penicillin, ceftriaxone and vancomycin. Strain SC84 is resistant to tetracycline, and susceptible to penicillin, ampicillin, cefotaxime, ceftriaxone, cefepime, meropenem, levofloxacin, chloramphenicol, erythromycin, azithromycin, clindamycin, and vancomycin .
Bacteria were cultured in Todd-Hewitt-broth at 37°C for 18 h and pelleted at 10,000×g. The cells were resuspended in 30 ml of lysis solution (10 mM NaCl, 20 mM Tris HCl pH 8, 1 mM EDTA, 0.5% SDS) and incubated at 50°C overnight. Three ml of 5 M sodium perchlorate was added and incubated for 1 h at ambient temperature. After phenol chloroform extraction the DNA was precipitated with ethanol, spooled into deionised water and stored at −20°C. DNA was also extracted using a genomic DNA extraction kit (G-500, Qiagen).
Whole genome sequencing
The genome of S. suis strain P1/7 was sequenced to approximately 8-fold coverage, from pUC18 (insert size 2.8–3.3 kb), and pMAQ1b_SmaI (insert size 3.0–3.3 kb) genomic shotgun libraries using big-dye terminator chemistry on ABI3730 automated sequencers. End sequences from large insert BAC libraries in pBACehr (insert size 10–25 kb) and pBACe3.6 (insert size 12–15 kb) were used as a scaffold. All repeat regions were bridged by read-pairs or end-sequenced polymerase chain reaction (PCR) products.
S. suis SC84 and BM407 genomes were sequenced using multiple sequencing technologies to generate contiguous drafts. The bulk of the sequencing for both projects was generated using 454/Roche GS20 and Solexa sequencing platforms. The 454 sequencing of strain SC84 generated 399,145 reads with an average length of 102.7 bp, which assembled de novo into 281 non-redundant contigs using the 454/Roche Newbler assembly program. The 454 sequencing of strain BM407 generated 223,282 reads with an average length of 103.1 bp, which assembled de novo into 1939 non-redundant contigs. The Solexa sequencing of strain SC84 generated 32,016,513 reads with a length of 37 bp, and of strain BM407 generated 1,769,043 reads with a length of 37 bp which were each assembled by alignment to a reference (strain P1/7) sequence using the ssaha2  alignment program. Data from the 2 technologies, were merged to produce 39 non-redundant contigs in SC84, and 390 non-redundant contigs in BM407.
The merging process involved using the random in silico sheering of the 454 de novo assembly consensus contigs to create 500 bp consensus reads with 250 bp overlaps. This generated a fasta tiling path representing each of the 454 de novo assembly contigs. These were then assembled together with all the capillary shotgun reads using phrap2gap  to merge the two technologies
The contigs were reordered based on BLAST  alignments with strain P1/7. The gaps between these contigs were closed by directed PCR and the products sequenced with big dye terminator chemistry on ABI3730 capillary sequencers. Aligned Solexa sequencing was merged to confirm assembly and also confirm sequence in 454 only regions. For the SC84 project capillary coverage was 5×, 454 coverage was 19× and Solexa coverage was 190×, and for the BM407 project capillary coverage was 5×, 454 coverage was 11× and Solexa coverage was 20×.
The sequences and annotations of the Streptococcus suis P1/7, SC84 and BM407 genomes have been deposited in the EMBL database under accession numbers AM946016, FM252031, and FM252032 and FM252033 respectively.
The sequence was annotated using Artemis software . Initial coding sequence (CDS) predictions were performed using Orpheus , Glimmer2 , and EasyGene  software. These predictions were amalgamated, and codon usage, positional base preference methods and comparisons to the non redundant protein databases using BLAST  and FASTA  software were used to refine the predictions. The entire DNA sequence was also compared in all six potential reading frames against UniProt, using BLASTX  to identify any possible coding sequences previously missed. Protein motifs were identified using Pfam  and Prosite , transmembrane domains were identified with TMHMM , and signal sequences were identified with SignalP version 2.0 . rRNAs were identified using BLASTN  alignment to defined rRNAs from the EMBL nucleotide database; tRNAs were identified using tRNAscan-SE ; stable RNAs were identified using Rfam .
Comparison of the genome sequences was facilitated by using the Artemis Comparison Tool (ACT)  which enabled the visualization of BLASTN and TBLASTX comparisons  between the genomes. Orthologous proteins were identified as reciprocal best matches using FASTA  with subsequent manual curation. Pseudogenes had one or more mutations that would prevent correct translation; each of the inactivating mutations was subsequently checked against the original sequencing data. Small scale variation including: SNPs and insertions and deletions, were identified by using the SNP detection pipeline of MUMmer . Clonalframe v1.1 was used to identify regions of the S. suis chromosomes associated with recombination .
Streptococcus sequences used for comparative genomic analysis were: S. suis 05ZYH33 (accession number CP000407) , S. suis 98HAH33 (accession number CP000408) , S. pyogenes Manfredo (accession number AM295007) , Streptococcus equi 4047 (accession number FM204883) , Streptococcus uberis 0140J (accession number AM946015) , Streptococcus thermophilus CNRZ1066 (accession number CP000024) , Streptococcus pneumoniae TIGR4 (accession number AE005672) , Streptococcus sanguinis SK36 (accession number CP000387) , Streptococcus mutans UA159 (accession number AE014133) , Streptococcus agalactiae NEM316 (accession number AL732656) , and Streptococcus gordonii str. Challis substr. CH1 (accession number CP000725) . The sequences were also compared wth Lactococcus lactis subsp. lactis IL1403 (accession number AE005176) .
Sequences alignment were made using R-coffee  and the alignments inspected and edited using Seaview (v4.0) . Bayesian analysis was performed using Mr Bayes , . The model of sequence evolution used was the generalized time-reversible (GTR) model with gamma-distributed rate variation. The analysis was run with 2×105 generations, and the first 5×104 were discarded as burn-in. The tree drawn with FigTree (v1.2.2; http://tree.bio.ed.ac.uk/software/figtree/).
The genome of S. suis strain P1/7, a pig disease isolate
The genome of Streptococcus suis P1/7 consists of a single circular chromosome of 2,007,491 bp (Figure 1; accession number AM946016) containing 1,908 predicted protein coding sequences (CDSs; Table 1), 82 of which are pseudogenes or gene fragments (Table S1). S. suis is phylogenetically distinct among the streptococci for which genome sequences are available (Figure 2). Pairwise genomic comparisons reveal very little conservation of gene order with other species (data not shown). Orthologue comparisons against all of the other available streptococcal genomes identified that approximately 60% of the P1/7 genome is orthologous to CDSs in the genomes of other Streptococcus species (Table 2). The highest number of orthologous matches was identified in S. sanguinis (66.1%), followed by S. gordonii (65.6%).
Key for the circular diagram (outer to inner): scale (in Mb); annotated CDSs coloured according to predicted function are shown on a pair of concentric circles, representing both coding strands; Streptococcus suis orphan CDSs, purple; orthologue matches shared with the Streptococcal species, S. mutans UA159, S. gordonii Challis CH1, S. sanguinis SK36, S. pyogenes Manfredo, S. equi 4047, S. agalactiae NEM316, S. uberis 0140J, S. pneumoniae TIGR4, S. thermophilus CNRZ1066, blue; orthologue matches shared with Lactococcus lactis subsp. lactis, green; G+C% content plot; G+C deviation plot (>0% olive, <0% purple). Colour coding for P1/7 CDS functions: dark blue, pathogenicity/adaptation; black, energy metabolism; red, information transfer; dark green, surface associated; cyan, degradation of large molecules; magenta, degradation of small molecules; yellow, central/intermediary metabolism; pale green, unknown; pale blue, regulators; orange, conserved hypothetical; brown, pseudogenes; pink, phage and IS elements; grey, miscellaneous.
Unrooted majority-rule tree of Bayesian analysis of combined 16S rRNA and RNase P RNA gene datasets. The was tree built from 16S rRNA and rnpB gene sequences from the genomes of the Streptococcus species: S. suis P1/7, S. mutans UA159, S. gordonii Challis CH1, S. sanguinis SK36, S. pyogenes Manfredo, S. equi 4047, S. agalactiae NEM316, S. uberis 0140J, S. pneumoniae TIGR4, and S. thermophilus CNRZ1066. Lactococcus lactis subsp. lactis IL1403 was included as an outgroup. The numbers at the branches are posterior probabilities indicating the support for the branch. The bar indicates the number of substitutions per site (4 per 100 sites).
Analysis of the functions of the P1/7 orthologues present in other streptococcal species identified that some functional groups are more widely conserved than others (Figure 3), and that some exhibit a wide range of distribution across the species compared. Many of the core housekeeping functions such as fatty acid metabolism, chaperones, macromolecule biosynthesis, and nucleotide biosynthesis exhibit the highest conservation. For these classes, >85% of the CDSs of these classes have orthologous matches in all the other streptococci compared. In some cases, for example ribosomal proteins, a lower level of orthologous matches in a strain is due to differences in gene prediction, rather than the absence of a CDS per se. In contrast, some functional groups such as cofactor biosynthesis and amino acid biosynthesis exhibit a lower level of conservation and probably reflect the available nutrients in the niches occupied by the individual streptococcal species. Many of the more variably distributed orthologous functional groups are associated with the acquisition and uptake of nutrients, for example macromolecule and small molecule degradation, as well as responses to environmental stresses and stimuli.
Functional groups are displayed clockwise on the web graph, in decreasing range of orthologue matches for a single category.
Excluding CDSs in mobile genetic elements (MGE), 283 CDSs (14.9%) in the P1/7 genome do not have orthologous matches (have been) in any of the other Streptococcus species (Table S2). These have been designated S. suis orphans and include: a transporter cluster (SSU0039 to SSU0044), a putative endopeptidase (SSU0152), a glucosamine-6-phosphate isomerase (SSU0206), a putative lipase (SSU0304), a putative deoxyguanosinetriphosphate triphosphohydrolase (SSU0327), capsule associated CDSs (SSU0520 to SSU0529), a UDP-galactopyranose mutase (SSU0563), a MATE family transporter (SSU0573), EPS associated CDSs (SSU1115, SSU1125 and SSU1128), a putative oligopeptidase (SSU1679), and a putative choline binding protein (SSU1911) (Table S2).
Virulence genes in the P1/7 genome
The mechanisms by which S. suis cause disease in humans and pigs are poorly understood. Many putative virulence functions have been identified, however no single function is crucial for the pathogenesis of S. suis. Many putative virulence genes are only present, or expressed, in some S. suis isolates, suggesting that combinations of virulence factors may be more important than individual functions. Additionally, several of the functions may be important for colonization and carriage, rather than disease. Analysis of the S. suis genomes reveals an array of CDSs encoding potentially important functions.
Polysaccharides and capsule.
Like most other pathogens that can cause meningitis, S. suis expresses a polysaccharide capsule. P1/7 is serotype 2 and contains a locus (SSU0515 to SSU0529) for the biosynthesis of type 2 capsule , . The structure of the S. suis serotype 2 capsule is unknown, but it is composed of glucose, galactose, N-acetylglucosamine, rhamnose and sialic acid in a ratio of 1∶3∶1∶1∶1 . Although 35 serotypes of S. suis have been described, type 2 is most frequently associated with disease in pigs and humans , –. Capsule plays a role in the virulence of S. suis; type 2 capsule mutants are avirulent in murine and pig models of infections ,  The capsule itself can mediate prostaglandin E2 (PGE2) and matrix metalloproteinase 9 (MMP-9) production by human macrophages  suggesting a role in pathogenesis of disease.
The P1/7 genome also contains a cluster of genes that appears to encode the components required for the biosynthesis of a putative exopolysaccharide (SSU1111 to SSU1133; Figure S1a). Included in this are CDSs encoding enzymes for synthesis of the exopolysaccharide building blocks, for processing and for export. Analysis of the cluster suggests that it is involved in the synthesis of a rhamnose-glucose-like polysaccharide (RGP), and is similar to components of RGP biosynthesis identified in S. mutans (Figure S1b). Four loci have been identified in S. mutans that encode proteins necessary for RGP synthesis: RmlA, RmlC, RmlB, and RmlD direct the synthesis of dTDP-L-rhamnose from D-glucose-1-phosphate , ; GluA produces UDP-D-glucose , RgpG initiates RGP synthesis by transfer of N-acetylglucosamine-1-phosphate to a lipid carrier , and proteins from an operon encoding six proteins (rgpABCDEFHI) are required for the transport and assembly of the rhamnan repeat unit backbone, and glucose side chain formation , .
The putative exopolysaccharide biosynthetic cluster between CDSs SSU1111 and SSU1133, contains homologues of rmlACB and rmlD (SSU1129, SSU1130, SSU1132 and SSU1133), and rgpABCD and F (SSU1124, SSU1123, SSU1122, SSU1121 and SSU1120) (Figure S1a). The two clusters of rml and rgp homologues are separated by four CDSs encoding a putative sulphatase (SSU1125), a mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase gene fragment (SSU1126), a putative N-acetylmuramoyl-L-alanine amidase (SSU1127), and a surface-anchored protein containing an LPXTS motif (SSU1128). In addition to the homologues of the S. mutans rpg genes, there is also a flippase (SSU1119), and six glycosyl transferases (SSU1117, SSU1116, SSU1115, SSU1114, SSU1113 and SSU1111). Homologues of gluA (SSU1828) and rgpG (SSU1672) are located at two other loci on the P1/7 genome. Unlike the S. mutans RGP cluster, the P1/7 cluster does not contain homologues of rmlE, rmlH and rmlI. These genes encode proteins that are involved in glucose side chain formation , therefore it is likely that the polysaccharide produced by P1/7 has a different repeat unit structure to that produced by S. mutans.
The cluster does not appear to encode an obvious repeat unit polymerase. Downstream of the flippase is a membrane protein (SSU1118) that has weak similarity matches to proteins of unknown function. Due to the sequence diversity of repeat unit polymerases, it is not always possible to annotate them by sequence similarity. Hydrophobicity plot profile comparison has been used to identify tentatively these polymerases . A comparison of the hydrophobicity plot of SSU1118 with those of known polymerases identified a similar profile (data not shown).
RGPs constitute serospecific-antigens that form the basis of some streptococcal serotyping. These molecules are associated with modulating host immune cell interactions, and may have a role in eliciting sepsis. Streptococcal RGPs stimulate human monocytes in vivo to release inflammatory cytokines, including tumor necrosis factor alpha and interleukin-1β , induce NO synthase activity , and confer resistance to phagocytosis by human polymorphonuclear leukocytes . Elevated NO production has been observed in animal models of bacterial meningitis and also in human patients with the disease , . In a mouse model of S. suis infection, high levels of inflammatory cytokines were observed both systemically and in the central nervous system . Distinct from the type 2 capsule, it is possible that the exopolysaccharide encoded by this cluster may have an important role in virulence and may contribute to the inflammatory response during S. suis infection.
Protein secretion and secreted proteins.
In the publication by Chen et al. describing the genomes of two Chinese S. suis isolates , they presented possible virulence-associated factors or pathways in S. suis, including strain P1/7. This analysis identified thirteen components of a Type II secretion system and one component of a Type III secretion system in the genome of strain P1/7. Our analysis of the genome failed to identify components of either secretion system, although components of Sec and SRP pathways were identified. Neither Type II, nor Type III secretion systems have been identified in Gram-positive bacteria, and it is therefore very unlikely that S. suis secretes proteins via these pathways. It would appear that the description of these secretion system components in S. suis is a case of mistaken identity due to sub-optimal transitive annotation. The S. suis genome contains CDSs that have weak similarity to components of these Gram-negative protein secretion systems, but these genes in S. suis encode proteins that are involved in the transport of other molecules, or are components of functionally distinct systems. The observed similarity owes more to distant evolutionary relationships rather than conservation of function .
Suilysin (SSU1231) is a secreted thiol-activated cytolysin  produced by some strains of S. suis . This protein is part of a family of cytolysins that play a role in disease caused by several bacterial pathogens . The role that suilysin plays in S. suis disease is unclear. Immunization of pigs with suilysin provided partial protection from S. suis infection . However, while a suilysin mutant was avirulent in a mouse infection model, it was only slightly attenuated in a model of infection in pigs .
Several of the previously characterized S. suis virulence factors are cell wall-anchored proteins. These sortase-processed proteins contain a C-terminal LPXTG-type processing motif. The genome of strain P1/7 contains a large number (28) of proteins of this type: all but five contained C-terminal LPXTG motifs followed by non-polar and basic amino acid motifs. The other five putative surface proteins (SSU1128, SSU1885, SSU1186, SSU1888 and SSU1889) contained LPXTS, IPXTG, IPXTG, YPXTG and LYXTG motifs respectively. Previous studies have identified five sortase homologues in S. suis (srtA-E) by PCR screening with degenerate primers . An additional sortase CDS was identified (SSU0428) in the P1/7 genome .
In addition, the S. suis P/7 genome contains a protein (SSU0879) with an atypical N-terminal LPXTG-type motif. This protein is similar to the large surface zinc metalloproteinases of S. pneumoniae; 39.9% identical to the zinc metalloprotease precursor ZmpC , 33.7% identical to Immunoglobulin A (IgA)1 protease precursor Iga , and 28.3% identical to the zinc metalloprotease precursor ZmpB . In S. pneumoniae these proteins interact directly with the host immune system, and play an inferred role in virulence. ZmpC mutants in S. pneumoniae led to reduced mortality in an experimental mouse model of intranasal challenge and sepsis . ZmpC cleaves human matrix metalloproteinase 9 (MMP-9), thought to lead to the proteolytic activation of this matrix remodelling protein . Matrix metalloproteinases can contribute to blood–brain barrier (BBB) disruption during bacterial meningitis , . MMP-9 produced by leukocytes during meningitis is essential to enable them to migrate through the BBB. A consequence of the increase in the permeability of the BBB mediated by MMP-9, is that bacterial cells are able to migrate to the central nervous system and cause further tissue damage. S. suis capsule induces MMP-9 production in macrophages in an in vivo model : it is possible that the S. suis zinc metalloproteinase may also play a role in increasing MMP-9 activity.
IgA1 protease mediates the site-specific cleavage of human IgA1 in the hinge region . IgA is the major class of immunoglobulin in mucosal secretions of the upper respiratory tract, and functions by interfering with microbial attachment to host tissues. The activity of IgA1 protease may help pathogens to subvert the antigen specificity of the humoral immune response, and facilitate adhesion and persistence at the mucosal surface .
Analysis of extracellular proteins produced by S. suis identified two proteins, muramidase-released protein (MRP)  and extracellular protein factor (EF) , associated with disease and mortality in pigs (MRP+, EF+) . Variation in the expression of these proteins has been observed in S. suis isolates. Although MRP and EF were originally identified as markers of disease, most S. suis serotype 2 strains isolated from diseased pigs in Canada are phenotypically negative for MRP and EF . Additionally, mutation of both proteins had no measurable effect on pathogenicity . P1/7 is phenotypically positive for MRP and EF (MRP+, EF+). In the genome of P1/7, EF appears to be encoded in the first ORF of a larger mutated CDS (SSU0171). The entire CDS encodes a putative surface-anchored protein, and is split into three reading frames by a frameshift and nonsense mutations; the 5-prime reading frame encoding EF, contains an N-terminal signal sequence, and the 3-prime reading frame contains a LPXTG sortase-processing motif. The EF+ phenotype of P1/7 is probably due to the expression of the 5-prime reading frame.
Unlike MRF and EF, opacity factor (OFS) is a large surface-associated protein that may play a role in the virulence of S. suis . The N-terminus of OFS is homologous to the N-terminal regions of the fibronectin-binding protein A (FnBA) of Streptococcus dysgalactiae and the serum opacity factor of S. pyogenes. Mutational studies have suggested a role for OFS in virulence, but not colonization. However in strain P1/7 OFS (SSU1474) is mutated, containing two frameshift mutations.
Two of the LPXTG proteins were found in a cluster similar to the pilus islands identified in S. agalactiae , and comprise the main pilus subunit (SSU0427) and ancillary subunit of this pilus (SSU0425). In addition the cluster contains a signal peptidase (SSU0424) and a sortase (SSU0428). The genome contains a second pilus cluster, containing 3 sortases, srtD, srtC, srtB (SSU1881, SSU1882, SSU1883), ancillary protein 1 (SSU1885), the main pilus subunit (SSU1886), ancillary protein 2 (SSU1888) and ancillary protein 3 (SSU1889). In silico analysis of the P1/7 genome suggests that this strain does not express complete pili, as both of the clusters contain pseudogenes; SSU0425 contains a nonsense mutation, and SSU1886 and SSU1888 contain frameshift mutations. To this end it is worth noting that orthologous CDS in the other sequenced S. suis strains are also pseudogenes. The genome also contains an incomplete pilus cluster that contains a signal peptidase (SSU0450) and sortase SrtE (SSU0453), but lacks pilus subunits and ancillary subunits. Between the signal peptidase and sortase there are a putative exported protein (SSU0451) and a transposase fragment (SSU0452). It is possible that the putative exported protein represents the N-terminal region of a pilus protein that has been truncated by a deletion event that generated the partial transposase sequence. Evidence for the potential role of pili in S. suis pathogenesis has come from in vitro work looking for preferentially expressed genes in S. suis upon interaction with porcine brain microvascular endothelial cells . The signal peptidase (SSU0424) of the S. agalactiae-like cluster is upregulated, as is srtE, the sortase belonging to the incomplete pilus cluster.
Comparison of cell wall-anchored proteins among streptococcal species shows that they are one of the most variable components of the genome; 13 out of the 29 proteins in P1/7 were unique to S. suis (Table S2), the rest being intermittently distributed. Interestingly, orthologues of several of the S. suis cell wall-anchored proteins, or proteins associated with their processing, were conserved within the meningitis-associated streptococci (Table 2). Six of the cell wall-anchored proteins, and four of the sortases were found in S. agalactiae and/or S. pneumoniae, and were absent in other streptococci .
In addition to LPXTG-type cell wall-anchored proteins, the P1/7 genome also contains other surface-exposed proteins that may play an important role in modulating host-cell interactions. The P1/7 genome encodes a protein, SSU0496, which contains a Mac 1 domain (PF09028), an N-terminal signal sequence and a C-terminal transmembrane domain. The Mac 1 domain is found in a small number of streptococcal proteins that modulate the immune response. Mac, also known as IdeS, is a cysteine protease secreted by S. pyogenes , . Two allelic families of Mac have been identified in S. pyogenes; Mac I and Mac II  exhibit sequence divergence in the middle third of the proteins (∼50% amino acid identity in this region). Mac was originally named because it had limited sequence homology to the α-subunit of the leukocyte β2 integrin, Mac-1 . S. pyogenes Mac proteins block polymorphonuclear leukocyte opsonophagocytosis and inhibit the production of reactive oxygen species . The mechanisms of action of the S. pyogenes Mac allelic families differ; Mac I binds to, and cleaves, the hinge region of human IgG  whereas Mac-2 exhibits lower levels of endopeptidase activity, and does not bind IgG, but instead to the IgG receptor, Fcγ . Homologues of Mac have been identified in Streptococcus equi subsp. equi (349 amino acids), the causative agent of strangles, and the related pathogen Streptococcus equi subsp. zooepidemicus . In S. equi the Mac homologue is expressed during infection, and reduces the antiphagocytic activity of equine neutrophils .
SSU0496 is similar (∼29% sequence identity) in the N-terminal region to the S. pyogenes Mac I and II proteins. In comparison, the S. suis protein is considerably larger than the S. pyogenes Mac proteins, comprising 1141 amino acids as opposed to ∼340 amino acids. The enzymatic activities of S. pyogenes Mac proteins have been characterized, and they belong to a novel family of cysteine proteases . Protein alignment of the S. suis protein with the S. pyogenes proteins shows that the amino acids of the catalytic triad (Cys94, His262 and Asp284) are conserved. The S. suis protein however, is missing an Arg-Gly-Asp (RGD) integrin-binding motif present in the other streptococcal proteins, suggesting that while it may possess the peptidase activity of its homologues it may lack the ability to bind integrin.
Comparative genomics of streptococci that can cause meningitis
Several species of Streptococcus have been associated with meningitis in humans: S. pneumoniae is an important cause of bacterial meningitis , and S. agalactiae is important in this regard . Comparative genomic analysis was used to identify components of the S. suis genome shared with S. pneumoniae and S. agalactiae that were absent in other streptococci. Three strains of S. agalactiae, NEM316 , 2603V/R  and A909  and three strains of S. pneumoniae, R6 , TIGR4  and D39 , have been sequenced thus far. Whilst S. pneumoniae strain TIGR4 is from a case of invasive disease, and S. agalactiae strains NEM316, 2603V/R and A909 are all clinical isolates, none of the strains appears to come from a meningitis case. 71 CDSs were identified that were present in at least one of the strains of S. pneumoniae or S. agalactiae, but were absent in other Streptococcus species (Table 3). Of these 12 were found in MGEs in the P1/7 genome. The distribution of orthologue matches is varied between species and between strains; only 5 CDSs (SSU0685, SSU0686, SSU0687, SSU1050 and SSU1403) have orthologue matches in all of the strains for both species. Amongst these common CDSs are 3 CDS that form part of an operon and encode a phosphomethylpyrimidine kinase, a hydroxyethylthiazole kinase and a thiamine-phosphate pyrophosphorylase. These 3 CDSs catalyse the final steps in the biosynthesis of thiamine monophosphate from pyrimidine and thiazole molecules. The other CDSs in this operon, SSU0684, encodes a putative phosphatase which does not have any orthologue matches. Another of the CDSs common to S. suis, S. agalactiae and S. pneumoniae encodes a PadR family regulatory protein
The final common CDS encodes a hyaluronidase. Hyaluronidase has been described as a virulence factor of S. pneumoniae; the addition of hyaluronidase to an intra-nasal inoculum was necessary to cause invasive disease in an infant mouse model of meningitis . Hyaluronidase mutants do not appear to be attenuated in a murine model of meningitis after intracerebral infection , suggesting that this degradative enzyme may be important for the bacteria to cross the mucosa or the BBB during the early stages of pathogenesis. Notably in all five S. suis strains sequenced, including the human meningitis strains this CDS is present as a pseudogene. A study investigating the distribution of hyaluronidases in a diverse collection of S. suis isolates identified fewer than 30% exhibited hyaluronate lyase activity in vitro, although all the isolates tested (n. 309) contained the hyaluronidase gene (hly) . In many cases mutations were identified that disrupted the expression of hly. Notably hyaluronate lyase activity was absent in the majority of isolates associated with invasive disease (meningitis and septicaemia), whereas activity was detected more often in isolates associated with pneumonia.
Given the unknown potential to cause meningitis of the S. agalactiae and S. pneumoniae strains sequenced, variably distributed matches should not be ruled out as having important roles in causing meningitis. Some of these CDSs encode functions that are important for pathogenesis, such as the zinc metalloproteinase previously mentioned. Orthologues of this protein are present in all of the S. pneumoniae strains compared but absent in S. agalactiae (Table 3). Components of the two pilus clusters are amongst the differentially distributed matches.
Comparative genomics of two geographically distinct human disease strains SC84 and BM407
S. suis strain SC84 is a human isolate, and was isolated from the same outbreak as Chinese S. suis strain 05ZYH33 . Strains from this outbreak have shown to belong to MLST sequence type 7 . Strain P1/7 is MLST sequence type 1 and, as a single locus variant, belongs to the same clonal complex as SC84. Previous comparisons of strain P1/7 with S. suis strains 98HAH33 and 05ZYH33 showed that only 83.6% and 83.3% respectively of the P1/7 CDSs had orthologous matches. It was also reported that 4.9% of the P1/7 genome was unique in comparison to S. suis 05ZYH33 and 98HAH33 . However, according to our analysis, all of the CDSs in the strain P1/7 genome have equivalent coding regions in the S. suis 05ZYH33 and 98HAH33 sequences. The difference in the number of orthologues identified can be accounted for by differences in gene prediction. The only large scale region of difference is associated with the 89 kb putative PI that is present in the two sequenced ST7 strains , and absent in P1/7.
The genome of S. suis strain SC84 consists of a single circular chromosome of 2,095,898 bp (accession number FM252031) and containing 1,985 CDSs (Table 1), of which 87 are pseudogenes or gene fragments (Table S1). Strain BM407 was isolated from a case of human meningitis in Vietnam in 2004 and, like strain P1/7, has MLST sequence type 1. The genome of S. suis strain BM407 consists of a single circular chromosome of 2,146,229 bp (accession number FM252032) containing 2,040 CDSs (Table 1) with 108 pseudogenes or gene fragments, and a plasmid (pBM407) of 24,579 bp, (accession number FM252033) which contains 18 CDSs with 3 pseudogenes or gene fragments (Table S1).
The chromosomes of strains P1/7 and SC84 have a conserved structure and are collinear along their length (Figure 4). The BM407 chromosome is collinear, with the exception of a large inversion (nucleotide 600371 to 1416050). The rearrangement is due to recombination events between identical IS3 elements on opposite replichores. Strains P1/7 and SC84 have asymmetric GC skews (Figure 4), whereas BM407 does not. Whilst the asymmetric conformation is present in two of the three strains sequenced, the BM407 conformation is likely to be the more common form of the S. suis chromosome as it maintains the replichore symmetry, placing the origin and terminus of replication at opposite ends of the circular chromosome, a common feature of bacterial circular chromosomes . Screening a collection of Vietnamese S. suis ST1 strains  using PCR showed that the chromosome is found in alternative orientations in closely related strains, suggesting that the IS3-mediated recombination generating the chromosomal rearrangement is an ongoing process (data not shown).
Pairwise comparisons of the chromosomes of S. suis strains P1/7, SC84 and BM407 displayed using the Artemis Comparison Tool (ACT) . The sequences have been aligned from the predicted replication origins (oriC; right). The coloured bars separating sequences (red and blue) represent similarity matches identified by BLASTN analysis , with a score cut off of 100. Red lines link matches in the same orientation; blue lines link matches in the reverse orientation. The green coloured boxes on the horizontal grey lines mark up the extent of genomic islands identified on the chromosomes; pink boxes mark up the extent ICE regions.
Comparison of the three S. suis genomes reveals discrete regions of difference that distinguish these closely related strains. For strains P1/7 and SC84, the regions of difference are exclusively associated with MGEs; all of the genes identified in strain P1/7 outside MGEs have orthologues in strain SC84. Strain BM407 also contains variation associated with MGEs, but additionally there are two regions that appear to be deleted, resulting in a reduction in the number of genes on the chromosome of this strain in comparison to the other two. The first is a small region of 268 bp, flanked by 30 bp direct repeats, that encodes a putative membrane protein (in P1/7, SSU1245). The second absent region is 3.3 kb and contains 2 complete CDSs encoding subunits of an ATP-dependent exonuclease RexAB and the C-terminal region of Mrp (SSUBM407_1126; equivalent to bases 733785 to 743796 in strain P1/7). Immediately downstream of mrp is a partial IS1216 transposase. The 3-prime region of mrp (pSSUBM407_006) is found on the plasmid pBM407, along with rexAB (pSSUBM407_015 and pSSUBM407_016), the genes for the missing chromosomal ATP-dependent exonuclease. The plasmid also contains three copies of IS1216, and it is likely that the chromosomal mrp and rexAB locus was deleted by IS1216-mediated recombination.
The three strains also exhibited small-scale variation at the nucleotide level including SNPs, insertions and deletions. Of the three strains BM407 appears to be the most diverse: comparison of strain BM407 to P1/7 identified 977 SNPs, 201 insertions, and 122 deletions in the core chromosome (excluding ICE regions); and comparison to SC84 identified 1017 SNPs, 232 insertions, and 133 deletions, whereas a comparison of strain P1/7 to SC84 identified 85 SNPs, 10 insertions, and 15 deletions.
The relative levels of variation are, on the face of it, at odds with the relatedness of the strains as defined by MLST: P1/7 belongs to the same sequence type as BM407, whereas SC84 is a single locus variant of P1/7. MLST is based on variation at only seven loci in the genome however, so it should not be surprising that it does not necessarily reflect the overall diversity at the whole genome level. Analysis of the distribution and density of SNPs in the genomes suggest that recombination has generated allelic variation at certain loci in the BM407 chromosome, thereby increasing the apparent level of diversity. Eight loci in the BM407 chromosome contained a high density of SNPs and were predicted by Clonalframe to be regions that have undergone recombination (Table 4). In total the SNPs in these regions account for ∼60% of the SNPs in the comparison with P1/7 and SC84.
Several of the putatively recombined loci in BM407 encode functions or proteins that potentially have a role in virulence or survival in the host. Two of the loci encode components of transport systems, that include an ABC-type glycine betaine transport system protein (SSUBM407_1490) homologous (56.1% identity at the amino acid level) to a bile exclusion system protein BilEB from Listeria monocytogenes shown to have a role in virulence ; and a binding protein-dependent transport system (SSUBM407_1447 to SSUBM407_1449) orthologous to a system from S. mutans responsible for the uptake of melibiose, raffinose, and isomaltotriose and the metabolism of melibiose, sucrose, and isomaltosaccharides . One of the loci is composed of three CDSs that are part of the putative exopolysaccharide biosynthesis cluster previously described. Two of the loci contain cell wall anchored surface proteins, encoding a zinc carboxypeptidase (SSUBM407_0646) and a N-acetylmuramoyl-L-alanine amidase (SSUBM407_0662). A single regulatory CDS (SSUBM407_0840) is found in the loci that appear to have undergone recombination. This CDS encodes a sensor kinase which is part of a two component regulatory system operon, with the SNPs being found in the N-terminal region of the sensor kinase protein. SSUBM407_0840 is homologous to ciaH of S. pneumoniae (49.3% identity at the amino acid level) which plays a role in antibiotics resistance and transformation competence . Mutations within this regulatory protein increase resistance to the cephalosporin cefotaxime.
There are fewer SNPs in the comparisons between P1/7 and SC84 than were described in the publication by Chen et al. (2007) which compared 98HAH33, 05ZYH33 and P1/7. Interestingly, a comparison of SC84 with 05ZYH33, both strains from the same outbreak, identified 2602 SNPs, 596 insertions, and 304 deletions. This is in marked contrast to the low number of SNPs identified in the comparison between P1/7 and SC84, two strains isolated from different hosts, on different continents over 20 years apart.
Strains from the Sichuan outbreak have been found to be ST7 . Examination of the sequence at MLST loci of 05ZYH33 revealed that they do not match the expected alleles for those of ST7. In comparison to SC84, which itself belongs to ST7, SNPs were identified at two of the seven MLST loci in the 05ZYH33 sequence: the aroA locus has 29 SNPs, and gki has 2 SNPs. Although, it is possible that 05ZYH33 is actually a different sequence type, albeit a double locus variant of ST7. The predicted sequence type of 98HAH33 is more divergent still, being a triple locus variant of ST7; the aroA has 6 SNPs, cpn has 1 single base deletion, and dpr has 1 single base deletion. The most likely explanation for these observations is that the sequence data for the 05ZYH33 and 98HAH33 strains is incorrect.
Mobile genetic elements.
Four regions of the P1/7 genome were identified that had properties indicative of recent acquisition, which include 1 prophage-like region and 3 miscellaneous islands, which have been designated genomic islands. None of these genomic islands carried characterized or putative virulence factors. Equivalent genomic islands are present in the SC84 genome. BM407 contains three of the genomic islands present in P1/7, but is lacking genomic island 3 (in strain P1/7 nucleotides 835135 to 846313; Figure 4). In contrast to some streptococcal species such as S. pyogenes , the proportion of the genome comprised by MGEs is relatively small, at 1.8% of the P1/7 genome, suggesting that the pan-genome of this species is smaller than other streptococci , , and that there may be some limitations or restrictions to the amount of recent horizontal gene transfer in this species.
Previous sequencing of the two S. suis strains associated with meningitis outbreaks in China in 1998 and 2005 identified highly similar 89 kb regions present in both genomes which were designated a candidate PI . This region is absent in strain P1/7, but strain SC84 contains an almost identical region. This island has a composite structure, contains several regions that appear to be integrated MGEs and has a structure similar to integrative conjugative elements (ICE) found in other streptococci and other bacteria . Accordingly this region has been designated ICESsuSC84.
Integrative conjugative elements.
ICE are commonly found in streptococcal genomes. The most common are Tn916-type elements, which almost invariably carry the tetracycline resistance determinant tetM, often accompanied by other drug resistance genes, and Tn5252-type elements, which are larger and more variable in sequence. In many cases, Tn5252 transposons are carrying Tn916 elements; varying arrangements of the two are observed in different genomes, suggesting this association has arisen independently on a number of occasions. Analogous with the bacterial chromosome as a whole, the ICE have core and accessory components to their gene complement. The conjugative machinery appears to be relatively well conserved between the ICE, with all of the streptococcal Tn5252-like elements sharing a large recombination protein, as well as VirB4-type and VirD4-type proteins. Furthermore, a genus-wide comparison of streptococcal ICE reveals a surprising level of similarity in their complement of cargo genes. The pezAT addiction toxin system, first identified in the chromosome of S. pneumoniae, is found on elements present in S. suis, S. pneumoniae and S. agalactiae, presumably aiding fixation of the transposon following integration. Bacteriocins, either alone or as part of a gene cluster encoding the accompanying processing machinery, are present on all the sequenced S. suis and S. pneumoniae  elements. Many of the other cargo coding sequences appear to be involved in increasing the stress tolerance of the host: abi genes are common, and the ICE of S. dysgalactiae subsp. equisimilis and S. agalactiae 2603V/R encode multiple heavy metal resistance genes. Of greater clinical importance, sequenced streptococcal ICE can carry genes for resistance to tetracycline, erythromycin, chloramphenicol, trimethoprim, aminoglycosides and streptomycin.
ICESsuSC84 is integrated into the SC84 S. suis genome in the 3-prime region of the gene encoding the 50S ribosomal protein L7/L12 (SSUSC84_0891). ICESsuSC84 has extended similarity to a Tn2424 region (Figure 5) found in S. agalactiae strain NEM316 , with three regions of difference. The first of these contains a putative bacteriocin biosynthesis cluster that is disrupted by a putative integron carrying five CDSs including an aminoglycoside resistance gene (aminoglycoside 6-adenylyltansferase; SSUSC84_0863). The second contains MGE-associated CDSs, including a Tn916 insertion that carries a tetracycline resistance gene (tetM; SSUSC84_0827). The third region contains genes associated with putative lantibiotic export/resistance and an asparagine synthetase. The only component of the 89kb island that is an obvious candidate for a pathogenicity determinant is a surface-anchored protein that contains a LPXTG motif that is annotated as an agglutinin receptor (SSUSC84_0873). This protein is found in the S. agalactiae element (and other streptococci) and contains a glucan-binding domain.
The results of a TBLASTX comparison of the ICESsuSC84 from strain SC84 and S. agalactiae NEM315 Tn2424 is displayed using the Artemis Comparison Tool (ACT) .
The BM407 genome contains two regions with extended similarity to ICESsuSC84. The first of these, ICESsuBM4071, is inserted into a luciferase-like monooxygenase (SSUBM407_0455), resulting in the disruption of this CDS. The second, ICESsuBM4072, is found at the same locus as ICESsuSC84; the element is inserted into the 3-prime region of the gene encoding the 50S ribosomal protein L7/L12 (SSUBM407_0933). Although ICESsuBM4072 and ICESsuSC84 are inserted at the same attachment site it is likely that they have arisen in these two strains by independent insertion events. Not only do the two elements contain regions of difference, the conserved DNA in the ICEs is more diverse than the core chromosome DNA (the ICEs exhibit ∼95−98% identity, in contrast to ∼99−100% identity for chromosomal DNA). ICESsuBM4072 and ICESsuSC84 have conserved tyrosine family site-specific integrases (SSUBM407_1003 and SSUSC84_0807; Figure 6), which is probably the explanation for why these elements integrated at the same site in the respective chromosomes, whereas ICESsuBM4071 contains a serine family recombinase (SSUBM407_0524) that directs the insertion of this element elsewhere on the BM407 chromosome.
The results of a TBLASTX comparison of the ICESsuBM4071 and ICESsuBM4072 regions from strain BM407, and ICESsuSC84 from strain SC84 is displayed using the Artemis Comparison Tool (ACT) .
Comparison of the BM407 and SC84 ICEs shows that CDSs associated with conjugation are conserved (Figure 6). The ICEs contain regions of sequence divergence with ICESsuBM4071 appearing to be most diverse, containing a greater proportion of unique sequence, and an inverted region. All three elements contain variants of Tn916; ICESsuBM4071 and ICESsuSC84 have Tn916 with the same gene content, whilst ICESsuBM4072 contains additional genes encoding the tetracycline resistance protein TetL (SSUBM407_0984) and chloramphenicol acetyltransferase (SSUBM407_0980). In addition to the drug resistance genes associated with Tn916, ICESsuBM4072 also carries genes encoding the tetracycline resistance protein TetO (SSUBM407_0954), erythromycin ribosome methylase ErmB (SSUBM407_0952) and dihydrofolate reductase (SSUBM407_0957), in a region of difference located elsewhere on the element.
Unlike ICESsuBM4072, ICESsuBM4071 does not carry as many CDSs associated with drug resistance. Amongst its cargo are a lactose utilization operon (SSUBM407_0510 to SSUBM407_0519), including a glucokinase-like protein, a galactose-6-phosphate isomerase, a tagatose-6-phosphate kinase, a tagatose 1,6-diphosphate aldolase, a 6-phospho-beta-galactosidase, and a PTS system lactose-specific transporter, which are absent in ICESsuBM4072 and ICESsuSC84. The chromosome of BM407 (and P1/7 and SC84) already contains homologues of these CDSs at two loci (SSUBM407_0879 to SSUBM407_0889 and SSUBM407_1060). Interestingly the chromosomal copy of the tagatose-6-phosphate kinase lacC (SSUBM407_0883) is mutated in BM407, therefore the provision of an additional copy of this CDS on ICESsuSC84 complements the mutation. Complete lactose utilization operons have been found on other ICEs, including the S. agalactiae NEM315 ICE.
Contained on ICESsuSC84 is a two component regulator, salK/salR (SSUSC84_0849 and SSUSC84_0850), that has been implicated in the virulence of the Chinese isolates in an animal model . These regulatory components are part of a larger operon encoding lantibiotic biosynthesis, similar to the salivaricin cluster of Streptococcus salivarius . Neither of the ICEs in BM407 contains this cluster (Figure 6); these elements contain alternative cargo CDSs in equivalent regions on these elements. Upstream of the operon is a CDS encoding a lantibiotic precursor (SSUSC84_0866). In SC84, and also in 05ZYH33 and 98HAH33, the operon has been disrupted by the insertion of a 7.9 kb element containing 9 CDSs, one of which encodes an aminoglycoside 6-adenylyltransferase (SSUSC84_0863), into the homologue of the S. salivarius salivaricin C biosynthesis gene salM (SSUSC84_0856). The precise regulatory function of SalK/SalR remains unclear, but their operonic association with the lantibiotic cluster suggests that their primary role is associated with this function. However it is possible that regulatory components of this operon may have been adapted to other functions. S. pyogenes with a mutation in salY, a gene encoding a putative ABC transporter permease of a lantibiotic operon, was attenuated for virulence in a zebrafish invasive-disease model . Analysis of the S. pyogenes lantibiotic cluster showed that it contained a homologue of salM that was mutated, suggesting that like the lantibiotic cluster in S. suis SC84, this cluster also no longer produced a functional lantibiotic, but it was still was essential for full virulence.
Screening of 91 Vietnamese S. suis serotype 2 strains of MLST ST1 isolated from patients with meningitis, for the presence of ICESsuSC84 using PCR targeted at the insertion sites , indicated that this ICE is not absolutely required for virulence in humans as 16 (17.6%) strains lacked the element (manuscript in preparation). In addition, only 41 (45%) carried sequences homologous to the salK/salR genes, suggesting that the presence of these genes cannot be used as a marker of virulence. Similar to strains BM407 and SC84, all strains containing ICE-associated sequences were resistant to tetracycline whereas those that were negative in the screening were sensitive, indicating the common presence of ICE carrying tet genes in these isolates.
The sequencing and detailed annotation of the S. suis P1/7, SC84 and BM407 genomes provides valuable data for studying the evolutionary events that are shaping the virulence and drug resistance in this emerging zoonotic pathogen.
There is strong evidence that genetic background influences the ability of S. suis to cause zoonotic infections as human disease isolates are almost always serotype 2 and are very closely related by MLST. Of the 121 STs in the S. suis MLST data base (http://ssuis.mlst.net/), only five STs have representative isolated from humans, and four (ST1, ST6, ST7 and ST84) out of the five are all part of the same clonal complex. Two of the strains in this study (P1/7 and BM407) introduce a greater degree of diversity into the comparisons, insofar as that they are from different geographical areas, and belong to different STs.
A major feature of the previously sequenced S. suis strains' genomes hypothesised to be important for pathogenicity, is an 89 kb ICE region designated as a candidate PI . BM407 also contains an almost identical ICE with some notable differences that encompass drug resistance genes, and a two-component regulatory system associated with a bacteriocin cluster. Sequence variation in the BM407 ICE suggest that although it is present at an orthologous site, it has been independently acquired.
The comparative analysis suggests that ICEs have been the major contributor to the evolution of drug resistance in the strains of S. suis examined. Although it is not possible to pinpoint the origins of the S. suis ICEs, due to the limited number of these elements in the databases, it is apparent from the analyses that they are closely related to ICEs found in phylogenetically distant streptococci. It is worth noting that some of these species occupy the same niches as S. suis, suggesting that inter-species genetic exchange is taking place. This has implications for defining the pan-genomes of some streptococcal species, since it may not be possible to define their boundaries if the pool of MGEs they exchange is effectively communal.
Aside from the ICEs, the virulence determinant inventories of P1/7, SC84 and BM407 strains are identical. Small nucleotide differences resulting from point mutation and recombination were observed in the genome comparisons. This variation may have subtle effects on the expression and function of these genes, and as a corollary, generate differences in the virulence phenotypes. To this extent it is possible that some of the small nucleotide differences observed in the SC84 and BM407 genomes in comparison to P1/7 may influence the ability of these strains to cause disease. However, as the capacity of P1/7 to cause human disease is unknown, it is not possible to speculate on the impact of genomic differences identified in this study on host-association. Future studies intending to increase the depth of sequencing in this zoonotic lineage will help unravel the pathogenic significance of sequence variation.
Pseudogenes in the genome of S. suis strains P1/7, SC84 and BM407.
(0.15 MB DOC)
Streptococcus suis orphan CDSs in the in the genome of S. suis strain P1/7.
(0.24 MB DOC)
Rhamnose-based polysaccharide cluster of strain P1/7 A) Structure and functional organisation of the Rhamnose-based polysaccharide cluster of strain P1/7. B) Comparison of the Rhamnose-based polysaccharide cluster from S. suis P1/7 with the RGP rml and rgp gene clusters of Streptococcus mutans UA159. The comparison of the RGP clusters from the S. suis P1/7 (top) and S. mutans UA159 and (bottom) is displayed using the Artemis Comparison Tool (ACT) . The red bars separating each genome represent similarity matches identified by TBLASTX analysis .
(0.35 MB PDF)
We would like to thank the Sanger Institute's Pathogen Production Group for shotgun and finishing sequencing, and the core Informatics Group for support.
Conceived and designed the experiments: MTGH MK JAL PNW CGD AMW NC MG JDS HS BGS JX CY SDB BB CS DJM JP. Performed the experiments: MTGH HH MS THN IC AC IG KM MAQ CP ER SS TBC. Analyzed the data: MTGH HH MS NJC PI CS. Contributed reagents/materials/analysis tools: THN TBC NTHM TSD NTC JX CY. Wrote the paper: MTGH HH MS NJC JAL MG HS BGS SDB CS DJM JP.
- 1. Gottschalk M, Segura M, Xu J (2007) Streptococcus suis infections in humans: the Chinese experience and the situation in North America. Anim Health Res Rev 8: 29–45.
- 2. Perch B, Kristjansen P, Skadhauge K (1968) Group R streptococci pathogenic for man. Two cases of meningitis and one fatal case of sepsis. Acta Pathol Microbiol Scand 74: 69–76.
- 3. Mai NT, Hoa NT, Nga TV, Linh LD, Chau TT, et al. (2008) Streptococcus suis meningitis in adults in Vietnam. Clin Infect Dis 46: 659–667.
- 4. Nghia HD, Hoa NT, Linh le D, Campbell J, Diep TS, et al. (2008) Human case of Streptococcus suis serotype 16 infection. Emerg Infect Dis 14: 155–157.
- 5. Yu H, Jing H, Chen Z, Zheng H, Zhu X, et al. (2006) Human Streptococcus suis outbreak, Sichuan, China. Emerg Infect Dis 12: 914–920.
- 6. Sriskandan S, Slater JD (2006) Invasive disease and toxic shock due to zoonotic Streptococcus suis: an emerging infection in the East? PLoS Medicine 3: e187.
- 7. Chen C, Tang J, Dong W, Wang C, Feng Y, et al. (2007) A glimpse of streptococcal toxic shock syndrome from comparative genomics of S. suis 2 Chinese isolates. PLoS ONE 2: e315.
- 8. Ye C, Zhu X, Jing H, Du H, Segura M, et al. (2006) Streptococcus suis sequence type 7 outbreak, Sichuan, China. Emerg Infect Dis 12: 1203–1208.
- 9. Clifton-Hadley FA (1981) Studies of Streptococcus suis type 2 infection in pigs: University of Cambridge.
- 10. King SJ, Leigh JA, Heath PJ, Luque I, Tarradas C, et al. (2002) Development of a multilocus sequence typing scheme for the pig pathogen Streptococcus suis: identification of virulent clones and potential capsular serotype exchange. J Clin Microbiol 40: 3671–3680.
- 11. Ye C, Bai X, Zhang J, Jing H, Zheng H, et al. (2008) Spread of Streptococcus suis sequence type 7, China. Emerg Infect Dis 14: 787–791.
- 12. Ning Z, Cox AJ, Mullikin JC (2001) SSAHA: a fast search method for large DNA databases. Genome Res 11: 1725–1729.
- 13. Dear S, Durbin R, Hillier L, Marth G, Thierry-Mieg J, et al. (1998) Sequence assembly with CAFTOOLS. Genome Res 8: 260–267.
- 14. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic Local Alignment Search Tool. Journal of Molecular Biology 215: 403–410.
- 15. Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, et al. (2000) Artemis: sequence visualization and annotation. Bioinformatics 16: 944–945.
- 16. Frishman D, Mironov A, Mewes HW, Gelfand M (1998) Combining diverse evidence for gene recognition in completely sequenced bacterial genomes. Nucleic Acids Research 26: 2941–2947.
- 17. Delcher AL, Harmon D, Kasif S, White O, Salzberg SL (1999) Improved microbial gene identification with GLIMMER. Nucleic Acids Research 27: 4636–4641.
- 18. Larsen TS, Krogh A (2003) EasyGene–a prokaryotic gene finder that ranks ORFs by statistical significance. BMC Bioinformatics 4: 21.
- 19. Pearson WR, Lipman DJ (1988) Improved Tools for Biological Sequence Comparison. Proceedings of the National Academy of Sciences of the United States of America 85: 2444–2448.
- 20. Bateman A, Birney E, Cerruti L, Durbin R, Etwiller L, et al. (2002) The Pfam Protein Families Database. Nucleic Acids Research 30: 276–280.
- 21. Falquet L, Pagni M, Bucher P, Hulo N, Sigrist CJA, et al. (2002) The PROSITE database, its status in 2002. Nucleic Acids Research 30: 235–238.
- 22. Krogh A, Larsson B, von Heijne G, Sonnhammer ELL (2001) Predicting transmembrane protein topology with a hidden Markov model: Application to complete genomes. Journal of Molecular Biology 305: 567–580.
- 23. Nielsen H, Engelbrecht J, Brunak S, vonHeijne G (1997) Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Protein Engineering 10: 1–6.
- 24. Lowe TM, Eddy SR (1997) tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Research 25: 955–964.
- 25. Griffiths-Jones S, Bateman A, Marshall M, Khanna A, Eddy SR (2003) Rfam: an RNA family database. Nucleic Acids Research 31: 439–441.
- 26. Carver TJ, Rutherford K, Berriman M, Rajandream MA, Barrell B, et al. (2005) ACT: the Artemis comparison tool. Bioinformatics 21: 3422–3423.
- 27. Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, et al. (2004) Versatile and open software for comparing large genomes. Genome Biol 5: R12.
- 28. Didelot X, Falush D (2007) Inference of bacterial microevolution using multilocus sequence data. Genetics 175: 1251–1266.
- 29. Holden MT, Scott A, Cherevach I, Chillingworth T, Churcher C, et al. (2007) Complete genome of acute rheumatic fever-associated serotype M5 Streptococcus pyogenes strain manfredo. J Bacteriol 189: 1473–1477.
- 30. Holden MT, Heather Z, Paillot R, Steward KF, Webb K, et al. (2009) Genomic evidence for the evolution of Streptococcus equi: host restriction, increased virulence, and genetic exchange with human pathogens. PLoS Pathogens 5: e1000346.
- 31. Ward PN, Holden MTG, Leigh JA, Lennard N, Bignell A, et al. (2009) Evidence for niche adaptation in the genome of the bovine pathogen Streptococcus uberis. BMC Genomics 10: 54.
- 32. Bolotin A, Quinquis B, Renault P, Sorokin A, Ehrlich SD, et al. (2004) Complete sequence and comparative genome analysis of the dairy bacterium Streptococcus thermophilus. Nat Biotechnol 22: 1554–1558.
- 33. Tettelin H, Nelson KE, Paulsen IT, Eisen JA, Read TD, et al. (2001) Complete genome sequence of a virulent isolate of Streptococcus pneumoniae. Science 293: 498–506.
- 34. Xu P, Alves JM, Kitten T, Brown A, Chen Z, et al. (2007) Genome of the opportunistic pathogen Streptococcus sanguinis. J Bacteriol 189: 3166–3175.
- 35. Ajdic D, McShan WM, McLaughlin RE, Savic G, Chang J, et al. (2002) Genome sequence of Streptococcus mutans UA159, a cariogenic dental pathogen. Proc Natl Acad Sci U S A 99: 14434–14439.
- 36. Glaser P, Rusniok C, Buchrieser C, Chevalier F, Frangeul L, et al. (2002) Genome sequence of Streptococcus agalactiae, a pathogen causing invasive neonatal disease. Mol Microbiol 45: 1499–1513.
- 37. Vickerman MM, Iobst S, Jesionowski AM, Gill SR (2007) Genome-wide transcriptional changes in Streptococcus gordonii in response to competence signaling peptide. J Bacteriol 189: 7799–7807.
- 38. Bolotin A, Wincker P, Mauger S, Jaillon O, Malarme K, et al. (2001) The complete genome sequence of the lactic acid bacterium Lactococcus lactis ssp. lactis IL1403. Genome Res 11: 731–753.
- 39. Wilm A, Higgins DG, Notredame C (2008) R-Coffee: a method for multiple alignment of non-coding RNA. Nucleic Acids Res 36: e52.
- 40. Galtier N, Gouy M, Gautier C (1996) SEAVIEW and PHYLO_WIN: two graphic tools for sequence alignment and molecular phylogeny. Comput Appl Biosci 12: 543–548.
- 41. Huelsenbeck JP, Ronquist F, Nielsen R, Bollback JP (2001) Bayesian inference of phylogeny and its impact on evolutionary biology. Science 294: 2310–2314.
- 42. Ronquist F, Huelsenbeck JP (2003) MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19: 1572–1574.
- 43. Smith HE, Damman M, van der Velde J, Wagenaar F, Wisselink HJ, et al. (1999) Identification and characterization of the cps locus of Streptococcus suis serotype 2: the capsule protects against phagocytosis and is an important virulence factor. Infect Immun 67: 1750–1756.
- 44. Smith HE, de Vries R, van't Slot R, Smits MA (2000) The cps locus of Streptococcus suis serotype 2: genetic determinant for the synthesis of sialic acid. Microb Pathog 29: 127–134.
- 45. Elliott SD, Tai JY (1978) The type-specific polysaccharides of Streptococcus suis. J Exp Med 148: 1699–1704.
- 46. Higgins R, Gottschalk M (2001) Distribution of Streptococcus suis capsular types in 2000. Can Vet J 42: 223.
- 47. Wisselink HJ, Smith HE, Stockhofe-Zurwieden N, Peperkamp K, Vecht U (2000) Distribution of capsular types and production of muramidase-released protein (MRP) and extracellular factor (EF) of Streptococcus suis strains isolated from diseased pigs in seven European countries. Vet Microbiol 74: 237–248.
- 48. Arends JP, Zanen HC (1988) Meningitis caused by Streptococcus suis in humans. Rev Infect Dis 10: 131–137.
- 49. Charland N, Harel J, Kobisch M, Lacasse S, Gottschalk M (1998) Streptococcus suis serotype 2 mutants deficient in capsular expression. Microbiology 144(Pt 2): 325–332.
- 50. Jobin MC, Gottschalk M, Grenier D (2006) Upregulation of prostaglandin E2 and matrix metalloproteinase 9 production by human macrophage-like cells: synergistic effect of capsular material and cell wall from Streptococcus suis. Microb Pathog 40: 29–34.
- 51. Tsukioka Y, Yamashita Y, Nakano Y, Oho T, Koga T (1997) Identification of a fourth gene involved in dTDP-rhamnose synthesis in Streptococcus mutans. Journal of Bacteriology 179: 4411–4414.
- 52. Tsukioka Y, Yamashita Y, Oho T, Nakano Y, Koga T (1997) Biological function of the dTDP-rhamnose synthesis pathway in Streptococcus mutans. Journal of Bacteriology 179: 1126–1134.
- 53. Yamashita Y, Tsukioka Y, Nakano Y, Tomihisa K, Oho T, et al. (1998) Biological functions of UDP-glucose synthesis in Streptococcus mutans. Microbiology 144(Pt 5): 1235–1245.
- 54. Yamashita Y, Shibata Y, Nakano Y, Tsuda H, Kido N, et al. (1999) A novel gene required for rhamnose-glucose polysaccharide synthesis in Streptococcus mutans. Journal of Bacteriology 181: 6556–6559.
- 55. Shibata Y, Yamashita Y, Ozaki K, Nakano Y, Koga T (2002) Expression and characterization of streptococcal rgp genes required for rhamnan synthesis in Escherichia coli. Infect Immun 70: 2891–2898.
- 56. Ozaki K, Shibata Y, Yamashita Y, Nakano Y, Tsuda H, et al. (2002) A novel mechanism for glucose side-chain formation in rhamnose-glucose polysaccharide synthesis. FEBS Lett 532: 159–163.
- 57. Broadbent JR, McMahon DJ, Welker DL, Oberg CJ, Moineau S (2003) Biochemistry, genetics, and applications of exopolysaccharide production in Streptococcus thermophilus: a review. J Dairy Sci 86: 407–423.
- 58. Soell M, Lett E, Holveck F, Scholler M, Wachsmann D, et al. (1995) Activation of human monocytes by streptococcal rhamnose glucose polymers is mediated by CD14 antigen, and mannan binding protein inhibits TNF-alpha release. J Immunol 154: 851–860.
- 59. Martin V, Kleschyov AL, Klein JP, Beretz A (1997) Induction of nitric oxide production by polyosides from the cell walls of Streptococcus mutans OMZ 175, a gram-positive bacterium, in the rat aorta. Infect Immun 65: 2074–2079.
- 60. Tsuda H, Yamashita Y, Toyoshima K, Yamaguchi N, Oho T, et al. (2000) Role of serotype-specific polysaccharide in the resistance of Streptococcus mutans to phagocytosis by human polymorphonuclear leukocytes. Infect Immun 68: 644–650.
- 61. Koedel U, Bernatowicz A, Paul R, Frei K, Fontana A, et al. (1995) Experimental pneumococcal meningitis: cerebrovascular alterations, brain edema, and meningeal inflammation are linked to the production of nitric oxide. Ann Neurol 37: 313–323.
- 62. Kornelisse RF, Hoekman K, Visser JJ, Hop WC, Huijmans JG, et al. (1996) The role of nitric oxide in bacterial meningitis in children. Journal of Infectious Diseases 174: 120–126.
- 63. Dominguez-Punaro MC, Segura M, Plante MM, Lacouture S, Rivest S, et al. (2007) Streptococcus suis serotype 2, an important swine and human pathogen, induces strong systemic and cerebral inflammatory responses in a mouse model of infection. J Immunol 179: 1842–1854.
- 64. Planet PJ, Kachlany SC, DeSalle R, Figurski DH (2001) Phylogeny of genes for secretion NTPases: identification of the widespread tadA subfamily and development of a diagnostic key for gene classification. Proceedings of the National Academy of Sciences of the United States of America 98: 2503–2508.
- 65. Jacobs AA, Loeffen PL, van den Berg AJ, Storm PK (1994) Identification, purification, and characterization of a thiol-activated hemolysin (suilysin) of Streptococcus suis. Infect Immun 62: 1742–1748.
- 66. King SJ, Heath PJ, Luque I, Tarradas C, Dowson CG, et al. (2001) Distribution and genetic diversity of suilysin in Streptococcus suis isolated from different diseases of pigs and characterization of the genetic basis of suilysin absence. Infect Immun 69: 7572–7582.
- 67. Billington SJ, Jost BH, Songer JG (2000) Thiol-activated cytolysins: structure, function and role in pathogenesis. FEMS Microbiol Lett 182: 197–205.
- 68. Jacobs AA, van den Berg AJ, Loeffen PL (1996) Protection of experimentally infected pigs by suilysin, the thiol-activated haemolysin of Streptococcus suis. Vet Rec 139: 225–228.
- 69. Allen AG, Bolitho S, Lindsay H, Khan S, Bryant C, et al. (2001) Generation and characterization of a defined mutant of Streptococcus suis lacking suilysin. Infect Immun 69: 2732–2735.
- 70. Osaki M, Takamatsu D, Shimoji Y, Sekizaki T (2002) Characterization of Streptococcus suis genes encoding proteins homologous to sortase of gram-positive bacteria. Journal of Bacteriology 184: 971–982.
- 71. Fittipaldi N, Gottschalk M, Vanier G, Daigle F, Harel J (2007) Use of selective capture of transcribed sequences to identify genes preferentially expressed by Streptococcus suis upon interaction with porcine brain microvascular endothelial cells. Appl Environ Microbiol 73: 4359–4364.
- 72. Oggioni MR, Memmi G, Maggi T, Chiavolini D, Iannelli F, et al. (2003) Pneumococcal zinc metalloproteinase ZmpC cleaves human matrix metalloproteinase 9 and is a virulence factor in experimental pneumonia. Mol Microbiol 49: 795–805.
- 73. Poulsen K, Reinholdt J, Kilian M (1996) Characterization of the Streptococcus pneumoniae immunoglobulin A1 protease gene (iga) and its translation product. Infect Immun 64: 3957–3966.
- 74. Novak R, Charpentier E, Braun JS, Park E, Murti S, et al. (2000) Extracellular targeting of choline-binding proteins in Streptococcus pneumoniae by a zinc metalloprotease. Mol Microbiol 36: 366–376.
- 75. Paul R, Lorenzl S, Koedel U, Sporer B, Vogel U, et al. (1998) Matrix metalloproteinases contribute to the blood-brain barrier disruption during bacterial meningitis. Ann Neurol 44: 592–600.
- 76. Leppert D, Leib SL, Grygar C, Miller KM, Schaad UB, et al. (2000) Matrix metalloproteinase (MMP)-8 and MMP-9 in cerebrospinal fluid during bacterial meningitis: association with blood-brain barrier damage and neurological sequelae. Clin Infect Dis 31: 80–84.
- 77. Weiser JN, Bae D, Fasching C, Scamurra RW, Ratner AJ, et al. (2003) Antibody-enhanced pneumococcal adherence requires IgA1 protease. Proceedings of the National Academy of Sciences of the United States of America 100: 4215–4220.
- 78. Smith HE, Vecht U, Gielkens AL, Smits MA (1992) Cloning and nucleotide sequence of the gene encoding the 136-kilodalton surface protein (muramidase-released protein) of Streptococcus suis type 2. Infect Immun 60: 2361–2367.
- 79. Smith HE, Reek FH, Vecht U, Gielkens AL, Smits MA (1993) Repeats in an extracellular protein of weakly pathogenic strains of Streptococcus suis type 2 are absent in pathogenic strains. Infect Immun 61: 3318–3326.
- 80. Gottschalk M, Lebrun A, Wisselink H, Dubreuil JD, Smith H, et al. (1998) Production of virulence-related proteins by Canadian strains of Streptococcus suis capsular type 2. Can J Vet Res 62: 75–79.
- 81. Smith HE, Vecht U, Wisselink HJ, Stockhofe-Zurwieden N, Biermann Y, et al. (1996) Mutants of Streptococcus suis types 1 and 2 impaired in expression of muramidase-released protein and extracellular protein induce disease in newborn germfree pigs. Infect Immun 64: 4409–4412.
- 82. Baums CG, Kaim U, Fulde M, Ramachandran G, Goethe R, et al. (2006) Identification of a novel virulence determinant with serum opacification activity in Streptococcus suis. Infect Immun 74: 6154–6162.
- 83. Dramsi S, Caliot E, Bonne I, Guadagnini S, Prevost MC, et al. (2006) Assembly and role of pili in group B streptococci. Mol Microbiol 60: 1401–1413.
- 84. Takamatsu D, Nishino H, Ishiji T, Ishii J, Osaki M, et al. (2009) Genetic organization and preferential distribution of putative pilus gene clusters in Streptococcus suis. Vet Microbiol. in press.
- 85. Lei B, DeLeo FR, Hoe NP, Graham MR, Mackie SM, et al. (2001) Evasion of human innate and acquired immunity by a bacterial homolog of CD11b that inhibits opsonophagocytosis. Nat Med 7: 1298–1305.
- 86. von Pawel-Rammingen U, Johansson BP, Bjorck L (2002) IdeS, a novel streptococcal cysteine proteinase with unique specificity for immunoglobulin G. EMBO Journal 21: 1607–1615.
- 87. Lei B, DeLeo FR, Reid SD, Voyich JM, Magoun L, et al. (2002) Opsonophagocytosis-inhibiting mac protein of group a streptococcus: identification and characteristics of two genetic complexes. Infect Immun 70: 6880–6890.
- 88. Vincents B, von Pawel-Rammingen U, Bjorck L, Abrahamson M (2004) Enzymatic characterization of the streptococcal endopeptidase, IdeS, reveals that it is a cysteine protease with strict specificity for IgG cleavage due to exosite binding. Biochemistry 43: 15540–15549.
- 89. Agniswamy J, Lei B, Musser JM, Sun PD (2004) Insight of host immune evasion mediated by two variants of group A Streptococcus Mac protein. J Biol Chem 279: 52789–52796.
- 90. Lannergard J, Guss B (2006) IdeE, an IgG-endopeptidase of Streptococcus equi ssp. equi. FEMS Microbiol Lett 262: 230–235.
- 91. Timoney JF, Yang J, Liu J, Merant C (2007) IdeE reduces the bactericidal activity of equine neutrophils for Streptococcus equi. Vet Immunol Immunopathol.
- 92. Short WR, Tunkel AR (2000) Changing epidemiology of bacterial meningitis in the United States. Curr Infect Dis Rep 2: 327–331.
- 93. Baker CJ, Edwards MS (1995) Group B streptococcal infections. In: Remington J, Klein JO, editors. Infectious diseases of the fetus and newborn infant. Philadelphia: W.B. Saunders. pp. 980–1054.
- 94. Glaser P, Rusniok C, Buchrieser C, Chevalier F, Frangeul L, et al. (2002) Genome sequence of Streptococcus agalactiae, a pathogen causing invasive neonatal disease. Mol Microbiology 45: 1499–1513.
- 95. Tettelin H, Masignani V, Cieslewicz MJ, Eisen JA, Peterson S, et al. (2002) Complete genome sequence and comparative genomic analysis of an emerging human pathogen, serotype V Streptococcus agalactiae. Proceedings of the National Academy of Sciences of the United States of America 99: 12391–12396.
- 96. Tettelin H, Masignani V, Cieslewicz MJ, Donati C, Medini D, et al. (2005) Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial “pan-genome”. Proceedings of the National Academy of Sciences of the United States of America 102: 13950–13955.
- 97. Hoskins J, Alborn WE, Arnold J, Blaszczak LC, Burgett S, et al. (2001) Genome of the bacterium Streptococcus pneumoniae strain R6. Journal of Bacteriology 183: 5709–5717.
- 98. Lanie JA, Ng WL, Kazmierczak KM, Andrzejewski TM, Davidsen TM, et al. (2007) Genome sequence of Avery's virulent serotype 2 strain D39 of Streptococcus pneumoniae and comparison with that of unencapsulated laboratory strain R6. Journal of Bacteriology 189: 38–51.
- 99. Zwijnenburg PJ, van der Poll T, Florquin S, van Deventer SJ, Roord JJ, et al. (2001) Experimental pneumococcal meningitis in mice: a model of intranasal infection. Journal of Infectious Diseases 183: 1143–1146.
- 100. Wellmer A, Zysk G, Gerber J, Kunst T, Von Mering M, et al. (2002) Decreased virulence of a pneumolysin-deficient strain of Streptococcus pneumoniae in murine meningitis. Infect Immun 70: 6504–6508.
- 101. King SJ, Allen AG, Maskell DJ, Dowson CG, Whatmore AM (2004) Distribution, genetic diversity, and variable expression of the gene encoding hyaluronate lyase within the Streptococcus suis population. Journal of Bacteriology 186: 4740–4747.
- 102. Rocha EP, Danchin A, Viari A (1999) Universal replication biases in bacteria. Mol Microbiol 32: 11–16.
- 103. Sleator RD, Wemekamp-Kamphuis HH, Gahan CG, Abee T, Hill C (2005) A PrfA-regulated bile exclusion system (BilE) is a novel virulence factor in Listeria monocytogenes. Mol Microbiol 55: 1183–1195.
- 104. Russell RR, Aduse-Opoku J, Sutcliffe IC, Tao L, Ferretti JJ (1992) A binding protein-dependent transport system in Streptococcus mutans responsible for multiple sugar metabolism. J Biol Chem 267: 4631–4637.
- 105. Guenzi E, Gasc AM, Sicard MA, Hakenbeck R (1994) A two-component signal-transducing system is involved in competence and penicillin susceptibility in laboratory mutants of Streptococcus pneumoniae. Mol Microbiol 12: 505–515.
- 106. Beres SB, Musser JM (2007) Contribution of exogenous genetic elements to the group A Streptococcus metagenome. PLoS ONE 2: e800.
- 107. Hiller NL, Janto B, Hogg JS, Boissy R, Yu S, et al. (2007) Comparative genomic analyses of seventeen Streptococcus pneumoniae strains: insights into the pneumococcal supragenome. Journal of Bacteriology 189: 8186–8195.
- 108. Burrus V, Pavlovic G, Decaris B, Guedon G (2002) The ICESt1 element of Streptococcus thermophilus belongs to a large family of integrative and conjugative elements that exchange modules and change their specificity of integration. Plasmid 48: 77–97.
- 109. Croucher NJ, Walker D, Romero P, Lennard N, Paterson GK, et al. (2009) Role of conjugative elements in the evolution of the multidrug-resistant pandemic clone Streptococcus pneumoniaeSpain23F ST81. J Bacteriol 191: 1480–1489.
- 110. Li M, Wang C, Feng Y, Pan X, Cheng G, et al. (2008) SalK/SalR, a two-component signal transduction system, is essential for full virulence of highly invasive Streptococcus suis serotype 2. PLoS ONE 3: e2080.
- 111. Hyink O, Wescombe PA, Upton M, Ragland N, Burton JP, et al. (2007) Salivaricin A2 and the novel lantibiotic salivaricin B are encoded at adjacent loci on a 190-kilobase transmissible megaplasmid in the oral probiotic strain Streptococcus salivarius K12. Appl Environ Microbiol 73: 1107–1113.
- 112. Phelps HA, Neely MN (2007) SalY of the Streptococcus pyogenes lantibiotic locus is required for full virulence and intracellular survival in macrophages. Infect Immun 75: 4541–4551.