Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Serotyping of Streptococcus pneumoniae Based on Capsular Genes Polymorphisms

  • Frédéric Raymond ,

    Affiliation Centre de Recherche en Infectiologie and Département de Microbiologie et Immunologie, Faculté de Médecine, Université Laval, Québec, Québec, Canada

  • Nancy Boucher,

    Affiliation Centre de Recherche en Infectiologie and Département de Microbiologie et Immunologie, Faculté de Médecine, Université Laval, Québec, Québec, Canada

  • Robin Allary,

    Affiliation Centre de Recherche en Infectiologie and Département de Microbiologie et Immunologie, Faculté de Médecine, Université Laval, Québec, Québec, Canada

  • Lynda Robitaille,

    Affiliation Centre de Recherche en Infectiologie and Département de Microbiologie et Immunologie, Faculté de Médecine, Université Laval, Québec, Québec, Canada

  • Brigitte Lefebvre,

    Affiliation Laboratoire de Santé Publique du Québec, Institut National de Santé Publique, Ste-Anne-de-Bellevue, Québec, Canada

  • Cécile Tremblay,

    Affiliation Laboratoire de Santé Publique du Québec, Institut National de Santé Publique, Ste-Anne-de-Bellevue, Québec, Canada

  • Jacques Corbeil ,

    Contributed equally to this work with: Jacques Corbeil, Alain Gervaix

    Affiliation Centre de Recherche en Infectiologie and Département de Microbiologie et Immunologie, Faculté de Médecine, Université Laval, Québec, Québec, Canada

  • Alain Gervaix

    Contributed equally to this work with: Jacques Corbeil, Alain Gervaix

    Affiliation Department of Pediatrics, University Hospitals of Geneva, Geneva, Switzerland

Serotyping of Streptococcus pneumoniae Based on Capsular Genes Polymorphisms

  • Frédéric Raymond, 
  • Nancy Boucher, 
  • Robin Allary, 
  • Lynda Robitaille, 
  • Brigitte Lefebvre, 
  • Cécile Tremblay, 
  • Jacques Corbeil, 
  • Alain Gervaix


Streptococcus pneumoniae serotype epidemiology is essential since serotype replacement is a concern when introducing new polysaccharide-conjugate vaccines. A novel PCR-based automated microarray assay was developed to assist in the tracking of the serotypes. Autolysin, pneumolysin and eight genes located in the capsular operon were amplified using multiplex PCR. This step was followed by a tagged fluorescent primer extension step targeting serotype-specific polymorphisms. The tagged primers were then hybridized to a microarray. Results were exported to an expert system to identify capsular serotypes. The assay was validated on 166 cultured S. pneumoniae samples from 63 different serotypes as determined by the Quellung method. We show that typing only 12 polymorphisms located in the capsular operon allows the identification at the serotype level of 22 serotypes and the assignation of 24 other serotypes to a subgroup of serotypes. Overall, 126 samples (75.9%) were correctly serotyped, 14 were assigned to a member of the same serogroup, 8 rare serotypes were erroneously serotyped, and 18 gave negative serotyping results. Most of the discrepancies involved rare serotypes or serotypes that are difficult to discriminate using a DNA-based approach, for example 6A and 6B. The assay was also tested on clinical specimens including 43 cerebrospinal fluid samples from patients with meningitis and 59 nasopharyngeal aspirates from bacterial pneumonia patients. Overall, 89% of specimens positive for pneumolysin were serotyped, demonstrating that this method does not require culture to serotype clinical specimens. The assay showed no cross-reactivity for 24 relevant bacterial species found in these types of samples. The limit of detection for serotyping and S. pneumoniae detection was 100 genome equivalent per reaction. This automated assay is amenable to clinical testing and does not require any culturing of the samples. The assay will be useful for the evaluation of serotype prevalence changes after new conjugate vaccines introduction.


Streptococcus pneumoniae is a major pathogen causing a wide spectrum of infections ranging from acute otitis media and lower respiratory tract infection to severe invasive diseases, such as septicemia and meningitis. Recent estimates of childhood mortality due to S. pneumoniae range from 700,000 to 1 million deaths yearly worldwide [1].

Among the >92 capsular serotypes of S. pneumoniae identified thus far, seven (4, 6B, 9V, 14, 18C, 19F and 23F) were responsible for 65 to 85% of invasive pneumococcal diseases (IPD) in children worldwide in the year 2000 [2]. A conjugate pneumococcal vaccine containing these serotypes (PCV7, Prevenar®, Pfizer) was licensed in 2000. Introduction of this vaccine rapidly led to a dramatic decrease in the incidence of IPD from an average of 25.1 cases/100,000 persons in 1999 to 12.6 cases/100,000 persons in 2004 in the USA [3]. Large post-licensure vaccine surveillance studies have raised concerns about an increased incidence of invasive disease due to non-vaccine serotypes, especially serotypes 3, 7F, 19A, 22 and 33 [4,5], a phenomenon known as serotype replacement. A 13-valent pneumococcal vaccine including the PCV7 serotypes as well as serotypes 1, 3, 5, 6A, 7F and 19A was licensed in 2010 (PCV13, Prevnar 13®, Pfizer). This new vaccine and others (Synflorix®, GSK) have the potential to further decrease IPD rates and prevent resurgence of pre-PCV7 IPD rates. However, ongoing serotype replacement is still of concern. Consequently, serotyping strains remains of paramount importance to both assess the effectiveness of current vaccines and to closely monitor the emergence of non-vaccine strains.

S. pneumoniae serotyping was developed at the beginning of the 20th century using panels of specific anti-sera produced in animals and directed against polysaccharides of the pneumococcal capsule. These agglutination assays are complex, costly and require highly skilled personnel [6]. Traditional agglutination assays cannot be performed directly on clinical samples since growth of S. pneumoniae on culture media is required. However, culture is often negative if patients received antibiotics before sampling of blood, cerebrospinal fluid (CSF) or other biological fluids, and in children with pneumococcal pneumonia in whom blood culture tends to be negative but PCR-positive for S. pneumoniae [7,8]. In recent years, immunological assays based on ELISA or latex-bead agglutination have been shown to work directly on clinical specimens [911]. For example, Sheppard and collaborators show that a multiplex immunoassay using xMAP beads can identify 14 S. pneumoniae serotypes directly from CSF [12].

Genes coding for the bacterial capsule are part of a complex gene structure where multiple and subtle polymorphisms in the capsular operon result in different serotypes. PCR-based serotyping using primers that amplify serotype-specific sequences are considered the method of choice for non-cultivable pneumococci serotyping [13,14]. A sequential multiplex PCR approach was initially proposed by Pai and collaborators [15] to identify 29 serotypes and has later been optimized for geographical locations such as South-Saharan Africa [16], Latin America [17] and Bangladesh [18]. Siira and collaborators complemented sequential multiplex PCR reactions by Quellung testing to increase confidence for 13 serogroups of the 42 serotypes/serogroups detected by PCR [6]. Selva and collaborators proposed a method that uses 3 multiplex PCR followed by fragment analysis and automated capillary electrophoresis to identify 40 serotypes or groups of serotypes [19]. Coskun-Ari and collaborators published a single multiplex that allows the identification of the 13 serotypes included in PCV13 [20]. Ahn and collaborators were able to identify 35 serotypes using as series of 8 multiplex PCR reactions [21]. Pimenta and collaborators described a series of multiplex real-time PCR that allowed the identification of 21 serogroups or serotypes, the order of which can be optimized for different regions [22]. PCR-based serotyping of S. pneumoniae has also been coupled with reverse line blot hybridization [23], restriction fragment length polymorphism [24,25] or amplicon sequencing [26,27]. This latter method, also termed sequetyping, relies on the sequencing of capsular operon genes such as cpsB in order to serotype samples by comparing the sequence obtained to those published by Bentley and collaborators [28]. Although these methods are cost-effective and specific, some serotypes are difficult to discriminate. As an example, a single nucleotide polymorphism in the wciP gene results in two distinct capsular serotypes, namely 6A and 6B, emphasizing the inability of PCR-based methods to realistically be used to differentiate serotypes from closely related serogroups [29]. Methods using PCR fragment length [30] or pyrosequencing [29] have been proposed to solve this problem, but are specific to serogroup 6. Immunological methods remain the most reliable approaches to discriminate closely related serotypes such as 6A and 6B. Microarray-based methods have also been published. Scott and coworkers used a microarray targeting capsular operon genes to serotype pneumococcal isolates [31]. Tomita and collaborators used a targeted microarray to identify 23 serotypes [32]. Both studies used conventional glass microarray supports, which require highly skilled personnel and are not amenable to clinical laboratories or to conduct large studies.

Immunological and molecular serotyping methods often include serotypes based on their prevalence, an approach that can bias serotyping studies. Moreover, most DNA-based methods allow the identification of a limited number of serotypes or serogroups. Herein, we suggest an innovative serotyping approach that, instead of detecting one target per serotype, relies on the genotyping of 12 positions located in the capsular operon to identify pneumococcal serotypes. This assay constitutes a proof-of-concept that typing a limited number of polymorphisms could allow a fast and efficient serotyping of S. pneumoniae. To facilitate large-scale clinical studies, the assay was adapted for the AutoGenomics INFINITI system and it could be readily ported to another automated platform. A flowchart exemplifying the steps of the assay and data analysis are shown in Figure 1 and Figure 2, respectively.

Figure 1. Schematic representation of experimental procedure.

Multiplex PCR amplifies up to 12 genes and is performed outside of the INFINITI system. Amplification products are loaded in the INFINITI system at the primer extension step. Primer extension oligonucleotides include a tag sequence that will hybridize to the microarray and a specific detection sequence that allows for primer extension of S. pneumoniae genes. Fluorescent nucleotides are incorporated during primer extension. Once the reaction is complete, the INFINITI system automatically transfers the labeled products to the microarray for hybridization. Tag sequences hybridize to anti-tags located on the microarray. Microarrays are then washed, dried and loaded into the integrated confocal scanner where fluorescence is measured. The report generated by the instruments is analyzed off-line using the Pneumotyper software.

Figure 2. Algorithm for results analysis.

Samples were assessed for the presence of S. pneumoniae by detecting the pneumolysin and the autolysin genes. If S. pneumoniae was detected in the sample, the serotyping probes were analyzed in order to identify the serotype.


Strains and clinical samples

One hundred and sixty-six (166) isolates of S. pneumoniae, which had been previously serotyped using the Quellung method, were provided by the Laboratoire de santé publique du Québec (LSPQ), the Swiss National Reference Laboratory for S. pneumoniae and the Centre de recherche en Infectiologie de l’Université Laval strain collection.

Clinical samples were obtained from blood of children with pneumococcal bacteremia, from CSF of children with pneumococcal meningitis, in 2007-2009 in Cameroon [35], and from naso-pharyngeal swabs of children with pneumonia, in 2008-2010 in Switzerland. All samples were collected with the approval of the medical ethics committee of the Hôpitaux Universitaires de Genève. Written informed consent was obtained from parents.

Bacterial culture and DNA extraction

DNA was extracted from clinical samples using the Purelink viral RNA/DNA minikit (Invitrogen, Burlington, ON, Canada). Reference bacterial strains were grown in the appropriate medium until a turbidity of 0.5 McFarland was obtained. Bacteria were centrifuged and resuspended in 5X TE buffer. DNA was extracted using the BD GeneOhm™ Lysis Kit (BD GeneOhm, San Diego, CA). Purified DNA was quantified on agarose gel and diluted to a concentration of 100 ng/µl and then to a concentration of 1 ng/µl. Genomic DNA was kept at 4°C.

Primers and probes

The composition of the multiplex PCR primer mix is described in Table 1. Primer should be pooled in 1X TE to obtain final multiplex primer mix concentration as described in Table 1. Primers used for primer extension were composed of a proprietary tag sequence, specific for the INFINITI system, followed by a specific detection sequence. The tag sequence must be selected based on the system on which the assay is implemented and should be provided by manufacturers. The composition of the primer extension primer mix is listed in Table 2. However, Table 2 contains only the specific detection sequence of the primers. Each sequence must then be coupled to a tag sequence appropriate for the technology used for detection. Some primers should use the same tag sequence, as indicated by their TagID. The primer concentration in the primer extension mix is 500 nM for each primer, in 1X TE. Concentration of each primer in the primer extension solution is 50 nM.

Target geneProbeSetAmplicon lengthPrimerIDSequenceMultiplex Primer Mix (µM)Final concentration (µM)
Integral membrane regulatory protein (wzg)B, E417 ntSpneumoF004GATCGATTTGTTGGGTGGGGTAGATG10.05
Glycosyl transferaseI199 ntSpneumoF021GTTTCGTTTACAACAATTCCAAGCCGTC10.05
Tyrosine-protein kinase (wze)J161 ntSpneumoF023GAGCCTTGGGATAAGTACATCTCTGT10.05
Protein-tyrosine phosphatase (wzh)C,D,F617 ntSpneumoF010TTTGATGTAGATGAYGGTCCCAAGTCAA20.1
UDP-galactopyranose mutase (glf)M269 ntSpneumoF033GCTCATATCTTYCATACTTCTGATAAGGAGA10.05
dTDP-d-glucose -dehydrataseP176 ntSpneumoF039GGAGCTGGCTTTATCGGTTC10.05
Acetyl transferaseQ333 ntSpneumoF041AGTTTGGGCAACATTTAGTGTGATTGCT10.05
Capsular polysaccharide biosynthesis protein (wzd)N et R293 ntSpneumoF043CCTGAGTTTACTAGTATGACTCGG0.50.025
AutolysinlytA201 ntSpneumoF037CCTCAAGTYGGCGTGCAAC10.05
Pneumolysinply300 ntSpneumoF014GAGGGTAATCAGCTACCCGATGAG10.05

Table 1. Genes targeted by the assay and composition of the multiplex PCR primer mix.

Download CSV
OligoIDSequence (Without tag assignation)ProbesetTagID

Table 2. Composition of primer extension multiplex.

Download CSV

Microarray assay.

Summary of the assay procedure is shown in Figure 1. Multiplex PCR was performed in a T1plus thermocycler (Biometra, Montreal Biotech, Montreal, Canada). Multiplex PCR was performed for each sample. The amplification solution was composed of 1X Platinum Taq buffer, 0.2 µM dNTPs, 1.5 mM MgCl2, 1µl multiplex PCR primer mix, 0.5 units of Platinum Taq DNA polymerase (Invitrogen, Burlington, ON, Canada), and 2.5 µl of cDNA, in a final volume of 20 µl. Prior to PCR, 107 copies of phage M13amp18 RF1 DNA (New England Biolabs, Pickering, ON, Canada) was added to each reaction as an internal control. The PCR program consisted in the following steps: 60 s at 94°C followed by 39 cycles of 30 s at 94°C, 30 s at 55°C and 90 s at 72°C. Finally, the reaction was incubated at 72°C for 3 min. Then, 3 units of shrimp alkaline phosphatase (Clontech, Mountain View, CA), 7.5 units of exonuclease (Clontech) and 0.25 µl of 50X titanium DNA polymerase (Clontech) were added to the solution, which was incubated at 37°C for 20 min and at 94°C for 10 min. This step allows for the degradation of remaining dNTPs and PCR primers that were not used in the multiplex PCR. The subsequent steps were automated by the INFINITI analyzer (AutoGenomics Inc., Carlsbad, CA). The primer extension solution was composed of 1X Platinum Taq buffer, 2.5µM MgCl2, 156 µM d(A/G/T)TP, 10µM Cy5-dCTP, 2µl primer extension primer mix completed with water to 20µl per reaction. A volume of 20µl of primer extension solution was added to each reaction. Primer extension reaction consisted in the following steps: 60 s at 94°C followed by 39 cycles of 15 s at 94°C and 15 s at 50°C. Primer extension was done in the presence of Cy5-dCTP. Following the primer extension reaction, 80 µl of hybridization solution was added to each reaction. The total volume of 120 µl was then hybridized to a DNA microarray for 90 min at 42°C with high humidity. The tags on the extension primers hybridized to corresponding probes on the microarray. After hybridization, each chip was washed 5 times with 300 µl of 1X saline sodium citrate (SSC). Chips were dried and scanned using a confocal scanner. Microarray results have been submitted to Gene Expression Omnibus database (GSE45865).

Result analysis

The algorithm used for S. pneumoniae detection and serotyping result analysis is summarized in Figure 2. Microarray results were controlled for background and negative control. For each probe a ratio between the mean of the background corrected signal of triplicate probes and background fluorescence was calculated. Positivity of the internal control probes (ratio > 3) confirmed the test validity for this sample. Sample was deemed positive for S. pneumoniae if the ratio of either the pneumolysin (ply) or the autolysin (lytA) probes was greater than 1.5. If the sample was positive for S. pneumoniae, the serotyping probes were analyzed to provide a serotype. Twelve positions of the capsular operon were typed at the nucleotide level. For each genotyped position, probes with ratios greater than the mean ratio for the sample were considered positive. If one or more probe was positive, the probe with the highest signal to background ratio was considered positive for the position. Each sample was compared to expected results for 92 serotypes and a score was calculated for each : (concordant probes - false positive) / (expected probes for serotype + discordant probes + false negative probes). Afterwards, the score for all serotypes were compared and, if the highest score obtained was higher than 0.4, the sample was associated to this serotype. If many serotypes shared the highest score, the sample was associated to this group of serotypes. Serotype determination software "Pneumotyper" and serotype database are available on GitHub (


Design of the assay

A machine learning approach was used to identify genes and single nucleotide variations within the capsular locus that, in combination, allow for the serotyping of S. pneumoniae. Coding sequences (CDS) were extracted from 90 capsular polysaccharide biosynthetic clusters of S. pneumoniae [28] and CDS were compared in order to form subgroups. Initially, the optimal combination of CDS necessary to identify the 13 vaccine serotypes of interest was determined using the Id3 decision tree algorithm implemented in the Weka software [33]. Selected CDS were mined to find polymorphisms specific to different capsular serotypes. Selected polymorphisms were added to the decision tree optimization parameters and targets were again optimized with Id3 to identify additional serotypes. Several iterations of this process were performed to increase the number of serotypes detected by the assay using a minimal number of targets. Detection primers were designed to identify these polymorphisms, which, in combination, permit serotype identification. Figure 3 provides an example of the nucleotides typed for serotype 19A.

Figure 3. Capsular operon of S. pneumoniae serotype 19A.

Expected amplicons are shown under the sequence, with genotyped positions marked with vertical lines. The expected genotype for serotype 19A is in large characters while the other possible genotypes are shown in smaller type. Positions are in nucleotides.

Evaluation with previously serotyped strains

The assay was evaluated on a collection of 166 cultures of S. pneumoniae previously serotyped by the Quellung assay, representing 63 different serotypes. Detailed results are shown in Table 3. All strains were found to be positive for S. pneumoniae, with all 166 strains positive for the pneumolysin gene (100%) and 165 positive for autolysin (99.4%). The strain negative for autolysin was of serotype 48 and was correctly serotyped. Serotyping using the assay allowed the correct identification of 126 (75.9%) samples and the misidentification of 14 (8.4%) samples to closely related serotypes of the same serogroup. These erroneous identification were serotype 6A identified as 6B (6/14), serotype 19B (2/14) or 19C (2/14) identified as serotype 19A, serotype 6B (1/14) identified as 6A/6C, serotype 24F (2/14) identified as 7C/24B/45 and serotype 28F (1/14) identified as 28A. Only 8 samples (4.8%) had erroneous serotyping out of the expected serogroup, and none accounted for misidentified vaccinal serotypes. They were of serotype 21, 41F, 42, 44, 45, and 46. Finally, 18 samples (10.8%) were positive for pneumolysin and autolysin but could not be serotyped because their serotyping score was lower than 0.4. The serotype yielded by samples with scores lower than the threshold was investigated. Of these, 8 would have been correctly serotyped and 10 would have been discordant if called positive. The latter were of serotypes 4, 10A, 10B, 15F, 39 and 43. Negative and discordant samples were retested and yielded similar results.

Serotype as dermined by QuellungExpected resultsVaccinenGenotyping concordant with QuellungGenotyping yield appropriate serogroupGenotyping results discordant with QuellungGenotyping unable to serotype sample
6B6B7871 (6A or 6D)00
6A6A131036 (6B)01
19B19BNon-Vaccine202 (19A)00
19C19CNon-Vaccine202 (19A)00
2121Non-Vaccine2002 (12A;12B;33B;33D;46)0
24F24FNon-Vaccine202 (7C;24B;45)00
28F16A;28F*Non-Vaccine101 (28A)00
41F41FNon-Vaccine1001 (17F)0
4242Non-Vaccine1001 (33A;33C;33F;35A;35C)0
4411F;12F;44*Non-Vaccine1001 (12A;12B;12F;46)0
4545Non-Vaccine2002 (22A;22F)0
4612A;12B;12F;46*Non-Vaccine2101 (21;28A;32A;32F)0

Table 3. Serotyping of 166 cultures of S. pneumoniae previously typed by the Quellung assay.

*Serotype is assigned to a subgroup of probable serotypes
Download CSV

Cross-reactivity with other bacteria species

In order to ensure its specificity, the assay was performed on twenty-four species of bacteria that could potentially be present in a clinical specimen, including ten Streptococcus species (Table 4). All species were positive for the internal control and negative for pneumolysin, autolysin and serotyping, except for S. pseudopneumoniae. As expected, S. pseudopneumoniae (BAA-960) yielded positive results for pneumolysin, autolysin and wzd. This strain of S. pseudopneumoniae has already been shown to contain these virulence genes [34].

SpeciesStrainPneumolysin Autolysin Serotyping Internal Control
Bordetella pertusisATCC 9797---+
Corynebacter jeikeiumATCC 15978---+
Corynebacterium minutissimumATCC 23348---+
Corynebacterium pseudodiphteriticumATCC 10700---+
Corynebacterium xerosisATCC 373---+
Enteroccus faecalisATCC 19433---+
Escherichia coliATCC 43886---+
Haemophilus influenzaATCC 9006---+
Klebsiella pneumoniaeATCC 27736---+
Neisseria meningitidisATCC 13077---+
Pseudomonas aeruginosaATCC 35554---+
Rhodococcus equiATCC 6939---+
Staphylococcus aureusATCC 29213---+
Staphylococcus epidermidisATCC 35984---+
Streptoccocus mitisATCC 49456---+
Streptoccus pyogenesATCC 12384---+
Streptococcus cristatusATCC 51100---+
Streptococcus gordoniiATCC 10558---+
Streptococcus oralisATCC 10557---+
Streptococcus parasanguinisATCC 15912---+
Streptococcus pseudopneumoniaeBAA-960++-+
Streptococcus salivariusATCC 7073---+
Streptococcus sanguinisATCC 10556---+
Streptococcus vestibularisATCC 49124---+

Table 4. Cross-reactivity with common bacteria species.

Download CSV

Limit of detection for S. pneumoniae detection and serotyping

Limit of detection studies were performed on cloned pneumolysin and autolysin genes and on quantified genomic DNA of serotypes 9V and 23F. Plasmids were serially diluted and dilutions were tested with the assay. Pneumolysin could be detected at a dilution lower than 10 copies per reaction and autolysin was detected at a 100 copies dilution. Genomic DNA from laboratory strains were dosed on agarose gel and serially diluted in order to determine the sensitivity of serotyping using the assay. The 100 genome equivalent limit of detection for serotyping was consistent between all samples. Limits of detection lower than 100 and 500 genome equivalent were observed for pneumolysin and autolysin, respectively.

Clinical samples

In order to validate that clinical specimens were suitable for serotyping using the assay, the test was performed on two sets of specimens obtained from patients, a series of 43 cerebrospinal fluid (CSF) samples [35], and a series of 59 nasopharyngeal aspirates (NPA). None of these samples have been serotyped with the Quellung assay. S. pneumoniae detection and serotyping was possible for the two types of specimens, demonstrating that culture is not required for serotyping using our method.

The 43 CSF samples were collected in Cameroon from patients with meningitis [35] and were previously tested positive for pneumolysin-specific real-time PCR as described by Corless and collaborators [36]. Using our assay, 40 of these specimens were positive for pneumolysin (93.0%), 34 were positive for autolysin (79.0%) and 35 were serotyped (81.4%). Serotype distribution of the 43 CSF samples from Cameroon, serotyped using the microarray assay described herein, was published by Gervaix and collaborators [35]. . The 59 NPA were obtained in Geneva, Switzerland, from children diagnosed with community-acquired and radiographically confirmed pneumonia of unknown etiology. Of these 59 specimens, 45 were positive for pneumolysin (76.3%), 34 were positive for autolysin (57.6%) and 41 were serotyped (69.5%), as shown in Table 5. These results demonstrate that pneumococcal culture is not required to conduct serotyping using the assay described herein.

SerotypePositive NPA
9A or 9V*1
9L, 9N or 204
10B or 34*1
10C, 10F, 11A or 11D2
16F, 23B, 27, 28A, 31 or 41A*1
16F, 27, 28A, 31 or 41A*3
16F, 28A, 31 or 41A*1
19C, 22A or 22f*1
19F or 23A*1
27 or 311
S. pneumoniae detection only6

Table 5. Distribution of serotypes detected in nasopharyngeal aspirates.

*Subgroup due to limited sample concentration
Download CSV


Accurate serotype determination of Streptococcus pneumoniae is of paramount importance to assess the impacts of pneumococcal vaccines and to monitor emergent serotypes. In order to simplify S. pneumoniae serotyping, we created a test that relies on a combination of genomic properties to identify serotypes. In this study, we validated the identification of 46 serotypes, 22 of which were precisely identified and 24 assigned to a subgroup of probable serotypes (Table 3). This is done by genotyping only 12 positions in the capsular operon. Serotypes included in the 13-valent vaccine were correctly assigned to serotypes, although serotypes 6A and 6B were sometimes not resolved. Bioinformatics analyses of available capsular operon sequences suggest that several other serotypes could be identified using this assay, but they have not been validated in the current study. In such cases, and for vaccinal serotypes 6A, 6B and 19A, it would be appropriate to confirm serotype determination using another method, such as the Quellung assay. Validation could also be performed to increase the precision of serotyping when the assay provides a subgroup of serotypes or unvalidated serotypes. We also tested the assay on clinical specimens consisting in 56 nasopharyngeal aspirates from patients with pneumonia and 43 cerebrospinal fluid samples from patient with pneumococcal meningitis, demonstrating that the assay do not require culture prior to serotyping.

The assay targets 12 polymorphic positions located in genes of the capsular operon. It was designed with sufficient redundancy to allow for the occasional false negative probe so that, in most cases, missing probes will not prevent serotyping. If several variants are possible at a single position in the known S. pneumoniae population, all polymorphisms are accounted for in the design. This provides redundancy that may counterbalance missing results, making the assay more robust. This is especially useful in specimens with low sample concentration or in specimens for which capsular gene sequences may differ from reference sequences. In some cases, a precise identification of serotypes is theoretically possible, but missing signal for some polymorphisms prevent precise serotype identification. Thus, the results for these samples are a group of serotypes. This could be caused by sequence variations in target genes and could be resolved by testing the samples using another method.

The assay was validated on 166 cultures of Streptococcus pneumoniae covering 63 different serotypes. Fourteen samples were not identified as the appropriate serotype but were correctly assigned as members of the correct serogroup. Six samples of serotype 6A were identified as serotype 6B and one serotype 6B was identified as 6A/6C. It is implicit in the literature that serotypes 6A and 6B are difficult to discriminate using DNA-based assays. Indeed, the vast majority of previously published nucleic acid serotyping methods do not discriminate serotype 6A from 6B [6,1422,27,32]. Moreover, 6A pneumococci with a wzh sequence previously observed in 6B serotypes have been reported in the Netherlands, highlighting the genetic diversity of this serogroup [26]. Assay designs based on phylogeny can provide relevant information, but can prove unreliable in some cases, such as in distinguishing between serotypes 6A and 6B. Designs based on functional polymorphisms, such as functional wciP genetic differences, could be more robust and reliable for these serotypes [29]. Serotypes 19B and 19C have been misidentified as serotype 19A. This is unexpected since the profile observed for these samples is not concordant with the published sequences for these serotypes [28]. It is of note that most molecular assays do not identify serotypes 19B and 19C, and that only a handful of molecular assays have been validated with serotypes 19B or 19C [15,22,32]. Variant 19F capsular genes have also been reported, leading to false positive 19A identification using PCR [37]. This highlights one of the challenges involved in validating S. pneumoniae serotyping assays, since it is difficult to obtain several representative strains of all serotypes. Using our assay, serotypes 24F was serotyped as 24B and 28F as 28A. Similar results were obtained with sequetyping [27]. The other misidentified samples were of low prevalence serotypes. Discordant samples outside of the serogroup were of serotypes 21, 41F, 42, 44, 45. Negative serotyping was obtained for 18 samples, 8 of which would have been correctly serotyped if their score had not been under the threshold of 0.4. They include 2 samples of serotype 3, and one sample for each of the following serotypes : 6A, 6B, 13, 14, 15A and 19F. These false negatives could be due to sequence variations that affect either PCR amplification, leading to low signal for some genotyping probes, or to mutations affecting the nucleotides that are genotyped by the assay, leading to missing signal or genotyping that do not conform with the information available in the serotyping database. This highlights the main problems that can be observed when a limited number of reference sequences are available to design assays.

In a recent study, Scott and collaborators used a glass-based microarray to detect the presence or absence of capsular genes, the combination of which allowed the identification of serotypes [31]. Instead of targeting a large number of capsular genes, the method described herein relies on the genotyping of selected nucleotides to identify serotypes. The use of a limited number of targets allowed the development of a multiplex PCR, which improves the limit of detection of the assay and eliminate the need for culture.

An advantage of this assay is that raw data from experiments can be reanalyzed upon the addition of new entries to the serotyping database or modifications to the identification software. Notably, new DNA sequences for which the serotype is known can help validate the assay and improve its robustness. The sequencing of additional S. pneumoniae capsular operons for relevant serotypes could improve the specificity of this assay. By testing the assay on several S. pneumoniae strains of the same serotype, we observed divergences between the expected genotype based on already determined capsular sequences and the actual results. For example, serotypes 19B and 19C produced the same genotyping profile as 19A, while the previously determined sequences hinted to a different profile. The same was observed with 6A compared to 6B. This suggests that actual strain-to-strain sequence variability within a single serotype could impact on the results of not only this assay, but on other molecular assays too. Thus, the sequencing of more isolates of each serotype would vastly improve serotyping assays that rely on genome sequences. In the future, new serotypes could be added to the analysis software provided that their capsular genes have been sequenced. The bioinformatics approach used to create this assay could also be applied to a larger sequence dataset, which should include the genome of several strains of each serotype, in order to create an improved molecular serotyping assay. A second multiplex PCR could be devised to genotype supplementary positions and thus improve the specificity and serotype coverage of the assay. In the end, as with other serotyping assays based on PCR and genotyping, immunological methods, such as the Quellung assay, may be required to confirm serotypes [6,37].

To facilitate the use of this assay in a clinical setting and to perform large scale epidemiological studies, we adapted the assay for use in the AutoGenomics INFINITI analyzer, an automated molecular diagnostic system that both reduces the hands-on time and the number of steps required to perform the assay. Twenty-four samples can be processed simultaneously with an overall processing time of 16 hours, with less than one hour of hands-on time. This system has been successfully used for the diagnostic of multiple respiratory viruses [38]. The serotyping assay could be adapted for additional platforms, such as the Luminex (Millipore), the BeadExpress (Illumina), or amplicon sequencing.

As exemplified by our Cameroon study [35] and several others [39], the serotypes circulating in developing countries are different from those observed in developed countries. This stresses the need for appropriate monitoring of vaccinal strains for surveillance. In order to properly describe the serotypes found in a country where no previous serotype prevalence studies have been performed, a test that allows the identification of a great number of serotypes would be beneficial. An assay such as the one described herein constitutes a reliable tool for a first screening of the serotypes circulating in unstudied populations. An approach based on polymorphism detection instead of serotype-by-serotype targets would allow a more versatile first screening of previously unstudied regions, where a wide coverage of serotypes is required.


This microarray-based assay uses a single test to identify most S. pneumoniae serotypes or serogroups. It is therefore useful for many types of infections, sample sources and for different countries. The current embodiment of the assay constitutes at this time a reasonable compromise between complexity and yield. Basically, it is a proof of concept that S. pneumoniae serotype identification can be performed by typing a limited combination of carefully selected polymorphic positions. Indeed, 22 serotypes can be precisely identified and 24 can be assigned to a subgroup of serotypes by genotyping only 12 nucleotides of the capsular operon. While the assay could be improved by the addition of new targets, the concept behind the assay could be used to devise methods to solve problems of similar complexity.


The authors wish to thank Maurice Boissinot and Marie-Claude Hélie, for providing access to the Centre de recherche en Infectiologie de l’Université Laval strain collection and preparing bacterial genomic DNA, and Kathrin Mühlemann from the Swiss National Reference Laboratory for S. pneumoniae, for providing multiple serotyped strains of S. pneumoniae.

Author Contributions

Conceived and designed the experiments: FR LR JC AG. Performed the experiments: FR NB RA. Analyzed the data: FR LR JC. Contributed reagents/materials/analysis tools: FR CT BL AG. Wrote the manuscript: FR NB RA LR BL CT JC AG.


  1. 1. O’Brien KL, Wolfson LJ, Watt JP, Henkle E, Deloria-Knoll M et al. (2009) Burden of disease caused by Streptococcus pneumoniae in children younger than 5 years: global estimates. Lancet 374: 893–902. Available: Accessed 1 July 2010. doi: PubMed: 19748398.
  2. 2. Feikin DR, Klugman KP (2002) Historical changes in pneumococcal serogroup distribution: implications for the era of pneumococcal conjugate vaccines. Clin Infect Dis Off Publ Infect Dis Soc Am 35: 547–555. Available: Accessed 20 May 2011. doi: PubMed: 12173128.
  3. 3. Kyaw MH, Lynfield R, Schaffner W, Craig AS, Hadler J et al. (2006) Effect of introduction of the pneumococcal conjugate vaccine on drug-resistant Streptococcus pneumoniae. N Engl J Med 354: 1455–1463. Available: Accessed 20 May 2011. doi: PubMed: 16598044.
  4. 4. Pilishvili T, Lexau C, Farley MM, Hadler J, Harrison LH et al. (2010) Sustained reductions in invasive pneumococcal disease in the era of conjugate vaccine. J Infect Dis 201: 32–41. Available: Accessed 20 August 2010. doi: PubMed: 19947881.
  5. 5. Hicks LA, Harrison LH, Flannery B, Hadler JL, Schaffner W et al. (2007) Incidence of pneumococcal disease due to non-pneumococcal conjugate vaccine (PCV7) serotypes in the United States during the era of widespread PCV7 vaccination, 1998-2004. J Infect Dis 196: 1346–1354. Available: Accessed 1 July 2010. doi: PubMed: 17922399.
  6. 6. Siira L, Kaijalainen T, Lambertsen L, Nahm MH, Toropainen M et al. (2012) From Quellung to multiplex PCR, and back when needed, in pneumococcal serotyping. J Clin Microbiol 50: 2727–2731. Available: Accessed 13 November 2012. doi: PubMed: 22692742.
  7. 7. Resti M, Moriondo M, Cortimiglia M, Indolfi G, Canessa C et al. (2010) Community-acquired bacteremic pneumococcal pneumonia in children: diagnosis and serotyping by real-time polymerase chain reaction using blood samples. Clin Infect Dis Off Publ Infect Dis Soc Am 51: 1042–1049. Available: Accessed 2 January 2011. doi: PubMed: 20883110.
  8. 8. Hassan-King M, Baldeh I, Secka O, Falade A, Greenwood B (1994) Detection of Streptococcus pneumoniae DNA in blood cultures by PCR. J Clin Microbiol 32: 1721–1724. Available: Accessed 23 April 2011. PubMed: 7929764.
  9. 9. Lalitha MK, Thomas K, Kumar RS, Steinhoff MC (1999) Serotyping of Streptococcus pneumoniae by coagglutination with 12 pooled antisera. J Clin Microbiol 37: 263–265. Available: Accessed 14 May 2013. PubMed: 9854110.
  10. 10. Sheppard CL, Harrison TG, Smith MD, George RC (2011) Development of a sensitive, multiplexed immunoassay using xMAP beads for detection of serotype-specific streptococcus pneumoniae antigen in urine samples. J Med Microbiol 60: 49–55. Available: Accessed 14 May 2013. doi: PubMed: 20864547.
  11. 11. Leeming JP, Cartwright K, Morris R, Martin SA, Smith MD (2005) Diagnosis of invasive pneumococcal infection by serotype-specific urinary antigen detection. J Clin Microbiol 43: 4972–4976. Available: Accessed 14 May 2013. doi: PubMed: 16207950.
  12. 12. Sheppard CL, Guiver M, Hartley J, Harrison TG, George RC (2011) Use of a multiplexed immunoassay for detection of serotype-specific Streptococcus pneumoniae antigen in pleural fluid and cerebrospinal fluid specimens. J Med Microbiol 60: 1879–1881. Available: Accessed 14 May 2013. doi: PubMed: 21835973.
  13. 13. Azzari C, Moriondo M, Indolfi G, Massai C, Becciolini L et al. (2008) Molecular detection methods and serotyping performed directly on clinical samples improve diagnostic sensitivity and reveal increased incidence of invasive disease by Streptococcus pneumoniae in Italian children. J Med Microbiol 57: 1205–1212. Available: Accessed 26 March 2013. doi: PubMed: 18809546.
  14. 14. Modak JK, Steinhoff MC, Zaman K, Islam M, El Arifeen S et al. (2012) Detection and serotyping of lyophilized nonculturable pneumococcal isolates. J Clin Microbiol 50: 3388–3390. Available: Accessed 22 March 2013. doi: PubMed: 22855517.
  15. 15. Pai R, Gertz RE, Beall B (2006) Sequential multiplex PCR approach for determining capsular serotypes of Streptococcus pneumoniae isolates. J Clin Microbiol 44: 124–131. Available: Accessed 29 March 2011. doi: PubMed: 16390959.
  16. 16. Morais L, Carvalho MdaG, da G, Roca A, Flannery B, Mandomando I et al. (2007) Sequential multiplex PCR for identifying pneumococcal capsular serotypes from South-Saharan African clinical isolates. J Med Microbiol 56: 1181–1184. Available: Accessed 2 January 2011. doi: PubMed: 17761480.
  17. 17. Dias CA, Teixeira LM, Carvalho MdaG, da G, Beall B (2007) Sequential multiplex PCR for determining capsular serotypes of pneumococci recovered from Brazilian children. J Med Microbiol 56: 1185–1188. Available: Accessed 2 January 2011. doi: PubMed: 17761481.
  18. 18. Saha SK, Darmstadt GL, Baqui AH, Hossain B, Islam M et al. (2008) Identification of serotype in culture negative pneumococcal meningitis using sequential multiplex PCR: implication for surveillance and vaccine design. PLOS ONE 3: e3576. Available: Accessed 2 January 2011. doi: PubMed: 18974887.
  19. 19. Selva L, Del Amo E, Brotons P, Muñoz-Almagro C (2012) Rapid and easy identification of capsular serotypes of Streptococcus pneumoniae by use of fragment analysis by automated fluorescence-based capillary electrophoresis. J Clin Microbiol 50: 3451–3457. Available: Accessed 26 March 2013. doi: PubMed: 22875895.
  20. 20. Coskun-Ari FF, Guldemir D, Durmaz R (2012) One-step multiplex PCR assay for detecting Streptococcus pneumoniae serogroups/types covered by 13-valent pneumococcal conjugate vaccine (PCV13). PLOS ONE 7: e50406. Available: Accessed 22 March 2013. doi: PubMed: 23226519.
  21. 21. Ahn JG, Choi SY, Kim DS, Kim KH (2012) Enhanced detection and serotyping of Streptococcus pneumoniae using multiplex polymerase chain reaction. Korean journal of pediatrics 55: 424–429. Available: Accessed 22 March 2013.
  22. 22. Pimenta FC, Roundtree A, Soysal A, Bakir M, Du Plessis M et al. (2013) Sequential triplex real-time PCR assay for detecting 21 pneumococcal capsular serotypes that account for a high global disease burden. J Clin Microbiol 51: 647–652. Available: Accessed 28 March 2013. doi: PubMed: 23224094.
  23. 23. O’Sullivan MVN, Zhou F, Sintchenko V, Kong F, Gilbert GL (2011) Multiplex PCR and reverse line blot hybridization assay (mPCR/RLB). Journal of visualized experiments : JoVE. Available: . Accessed 26 January 2012.
  24. 24. Batt SL, Charalambous BM, McHugh TD, Martin S, Gillespie SH (2005) Novel PCR-restriction fragment length polymorphism method for determining serotypes or serogroups of Streptococcus pneumoniae isolates. J Clin Microbiol 43: 2656–2661. Available: Accessed 30 March 2011. doi: PubMed: 15956380.
  25. 25. Shaaly A, Tellevik MG, Langeland N, Høiby EA, Jureen R (2005) Comparison of serotyping, pulsed field gel electrophoresis and amplified fragment length polymorphism for typing of Streptococcus pneumoniae. J Med Microbiol 54: 467–472. Available: Accessed 30 March 2011. doi: PubMed: 15824425.
  26. 26. Elberse KEM, Van de Pol I, Witteveen S, Van der Heide HGJ, Schot CS et al. (2011) Population structure of invasive Streptococcus pneumoniae in The Netherlands in the pre-vaccination era assessed by MLVA and capsular sequence typing. PLOS ONE 6: e20390. Available: Accessed 22 March 2013. doi: PubMed: 21637810.
  27. 27. Leung MH, Bryson K, Freystatter K, Pichon B, Edwards G et al. (2012) Sequetyping: serotyping Streptococcus pneumoniae by a single PCR sequencing strategy. J Clin Microbiol 50: 2419–2427. Available: Accessed 13 November 2012. doi: PubMed: 22553238.
  28. 28. Bentley SD, Aanensen DM, Mavroidi A, Saunders D, Rabbinowitsch E et al. (2006) Genetic analysis of the capsular biosynthetic locus from all 90 pneumococcal serotypes. PLOS Genet 2: e31. Available: Accessed 13 August 2010. doi: PubMed: 16532061.
  29. 29. Pai R, Limor J, Beall B (2005) Use of pyrosequencing to differentiate Streptococcus pneumoniae serotypes 6A and 6B. J Clin Microbiol 43: 4820–4822. Available: Accessed 26 March 2013. doi: PubMed: 16145148.
  30. 30. Jin P, Xiao M, Kong F, Oftadeh S, Zhou F et al. (2009) Simple, accurate, serotype-specific PCR assay to differentiate Streptococcus pneumoniae serotypes 6A, 6B, and 6C. J Clin Microbiol 47: 2470–2474. Available: Accessed 22 March 2013. doi: PubMed: 19535528.
  31. 31. Scott JR, Hinds J, Gould KA, Millar EV, Reid R et al. (2012) Nontypeable pneumococcal isolates among navajo and white mountain apache communities: are these really a cause of invasive disease? J Infect Dis 206: 73–80. Available: Accessed 22 March 2013. doi: PubMed: 22536000.
  32. 32. Tomita Y, Okamoto A, Yamada K, Yagi T, Hasegawa Y et al. (2011) A new microarray system to detect Streptococcus pneumoniae serotypes. J Biomed Biotechnol: 2011: 352736. Available: . Accessed 22 March 2013.
  33. 33. Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques. Morgan Kaufmann. Available: Accessed 29 March 2011.
  34. 34. Johnston C, Hinds J, Smith A, Van der Linden M, Van Eldere J et al. (2010) Detection of large numbers of pneumococcal virulence genes in streptococci of the mitis group. J Clin Microbiol 48: 2762–2769. Available: Accessed 27 July 2011. doi: PubMed: 20519466.
  35. 35. Gervaix A, Taguebue J, Bescher BN, Corbeil J, Raymond F et al. (2012) Bacterial meningitis and pneumococcal serotype distribution in children in cameroon. Pediatr Infect Dis J 31: 1084–1087. Available: Accessed 13 November 2012. PubMed: 22653488.
  36. 36. Corless CE, Guiver M, Borrow R, Edwards-Jones V, Fox AJ et al. (2001) Simultaneous detection of Neisseria meningitidis, Haemophilus influenzae, and Streptococcus pneumoniae in suspected cases of meningitis and septicemia using real-time PCR. J Clin Microbiol 39: 1553–1558. Available: Accessed 19 May 2011. doi: PubMed: 11283086.
  37. 37. Pimenta FC, Gertz RE, Roundtree A, Yu J, Nahm MH et al. (2009) Rarely occurring 19A-like cps locus from a serotype 19F pneumococcal isolate indicates continued need of serology-based quality control for PCR-based serotype determinations. J Clin Microbiol 47: 2353–2354. Available: Accessed 7 January 2011. doi: PubMed: 19439547.
  38. 38. Raymond F, Carbonneau J, Boucher N, Robitaille L, Boisvert S et al. (2009) Comparison of automated microarray detection with real-time PCR assays for detection of respiratory viruses in specimens obtained from children. J Clin Microbiol 47: 743–750. Available: Accessed 2 December 2010. doi: PubMed: 19158263.
  39. 39. Johnson HL, Deloria-Knoll M, Levine OS, Stoszek SK, Freimanis Hance L et al. (2010) Systematic evaluation of serotypes causing invasive pneumococcal disease among children under five: the pneumococcal global serotype project. PLOS Med 7: ([MedlinePgn:]). Available: Accessed 27 March 2013 PubMed: 20957191.