Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Resolution and Characterization of Distinct cpn60-Based Subgroups of Gardnerella vaginalis in the Vaginal Microbiota

  • Teenus Paramel Jayaprakash,

    Affiliation Department of Veterinary Microbiology, Western College of Veterinary Medicine, University of Saskatchewan, Saskatoon, Saskatchewan, Canada

  • John J. Schellenberg,

    Affiliation Department of Microbiology, University of Manitoba, Winnipeg, Manitoba, Canada

  • Janet E. Hill

    Affiliation Department of Veterinary Microbiology, Western College of Veterinary Medicine, University of Saskatchewan, Saskatoon, Saskatchewan, Canada

Resolution and Characterization of Distinct cpn60-Based Subgroups of Gardnerella vaginalis in the Vaginal Microbiota

  • Teenus Paramel Jayaprakash, 
  • John J. Schellenberg, 
  • Janet E. Hill


Bacterial vaginosis (BV), characterized by a shift of the vaginal microbiota from a Lactobacillus-dominated community to a dense biofilm containing a complex mixture of organisms, is an important risk factor in poor reproductive health outcomes. The Nugent score, based on Gram stain, is used to diagnose BV and Gardnerella vaginalis abundance in the sample is one factor determining Nugent score. A high Nugent score is indicative of BV but does not always correspond to the presence of clinical symptoms. G. vaginalis is recognized as a heterogeneous group of organisms, which can also be part of the normal, healthy vaginal microbiome. In addition, asymptomatic BV and non-Gardnerella types of BV are being recognized. In an attempt to resolve the heterogeneous group of G. vaginalis, a phylogenetic tree of cpn60 universal target sequences from G. vaginalis isolates was constructed that indicates the existence of four subgroups of G. vaginalis. This subdivision, supported by whole genome similarity calculation of representative strains using JSpecies, demonstrates that these subgroups may represent different species. The cpn60 subgroupings did not correspond with the Piot biotyping scheme, but did show consistency with ARDRA genotyping and sialidase gene presence. Isolates from all four subgroups produced biofilm in vitro. We also investigated the distribution of G. vaginalis subgroups in vaginal samples from Kenyan women with Nugent scores consistent with BV, Intermediate and Normal microbiota (n = 44). All subgroups of G. vaginalis were detected in these women, with a significant difference (z = −3.372, n = 39, p = 0.001) in frequency of G. vaginalis subgroup B between BV and Normal groups. Establishment of a quantifiable relationship between G. vaginalis subgroup distribution and clinical status could have significant diagnostic implications.


Gardnerella vaginalis, first isolated by Leopold in 1953 [1], has long been recognized in vaginal samples and has been identified by several names, including Haemophilus vaginalis by Gardner and Dukes in 1955 [2]. Further characterization based on metabolic requirements and Gram staining led to its reclassification as Corynebacterium vaginale [3]. The proposal to create the genus Gardnerella and allocation of Corynebacterium vaginale and Haemophilus vaginalis to this new taxon as Gardnerella vaginalis was put forward by Greenwood and Pickett [4], based on a taxonomic study that utilized DNA-DNA hybridization, biochemical analysis of the cell wall, and electron microscopy.

G. vaginalis is strongly associated with bacterial vaginosis (BV), and is one of the most frequently isolated bacteria from women with symptoms of BV [5][7]. Abundance of G. vaginalis in vaginal samples has also been associated with infertility and preterm labour [8]. G. vaginalis has also been isolated from urine and blood and is associated with bacteremia, osteomyelitis and cervical cancer [9][11]. However, recent studies of vaginal microbiota indicate that G. vaginalis can also be a part of the vaginal microbiota in clinically healthy women [6], [12], [13].

G. vaginalis is recognized as a diverse taxon, both phenotypically and genotypically [13][15]. Eight biotypes of G. vaginalis have been identified by Piot et al. based on the presence of β-galactosidase, lipase and hippurate hydrolysis activities [14], whereas Benito et al. identified seventeen biotypes based on these characteristics in addition to fermentation of xylose, arabinose and galactose [15]. Phenotypic diversity within G. vaginalis has also been described in terms of virulence factors, particularly production of sialidase [16] and formation of biofilms [17]. Genetic heterogeneity within G. vaginalis has been demonstrated using amplified ribosomal DNA restriction analysis (ARDRA) [18]. Santiago et al. [16], identified three ARDRA genotypes of G. vaginalis, of which only two genotypes (genotypes 1 and 3) produced sialidase. However, like biotyping schemes, ARDRA can only be performed on isolates.

Genotype diversity is apparent in whole genome studies and in metagenomic studies of the human vaginal microbiome based on 16S rRNA [19] or cpn60 [6], [12]. Hummelen et al. [19] reported the presence of four types of G. vaginalis sequences differing by a single nucleotide within the 16S rRNA V6 region. Four clusters of G. vaginalis sequences, ranging between 89 and 100% sequence identity to the type strain (ATCC 14018T), were observed in a cpn60-based study of clinically healthy women by Hill et al. [12] and followed up in a larger study by Schellenberg et al. [6].

Previous work by our research group has demonstrated that cpn60 universal target sequences can resolve phenotypically distinct strains or ecotypes within an intestinal microbial community, and that these sequences are also excellent predictors of whole genome sequence relationships [20], [21]. The gene encoding the universal 60 kDa chaperonin protein (cpn60) is an established target for detection and identification of microorganisms, as well as gene-based metagenomic studies of complex microbial communities, including the vaginal microbiome [6], [12], [22][31]. An approximately 555 bp region corresponding to nucleotides 274–828 of the E. coli cpn60 gene can be amplified with degenerate, universal PCR primers [32], [33]. This universal target (UT) region is phylogenetically informative, providing more discriminating power than 16S rRNA to differentiate organisms, even at the sub-species or strain level [20], [34][42]. A highly curated reference database of chaperonin sequences, cpnDB, supports cpn60 based applications [43].

Given the observed phenotypic diversity (especially virulence factors), genotypic diversity, and the presence of G. vaginalis in women regardless of clinical status, it is critical to improve our understanding of the clinical significance of these different strains. In order to accomplish this most effectively and to lay the foundation for the development of more informative diagnostic tools for women’s health, direct culture-independent analysis of vaginal samples, exploiting a genetic target that facilitates robust resolution is required.

The objective of the current study was to investigate if previously observed cpn60 based subgroups of G. vaginalis are consistent with other (phenotypic) classification systems and/or available whole genome sequences, and to investigate the distribution of cpn60 defined subgroups of G. vaginalis in women with and without BV. Our results demonstrate that the cpn60 universal target sequence differentiates distinct subgroups within G. vaginalis and that only one of these subgroups (Subgroup B: Piot biotype 5, sialidase positive and ARDRA genotype 1) was found to be significantly more abundant in women with BV (high Nugent score) than women with normal vaginal microbiota in a retrospective analysis of metagenomic profiles of Kenyan women.

Materials and Methods

Bacterial Isolates

G. vaginalis ATCC 14018 (type strain) and ATCC 49145 were obtained from the American Type Culture Collection (Manassas, VA). Eight additional strains were isolated from Kenyan (N170, N165, N160, N158, N153, N148, N144, N143, N137, N134, N101, and N72) or Canadian (W11) women as described previously [6]. G. vaginalis isolates were cultured using the following media: ATCC #1685 broth (with or without 1% (w/v) glucose), Brain Heart Infusion broth (BHI) with 1% (w/v) glucose, egg yolk agar [14] and Columbia agar with 5% sheep blood (BD, Mississauga, ON). The GasPak EZ Pouch System (BD, Mississauga, ON) was used to provide anaerobic conditions.

DNA Extraction and PCR

DNA was extracted from isolates using a phenol-chloroform extraction method and was stored at −20°C. Primers used in the study are listed in Table 1. cpn60 UT PCR amplicons were produced for direct sequencing, using universal primers H729 and H730 as described previously [33]. Primers JH0315 and JH0316 were designed based on the 16S rRNA sequence from G. vaginalis ATCC 14018. Amplification with these primers was carried out by incubating the reactions at 94°C for 3 minutes, followed by 40 cycles of 94°C for 30 sec, 52°C for 1 min and 72°C for 90 sec, and completed with a final extension of 10 min at 72°C. Sialidase gene presence was assessed by amplifying the sialidase gene using primers GVSI forward and GVSI reverse [16]. Vaginolysin gene sequences were amplified using primers V1 and V2 as previously described [44].

Phenotyping of G. vaginalis Isolates

Representative G. vaginalis isolates with unique cpn60 sequences were phenotyped using the Piot typing scheme using assays for hippurate hydrolysis, β-galactosidase and lipase activity as described previously [14]. Lactobacillus crispatus was used as a negative control for the lipase assay. G. vaginalis ATCC 14018T was used as a positive control for all biochemical assays.

Biofilm Formation

Isolates of G. vaginalis were cultured from −80°C stocks for 72 hrs, and subcultured for 48 hrs on Columbia sheep blood agar plates anaerobically at 37°C. A loopful of culture for each isolate was used to inoculate 4 ml of either ATCC broth #1685 with 1% (w/v) glucose or Brain Heart Infusion with 1% (w/v) glucose (BHIG) and incubated anaerobically for 48 hrs at 37°C. Broth cultures were diluted 1∶100 in media and 200 µL of diluted culture was added to individual wells of a 96-well tissue culture plate and incubated anaerobically for 48 hrs at 37°C. Qualitative assessment of biofilm formation was done by washing off the planktonic cells and staining the wells with 1% crystal violet solution to visualize any biofilm.

ARDRA Genotyping

G. vaginalis isolates were genotyped using amplified rDNA restriction analysis (ARDRA) [16], [18]. Full-length 16S rRNA gene sequences were amplified using primers GV10F and ωMB (Table 1). PCR products were purified (QIAquick PCR Purification Kit, Qiagen, Inc., Toronto, ON) and subjected to overnight restriction digestion using TaqI (Life Technologies, Inc., Burlington, ON). The digestion products were resolved on a 1.5% agarose gel at 140 volts for 2 hrs. In silico ARDRA was performed for some strains for which only published genome sequence information was available (not the isolates themselves) by extracting full length 16S rRNA gene sequences from published whole genome sequences and then restricting the sequence using the program remap within the EMBOSS software suite [45].

Sequence Sources

Published genome sequences of G. vaginalis used in the study, either completed or in progress, were downloaded from NCBI’s Genome database: strains 101 (Accession AEJD00000000), 315-A (AFDI00000000), 41V (AEJE00000000), 409-05 (CP001849), 5-1 (ADAN00000000), AMD (ADAM00000000), ATCC 14018 (ADNB00000000), ATCC 14019 (CP002104) and HMP9231 (CP002725). The metagenomic cpn60 sequences and cpn60 UT sequences of G. vaginalis isolates used were from a previously published study of vaginal microbiota of commercial sex workers in Kenya [6].

Phylogenetic Analysis

cpn60 UT sequences obtained from whole genome sequences of G. vaginalis reference strains or amplified from cultured clinical isolates [6] were used to construct a phylogenetic tree, using Alloscardovia omnicolens CCUG 34444 as a root. Sequences were aligned using ClustalW (gap opening penalty = 10, gap extension penalty = 0.10) [46], followed by utilization of the Phylip software package [47] to calculate a distance matrix using dnadist and construct a tree using neighbor. The final tree was obtained from the bootstrapped consensus of 100 trees and was visualized using Dendroscope [48].

Statistical Analysis

All statistical analyses were done using SPSS Statistics, version 19.0. For the analysis of assignment of assembled reads of Nairobi metagenomic data set to G. vaginalis subgroups, one-way ANOVA was done, followed by Post-hoc analysis by Tukey’s test. Statistical analysis for the distribution of G. vaginalis subgroups in Nairobi women was done by Kruskal-Wallis H test, followed by Mann-Whitney U test.


cpn60-based Resolution of G. vaginalis Subgroups

A phylogenetic tree of G. vaginalis cpn60 UT sequences is shown in Figure 1A. This tree was used to select landmark sequences for further comparisons and for subgrouping of G. vaginalis metagenomic sequences. The discreteness of the subgroups was further supported by demonstration of a bimodal distribution of pairwise distances (inter- and intra-subgroup) between strains (Figure 1B). The four subgroups resolved were designated A, B, C and D and two representative sequences from each subgroup were selected as representatives to capture the maximum phylogenetic distance represented by the phylogenetic tree.

Figure 1. cpn60 UT sequence-based subgroups of G. vaginalis.

A. Phylogenetic tree of G. vaginalis-like cpn60 UT sequences comprising four distinct clades: A, B, C and D. Bootstrap values for each node are indicated. 101, 315-A, 41V, 409-05, 5-1, AMD, ATCC 14018T, ATCC 14019, and HMP9231 are G. vaginalis isolates with whole genome sequence information available in Genbank (Accession numbers AEJD00000000, AFDI00000000, AEJE00000000, CP001849, ADAN00000000, ADAM00000000, ADNB00000000, CP002104 and CP002725 respectively). Isolates with names starting with “N” are isolates from Kenyan women from Schellenberg et al. [6]. W11 was isolated from a Canadian woman (Schellenberg, Unpublished). Sequences highlighted in red were used as representatives of the subgroups in the distribution analysis of metagenomic sequence data. B. Pairwise distances for the 26 G. vaginalis cpn60 UT sequences included in the phylogenetic analysis. Distances for both inter-subgroup comparisons (white bars) and intra-subgroup comparisons (black bars) are indicated.

Whole Genome, cpn60 and 16S rRNA Comparisons

Whole genome similarity calculations of representative strains of G. vaginalis for which there was either complete or partial genome sequence data available were calculated with JSpecies (Table 2). Within subgroups, pairwise Average Nucleotide Identity by MUMmer (ANIm) values were >95% and cpn60 identities were ≥96%, while between subgroups ANIm values were <90% and cpn60 identities were ≤92%. 16S rRNA pairwise identities were all 98–100%. Pairwise cpn60 and 16S rRNA sequence identities for representative isolates with unique cpn60 sequences are shown in Table 3. The pairwise sequence identity for cpn60 gene sequences and 16S rRNA gene sequences for isolates within G. vaginalis subgroups are >96% and >97%, respectively. Between G. vaginalis subgroups, pairwise identities ranged between 87% and 93% for cpn60 sequences and 97% and 100% for 16S rRNA sequences.

Table 2. ANIm (first row), cpn60 UT sequence identity (second row) and 16S rRNA sequence identity (third row) for representatives of G. vaginalis subgroups A, C and D, for which whole genome sequence data was available.

Table 3. Pairwise sequence identity of cpn60 (first row of each column) and full-length 16S rRNA (second row) for G. vaginalis isolates.

Biotyping, Sialidase, ARDRA Genotyping and Biofilm Production

Results for the biotyping assays and genotyping are shown in Table 4. Piot biotypes 1, 2, 5, 7 and 8 were identified among the isolate collection, but no consistent pattern of biotype distribution and cpn60 subgroup was observed. All subgroup C isolates were lipase positive. Subgroup B and C isolates were sialidase gene positive and ARDRA genotype 1, whereas subgroup A isolates were genotype 2 with no detection of the sialidase gene. Subgroup D isolates differed in sialidase gene presence (N160 was negative, strain 101 positive) and ARDRA genotype (N160 was genotype 2, strain 101 was predicted to be genotype 1 based on in silico restriction analysis). All isolates produced biofilm in BHIG by 48 hrs, and substantial variability in biofilm production was observed in the two media tested (BHIG and ATCC broth #1685 with 1% glucose) (Figure 2). Both subgroup B isolates (N144 and N153) formed biofilm in both media, although the biofilm formed in BHIG was more extensive, completely coating the well. In subgoups A and C, at least one of the isolates failed to produce any visible biofilm in ATCC broth #1685.

Table 4. Piot biotype, sialidase gene presence, and ARDRA characterization of G. vaginalis isolates (representatives of study isolates with unique cpn60 sequences) and published whole genome sequences.

Figure 2. Biofilm formation by G. vaginalis.

Isolates were cultured in 96-well plate in two different media: ATCC broth #1685 and BHI, stained at 48 hrs, after removal of planktonic cells.

We also attempted to detect vaginolysin gene sequences in the isolates selected for phenotypic analysis. A PCR product of the expected size of 1,551 bp was obtained for only three isolates (ATCC 14018T, ATCC 49145 and N153). An amplicon of 1,200 bp was amplified from three others (N134, N137 and N158), but sequence analysis indicated a mixture of products, suggesting that this product was the result of non-target sequence amplification. Isolates N165 and N144 did yield any product after repeated attempts. The vaginolysin sequence from ATCC 49145 was 99% identical to ATCC 14018 and only 91% identical to N153.

Distribution of G. vaginalis Subgroups in Kenyan Women

A previously published cpn60 metagenomic dataset was used to investigate distribution of cpn60-based G. vaginalis subgroups in vaginal microbiome profiles derived from samples classified as BV, Intermediate or Normal based on Nugent score [6]. All unique sequences assembled from the study data (n = 831 OTU) were compared using watered-BLAST [30] to a reference database of cpn60 sequences containing one representative of each species in cpnDB (cpnDB_nr; and two representatives of each G. vaginalis subgroup as indicated in Figure 1. All assembled sequences with any of the G. vaginalis reference sequences as their best match, and meeting the minimum requirement for identification as a cpn60 sequence (60% identity over ≥100 nucleotides) were included in the analysis of distribution (n = 93). For 84/93 of the assembled metagenomic sequences, the top two hits were to the same G. vaginalis subgroup. Identities for each query and its top two hits (medians for 93 queries were 96.6% and 92.9% respectively) were significantly (p<0.0001) higher than identity to the third (median 89.4%) through eighth best hits (Figure 3). Thirty-one queries had sequence identities <95% to their best match.

Figure 3. Percent identity of metagenomic sequences to G. vaginalis reference strains.

Distribution of percent identity of G. vaginalis metagenomic sequences to their first through eighth best matches among representative sequences of the G. vaginalis subgroups. The reference database included two representatives of each subgroup, indicated in Figure 1. Significant differences in percent identity (p<0.0001) are indicated by *.

Once metagenomic sequences were assigned to a subgroup (based on the best watered-BLAST match) the distribution of the subgroups among the vaginal microbiomes of women diagnosed as BV (n = 20), Intermediate (n = 5) and Normal (n = 19), based on Nugent score was determined (Figure 4). The sequence read frequencies used in this analysis were normalized to the median library size of 15,000 reads [6]. All vaginal microbiota libraries contained sequences corresponding to more than one subgroup of G. vaginalis. Out of 44 libraries sequenced, 41, 43, 43 and 27 contained G. vaginalis subgroup A, B, C and D respectively. The majority of libraries (25/44) contained sequence from all four subgroups. The next most prevalent combination was A+B+C (n = 14 libraries), while other combinations were present in the remaining four samples (n = 1 for A+C, n = 2 for B+C, n = 1 for B+C+D, n = 1 for A+B+D). The difference in frequencies of G. vaginalis subgroup sequence reads was tested using Kruskal-Wallis H test (SPSS Statistics, version 17.0). A significant difference (χ2 value = 12.329, df = 2, p = 0.002) was observed only for subgroup B sequences. Pairwise comparison on subgroup B sequences between the clinical groups (BV, Intermediate, and Normal) was analyzed by Mann-Whitney U test and results showed a significantly greater abundance of subgroup B sequences in BV compared to Normal (z = −3.372, n = 39, p = 0.001).

Figure 4. Distribution of G. vaginalis subgroups in African women.

Relative abundance of sequence reads corresponding to Gardnerella subgroups (scaled to median library size of 15,000 reads) among clinical categories (BV, Intermediate and Normal, based on Nugent score). Boxplots were created for each Gardnerella subtype and p values calculated based on non-parametric significance tests (Mann-Whitney U test) using SPSS Statistics version 19.0.


Bacterial vaginosis is the most commonly reported vaginal infection [49], [50]. BV can be diagnosed clinically using Amsel’s criteria, which include presence of homogenous vaginal discharge, a vaginal pH of greater than 4.5, positive whiff test (production of a fishy odour on addition of 10% KOH to vaginal sample), and also presence of clue cells in at least 20% of the total cell count [51]. The Nugent score is another commonly used diagnostic tool for BV. To calculate the Nugent score, a Gram stained vaginal smear is assessed for the relative abundance of various bacterial morphotypes including Gram-positive large rods, Gram-negative/Gram-variable rods and curved Gram-variable rods. With increasing numbers of survey studies in which Nugent scores of clinically normal women are determined, the phenomenon of “asymptomatic BV” has been widely observed [52][54]. These women have high Nugent scores, but do not have symptoms of BV. The clinical significance of asymptomatic BV is unknown. Since the presence of G. vaginalis is one of the key determinants of Nugent score, one possible explanation for asymptomatic BV is the presence of large numbers of non-pathogenic G. vaginalis or other species with similar Gram stain morphology. If this is true, then the detection of G. vaginalis in general may be of questionable diagnostic value. Resolution of this important issue requires the investigation of distribution of G. vaginalis lineages among women in a variety of clinical cohorts. Tackling this on a large scale requires culture-independent tools that provide resolution of phenotypically distinct G. vaginalis subgroups when applied directly to clinical samples.

Sequence diversity within G. vaginalis has been reported in metagenomic studies of the vaginal microbiome based on 16S rRNA and cpn60 gene targets, and four subdivisions have been reported in several recent studies [6], [12], [19]. The subdivision observed by Hummelen et al. [19] was based on single nucleotide differences within the V6 region of 16S rRNA. The much lower cpn60 sequence identity between these subgroups (≤93% versus ≥98% identity for 16S rRNA) facilitated the identification of vaginal isolates corresponding to these subgroups, demonstrating that the metagenomic studies had revealed real biological diversity and not artifactual diversity resulting from PCR, sequencing and or data assembly (Figure 1). Overestimation of microbial diversity in metagenomic sequencing studies is an ongoing concern [55], so the identification and characterization of actual isolates corresponding to metagenomic sequences is reassuring and further supports the value of the cpn60 universal target for resolution of diversity at species- and strain-level [20].

Whole genome DNA–DNA hybridization persists as the gold standard method for defining bacterial species [56] although it remains unpopular due to its technical demands [57]. DNA sequence data is increasingly relied upon to support species definition and resolution, and recently whole genome sequence comparison has been suggested as a new gold standard [58]. Richter & Rosselló-Mora [58] demonstrated that average nucleotide identity (ANI) values correlate well with DNA-DNA hybridization results and suggest that an ANI values greater than ≈95–96%, calculated by either BLAST or the MUMmer rapid aligning tool, were indicative of bacteria of the same species. Another alternative was proposed by Ziegler, who developed a computational algorithm based on sequence of three genes (recN, rpoA and thdF) that corresponds well to the conclusions of DNA-DNA hybridization data [59]. In that study, the 16S rRNA gene, widely used for identifying bacterial species and metagenomic studies, was found to have the lowest correlation between sequence identity and genome sequence identity. Subsequently, Verbeke et al. [21] demonstrated that a cpn60 UT sequence alone could predict whole genome identity as well as the three gene model. The ease of amplifying and sequencing the cpn60 UT from bacteria, the curated reference database of chaperonin sequences (cpnDB, [43], and the ability of the cpn60 UT to predict whole genome sequence similarity make it the ideal target for studies of Gardnerella, or any other bacterial taxon for which subspecies resolution is of interest.

Our results show a strong relationship between cpn60 UT sequence identity and whole genome comparison with the ANIm algorithm in JSpecies (Table 2). In fact, our results suggest that subgroups A, C and D of G. vaginalis meet the whole genome sequence-based criteria for designation as different species. Although no whole genome sequence is available for a subgroup B isolate, the cpn60 sequence data for isolates in this group certainly support a similar species level status for this group (Table 3). Complete genomes of nine G. vaginalis strains determined at the time of writing, and the fact that none of them belongs to subgroup B is interesting. Our experience with culturing of subgroup B isolates suggest that their conspicuous absence from the genome sequence database is most likely due to the fact that members of this subgroup, unlike the others, only grow in anaerobic conditions and do not grow in 7% CO2, which is the atmosphere recommended for routine isolation of G. vaginalis [60].

The heterogeneity of the G. vaginalis taxon is well documented based on application of biotyping schemes. Some of the biotypes of G. vaginalis from both the Piot and Benito biotyping schemes have been associated with BV [61], [62]. Piot biotypes 1, 4 and 5 are the most frequently isolated regardless of BV status [13] and biotype 5 has been reported to be predominantly associated with healthy vaginal ecosystems [61]. Piot biotypes 7 and 8 have been reported as the most frequently isolated from BV patients with isolation rate of 32% and 20%, respectively [61]. Of the seven isolates characterized in this study, four were Piot biotype 5, supporting previous observations of the prevalence of this biotype (Table 4). Otherwise, biotyping results were not consistent with sialidase gene presence, ARDRA or observations of association with BV in the Kenyan cohort. Although we did not provide evidence of sialidase enzymatic activity in our isolates, we observed a consistent relationship between the presence or absence of the sialidase gene and ARDRA genotype in that we detected the sialidase gene in all genotype 1 isolates examined and did not detect the sialidase gene in any of the genotype 2 isolates examined. Sialidase activity is recognized as a virulence factor in G. vaginalis and is the basis for a chromogenic, BV Blue Kit for BV diagnosis [63], but the results of reports examining the correlation between sialidase gene presence and detection of sialidase enzymatic activity are variable, making it inadvisable to draw conclusions about sialidase activity based solely on gene presence [16], [64].

Vaginolysin is a protein toxin belonging to the cholesterol-dependent cytolysin family of toxins that has been previously identified in G. vaginalis [44]. To detect this purported virulence factor in the study isolates, we employed previously published PCR primers designed based on the type strain, ATCC 14018. The primers failed to amplify the target sequence from most study isolates, and among the isolates for which we did generate sequence (ATCC 14018, ATCC 49145 and N153), we observed only 91% nucleotide sequence identity between some isolates (N153 vs. either the type strain or ATCC 49145). These results suggest that these primers may be too specific for general application in G. vaginalis, rather than indicating the absence of a vaginolysin gene in the other strains included in the study.

Subgroups of G. vaginalis were not evenly distributed among vaginal microbiomes diagnosed as BV, intermediate or normal based on Nugent score (Figure 4). Although G. vaginalis sequences were ubiquitous in the study group, and most women hosted multiple subgroups of G. vaginalis, only subgroup B was significantly more abundant in BV than normal samples. Analysis of pH and clue cells in these samples showed, as expected, a negative correlation of pH and Nugent score and a positive correlation of clue cells and Nugent score (data not show). An obvious and immediate question is whether subgroup B or any other subgroup is differentially associated with symptomatic and asymptomatic BV. Although we were unable to stratify our current data by symptoms (discharge, odour), relatively low cpn60 sequence identities facilitate robust differentiation of G. vaginalis subgroups (Figure 1 and 3) making it an ideal target to exploit in culture-independent approaches to addressing these questions in future studies. High throughput sequencing of cpn60 amplicons [30], bead-based hybridization assays [65] and quantitative real-time PCR methods [66] have all been developed based on cpn60 UT sequences and offer powerful tools for investigation of microbial diversity at, and below, the species level.

The results of our work support previous observations of genotypic and phenotypic diversity within G. vaginalis and we have been successful in using cpn60 UT sequences for robust classification of available G. vaginalis strains into four subgroups. We have also provided evidence that supports the eventual reclassification of subgroups as different species of Gardnerella. The degree of cpn60 UT and whole genome sequence diversity within this taxon is beyond that associated with “ecotypes” [67] or strains and suggests that reclassification may be warranted. However, additional genotypic and phenotypic analysis of additional isolates will be required to make this case. The cpn60 UT sequence offers a robust tool for identification of subgroups within G. vaginalis that may not be discernable using other targets. This feature of the cpn60 target will facilitate future efforts to expand diagnostic panels for rapid, high throughput characterization and improved resolution of species and strain distribution in the vaginal microbiome.

Author Contributions

Conceived and designed the experiments: TPJ JJS JEH. Performed the experiments: TPJ. Analyzed the data: TPJ JJS JEH. Wrote the paper: TPJ JJS JEH.


  1. 1. Leopold S (1953) Heretofore undescribed organism isolated from the genitourinary system. U S Armed Forces Med J 4: 263–266.
  2. 2. Gardner HL, Dukes CD (1955) Haemophilus vaginalis vaginitis: a newly defined specific infection previously classified non-specific vaginitis. Am J Obstet Gynecol 69: 962–976.
  3. 3. Zinnemann K, Turner GC (1963) The taxonomic position of “Haemophilus vaginalis” [Corynebacterium vaginale]. The Journal of Pathology and Bacteriology 85: 213–219.
  4. 4. Greenwood JR, Pickett MJ (1980) Transfer of Haemophilus vaginalis Gardner and Dukes to a new genus, Gardnerella: G. vaginalis (Gardner and Dukes) comb. nov. Int J Syst Evol Microbiol 30: 170–178.
  5. 5. Hillier SL, Critchlow CW, Stevens CE, Roberts MC, Wolner-Hanssen P, et al. (1991) Microbiological, epidemiological and clinical correlates of vaginal colonisation by Mobiluncus species. Genitourin Med 67: 26–31.
  6. 6. Schellenberg JJ, Links MG, Hill JE, Dumonceaux TJ, Kimani J, et al. (2011) Molecular definition of vaginal microbiota in East African commercial sex workers. Appl Environ Microbiol 77: 4066–4074.
  7. 7. Demba E, Morison L, van der Loeff MS, Awasana AA, Gooding E, et al. (2005) Bacterial vaginosis, vaginal flora patterns and vaginal hygiene practices in patients presenting with vaginal discharge syndrome in The Gambia, West Africa. BMC Infect Dis 5: 12.
  8. 8. Menard JP, Mazouni C, Salem-Cherif I, Fenollar F, Raoult D, et al. (2010) High vaginal concentrations of Atopobium vaginae and Gardnerella vaginalis in women undergoing preterm labor. Obstet Gynecol 115: 134–140.
  9. 9. Mikamo H, Sato Y, Hayasaki Y, Kawazoe K, Izumi K, et al. (1999) Intravaginal bacterial flora in patients with uterine cervical cancer. High incidence of detection of Gardnerella vaginalis. J Infect Chemother 5: 82–85.
  10. 10. Johnson AP, Boustouller YL (1987) Extra-vaginal infection caused by Gardnerella vaginalis. Epidemiol Infect 98: 131–137.
  11. 11. Graham S, Howes C, Dunsmuir R, Sandoe J (2009) Vertebral osteomyelitis and discitis due to Gardnerella vaginalis. J Med Microbiol 58: 1382–1384.
  12. 12. Hill JE, Goh SH, Money DM, Doyle M, Li A, et al. (2005) Characterization of vaginal microflora of healthy, nonpregnant women by chaperonin-60 sequence-based methods. Am J Obstet Gynecol 193: 682–692.
  13. 13. Tosun I, Alpay Karaoglu S, Ciftci H, Buruk CK, Aydin F, et al. (2007) Biotypes and antibiotic resistance patterns of Gardnerella vaginalis strains isolated from healthy women and women with bacterial vaginosis. Mikrobiyol Bul 41: 21–27.
  14. 14. Piot P, Van Dyck E, Peeters M, Hale J, Totten PA, et al. (1984) Biotypes of Gardnerella vaginalis. J Clin Microbiol 20: 677–679.
  15. 15. Benito R, Vazquez JA, Berron S, Fenoll A, Saez-Neito JA (1986) A modified scheme for biotyping Gardnerella vaginalis. J Med Microbiol 21: 357–359.
  16. 16. Santiago GL, Deschaght P, El Aila N, Kiama TN, Verstraelen H, et al.. (2011) Gardnerella vaginalis comprises three distinct genotypes of which only two produce sialidase. Am J Obstet Gynecol 204: 450 e451–457.
  17. 17. Swidsinski A, Doerffel Y, Loening-Baucke V, Swidsinski S, Verstraelen H, et al. (2010) Gardnerella biofilm involves females and males and is transmitted sexually. Gynecol Obstet Invest 70: 256–263.
  18. 18. Ingianni A, Petruzzelli S, Morandotti G, Pompei R (1997) Genotypic differentiation of Gardnerella vaginalis by amplified ribosomal DNA restriction analysis (ARDRA). FEMS Immunol Med Microbiol 18: 61–66.
  19. 19. Hummelen R, Fernandes AD, Macklaim JM, Dickson RJ, Changalucha J, et al. (2010) Deep sequencing of the vaginal microbiota of women with HIV. PLoS ONE 5: e12078.
  20. 20. Vermette CJ, Russell AH, Desai AR, Hill JE (2010) Resolution of phenotypically distinct strains of Enterococcus spp. in a complex microbial community using cpn60 universal target sequencing. Microbial Ecol 59: 14–24.
  21. 21. Verbeke TJ, Sparling R, Hill JE, Links MG, Levin D, et al. (2011) Predicting relatedness of bacterial genomes using the chaperonin-60 universal target (cpn60 UT): application to Thermoanaerobacter species. Syst Appl Microbiol 34: 171–179.
  22. 22. Chaban B, Links MG, Hill JE (2012) A molecular enrichment strategy based on cpn60 for detection of Epsilon-Proteobacteria in the dog fecal microbiome. Microbial Ecol 63: 348–357.
  23. 23. Desai AR, Links MG, Collins SA, Mansfield GS, Drew M, et al. (2012) Effects of plant-based diets on the distal gut microbiome of rainbow trout (Oncorhynchus mykiss). Aquaculture 350–353: 134–142.
  24. 24. Desai AR, Musil KM, Carr AP, Hill JE (2009) Characterization and quantification of feline fecal microbiota using cpn60 sequence-based methods and investigation of animal-to-animal variation in microbial population structure. Vet Microbiol 137: 120–128.
  25. 25. Dumonceaux TJ, Hill JE, Hemmingsen SM, Van Kessel AG (2006) Characterization of intestinal microbiota and response to dietary virginiamycin supplementation in the broiler chicken. Appl Environ Microbiol 72: 2815–2823.
  26. 26. Dumonceaux TJ, Hill JE, Pelletier C, Paice MG, Van Kessel AG, et al. (2006) Molecular characterization of microbial communities in Canadian pulp and paper activated sludge and quantification of a novel Thiothrix eikelboomii-like bulking filament. Can J Microbiol 52: 494–500.
  27. 27. Fernando WMU, Hill JE, Zello GA, Tyler RT, Dahl WJ, et al. (2010) Diets supplemented with chickpea or its main oligosaccharide component raffinose modify fecal microbial composition in healthy adults. Beneficial Microbes 1: 197–207.
  28. 28. Hill JE, Hemmingsen SM, Goldade BG, Dumonceaux TJ, Klassen J, et al. (2005) Comparison of ileum microflora of pigs fed corn-, wheat-, or barley-based diets by chaperonin-60 sequencing and quantitative PCR. Appl Environ Microbiol 71: 867–875.
  29. 29. Hill JE, Seipp RP, Betts M, Hawkins L, Van Kessel AG, et al. (2002) Extensive profiling of a complex microbial community by high-throughput sequencing. Appl Environ Microbiol 68: 3055–3066.
  30. 30. Schellenberg J, Links MG, Hill JE, Dumonceaux TJ, Peters GA, et al. (2009) Pyrosequencing of the chaperonin-60 universal target as a tool for determining microbial community composition. Appl Environ Microbiol 75: 2889–2898.
  31. 31. Schellenberg J, Links MG, Hill JE, Hemmingsen SM, Peters GA, et al. (2011) Pyrosequencing of chaperonin-60 (cpn60) amplicons as a means of determining microbial community composition. Methods Mol Biol 733: 143–158.
  32. 32. Goh SH, Potter S, Wood JO, Hemmingsen SM, Reynolds RP, et al. (1996) HSP60 gene sequences as universal targets for microbial species identification: studies with coagulase-negative staphylococci. J Clin Microbiol 34: 818–823.
  33. 33. Hill JE, Town JR, Hemmingsen SM (2006) Improved template representation in cpn60 PCR product libraries generated from complex templates by application of a specific mixture of PCR primers. Environ Microbiol 8: 741–746.
  34. 34. Brousseau R, Hill JE, Prefontaine G, Goh SH, Harel J, et al. (2001) Streptococcus suis serotypes characterized by analysis of chaperonin 60 gene sequences. Appl Environ Microbiol 67: 4828–4833.
  35. 35. Goh SH, Facklam RR, Chang M, Hill JE, Tyrrell GJ, et al. (2000) Identification of Enterococcus species and phenotypically similar Lactococcus and Vagococcus species by reverse checkerboard hybridization to chaperonin 60 gene sequences. J Clin Microbiol 38: 3953–3959.
  36. 36. Hill JE, Paccagnella A, Law K, Melito PL, Woodward DL, et al. (2006) Identification of Campylobacter spp. and discrimination from Helicobacter and Arcobacter spp. by direct sequencing of PCR-amplified cpn60 sequences and comparison to cpnDB, a chaperonin reference sequence database. J Med Microbiol 55: 393–399.
  37. 37. Hung WC, Tseng SP, Chen HJ, Tsai JC, Chang CH, et al. (2010) Use of groESL as a target for identification of Abiotrophia, Granulicatella, and Gemella species. J Clin Microbiol 48: 3532–3538.
  38. 38. Sakamoto M, Ohkuma M (2010) Usefulness of the hsp60 gene for the identification and classification of Gram-negative anaerobic rods. J Med Microbiol 59: 1293–1302.
  39. 39. Sakamoto M, Suzuki N, Benno Y (2010) hsp60 and 16S rRNA gene sequence relationships among species of the genus Bacteroides with the finding that Bacteroides suis and Bacteroides tectus are heterotypic synonyms of Bacteroides pyogenes. Int J Syst Evol Microbiol 60: 2984–2990.
  40. 40. Minana-Galbis D, Urbizu-Serrano A, Farfan M, Fuste MC, Loren JG (2009) Phylogenetic analysis and identification of Aeromonas species based on sequencing of the cpn60 universal target. Int J Syst Evol Microbiol 59: 1976–1983.
  41. 41. Blaiotta G, Fusco V, Ercolini D, Aponte M, Pepe O, et al. (2008) Lactobacillus strain diversity based on partial hsp60 gene sequences and design of PCR-restriction fragment length polymorphism assays for species identification and differentiation. Appl Environ Microbiol 74: 208–215.
  42. 42. Jian W, Zhu L, Dong X (2001) New approach to phylogenetic analysis of the genus Bifidobacterium based on partial HSP60 gene sequences. Int J Syst Evol Microbiol 51: 1633–1638.
  43. 43. Hill JE, Penny SL, Crowell KG, Goh SH, Hemmingsen SM (2004) cpnDB: a chaperonin sequence database. Genome Res 14: 1669–1675.
  44. 44. Gelber SE, Aguilar JL, Lewis KL, Ratner AJ (2008) Functional and phylogenetic characterization of Vaginolysin, the human-specific cytolysin from Gardnerella vaginalis. J Bacteriol 190: 3896–3903.
  45. 45. Rice P, Longden I, Bleasby A (2000) EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet 16: 276–277.
  46. 46. Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucl Acids Res 22: 4673–4680.
  47. 47. Felsenstein J (1993) PHYLIP (Phylogeny Infererence Package) version 3.5c, distributed by the author, Department of Genetics, University of Washington, Seattle, USA.
  48. 48. Huson DH, Richter DC, Rausch C, Dezulian T, Franz M, et al. (2007) Dendroscope: An interactive viewer for large phylogenetic trees. BMC Bioinformatics 8: 460.
  49. 49. Fethers KA, Fairley CK, Hocking JS, Gurrin LC, Bradshaw CS (2008) Sexual risk factors and bacterial vaginosis: a systematic review and meta-analysis. Clin Infect Dis 47: 1426–1435.
  50. 50. Ness RB, Hillier SL, Kip KE, Soper DE, Stamm CA, et al. (2004) Bacterial vaginosis and risk of pelvic inflammatory disease. Obstet Gynecol 104: 761–769.
  51. 51. Amsel R, Totten PA, Spiegel CA, Chen KC, Eschenbach D, et al. (1983) Nonspecific vaginitis. Diagnostic criteria and microbial and epidemiologic associations. Am J Med 74: 14–22.
  52. 52. Gibbs RS (2007) Asymptomatic bacterial vaginosis: is it time to treat? Am J Obstet Gynecol 196: 495–496.
  53. 53. Leitich H, Kiss H (2007) Asymptomatic bacterial vaginosis and intermediate flora as risk factors for adverse pregnancy outcome. Best Pract Res Clin Obstet Gynaecol 21: 375–390.
  54. 54. Mastrobattista JM, Bishop KD, Newton ER (2000) Wet smear compared with gram stain diagnosis of bacterial vaginosis in asymptomatic pregnant women. Obstet Gynecol 96: 504–506.
  55. 55. Reeder J, Knight R (2009) The ‘rare biosphere’: a reality check. Nature Meth 6: 636–637.
  56. 56. International Committee on Systematic Bacteriology announcement of the report of the ad hoc Committee on Reconciliation of Approaches to Bacterial Systematics. J Appl Microbiol 64: 283–284.
  57. 57. Cho JC, Tiedje JM (2001) Bacterial species determination from DNA-DNA hybridization by using genome fragments and DNA microarrays. Appl Environ Microbiol 67: 3677–3682.
  58. 58. Richter M, Rossello-Mora R (2009) Shifting the genomic gold standard for the prokaryotic species definition. Proc Nat Acad Sci USA 106: 19126–19131.
  59. 59. Zeigler DR (2003) Gene sequences useful for predicting relatedness of whole genomes in bacteria. Int J Syst Evol Microbiol 53: 1893–1900.
  60. 60. Holt JG (1994) Bergey’s Manual of Determinative Bacteriology. Baltimore: Lippincott Williams & Wilkins.
  61. 61. Aroutcheva AA, Simoes JA, Behbakht K, Faro S (2001) Gardnerella vaginalis isolated from patients with bacterial vaginosis and from patients with healthy vaginal ecosystems. Clin Infect Dis 33: 1022–1027.
  62. 62. Numanovic F, Hukic M, Nurkic M, Gegic M, Delibegovic Z, et al. (2008) Importance of isolation and biotypization of Gardnerella vaginalis in diagnosis of bacterial vaginosis. Bosn J Basic Med Sci 8: 270–276.
  63. 63. Kampan NC, Suffian SS, Ithnin NS, Muhammad M, Zakaria SZ, et al. (2011) Evaluation of BV Blue Test Kit for the diagnosis of bacterial vaginosis. Sex Reprod Healthc 2: 1–5.
  64. 64. Pleckaityte M, Janulaitiene M, Lasickiene R, Zvirbliene A (2012) Genetic and biochemical diversity of Gardnerella vaginalis strains isolated from women with bacterial vaginosis. FEMS Immunol Med Microbiol 65: 69–77.
  65. 65. Dumonceaux TJ, Schellenberg J, Goleski V, Hill JE, Jaoko W, et al. (2009) Multiplex detection of bacteria associated with normal microbiota and with bacterial vaginosis in vaginal swabs using oligonucleotide-coupled fluorescent microspheres. J Clin Microbiol 47: 4067–4077.
  66. 66. Chaban B, Musil KM, Himsworth CG, Hill JE (2009) Development of cpn60-based real-time quantitative PCR assays for the detection of 14 Campylobacter species and application to screening canine fecal samples. Appl Environ Microbiol 75: 3055–3061.
  67. 67. Cohan FM (2002) What are bacterial species? Annu Rev Microbiol 56: 457–487.