Specified Species in Gingival Crevicular Fluid Predict Bacterial Diversity

Background Analysis of gingival crevicular fluid (GCF) samples may give information of unattached (planktonic) subgingival bacteria. Our study represents the first one targeting the identity of bacteria in GCF. Methodology/Principal Findings We determined bacterial species diversity in GCF samples of a group of periodontitis patients and delineated contributing bacterial and host-associated factors. Subgingival paper point (PP) samples from the same sites were taken for comparison. After DNA extraction, 16S rRNA genes were PCR amplified and DNA-DNA hybridization was performed using a microarray for over 300 bacterial species or groups. Altogether 133 species from 41 genera and 8 phyla were detected with 9 to 62 and 18 to 64 species in GCF and PP samples, respectively, per patient. Projection to latent structures by means of partial least squares (PLS) was applied to the multivariate data analysis. PLS regression analysis showed that species of genera including Campylobacter, Selenomonas, Porphyromonas, Catonella, Tannerella, Dialister, Peptostreptococcus, Streptococcus and Eubacterium had significant positive correlations and the number of teeth with low-grade attachment loss a significant negative correlation to species diversity in GCF samples. OPLS/O2PLS discriminant analysis revealed significant positive correlations to GCF sample group membership for species of genera Campylobacter, Leptotrichia, Prevotella, Dialister, Tannerella, Haemophilus, Fusobacterium, Eubacterium, and Actinomyces. Conclusions/Significance Among a variety of detected species those traditionally classified as Gram-negative anaerobes growing in mature subgingival biofilms were the main predictors for species diversity in GCF samples as well as responsible for distinguishing GCF samples from PP samples. GCF bacteria may provide new prospects for studying dynamic properties of subgingival biofilms.


Introduction
Periodontitis remains widespread in populations aged 40 years or older [1] and represents one of the most common chronic bacterial infections in man. During its elongated progression, this asymptomatic infection causes local impairment, but it may also render the patients to increased risk for nonoral infections as well as to elevated proinflammatory host responses [2] linked to systemic diseases, such as cardiovascular diseases [3,4]. The mechanisms how periodontitis induces systemic inflammation are still unclear, but may include systemic dissemination of live bacteria and components from lysed bacteria and, as recently suggested, free-soluble components from live planktonic and biofilm bacteria from periodontal pockets [5,6].
Bacterial colonization and growth on supra-and subgingival tooth surfaces causes chronic inflammation in periodontal tissues. Increased flow of gingival crevicular fluid (GCF) [7], a serum transudate or inflammatory exudate, washes periodontal pockets and thereby provides host-derived substances that shape subgin-gival bacterial populations. Enriched with bacterial material from subgingival space GCF finally enters the oral cavity at the periodontal pocket orifice. In contrast to well-studied hostassociated constituents of GCF [8,9], no previous knowledge was found of its bacterial composition.
Recent advances in molecular methods give new possibilities to identify species in complex bacterial communities such as subgingival biofilms. Open-ended molecular biological methods have revealed the presence of previously unrecognized species and bacterial groups in subgingival samples [10,11,12,13]. The collected 16S rDNA sequence data serve as a basis for developing rapid methods, such as DNA microarrays for bacterial detection and identification from clinical samples. A recently-launched DNA microarray provides simultaneous detection of hundreds of cultivable and not-yet-cultured bacterial species from oral samples [14,15].
Passive and active detachment of bacterial cells from biofilms [16] occurs as part of their biofilm life cycle [17]. Detached (planktonic) bacteria downregulate genes required for biofilm formation and upregulate expression of virulence properties for an acute infection [18]. Thus, the quality and quantity of unattached bacterial populations in periodontal pockets and the phase of periodontal infection may depend on each other. Identifying unattached bacteria may also provide insight into the activity of subgingival biofilm in vivo. In the present study, we propose that bacterial species found in GCF reflect the bacterial populations detached from subgingival biofilms. As no previous knowledge was found of bacteria in GCF samples, our main objectives were to detect and identify bacterial species and to delineate contributors to species diversity in GCF samples. The results demonstrated complex bacterial populations. Certain species/phylotypes predicted species diversity in GCF samples and discriminated GCF samples from subgingival paper point (PP) samples.

Results
Descriptive data on the demographic, systemic and dental variables of the periodontitis patients are presented in Table 1. From the total of 438 bacterial probe targets incorporated in the DNA microarray, 133 were included in the present data analyses. These 133 probe targets represent those that were detected in at least one patient in GCF and/or PP samples at levels higher than the HOMIM signal level score of 1 (Positive scores range from the lowest 1 to the highest 5). The mean weights of the GCF and PP samples per patient were 7.862.6 ug and 5.762.8 ug, respectively.
Specified bacterial species but few host-associated variables contributed significantly to the species diversity in gingival crevicular fluid samples A phylogenetic analysis of the bacterial species identified in GCF samples revealed 8 bacterial phyla/candidate phyla, Firmicutes, Bacteroidetes, Fusobacteria, Proteobacteria, Actinobacteria, Spirochaetes, Synergistetes and TM7 ( Figure 1). The number of different bacterial species (species diversity) in GCF samples ranged from 9 to 62 (mean 33.7, SD 15.3) per patient. In order to delineate which of the bacterial and host-associated study variables contributed to the species diversity per sample, we generated a multivariate PLS (Projection to latent structures by means of partial least squares) model using 166 X variables (133 bacterial species and 33 hostassociated variables) and one Y variable (number of different species per sample) (Figure 2A, B, C). The loading scatter plot ( Figure 2A) gives an overview of all variables; the position of each X variable shows its relationship to other X variables but also to the Y variable. The bacterial species are shown as numbers for which the key is given in online Supporting information, Table S1. The observed vs predicted values for the number of different species per sample demonstrated a good fit for the model (R 2 = 0.975). According to cross validation the model predicted 79% of the variation in Y.
In the loading scatter plot (Figure 2A), the X variables closest to the Y variable suggests their positive correlation to each other and to species/phylotype diversity. They mainly included members of 3 phyla, Proteobacteria, Firmicutes and Bacteroidetes. Few host-related variables were found close to Y. Conversely, in the diagonal corner of the loading plot, suggesting a negative correlation to species/ phylotype diversity, several host-associated variables were found in addition to members of 2 of the above phyla, i.e., Proteobacteria and Bacteroidetes.
A PLS regression analysis was additionally performed to evaluate the extent each X variable contributed to the bacterial species/phylotype diversity in GCF samples. After removing X variables with regression coefficients between 0.02 and 20.02 regarded as less important correlations, the remaining X variables with the highest positive and negative correlations are shown in Figure 2B. Significant positive correlations were found for species of genera belonging to 3 phyla, Firmicutes (Selenomonas, Catonella, Dialister, Veillonella, Eubacterium, Peptostreptococcus, Pseudoramibacter and Streptococcus), Bacteroidetes (Porphyromonas, Prevotella and Tannerella) and Proteobacteria (Campylobacter). Most of the streptococcal species were milleri (anginosus) group streptococci, S. intermedius, S. constellatus and S. anginosus. Only a single species, Neisseria flavescens, had a negative correlation (,20.02) to species/phylotype diversity, although not significant at the 95% confidence interval level ( Figure 2B).
Among the dental variables, Plaque Index (PlI) per patient at sampled sites and at all examined sites in the dentition was the only significant positive contributor to the species/phylotype diversity in GCF samples ( Figure 2B). Periodontal variables, such as number of teeth with clinical attachment level (CAL) 4-6 mm (NTCAL2) or with probing depth (PD) .6 mm (NTPD3), mean CAL at all sites (CAL_all), proportion of sites with PD .6 mm, and Gingival Index (GI) at sampled sites (GI_S), as well as some other host-related variables e.g., glucose level and interrupted smoking habit, correlated positively, but not at 95% confidence interval level (data not shown). Conversely, the number of teeth with CAL ,4 mm (NTCAL1) was the only significant negative contributor to species/phylotype diversity ( Figure 2B). Mean frequency of bleeding on probing (BOP) at sampled sites (BOP_S) and leukocyte count had negative regression coefficients (,20.02), but they did not reach statistical significance. Comparison between bacterial species in gingival crevicular fluid samples and subgingival paper point samples We subsequently asked whether the composition of the bacterial populations identified in GCF samples differed from those identified in PP samples obtained from the same periodontal sites as the GCF samples. The overall species/phylotype diversity in PP samples, ranging from 18 to 64 bacterial species (mean 41.8, SD 12.0) per patient, resembled that in GCF samples (data above). Figure 3 demonstrates the distribution pattern of the 133 species or groups shown as detection frequency and percentage of all samples positive for the depicted species/phylotype in the GCF and PP samples, separately. Except for 15 (11%) species found only in GCF samples and 17 (13%) species found only in PP samples, the majority (N = 101, 76%) of the 133 species/phylotypes were identified in both sample types; 23 (23%) more often in GCF than PP samples and 58 (57%) more often in PP than GCF samples ( Figure 3).
To delineate differences between the bacterial species found in GCF and PP samples a phylogenetic tree ( Figure 4) was constructed of the species/phylotypes that had statistically significant PLS regression coefficients (.0.02 or ,20.02) and thus predicted species/phylotype diversity in GCF samples (data from Figure 2C) and PP samples (data from Figure 5). From the 49 species/phylotypes included in the phylogenetic analysis, 38 (78%) were found in GCF samples, 23 (47%) in PP samples, and 12 (25%) in both. The species distribution among the 7 phyla or among all species/phylotypes did not significantly differ between GCF and PP samples in bivariate data analyses ( Figure 4).

Discussion
To our knowledge this is the first study to identify bacteria in GCF samples although host-derived constituents of GCF have been extensively studied during the past 40 years [9]. Interestingly, we found a wide diversity of different species in GCF samples despite the small sample amounts, from 4 to 12 ug per patient, and low bacterial concentrations, which could have challenged the yield of representative bacterial DNA for processing and DNA microarray analysis. The extracted DNA appeared to be of sufficient quality and quantity and the PCR amplification and microarray analysis were successful, as suggested by identification of 118 species belonging to 41 genera or higher taxa, altogether representing 8 phyla/candidate phyla in GCF samples. Bacterial species/phylotype diversity in GCF samples was comparable to that in subgingival PP samples obtained from the same periodontal sites as the GCF samples. Data on subgingival species diversity have been somewhat lower or of the same magnitude in other studies using 16S rDNA cloning [10,13,14,19] and 16S rDNA microarray [14,15].
In order to obtain representative bacterial samples for the present purposes, filter paper strips and paper points were chosen for sampling from GCF and periodontal pockets, respectively. Both devices absorb GCF with its various constituents e.g., bacteria, but the sampling procedure with paper points introduced along the root surface down to the bottom of the periodontal pockets additionally disrupts the attached subgingival biofilms, thus incorporating both unattached and attached subgingival bacteria in PP samples. Choosing these sampling methods instead of others, such as using capillary tubes for GCF and curettes for subgingival sampling, was based on the attempt to cause least possible stimulation and trauma to inflamed periodontal tissues and thus to minimize increase of GCF flow and gingival bleeding due to the mechanical sampling procedure. Bleeding by itself dilutes pocket contents and causes pressure in the pocket. In addition, strictly aseptic sampling technique and isolation of the sampling area were applied to avoid sample contamination from supragingival plaque and saliva.
The low patient number but large number of variables comprising demographic, systemic, periodontal and bacterial variables prompted us to use multivariate PLS modeling for the data analysis due to its capacity to handle complex data including low number of subjects, noise and artifacts as well as distribution and colinearity of variables [20]. Projection methods also enable effective data visualization tools that help interpret complex data and thereby also provide enhanced possibilities to comprehend implications. For the present data analyses PLS and OPLS/ 2OPLS-DA models with high predictive power were chosen according to the cross validation procedure described in the Methods section. Score and loading plots were used for an overview of the respective correlation matrices, and regression coefficients for prediction of responses, such as bacterial species/ phylotype diversity and the GCF sample or PP sample  Table S1). PLS regression coefficient plot (Panel B) identified X variables with statistically significant correlation to Y. The coefficients (.0.02 or ,20.02 are shown) are statistically significant membership. The coefficients for individual variables were generally low probably due to the complexity of the indigenous subgingival bacterial community. However, with use of confidence intervals to define statistical significance certain species were separated as the most important contributors to the responses in question. Regardless of the potent multivariate approach used for the analysis of the complex set of microbial and host variables, the questions concerning interindividual differences in GCF versus PP microbiota as well as time stability of microbial profiles still remain to be further investigated.
Our result that the statistically significant PLS regression coefficients distinguished a group of bacterial species/phylotypes among the diverse microbiota in GCF samples indicates that these species/phylotypes were the strongest contributors to and predictors for the species diversity in GCF samples. It also suggests that these species/phylotypes favored a multispecies environment. They appeared to be species traditionally classified as Gram-negative anaerobes belonging to genera Campylobacter, Selenomonas, Porphyromonas, Catonella and Tannerella, but also Gram-positive species from genera Eubacterium, Peptostreptococcus and Streptococcus. Thus, the species/phylotypes identified using PLS regression analysis as significant predictors for species/phylotype diversity in GCF samples mainly belonged to the late colonizers in dental plaque [21], and are known of their preferred growth in subgingival environment and their association with periodontitis [10,11,13,15,22]. Interestingly, milleri group streptococci, such as S. intermedius, S. constellatus and S. anginosus, related to abscesses and other pathologies, but also some mitis streptococci, known for their early plaque colonization and compatibility with dental health [23,24], were detected among the species predicting bacterial species diversity in GCF samples. However, other early colonizers on tooth surfaces, such as Actinomyces spp., Rothia dentocariosa, Gemella haemolysans, Eikenella corrodens, K. oralis and Haemophilus spp. were not represented among the predictor species. Fusobacterium spp. that coaggregate with species colonizing early e.g., Actinomyces spp. and Streptococcus spp., but also with those colonizing late e.g., P. gingivalis and Eubacterium spp., in dental plaque [21], were not found among the positive predictors for species/ phylotype diversity in GCF samples. This may mean that fusobacteria remained below the detection level or that they did not serve as a core for detached bacterial cell aggregates. Altogether, these results suggest that certain bacterial species mainly belonging to mature subgingival biofilms readily detach from subgingival biofilms or lyse/release DNA, both alternatives being able to cause host pathology in parenteral space.
None of the host-associated variables correlated significantly positively and only a single periodontal variable, number of teeth with CAL ,4 mm, correlated significantly negatively to the species/phylotype diversity in GCF samples, suggesting that the healthier the periodontium the lower the species diversity. This is in accord with studies reporting lower number of different subgingival species in subjects with healthy periodontal conditions than in those with periodontitis [10,15]. Somewhat unexpected was that host variables, particularly periodontal variables, did not more clearly correlate to bacterial diversity, although the robustness of the response variable could offer one explanation.
Although most of the 133 identified bacterial species were detected both in GCF and in PP samples some species, such as E. saphenum, F. nucleatum ss nucleatum, S. sputigena and Treponema Cluster I, frequent in traditional subgingival samples in periodontitis, were only found in GCF samples. These species could be prone to be flushed out in GCF or the result may have been affected by the generally low detection rate (Treponema Cluster I) in the present patients, dependence on the probes (E. saphenum, S. sputigena) or possibly that in PP samples they remained under detection level (F. nucleatum ss nucleatum). To exploit the potential of the full data set with quantitative bacterial data we subsequently applied OPLS/ O2PLS discriminant analysis and found that certain species significantly positively or negatively contributed to GCF sample membership and vice versa to PP sample membership.
The positive contributors to GCF sample membership were mainly species traditionally classified as Gram-negative anaerobes, such as C. rectus/concisus, P. nigrescens, D. pneumosintes and T. forsythia, that are common subgingival microbiota and associated with periodontitis. Somewhat unexpected was that Campylobacter spp. were the only significantly contributing motile species. It could have been speculated that periodontitis-associated motile bacteria actively leave biofilm and will thus be associated with GCF sample. One possible reason is that other motile bacteria remained under detection level of HOMIM analysis that discloses predominant bacteria in the sample. As seen in the results, several motile species among Selenomonas and Treponema genera were positively associated with GCF sample, but the association did not reach statistical significance.
On the other hand, the positive contributors to PP sample membership included mainly Gram-positive bacteria, such as mitis-and milleri-group streptococci and anaerobic rods, all of them current or previous members of genus Eubacterium (E. brachy, E. saphenum, E. yurii, S. satelles and P. alactolyticus). As mitis streptococci are early plaque colonizers and common in periodontal health [25,26] less is known of S. satelles [27] that recently was associated with periodontal destruction [15]. P. alactolyticus, former Eubacterium alactolyticum [28], has been found in subgingival samples of periodontally healthy children [29] but also in those of periodontitis patients [15]. The species separation between GCF and PP samples supports the capability of the employed sampling methods to collect unattached and attached microbiota. For example, Gram-negative species may occur unattached or loosely attached in periodontal pockets and were more readily absorbed with GCF to filter paper strips than the above-mentioned Gram-positive species.
Our results suggest that in periodontal pockets there are unattached bacteria and/or their species-specific DNA that can be identified from GCF samples taken at the orifice of the gingival sulcus and thus without entering the periodontal pockets. To date data are accumulating of passive and active mechanisms for bacterial dispersal from biofilms [16,30], but little knowledge is still available of the life cycle of complex natural biofilms such as human subgingival dental plaque. This is an interesting research field since the quality and quantity of bacterial species detached from subgingival biofilms may differ in different phases of periodontal inflammation. Thus, in future studies, monitoring the bacterial composition of GCF may give information of the current activity of the subgingival biofilm in vivo. Based on the present results, we conclude that the DNA microarray analysis of GCF samples from periodontitis patients revealed phylogenetically diverse bacterial populations and that species traditionally classified as Gram-negative anaerobes appeared to be the main predictors for the species diversity as well as for the distinction of GCF from PP samples.
when the error bars do not cross the 0 line. The model explained 98% and, according to cross validation, predicted 79% of the variation in Y. Observed values vs. predicted values R 2 = 0.975 (Panel C). Capital letters are patient identifiers (Panel C). X variables were scaled and centered and Y variable scaled. The confidence intervals were derived from jack knifing. GCF: gingival crevicular fluid. doi:10.1371/journal.pone.0013589.g002

Ethics statement
The study design was approved by the Ethics Committee of the Faculty of Medicine for Human Studies, Gazi University (June 16, 2006, #208) and the patients signed an informed consent.

Patients
Seventeen adult patients with periodontitis (age range 33-64 years), who met the inclusion criteria and volunteered for the study, were recruited from the Department of Periodontology, Gazi University, Ankara, Turkey (June-August 2008).
Data on demographics and smoking habit were collected with a self-reported questionnaire. Body Mass Index (BMI) was calculat-   Figure 2B for the GCF samples and from Figure 5 for the PP samples) were selected for the phylogenetic analysis. The marker bar represents 5% difference in nucleotide sequences. GF abbreviation for gingival crevicular fluid and PP for subgingival paper point samples. Black boxes indicate presence and white boxes absence of the depicted species/phylotype. doi:10.1371/journal.pone.0013589.g004 ed. Blood samples for leukocyte count and fasting blood glucose were taken and analyzed at the Gazi University Hospital Laboratory.
The diagnosis of periodontitis was based on clinical and radiographic examination. The study inclusion criteria were as follows: generally healthy adults, at least 5 teeth with clinical attachment level (CAL) exceeding 4 mm, no continuous medications, no mechanical periodontal therapy or antibiotics during the preceding 6 months, and never had periodontal surgery. Plaque Index (PlI), Gingival Index (GI), probing depths (PD), bleeding on probing (BOP), and clinical attachment level (CAL) were recorded at 6 sites (both vestibular and oral aspects of the mesial, distal and mid surfaces) per tooth from all teeth in the dentition by a periodontist, as previously described [31].

Bacterial sampling
Six caries-free periodontal sites with the deepest, inflamed periodontal pockets and attachment loss were selected for bacterial sampling during the clinical examination of the patients.
One week later, the GCF samples and subgingival PP samples were taken as follows: After removing supragingival plaque with sterile curettes using coronal strokes starting from the gingival margin, a PerioPaper strip (Oraflow Inc.) was gently placed to the orifice of the gingival sulcus to collect fluid approximately half of the strip length. Care was taken not to contaminate the strip from the supragingival tooth surfaces or saliva, or to mechanically irritate the gingiva. Approximately 10 min after the GCF sampling, subgingival samples were taken with sterile paper points (MetaAbsorbent Paperpoints; MetaBiomed Co. Ltd., Chungbuk, Korea) from the same 6 periodontal sites as the GCF samples. The PP samples were introduced to the bottom of each periodontal pocket and removed after 10 sec [32]. The GCF samples and the PP samples were transferred to separate sterile Eppendorf tubes. For each patient, one GCF sample pooled from 6 sites and one PP sample pooled from 6 pockets were immediately placed at 270uC, where they were preserved until sent to the Department of Oral Microbiology, Umeå University, Sweden, for DNA extraction. To quantify the samples, eppendorf tubes with strips and those with paper points were weighed before and after sampling.
DNA extraction and 16S rRNA gene amplification for DNA microarray analysis For DNA extraction, bacteria were eluted from 6 PerioPaper strips (GCF) and from 6 paper points (subgingival sample), respectively, per patient using 400 ml buffer (16PBS or TE). Genomic DNA was then extracted using the QiaAmp DNA isolation kit (QiaGen), according to the instructions of the manufacturer. 16S rRNA genes were amplified from each genomic DNA sample, isolated as above, by means of a double PCR approach, essentially as described earlier [15]. For this, 2 separate 25 ml PCR reactions were run, using an equimolar mixture of oligonucleotides 4F:59 and 6F:59 (each at a final concentration of 200 nM) as the forward primer, and oligonucleotides 1541R and 1492R, respectively (final concentration 400 nM) as reverse primers. The reaction mixtures also contained 16 PCR buffer II and 1.5 U AmpliTaqH DNA polymerase (Applied Biosystems), 1.5 mM MgCl 2 , 200 nM of each deoxynucleotide, and 2 ml of the DNA template. The PCR cycling conditions were as previously described [15], with the exception that 72uC was used for the elongation steps instead of 68uC. The 2 PCR reactions were pooled, purified using the QiaQuick PCR purification kit (QiaGen), and dried down using Speedvac, prior to microarray analysis (HOMIM, Boston, MA) as previously described [15].
Oral taxon (OT) designation for each species/phylotype are provided in Human Oral Microbiome Database (HOMD) www. homd.org. The HOMD links sequence data of each OT with phenotypic, phylogenetic, clinical and bibliographic information. The HOMD also provides information of the OTs included in each species cluster.

Statistical data analysis
For the present data analysis, traditional statistics did not seem to be optimal due to the low number of patients and colinearity of the variables. Therefore, a multivariate regression technique, PLS (partial least squares) projections to latent structures [20], was used for modeling, analysis and interpretation of the bacterial and hostassociated data. OPLS/O2PLS-DA (orthogonal PLS discriminant analysis) [33], a new supervised classification analysis tool, was used to quantitatively analyze the contribution of each species/ phylotype (X variables) to the class separation by creating 2 dummy Y variables, one for GCF samples and another for subgingival PP samples.
The study variables were imported into the SIMCA-P+12 software package (Umetrics Inc.) for the data analysis and visualization. The quantitative bacterial data were scored from 1 to 5. Data were imported as continuous variables for hostassociated variables (age, BMI, leukocyte count and glucose levels) and periodontal variables (means per subject of PlI, GI, BOP, PD and CAL, number of teeth, number of different bacterial species or groups per sample and weight of the GCF and PP samples) and as categorical variables (number of teeth with 3 severity categories for PD and CAL, gender, socioeconomic status [low, intermediate, high] and smoking habit [never, stopped, current]). After centering and scaling to unit variance the data were analyzed. To measure the predictive power of the model, cross validation procedure contained the following: parts of the data were kept out of model development, the kept-out parts were predicted by the model and predictions of the kept-out parts were compared with the obtained actual values. Several tools were used to visualize the results. Score plots provided correlation patterns among the patients and among the samples. Loading scatter plots revealed the impact of each X variable on the formation of the scores, since the loading scatter plots are complementary and superimposable to the respective score plots. PLS regression coefficients for the variables showed the extent each X variable contributed to Y. The precision was derived from confidence interval (95%) using jack knifing. Statistical significance prevails if the error bars in the figures do not cross the 0 line.

Data accession
The data discussed in this publication have been deposited in a MIAME compliant database, NCBI's Gene Expression Omnibus [34] and are accessible through GEO Series accession number GSE21929 (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc =GSE21929).

Supporting Information
Table S1 The 133 bacterial species/phylotypes found in crevicular fluid and/or subgingival paper point samples and their respective number codes assigned to be used in Figure 2A. Found at: doi:10.1371/journal.pone.0013589.s001 (0.13 MB DOC) Figure 6. Bacterial species/phylotypes responsible for the difference in bacterial composition between GCF samples and PP samples. Multivariate OPLS/2OPLS-DA model was generated for 2 classes. The score plot (Panel A) demonstrates the relationship between the 2 types of samples and between the patients. Capital letters are patient identifiers, ''G'' (black color) for GCF samples and ''P'' (red color) for PP samples. The X variables included 133 species/phylotypes identified by DNA microarray. The binary Y variable defines the group membership to GCF or PP samples. Ellipse: Hotelling T2 (0.95). PLS regression analysis (Panel B) indicated the extent each variable positively or negatively contributed to GCF sample membership and vice versa to PP sample membership. The coefficients are statistically significant when the error bars do not cross the 0 line. The model explained 99% of the variation of Y and, according to cross validation, predicted 70% of the variation of Y. X variables were scaled and centered and Y variables scaled. The confidence intervals (95%) were derived from jack knifing. GCF: gingival crevicular fluid; PP: paper point. doi:10.1371/journal.pone.0013589.g006