Some previous studies have identified bacteria in semen as being a potential factor in male infertility. However, only few types of bacteria were taken into consideration while using PCR-based or culturing methods. Here we present an analysis approach using next-generation sequencing technology and bioinformatics analysis to investigate the associations between bacterial communities and semen quality. Ninety-six semen samples collected were examined for bacterial communities, measuring seven clinical criteria for semen quality (semen volume, sperm concentration, motility, Kruger's strict morphology, antisperm antibody (IgA), Atypical, and leukocytes). Computer-assisted semen analysis (CASA) was also performed. Results showed that the most abundant genera among all samples were Lactobacillus (19.9%), Pseudomonas (9.85%), Prevotella (8.51%) and Gardnerella (4.21%). The proportion of Lactobacillus and Gardnerella was significantly higher in the normal samples, while that of Prevotella was significantly higher in the low quality samples. Unsupervised clustering analysis demonstrated that the seminal bacterial communities were clustered into three main groups: Lactobacillus, Pseudomonas, and Prevotella predominant group. Remarkably, most normal samples (80.6%) were clustered in Lactobacillus predominant group. The analysis results showed seminal bacteria community types were highly associated with semen health. Lactobacillus might not only be a potential probiotic for semen quality maintenance, but also might be helpful in countering the negative influence of Prevotella and Pseudomonas. In this study, we investigated whole seminal bacterial communities and provided the most comprehensive analysis of the association between bacterial community and semen quality. The study significantly contributes to the current understanding of the etiology of male fertility.
Citation: Weng S-L, Chiu C-M, Lin F-M, Huang W-C, Liang C, Yang T, et al. (2014) Bacterial Communities in Semen from Men of Infertile Couples: Metagenomic Sequencing Reveals Relationships of Seminal Microbiota to Semen Quality. PLoS ONE 9(10): e110152. https://doi.org/10.1371/journal.pone.0110152
Editor: Zaid Abdo, Institution and Department: Agricultural Research Service, United States of America
Received: March 6, 2014; Accepted: August 21, 2014; Published: October 23, 2014
Copyright: © 2014 Weng et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The authors confirm that all data underlying the findings are fully available without restriction. The raw sequencing data are available at http://clinic.mbc.nctu.edu.tw/semen/.
Funding: Ministry of Science and Technology of the Republic of China financially supported this research under contract nos. NSC 101-2311-B-009-005-MY3, NSC 102-2627-B-009-001, NSC 103-2628-B-009 -001 -MY3 and NSC 103-2221-E-038-013-My2. This work was supported in part by the UST-UCSD International Center of Excellence in Advanced Bio-engineering sponsored by the Taiwan Ministry of Science and Technology I-RiCE Program under grant nos. NSC 102-2911-I-009-101, and Veterans General Hospitals and University System of Taiwan (VGHUST) Joint Research Program under grant no.VGHUST103-G5-1-2. This work was also partially supported by MOE ATU. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors and the affiliations to Health GeneTech Corporation, along with any other relevant declarations relating to employment, consultancy, patents, products in development or marketed product etc. have declared that no competing interests exist. This does not alter the authors' adherence to PLOS ONE policies on sharing data and materials.
Semen quality and male infertility
Infertility is an increasingly common condition, and the male factors (either alone or in combination with female factors) are now estimated playing a significant role in about 40%–50% of infertile couples. Despite contemporary therapies undoubtedly raising the likelihood of conception among couples suffering from male infertility, these solutions often overlook the absence of a defined etiological or pathophysiological diagnosis. Male infertility, unfortunately, is still considered “idiopathic” in a large proportion of cases –. Consequently, there is a fundamental need to carry out research directed to establish the causes (and potential means of prevention) of male infertility.
Acute and chronic infections of the genitourinary (GU) tract may induce male factor infertility. Infectious etiologies cause about 15% of male factor infertility cases . Ochsendorf  and Keck et al.  found that numerous infectious bacterial, viral, fungal, and protozoan species can enter the normal genital-urinary tract by route of sexual transmission, intracanicular spread of infected urine, or hematogenous seeding of genital organs. Infections of the testicle, epididymis, and prostate ,  can negatively affect spermatogenesis and fertility. There are multiple causes of elevated seminal leukocytes (ESL) including infectious etiologies like genital-urinary infection and non-infectious etiologies including exposure to environmental toxins, man-made products during intercourse, tobacco products, alcohol and certain medications . Other potent noninfection causes such as vasovasostomy, varicoceles, autoimmunity, defective spermatogenesis and poor sperm viability can lead to elevated seminal leukocytes , . Bacteriospermia and the recruitment of seminal leukocytes can potentially impair male fertility through the deterioration of spermatogenesis, impairment of sperm function, and genital tract dysfunction and/or obstruction.
Human microbiome and health
Microbiomes play an important role in human health, disease and some uncertain etiologies. Human skin, intestines, oral cavity, vagina and urethra can host microbial communities. The composition of microbiomes and their connection with various parts of the human body impact human health and the influence on the cause of disease –. The routine culturing methods and the polymerase chain reaction method are clinically useful for detecting specific aerobic, anaerobic or pathogenic bacteria in clinical specimens . Next-generation sequencing technology can be used to directly extract large-scale microbial DNA and RNA sequences from human mixed microbial communities, and can also be used to sequence microbiomes which cannot be cultured . Moreover, next-generation sequencing is more efficient than Sanger sequencing and less expensive . 16S ribosomal RNA analysis is the most commonly-used approach to investigate the cultured-independent microbiomes. Microbiome identification can be achieved by sequencing 16S ribosomal RNA based on next-generation sequencing and bioinformatics approaches.
Several bacteria identified in semen by previous studies have been associated with male infertility. Akutsu et al. performed PCR-based detection of 16S ribosomal RNA genes for six species and detected Lactobacillus inner and. Gardnerella vaginalis in 28% and 14% of semen samples, respectively . Domes et al. determined the incidence of bacteriospermia and found elevated seminal leukocytes (ESL) in a subfertile male population. The rate of bacteriospermia was 15% using a concurrent culture of 22 species in 7,852 samples, and four most common bacterial species identified in seminal fluid were E. fecalis (56%), E. coli (16%), GBS (13%) and S. aureus (5%) . Using a culture method for bacteria detection, Ibadin and Ibeh found 36 out of 87 semen samples of infertile men (41.4%) showed at least one pathogen. Staphylococcus aureus (16.9%), Staphylococcus saprophyticus (9.2%), Escherichia coli (6.9%), Proteus mirabilis (3.4%), Klebsiella spp (2.3%), Pseudomonas aerouginosa (1.1%) and Proteus vulgaris (2.3%) were identified as being associated with bacteriospermia and with the rate of total motility and morphologically abnormal sperms . Manca et al. performed microscopic analyses and cultures on 696 semen specimens and found Gardnerella vaginalis was the most frequently isolated bacterium, followed by Escherichia coli and Enterococcus sp. In addition, sperm concentration, motility and morphology were most likely to deteriorate in the presence of G. vaginalis and U. urealyticum . Moretti et al. performed semen culture and sperm transmission electron microscopy (TEM) analysis on 1,256 patients, with 417 samples (33.2%) showing the presence of bacterial species. The authors suggested that sperm bacterial contamination is quite frequent and could contribute to the deterioration of the sperm quality in infertile men . Kiessling et al. performed PCR amplification of bacterial rDNA on 34 semen samples, and identified gram-positive anaerobic cocci, Corynebacterium spp., Staphylococcus, Lactobacillus, Streptococcus spp., Pseudomonas spp., Haemophilus and Acinetobacter spp. as the largest groups in different specimens. The authors found the concentration and motility of rDNA-positive specimens were not statistically different from the rDNA-negative specimens. In addition, the authors thought that the abundance of bacteria in semen is not commensal, arise from infection in the male genitourinary tract, may influence fertility, and may reflect an inadequate cellular immune response . Hou et al. analyzed the microbiota of the seminal fluid from healthy and infertile men by using pyrosequencing V1–V2 regions of 16S rRNA gene . The results showed that the bacterial community could be clustered in to six group, and no significant differences were found between sperm donors and the infertility patients. However, multiple statistical tests showed the presence of Anaerococcus has negative association with sperm quality.
Bacterial identification methods performed in previous studies were either PCR-based or culture methods and, to date, a comprehensive understanding of bacterial communities in semen is still lacking. Most previous works only focused on a few types of bacteria and relied on qualitative analysis to discover associations between semen microbiomes and semen quality. We therefore present an analysis flow by combining next-generation sequencing technology and clinical semen quality examination (Fig. 1) to produce a high-resolution analysis of relationships between bacterial communities and semen quality.
As shown in Table 1, of the 96 men clinical examined for semen quality, 10 had abnormal semen volume; 13 had low semen concentration (<15×106); 12 had low motility (<40%); 44 had low Kruger's morphology (<5%); 10 had abnormal antisperm antibodies (IgA) (>30%); 8 had atypical (≧1%) and 18 were found to have leucocytes. Table S1 summarizes participant metadata, including CASA values. Thirty-six semen samples without any abnormal clinical values were defined as the normal samples.
A total of 8,337,766 sequence reads was obtained from the 96 samples with a median read length of 125 bp and a mean of 80,424 reads per study participant. Sequence reads were passed through our taxonomic mapping flow and classified to represent seminal bacteria. An average number of 135 genera and 569 species were detected in the samples, and Table S2 summarizes the taxonomic assignment of sequence reads. In Schloss et al's study, the microbiome communities should be compared using an equal number of sequences to minimize the sequence artifact generated by high-throughput sequencing . Therefore, we used the proportion instead of number of sequence reads to represent the taxonomic composition of each sample for further analysis. The distribution of relatively abundant genera is depicted in Fig. 2. The sequencing results showed that the most abundant species of bacteria in semen are Lactobacillus iners (14.09%), uncultured Prevotella sp. (3.06%), uncultured Gardnerella sp. (2.96%), Lactobacillus sp. (2.53%), uncultured Pseudomonas sp. (2.52%) and Prevotella bivia (2.22%) (Table S3). The most abundant genera of bacteria are Lactobacillus (19.9%), Pseudomonas (9.84%), Prevotella (8.51%), Gardnerella (4.21%), Rhodanobacter (2.74%), Streptococcus (2.74%), Finegoldia (2.73%) and Haemophilus (2.58%) (Table S4).
Unsupervised Clustering Analysis
A hierarchical clustering was performed, and the seminal bacterial community and clinical values of each sample are shown in Fig. 3. The results demonstrate that the bacterial communities in semen are clustered into three main groups, G1, G2 and G3 (Fig. 3C). Five out of 25 samples (20%) in Pseudomonas-predominant group (G1), 2 out of 16 samples (12.5%) in Prevotella-predominant group (G3), and 29 out of 55 samples (52.7%) in Lactobacillus-predominant group (G2) are normal samples, and both of G1 and G2 are statistically different from G2 (fisher p-value = 7.38E-03 and 4.48E-03). Figure 3 shows that the most abundant genera in G1, G2, and G3 were Pseudomonas (16.1%), Lactobacillus (32.3%), and Prevotella (26.3%), respectively. Remarkably, 29 out of 36 normal samples (80.6%) were clustered in the Lactobacillus-predominant group (G2). The results showed that seminal bacteria community types were highly associated with semen health. The genera diversity and richness analysis showed that the G1 group exhibited significantly higher levels of bacterial diversity than G2 and G3 group (Fig. 4A), and there were no significant difference of richness among three groups (Fig. 4B).
(A) Hierarchical clustering was used to generate a clustering tree depicting the bacterial diversity in 96 men with clinical values. The scale bar represents the sample distance generated by UniFrac. (B) Clinical value status is depicted in the seven horizontal bars. The red and green rectangles respectively represent abnormal and normal clinical values. B1 to B7 respectively represent the clinical value status of semen volume, sperm concentration, motility, Kruger's strict morphology, antisperm antibody (IgA), Atypical, and leucocytes. (C) G1, G2 and G3 represent the three main groups in the clustering results, which were respectively predominated by Pseudomonas, Lactobacillus and Prevotella. (D) The colored bars represent the taxonomic compositions in each sample. Less abundant taxa were grouped in the “Others” category.
To determine the functional basis of the microbiome community types (G1, G2 and G3), we investigated differences in their composition and correlation to abundance of co-occurring genera (Fig. 5). Semen microbiome community type G1 (with 25 samples) is enriched in Pseudomonas, which co-occurs, for example, with Enterobacter. Both Pseudomonas and Enterobacter are Gram-negative bacteria and several strains of these bacteria are pathogenic and cause opportunistic infections in immunocompromised hosts, most commonly in the urinary tract. Microbiome community type G2 is the most frequent, containing 55 samples, and is enriched in Lactobacillus, which is Gram-positive facultative anaerobic, and negatively correlated to Pseudomonas, Stenotrophomonas, Ochrobactrum and Janthinobacterium, which all are Gram-negative bacteria. Lactobacilli have been found to play an important role of restoring particular physiological balances such as in the vaginal microbiome –. Microbiome community type G3 contains 16 samples and is enriched in Prevotella which mainly co-occurs with Propionibacterium and Dietzia. Prevotella spp. is a type of oral and vaginal flora and has been isolated from abscesses and burns in the vicinity of the mouth, urinary tract infections, and bacteremia associated with upper respiratory tract infections.
Supervised comparison of seminal bacterial communities
To investigate the association between bacterial community and semen quality, 36 semen samples examined without any abnormal clinical values were defined as normal samples, and 33 samples with more than two abnormal clinical values were used as case samples which were low-quality and more likely to be associated with male infertility. Figure 6 shows that the most abundant genera of bacteria in the normal samples were Lactobacillus (24.7%), Pseudomonas (10.3%), Gardnerella (6.6%), Prevotella (5.4%), Rhodanobacter (2.9%) and Streptococcus (2.7%), and the most abundant genera of bacteria in the case samples were Lactobacillus (13.8%), Prevotella (11%), Pseudomonas (9.3%), Haemophilus (4.4%), Finegoldia (3.5%), Rhodanobacter (3.1%), Corynebacterium (2.8%) and Streptococcus (2.8%). The normal samples had a significantly higher proportion of Lactobacillus, Gardnerella, Propionibacterium and Atopobium, while the case samples had a significantly higher proportion of Prevotella and Aggregatibacter (Fig. 7A). Lactobacillus crispatus, Gardnerella vaginalis, Lactobacillus acidophilus had a higher proportion in the normal samples, while Prevotella bivia and Haemophilus parainfluenzae had a higher proportion in the case samples (Fig. 7B). Haemophilus parainfluenzae is as an opportunistic pathogen which causes systemic diseases including endocarditis, meningitis and bacteremia, and is often isolated from the sputa of patients with chronic obstructive lung disease. The genera diversity and richness analysis showed that there were no significant differences in diversity between the case and normal samples (Fig. 8A), and the bacterial richness of the case samples was significantly higher than that of normal samples (Fig. 8B).
Lactobacillus and Prevotella not only constituted the major proportion of seminal microbiota in the normal samples and case samples, but also had significantly different proportions between the case and normal samples. Therefore they may respectively have strong positive and negative impacts on semen quality. Pseudomonas was a common genus in both case samples (10%) and normal samples (9%), and some species were considered to be opportunistic human pathogens. The proportion of Pseudomonas was not significantly different between the case and normal samples, and therefore Pseudomonas seems to have no impact on semen quality. However, further analysis results (Table S5) showed that Pseudomonas can contribute to the deterioration of semen quality in samples containing less Lactobacillus (Fisher's exact p-value = 0.04), such as the samples in the microbiome type G2 (Pseudomonas-predominated) group. Lactobacillus might not only be a potential probiotic for maintaining semen quality, but also might protect against the negative influence of Prevotella, Haemophilus and Pseudomonas.
A principal coordinates analysis (PCoA) was performed to visualize seminal bacterial communities for the case and normal samples. As shown in Fig. 9, most normal samples (red circle) were located in the top left area, and most case samples (blue rectangle) were located on the bottom and right side. The nodes in the PCoA graph could be separated into three clusters according to the distribution of the red circles and blue rectangles. Interestingly, these three clusters also correspond to the G1, G2 and G3 groups as previously defined. Table S6 shows that the G1 and G3 groups respectively have a 5.2-fold and 8.5-fold greater likelihood of low semen quality than samples in the G2 group, which indicates that semen quality is highly associated with seminal bacterial communities.
The blue rectangles and red circles respectively represent the case and normal samples.
Tables S7∼S10 respectively list the genera and species of bacteria significantly abundant in the samples with normal or abnormal clinical values. The genera and species which were significantly associated more than two clinical criteria are respectively summarized in Tables 2 and 3. The presence of Lactobacillus and Gardnerella respectively exert a positive association with five and six clinical criteria, while the presence of Prevotella and Bordetella respectively exert a negative association with three and four clinical criteria. In addition, only one bacteria genus (Shewanella) and one species (Fusobacterium periodonticum) were found to be significantly associated with semen volume (Table S8 and Table S10), which indicates that semen volume may be mainly caused by other etiological factors.
The genera and species of bacteria associated with CASA criteria are respectively summarized in Table S11 and Table S12. Only morphometric criteria of CASA, head elongation and area were found to be associated with bacteria. Noticeably, Lactobacillus crispatus was not only associated with sperm elongation, but was also associated with Kruger's strict morphology (Table S9), which indicates that it may have a significant influence on semen morphology.
Potential markers for classification of semen microbiome communities
Analysis results demonstrate that microbiome communities are significantly associated with semen quality (Table S6). PCR-based detection can be used to determine the critical bacteria for classification of particular semen microbiome communities in practical clinical investigations, and such results might be helpful for indicating possible causes of male infertility. Here, five abundant genera of bacteria, Lactobacillus, Gardnerella, Prevotella, Pseudomonas, and Haemophilus, were collected to classify 96 samples into three different microbiome community types using the machine learning method J48 in Weka 3.6.7 . The proportion of each bacteria and the relative ratio between the different bacteria types were taken into consideration as features for rule-based clustering. Five-fold cross-validation was used to evaluate the performance of the classification model. As shown in Table S13, the classifier performed well in all three groups, and the ROC area of G1, G2 and G3 achieved 0.929, 0.953 and 0.947, respectively. As shown in Fig. 10, the classification rules are simple and are described as follows: (1) If the ratio of Lactobacillus/(Prevotella+Pseudomonas+Haemophilus) of a sample is greater than 0.57, the sample is classified to G2. (2) For a sample with a ratio of Lactobacillus/(Prevotella+Pseudomonas+Haemophilus) below 0.57, it is classified to G3 if the ratio of Prevotella/Pseudomonas is greater than 1.37; otherwise, it is classified to G1. The results show that Lactobacillus, Prevotella, Pseudomonas, and Haemophilus could potentially be markers for future clinical applications and investigations of male infertility.
This study reveals whole bacterial communities in semen and has potentially high clinical value of the high-throughput sequencing of bacterial 16S rRNA in semen specimens to help monitor semen quality. Three seminal microbiome community types were identified: G1 (Pseudomonas-predominant group), G2 (Lactobacillus-predominant group) and G3 (Prevotella-predominant group) (Fig. 3). Twenty-nine out of 36 normal samples (80.6%) were clustered in the G2 group, which accounts for more than half of the samples in G2 (52.7%). Only 5 out of 25 samples (20%) in G1 group and 2 out of 16 samples (12.5%) in G3 group were normal samples. In addition, comparative analysis shows that samples in the G1 and G3 groups respectively have a 5.2-fold and 8.5-fold greater chance than the G2 group of containing more than two abnormal clinical values. These results indicate that semen quality is highly associated with seminal bacterial communities.
As compared to Hou et al.'s study , the different results were discovered in our study. It might be caused by use of different samples, clinical examinations, sperm quality groups and clustering methods between two studies. However, most abundant genera present in Hou et al's study were also identified abundant in our data, such as Lactobacillus (19.9%), Prevotella (8.51%), Finegoldia (2.73%), Corynebacterium (2.04%), Staphylococcus (1.46%) and Veillonella (1.04%). In addition, some of our relative abundant bacteria had been identified in previous studies which used PCR-based or culture methods to detect bacteria in semen or the male genital tract, such as Lactobacillus , , , Pseudomonas , , Gardnerella , , , , Prevotella ,  and Haemophilus . It shows the comparability and reliability of our sequencing data.
As shown in Fig. 7A, genera of Lactobacillus, Gardnerella, Propionibacterium and Atopobium were relatively abundant and significantly present in the normal samples, and Prevotella and Aggregatibacter were relative abundant and significantly present in the case samples. Lactobacillus is a genus of Gram-positive anaerobic bacteria , and is a major part of the lactic acid bacteria group. Lactobacillus was detected in 95 of the 96 samples (98.96%), with proportions ranging from 0% to 61.22% with an average of 19.90%. In humans, Lactobacilli are present in the vagina  and the gastrointestinal tract. Lactobacilli can also be used to restore physiological balance such as in the vaginal eco-system –. In previous studies, Lactobacillus had been reported as a normal bacteria in semen , , and is found to have a positive impact on clinical criteria for leucocytes, Kruger's strict morphology, sperm concentration, and Atypical. Gardnerella is a genus of Gram-variable-staining anaerobic bacteria ,  of which Gardnerella vaginalis is the only species. Gardnerella was detected in 93 of the 96 samples (96.88%) in proportions ranging from 0% to 28.82% with an average of 4.21%. G. vaginalis is the predominant cause of bacterial vaginosis in women by disrupting normal vaginal microflora . Propionibacterium is a genus of Gram-positive bacteria. Its members are primarily facultative parasites and commensals of humans. Atopobium is a genus of anaerobic Gram-positive bacteria. In Akutsu et al's study, 16S ribosomal RNA genes of Lactobacillus crispatus and Atopobium vaginae were detected in vaginal fluid and female urine samples . Prevotella is a genus of Gram-negative anaerobic bacteria , and it had been reported to be involved in bacterial vaginosis , . Aggregatibacter is a genus of Gram-negative bacteria. Pseudomonas is a genus of Gram-negative aerobic bacteria . Pseudomonas has been known to cause bloodstream infections, and even to be associated with bacteraemia , . Pseudomonas aeruginosa is the type species of the genus , and it is increasingly recognized as an opportunistic pathogen of clinical relevance. Pseudomonas aeruginosa is one type of bacteria detected in the semen of infertile men . Pseudomonas putida is a bacterium commonly found in genitourinary tract infections –.
As shown in Fig. 6, Pseudomonas was a common genus in both case samples (10%) and normal samples (9%). The proportion of Pseudomonas dose not significantly differ between the case and normal samples, and thus seemingly has no association with semen quality. As seen in Table S5, Pseudomonas was found to contribute to the deterioration of semen quality in samples containing less Lactobacillus. According to the bacteria discovered in Fig. 7A and the above discussion, it indicated that Gram-positive bacteria, such as Lactobacillus, Propionibacterium and Atopobium, seem to be involved in not only facilitating semen quality maintenance, but also might protect against the negative influence of Gram-negative bacteria, such as Prevotella, Aggregatibacter and Pseudomonas. In our opinion, Lactobacillus supplements could help maintain semen quality and increase fertility potential, and future studies are recommended to validate this assumption.
Previous studies have used PCR-based or culture methods to detect Gardnerella vaginalis in semen or the male genital tract , , , . G. vaginalis is considered to be a pathogen in the lower male genital tract , , . Manca et al. found a positive correlation between the presence of G. vaginalis infections and leukocytes, while sperm concentration, motility and morphology were found to deteriorate in the presence of G. vaginalis . Andrade-Rocha concluded that the presence of G. vaginalis is not associated with either abnormal sperm characteristics or inflammatory response in infected men . However, in this study, we found a positive correlation between the proportion of G. vaginalis genera and semen quality (see Table 3), including sperm concentration, motility, Kruger's strict morphology, antisperm-antibody (IgA), Atypical and Leucocytes – findings which differ from previous studies. In our data, G. vaginalis was relatively abundant in the G2 group which is predominated by Lactobacillus. We hypothesize that G. vaginalis might act as an opportunistic microorganism which can disturb the bacterial ecosystem to shift from commensal to pathogenic bacterium in low quality semen. Therefore, the positive association between G. vaginalis and semen quality might be the accompanied effect provided by Lactobacillus. Nevertheless, the role of G. vaginalis remains unclear since it is found in both normal and abnormal fertile individuals .
The presence of Lactobacillus crispatus was found to have a positive association with quality of sperm concentration, leucocytes, IgA, and Kruger's strict morphology (Table S9). It was also found to be related to sperm elongation in CASA criteria (Table S12). Lactobacillus crispatus is a normal vaginal flora  but its significant to semen is unknown . In healthy women, Lactobacillus crispatus or Lactobacillus iners are the dominant vaginal bacterial communities . Also, in co-culture conditions, Lactobacillus crispatus reduced the viability of Gardnerella vaginalis and Prevotella bivia , and Lactobacillus crispatus was found to restore normal microbial communities in the vaginal ecosystem by displacing vaginal pathogens . Our analysis results indicate that Lactobacillus crispatus has the potentialin maintaining the semen ecosystem and semen quality, as the role in the vaginal ecosystem.
Escherichia coli was found to be associated with reduced semen density and diminished progressive motility , and could cause a decrease in in vitro viability  and motility –. In De Francesco et al's study, E. coli was the second most frequently isolated bacterium in 696 infertile men, and its concentration differed significantly between infertile and fertile males . However, it was not detected using the PCR-based method by either Jarvi  or Kiessling , and was detected in only 3 of 17 specimens by Tanner . Eyre et al. suggested that, although E. coli may be present in semen, the number of organisms are too few to compete in PCR reactions, but that they are more readily cultured than other organisms such as gram-positive anaerobic cocci (GPAC). In this study, we clearly identified the proportion of E. Coli in each sample using high-throughput sequencing technology. Escherichia coli were detected in 94 of the 96 samples (97.92%) in proportions ranging from 0% to 16.42%, with an average of 1.26%. Analysis results showed that E. Coli were not associated with semen quality (p>0.05 for the 7 detected clinical criteria, data not shown).
Little is known about the composition of the microflora in normal seminal fluid or the male genital tract , . As far as we know presently, the microflora in semen of healthy men was characterized by Gram-positive bacteria, e.g. lactobacilli, coagulase-negative staphylococci, streptococci and corynebacteria, ,  by culture detection.
Recently, the composition of microflora in semen of healthy men detecting by next generation sequencing has been reported in the work of Hou et al. . Corynebacterium (14%) was the most abundant genus in healthy men, followed by Ralstonia (13%), Lactobacillus (9%), Streptococcus (7%), Finegoldia (7%), and Anaerococcus (5%). These genera were also isolated in infertility patients and did not show significant differences between healthy men and patients.
In this study, microbiome type G2 (Lactobacillus predominated) was found to be associated with healthy semen, while G1 (Pseudomonas predominated) and G3 (Prevotella predominated) were found to be associated with low quality semen. However, the G2 group accounts for 23.6% of the case samples, which indicates that some instances of low quality semen may be caused by genetic or other etiological factors. In addition, the G1 and G3 groups respectively contained 20% and 12.5% normal samples, which largely corresponds to the results of previous studies , i.e., the bacteria contamination of semen samples of fertile individuals did not compromise sperm quality, but it is possible that the presence of bacteria further deteriorated the overall quality of seminal plasma in infertile men.
Finally, a practical criterion was developed to classify semen samples into three different microbiome community types by using four genera of bacteria: Lactobacillus, Prevotella, Pseudomonas, and Haemophilus. The fertility status and the fertility outcome of our samples are not available currently. A case-control study will be conducted in the near future to clarify this issue and validate our classification model. Although the mechanism by which pathogenic bacteria cause infertility is still unknown, these four genera could serve as potential markers for future clinical applications and investigations of male infertility.
Materials and Methods
The study was approved by the institutional review board of Mackay Memorial Hospital (Hsinchu, Taiwan). All patients provided written informed consent.
Ninety-six semen samples were collected from 96 patients at the reproductive center at Mackay Memorial Hospital (Hsinchu, Taiwan). The patients were couples suffering from either male infertility or female infertility, or infertility of unknown etiologies. The patients had been suffering from primary infertility for at least one year, and did not suffer from other significant health issues. Semen was collected by masturbation into a sterile bottle following 3–5 days of sexual abstinence. All semen samples were collected using the Copan ESwab collection Kit 480C into Eswab tubes for transport, and stored at 4°C for DNA extraction and PCR amplification of bacteria 16S rDNA.
Measurement of semen quality
All semen samples were allowed to liquefy for 30 minutes at 37°C, followed by assessment of sperm parameters according to World Health Organization (WHO) guidelines . Leukocytes were identified by peroxidase stain. Leukocyte concentrations below 1×106 cells/ml are considered normal under the WHO guidelines.
Sperm concentration and motion parameter analysis were assessed using the HTM-IVOS semen analyzer (Hamilton Thorne Research, Beverly, MA, USA) and manually monitored. Sperm motion was examined after mixing the sperm suspension and loading a 5 µl aliquot into a chamber of the 20-µl M-4 chamber MicroCell slide (Conception Technologies, La Jolla, CA, USA). The slide was then transferred to the HTM-IVOS semen analyzer where it was maintained at 37°C for 2 minutes before data assessment, which was conducted on randomly selected fields. Within each sample, computer-assisted semen analysis (CASA) was performed to measure 9 aspects of semen quality, including average path velocity (VAP), straight-line (progressive) velocity (VSL), curvilinear velocity (track speed) (VCL), lateral amplitude (ALH), beat frequency (BCF), straightness (STR), linearity (LIN), head elongation, and area.
Sperm antibody tests, including IgA classes, were conducted using SpermMar Test (Fertipro N.V., Belgium) by two well-experienced technicians. Sperm morphology was assessed using Kruger's strict criteria  after slide staining with Diff-Quik (Diagnostics AG, Medion, Switzerland) by two other well-experienced technicians to classify head defects, neck and midpiece defects and tail defects of the spermatozoa. Very small spermatozoa heads were identified as an atypical morphology indicating immaturity issues.
DNA extraction was performed directly on the samples using a QIAamp DNA Blood Mini Kit (Qiagen). Each sample was transferred to a 1.5 ml microcentrifuge tube, and centrifuged at 13,000 rpm for 2 min to pellet the bacteria. Bacterial pellets were suspended in 180 µl of the appropriate enzyme solution, and incubated for at least 30 min at 37°C. In addition, 20 µl proteinase K and 200 µl Buffer AL was added to the sample, and mixed by vortexing. Each suspension was incubated at 56°C for 30 min and then for a further 15 min at 95°C. The 1.5 ml microcentrifuge tube was briefly centrifuged to spin down the suspension. From this point, the extraction proceeded following the protocol of the QIAamp DNA Blood Mini Kit. The DNA was eluted with 30 µl Buffer AE, and centrifuged at 8,000 rpm for 1 min. The DNA extract was stored at −20°C for further analysis.
Two PCR primers, F515 (5′-GTGCCAGCMGCCGCGGTAA-3′) and R806 (5′-GGACTACHVGGGTWTCTAAT-3′), were designed to target the V4 domain of bacterial 16S rRNA as described in a previous study . The PCR amplification was performed in a 50 µl reaction volume containing 25 µl 2X Phusion Flash Master Mix (ThermoFisher), 0.5 µM of each forward and reverse primer, and 50 ng DNA template. The reaction conditions consisted of an initial 98°C for 30 sec followed by 30 cycles of 98°C for 10 sec, 54°C for 30 sec, 72°C for 30 sec, and a final extension of 72°C for 5 min. Amplified products were checked by 2% agarose gel electrophoresis and ethidium bromide staining. Amplicons were purified using the AMPure XP PCR Purification Kit (Agencourt), and quantified using a Qubit dsDNA HS Assay Kit (Qubit) on a Qubit 2.0 Fluorometer (Qubit) all according to the respective manufacturers' instructions. For V4 library preparation, Illumina adapters were attached to the amplicons using the TruSeq DNA Sample Preparation v2 Kit (Illumina). Purified libraries were applied for cluster generation and sequencing on the Miseq system.
16s rRNA sequence data quality filtering and taxonomy mapping
Paired-end sequences were obtained using an illumina sequencing machine in FASTQ format and sequence quality was assessed using the FASTX-Toolkit. The raw reads were pre-processed to classify samples by barcodes, trim barcodes and truncate low quality bases and reads. The quality of reads exceeding a Phred quality score 30 (Q30) were retained. Sequences consisting of fewer than 100 nucleotides were discarded along with any reads containing ambiguous characters. A fast and memory-efficient aligning sequencing reading tool  was adopted to map the paired-end reads to bacterial 16s ribosomal RNA (rDNA) sequences obtained from NCBI 16S ribosomal RNA sequence database and NCBI nucleotide collection database. The reads were mapped to specific bacteria if the sequence similarity exceeded 97% and paired-end reads were aligned to the same reference sequence. The raw sequencing data are available at http://clinic.mbc.nctu.edu.tw/semen/.
Bacterial community analysis
An unsupervised clustering method was applied to observe relationships between the samples and to explore taxonomic associations with semen quality. The Greengenes core set tree was used to represent the distance between bacteria , , and the phylogeny-based weighted UniFrac  was applied to calculate the difference in overall microbial community composition. The pseudo F statistic developed by Calinski and Harabasz  was applied to generate the optimum number of clusters. A dendrogram was constructed using MEGA 4.0 . Statistics for true diversity through the Shannon index and Chao richness were calculated separately for each sample using R language. Correlations of co-occurring network were computed on the basis of pairwise Spearman correlation analysis. The correlations with an absolute Spearman correlation above 0.4 were transformed to links between two genera in the genus network. The Mann–Whitney U test was used to investigate significant genera or species between case and normal samples. The bacteria with proportion more than 0.25% were collected for multiple testing with false discovery rate less than 0.05 by using adaptive Benjamini-Hochberg method . UniFrac distance matrices which represent the characteristics between microbial communities of samples dependent on phylogenetic information was transformed into principal coordinated using Principal coordinates analysis (PCoA)  to provide a visualization of the sample distribution patterns.
Taxonomic assignment of sequence reads.
Identified species of bacteria in semen.
Identified genera of bacteria in semen.
The association between Pseudomonas and semen quality in samples with and without relatively abundant Lactobacillus.
Fisher's exact test and odds ratio of low semen quality rates comparing G1 and G3 to G2.
Genera of bacteria significantly abundant in samples with normal clinical value.
Genera of bacteria significantly abundant in samples with abnormal clinical value.
Species of bacteria significantly abundant in samples with normal clinical value.
Species of bacteria significantly abundant in samples with abnormal clinical value.
Genera of bacteria significantly associated with CASA criteria.
Species of bacteria significantly associated with CASA criteria.
Performance classification of microbiome community types using four bacteria, Lactobacillus, Prevotella, Pseudomonas and Haemophilus.
Conceived and designed the experiments: SLW THC HDH. Performed the experiments: TY TLY YAC. Analyzed the data: CMC FML WCH CL CYL WYW THC. Contributed reagents/materials/analysis tools: SLW. Wrote the paper: SLW CMC WCH TLY THC HDH.
- 1. Kamischke A, Nieschlag E (1999) Analysis of medical treatment of male infertility. Hum Reprod 14 Suppl 11–23.
- 2. Carr BR (2013) Optimal diagnosis and medical treatment of male infertility. Semin Reprod Med 31: 231–232.
- 3. Rittenberg V, El-Toukhy T (2010) Medical treatment of male infertility. Hum Fertil (Camb) 13: 208–216.
- 4. Diemer T, Huwe P, Ludwig M, Hauck EW, Weidner W (2003) Urogenital infection and sperm motility. Andrologia 35: 283–287.
- 5. Ochsendorf FR (2008) Sexually transmitted infections: impact on male fertility. Andrologia 40: 72–75.
- 6. Keck C, Gerber-Schafer C, Clad A, Wilhelm C, Breckwoldt M (1998) Seminal tract infections: impact on male fertility and treatment options. Hum Reprod Update 4: 891–903.
- 7. Henkel R, Ludwig M, Schuppe HC, Diemer T, Schill WB, et al. (2006) Chronic pelvic pain syndrome/chronic prostatitis affect the acrosome reaction in human spermatozoa. World J Urol 24: 39–44.
- 8. Domes T, Lo KC, Grober ED, Mullen JB, Mazzulli T, et al. (2012) The incidence and effect of bacteriospermia and elevated seminal leukocytes on semen parameters. Fertil Steril 97: 1050–1055.
- 9. Close CE, Roberts PL, Berger RE (1990) Cigarettes, alcohol and marijuana are related to pyospermia in infertile men. J Urol 144: 900–903.
- 10. Barratt CL, Bolton AE, Cooke ID (1990) Functional significance of white blood cells in the male and female reproductive tract. Hum Reprod 5: 639–648.
- 11. Jarvi K, Noss MB (1994) Pyospermia and male infertility. Can J Urol 1: 25–30.
- 12. Stumpf RM, Wilson BA, Rivera A, Yildirim S, Yeoman CJ, et al. (2013) The primate vaginal microbiome: comparative context and implications for human health and disease. Am J Phys Anthropol 152 Suppl 57119–134.
- 13. Madupu R, Szpakowski S, Nelson KE (2013) Microbiome in human health and disease. Sci Prog 96: 153–170.
- 14. Cox MJ, Cookson WO, Moffatt MF (2013) Sequencing the human microbiome in health and disease. Hum Mol Genet 22: R88–94.
- 15. Jordan JA, Durso MB (2005) Real-time polymerase chain reaction for detecting bacterial DNA directly from blood of neonates being evaluated for sepsis. J Mol Diagn 7: 575–581.
- 16. Goldenberger D, Kunzli A, Vogt P, Zbinden R, Altwegg M (1997) Molecular diagnosis of bacterial endocarditis by broad-range PCR amplification and direct sequencing. J Clin Microbiol 35: 2733–2739.
- 17. Metzker ML (2010) Sequencing technologies - the next generation. Nat Rev Genet 11: 31–46.
- 18. Akutsu T, Motani H, Watanabe K, Iwase H, Sakurada K (2012) Detection of bacterial 16S ribosomal RNA genes for forensic identification of vaginal fluid. Leg Med (Tokyo) 14: 160–162.
Lbadin OKaI, I N. (2008) Bacteriospermia and sperm quality in infertile male patient at University of Benin Teaching Hospital, Benin City, Nigeria Malaysian Journal of Microbiology Vol 4(2) : pp. 65– 67.
- 20. De Francesco MA, Negrini R, Ravizzola G, Galli P, Manca N (2011) Bacterial species present in the lower male genital tract: a five-year retrospective study. Eur J Contracept Reprod Health Care 16: 47–53.
- 21. Moretti E, Capitani S, Figura N, Pammolli A, Federico MG, et al. (2009) The presence of bacteria species in semen and sperm quality. J Assist Reprod Genet 26: 47–56.
- 22. Kiessling AA, Desmarais BM, Yin HZ, Loverde J, Eyre RC (2008) Detection and identification of bacterial DNA in semen. Fertil Steril 90: 1744–1756.
- 23. Hou D, Zhou X, Zhong X, Settles ML, Herring J, et al. (2013) Microbiota of the seminal fluid from healthy and infertile men. Fertil Steril 100: 1261–1269.
- 24. Schloss PD, Gevers D, Westcott SL (2011) Reducing the effects of PCR amplification and sequencing artifacts on 16S rRNA-based studies. PLoS One 6: e27310.
- 25. Reid G, Dols J, Miller W (2009) Targeting the vaginal microbiota with probiotics as a means to counteract infections. Curr Opin Clin Nutr Metab Care 12: 583–587.
- 26. Osset J, Bartolome RM, Garcia E, Andreu A (2001) Assessment of the capacity of Lactobacillus to inhibit the growth of uropathogens and block their adhesion to vaginal epithelial cells. J Infect Dis 183: 485–491.
- 27. Pascual LM, Daniele MB, Ruiz F, Giordano W, Pajaro C, et al. (2008) Lactobacillus rhamnosus L60, a potential probiotic isolated from the human vagina. J Gen Appl Microbiol 54: 141–148.
- 28. Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, et al. (2009) The WEKA Data Mining Software: An Update. SIGKDD Explorations 11.
- 29. Lacroix JM, Jarvi K, Batra SD, Heritz DM, Mittelman MW (1996) PCR-based technique for the detection of bacteria in semen and urine. Journal of Microbiological Methods 26: 61–71.
- 30. Andrade-Rocha FT (2009) Colonization of Gardnerella vaginalis in semen of infertile men: prevalence, influence on sperm characteristics, relationship with leukocyte concentration and clinical significance. Gynecol Obstet Invest 68: 134–136.
- 31. Smith SM, Ogbara T, Eng RH (1992) Involvement of Gardnerella vaginalis in urinary tract infections in men. J Clin Microbiol 30: 1575–1577.
- 32. Marchandin H, Teyssier C, Jumas-Bilak E, Robert M, Artigues AC, et al. (2005) Molecular identification of the first human isolate belonging to the Veillonella ratti-Veillonella criceti group based on 16S rDNA and dnaK gene sequencing. Res Microbiol 156: 603–607.
- 33. Makarova K, Slesarev A, Wolf Y, Sorokin A, Mirkin B, et al. (2006) Comparative genomics of the lactic acid bacteria. Proc Natl Acad Sci U S A 103: 15611–15616.
- 34. Dicks LM, Silvester M, Lawson PA, Collins MD (2000) Lactobacillus fornicalis sp. nov., isolated from the posterior fornix of the human vagina. Int J Syst Evol Microbiol 50 Pt 3: 1253–1258.
- 35. Bukharin OV, Kuz'min MD, Ivanov Iu B (2000) [The role of the microbial factor in the pathogenesis of male infertility]. Zh Mikrobiol Epidemiol Immunobiol 106–110.
- 36. Ivanov IB, Kuzmin MD, Gritsenko VA (2009) Microflora of the seminal fluid of healthy men and men suffering from chronic prostatitis syndrome. Int J Androl 32: 462–467.
- 37. Catlin BW (1992) Gardnerella vaginalis: characteristics, clinical considerations, and controversies. Clinical Microbiology Reviews 5: 213–237.
- 38. Harper JJ, Davis GHG (1982) Cell-Wall Analysis of Gardnerella-Vaginalis (Hemophilus-Vaginalis). International Journal of Systematic Bacteriology 32: 48–50.
- 39. Pleckaityte M, Zilnyte M, Zvirbliene A (2012) Insights into the CRISPR/Cas system of Gardnerella vaginalis. BMC Microbiol 12: 301.
- 40. Shah HN, Collins DM (1990) Prevotella, a new genus to include Bacteroides melaninogenicus and related species formerly classified in the genus Bacteroides. Int J Syst Bacteriol 40: 205–208.
- 41. Gonzalez Pedraza Aviles A, Ortiz Zaragoza MC, Irigoyen Coria A (1999) Bacterial vaginosis a "broad overview". Rev Latinoam Microbiol 41: 25–34.
- 42. Donders G (2010) Diagnosis and management of bacterial vaginosis and other types of abnormal vaginal bacterial flora: a review. Obstet Gynecol Surv 65: 462–473.
- 43. Euzeby JP (1997) List of Bacterial Names with Standing in Nomenclature: a folder available on the Internet. Int J Syst Bacteriol 47: 590–592.
- 44. Hattemer A, Hauser A, Diaz M, Scheetz M, Shah N, et al. (2013) Bacterial and clinical characteristics of health care- and community-acquired bloodstream infections due to Pseudomonas aeruginosa. Antimicrob Agents Chemother 57: 3969–3975.
- 45. Liu Y, Liu K, Yu X, Li B, Cao B (2014) Identification and control of a Pseudomonas spp (P. fulva and P. putida) bloodstream infection outbreak in a teaching hospital in Beijing, China. Int J Infect Dis 23: 105–108.
- 46. Anzai Y, Kim H, Park JY, Wakabayashi H, Oyaizu H (2000) Phylogenetic affiliation of the pseudomonads based on 16S rRNA sequence. Int J Syst Evol Microbiol 50 Pt 4: 1563–1589.
- 47. Isaiah IN, Nche BT, Nwagu IG, Nnanna II (2011) Current studies on bacterospermia the leading cause of male infertility: a protege and potential threat towards mans extinction. N Am J Med Sci 3: 562–564.
- 48. Yang CH, Young T, Peng MY, Weng MC (1996) Clinical spectrum of Pseudomonas putida infection. J Formos Med Assoc 95: 754–761.
- 49. Taneja N, Meharwal SK, Sharma SK, Sharma M (2004) Significance and characterisation of pseudomonads from urinary tract specimens. J Commun Dis 36: 27–34.
- 50. Saha R, Jain S, Kaur IR (2010) Metallo beta-lactamase producing pseudomonas species—a major cause of concern among hospital associated urinary tract infection. J Indian Med Assoc 108: 344–348.
- 51. Elsner P, Hartmann AA, Wecker I (1988) Gardnerella vaginalis is associated with other sexually transmittable microorganisms in the male urethra. Zentralbl Bakteriol Mikrobiol Hyg A 269: 56–63.
- 52. Virecoulon F, Wallet F, Fruchart-Flamenbaum A, Rigot JM, Peers MC, et al. (2005) Bacterial flora of the low male genital tract in patients consulting for infertility. Andrologia 37: 160–165.
- 53. Fraczek M, Szumala-Kakol A, Jedrzejczak P, Kamieniczna M, Kurpisz M (2007) Bacteria trigger oxygen radical release and sperm lipid peroxidation in in vitro model of semen inflammation. Fertil Steril 88: 1076–1085.
- 54. Srinivasan S, Fredricks DN (2008) The human vaginal bacterial biota and bacterial vaginosis. Interdiscip Perspect Infect Dis 2008: 750479.
- 55. Srinivasan S, Hoffman NG, Morgan MT, Matsen FA, Fiedler TL, et al. (2012) Bacterial communities in women with bacterial vaginosis: high resolution phylogenetic analyses reveal relationships of microbiota to clinical criteria. PLoS One 7: e37818.
- 56. Atassi F, Brassart D, Grob P, Graf F, Servin AL (2006) Lactobacillus strains isolated from the vaginal microbiota of healthy women inhibit Prevotella bivia and Gardnerella vaginalis in coculture and cell culture. FEMS Immunol Med Microbiol 48: 424–432.
- 57. Kaewsrichan J, Peeyananjarassri K, Kongprasertkit J (2006) Selection and identification of anaerobic lactobacilli producing inhibitory compounds against vaginal pathogens. FEMS Immunol Med Microbiol 48: 75–83.
- 58. Sanocka-Maciejewska D, Ciupinska M, Kurpisz M (2005) Bacterial infection and semen quality. J Reprod Immunol 67: 51–56.
- 59. Schulz M, Sanchez R, Soto L, Risopatron J, Villegas J (2010) Effect of Escherichia coli and its soluble factors on mitochondrial membrane potential, phosphatidylserine translocation, viability, and motility of human spermatozoa. Fertil Steril 94: 619–623.
- 60. Diemer T, Huwe P, Ludwig M, Schroeder-Printzen I, Michelmann HW, et al. (2003) Influence of autogenous leucocytes and Escherichia coli on sperm motility parameters in vitro. Andrologia 35: 100–105.
- 61. Berktas M, Aydin S, Yilmaz Y, Cecen K, Bozkurt H (2008) Sperm motility changes after coincubation with various uropathogenic microorganisms: an in vitro experimental study. Int Urol Nephrol 40: 383–389.
- 62. Jarvi K, Lacroix JM, Jain A, Dumitru I, Heritz D, et al. (1996) Polymerase chain reaction-based detection of bacteria in semen. Fertil Steril 66: 463–467.
- 63. Tanner MA, Shoskes D, Shahed A, Pace NR (1999) Prevalence of corynebacterial 16S rRNA sequences in patients with bacterial and "nonbacterial" prostatitis. J Clin Microbiol 37: 1863–1870.
- 64. Mandar R (2013) Microbiota of male genital tract: impact on the health of man and his partner. Pharmacol Res 69: 32–41.
- 65. Cao XW, Lin K, Li CY, Yuan CW (2011) [A review of WHO Laboratory Manual for the Examination and Processing of Human Semen (5th edition)]. Zhonghua Nan Ke Xue 17: 1059–1063.
- 66. Caporaso JG, Lauber CL, Walters WA, Berg-Lyons D, Lozupone CA, et al. (2011) Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proc Natl Acad Sci U S A 108 Suppl 14516–4522.
- 67. Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9: 357–359.
- 68. DeSantis TZ, Dubosarskiy I, Murray SR, Andersen GL (2003) Comprehensive aligned sequence construction for automated design of effective probes (CASCADE-P) using 16S rDNA. Bioinformatics 19: 1461–1468.
- 69. DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, et al. (2006) Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol 72: 5069–5072.
- 70. Lozupone C, Lladser ME, Knights D, Stombaugh J, Knight R (2011) UniFrac: an effective distance metric for microbial community comparison. ISME J 5: 169–172.
- 71. Kozak M (2012) "A Dendrite Method for Cluster Analysis" by Calinski and Harabasz: A Classical Work that is Far Too Often Incorrectly Cited. Communications in Statistics-Theory and Methods 41: 2279–2280.
- 72. Tamura K, Dudley J, Nei M, Kumar S (2007) MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol 24: 1596–1599.
- 73. Benjamini Y, Hochberg Y (2000) On the adaptive control of the false discovery fate in multiple testing with independent statistics. Journal of Educational and Behavioral Statistics 25: 60–83.
- 74. Hamady M, Lozupone C, Knight R (2010) Fast UniFrac: facilitating high-throughput phylogenetic analyses of microbial communities including analysis of pyrosequencing and PhyloChip data. ISME J 4: 17–27.