Bite mark injuries often feature in violent crimes. Conventional morphometric methods for the forensic analysis of bite marks involve elements of subjective interpretation that threaten the credibility of this field. Human DNA recovered from bite marks has the highest evidentiary value, however recovery can be compromised by salivary components. This study assessed the feasibility of matching bacterial DNA sequences amplified from experimental bite marks to those obtained from the teeth responsible, with the aim of evaluating the capability of three genomic regions of streptococcal DNA to discriminate between participant samples. Bite mark and teeth swabs were collected from 16 participants. Bacterial DNA was extracted to provide the template for PCR primers specific for streptococcal 16S ribosomal RNA (16S rRNA) gene, 16S–23S intergenic spacer (ITS) and RNA polymerase beta subunit (rpoB). High throughput sequencing (GS FLX 454), followed by stringent quality filtering, generated reads from bite marks for comparison to those generated from teeth samples. For all three regions, the greatest overlaps of identical reads were between bite mark samples and the corresponding teeth samples. The average proportions of reads identical between bite mark and corresponding teeth samples were 0.31, 0.41 and 0.31, and for non-corresponding samples were 0.11, 0.20 and 0.016, for 16S rRNA, ITS and rpoB, respectively. The probabilities of correctly distinguishing matching and non-matching teeth samples were 0.92 for ITS, 0.99 for 16S rRNA and 1.0 for rpoB. These findings strongly support the tenet that bacterial DNA amplified from bite marks and teeth can provide corroborating information in the identification of assailants.
Citation: Kennedy DM, Stanton J-AL, García JA, Mason C, Rand CJ, Kieser JA, et al. (2012) Microbial Analysis of Bite Marks by Sequence Comparison of Streptococcal DNA. PLoS ONE 7(12): e51757. doi:10.1371/journal.pone.0051757
Editor: Ramy K. Aziz, Cairo University, Egypt
Received: May 28, 2012; Accepted: November 5, 2012; Published: December 19, 2012
Copyright: © 2012 Kennedy et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: Funding came from the New Zealand Dental Research Foundation, Ministry of Science and Innovation and the New Economy Research Fund (UOOX0809). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have read the journal’s policy and have the following conflicts. Co-author Jo-Ann L. Stanton is a PLOS ONE Editorial Board Member. This does not alter the authors’ adherence to all the PLOS ONE policies on sharing data and materials.
A bite mark is defined as a physical alteration in a medium caused by contact with the teeth . Bite marks have provided crucial physical and biological evidence for the prosecution of violent crimes . Bite marks can be found in inanimate objects such as foodstuffs, however it is injuries inflicted on human tissue that comprise the majority of bite mark cases presented in court . Human bite marks are sustained predominantly in homicide, sexual assault and child abuse .
The examination of bite marks currently relies on morphometric analysis, which involves the comparison of the characteristics of a suspect’s teeth with full-scale photographs of the injury . The correlation of a bite mark to the dentition of a suspect utilizes parameters of size, shape and alignment of teeth in addition to dimensions of the dental arch . The forensic discipline of bite mark analysis is centered on two assumptions; firstly, that the characteristics of the teeth involved in biting are unique to an individual, and secondly, that this asserted uniqueness is registered in the material that is bitten. The term “forensic” means “pertaining to a court of law”  thus bite mark evidence has been admissible testimony in criminal proceedings for almost 60 years . Despite the importance placed upon this evidence, there has been rising concern regarding the lack of empirical evidence underpinning conventional bite mark analysis , , , , , , . These concerns were recognised in the National Academy of Sciences report released in 2009 which concluded that “no evidence of an existing scientific basis for identifying an individual to the exclusion of all other” could be found .
DNA profiling was developed in the 1980’s and over the last 20 years, the adaptation of this technology permits DNA from human biological sources to be used for identification purposes. In cases involving bite marks, the recovery of human DNA from saliva provides an objective form of evidence . However, nucleases, such as deoxyribonuclease I, present in saliva at relatively high concentrations , ,  contribute to the rapid degradation of exposed DNA . Because of the difficulties that can be encountered in recovering salivary DNA of sufficient quality and quantity to generate a DNA profile, an alternative objective approach to bite mark analysis has been directed toward a bacterial genotyping method , , .
Bacterial DNA is enclosed within the cell envelope, which provides a biological barrier against the degradation suffered by exposed human DNA. More than 700 bacterial species inhabit the human oral cavity  and the numerically dominant species belong to the genus Streptococcus , , . 16S rRNA gene sequence analyses ,  has shown that oral streptococci are included within four multispecies phylogenetic units: the anginosus, mitis, mutans and salivarius groups. Currently, member species of these phylogenetic units include: S. anginosus, S. constellatus and S. intermedius (anginosus group); S. australis, S. cristatus, S. gordonii, S. infantis, S. mitis, S. oralis, S. parasanguinis, S. peroris, S. pneumoniae and S. sanguinis (mitis group); S. cricetus, S. downei, S. ferus, S. macacae, S. mutans, S. orisratti, S. rattus and S. sobrinus (mutans group); S. salivarius, S. thermophilus and S. vestibularis (salivarius group).
Characterization of the microbiota of the oral cavity reveals that S. mitis, S. oralis and S. sanguinis, are the initial colonizers of the teeth , , , . Of these species, S. mitis (which exhibits considerable genotypic diversity) is the predominant organism , , , , . Humans harbour multiple strains of the same Streptococcus species with many strains seemingly unique to individuals , , . This intraspecies diversity provides the premise that oral streptococci isolated from a bite mark inflicted on human skin may be genotypically matched, with a high degree of assurance, to those from the teeth responsible , . These observations were reiterated in a third study  that circumvented the need for prior culturing by amplifying bacterial DNA directly from teeth and experimental bite marks. In that study, streptococcal DNA, amplified with primers specific for hypervariable region 9 of streptococcal 16S rRNA gene, was resolved by denaturing gradient gel electrophoresis (DGGE), and a comparison of the amplicon profiles from the bite marks and teeth matched most bite marks to the teeth responsible. However, there was a concomitant risk of false positives with the sole use of this relatively conserved locus .
Phylogenetic analysis and identification of bacterial species have been conventionally based on 16S rRNA gene sequence comparison; however, the variable regions contained within this locus are generally insufficient for distinguishing closely related streptococcal species , . Alternative gene targets that discriminate between closely related streptococci include ITS (stretch of non-coding DNA that lies between the 16S and the 23S rRNA genes) , , , , rnpB (encoding endoribonuclease P) ,  and rpoB (encoding the beta subunit of the bacterial RNA polymerase) , . The variability offered by these regions is sufficient for discriminating between streptococcal species with almost identical 16S rRNA gene sequences. Therefore, the current study focused on determining whether such variability enables the discrimination of strains. Should these alternative molecular targets facilitate strain differentiation then it may be feasible to utilize them to distinguish between individuals.
This investigation had two objectives: the first was to apply high throughput sequencing, using the GS FLX 454 technology, to assess the feasibility of matching oral streptococcal DNA sequences amplified from experimental bite marks (inflicted on human skin) to those obtained from the teeth responsible. The second was to evaluate the capability of three genomic regions of streptococcal DNA to discriminate between participant samples.
Materials and Methods
The study design was approved by the University of Otago Human Ethics Committee (January 16, 2008, reference number 06/169). Written consent was obtained from all participants.
Bite and Teeth Samples
Sixteen unrelated adult participants recruited from the staff and students of the University of Otago generated self-inflicted bites on their upper arms , , . Participants were healthy adults who had not used mouthwash in the preceding month or antibiotics in the preceding three months. Before inflicting the bites, a sterile cotton applicator moistened in 0.9 % saline, was used to swab the area of skin to be bitten, to provide an index of the bacteria naturally present on the skin and to facilitate the distinction between oral and skin bacterial sequence data. Participants firmly bit their own upper arm in the bicep region with enough force to leave clear impressions of the teeth that would last for at least five minutes. Three hours later, saline-moistened cotton applicators were used to swab the bite mark. Dry, sterile cotton applicators were used to sample the upper and lower anterior teeth at this time also. The tips of the applicators were placed into separate sterile tubes each containing 2 ml of saline, and were vortexed for 30 seconds to detach the bacteria.
Extraction and purification of bacterial DNA from the skin, bite mark and teeth samples was achieved with InstaGene™ matrix (Bio-Rad Laboratories, Hercules, CA) according to manufacturer’s protocol. Portions (1.5 mL) of the saline-suspended bacteria were centrifuged for 3 minutes at 11,000 rpm at 4°C. The supernatant was discarded and the pellet resuspended in 200 µL of InstaGene™ matrix. Preparations were incubated at 56°C for 30 minutes, vortexed for 10 seconds and heated in a boiling water bath for 8 minutes. The tubes were cooled to room temperature, vortexed for 10 seconds and centrifuged for 2.5 minutes at 11,000 rpm at 4°C. An aliquot (100 µL) of the supernatant containing extracted bacterial DNA was recovered and stored at −20°C.
The streptococcus-specific oligonucleotide primers for the amplification of approximately 245 base pair (bp) fragments of the 16S rRNA gene; 16S–23S rRNA intergenic spacer region (ITS); endoribonuclease P (rnpB); and RNA polymerase beta-subunit (rpoB) loci are given in Table 1. Primers for the 16S rRNA gene and rnpB fragments have been previously described , . Alignment of partial ITS and rpoB sequences, from numerous strains of oral streptococci catalogued in GenBank, (http://www.ncbi.nlm.nih.gov/nuccore) identified areas of high variation and primers were selected in conserved flanking regions. All primers included the GS FLX/454® (Roche) Adapter A (for forward sequencing, GCCTCCCTCGCGCCATCAG) and B (for reverse sequencing, GCCTTGCCAGCCCGCTCAG) fused to the 5′ end of each primer.
PCR was performed in simplex with 5 µL of template DNA in a total reaction volume of 50 µL consisting of 37.8 µL of nuclease-free deionised water, 5 µL of 10X Taq buffer (25 mM Tris-HCl [pH 8.0], 35 mM KCl, 2.5 mM MgCl2) (HotMaster 5 PRIME GmbH, Hamburg, Germany), 1 µL of deoxyribonucleoside triphosphates (10 mM) (Roche Diagnostics, Indianopolis, USA), 0.5 µL of each primer (0.1 µM) and 0.2 µL of Taq DNA Polymerase (5 U/µL) (HotMaster). Thermocycling was preceded by an initial denaturation at 94 °C for 1 minute with maintenance at 4 °C following the last cycle. Reactions were subjected to 35 cycles (DNA Engine Thermal Cycler, Bio-Rad, CA, USA) of denaturation at 94 °C for 30 seconds, annealing at 56 °C for 30 seconds and extension at 72 °C for 30 seconds. PCR products were purified on silicate columns (QIAquick, Qiagen GmbH, Hilden, Germany) and the concentration of each eluate was estimated visually following agarose gel (1.5%) electrophoresis and staining with ethidium bromide.
For the first 11 participants the four amplicon libraries were pooled (in equimolar amounts) to give 11 bite mark and 11 teeth samples. For participants 12–16, the amplicon libraries were not pooled. All bite mark and teeth samples were sequenced individually. Samples were loaded into a 16-lane bead deposition gasket on a 70 X 75 mm PicoTiterPlate (Roche). Sequencing was performed in both forward (A-adapter sequence) and reverse (B-adaptor sequence) directions with the standard (not Titanium) amplicon sequencing protocol for the GS FLX/454® (Roche).
The filtering pipeline designed to extract high quality reads comprised three levels. The first and third levels were executed using a customized computational pipeline and the second employed an open source workflow. In the first level, reads shorter than 220 bp were discarded and the remaining reads grouped according to their locus. In the second level, the workflow Galaxy (galaxyproject.org) removed both forward and reverse primer sequences and eliminated bases with a PHRED quality score of < 20 (removing ambiguous base calls). Where ambiguous bases occurred, the read would be truncated. Therefore, the third filtering level discarded reads shorter than 180 bp and determined the frequency at which each read was observed. Reads observed only once were discarded. For reads observed at least twice, the script indicated their frequency in the sequence header. However, in this final data set, the read was represented as the consensus read. Thus the data set comprised high quality unique reads only. A minimum of ten unique reads/data set was required for samples to be included in comparative analyses. An additional customized script enabling the direct comparison of bite mark and teeth reads disclosed the number of reads 100% identical between the two sample types.
Prior to this study, a control experiment was performed to determine the quality of the filtered and trimmed reads isolated using this customised workflow. The pipeline processed read data from a sample containing a defined amplicon mix (reference sequences for amplicons were obtained using Sanger sequencing) and an error rate of 0.106% was determined (manuscript in preparation). While the error rate is lower than the 0.25% previously reported for GS FLX platforms , a maximized stringency was maintained by including only reads of 100% identity between two sample types. The proportion of shared identical reads was calculated by dividing the number of identical reads shared between a bite mark sample and a teeth sample by the total number of reads in that bite mark sample. For 16S rRNA, ITS and rpoB, all shared identical reads were compared with sequences available in the nucleotide database of GenBank (http://blast.ncbi.nlm.nih.gov/Blast.cgi) to identify SLOTUs (species-level operational taxonomic units).
Statistical modelling provided estimates of parameters for a population based on the sample data. Logistic regression is the preferred model for analyzing binary outcome variables. The statistical parameters generated from this analysis determined: i) whether a relationship existed between the binary outcome variable and the predictor variable and; ii) the optimum proportion of shared identical reads yielding the greatest probability of correctly matching a bite mark to the corresponding teeth. Statistical analyses were undertaken with R (http://cran.r-project.org/).
The 16S rRNA, ITS and rpoB read data from each bite mark and teeth sample were compared to determine the proportion of shared identical reads between the two sample types. These proportions constituted the predictor variable. All teeth samples were assigned a binary outcome of either 0 or 1. For each bite mark, the teeth sample originating from the same participant (corresponding) was assigned 1 (to indicate an expected match) and the remaining teeth samples (non-corresponding) assigned 0 (to indicate expected non-matches). To determine whether a relationship existed between the binary outcome variable and the predictor variable, the data from each locus were fitted to individual models. The corresponding p-values indicated whether the binary outcome variable was influenced by the measured predictor variable, thus a p-value less than 0.05 indicated a relationship.
To determine the optimum proportion of shared identical reads yielding the greatest probability of correctly matching a bite mark to the corresponding teeth, the model for each locus was used to estimate values for four different parameters: sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV). These parameters assessed the ability of the predictor variable to correctly match a bite mark to the corresponding teeth.
Sensitivity is defined as the proportion of true positives correctly identified as such and specificity is the proportion of correctly identified true negatives . In this study, sensitivity is the proportion of correct bite mark and corresponding teeth matches; specificity is the proportion of correct bite mark and non-corresponding teeth matches. To estimate sensitivity and specificity, each teeth sample had to be classified definitively using a “gold-standard” assessment, in addition to being classified according to the test being assessed. The “gold-standard” assessment was the previously described binary outcome variable. The test being assessed was the ability of the predictor variable to correctly match a bite mark to the corresponding teeth; thus the second assignment of binary values depended on the proportion of shared identical reads between the two sample types. A value of 1 was given if the proportion was higher than the optimum proportion of shared identical reads yielding the greatest probability of correctly matching a bite mark to the teeth responsible. To determine this optimum proportion, a Receiver Operator Characteristic (ROC) analysis was performed. The ROC analysis assessed the performance of different proportions (ranging from the lowest to the highest proportions observed in comparative analyses) to estimate values for sensitivity, specificity, PPV and NPV.
PPV is the proportion of test positives that are truly positive and NPV is the proportion of test negatives that are truly negative . In this study, PPV is the proportion of bite mark and corresponding teeth matches (being assigned a “match” according to the predictor variable) that were correct. NPV is the proportion of bite mark and non-corresponding teeth matches (being assigned a “match” according to the predictor variable) that were correct. Wald confidence intervals were calculated for sensitivity, specificity, PPV and NPV to indicate the range of values for each parameter that are possible 95% of the time under repeated sampling.
Results and Discussion
Quality Filtering of Read Data
The total number of reads generated by the GS FLX sequencing instrument was 179,987 from all bite mark samples and 232,229 from all teeth samples, translating to 115,801 and 117,886 unique reads for bite mark and teeth samples, respectively. Following quality filtering, the total number of unique reads was 3,164 from all bite mark samples and 5,085 from all teeth samples (Figures 1–4). The average length of sequence reads was 200 bp. This was expected for these amplicons following primer sequence removal.
Comparison of the number of unique 16S rRNA reads generated from samples in which amplicons from four loci were pooled (gray) and those submitted for sequencing singly (black). Bite mark sample 2 contains less than 10 unique reads and was therefore excluded from comparative analyses for all loci.
Comparison of the number of unique ITS reads generated from samples in which amplicons from four loci were pooled (gray) and those submitted for sequencing singly (black). As with 16S rRNA, bite mark sample 2 contains less than 10 unique reads and was therefore excluded from comparative analyses for all loci.
Comparison of the number of unique rnpB reads generated from samples in which amplicons from four loci were pooled (gray) and those submitted for sequencing singly (black). All rnpB reads were excluded from comparative analyses because bite mark samples 5, 8, 11, 13, 14 contained no reads following quality filtering and bite mark samples 15 and teeth samples 10 and 15 contained less than 10 unique reads.
Comparison of the number of unique rpoB reads generated from samples in which amplicons from four loci were pooled (gray) and those submitted for sequencing singly (black). Bite mark sample 11 contains less than 10 unique reads and was therefore excluded from comparative analyses for all loci.
The amplicon libraries from the four loci generated from bite mark and teeth samples 1–11 were pooled prior to sequencing. To determine whether single amplicon sequencing enhanced the number of unique reads, five additional bite mark and teeth samples (B/T12-16) were collected and the amplicons sequenced singly (rather than combined as a pool). Under these conditions, the number of unique reads (remaining after filtering) was generally greater than from samples in which the loci were pooled (Figure 5). Furthermore, submitting higher amounts of DNA for sequencing also increased the average number of unique reads.
Comparison of the average number of unique reads generated from samples with varying amounts of DNA, containing either amplicons from one locus or amplicons from four loci.
None of the skin control samples obtained prior to biting generated detectable amplicons using the streptococcus-specific fusion primers designed in this study. Molecular approaches have identified pyogenes group streptococci (e.g. S. pyogenes) and oral streptococci from various skin sites using universal bacterial primers for the 16S rRNA gene , , , , . However, the specificity of customized primers designed specifically from oral streptococci sequences used in the current study (evidenced by the absence of amplicons from the skin controls) provides assurance that the streptococci amplified from the bite marks originated from the teeth. High stringency filtering of the data to retain only reads that are 100% identical between bite mark and teeth samples further ensured analysis of strictly oral streptococci. This latter measure was validated by performing a phylogenetic analysis of 16S rRNA, ITS and rpoB reads that were matched between the two sample types. All shared reads were confirmed as representing species of oral streptococci with S. mitis, S. oralis and S. cristatus being identified by all three loci. The variability within the 16S rRNA and ITS regions were insufficient for distinguishing between closely related oral streptococci within the mitis and salivarius groups (Figure 6).
All 16S rRNA, ITS and rpoB reads identical between bite mark and teeth samples were compiled into locus-specific files and uploaded into GenBank for standard nucleotide-nucleotide BLAST comparison to determine SLOTUs. The number of shared identical reads in each locus-specific file was 482, 639 and 178, respectively. (Pseudo-Streptococcus pseudopneumoniae; Pneu-Streptococcus pneumoniae; Ther-Streptococcus thermophilus; Vest-Streptococcus vestibularis).
Comparison of Bite Mark and Teeth Read Data
Tables 2, 3 and 4 compare the proportions of identical 16S rRNA, ITS and rpoB reads shared between bite mark and teeth samples. After filtering, each retained sample contained at least ten unique reads. Samples 2 and 11 were excluded as they contained less than ten unique reads. RnpB reads were also excluded from comparative analyses because most samples contained less than ten unique reads following filtering (Figure 3).
For pooled samples, a comparison of 16S rRNA, ITS and rpoB reads revealed that the highest proportion of identical reads occurred between bite mark and corresponding teeth samples in 8, 7 and 9 (of 9) comparisons, respectively (Tables 2–4). For individually sequenced samples (12–16), a comparison of 16S rRNA, ITS and rpoB reads revealed that the highest proportion of identical reads occurred between bite mark and corresponding teeth samples in 5 (of 5) comparisons, for each locus (Tables 2–4). A comparison of the unique reads from the teeth samples of all participants revealed that on average, 11% of 16S rRNA reads and 20% of ITS reads were common to all participants. In contrast, participants shared only 1.6% of rpoB reads.
To determine whether the greater number of 16S rRNA and ITS unique reads obtained by single amplicon sequencing improved the discriminatory capabilities of these regions, read data from pooled samples (1–11) were compared with the read data from the singly sequenced samples (12–16). The increased number of unique 16S rRNA reads from teeth samples 13 and 15 produced proportions of identical reads with bite mark samples 3, 5, 6 and 9 that were greater than those obtained with their corresponding teeth samples (Table 2). The increased number of unique ITS reads from teeth sample 16 produced proportions of identical reads with bite mark samples 9 and 15 that were greater than that from teeth sample 16 (Table 3). In contrast, the increased number of unique rpoB reads obtained from teeth samples 12–16 did not produce proportions with bite mark samples 1–11 that exceeded those obtained with their corresponding teeth samples (Table 4).
Pooled sequence data (i.e. samples 1 and 3–10) were fitted to logistic regression models as the change in methodology disqualified samples 12–16. Table 5 lists the statistical parameters determined by logistic regression modelling.
Tables 2, 3 and 4 indicate that in at least 7 (of 9) comparisons, the highest proportion of shared identical reads occurred between a bite mark and its corresponding teeth sample. This strongly suggests that matching a bite mark to the teeth responsible is dependent on the predictor variable (i.e. proportion of shared identical reads). The probabilities confirm that the binary outcome variable was influenced by the measured predictor variable (Table 5) and not by some unmeasured variable or chance.
Assessment of the ability of the predictor variable to correctly match a bite mark to the corresponding teeth was provided by model estimates for specificity, sensitivity, PPV and NPV. ROC analysis revealed the optimum proportion of shared identical reads yielding the greatest values for each of the four parameters (Table 5). For the 16S rRNA model, the sensitivity of 100% indicates that all bite marks will be matched to the corresponding teeth; however, the PPV predicts that the proportion of these matches being correct is 75% (i.e. 25% false positive rate). The occurrence of false positives was also observed in the previously reported method involving the analysis of 16S rRNA amplicon profiles resolved by DGGE . For the ITS model, a maximized sensitivity yielded a PPV of 35%, translating to a 65% chance of obtaining a false positive. The values for the rpoB model revealed maximized scores of 100% for all four diagnostic measures indicating that all bite marks will be correctly matched to the corresponding teeth. Furthermore, the 16S rRNA, ITS and rpoB models all exhibit maximum negative predictive values, assuring that all negative cases will be correctly assigned (Table 5).
Under repeated random sampling from the population, the confidence intervals indicate the boundaries that will contain the true value of each parameter 95% of the time. It is important to recognize that bite mark evidence attempts to confirm the identity of a person held on suspicion based on other evidence. In other words, the approach explored here is not aimed at identifying an assailant from the wider population in the absence of other indicative evidence. Also derived from the ROC analyses are the values for the area under the curve (AUC), which measures the overall ability to discriminate between samples from teeth responsible for a bite and those not responsible, when compared to any bite mark sample. Where perfect discrimination is attained the ROC curve yields an area of 100%. The strength of the rpoB model was reiterated with an AUC of 100% (Table 5).
Fitting a model to sample data primarily involves finding estimates of the model parameters that are in some sense “optimal” for the data. Confidence that the estimates derived from each model are optimal was established by calculating two parameters, pseudo R2 and goodness of fit, which assessed the appropriateness of each model (data not shown). The pseudo R2 was calculated to indicate the proportion of variability in the data that is explained by the model. For 16S rRNA and ITS, the models explained 71% and 34% of the variability, respectively. The pseudo R2 of 100% obtained for rpoB, revealed a model that explains all of the variability in the data, thus constituting the best model. The “goodness of fit” tests the null hypothesis that the model approximates the data; a value of ≥ 0.05 is required for the model to be deemed a good fit of the data. While the 16S rRNA and ITS models met this criterion with values of 0.3 and 0.08 respectively, the rpoB model was exceedingly strong with a value of 0.996.
Of the three loci assessed, rpoB was clearly the most satisfactory, providing unequivocal identification of the teeth responsible for each bite. The strength of this region was validated in three ways: firstly, the high stringency of the filtering process ensured that data sets contained reads of the highest quality; thus correctly matching a bite with the teeth responsible was achieved using 3% of the initial unique reads. Secondly, the average proportion of identical reads shared between bite marks and corresponding teeth samples was an order of magnitude greater than those of bite mark and non-corresponding teeth samples. This ratio was maintained when the original bite samples were compared with teeth samples 12–16, which were sequenced at greater depth. Thirdly, the predictive power of rpoB to correctly assign a bite mark to the teeth responsible was absolute and supported by both AUC and PPV.
The differing performances of the three regions in distinguishing between participants can be attributed to the target sites of each primer. The 16S rRNA and ITS primers amplify a range of streptococcal species whereas the rpoB primers were designed to amplify only S. mitis, the most prevalent species on tooth surfaces , , , , . The robustness of rpoB in distinguishing participants is due to exclusivity to a species with profound genotypic diversity therefore permitting coverage of that species at a greater depth. The variable regions enclosed within the 16S rRNA and ITS fragments do not offer the discriminatory power required to distinguish between participants as irrefutably as does the rpoB region.
From a forensic standpoint, assurance that there is temporal stability of oral streptococcal populations is crucial. Genetic analyses reveal that oral streptococcal populations are dynamic with species numbers and proportions fluctuating over time , . The mechanisms underlying these changes are not fully understood; however, the dominant strains of streptococci are generally retained over longer periods , , . Approximately 20% of all S. mitis genotypes recovered from the buccal mucosae of six participants were detected in repeated samplings over a 10-month period , and almost 50% of S. mitis and S. oralis genotypes from two individuals were detected two years after initial sampling . Rahimi et al.,  found that between 20–78% of bacterial genotypes were recovered from the same teeth 12 months later. Nevertheless, the likelihood of matching bite mark sequence data to that of a suspected assailant will be increased by prompt sampling.
In conclusion, the comparison of highly discriminatory regions of oral streptococcal DNA recovered from bite marks and teeth is capable of unequivocally matching a bite mark to the teeth responsible and may provide valuable information to corroborate other evidence in cases where the perpetrators DNA cannot be recovered.
We would like to acknowledge the technical assistance of Mrs Jenine Upritchard and the statistical support of Associate Professor David Fletcher.
Conceived and designed the experiments: DMK CM. Performed the experiments: DMK. Analyzed the data: DMK. Contributed reagents/materials/analysis tools: JLS JAG CM CJR JAK GRT. Wrote the paper: DMK JLS JAG CM CJR JAK GRT.
- 1. Herschaft EE, Alder ME, Ord DK, Rawson RB, Smith ES (2007) Manual of Forensic Odontology. New York: Impress Printing & Graphics, Inc.
- 2. Rothwell BR (1995) Bite marks in forensic dentistry: a review of legal, scientific issues. The Journal of the American Dental Association 126: 223–232. doi: 10.14219/jada.archive.1995.0149
- 3. Pretty IA, Sweet D (2000) Anatomical location of bitemarks and associated findings in 101 cases from the United States. Journal of Forensic Sciences 45: 812–814.
- 4. Freeman AJ, Senn DR, Arendt DM (2005) Seven Hundred Seventy Eight Bite Mark: Analysis by Anatomic Location, Victim, and Biter Demographics, Type of Crime, and Legal Disposition. Journal of Forensic Sciences: 1436–1443.
- 5. Al-Talabani N, Al-Moussawy ND, Baker FA, Mohammed HA (2006) Digital analysis of experimental human bitemarks: application of two new methods. Journal of Forensic Sciences 51: 1372–1375. doi: 10.1111/j.1556-4029.2006.00265.x
- 6. Ellner PD (2006) The biomedical scientist as expert witness. Washington DC: ASM Press.
- 7. Court of Criminal Appeals of Texas (1954) Doyle v. State. Texas.
- 8. Pretty IA (2001) The scientific basis for human bitemark analyses- a critical review. Science and Justice 41: 85–92. doi: 10.1016/s1355-0306(01)71859-x
- 9. Pretty IA (2005) Unresolved issues in bitemark analysis. Bitemark Evidence. New York: Marcel Dekker. 547–563.
- 10. Kieser J, Tompkins G, Buckingham D, Firth N, Swain M (2005) Forensic Pathology Reviews; Tsokos M, editor. New Jersey: Humana Press. 157–179 p.
- 11. Pretty IA (2006) The barriers to achieving an evidence base for bitemark analysis. Forensic Science International 159S: 110–120. doi: 10.1016/j.forsciint.2006.02.033
- 12. Plourd CJ (2009) Innocent people convicted by bite mark evidence: Is there still a problem? Proceedings of the American Academy of Forensic Sciences: 257.
- 13. Clement JG, Blackwell SA (2010) Is current bite mark analysis a misnomer? Forensic Science International 201: 33–37. doi: 10.1016/j.forsciint.2010.03.006
- 14. Pretty I, Sweet D (2010) A paradigm shift in the analysis of bitemarks. Forensic Science International 201: 38–44. doi: 10.1016/j.forsciint.2010.04.004
- 15. National Research Council (2009) Strengthening forensic science in the United States: a path forward. National Academies ed. Washington D.C.: National Academies Press.
- 16. Nadano D, Yasuda T, Kishi K (1993) Measurement of deoxyribonuclease I activity in human tissues and body fluids by a single radial enzyme-diffusion method. Clinical Chemistry 39: 448–452.
- 17. Tenjo E, Sawazaki K, Yasuda T, Nadano D, Takeshita H, et al. (1993) Salivary deoxyribonuclease I polymorphism separated by polyacrylamide gel-isoelectric focusing and detected by the dried agarose film overlay method. Electrophoresis 14: 1042–1044. doi: 10.1002/elps.11501401166
- 18. Nieuw Amerongen AV, Veerman ECI (2008) Saliva- the defender of the oral cavity. Oral Diseases 8: 12–22. doi: 10.1034/j.1601-0825.2002.1o816.x
- 19. Mercer DK, Scott KP, Bruce-Johnson WA, Glover LA, Flint HJ (1999) Fate of free DNA and transformation of the oral bacterium Streptococcus gordonii DL1 by plasmid DNA in human saliva. Applied and Environmental Microbiology 65: 6–10.
- 20. Borgula LM, Robinson FG, Rahimi M, Chew KEK, Birchmeier KR, et al. (2003) Isolation and genotypic comparison of oral streptocci from experimental bitemarks. The Journal of Forensic Odonto-Stomatology 21: 23–30.
- 21. Rahimi M, Heng NCK, Kieser JA, Tompkins GR (2005) Genotypic comparison of bacteria recovered from human bite marks and teeth using arbitrarily primed PCR. Journal of Applied Microbiology 99: 1265–1270. doi: 10.1111/j.1365-2672.2005.02703.x
- 22. Hsu L, Power DA, Uprithcard J, Burton JP, Friedlander R, et al. (2012) Amplification of oral streptococcal DNA from human incisors and bite marks. Current Microbiology 65: 207–211. doi: 10.1007/s00284-012-0148-x
- 23. Aas JA, Paster BJ, Stokes LN, Olsen I, Dewhirst FE (2005) Defining the normal bacterial flora of the oral cavity. Journal of Clinical Microbiology 43: 5721–5732. doi: 10.1128/jcm.43.11.5721-5732.2005
- 24. Whiley RA, Beighton D (1998) Current classification of the oral streptococci. Oral Microbiol Immunol 13: 195–216. doi: 10.1111/j.1399-302x.1998.tb00698.x
- 25. Marsh P, Martin MV (1999) Oral Microbiology. England: Wright.
- 26. Truong TL, Menard C, Mouton C, Trahan L (2000) Identification of mutans and other oral streptococci by random amplified polymorphic DNA analysis. Journal of Medical Microbiology 49: 63–71.
- 27. Kawamura Y, Hou X, Sultana F, Miura H, Ezaki T (1995) Determination of 16S rRNA sequences of Streptocccus mitis and Streptococcus gordonii and phylogenetic relationships among members of the genus Streptococcus. International Journal of Systematic Bacteriology 45: 406–408. doi: 10.1099/00207713-45-2-406
- 28. Hardie JM, Whiley RA (2006) The genus Streptococcus- Oral. In: Dworkin M, editor. The Prokaryotes: The handbook on the biology of bacteria. 3 ed. Singapore: Springer. 76–107.
- 29. Socransky SS, Manganiello AD, Propas D, Oram V, Van Houte J (1977) Bacteriological studies of developing supragingival dental plaque. Journal of Periodontal Research 12: 90–106. doi: 10.1111/j.1600-0765.1977.tb00112.x
- 30. Nyvad B, Kilian M (1987) Microbiology of the early colonization of human enamel and root surfaces in vivo. Scandanavian Journal of Dental Research 95: 369–380. doi: 10.1111/j.1600-0722.1987.tb01627.x
- 31. Pearce C, Bowden GH, Evans M, Fitzsimmons SP, Johnson J, et al. (1995) Identification of pioneer viridans streptococci in the oral cavity of human neonates. J Med Microbiol 42: 67–72. doi: 10.1099/00222615-42-1-67
- 32. Fitzsimmons S, Evans M, Pearce C, Sheridan MJ, Wientzen R, et al. (1996) Clonal diversity of Streptococcus mitis biovar 1 isolates from the oral cavity of human neonates. Clin Diagn Lab Immunol 3: 517–522.
- 33. Wisplinghoff H, Reinert RR, Cornely O, Seifert H (1999) Molecular relationships and antimicrobial susceptibilities of viridans group streptococci isolated from blood of neutropenic cancer patients. J Clin Microbiol 37: 1876–1880.
- 34. Hohwy J, Reinholdt J, Kilian M (2001) Population dynamics of Streptococcus mitis in its natural habitat. Infect Immun 69: 6055–6063. doi: 10.1128/iai.69.10.6055-6063.2001
- 35. Rudney JD, Larson CJ (1994) Use of restriction fragment polymorphism analysis of rRNA genes to assign species to unknown clinical isolates of oral viridans streptococci. J Clin Microbiol 32: 437–443.
- 36. Glazunova OO, Raoult D, Roux V (2009) Partial sequence comparison of the rpoB, sodA, groEL and gyrB genes within the genus Streptococcus. Int J Syst Evol Microbiol 59: 2317–2322. doi: 10.1099/ijs.0.005488-0
- 37. Hassan AA, Khan IU, Abdulmawjood A, Lammler C (2003) Inter- and intraspecies variations of the 16S–23S rDNA intergenic spacer region of various streptococcal species. Syst Appl Microbiol 26: 97–103. doi: 10.1078/072320203322337371
- 38. Mora D, Ricci G, Guglielmetti S, Daffonchio D, Fortina MG (2003) 16S–23S rRNA intergenic spacer region sequence variation in Streptococcus thermophilus and related dairy streptococci and development of a multiplex ITS-SSCP analysis for their identification. Microbiology 149: 807–813. doi: 10.1099/mic.0.25925-0
- 39. Chen CC, Teng LJ, Chang TC (2004) Identification of clinically relevant viridans group streptococci by sequence analysis of the 16S–23S ribosomal DNA spacer region. J Clin Microbiol 42: 2651–2657. doi: 10.1128/jcm.42.6.2651-2657.2004
- 40. Hoshino T, Izumi T, Ooshima T, Fujiwara T (2005) Method for rapid identification of oral streptococci by PCR using 16S–23S ribosomal RNA intergenic spacer gene. Pediatric Dental Journal 15: 185–190. doi: 10.1016/s0917-2394(05)70051-3
- 41. Tapp J, Thollesson M, Herrmann B (2003) Phylogenetic relationships and genotyping of the genus Streptococcus by sequence determination of the RNase P RNA gene, rnpB. Int J Syst Evol Microbiol 53: 1861–1871. doi: 10.1099/ijs.0.02639-0
- 42. Innings A, Krabbe M, Ullberg M, Herrmann B (2005) Identification of 43 Streptococcus species by pyrosequencing analysis of the rnpB gene. J Clin Microbiol 43: 5983–5991. doi: 10.1128/jcm.43.12.5983-5991.2005
- 43. Drancourt M, Roux V, Fournier PE, Raoult D (2004) rpoB gene sequence-based identification of aerobic Gram-positive cocci of the genera Streptococcus, Enterococcus, Gemella, Abiotrophia, and Granulicatella. J Clin Microbiol 42: 497–504. doi: 10.1128/jcm.42.2.497-504.2004
- 44. Kilian M, Poulsen K, Blomqvist T, Havarstein LS, Bek-Thomsen M, et al. (2008) Evolution of Streptococcus pneumoniae and its close commensal relatives. PLoS One 3: e2683. doi: 10.1371/journal.pone.0002683
- 45. Rudney JD, Pan Y, Chen R (2003) Streptococcal diversity in oral biofilms with respect to salivary function. Arch Oral Biol 48: 475–493. doi: 10.1016/s0003-9969(03)00043-8
- 46. Droege M, Hill B (2008) The Genome Sequencer FLX System–longer reads, more applications, straight forward bioinformatics and more complete data sets. J Biotechnol 136: 3–10. doi: 10.1016/j.jbiotec.2008.03.021
- 47. Kirkwood BR, Sterne JAC (2003) Medical Statistics: Blackwell.
- 48. Dekio I, Hayashi H, Sakamoto M, Kitahara M, Nishikawa T, et al. (2005) Detection of potentially novel bacterial components of the human skin microbiota using culture-independent molecular profiling. J Med Microbiol 54: 1231–1238. doi: 10.1099/jmm.0.46075-0
- 49. Gao Z, Tseng CH, Pei Z, Blaser MJ (2007) Molecular analysis of human forearm superficial skin bacterial biota. Proc Natl Acad Sci U S A 104: 2927–2932. doi: 10.1073/pnas.0607077104
- 50. Gao Z, Tseng CH, Strober BE, Pei Z, Blaser MJ (2008) Substantial alterations of the cutaneous bacterial biota in psoriatic lesions. PLoS One 3: e2719. doi: 10.1371/journal.pone.0002719
- 51. Costello EK, Lauber CL, Hamady M, Fierer N, Gordon JI, et al. (2009) Bacterial community variation in human body habitats across space and time. Science 326: 1694–1697. doi: 10.1126/science.1177486
- 52. Structure, function and diversity of the healthy human microbiome. Nature 486: 207–214.
- 53. Bek-Thomsen M, Tettelin H, Hance I, Nelson KE, Kilian M (2008) Population diversity and dynamics of Streptococcus mitis, Streptococcus oralis, and Streptococcus infantis in the upper respiratory tracts of adults, determined by a nonculture strategy. Infect Immun 76: 1889–1896. doi: 10.1128/iai.01511-07