Paleomicrobiological investigations of a 14th-century coprolite found inside a barrel in Namur, Belgium were done using microscopy, a culture-dependent approach and metagenomics. Results were confirmed by ad hoc PCR – sequencing. Investigations yielded evidence for flora from ancient environment preserved inside the coprolite, indicated by microscopic observation of amoebal cysts, plant fibers, seeds, pollens and mold remains. Seventeen different bacterial species were cultured from the coprolite, mixing organisms known to originate from the environment and organisms known to be gut inhabitants. Metagenomic analyses yielded 107,470 reads, of which known sequences (31.9%) comprised 98.98% bacterial, 0.52% eukaryotic, 0.44% archaeal and 0.06% viral assigned reads. Most abundant bacterial phyla were Proteobacteria, Gemmatimonadetes, Actinobacteria and Bacteroidetes. The 16 S rRNA gene dataset yielded 132,000 trimmed reads and 673 Operational Taxonomic Units. Most abundant bacterial phyla observed in the 16 S rRNA gene dataset belonged to Proteobacteria, Firmicutes, Actinobacteria and Chlamydia. The Namur coprolite yielded typical gut microbiota inhabitants, intestinal parasites Trichuris and Ascaris and systemic pathogens Bartonella and Bordetella. This study adds knowledge to gut microbiota in medieval times.
Citation: Appelt S, Armougom F, Le Bailly M, Robert C, Drancourt M (2014) Polyphasic Analysis of a Middle Ages Coprolite Microbiota, Belgium. PLoS ONE 9(2): e88376. doi:10.1371/journal.pone.0088376
Editor: Kostas Bourtzis, International Atomic Energy Agency, Austria
Received: September 27, 2013; Accepted: January 6, 2014; Published: February 28, 2014
Copyright: © 2014 Appelt et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: No external funding for this study.
Competing interests: The authors have declared that no competing interests exist.
Human paleomicrobiology, the quest for microbes in ancient specimens derived from humans, mainly relied on the investigations of old bone and dental pulp specimens . Such investigations characterized past pathogens but did not provide data of ancient microbiota. In complement, investigating fossilized fecal material, i.e. coprolites, previously helped to gain knowledge on ancient human gut microbiota and intestinal parasites –. Currently, less than 20 coprolites and ancient colon content samples collected from six American and two European archeological sites have been investigated using large-scale sequencing and PCR-based analyses. These investigations yielded data about ancient gut microbiota, indicating that parts of the digestive flora were preserved in such specimens , , , –. Moreover, these studies enabled to compare dietary habits of ancient populations and their impact on human gut flora composition , , . Analyzing differences in the composition of bacterial and fungal community associated with coprolites, supported the existence of two different pre-Columbian cultures in Indonesia . Furthermore, a recent study indicated that coprolites exhibited more similarities between each other, and with stools from modern rural communities, than with stools coming from modern cosmopolitan communities. These findings support that modern lifestyle may participate to changes in the composition of the human gut flora .
In the present study, a further coprolite from a Middle Ages European site was investigated using a polyphasic approach in order to expand knowledge about gut microbiota in ancient Europe. Microscopic observations, culture and metagenomics (high-throughput sequencing and 16 S rRNA gene amplicon sequencing) were used to characterize the microbiota associated with the coprolite and to identify potential pathogens. Then, ad hoc suicide PCR amplifications were used for confirmation .
In 1996, the exploration of an archeological Middle Ages site in Namur, Belgium yielded a closed barrel, such as those commonly used at that time as pits or latrines. The barrel was located at a depth of 3.80 m beneath the modern soil level and contained a 121.4 g, dark-brown, well preserved coprolite specimen. The specimen was attributed the number Z04F56 and was deposited into the laboratory of paleo- and parasitological studies of Champagne-Ardennes University, Reims, France until 2007. In 2007, the specimen was transferred to the laboratory for paleomicrobiology of Aix-Marseille University, Marseille, France for further investigations. After aseptically peeling of its external portion, the inner portion of the coprolite was re-suspended in sterile Page's amoeba saline medium (PAS) and microscopic observations revealed the presence of several eggs (Figure S1 in File S1). Thick-shelled and barrel-shaped eggs, with polar ‘plugs’ at the ends, 40–60 µm in length and 20–26 µm diameter, corresponded to the phenotypic description of Trichuris spp. eggs . More precisely, broad eggs compatible with the pig-infecting Trichuris suis species and thinner eggs compatible with human-infecting Trichuris trichiura were observed (Figure 1A and B) . Thick-shelled and brown eggs, corresponding to the description of Ascaris spp. eggs were also observed . Among them, unfertilized elongated eggs (60 µm in length), fertilized round eggs (diameter between 40 and 50 µm), as well as eggs with embryos inside (Figure 1C) were found in the Namur coprolite. Microscopic observations also revealed the presence of suspected Taenia spp. eggs, plant fibers, pollens and mold remains (Figure 1D and 1E). Cysts, plant fibers and seeds stained red using Congo red (Figure 1F-G). The cysts measured 4.1 to 13.5 µm and matched with the description of amoeba cysts .
A) Trichuris spp. egg (ø 25.83 µm; length 43.65 µm); B) Trichuris spp. egg (ø 22.50 µm; length 41.10 µm); C) Ascaris spp. fertilized egg (ø 49.92 µm) and unfertilized egg (ø 43.16 µm; length 59.92 µm); D) pollen (ø 18.26 µm); E) suspected Taenia spp. egg (ø 14.65 µm); F) suspected Acanthamoeba spp. cyst; and G) seed remains. (The scale bar on the right indicates 20 µm).
After ten-day incubation at 30°C in the presence of negative controls, small colonies were visible in the aerobic and anaerobic layers of R2A and Schaedler broths. Additionally, a tiny film was observed on the surface of the R2A solid medium. After 5-7-day subculture, matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry performed as previously described  identified eight different bacterial species including Paenibacillus macerans, Bacillus jeotgali, Staphylococcus pasteuri, Staphylococcus epidermidis, Staphylococcus cohnii, Micrococcus luteus, Pseudomonas geniculata and Stenotrophomonas maltophilia. In addition Bacillus horti and Clostridium magnum were identified by 16 S rRNA gene sequencing  (Table S3 in File S1; Figure S2 in File S1). Furthermore, culturing the specimen in anaerobic and aerobic blood culture bottles in the presence of non-inoculated bottles (negative controls) yielded one Rhodanobacter sp. organism, one Paenibacillus sp. organism, Paenibacillus macerans, Paenibacillus thiaminolyticus, Paenibacillus ehimensis, Staphylococcus arlettae, Propionibacterium acnes and Enterobacter cloacae (Table S4 in File S1; Figure S2 in File S1).
Acridine orange staining disclosed the presence of DNA in the coprolite, suggesting that molecular biological tools can be further applied to this specimen . Accordingly, high-throughput sequencing yielded a total of 37.5 millions base pairs and 107,470 reads, with an average sequence length of 375 bp and a GC content between 65 and 70% (MG-RAST accession number 4479942.3). Taxonomic assignment of the reads was performed using a BLASTX comparison with the National Center for Biotechnology Information (NCBI) database, with stringent parameters as previously described , . A significant similarity to known sequences was obtained for 31.9% of reads comprising 98.98% bacterial, 0.52% eukaryotic, 0.44% archaeal and 0.06% of viral reads. The most abundant bacterial phyla were Proteobacteria (58.12%), Gemmatimonadetes, (15.18%), Actinobacteria (6.96%) and Bacteroidetes (5.10%) (Figure 2). More precisely, the high-throughput sequencing dataset yielded Gemmatimonas, Rhizobium, Streptomyces and Burkholderia known as environmental bacteria; as well as Corynebacterium, Cytophaga, Enterobacter, Prevotella, Ruminococcus, Aeromonas, Escherichia, Lactobacillus and Bacteroides known as members of mammal gut microbiota (Table S1 in File S1 ). In particular, contig reconstruction and annotation identified contigs belonging to human gut Bacteroides species, including Bacteroides finegoldii, Bacteroides vulgatus, Bacteroides coprocola along with Bacteroides coprosuis belonging to pig gut microbiota (Table S2A in File S1).
The dataset obtained from high-throughput sequencing (external circle) and Operational Taxonomic Units (OTUs) assigned to the 16 S rRNA gene amplicon (internal circle) were used to identify bacterial phyla. Only phyla comprising more than 0.3% of the datasets are shown.
Some metagenomic reads were assigned to potential pathogenic bacteria (Figure 3, Tables S1 and S2B in File S1). Reads and contigs of amoeba-resistant bacteria (i.e. bacteria resisting to killing by amoeba) Actinobacteria spp., Pseudomonas spp., Parachlamydia acanthamoebae, Legionella drancourtii and Legionella pneumophila were found . Moreover, contig annotations yielded Burkholderia gladioli, Granulibacter bethesdensis, Leptospira borgpetersenii, Coxiella burnetii and Mycobacterium abscessus. Contigs respectively encoding a hypothetical protein and an initiation factor 3 of Brucella abortus were also found, as well as reads and a contig encoding VapC46 toxin from Mycobacterium tuberculosis (Figure 3, Tables S1 in File S1 and S2B in File S1). Two contigs assigned to Clostridium botulinum encode a perosamine synthetase and a methionyl-tRNA formyltransferase. Among Bordetella spp. sequences found in the metagenomic dataset (Figure 3), contig reconstruction and annotation identified contigs encoding a hypothetical protein of Bordetella parapertussis and a putative hydrolase of Bordetella bronchiseptica (Tables S1, S2 in File S1). For the hydrolase, phylogenetic analyses confirmed the BLAST based annotation (Figure S3 in File S1). The cultured P. macerans, M. luteus, S. maltophilia and E. cloacae were also found in the metagenomic high-throughput sequence dataset.
Only human and animal pathogens detected in silico or by at least two tests are shown. A positive test result is marked in green and negative tests are colored in red. Pathogenic microorganisms which were previously associated to ancient human coprolites or colon contents are marked by circles. *  ** 
In the high-throughput sequence dataset, 0.09% of reads identified bacterial 16 S rRNA genes. For further analysis, the V6 region was amplified by PCR. This 16 S rRNA gene dataset yielded 132,000 trimmed reads. The Bayesian microbial source-tracking approach was performed to compare the 16 S rRNA gene dataset associated with the Namur coprolite to those of modern stool samples, published coprolites, compost and soil as previously described . The results indicated that the mixture of taxa associated to the Namur coprolite had no significant matches with any of the different sources used for comparison (Figure S4 in File S1). A total of 673 Operational Taxonomic Units (OTUs) were assigned to the 16 S rRNA gene dataset (MG-RAST accession number 4480463.3). Comparisons to the laboratory gut microbiota database including all 16 S rRNA gene sequences generated by the 454 FLX titanium platform showed that OTUs associated with the Namur coprolite did not clusterize with sequences previously amplified in the laboratory. The most abundant identified phyla were Proteobacteria (85.1%), Firmicutes (7.4%), Actinobacteria (2.8%) and Chlamydia (1.8%) (Figure 2). Furthermore, Bartonella species were detected with a 16 S rRNA gene sequence identity of 98.7% (Figure 3, Table S4 in File S1); phylogenetic analyses indicated that these 16 S rRNA gene amplicons were most closely related to Bartonella henselae, Bartonella koehlerae and Bartonella quintana (Table S3 in File S1and Figure S5 in File S1).
Pathogen-specific PCR assays
Mechanical lysis of the specimen reduced PCR inhibition as measured by PCR amplification of an internal synthetic, 135-bp oligonucleotide sequence used as a PCR control. Further, microorganisms detected by microscopy or metagenomic were tentatively amplified by using species-specific PCR amplifications. Sequencing a 120-bp band generated using Ascaris spp.– specific primer pair  (Table S5 and Table S6 in File S1) yielded 98% sequence identity with Ascaris sp. DHS-2010a cytochrome b gene sequence, similar to those previously reported in a human remain  (GenBank Accession No. GU339224.1). The amplicon generated by an Acanthamoeba spp. – specific PCR showed a 99% sequence similarity with Acanthamoeba castellanii (GenBank Accession No. JF437606.1) . Bartonella species, detected in the 16 S rRNA gene rDNA dataset, were additionally amplified by PCR. The amplicon of the ribosomal RNA operon rrsC yielded 95% of sequence identity with B. henselae and 94% with B. quintana (GenBank Accession No. BX897699.1 and BX897700.1), respectively. Bordetella species, detected in the high-throughput sequencing dataset, were also amplified in the PCR assays (Figure 3). A 189-bp long fragment of the RNA polymerase C gene was generated, exhibiting 99% of sequence similarity with B. bronchiseptica (GenBank Accession No. HE965806.1, BX640437.1, HE965807.1), B. parapertussis (GenBank Accession No. HE965803.1, BX640423.1) and Achromobacter xylosoxidans (GenBank Accession No. CP002287.1). The obtained amplicon differed in one position (T → C transition) from the B. parapertussis and B. bronchiseptica reference strains. In two positions, the amplicon differed from B. bronchiseptica MO149 strain (T → C transition) as from the reference sequence of A. xylosoxidans (T → C transition and G → C mutation) (Figure S6 in File S1) respectively.
Synopsis of identified microorganisms
The identified microorganisms were categorized into two groups. The first group comprises microorganisms for which the identification was confirmed by at least two independent methods among microscopy, culture, metagenomic and the PCR assay. This group includes Ascaris spp. and Acanthamoeba spp. identified by microscopy and PCR assay; P. macerans, M. luteus, S. maltophilia and E. cloacae found by culture and metagenomic; and Bartonella spp. – related to B. henselae and B. quintana – and Bordetella spp. found in metagenomic or 16 S rRNA gene amplicon datasets and the PCR assay (Figure 3). The second group comprises of microorganisms identified only in silico by contig annotations (Figure 3). This group includes B. abortus, C. botulinum, C. burnetii ¸ G. bethesdensis, M. tuberculosis, M. abscessus, B. gladioli, L. drancourtii, L. pneumophila, L. borgpetersenii, P. acanthamoebae, B. finegoldii, B. vulgatus, B. coprocola and B. coprosuis (Figure S2 in File S1).
Paleomicrobiological investigations of a total of fourteen human coprolites and one colon sample have been reported from six different American archeological sites , , , , . As for Europe, only two colon content specimens from two different sites, respectively dated to 3,350–3,100 BC and 1918 AD and no coprolite have been analyzed , . In the present study, a coprolite collected from a medieval site in Belgium was investigated using a polyphasic approach.
This coprolite was recovered from a sealed barrel which was still intact at the time of its discovery, and only the internal portion of the coprolite was investigated. Current recommendations for paleomicrobiology and paleoparasitology studies were strictly enforced in order to minimize in-laboratory contaminations , –. No positive control was used and negative controls incorporated during all the experimentations remained negative. Nevertheless, the data herein reported – the presence of amoebal cysts, plant fibers, seeds, pollens and mold remains – indicate that the coprolite obviously contained environmental flora as previously reported in other investigations , , , . Accordingly, source tracking of the 16 S rRNA gene amplicon dataset yielded no hits, as previously also observed for two pre-Columbian American coprolites . More coprolites need to be investigated before source tracking can be informative. Furthermore, the observation of pig- and human-infecting Trichuris spp. eggs in the coprolite indicates a possible common use of the barrel for animal and human feces. This point was supported by the results obtained in the metagenomic datasets, with bacterial sequences assigned to species found in modern gut microbiota of humans and pigs. Nevertheless, parts of the Namur coprolite correspond to human gut microbiota. Gut bacteria phyla – Alpha-, Beta- and Gammaproteobacteria and Bacteroidetes – were found herein as described for ancient coprolites and colon contents from American and European archeological sites , , , . Furthermore, Corynebacterium, Enterobacter and Prevotella are common inhabitants of the modern human digestive tract, further detected in the Namur coprolite , , , , . Moreover, intestinal human parasites including Trichuris and Ascaris were also identified. The sequence obtained for the intestinal parasite Ascaris spp. supports the human origin of the Namur coprolite. Indeed, the same Ascaris spp. sequences were previously amplified from the remains of a human pelvic bone from a medieval Korean tomb .
On this basis, the Namur coprolite was further used as a source to detect past pathogens. Modern stool specimens diagnose systemic infections in humans and primates including malaria, rickettsiosis and tuberculosis –. Likewise, two pathogens Haemophilus parainfluenzae and Clostridium botulinum were already found in ancient fecal material ,  and Neisseria, Yersinia, Shigella and Mycobacterium sequences were identified in WGS datasets of pre-Columbian American coprolites . To go further, it was herein attempted to ascertain the presence of potential pathogenic agents that might be associated to the analyzed specimen, in silico or by molecular tests. Bacterial pathogens belonging to Bartonella and Bordetella were detected here for the first time in a coprolite sample. Bartonella species are responsible for zoonoses  and Bordetella species cause respiratory tract infections .
Using a polyphasic approach including culture-dependent and culture-independent techniques, several microorganisms were consistently identified in one coprolite dating back to the European Middle Ages. These microorganisms included both common members of the gut microbiota and systemic pathogens. Coprolites are a source of knowledge regarding both microbiota and pathogens circulating in ancient populations and could be investigated using techniques routinely used for the modern diagnosis in clinical microbiology.
Materials and Methods
According to the French legislation, no permits are required for the investigation of coprolite in general and for this study in particular.
After excavation, the coprolite was stored in a sterile forensic specimen bag. In 2006, the coprolite was sent to our laboratory, where it was handled only in a positive pressure room with isolated ventilation under strict aseptic conditions. Workbenches were stringently disinfected using absolute ethanol and UV-irradiation for at least 30 min. Non-disposable instruments were autoclaved. Reagents and chemicals were from new stocks aliquoted into sterile, single-use tubes and immediately discarded after use. The external portion of the coprolite was aseptically removed; only the internal portion of the coprolite was used. DNA -extraction, PCR and post-PCR experiments were performed in separate rooms in isolated work areas. Positive controls were strictly avoided. Negative controls for DNA -extraction and PCR were used at a 1∶4 control: specimen ratio , –.
A 350-mg sample taken from the interior portion of the coprolite specimen was rehydrated for three days in 5 mL of phosphate buffered saline (PAS;Biotechnologie Appliquée, Taden, France) prior to observation under a microscope. Congo-red staining was applied to stain the cellulosic material of the coprolite. Then, 500 mg samples were taken from the interior of the coprolite specimen and rehydrated for two days in 5 mL of PAS. The diluted samples were fixed with absolute methanol on a glass slide, coated with 1 mg/mL Congo-red solution (Microm, Francheville, France) and incubated for 1 h at room temperature. The slide was carefully rinsed with 1 M sodium hydroxide (Aldrich-Chemie GmbH, Steinheim, Germany) and air-dried. Microscopic observations were performed using a Leica DM2500 microscope (Leica Microsystems, Nanterre, France) at 40× and 100× magnifications. Pictures were taken using a Nikon digital sight DS-U1 camera with Lucia G software (Nikon Instruments, Champigny sur Marne, France). Measurements were collected using the ImageJ software (http://rsbweb.nih.gov/ij/). Further, 500 mg of the coprolite were rehydrated into 1 mL PBS (Biotechnologie Appliquée), fixed with absolute ethanol on glass slides and stained using acridine orange solution as previously described .
DNA recovery from the coprolite
Two independent extraction protocols were used to obtain ancient DNA of the widest possible length range and to remove pigments, inhibitors of molecular detection methods. One gram of the coprolite was solubilized overnight at 4°C in 1 mL TE buffer (ethylenediamine tetraacetic acid; buffered solution, Tris HCl 10 mM, EDTA 1 mM, pH 8). A 500-µL aliquot of the solution was used for total DNA extraction as previously described , except that incubations into TE and digestion buffers were shortened to 1 day. TE buffer without coprolite was used as a negative control. For the DNA extraction using the PowerSoilR DNA Isolation Kit (MoBio Laboratories, Inc., Carlsbad, USA) , 500 µL of solubilized coprolite specimen were also used. Incubation was extended to 24 h/56°C in PowerSoilR bead tubes containing sodium dodecyl sulfate (SDS) and digestion buffer C1, under contentiously rotation followed by shaking in a Bio 101 FastPrep instrument (Qbiogene) at level 6.5 (full speed) for 95 s. DNA extraction was performed according to the manufacturer's instructions. Extraction batches contained also negative controls composed of PowerSoilR bead tubes without coprolite. Total DNA extracts from both protocols above were pooled together.
The coprolite was inoculated into broth in vertical tubes producing a gradient of oxygen, to recover aerobic, microaerophilic and anaerobes bacteria which may have survived inside the coprolite. Schaedler (Neogen, Lansingen, Michigan) and R2A (Neogen) broths with an additive of 1.5% of agarose were used as culture media. Sterile screw capped tubes (BIO-RAD, Marnes-la-Coquette, France) were filled with 15 mL of growth media and boiled for 30 min and then incubated at 47°C for 10 min. A 1-g inner portion of the coprolite was removed under anaerobic conditions and solubilized in 2 mL of sterile PBS (Biotechnologie Appliquée). Then, 200 µL of this suspension were injected into tubes containing Schaedler agar tubes under strict anaerobic atmosphere. In parallel, 1 mL of this suspension was incubated at 70°C for 20 min and then injected under strict anaerobiosis into tubes containing R2A broth. Tubes were incubated at 30°C for daily inspection. When visible growth was observed, the tubes were sliced with a sterile glass cutter under strict anaerobiosis and subcultured on Schaedler or R2A agar plates under various atmospheres. The appropriated atmospheres were created in sterile incubation bags using gas generating pouch systems (BD Gas PakTM EZ, Maryland, USA). During the entire procedure, two negative controls (broth with and without PBS were carried out. Additionally, BD BACTEC, Lytic/10 Anaerobic and Aerobic bottles enriched with 7 mL of defibrinated sheep blood (bioMérieux, Marcy l'Etoile, France) were used for culturing. 1-g of the inner portion of the coprolite was solubilized into 1 mL of PBS (Biotechnologie Appliquée) and 300 µL of such suspension were injected into the culture bottle. After 2-day incubation at 37°C, 100 µL of the anaerobic and aerobic culture liquid were serially diluted (D1:D10−10) and 10 µL of each dilution were plated onto COS culture plates (bioMérieux) and incubated at 37°C under strict anaerobe, microaerophile and aerobe atmosphere created by the use of gas generating pouch systems. Culture bottles with 300 µL of sterile PBS were run in parallel as negative controls. Likewise, COS plates (bioMérieux) inoculated with 10 µL culture liquid form the negative bottle culture and a COS plates (bioMérieux) that were opened under the Biosafety cabinet level 2 during the whole time of manipulation, were used as negative controls during subculture. Colonies were identified by MALDI-TOF mass spectrometry. Briefly, colonies were directly spotted on the mass-spectrometer plate and overlaid by appropriate matrix, before introduction into a Microflex mass spectrometer (Bruker Daltonics, Wissembourg, France). The spectrum of 2,000 – 20,000 – dalton peaks was compared to those in the database, which comprised of 3,768 reference bacterial spectra, in order to achieve identification . When MALDI-TOF identification failed, colonies were identified by 16 S rRNA gene sequence analysis as previously described .
High-throughput pyrosequencing of the 16 S rRNA gene V6 region
The V6 region was amplified using 454-adapter primers . PCR was performed in a final volume of 50 µL containing 1×PCR buffer, 2 µL of 25 mM MgCl2, 200 µM of each d’NTP, 1 µL of 10 pM of each primer (10 pM), 31.15 µL ddH2O, 1 unit of HotStar Taq Polymerase (Invitrogen, Villebon sur Yvette, France) and 57–112 ng of DNA-extract. The amplification was performed by incubating at 95°C for 15 min, followed by 31 cycles of denaturation at 95°C for 45 sec, annealing at 58°C for 45 sec, elongation at 72°C for 90 sec, followed by an final elongation at 72°C for 10 min in an ABI Thermocycler (Applied Biosystems Gene Amp PCR System 2700, Villbon sur Yvette, France). PCR products were purified using Ampure beads (AgentcourtR AMPureR XP, Beckman Coulter, USA) and checked using a BioAnalyzer (Agilent 2,100 Bioanalyzer, Agilent Technology, Lithuania) LabChip with a DNA chip 7,500 at 548 bp (Agilent DNA 7,500 Reagent). Quantification of PCR products was performed using a Tecan GENios fluorometer was perfomed using a Quant-iTTM PicoGreenR ds DNA Assay Kit (Invitrogen) following the Amplicon Library Preparation Method in a manual provided by Roche (Roche, Mannheim, Germany). The concentration of the library was 27.7 ng/µL, corresponding to 8.84 E×10 molecules/µL. Clone amplification was performed using a GS Titanium LV emPCR Kit (Lib-A) v2 using only DNA Capture Beads A (Roche). Sequencing was performed with a GS FLX Titanium XLR70 Sequencing Kit (Roche).
The 16 S rRNA gene pyrosequencing data were processed using Mothur package 1.5 . No ambiguous bases ‘N’ and only one mismatch were allowed in the read and primer sequences. The quality read trimming used a moving window of 50 bp and required that the average quality score over the region did not drop below 35. The trimmed reads were dereplicated and aligned using the Sylva bacteria reference alignment provided by the Mothur (http://www.mothur.org/). The multiple sequence alignment was filtered using a minimum read length of 200 bp. In addition, a pre-clustering step  was performed before chimera detection using the Uchime tool in Mothur. A distance matrix was built based on multiple sequence alignment, and OTUs clustering was performed at a 97% sequence identity. The taxonomic classification from phylum to genus level of each representative OTU sequence was performed using the RDP classifier tool and the RDP training set 9-database (http://www.mothur.org/). The relative abundance of reads per phyla was deduced from this classification. To exclude laboratory contaminations, we built an in-house gut microbiota database. This gut microbiota database contained 5,500,000 16 S rRNA gene sequences using all of the data generated by the 454FLX titanium platform of the URMITE laboratory. The database includes data from obese (SRX118214), Senegalese (SRX118212, SRX118213), HIV (SRX209782), anorexic (SRX209240), post- antibiotic treatment (SRX189054, SRX189053) and drug resistance tuberculosis (SRX204218) stool specimens. The OTUs identified in the Namur coprolite were clustered at 97% sequence identity with sequences in our gut microbiota database. The OTUs and corresponding sequences that did not match with any sequence of the database were evaluated.
To detect potential pathogens in the 16 S rRNA gene pyrosequencing data, we used an alternative method that consisted of binning reads according to their taxonomic classification using BLAST searches against the Ribosomal Database Project (RDP) database. First, the OTU-based quality read trimming filter was applied. However, no multiple sequence alignment and no distance matrix were performed to increase read length and therefore, improve the ability of assignment at the species level. Two databases were created using selected criteria from the Hierarchy Browser of the RDP 16 S rRNA database, release 10 (http://rdp.cme.msu.edu/). A “Type” database was built with sequences labeled “Type strains”, “Isolates” and size “length >1,200 bp” with “good quality”. A “Non-type” database was built using “Non-type strains”, “Isolates”, size “length >1,200 bp” and “good quality”. The two databases were formatted using Taxcollector . The species level was defined with a minimum sequence identity of 98.7%  with the best BLAST hit from the “Type” database. The multiple best BLAST hit cases were checked for the most representative species (>50% of the multiple best BLAST hits). A second BLAST round was performed from the remaining reads with the same cutoffs but using the “Non-type” database. A total of 121 reads were assigned to the genus Bartonella, including 53 reads that were significantly assigned to B. quintana and 47 reads to B. henselae. For some 16 S rDNA amplicons phylogenetic trees were constructed. Multiple sequence alignment was performed using MUSCLE  and curated by Gblocks . The phylogenetic tree was built using the PhyML algorithm  with a bootstrap of 100 and the nucleotide substitution model HKY85 . These tasks were all performed using the pipeline www.phylogenie.fr . The phylogenetic trees were visualized using trees DrawTree .
To determine the potential sources of bacteria contained in the coprolite sample (defined as sink) we used the SourceTracker software package  as previously reported . The Source Tracker sofware uses a Bayesian model that estimate the different source proportion found in a community sample (sink). The different sources examined (16 S rRNA datasets) correspond to 88 soil samples , 602 multiple human body sites (skin, nasal, tongues, urine) including 45 U.S. gut samples , 20 infant guts , 3 Polynesian guts from our laboratory, 60 mammal guts , eight different coprolites  and one compost . If the community tested corresponds to a mixture of taxa that do not match with any of the source environments used, that portion of the community is classified as “unknown”. The 16 S rRNA reads were processed for all source and sink samples using the quantitative insights into microbial ecology 1.7.0 release (QIIME) . High-quality read sequences (quality score >30, exact match to primer, and containing no ambiguous characters) were trimmed from the initial dataset. When compared datasets from non-overlapping amplicons the taxonomy binning with the closed-reference OTU picking strategy and using the Greengene gg_otus-12_10 release were performed. Then, all the taxonomy tables were converted and merged, for all the mapping files of the various source/sink datasets for SourceTracker analysis with the defaults parameters (iterations for Gibbs sampling = 100, rarefaction depth = 1000, alpha1 = 1e−3, alpha2 = 1e−1). The modification of the Source tracker parameters did not change the results obtained for the analysis of the Namur coprolite. To control the analysis a soil sample  and a previously investigated coprolite ZA04 were used as positive controls  (Figure S4 in File S1). The coprolite specimen matched to the Sources of primate/mammalian gut and Burkina Faso children gut.
The shotgun strategy was chosen for high-throughput pyrosequencing on a 454 Life Sciences Genome sequencer FLX instrument using titanium chemistry (Genome Sequencer RLX, Roche). Sequencing was performed using eight regions of the PicoTiterPlate. The concentration of extracted DNA was measured with the QuAnt_IT Picogreen Kit (Invitrogen) on a Tecan Fluorometer (GENios) at a concentration of 28.2 ng/µL. A total of 500 ng of DNA was nebulized. The library was constructed according to the 454-titanium shotgun protocol and the manufacturer's instructions. DNA fragmentation was visualized using the BioAnalyzer 2,100 on a LabChip with high sensitivity and an optimal size of 872 bp. The DNA stock was measured on a TBS fluorometer at 8.776 E+08 molecules/µL. The library was clonally amplified with 3cpb in 3 emPCR reactions using the GS Titanium SV emPCR Kit (Lib-L) version 2.The titration yield was 12.31%. In total 340,000 beads per project and per region were loaded onto the GS Titanium PicoTiterPlate Kit 70×75 which corresponded for this project with 383 µL of the clonal amplification and sequenced with the GS Titanium Sequencing Kit XLR70. The run was performed overnight, and then analyzed using the cluster. Using the Camera 2  QC-Filter and the 454 duplicate clustering tool, reads that were low quality (average score <19) or that were <60 bp and identical duplicates (default sequence identity = 0.96) artificially produced by titanium technology, were deleted. Reads were blasted against the NCBI non-redundant protein database using a translated nucleotide query (BLASTX). The best BLAST-hits with ≥50% identity, ≥50 score and E-values <1e−05 ,  were retained. Reads were assembled reads into contigs using a GS De Novo Assembler (Roche) with the following parameters: minimum overlap length of 35 bp, minimum identity of 98%. ORF searching was performed using Prodigal , and ORFs were blasted against the NCBI non-redundant database (BLASTP, E<1e−05). When possible, phylogenetic trees of the ORFs of contigs, encoding proteins that were associated to potential bacterial pathogens, were built. Regions that were homologous to the translated ORFs were searched using BLASTP against the non-redundant NCBI database. For multiple alignments of the sequences MUSCLE  was used and then curated by Gblocks . The phylogenetic tree was built using the PhyML algorithm  with a bootstrap of 100 and the protein substitution model WAG. These tasks were all performed using the pipeline www.phylogenie.fr . For the visualization of the phylogenetic trees DrawTree  was used. Principal coordinates analysis was performed using the correlated tool on the MG-RAST  metagenomic analysis server, to evaluate similarity between coprolite, soil and modern feces samples at the metabolic taxonomic level. Metabolic classifications were generated by a BLAST search against the SEED database using an E-value <1e -05, a minimum identity cutoff of 80% and a minimum alignment length cutoff of 20 bp. The data were normalized to values between 0 and 1, distances were calculated using the Bray-Curtis method. The results of the analysis are represented in the Figure S7 in File S1.
Supporting information. Note S1. Specific PCR-amplifications and Sanger-sequencing, quantitative real time PCR. Table S1. Typical gastro-intestinal, environmental and pathogenic bacteria assigned to the coprolite metagenome#. #Metagenomic reads were blasted against the NCBI protein database using a translated nucleotide query. Underrepresented taxonomic genera are not shown. Table S2. Number and size of contigs that were assigned to (A) Bacteroides spp. and (B) to bacterial pathogens associated to the coprolite sample#. #The contig identifier, its length (bp) and the annotation according to the best BLAST hit (BLASTX versus the non-redundant NCBI database, E-value<1e−05) are summarized. The E-value, the hit accession identifier and the percent of identity are also provided. Contigs of °human and *pig gut microbiota Bacteroides species. Table S3. Cultured microorganisms#. # Reported are the culture conditions, the cultured microorganisms and the mode leading to species identification. Reported is also if the cultured bacteria were found in the high-throughput pyrosequencing dataset. ° When the identification was based on BLAST annotation of the amplified 16 S rRNA gene region additional phylogenetic trees were constructed (Figures S2). Table S4. Bacterial pathogens identified from the amplified 16 S rRNA V6 region#. # The species level was defined with a minimum sequence identity of 98.7% using BLAST similarity searches against RDP databases. Table S5. Primers used to amplify DNA from intestinal parasites, bacterial pathogens and amoebae. Table S6. The quantitative real-time PCR systems that were tested. Figure S1. Working overview of the polyphasic approach used to analyze the Namur coprolite. Figure S2. Phylogenetic of 16 S rRNA gene sequences generated form cultivated bacteria species. The tree was constructed using the PhyML algorithm with a bootstrap of 100. The bootstrap support is reported for each branch. Phylogenetic tree of 16 S rDNA amplicons closely related to (A) Bacillus horti, (B) Paenibacillus spp., (C) Rhodanobacter spp. and (D) Clostridium magnum. Figure S3. Phylogenetic tree of a hydrolase. A phylogenetic tree was generated from the translated open reading frame of a contig encoding a hydrolase close to Bordetella species. The tree was constructed using the PhyML algorithm with a bootstrap of 100. The bootstrap support is reported for each branch. Figure S4. Bayesian source-tracking results. (A) The mixture of taxa associated to the 16 S rDNA gene amplicon dataset of the coprolite specimen was compared to known dataset of various environments. To control the workflow used to perform the analyses, two known samples (B) one coprolite previously investigated  and (C) a soil sample  were positively tested. Figure S5. Phylogenetic tree of 16 S rDNA amplicons matching to Bartonella sp. The tree was constructed using the PhyML algorithm with a bootstrap of 100. The bootstraps are reported for each branch. Phylogenetic tree of 16 S rDNA amplicons closely related to (A) B. henselae, B. koehlerae and (B) B. quintana. Figure S6. Alignment and of the amplicon matching to Bordetella and Achromobacter. The sequence alignment was performed using CLUSTALW multiple alignment tool . Figure S7. Metabolic comparison of modern metagenomes to the coprolite metagenome. The Principal coordinates analysis was based on read classification according to BLASTX searches against the SEED Database. For each metagenome included the MG-RAST accession number is given. Compared metagenomes are from soil (yellow cluster), healthy mammalian and human feces (blue cluster); and the coprolite (red). The coprolite metagenome does not group with either the modern gut or soil microbiota.
The authors acknowledge Prof. Francoise Bouchet, from the Reims Champagne Ardenne University for providing the coprolite specimen.
Conceived and designed the experiments: MD. Performed the experiments: SA. Analyzed the data: FA CR. Contributed reagents/materials/analysis tools: MLB. Wrote the paper: SA MD.
- 1. Drancourt M, Raoult D (2005) Palaeomicrobiology: current issues and perspectives. Nat Rev Microbiol 3: 23–35.
- 2. Ubaldi M, Luciani S, Marota I, Fornaciari G, Cano RJ, et al. (1998) Sequence analysis of bacterial DNA in the colon of an Andean mummy. Am J Phys Anthropol 107: 285–295.
- 3. Dittmar K (2009) Old parasites for a new world: the future of paleoparasitological research. a review. J Parasitol 95: 365–371.
- 4. Tito RY, Knights D, Metcalf J, Obregon-Tito AJ, Cleeland L, et al. (2012) Insights from characterizing extinct human gut microbiomes. PLoS One 7: e51146.
- 5. Tito RY, Macmil S, Wiley G, Najar F, Cleeland L, et al. (2008) Phylotyping and functional analysis of two ancient human microbiomes. PLoS One 3: e3703.
- 6. Cleeland LM, Reichard MV, Tito RY, Reinhard KJ, Lewis CM Jr (2013) Clarifying Prehistoric Parasitism from a Complementary Morphological and Molecular Approach. J Archaeol Sci 40: 3060–3066.
- 7. Luciani S, Fornaciari G, Rickards O, Labarga CM, Rollo F (2006) Molecular characterization of a pre-Columbian mummy and in situ coprolite. Am J Phys Anthropol 129: 620–629.
- 8. Rollo F, Ermini L, Luciani S, Marota I, Olivieri C (2006) Studies on the preservation of the intestinal microbiota's DNA in human mummies from cold environments. Med Secoli 18: 725–740.
- 9. Cano RJ, Tiefenbrunner F, Ubaldi M, Del Cueto C, Luciani S, et al. (2000) Sequence analysis of bacterial DNA in the colon and stomach of the Tyrolean Iceman. Am J Phys Anthropol 112: 297–309.
- 10. Santiago-Rodriguez TM, Narganes-Storde YM, Chanlatte L, Crespo-Torres E, Toranzos GA, et al. (2013) Microbial communities in pre-columbian coprolites. PLoS One 8: e65191.
- 11. Raoult D, Aboudharam G, Crubezy E, Larrouy G, Ludes B, et al. (2000) Molecular identification by “suicide PCR” of Yersinia pestis as the agent of medieval black death. Proc Natl Acad Sci U S A 97: 12800–12803.
- 12. Sheorey H, Biggs BA, Traynor P (2011) Nematodes. In: Versalovic J, Caroll KC, Funke G, Jorgensen JH, Landry ML, Warnock DW, Editors. Manual of Clinical Microbiology, 10th Edition: American Society of Microbiology, Washington DC. pp. 2200–2212.
- 13. Rocha GCd, Serra-Freire NM (2009) Paleoparasitology at “Place d'Armes”. Namur, Belgium: a biostatistics analysis of trichurid eggs between the Old and New World. Rev Bras Parasitol Vet 18: 70–74.
- 14. Visvesvara GS, Moura H, Schuster FL (2007) Pathogenic and opportunistic free-living amoebae: Acanthamoeba spp., Balamuthia mandrillaris, Naegleria fowleri, and Sappinia diploidea. FEMS Immunol Med Microbiol 50: 1–26.
- 15. Seng P, Abat C, Rolain JM, Colson P, Lagier JC, et al. (2013) Identification of rare pathogenic bacteria in a clinical microbiology laboratory: impact of matrix-assisted laser desorption ionization-time of flight mass spectrometry. J Clin Microbiol 51: 2182–2194.
- 16. Drancourt M, Bollet C, Carlioz A, Martelin R, Gayral JP, et al. (2000) 16 S ribosomal DNA sequence analysis of a large collection of environmental and clinical unidentifiable bacterial isolates. J Clin Microbiol 38: 3623–3630.
- 17. Kunin V, Copeland A, Lapidus A, Mavromatis K, Hugenholtz P (2008) A bioinformatician's guide to metagenomics. Microbiol Mol Biol Rev 72: 557–578.
- 18. Turnbaugh PJ, Hamady M, Yatsunenko T, Cantarel BL, Duncan A, et al. (2009) A core gut microbiome in obese and lean twins. Nature 457: 480–484.
- 19. Greub G, Raoult D (2004) Microorganisms resistant to free-living amoebae. Clin Microbiol Rev 17: 413–433.
- 20. Loreille O, Roumat E, Verneau O, Bouchet F, Hanni C (2001) Ancient DNA from Ascaris: extraction amplification and sequences from eggs collected in coprolites. Int J Parasitol 31: 1101–1106.
- 21. Oh CS, Seo M, Lim NJ, Lee SJ, Lee EJ, et al. (2010) Paleoparasitological report on Ascaris aDNA from an ancient East Asian sample. Mem Inst Oswaldo Cruz 105: 225–228.
- 22. Schroeder JM, Booton GC, Hay J, Niszl IA, Seal DV, et al. (2001) Use of subgenic 18 S ribosomal DNA PCR and sequencing for genus and genotype identification of Acanthamoebae from humans with keratitis and from sewage sludge. J Clin Microbiol 39: 1903–1911.
- 23. Cooper A, Poinar HN (2000) Ancient DNA: do it right or not at all. Science 289: 1139.
- 24. Hofreiter M, Serre D, Poinar HN, Kuch M, Paabo S (2001) Ancient DNA. Nat Rev Genet 2: 353–359.
- 25. Willerslev E, Cooper A (2005) Ancient DNA. Proc Biol Sci 272: 3–16.
- 26. Pääbo S, Poinar H, Serre D, Jaenicke-Despres V, Hebler J, et al. (2004) Genetic analyses from ancient DNA. Annu Rev Genet 38: 645–679.
- 27. Chaves SAdM, Reinhard KJ (2006) Critical analysis of coprolite evidence of medicinal plant use, Piauí, Brazil. Palaeogeography, Palaeoclimatology, Palaeoecology 237: 110–118.
- 28. Backhed F, Ley RE, Sonnenburg JL, Peterson DA, Gordon JI (2005) Host-bacterial mutualism in the human intestine. Science 307: 1915–1920.
- 29. Keita AK, Socolovschi C, Ahuka-Mundeke S, Ratmanov P, Butel C, et al. (2013) Molecular evidence for the presence of Rickettsia felis in the feces of wild-living African apes. PLoS One 8: e54679.
- 30. Jirku M, Pomajbikova K, Petrzelkova KJ, Huzova Z, Modry D, et al. (2012) Detection of Plasmodium spp. in human feces. Emerg Infect Dis 18: 634–636.
- 31. El Khechine A, Henry M, Raoult D, Drancourt M (2009) Detection of Mycobacterium tuberculosis complex organisms in the stools of patients with pulmonary tuberculosis. Microbiology 155: 2384–2389.
- 32. Ricardog M, Kempf V, Chomel BB, Breitschyerdt E (2011) Bartonella. In: Versalovic J, Caroll KC, Funke G, Jorgensen JH, Landry ML, Warnock DW, Editors. Manual of Clinical Microbiology, 10th Edition: American Society of Microbiology, Washington DC. pp. 786–799.
- 33. Wirsing von König C, Riffelmann M, Coenye T (2011) Bordetella and Related Genera. In: Versalovic J, Caroll KC, Funke G, Jorgensen JH, Landry ML, Warnock DW, Editors. Manual of Clinical Microbiology, 10th Edition: American Society of Microbiology, Washington DC. pp 739–751.
- 34. Iniguez AM, Reinhard K, Carvalho Goncalves ML, Ferreira LF, Araujo A, et al. (2006) SL1 RNA gene recovery from Enterobius vermicularis ancient DNA in pre-Columbian human coprolites. Int J Parasitol 36: 1419–1425.
- 35. Lagier JC, Armougom F, Million M, Hugon P, Pagnier I, et al. (2012) Microbial culturomics: paradigm shift in the human gut microbiome study. Clin Microbiol Infect 18: 1185–1193.
- 36. Schloss PD, Gevers D, Westcott SL (2011) Reducing the effects of PCR amplification and sequencing artifacts on 16 S rRNA-based studies. PLoS One 6: e27310.
- 37. Huse SM, Welch DM, Morrison HG, Sogin ML (2010) Ironing out the wrinkles in the rare biosphere through improved OTU clustering. Environ Microbiol 12: 1889–1898.
- 38. Giongo A, Davis-Richardson AG, Crabb DB, Triplett EW (2010) TaxCollector: Modifying Current 16 S rRNA Databases for the Rapid Classification at Six Taxonomic Levels. Diversity 2: 1015–1025.
- 39. Schlaberg R, Simmon KE, Fisher MA (2012) A systematic approach for discovering novel, clinically relevant bacteria. Emerg Infect Dis 18: 422–430.
- 40. Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32: 1792–1797.
- 41. Talavera G, Castresana J (2007) Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol 56: 564–577.
- 42. Guindon S, Delsuc F, Dufayard JF, Gascuel O (2009) Estimating maximum likelihood phylogenies with PhyML. Methods Mol Biol 537: 113–137.
- 43. Hasegawa M, Kishino H, Yano T (1985) Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J Mol Evol 22: 160–174.
- 44. Dereeper A, Guignon V, Blanc G, Audic S, Buffet S, et al. (2008) Phylogeny.fr: robust phylogenetic analysis for the non-specialist. Nucleic Acids Res 36: W465–469.
- 45. Chevenet F, Brun C, Banuls AL, Jacq B, Christen R (2006) TreeDyn: towards dynamic graphics and annotations for analyses of trees. BMC Bioinformatics 7: 439.
- 46. Knights D, Kuczynski J, Charlson ES, Zaneveld J, Mozer MC, et al. (2011) Bayesian community-wide culture-independent microbial source tracking. Nat Methods 8: 761–763.
- 47. Flores GE, Bates ST, Knights D, Lauber CL, Stombaugh J, et al. (2011) Microbial biogeography of public restroom surfaces. PLoS One 6: e28132.
- 48. Lauber CL, Hamady M, Knight R, Fierer N (2009) Pyrosequencing-based assessment of soil pH as a predictor of soil bacterial community structure at the continental scale. Appl Environ Microbiol 75: 5111–5120.
- 49. Costello EK, Lauber CL, Hamady M, Fierer N, Gordon JI, et al. (2009) Bacterial community variation in human body habitats across space and time. Science 326: 1694–1697.
- 50. De Filippo C, Cavalieri D, Di Paola M, Ramazzotti M, Poullet JB, et al. (2010) Impact of diet in shaping gut microbiota revealed by a comparative study in children from Europe and rural Africa. Proc Natl Acad Sci U S A 107: 14691–14696.
- 51. Muegge BD, Kuczynski J, Knights D, Clemente JC, Gonzalez A, et al. (2011) Diet drives convergence in gut microbiome functions across mammalian phylogeny and within humans. Science 332: 970–974.
- 52. Watanabe K, Nagao N, Toda T, Kurosawa N (2008) Changes in bacterial communities accompanied by aggregation in a fed-batch composting reactor. Curr Microbiol 56: 458–467.
- 53. Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD, et al. (2010) QIIME allows analysis of high-throughput community sequencing data. Nat Methods 7: 335–336.
- 54. Sun S, Chen J, Li W, Altintas I, Lin A, et al. (2011) Community cyberinfrastructure for Advanced Microbial Ecology Research and Analysis: the CAMERA resource. Nucleic Acids Res 39: D546–551.
- 55. Hyatt D, Chen GL, Locascio PF, Land ML, Larimer FW, et al. (2010) Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11: 119.
- 56. Glass EM, Wilkening J, Wilke A, Antonopoulos D, Meyer F (2010) Using the metagenomics RAST server (MG-RAST) for analyzing shotgun metagenomes. Cold Spring Harb Protoc 2010: pdb prot5368.
- 57. Greenberg DE, Ding L, Zelazny AM, Stock F, Wong A, et al. (2006) A novel bacterium associated with lymphadenitis in a patient with chronic granulomatous disease. PLoS Pathog 2: e28.
- 58. Greub G (2009) Parachlamydia acanthamoebae, an emerging agent of pneumonia. Clin Microbiol Infect 15: 18–28.