Table 1.
Mock bacterial composition of in vitro and in silico datasets.
Fig 1.
A schematic flowchart for processing of paired-end sequences with MGmapper.
MGmapper processes fastq reads in four steps. These consist of: (I) Trimming and mapping reads against a phiX bacteriophage to remove potential positive control reads. (II) Mapping to specified reference databases, post-processing of BWA-mem alignments to remove reads with low alignment score or insufficient alignment coverage. (III) Identification of best hits in bestmode: Assignment of a read-pairs to only one specific reference sequence based on the highest sum of alignment scores. In fullmode, assigned a read-pair to a reference sequence even if a higher alignment score is found when mapping to another reference sequence database. This will provide best target match, considering only the sequences present in one particular reference database. (IV) Compilation of abundance statistics, read and nucleotide counts, depth, coverage, and summary reports.
Table 2.
Read count statistics and reference sequence information.
Table 3.
Benchmarking of the in vitro data mapped against a bacteria reference sequence database.
Table 4.
Benchmarking of the in silico data mapped against a bacteria reference sequence database.
Table 5.
Read count benchmark at genus level.
Table 6.
Read count benchmark at species level.