Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Phylogenomic analysis and Mycobacterium tuberculosis antibiotic resistance prediction by whole-genome sequencing from clinical isolates of Caldas, Colombia

  • Lusayda Sánchez-Corrales,

    Roles Conceptualization, Formal analysis, Investigation, Project administration, Writing – original draft

    Affiliation Maestría en Investigación en Enfermedades Infecciosas, Universidad de Santander, Bucaramanga, Santander, Colombia

  • Olga Lucía Tovar-Aguirre,

    Roles Conceptualization, Formal analysis, Project administration, Writing – original draft

    Affiliation Programa de Bacteriología, Universidad Católica de Manizales, Manizales, Caldas, Colombia

  • Narmer Fernando Galeano-Vanegas,

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Software, Writing – review & editing

    Affiliations Instituto de Investigación en Microbiología y Biotecnología Agroindustrial, Universidad Católica de Manizales, Manizales, Caldas, Colombia, Departamento de Biotecnología, BIOS Centro de Bioinformática y Biología Computacional, Manizales, Caldas, Colombia

  • Paula Alejandra Castaño Jiménez,

    Roles Formal analysis, Investigation, Methodology

    Affiliation Programa de Bacteriología, Universidad Católica de Manizales, Manizales, Caldas, Colombia

  • Ruth Arali Martínez-Vega,

    Roles Funding acquisition, Resources, Validation

    Affiliation Facultad de Salud, Universidad de Santander, Bucaramanga, Santander, Colombia

  • Carlos Ernesto Maldonado-Londoño,

    Roles Data curation, Formal analysis, Software

    Affiliation Centro Nacional de Investigaciones de Café –Cenicafé, Manizales, Caldas, Colombia

  • Johan Sebastián Hernández-Botero,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Software, Validation, Writing – review & editing

    Affiliations Grupo de Investigación Médica, Escuela de Medicina, Universidad de Manizales, Manizales, Caldas, Colombia, Grupo de Resistencia Antibiótica de Manizales, Manizales, Caldas, Colombia

  • Fernando Siller-López

    Roles Conceptualization, Investigation, Project administration, Supervision, Writing – review & editing

    fsiller@ucm.edu.co

    Affiliations Programa de Bacteriología, Universidad Católica de Manizales, Manizales, Caldas, Colombia, Programa de Microbiología, Universidad Libre, Pereira, Risaralda, Colombia

Abstract

Mycobacterium tuberculosis (M. tuberculosis) was the pathogen responsible for the highest number of deaths from infectious diseases in the world, before the arrival of the COVID-19 pandemic. Whole genome sequencing (WGS) has contributed to the understanding of genetic diversity, the mechanisms involved in drug resistance and the transmission dynamics of this pathogen. The object of this study is to use WGS for the epidemiological and molecular characterization of M. tuberculosis clinical strains from Chinchiná, Caldas, a small town in Colombia with a high incidence of TB. Sputum samples were obtained during the first semester of 2020 from six patients and cultured in solid Löwenstein-Jensen medium. DNA extraction was obtained from positive culture samples and WGS was performed with the Illumina HiSeq 2500 platform for subsequent bioinformatic analysis. M. tuberculosis isolates were typified as Euro-American lineage 4 with a predominance of the Harlem and LAM sublineages. All samples were proven sensitive to antituberculosis drugs by genomic analysis, although no phenotype antimicrobial tests were performed on the samples, unreported mutations were identified that could require further analysis. The present study provides preliminary data for the construction of a genomic database line and the follow-up of lineages in this region.

Introduction

M. tuberculosis (MTB) is considered one of the most lethal pathogens, with a toll of 1 billion people suffering tuberculosis (TB)-related deaths in the last two centuries. The World Health Organization (WHO) in their annual report of 2019 depicted an estimate of 10 million new cases and 1.7 million deaths per year [1], making MTB the deadliest single infectious agent before the arrival of SARS-CoV-2. As in many developing countries, Colombia has presented a steady increase in tuberculosis incidence in the last six years from 23 to 27.6 cases per 100,000 habitants in 2019. In Caldas, a department located in the center of the country, TB is considered endemic causing 34 new cases and 2.31 deaths per 100,000 habitants at its peak in 2017, making this malady a primary public health concern in the region due to the progressive aging of the population, emerging immunosuppressive treatments, sociodemographic and comorbidities conditions. Chinchiná (47,929 habitants in 2012), a small town in the center of Caldas, is key to the region’s coffee harvesting processes which lead to the arrival of hundreds of coffee pickers from all over the country at harvesting season. Endemic social issues like poverty and homelessness make a perfect combination for TB to thrive in Chinchiná resulting in one of the highest TB incidence rates in the country (71.5 cases per 100,000 habitants) [2]. WGS offers an opportunity for the molecular epidemiology surveillance for TB in the region. The development of local capacities bioinformatic analysis and the reduction on overall genome sequencing costs make whole genome sequencing (WGS) an increasingly accessible alternative for molecular epidemiologic studies [3]. With the genetic information obtained by WGS compared to other analysis, drug resistance profiles can be predicted for most of the current antibiotic treatment, enabling a wide monitoring of drug resistance.

The aim of this study is to explore the potential use of WGS in the context of routine surveillance, describing the epidemiological and molecular characteristics of circulating M. tuberculosis strains from Chinchiná, Caldas using minimal sequencing and computational resources.

Materials and methods

Population and sample

Patients suspected of pulmonary tuberculosis due to respiratory symptoms screened by routine surveillance in the first semester of 2020 were considered for the study. Demographic information of the patients was obtained from clinical records and epidemiological data from the National surveillance system (SIVIGILA) of Colombia. Seventeen cases of TB were notified from the health authorities of Chinchiná, Caldas to the SIVIGILA system. Six subjects were not considered to participate in the study since no sample was collected due to loss of follow up by the emerging coronavirus pandemic quarantine. Sputum samples were collected from 11 patients at the Hospital San Marcos in Chinchiná, Caldas, and cultured in Lowenstein-Jensen media for the microbiological and genomic analysis of M. tuberculosis. At the beginning of the COVID-19 pandemic, between March to June 2020, there were no growth-dependent or molecular susceptibility tests as part of the standard of care, therefore, no phenotypic or genotypic antimicrobial resistance data was available for the analysis. Positive cultures with colonies suspected of MTB were send to the Laboratory of Microbiology of the Universidad Católica de Manizales, Caldas for DNA extraction.

Seven samples passed the DNA quality control for the sequencing platform, whereas one sample was withdrawn because it was not taxonomically assigned to MTB genome after the bioinformatic analyzes; finally, six samples were considered for the study (Fig 1).

DNA extraction

Genomic DNA was extracted from colonies using the Bacteria DNA preparation kit following the manufacturer’s protocol (Jena Bioscience, Germany). The quality and quantity of isolated DNA was measured using a Nanodrop 2000 spectrophotomer (ThermoScientific, Waltham, MA) and a Quantus fluorometer (Promega, Madison, WI). DNA integrity was assessed by visualization in 1% agarose gel electrophoresis. DNA samples that fulfill the quality standards in terms of integrity, purity and quantity were sent to whole genome sequencing at NovoGene (Beijing, China).

Whole genome sequencing

Sequencing was performed on PCR-free libraries on the Illumina Hiseq 2500 platform (Illumina Inc., San Diego, CA), producing 150 bp paired end reads. Quality control data of the DNA samples pre- and post-sequencing are presented on S1 and S2 Tables of the S1 File.

Bioinformatic pipeline

The Illumina reads were analyzed in a standard PC (Intel Core i7-7700, 16 Gb of RAM, 4 TB HDD) with Linux subsystem for Windows (Ubuntu 20.04.1 LTS). An in-house bioinformatics workflow was installed using Miniconda (Conda 4.10.3 Python 3.9.5) with the core programs of the unified analysis variant pipeline of the Related Sequencing TB data platform (ReSeqTB/UVP) to identify and assign lineage to the MTB isolates sequence data [4]. (S3 Table in S1 File). Briefly, reads were classified by Kraken version 2 to detect possible contamination or the presence of other mycobacteria; [5] additionally, reads were checked, validated, and trimmed with FastQC version 0.11.9 [6], FastQVAlidator (http://genome.sph.umich.edu/wiki/FastQValidator), and Trimmomatic [7], respectively. Processed reads were mapped using BWA-mem [8] using H37Rv (NCBI ID: NC_000962.3) as reference genome. Alignment quality control was assesed using QuallyMap [9] and Alfred was used to evaluate the depth of coverage in all loci of interest associated with resistance lineage assignment [10]. Genomes with > 75X of depth and > 95% reference mapped with all ReSeqTB/UVP loci of interest were considered for downstream analysis. Variant calling using GATK [11] with hard-filter settings was used to eliminate artefactual false-positive variants. [12] Finally, NGSEP version 4.0 [13] and SnpEff 5.0 [14] was used to perform functional annotation of single nucleotide variants (SNVs) and insertions or deletions (INDELs). By parsing the annotated VCF it was possible to identify variants with low, medium, and high confidence (LR+>10; p<0,05) of association with antituberculous resistance and, also, identify variants not yet described in the ReSeqTB database (V. 2019-I) [15]. The pipeline for WGS TB resistance analysis is available on demand on protocols.io (https://www.protocols.io/private/A0CEADBAFBBB11EB878B0A58A9FEAC02). Links to references of the software used for the bioinformatic pipeline are summarized on S4 Table of the S1 File.

De novo assembly and annotation.

The final high-quality reads were assembled using Megahit version 1.2.9 [16]. (S5 Table in S1 File). The assembly was carried out with a kmer range from 29 to 141 at intervals of 20 and 800 bp as minimum contig size. Kaiju version 1.7.3 [17,18] was used to perform contig taxonomic assignment. Quast version 4.3 [19] was used to generate metrics and evaluate the quality of the assemblies using M. tuberculosis strain HR37v (NC_000962) as reference genome. The completeness of the assembly process was verified with Busco version 4.1.4 [20]. The structural and functional annotation was performed with Prokka version 1.12–14 [21].

Lineage assignment and phylogeny.

M. tuberculosis complex (MTBC) lineages/sublineages and in silico spoligotyping were determined using TB-Profiler version 2.8.13 [22]. The variants on genes rpoB, inhA, embB, embC, gyrA, gyrB, katG, panD, pncA, recA, 16S, pssC, ahpC, cycA, clpC, aftA, atpE, mas, eccC5, kasA, gpsI, mas, and rpsA, were used as a second approach to classify the isolates on specific lineages and sub-lineages as was proposed by Coll et al. [23] Whole genome dinucleotide variants that passed the quality filters, including the phylogenetically informative SNPs for MTBC typing, and excluding regions of erroneous mapping [24,25], were used to perform clustering by phylogenetic analysis using the software Mega X 10.1. Initially, a maximum likelihood (ML) analysis was run to find the best nucleotide substitution model for the dataset, the phylogenetic analysis was done by the ML method and general time reversible model with a bootstrapping test of 1000 replications, a non-rooted phylogenetic tree to show sample’s clustering was obtained [26].

Analysis of variants associated with first- and second-line drug resistance.

After variant calling and annotation, variants on each resistance gene (according to ReSeqTB/UVP) with a high confidence of association with antimicrobial resistance (LR+ > 10; p <0,05) were parsed from the variant call format (VCF) files (for a complete list of genes tested per drug, see Supplementary file, S6 Table in S1 File).

Genes related to resistance to first-line drug therapy [rifampicin (R), isoniazid (H), pyrazinamide (Z), and ethambutol (E)] and second-line drugs [fluoroquinolone (FQ), linezolid (Lnz), bedaquilina (BDQ), delamanid (DLM)] were considered for the analysis. Variants of uncertain effect in genes related to resistance to therapy, and not yet listed in the ReSeqTB database (V. 2019-I) were also evaluated. Finally, all variants (SNV, and INDELS) in drug targets were tested with PROVEAN algorithm to predict deleterious effects in proteins [27].

Ethics statement.

The local San Marcos hospital sent all positive MTB cultures to DNA extraction after finished all diagnostic procedures during patient care. Clinical data of patients from surveillance records were collected and anonymized after patient care. All samples were deemed of retrospective nature by the clinical and academic ethics committee (San Marcos Hospital–2019–2907, and Universidad Católica de Manizales–DIP-1260-001-4184).

Results

Sociodemographic and clinical characteristics

All patients were of low socioeconomic status with a median age of 32 years. HIV comorbidity was present in two TB patients. Bacillary load was greater than two crosses in the first smear microscopy in four patients, however, no relation of this issue was observed with the clinical outcome (Table 1). A complete sociodemographic, epidemiological, and clinical data is listed on S7 Table in S1 File.

thumbnail
Table 1. Sociodemographic and clinical characteristics of patients with tuberculosis.

https://doi.org/10.1371/journal.pone.0258402.t001

Lineage assignment and phylogeny

The analysis showed that all isolates belong to the Euro-American lineage (Lineage 4), divided in LAM and Harlem sublineages (Fig 2).

thumbnail
Fig 2. Clustering by phylogenetic analysis and lineage assignment of six M. tuberculosis isolates form Chinchiná (Caldas, Colombia).

https://doi.org/10.1371/journal.pone.0258402.g002

In total, 1753 dinucleotide variants of high quality were found in the genome. After filtering regions of erroneous mapping, a final data set of 1386 SNVs was obtained. The un-rooted tree shows two main clades, with a branch support of 100% according to the bootstrap test.

The phylogenetic tree shows a reliable clustering. An appreciable genetic distance is not observed since there are few variations in the selected phylogeny genes.

Clade one contained samples TB02 (4.3.4.1), TB03 (4.3.4.2) and TB09 (4.3.3) and assigned to the spoligotype LAM and the closest relationship to the reference H37Rv genome. The second clade groups samples TB04, TB05, and TB06, assigned to the spoligotype Haarlem sublineage 4.1.2.1 (Fig 2). Of note, according to phylogeny analysis, the two closest isolates were TB05 and TB06 both from patients with previous antituberculous treatment.

Phylogeny inferred by maximum likelihood method using the general time reversible model, bootstrap test of 1000 replications. Values on nodes corresponds to the bootstrap value in percentage.

WGS prediction of drug resistance

According to the genomic analysis all isolates are sensitive to all the first-line drugs. Several canonical genes related to first-line treatment showed different substitutions previously reported in the ReSeqTB database with 57.8% synonymous mutations and 42.1% non-synonymous. All substitutions showed no association with clinically significant drug resistance (all positive likelihood ratio (LR)<10). A mutation in ponA1 (1893_1894insTCG) showed LR+ >10, nevertheless, was non-significant (p = 0,3818). Of 54 mutations across all samples reported in ReSeqTB for first-line drugs, only three mutations (5,5%) were classified as deleterious by the PROVEAN score (S8 Table in S1 File).

Regarding second-line treatments, several mutations previously described in ReSeqTB for FQN were found. Mutations in two genes, gyrA (with both Glu21Gln and Gly668Asp SNV in all but one isolate) and eccC5 (with one non-synonymous substitution in one sample). The rest of the mutational spectrum for quinolones are available in S3 Table in S1 File. Of note, one gyrA mutation (739G>A, Gly247Ser) with the prediction of deleterious effect (PROVEAN score of -5,942) was present in one sample. However, none of the variants related to FQN genes showed impact on drug resistance (LR+ <10, p value>0,05). Of those drugs recently added to MDR treatment, loci related to LNZ resistance (genes rrl, rplC) showed no SNV or INDEL. Another cornerstone of MDR-TB treatment, the repurposed drug BDQ, showed one synonymous substitution atpeE (24C>T) in one sample. Genes related to new drugs DMN and protamanid showed a single substitution in fbiC (Ala132Val) in one isolate, and substitution Lys270Met in fgd1 was found in four samples (S8 Table in S1 File). Finally, ten variants were predicted as deleterious in proteins associated with antimicrobial resistance genes. Of note, a Ala132Val mutation in fbiC gene related to DMN and a Thr202Ala mutation in thyA gene related to PAS deem deleterious PROVEAN scores with a predicted effect on the target protein which were no described in the ReSeqTB database (Table 2).

thumbnail
Table 2. Mutations no reported in ReseqTB in loci related to antituberculosis therapy detected by WGS analysis from isolates of Chinchiná.

https://doi.org/10.1371/journal.pone.0258402.t002

In genomes from two HIV patient isolates (TB02, TB04), the number of missense variants (541 and 533) was within range (485–546) comparing to the rest of the dataset. Both samples had 15 SNVs in genes related to first-line treatment and 10 to 17 SNVs for the second-line treatment loci, respectively (S8 Table in S1 File).

Discussion

The present study is the first to provide information on the phylogenetic characteristics and the identification of the predominant strains in Chinchiná, a municipality with a high incidence of TB; this study alsodescribes for the first time the use of WGS in the surveillance of TB drug susceptibility in Colombia. By developing a strategy of outsourcing library creation and sequencing, plus using standard computer capabilities, our team showed that it is possible to add WGS results to the surveillance data without significant changes in the laboratory setup or considerable investments in computing power. Furthermore, to our knowledge, it provides the first high-quality WGS dataset of M. tuberculosis from Colombia in compliance with new standards in terms of depth, coverage, tools, and quality controls during the bioinformatic pipeline (Q scores > 30, depth > 125x, MAP-Q > 58, all UVP-Loci of Interest Covered) [4,25,28]. The dataset of our study will allow the inclusion of MTB WGS from Colombia for comparative analyses using current pipelines deployed for surveillance [4,20,25,28].

Although there was a limitation on the surveillance activities due to the COVID-19 pandemic and no availability of phenotypic or molecular tests to assess drug susceptibility, the incidence of TB cases in Chinchiná for the first semester of 2020 (N = 17) was similar to reports from previous years: 2019 (N = 20), 2018 (N = 22), and 2017 (N = 19). The sample was also representative on demographic and clinical characteristics of patients, predominantly male, uneducated, with low income, and no formal jobs following local and regional patterns of mobility and social risk factors [2,29]. Furthermore, all isolates were susceptible to antimicrobial drugs according to the WGS analysis, which agrees with local epidemiology (TB-DR prevalence of 1–2%) [2,29].

The Euro-American lineage found in this study presents a variable distribution in Colombia. Similar results have been found in other regions of the world. Reports from Argentina, Brazil, México and Peru have established that the predominant strains have evolved from this lineage [30]. This lineage has been described as more transmittable than others [31]. Half of the cases analyzed corresponded to the LAM sublineage, the most frequent and globally extended of the Euro-American lineages. The other half, to the Haarlem sublineage, probably consequence of human migration and their commercial activities [32]. The European origin and the influence of European settlers in America and Colombia are confirmed. Similar results have been described by other authors in regions of the country such as Cali [33] Bogotá [34] and Medellín [35]. Contrary to these reports, in Buenaventura, a region of Valle del Cauca, the Beijing lineage, clearly associated with resistant cases has been identified [33]. Most recent reports on TB-MDR Colombian isolates using spoligotyping showed unique TB-MDR strains with no support of active circulation of MDR clones and no relation to specific lineage (Beijing vs. LAM) [36]. Moreover, there is an extensive circulation (53%) of orphan lineages between the indigenous population [36]. Both studies emphasize the importance of using WGS as the primary method for phylogenetic analysis due to high resolution and discriminatory power and potentially characterize new lineages and outbreaks [37,38].

All mutations regarding first-line drugs showed no association with resistance when using a clinical relational database for interpreting the significance of variants (10.1038/s41598-018-33731-1), even though 5,5% are predicted to show deleterious effects in the proteins using the PROVEAN score [39].

The Asp103Asp mutation is a silent mutation in the rpoB gene reported by other authors in mono-resistant isolates [40]. The Gly594Glu mutation has been considered compensatory in isolates resistant to rifampicin, although it is not found in the resistance determining region (codon 432 to 438).

Mutations in mshA (Ans111Ser) were detected in three isolates, although it has been reported as a variant related to the Haarlem lineage and responsible for conferring resistance to Ethionamide [41,42] there is no significant association with drug resistance in the ReSeqTB database [4].

Regarding BDQ resistance by atpeE mutations, one isolate showed a non-repoted synonymous mutation (24C>T) outside codon positions 53 and 72, the key genomic region with most substitutions related to an increase in MIC for BDQ [43]. DMN, and more recently pretomanid for TB-XDR treatment, use similar pathways and genes for activation. fgd1 encodes a key enzyme for the recycle of the target F420 of DMN; in our study, we found three isolates with Lys270Met substitutions in patients not previously exposed to the drug, this substitution was phylogenetically restricted to the Harleem lineage which is regarded as neutral [43]. fbiC encodes a protein that participates in F420 biosynthesis and harbors key mutations found in resistant MTB isolates. The mutation Ala132Val we detected in one isolate was not described in the systematic review on DMN resistance by Kadura et al., [43] however, it showed a deleterious effect on the protein according to the PROVEAN score (-3,392); thus, the presence of this mutation in TB isolates not previously exposed to the drug requires future surveillance as not seems to be lineage-restricted [44].

No differences in SNV and other variants were observed when comparing patients living with HIV within our sample. Although with did not find statistical significance due to the small sample, our results are broadly consistent with the literature. Previous reports found that HIV co-infection did not affect the mutation rate and was not associated with faster de novo development of drug resistance [45,46].

As for the patients with previous treatment, our results showed full genetic susceptibility to current antimicrobial drugs and successful clinical outcomes after the new HRZE treatment. Although there was no previous microbial culture in our samples, the WGS results in the context of adequate therapeutic adherence and no cavitation with fibrosis in radiologic studies (associated with poor tissue penetration of drugs) [47] seem to indicate that a re-infection with a drug sensitive strain by re-exposure to a known focus would be the most plausible explanation for the re-treatment cases.

There were some limitations in our study, since due to the COVID pandemic, the researchers did not have access to phenotypic or molecular tests that would have made it possible to compare data with our genomic findings; and, due to international shipping for WGS and the corresponding response times, the delivery of the results added approximately four weeks to analyze them. Currently, we are improving those aspects for future WGS analyzes of MTB surveillance projects.

Conclusions

Recent studies in top medical journals have shown the enormous potential of WGS in the surveillance of tuberculosis but at the cost of high investments in laboratory infrastructure, sequence capacity, and processing power. However, our results indicate that it is possible to develop genomics surveillance applications at the regional level without considerable investments in sequencing platforms, laboratory upgrades, or computational infrastructure. A decentralized strategy of drug susceptibility analysis with WGS of first and second-line anti-TB drugs could accelerate the adoption of these methodologies in developing regions. The future of this approach depends on a collaboration of multiple institutions, both public and private, to overcome the limitations in turnaround time, number of samples, large phylogenetic analysis, computational requirements, and rigorous validation processes required to include WGS data in the surveillance arsenal against TB.

Acknowledgments

We thank the health professionals of the Hospital San Marcos of Chinchiná who provided insight and expertise that greatly assisted the research, and the epidemiological support obtained from the Dirección Territorial de Salud de Caldas.

References

  1. 1. World Health Organization. Global tuberculosis report. Geneva2019.
  2. 2. Dirección Territorial de Salud de Caldas. Análisis de Situación de Salud con el Modelo de los Determinantes Sociales de Salud del departamento de Caldas, 2019. Grupo de Vigilancia en Salud Pública 2019. p. 1–98.
  3. 3. Acquah R, Furin J. Universal regimens or universal access to drug susceptibility testing for tuberculosis? Lancet Infect Dis. 2019;19(3):224–5. Epub 2019/02/13. pmid:30744961.
  4. 4. Ezewudo M, Borens A, Chiner-Oms A, Miotto P, Chindelevitch L, Starks AM, et al. Integrating standardized whole genome sequence analysis with a global Mycobacterium tuberculosis antibiotic resistance knowledgebase. Sci Rep. 2018;8(1):15382. Epub 2018/10/20. pmid:30337678; PubMed Central PMCID: PMC6194142.
  5. 5. Wood DE, Lu J, Langmead B. Improved metagenomic analysis with Kraken 2. bioRxiv: Genome Biology; 2019. p. 1–13. pmid:31779668
  6. 6. Andrews S. FastQC: A Quality Control Tool for High Throughput Sequence Data. 2010.
  7. 7. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20. Epub 2014/04/04. pmid:24695404; PubMed Central PMCID: PMC4103590.
  8. 8. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754–60. Epub 2009/05/20. pmid:19451168; PubMed Central PMCID: PMC2705234.
  9. 9. Okonechnikov K, Conesa A, Garcia-Alcalde F. Qualimap 2: advanced multi-sample quality control for high-throughput sequencing data. Bioinformatics. 2016;32(2):292–4. Epub 2015/10/03. pmid:26428292; PubMed Central PMCID: PMC4708105.
  10. 10. Rausch T, Hsi-Yang Fritz M, Korbel JO, Benes V. Alfred: interactive multi-sample BAM alignment statistics, feature counting and feature annotation for long- and short-read sequencing. Bioinformatics. 2019;35(14):2489–91. Epub 2018/12/07. pmid:30520945; PubMed Central PMCID: PMC6612896.
  11. 11. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20(9):1297–303. Epub 2010/07/21. pmid:20644199; PubMed Central PMCID: PMC2928508.
  12. 12. Chowell G, Hyman JM. Mathematical and statistical modeling for emerging and re-emerging infectious diseases. Mathematical and Statistical Modeling for Emerging and Re-emerging Infectious Diseases 2016. p. 1–356.
  13. 13. Tello D, Gil J, Loaiza CD, Riascos JJ, Cardozo N, Duitama J. NGSEP3: accurate variant calling across species and sequencing protocols. Bioinformatics. 2019;35(22):4716–23. Epub 2019/05/18. pmid:31099384; PubMed Central PMCID: PMC6853766.
  14. 14. Cingolani P, Platts A, Wang le L, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin). 2012;6(2):80–92. Epub 2012/06/26. pmid:22728672; PubMed Central PMCID: PMC3679285.
  15. 15. Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30(14):2068–9. Epub 2014/03/20. pmid:24642063.
  16. 16. Stucki D, Brites D, Jeljeli L, Coscolla M, Liu Q, Trauner A, et al. Mycobacterium tuberculosis lineage 4 comprises globally distributed and geographically restricted sublineages. Nat Genet. 2016;48(12):1535–43. Epub 2016/11/01. pmid:27798628; PubMed Central PMCID: PMC5238942.
  17. 17. Menzel P, Ng KL, Krogh A. Fast and sensitive taxonomic classification for metagenomics with Kaiju. Nat Commun. 2016;7:11257. Epub 2016/04/14. pmid:27071849; PubMed Central PMCID: PMC4833860.
  18. 18. Cole ST, Brosch R, Parkhill J, Garnier T, Churcher C, Harris D, et al. Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature. 1998;393(6685):537–44. Epub 1998/06/20. pmid:9634230.
  19. 19. Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013;29(8):1072–5. Epub 2013/02/21. pmid:23422339; PubMed Central PMCID: PMC3624806.
  20. 20. Iketleng T, Lessells R, Dlamini MT, Mogashoa T, Mupfumi L, Moyo S, et al. Mycobacterium tuberculosis Next-Generation Whole Genome Sequencing: Opportunities and Challenges. Tuberc Res Treat. 2018;2018:1298542. Epub 2019/01/12. pmid:30631597; PubMed Central PMCID: PMC6304523.
  21. 21. Getahun H, Matteelli A, Chaisson RE, Raviglione M. Latent Mycobacterium tuberculosis infection. N Engl J Med. 2015;372(22):2127–35. Epub 2015/05/29. pmid:26017823.
  22. 22. Phelan JE, O’Sullivan DM, Machado D, Ramos J, Oppong YEA, Campino S, et al. Integrating informatics tools and portable sequencing technology for rapid detection of resistance to anti-tuberculous drugs. Genome Med. 2019;11(1):41. Epub 2019/06/27. pmid:31234910; PubMed Central PMCID: PMC6591855.
  23. 23. Coll F, McNerney R, Guerra-Assuncao JA, Glynn JR, Perdigao J, Viveiros M, et al. A robust SNP barcode for typing Mycobacterium tuberculosis complex strains. Nat Commun. 2014;5:4812. Epub 2014/09/02. pmid:25176035; PubMed Central PMCID: PMC4166679.
  24. 24. Roetzer A, Diel R, Kohl TA, Ruckert C, Nubel U, Blom J, et al. Whole genome sequencing versus traditional genotyping for investigation of a Mycobacterium tuberculosis outbreak: a longitudinal molecular epidemiological study. PLoS Med. 2013;10(2):e1001387. Epub 2013/02/21. pmid:23424287; PubMed Central PMCID: PMC3570532.
  25. 25. Meehan CJ, Goig GA, Kohl TA, Verboven L, Dippenaar A, Ezewudo M, et al. Whole genome sequencing of Mycobacterium tuberculosis: current standards and open issues. Nat Rev Microbiol. 2019;17(9):533–45. Epub 2019/06/19. pmid:31209399.
  26. 26. Stecher G, Tamura K, Kumar S. Molecular Evolutionary Genetics Analysis (MEGA) for macOS. Mol Biol Evol. 2020;37(4):1237–9. Epub 2020/01/07. pmid:31904846; PubMed Central PMCID: PMC7086165.
  27. 27. Choi Y, Chan AP. PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics. 2015;31(16):2745–7. Epub 2015/04/09. pmid:25851949; PubMed Central PMCID: PMC4528627.
  28. 28. Consortium CR, the GP, Allix-Beguec C, Arandjelovic I, Bi L, Beckert P, et al. Prediction of Susceptibility to First-Line Tuberculosis Drugs by DNA Sequencing. N Engl J Med. 2018;379(15):1403–15. Epub 2018/10/04. pmid:30280646; PubMed Central PMCID: PMC6121966.
  29. 29. Duenes M, Cardona D. [Factors related to treatment adherence in patients with tuberculosis in Pereira, Colombia, 2012–2013]. Biomedica. 2016;36(3):423–31. Epub 2016/11/22. pmid:27869390.
  30. 30. Woodman M, Haeusler IL, Grandjean L. Tuberculosis Genetic Epidemiology: A Latin American Perspective. Genes (Basel). 2019;10(1). Epub 2019/01/19. pmid:30654542; PubMed Central PMCID: PMC6356704.
  31. 31. Coscolla M, Gagneux S. Consequences of genomic diversity in Mycobacterium tuberculosis. Semin Immunol. 2014;26(6):431–44. Epub 2014/12/03. pmid:25453224; PubMed Central PMCID: PMC4314449.
  32. 32. Mokrousov I, Vyazovaya A, Iwamoto T, Skiba Y, Pole I, Zhdanova S, et al. Latin-American-Mediterranean lineage of Mycobacterium tuberculosis: Human traces across pathogen’s phylogeography. Mol Phylogenet Evol. 2016;99:133–43. Epub 2016/03/24. pmid:27001605.
  33. 33. Garcia DF, Astudillo M. MIRU-VNTR genotyping of Mycobacterium tuberculosis in a population of patients in Cali, Colombia, 2013–2015. Biomedica. 2019;39(s1):71–85. Epub 2019/09/19. pmid:31529850.
  34. 34. Cerezo-Cortes MI, Rodriguez-Castillo JG, Hernandez-Pando R, Murcia MI. Circulation of M. tuberculosis Beijing genotype in Latin America and the Caribbean. Pathog Glob Health. 2019;113(8):336–51. Epub 2020/01/07. pmid:31903874; PubMed Central PMCID: PMC7006823.
  35. 35. Realpe T, Correa N, Rozo JC, Ferro BE, Gomez V, Zapata E, et al. Population structure among mycobacterium tuberculosis isolates from pulmonary tuberculosis patients in Colombia. PLoS One. 2014;9(4):e93848. Epub 2014/04/22. pmid:24747767; PubMed Central PMCID: PMC3991582.
  36. 36. Rodriguez-Castillo JG, Llerena C, Argoty-Chamorro L, Guerra J, Couvin D, Rastogi N, et al. Population structure of multidrug-resistant Mycobacterium tuberculosis clinical isolates in Colombia. Tuberculosis (Edinb). 2020;125:102011. Epub 2020/11/03. pmid:33137696.
  37. 37. Homolka S, Projahn M, Feuerriegel S, Ubben T, Diel R, Nubel U, et al. High resolution discrimination of clinical Mycobacterium tuberculosis complex strains based on single nucleotide polymorphisms. PLoS One. 2012;7(7):e39855. Epub 2012/07/07. pmid:22768315; PubMed Central PMCID: PMC3388094.
  38. 38. Walker TM, Ip CL, Harrell RH, Evans JT, Kapatai G, Dedicoat MJ, et al. Whole-genome sequencing to delineate Mycobacterium tuberculosis outbreaks: a retrospective observational study. Lancet Infect Dis. 2013;13(2):137–46. Epub 2012/11/20. pmid:23158499; PubMed Central PMCID: PMC3556524.
  39. 39. Choi Y, Sims GE, Murphy S, Miller JR, Chan AP. Predicting the functional effect of amino acid substitutions and indels. PLoS One. 2012;7(10):e46688. Epub 2012/10/12. pmid:23056405; PubMed Central PMCID: PMC3466303.
  40. 40. Jagielski T, Bakula Z, Brzostek A, Minias A, Stachowiak R, Kalita J, et al. Characterization of Mutations Conferring Resistance to Rifampin in Mycobacterium tuberculosis Clinical Strains. Antimicrob Agents Chemother. 2018;62(10). Epub 2018/08/01. pmid:30061294; PubMed Central PMCID: PMC6153850.
  41. 41. de Souza VC, Antunes D, Santos LH, Goliatt PVZC, Caffarena ER, Guimarães ACR, et al. Insights into the Mechanism of Ethionamide Resistance in Mycobacterium tuberculosis through an in silico Structural Evaluation of EthA and Mutants Identified in Clinical Isolates. Catalysts. 2020;10(5):543.
  42. 42. Rueda J, Realpe T, Mejia GI, Zapata E, Rozo JC, Ferro BE, et al. Genotypic Analysis of Genes Associated with Independent Resistance and Cross-Resistance to Isoniazid and Ethionamide in Mycobacterium tuberculosis Clinical Isolates. Antimicrob Agents Chemother. 2015;59(12):7805–10. Epub 2015/09/16. pmid:26369965; PubMed Central PMCID: PMC4649173.
  43. 43. Kadura S, King N, Nakhoul M, Zhu H, Theron G, Koser CU, et al. Systematic review of mutations associated with resistance to the new and repurposed Mycobacterium tuberculosis drugs bedaquiline, clofazimine, linezolid, delamanid and pretomanid. J Antimicrob Chemother. 2020;75(8):2031–43. Epub 2020/05/04. pmid:32361756; PubMed Central PMCID: PMC7825472.
  44. 44. Rifat D, Li SY, Ioerger T, Shah K, Lanoix JP, Lee J, et al. Mutations in fbiD (Rv2983) as a Novel Determinant of Resistance to Pretomanid and Delamanid in Mycobacterium tuberculosis. Antimicrob Agents Chemother. 2020;65(1). Epub 2020/10/21. pmid:33077652; PubMed Central PMCID: PMC7927868.
  45. 45. Eldholm V, Rieux A, Monteserin J, Lopez JM, Palmero D, Lopez B, et al. Impact of HIV co-infection on the evolution and transmission of multidrug-resistant tuberculosis. Elife. 2016;5. Epub 2016/08/10. pmid:27502557; PubMed Central PMCID: PMC4978521.
  46. 46. Koch AS, Brites D, Stucki D, Evans JC, Seldon R, Heekes A, et al. The Influence of HIV on the Evolution of Mycobacterium tuberculosis. Mol Biol Evol. 2017;34(7):1654–68. Epub 2017/04/04. pmid:28369607; PubMed Central PMCID: PMC5455964.
  47. 47. Marrone MT, Venkataramanan V, Goodman M, Hill AC, Jereb JA, Mase SR. Surgical interventions for drug-resistant tuberculosis: a systematic review and meta-analysis. Int J Tuberc Lung Dis. 2013;17(1):6–16. Epub 2012/12/13. pmid:23232000.