Skip to main content
Advertisement
  • Loading metrics

Estimating the global burden of viable Mycobacterium tuberculosis infection: A mathematical modelling study

?

This is an uncorrected proof.

Abstract

Background

Estimating the proportion of individuals currently infected with Mycobacterium tuberculosis (Mtb) is key for informing global health policies. Although a substantial portion of the global population exhibit tuberculous immunoreactivity, not all have a viable Mtb infection. Moreover, individuals with recent infections are at a higher risk of developing tuberculosis (TB). Here, we present estimates of the global burden of viable Mtb infection, using new insights into the natural history of TB.

Methods and findings

We constructed country-specific trends in annual risk of infection considering estimates of TB burden, immunoreactivity reversion, and age-specific mixing. We applied these trends to a deterministic mathematical model incorporating reinfection and self-clearance to estimate recent (within 2 years) and total viable Mtb infections. Empirical data on self-clearance are limited, so rates were informed by modelling estimates. In 2022, we estimated that 133.7 million people (95% uncertainty interval [UI]: 104.0, 171.1) had a recent Mtb infection, representing 1.7% (95% UI: 1.3, 2.2) of the global population. In total, 288.9 million people (95% UI: 242.2, 342.7)—or 3.7% (95% UI: 3.1, 4.3) globally—were estimated to harbour a viable Mtb infection. Among those recently infected, 12.0% (95% UI: 11.4, 12.7) were children under 15 years of age. Most recent infections were found in the World Health Organization regions of South-East Asia (49.0%; 95% UI: 37.2, 62.4), the Western Pacific (19.7%; 95% UI: 12.6, 30.5), and Africa (17.9%; 95% UI: 12.9, 24.1). India, Indonesia, and China had the highest burden, with 39.1 million (95% UI: 18.0, 73.6), 12.0 million (95% UI: 5.8, 22.9), and 11.2 million (95% UI: 5.0, 25.5) people, respectively, recently infected with Mtb. Sensitivity analyses of varying self-clearance scenarios showed significant changes in global estimates of viable Mtb infection, particularly in total burden, with lower self-clearance rates. Overall uncertainty in the estimates was considerable, reflecting limitations in the underlying data informing key model parameters.

Conclusions

Our findings offer global burden estimates of viable Mtb infection and reveal a sizable population recently infected with Mtb and at high risk of progression to disease. New diagnostic tools that can detect individuals with viable Mtb—particularly those who would benefit from TB preventive therapy—are urgently needed.

Author summary

Why was this study done?

  • Estimates of Mycobacterium tuberculosis (Mtb) infection reflect immunoreactivity, rather than the presence of viable bacteria.
  • As most people who develop tuberculosis (TB) do so within two years of infection, capturing the recency of infection improves the relevance of burden estimates.
  • Estimating recent Mtb infection supports more targeted TB prevention strategies.

What did the researchers do and find?

  • We developed a mathematical model that incorporated reversion-adjusted and age-specific annual risk of infection trends and accounted for self-clearance of Mtb infection.
  • We estimated that around 134 million people—about 1.7% of the global population—carried viable Mtb from a recent infection in 2022.
  • Nearly 87% of all recent infections were estimated to be in the World Health Organization regions of South-East Asia, the Western Pacific, and Africa.
  • Sensitivity analyses reveal that our estimates are sensitive to varying assumptions about self-clearance rates.

What do these findings mean?

  • Our findings provide national, regional, and global estimates of Mtb infection burden, emphasising the distinction between viable infection and immunoreactivity.
  • A diagnostic test that can detect viable Mtb infection is urgently needed to identify those who would benefit most from TB preventive therapy.
  • Future estimates should address the main limitations of this study by using more contemporary, empirically derived annual risk of infection data through repeated immunoreactivity surveys, along with empirical measures of self-clearance.

Introduction

Estimates of Mycobacterium tuberculosis (Mtb) infection burden are fundamental in shaping global health strategies for tuberculosis (TB) [1]. These estimates guide key interventions, such as the provision of TB preventive therapy (TPT) to individuals infected with Mtb (in the absence of disease), which is vital for reducing TB incidence [2]. Therefore, estimates must capture the number of individuals harbouring viable Mtb infection, i.e., an infection capable of causing disease [3]. Furthermore, these estimates should account for the recency of infection, as most individuals who progress to disease do so within two years of infection [4]. Focussing on recent viable Mtb infection sets a medically actionable target—serving as a programmatically relevant upper bound—for optimising the prevention cascade and the population to target with TPT [1,3]. Additionally, such estimates would offer deeper insights into the reservoir fuelling ongoing Mtb transmission in the coming years [1].

Previous studies have estimated that a substantial portion of the global population had ‘latent’ TB infection (LTBI), an asymptomatic state defined by the presence of immunoreactivity to tuberculous antigens [57]. Although these estimates essentially reflect individuals exposed to Mtb and still exhibiting immunoreactivity, they are often used interchangeably to indicate current infection [1,8]. As our understanding of the natural history of TB has evolved [9,10], long-standing assumptions that informed those estimates warrant reconsideration.

Firstly, estimates of LTBI used the annual risk of infection (ARI) as a metric for the force of infection experienced by a population. The ARI is derived from surveys of immunoreactivity prevalence, and assumes immunoreactivity persists over time [11]; however, immunoreactivity can wane and in several cases revert [12], leading to significant underestimation of the actual ARI [13,14]. Secondly, the ARI is often extrapolated from surveys in children [11]; however, Mtb transmission to children is less common than in adolescents and adults [15]. As a result, infection incidence (and the estimated ARI) may be underestimated in older age groups, who have higher contact rates with individuals with infectious TB [14,15]. Both reversion and contact patterns suggest that the true force of infection in adults is likely higher than previously assumed. If so, this would significantly increase the estimated global burden of infection, especially given the long-held assumption of lifelong Mtb infection. However, this assumption has been challenged by estimates suggesting that a large proportion (>90%) of individuals self-clear infection without treatment and are no longer at risk of TB in the absence of reinfection [8,9,16,17]. Therefore, accounting for self-clearance in our estimates would likely yield a lower global burden of infection than previously thought [9].

These progressive insights underscore the need for estimates that account for the dynamic nature of Mtb infection. By incorporating these factors, more accurate assessments of the current global burden of viable Mtb infection can be made, particularly to guide effective TB prevention strategies. Additionally, improved characterisation of viable Mtb infection—as part of the ongoing pathological process within the spectrum of TB disease—is needed to better identify and target individuals in whom typical bacteriological confirmation is not possible. In this study, we estimate the global burden of viable Mtb infection using a mathematical modelling approach that incorporates recent insights into TB natural history.

Methods

Annual risk of infection

To estimate the burden of viable Mtb infection, we constructed national ARI trajectories spanning from 1950 to 2022, based on the methods used by Houben and Dodd [6]. The trajectories were constructed using a Gaussian process (GP) regression, a flexible, non-parametric framework combining different sources of estimates with the assumption of a normal approximation to the likelihood. These were fitted to 171 countries (comprising 99.6% of the world population) using two sources of ARI estimates. Direct ARI estimates were obtained from nationally representative immunoreactivity surveys identified in previous searches (Table A and Text A in S1 File). Most surveys used tuberculin skin test (TST) positivity prevalence and were conducted in children aged 6–9 years old. For estimates reported as a single value without presentation of uncertainty, an additional step was taken to quantify measurement precision (Text B in S1 File). Moreover, indirect ARI estimates were derived from TB prevalence estimates using the revised Stýblo rule and adjusted to account for the influence of age and human immunodeficiency virus (HIV) on smear positivity (Text C in S1 File). TB prevalence estimates were calculated by converting the most recent World Health Organization (WHO) TB incidence estimates (from 2000 to 2022) by applying an average duration of disease (Text D in S1 File). For country-years in which both direct and indirect ARI estimates were available, these are compared in Table B in S1 File. Furthermore, accounting for the impact of immunoreactivity reversion and that the true ARI is roughly three times higher than calculated, all ARI estimates were adjusted based on this underestimation [13]. Additionally, measurement uncertainty was also increased by 50% to account for this adjustment (Text E in S1 File). The GP regression with a linear trend was applied to the reversion-adjusted data on ARI (on a log scale) and the measurement precision per country. For each country, we generated 1,000 simulated ARI trajectories spanning 1950–2022 (Figs A–F in S1 File).

Mixing and age-specific risks of infection

To address the higher risk of infection in adults compared to children, we used age-specific estimates of TB incidence as a proxy for TB prevalence and then applied contact mixing matrices to derive the corresponding hazard ratios for each age group. We defined three distinct age groups to encapsulate varying ARIs: under 15 years, 15–45 years, and 45 years and older. Using the estimated TB incidence disaggregated by age and country [18], we calculated the relative TB incidence per capita, using the under-15-year-old age group as the reference, assuming that this age group contributes minimally to transmission (Fig G in S1 File). Similarly, by employing the synthetic country-specific contact mixing matrices developed by Prem and colleagues [19], we obtained the average number of contacts between the defined age groups (Fig H in S1 File). We then integrated these data to estimate the relative ARI for each age group, again using the under-15-year-old age group as the reference (Fig I in S1 File). Finally, ARI trajectories were subdivided into three age groups and adjusted based on relative ARI; for countries with missing data, we applied the regional relative ARI.

Self-clearance rates

Using a Bayesian approach, self-clearance rates were calibrated by tracking an infected cohort over time, using a simplified version of the model structure described below. This structure comprised sequential infected compartments representing increasing time since infection, allowing for self-clearance while excluding infection or reinfection. Informed by the pathways described in the TB natural history model by Horton and colleagues [10], the proportions of the cohort that self-clear or recover (i.e., no longer harbour viable Mtb infection) were 80.9% (95% uncertainty interval [UI]: 65.1, 90.6%) at year 1, 91.9% (95% UI: 83.3, 96.3%) at year 2, and 97.2% (95% UI: 93.9, 98.7%) at year 10, which were used as calibration targets. Due to the lack of data on the proportion after 10 years, we assumed a proportion of 99.0% (95% UI: 97.8, 99.6) and calibrated the associated post-10-year self-clearance rate under two different scenarios: one where this was reached by year 20 post-infection (high self-clearance) and another for year 50 (low self-clearance) (Fig J in S1 File). The model by Horton and colleagues captures detailed progression and regression across multiple TB states, ranging from Mtb infection through asymptomatic and symptomatic infectious disease. As such, the complement of these calibration targets (i.e., the proportion that does not self-clear or recover), and thus our estimates, represent individuals who harbour viable Mtb bacilli, including those with TB disease [20]. Furthermore, in this study, the term self-clearance is used as an overarching concept encompassing both biological clearance without progression and natural recovery after progression to disease. All rates were assigned uninformed uniform priors and posterior estimates were calculated using a Markov chain Monte Carlo algorithm in Stan via R statistical software [21,22]. We assumed that self-clearance rates were constant and were not influenced by age or year. Further details on calibration are available in Text F in S1 File.

Model structure

We developed a deterministic model tracking Mtb infection and self-clearance of infection across 5-year age groups (Fig K in S1 File). Parameters and descriptions can be found in Table C in S1 File. Once individuals are infected, they can progress through four infection states reflecting various times since infection. In each infection state, individuals can self-clear infection and revert to not being infected. Individuals who have been distally infected (i.e., more than two years) are at risk of reinfection, adjusted by a protection factor [23]. The force of infection was determined by the reversion-adjusted and age-group-specific ARI trajectory for each country. The self-clearance rates were obtained from the calibrated scenarios. No correlations were assumed between parameters, except in a designated sensitivity analysis. The model was constructed using R version 4.3.2 for statistical computing and graphics [22]. Further details and model equations are available in the Texts G and H in S1 File.

Model run and outputs

For each country, all 1,000 ARI trajectories from 1950 to 2022 were used to estimate the proportion of each age group harbouring viable Mtb at various times since infection. These proportions were combined with population estimates obtained from the United Nations World Population Prospects [24]. Estimates for absolute numbers and prevalence of viable Mtb infection were summarised and explored by age group, country, region, and globally. For the main results, we report the estimates based on the high self-clearance scenario as a conservative approach to estimating viable Mtb infection in 2022, with robustness compared against the low self-clearance scenario.

Statistical analysis

All analyses were performed using R [22]. Model outputs are summarised and reported as medians with corresponding 95% UI, calculated as the 2.5%–97.5% percentile range.

Sensitivity analyses

We conducted two sensitivity analyses to assess the impact of assumptions around self-clearance. First, we assumed that self-clearance parameters were correlated across countries: for each model run, a sampled set of self-clearance values was drawn and applied uniformly to all countries, reflecting the assumption that natural history processes may be consistent across settings. Second, we conducted a parameter-driven analysis in which self-clearance rates were systematically scaled by ±25%, ±50%, and ±75% relative to the median estimates from the calibrated high self-clearance scenario to explore how these rates influence global burden estimates.

GATHER reporting

This study was reported in accordance with the Guidelines for Accurate and Transparent Health Estimates Reporting (GATHER) [25]. Fig 1 provides a comprehensive conceptual overview of the study, including methods and data sources. The GATHER checklist is also available in the S2 File.

thumbnail
Fig 1. Diagram of the conceptual overview of the study.

Conceptual overview of study as per Guidelines for Accurate and Transparent Health Estimates Reporting (GATHER) [25]. This diagram summarises the key data sources, analytical steps, and modelling components used to estimate the global burden of viable Mycobacterium tuberculosis infection. The left panels outline the derivation of ARI data: direct estimates from immunoreactivity surveys and indirect estimates derived from WHO TB incidence data via a revised Stýblo rule, with adjustments for smear positivity in children and people living with HIV. All ARI estimates were further adjusted for underestimation due to immunoreactivity reversion. The combined ARI dataset was smoothed using Gaussian process and stratified by age using TB incidence per capita and contact mixing matrices. These age-specific ARIs informed the force of infection input in the ODE model. Additional inputs included self-clearance rates (calibrated via MCMC to time point-specific targets) and demographic data on population size and age structure. Model outputs include national, regional, and global estimates of viable Mtb infection by age group. ARI, Annual risk of infection; HIV, Human immunodeficiency virus; MCMC, Markov chain Monte Carlo; Mtb, Mycobacterium tuberculosis; ODE, Ordinary differential equation; TB, Tuberculosis; WHO, World Health Organization.

https://doi.org/10.1371/journal.pmed.1004920.g001

Results

Annual risk of infection estimates

Fig 2 shows the fitted ARI trajectories with and without reversion and age-specific adjustments for India, Indonesia, and China—countries with the highest estimated TB incidence in 2022. Compared with the under-15-year-old population, the average relative ARI was 3.3 (95% UI: 2.9, 3.9) for the 15- to 45-year-old population and 2.9 (95% UI: 2.5, 3.3) for those aged 45 and older. Regional relative ARIs by age group are available in Table D in S1 File. For India, Indonesia, and China, the estimated ARI for the 15-to-45-year age group was 8.4% (95% UI: 4.1, 16.2), 13.6% (95% UI: 7.1, 25.3), and 2.0% (95% UI: 0.9, 4.3), respectively. ARI estimates for 2022 for the remaining countries on WHO’s list of 30 high TB burden countries are shown in Table E in S1 File. When compared to the unadjusted ARI estimates for 2014 from Houben and Dodd [25], the ARI for the under-15-year age group was 3.1 times higher (interquartile range: 2.5, 4.0), reflecting the increase in the force of infection due to adjustments for reversion underestimation (Table F in S1 File).

thumbnail
Fig 2. Fitted annual risk of infection trajectories and adjustments.

Fitted annual risk of Mycobacterium tuberculosis infection trajectories for selected countries and corresponding adjustments. The first column shows ARI trajectories without adjustment, the second column shows trajectories adjusted for immunoreactivity reversion, and the third column shows the ARI trajectories subdivided by age group, using children under 15 years of age as the reference group for relative ARIs. Greater uncertainty is observed in earlier years, which narrows as ARI data becomes available from immunoreactivity surveys or TB prevalence estimates. Data points represent available ARI data, with black circles representing TB prevalence estimates and black triangles representing nationally representative immunoreactivity surveys; error bars reflect measurement precision to ±one standard deviation. Lines represent the mean ARI, and the shaded area shows ±one standard deviation from the Gaussian process regression with a linear trend. ARI, Annual risk of infection; CHN, China; IDN:, Indonesia; IND, India.

https://doi.org/10.1371/journal.pmed.1004920.g002

Global burden of infection estimates

We estimate that 133.7 million people (95% UI: 104.0, 171.1) had a recent Mtb infection in 2022, representing 1.7% (95% UI: 1.3, 2.2) of the global population. In total, 288.9 million people (95% UI: 242.2, 342.7)—or 3.7% (95% UI: 3.1, 4.3) globally—were harbouring a viable Mtb infection. Estimates of both total and recent infection were robust across scenarios, with minimal differences observed between the high and low long-term self-clearance assumptions (Table 1 versus Table G in S1 File for absolute numbers; Table 2 versus Table H in S1 File for proportions of the population).

thumbnail
Table 1. Number of individuals with viable Mycobacterium tuberculosis infection in 2022.

https://doi.org/10.1371/journal.pmed.1004920.t001

thumbnail
Table 2. Proportion of population with viable Mycobacterium tuberculosis infection in 2022.

https://doi.org/10.1371/journal.pmed.1004920.t002

Regional and country-level infection estimates

Of all recent infections, approximately 87% were found in the three WHO regions of South-East Asia (49.0%; 95% UI: 37.2, 62.4), the Western Pacific (19.7%; 95% UI: 12.6, 30.5), and Africa (17.9%; 95% UI: 12.9, 24.1) (Table I in S1 File). At the country level, India, Indonesia, and China collectively represented around 50% of global recent infections, with 39.1 million (95% UI: 18.0, 73.6), 12.0 million (95% UI: 5.8, 22.9), and 11.2 million (95% UI: 5.0, 25.5) people recently infected, respectively (Table J in S1 File). The remaining countries in the top 10 were all from WHO’s list of 30 high TB burden countries [26]. Fig 3 shows the regional and country-level variation in the prevalence of recent viable Mtb infections. Here, the countries with the highest prevalence of recent viable Mtb infections were the Philippines (7.7%; 95% UI: 3.2, 16.4), North Korea (5.4%; 95% UI: 2.2, 12.0), and Timor-Leste (5.3%; 95% UI: 2.2, 11.7) (Table K in S1 File). Estimates of viable Mtb infection for all 171 countries are available in the S1 Data.

thumbnail
Fig 3. Prevalence of recent viable Mycobacterium tuberculosis infection in 2022.

Median estimated population prevalence of recent viable Mycobacterium tuberculosis infection by country in 2022. Recent infection is defined as occurring within the past two years. Vector map data obtained from Natural Earth (https://www.naturalearthdata.com/).

https://doi.org/10.1371/journal.pmed.1004920.g003

Age trends

The proportions of individuals with viable Mtb infection by age group and region are shown in Fig 4 and Table L in S1 File. Across all regions, higher infection estimates are concentrated in the adult population younger than 45 years old, in line with the increased ARI. Among those recently infected, 16.1 million (95% UI: 12.7, 20.3) were children under 15 years of age, representing 12.0% (95% UI: 11.4, 12.7) of all recent infections (Tables 1 and 2). There was a substantial regional disparity in recent infections among children, ranging from 7.9% (95% UI: 7.3, 8.6) in the European Region to 18.3% (95% UI: 17.6, 19.0) in the African Region, likely due to differences in population structure (Table 2).

thumbnail
Fig 4. Prevalence of viable Mycobacterium tuberculosis infection by age and WHO region in 2022.

Median estimated proportion of the population per age group and by WHO region infected with viable Mycobacterium tuberculosis in 2022. Values are presented as percentages (%). Recent infections are defined as those occurring within the past two years, while distal infections are those acquired more than two years ago. The coarse age group distribution in the estimates reflects the annual risk of infection disaggregated into the following groups: under 15 years old, 15–45 years old, and 45 years and older. Estimates are based on a scenario assuming high long-term self-clearance rates. WHO, World Health Organization; AFR, African Region; AMR, Region of the Americas; EMR, Eastern Mediterranean Region; EUR, European Region; SEA, South-East Asia Region; WPR, Western Pacific Region.

https://doi.org/10.1371/journal.pmed.1004920.g004

Sensitivity analyses

Under the scenario assuming correlated self-clearance rates across countries, the estimated burden of viable Mtb infection in 2022 was similar to the main analysis, though with wider UIs: 287.2 million people (95% UI: 196.1, 393.8) in total, and 132.2 million people (95% UI: 79.5, 202.2) with recent infection (Table M in S1 File). Under varying self-clearance scenarios, global estimates of viable Mtb infection increased with lower self-clearance rates, particularly for total burden (Fig 5 and Table N in S1 File). The proportion of infections that were recent declined as the total burden rose accordingly. At higher self-clearance levels, total infection estimates remained relatively stable.

thumbnail
Fig 5. Global estimates of viable Mycobacterium tuberculosis infection under varying self-clearance scenarios.

Absolute number of individuals infected with viable Mycobacterium tuberculosis in 2022. Values are shown in millions (M) with error bars indicating 95% uncertainty intervals (UI). Recent infections are defined as those occurring within the past two years, while distal infections are those acquired more than two years ago. Estimates reflect systematic variation of self-clearance rates relative to the median estimates from the high self-clearance scenario.

https://doi.org/10.1371/journal.pmed.1004920.g005

Discussion

Our findings provide an estimate of the global burden of Mtb infection, with approximately 134 million people recently infected in 2022 and therefore at immediate risk of developing TB. In total, 289 million people—3.7% of the world’s population—were harbouring a viable Mtb infection. These estimates show significant geographical variability, with infections concentrated in WHO regions of South-East Asia, the Western Pacific, and Africa. These findings underscore the need for enhanced diagnostic and management strategies to identify individuals with viable Mtb infection—particularly those who may benefit from TPT.

Our estimates further highlight the important distinction between viable Mtb infection and positive immunoreactivity tests. Despite working on a similar construction of ARI trajectories, there is a stark numerical contrast between our viable Mtb infection estimates and the latest LTBI estimates [9]. The main contributor to this difference is the inclusion of self-clearance of infection in our model, thereby shifting the estimate definition from (historical) exposure to Mtb (as measured by immunoreactivity) to viable Mtb infection. This was suggested by Emery and colleagues, where the population with viable Mtb infection was markedly smaller (up to 20%) than assumed in India, China, and Japan [1,18]. Similarly, our estimates indicate a lower overall proportion of Mtb infection, even after adjusting for the increased ARI experienced (considering immunoreactivity reversion and age-specific risks). Self-clearance is now widely acknowledged, as reflected by a shift in WHO reporting, which now describes the quarter of the population figure as “having been infected” rather than “currently infected” [13,14]. Self-clearance, therefore, outweighed the increased force of infection generated by a revised understanding of the true ARI underneath empirical measurements [6]. However, the shift in ARI has led to a higher estimate of the recently infected population than previous estimates (1.7 versus 0.8%), suggesting a more rapid turnover of the population at high risk of developing TB [12].

Although reversion and self-clearance are likely related phenomena, they represent conceptually distinct processes in the context of this analysis. Reversion, defined as the loss of immunoreactivity [13,14], can lead to the underestimation of ARI if not accounted for, as individuals with prior infection may test negative [8,9,16,17]. In contrast, self-clearance refers to the biological elimination of viable Mtb bacilli from the body [8]. While reversion may occur in individuals who have self-cleared, not all reversion indicates true clearance, nor does all self-clearance necessarily result in reversion [26].

While this study attempts to quantify the extent of viable Mtb infection, our ability to detect it remains hampered by the limitations of currently available tests. These include the TST and interferon-gamma release assays (IGRA), which detect immunoreactivity to tuberculous antigens but do not directly detect the presence of the organism itself [27]. Another limitation can be observed in individuals with non-specific chest radiography abnormalities who are treated for TB despite lacking bacteriological confirmation and clear evidence of Mtb infection. Additionally, current immunoreactivity tests are inaccurate surrogates of viability as they can remain detectable (positive) after TPT or treatment [28]. This poses a significant challenge for national TB programmes, as TPT is often offered to those with a single positive immunoreactivity test [29], whereas the number of those that would truly benefit is likely smaller, especially in absence of an indicative patient history, such as recent contact with a TB patient. To improve and expand the use of TPT—and to aid in the diagnosis of bacteriologically negative TB disease—there is a clear need for improved biomarkers of Mtb infection, some of which have been explored recently [30,31].

Our modelling study is not without limitations. A major limitation concerns the estimation of the ARI. Empirical ARI estimates remain scarce, so contemporary ARIs had to be inferred indirectly from WHO TB incidence estimates, which provided the necessary data points for the GP regression. In country-years where both direct and indirect ARI estimates were available, more than half had non-overlapping uncertainty intervals; however, the GP regression, as a non-parametric framework, was able to accommodate this discrepancy. Ideally, population-level trends in Mtb transmission should be evaluated through repeated immunoreactivity surveys to obtain direct ARI estimates. While still requiring adjustments to account for reversion, recent surveys using IGRA or new TB antigen-based skin tests would improve ARI estimates, especially as they reduce false-positive results among children who received Bacillus Calmette Guérin (BCG) vaccination [27].

Another major limitation concerns the evidence base for self-clearance. Although a few studies summarise supportive evidence [1,4,8,9], empirical data remain limited. As our results show, assumptions about self-clearance are a major source of uncertainty in our estimates. Our sensitivity analysis on systematically varied self-clearance rates showed that global burden estimates were particularly sensitive to lower self-clearance rates, highlighting the need for better empirical data to inform this parameter. Additionally, these self-clearance rates were applied uniformly, but they may plausibly vary by age and HIV status; we currently lack the data to inform this heterogeneity. Future work should explore and incorporate such variation where possible.

Furthermore, several broader modelling simplifications were required to generate global estimates. We treated all individuals as experiencing one of three age-specific ARIs, disregarding population and individual factors that may affect the force of infection. ARI estimates could have been further refined by breaking them down into distinct 5-year age groups; however, given that self-clearance had the strongest effect on model outputs, we do not expect that adding further granularity to the ARI estimates would have changed the qualitative findings. Additionally, the reversion underestimation factor was simplified to a single value and applied uniformly to all ARI estimates, despite known variations across ages and populations [31]. We accounted for reduced measurement precision to reflect the additional uncertainty introduced by this assumption. Furthermore, although our model does not explicitly include a compartment for individuals who have self-cleared infection, we incorporated protection against reinfection by reducing the force of infection for individuals with current infection; however, this may underestimate the potential contribution of immunological memory in those who have cleared infection. Finally, because the calibration targets for the proportion self-cleared or recovered were derived from a model that accounts for progression and regression across multiple TB disease states and does not include treatment-driven recovery, our estimates of viable Mtb infection include individuals with TB disease. As a result, the burden, particularly the fraction that might be considered medically actionable, may represent an upper bound. We recognise that addressing this limitation fully would require a more complex model, incorporating explicit representation of multiple TB disease states and consideration of treatment-driven recovery. We see this as an important and natural direction for future work. Nevertheless, while these necessary simplifications may have introduced some biases in our estimates, they were appropriate to ensure the feasibility of assessing Mtb infection burden at national, regional, and global levels.

Our findings provide valuable global estimates of the burden of viable Mtb infection, emphasising the crucial distinction from positive immunoreactivity tests. By more accurately accounting for the true force of infection and the immune system’s ability to clear infections, we identified a sizable population recently infected and at high risk of progressing to disease. This approach reinforces the importance of better characterising viable Mtb infection within the spectrum of TB disease [20], as opposed to the concept of ‘latency’, which imposes a rigid separation from disease and relies on the notion of dormant infection to explain future risk. There is an urgent need for enhanced diagnostic tools capable of detecting viable Mtb infection, allowing us to identify individuals who may benefit from TB preventive therapy, as well as those with bacteriologically unconfirmed disease who might otherwise go undiagnosed.

Supporting information

S1 File. Supplementary material.

Additional methodological details, supplementary tables, and figures.

https://doi.org/10.1371/journal.pmed.1004920.s001

(PDF)

S2 File. GATHER checklist.

GATHER checklist is made available under the Creative Commons Attribution 4.0 (CC BY 4.0) license. The original checklist can be accessed at: https://doi.org/10.1371/journal.pmed.1002056.

https://doi.org/10.1371/journal.pmed.1004920.s002

(PDF)

S3 File. PMED Inclusivity in Global Research.

Checklist addressing ethical, cultural, and scientific considerations for inclusivity in global research.

https://doi.org/10.1371/journal.pmed.1004920.s003

(PDF)

S1 Data. Country-level viable Mycobacterium tuberculosis infection estimates.

Country-level estimates of viable Mycobacterium tuberculosis infection, including recent and all infections, reported as absolute numbers and population prevalence, overall and among children.

https://doi.org/10.1371/journal.pmed.1004920.s004

(XLSX)

Acknowledgments

The authors would like to acknowledge Prof. Marcel Behr and Prof. Dave Moore for their valuable insights on the methodology employed and estimates generated. Additionally, the authors extend their gratitude to Dr. Tomos Prŷs-Jones for his assistance in developing the pipeline to run the analyses on the high-performance computing platform. This research was supported in part through computational resources provided by the London School of Hygiene and Tropical Medicine.

References

  1. 1. Behr MA, Edelstein PH, Ramakrishnan L. Rethinking the burden of latent tuberculosis to reprioritize research. Nat Microbiol. 2024;9(5):1157–8. pmid:38671272
  2. 2. World Health Organization. The end TB strategy. Geneva, Switzerland: WHO; 2015. Available from: https://www.who.int/publications/i/item/WHO-HTM-TB-2015.19
  3. 3. Matteelli A, Churchyard G, Cirillo D, den Boon S, Falzon D, Hamada Y, et al. Optimizing the cascade of prevention to protect people from tuberculosis: a potential game changer for reducing global tuberculosis incidence. PLOS Glob Public Health. 2024;4(7):e0003306. pmid:38954723
  4. 4. Behr MA, Edelstein PH, Ramakrishnan L. Revisiting the timetable of tuberculosis. BMJ. 2018;362:k2738. pmid:30139910
  5. 5. Dye C, Scheele S, Dolin P, Pathania V, Raviglione MC. Consensus statement. Global burden of tuberculosis: estimated incidence, prevalence, and mortality by country. WHO Global Surveillance and Monitoring Project. JAMA. 1999;282(7):677–86. pmid:10517722
  6. 6. Houben RMGJ, Dodd PJ. The global burden of latent tuberculosis infection: a re-estimation using mathematical modelling. PLoS Med. 2016;13(10):e1002152. pmid:27780211
  7. 7. GBD Tuberculosis Collaborators. Global, regional, and national burden of tuberculosis, 1990-2016: results from the Global Burden of Diseases, Injuries, and Risk Factors 2016 Study. Lancet Infect Dis. 2018;18:1329–49.
  8. 8. Behr MA, Edelstein PH, Ramakrishnan L. Is Mycobacterium tuberculosis infection life long?. BMJ. 2019;367:l5770. pmid:31649096
  9. 9. Emery JC, Richards AS, Dale KD, McQuaid CF, White RG, Denholm JT, et al. Self-clearance of Mycobacterium tuberculosis infection: implications for lifetime risk and population at-risk of tuberculosis disease. Proc Biol Sci. 2021;288(1943):20201635. pmid:33467995
  10. 10. Horton KC, Richards AS, Emery JC, Esmail H, Houben RMGJ. Reevaluating progression and pathways following Mycobacterium tuberculosis infection within the spectrum of tuberculosis. Proc Natl Acad Sci U S A. 2023;120(47):e2221186120. pmid:37963250
  11. 11. Cauthen GM, Pio A, ten Dam HG. Annual risk of tuberculous infection. 1988. Bull World Health Organ. 2002;80(6):503–11; discussion 501-2. pmid:12132011
  12. 12. Dale KD, Schwalb A, Coussens AK, Gibney KB, Abboud AJ, Watts K, et al. Overlooked, dismissed, and downplayed: reversion of Mycobacterium tuberculosis immunoreactivity. Eur Respir Rev. 2024;33(173):240007. pmid:39048129
  13. 13. Schwalb A, Emery JC, Dale KD, Horton KC, Ugarte-Gil CA, Houben RMGJ. Impact of reversion of Mycobacterium tuberculosis immunoreactivity tests on the estimated annual risk of tuberculosis infection. Am J Epidemiol. 2023;192(12):1937–43. pmid:36749011
  14. 14. Dowdy DW, Behr MA. Are we underestimating the annual risk of infection with Mycobacterium tuberculosis in high-burden settings?. Lancet Infect Dis. 2022;22(9):e271–8. pmid:35526558
  15. 15. Dodd PJ, Looker C, Plumb ID, Bond V, Schaap A, Shanaube K, et al. Age- and sex-specific social contact patterns and incidence of Mycobacterium tuberculosis Infection. Am J Epidemiol. 2016;183:156–66.
  16. 16. Ferebee SH. Controlled chemoprophylaxis trials in tuberculosis. A general review. Bibl Tuberc. 1970;26:28–106. pmid:4903501
  17. 17. Opie EL, Aronson JD. Tubercule bacilli in latent tuberculosis lesions and in lung tissue without tuberculosis lesions. Arch Pathol. 1927;4:1–21.
  18. 18. World Health Organization. Global tuberculosis report 2023. Geneva, Switzerland: WHO; 2023. Available from: https://iris.who.int/bitstream/handle/10665/373828/9789240083851-eng.pdf?sequence=1
  19. 19. Prem K, van Zandvoort K, Klepac P, Eggo RM, Davies NG, Centre for the Mathematical Modelling of Infectious Diseases COVID-19 Working Group. Projecting contact matrices in 177 geographical regions: an update and comparison with empirical data for the COVID-19 era. PLoS Comput Biol. 2021;17:e1009098.
  20. 20. Coussens AK, Zaidi SMA, Allwood BW, Dewan PK, Gray G, Kohli M, et al. Classification of early tuberculosis states to guide research for improved care and prevention: an international Delphi consensus exercise. Lancet Respir Med. 2024;12(6):484–98. pmid:38527485
  21. 21. Stan Development Team. Stan Modeling Language Users Guide and Reference Manual, v2.35. 2024. Available from: https://mc-stan.org
  22. 22. RCore Team. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2024. Available from: https://www.r-project.org/
  23. 23. Andrews JR, Noubary F, Walensky RP, Cerda R, Losina E, Horsburgh CR. Risk of progression to active tuberculosis following reinfection with Mycobacterium tuberculosis. Clin Infect Dis. 2012;54(6):784–91. pmid:22267721
  24. 24. United Nations. World population prospects—population division. In: World population prospects; 2022 [Internet]. [cited Jun 2023]. Available from: https://population.un.org/wpp/
  25. 25. Stevens GA, Alkema L, Black RE, Boerma JT, Collins GS, Ezzati M, et al. Guidelines for accurate and transparent health estimates reporting: the GATHER statement. Lancet. 2016;388(10062):e19–23. pmid:27371184
  26. 26. World Health Organization. WHO global lists of high burden countries for tuberculosis (TB), TB/HIV and multidrug/rifampicin-resistant TB (MDR/RR-TB), 2021-2025: background document. Geneva, Switzerland: WHO; 2021 Jun. Available from: https://iris.who.int/bitstream/handle/10665/341980/9789240029439-eng.pdf
  27. 27. Getahun H, Matteelli A, Chaisson RE, Raviglione M. Latent Mycobacterium tuberculosis infection. N Engl J Med. 2015;372(22):2127–35. pmid:26017823
  28. 28. Johnson DF, Malone LL, Zalwango S, Mukisa Oketcho J, Chervenak KA, Thiel B, et al. Tuberculin skin test reversion following isoniazid preventive therapy reflects diversity of immune response to primary Mycobacterium tuberculosis infection. PLoS One. 2014;9(5):e96613. pmid:24796677
  29. 29. Hamada Y, Gupta RK, Quartagno M, Izzard A, Acuna-Villaorduna C, Altet N, et al. Predictive performance of interferon-gamma release assays and the tuberculin skin test for incident tuberculosis: an individual participant data meta-analysis. EClinicalMedicine. 2023;56:101815. pmid:36636295
  30. 30. Belay M, Tulu B, Younis S, Jolliffe DA, Tayachew D, Manwandu H, et al. Detection of Mycobacterium tuberculosis complex DNA in CD34-positive peripheral blood mononuclear cells of asymptomatic tuberculosis contacts: an observational study. Lancet Microbe. 2021;2(6):e267–75. pmid:34100007
  31. 31. Scriba TJ, Fiore-Gartland A, Penn-Nicholson A, Mulenga H, Kimbung Mbandi S, Borate B, et al. Biomarker-guided tuberculosis preventive therapy (CORTIS): a randomised controlled trial. Lancet Infect Dis. 2021;21(3):354–65. pmid:33508224