Figures
Abstract
Active tuberculosis (TB) is caused by Mycobacterium tuberculosis (Mtb) bacteria and is characterized by multiple phases of infection, leading to difficulty in diagnosing and treating infected individuals. Patients with latent tuberculosis infection (LTBI) can reactivate to the active phase of infection following perturbation of the dynamic bacterial and immunological equilibrium, which can potentially lead to further Mtb transmission. However, current diagnostics often lack specificity for LTBI and do not inform on TB reactivation risk. We hypothesized that immune profiling readily available QuantiFERON-TB Gold Plus (QFT) plasma supernatant samples could improve LTBI diagnostics and infer risk of TB reactivation. We applied a whispering gallery mode, silicon photonic microring resonator biosensor platform to simultaneously quantify thirteen host proteins in QFT-stimulated plasma samples. Using machine learning algorithms, the biomarker concentrations were used to classify patients into relevant clinical bins for LTBI diagnosis or TB reactivation risk based on clinical evaluation at the time of sample collection. We report accuracies of over 90% for stratifying LTBI + from LTBI– patients and accuracies reaching over 80% for classifying LTBI + patients as being at high or low risk of reactivation. Our results suggest a strong reliance on a subset of biomarkers from the multiplexed assay, specifically IP-10 for LTBI classification and IL-10 and IL-2 for TB reactivation risk assessment. Taken together, this work introduces a 45-minute, multiplexed biomarker assay into the current TB diagnostic workflow and provides a single method capable of classifying patients by LTBI status and TB reactivation risk, which has the potential to improve diagnostic evaluations, personalize treatment and management plans, and optimize targeted preventive strategies in Mtb infections.
Citation: Meserve K, Chapman CA, Xu M, Zhou H, Robison HM, Hilgart HR, et al. (2025) Multiplexed cytokine profiling identifies diagnostic signatures for latent tuberculosis and reactivation risk stratification. PLoS ONE 20(4): e0316648. https://doi.org/10.1371/journal.pone.0316648
Editor: Fumihiro Yamaguchi, Showa University Fujigaoka Hospital, JAPAN
Received: July 17, 2024; Accepted: December 15, 2024; Published: April 9, 2025
Copyright: © 2025 Meserve et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information.
Funding: This work was supported by the National Institute of Allergy and Infectious Diseases at the United States National Institutes of Health [AI141591 to R.C.B. and P.E.]. This paper’s contents are solely the responsibility of the authors and do not necessarily represent the official views of the National Institutes of Health, Mayo Clinic, the University of Michigan at Ann Arbor, or any organization. No other financial or material support for this work was provided to the authors and participants. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."
Competing interests: We have read the journal's policy and the authors of this manuscript have the following competing interests. R.C.B. has a minor financial interest in Genalyte, Inc. P.E. and T.P., and their institution have filed two patent applications related to immunodiagnostic laboratory methodologies for latent tuberculosis infection (Patent numbers: 9678071 and 10401360), which are not included in this manuscript. To date, there has been no income or royalties associated with those filed patent applications. E.S.T serves as a consultant for Roche Diagnostics (Basel, Switzerland), Euroimmun US (Mountain Lakes, NJ, USA), and Seriummune Inc. (Goleta, CA, USA) on topics outside the scope of this manuscript. R.C.B., P.E., T.P., and E.S.T. have no other conflicts to declare. The remaining authors have no conflicts to declare. These do not alter our adherence to PLOS ONE policies on sharing data and materials.”
Introduction
Active tuberculosis (TB) is a complex disease resulting from infection with Mycobacterium tuberculosis (Mtb), which can have pulmonary and/or extrapulmonary involvement. TB infection is estimated to be present in around 25% of the world’s population and continues to be a fatal disease, with an attributed 10.6 million cases and 1.3 million deaths worldwide in 2022 [1,2]. TB disease is currently regarded as a continuum of various phases of infection, with each phase requiring different methods of diagnosis and treatment regimes. This adds to the complex task of global TB eradication [3–5]. Upon infection, the host immune response attempts to contain and clear the Mtb, but under many circumstances only succeeds in sequestering the bacteria in granulomas found in different stages of maturation primarily in the lungs and lymph nodes [6–8]. Granuloma development aids Mtb persistence by isolating the bacteria from the host’s immune response, but also protects the host from continued bacterial growth and replication. The dynamic bacterial and immunological equilibrium induced by the granulomatous immune response results in a phase of TB infection called latent TB infection (LTBI), in which the host is asymptomatic with no indications of active disease on radiographs [8]. It is estimated that from all Mtb-infected individuals worldwide, 20% manifest clinically as TB disease and 80% persist as LTBI, totaling an estimated 2 billion people [6,9,10].
Approximately 90% of LTBI patients who are immunocompetent will stay asymptomatic and non-contagious [4,6]. The remaining patients will eventually reactivate from LTBI to active TB through a disruption in the dynamic equilibrium resulting in rupture of the protective granuloma and immune response. The Mtb, no longer held in a quiescent state, will then begin to replicate usually within the lungs and lymph node compartments of the host. These infection reactivation and disease progression events facilitate further transmission of Mtb. Many TB reactivation events occur within 24 months of initial infection, highlighting the importance of diagnosing the early latent phase [4]. Therefore, a key to eradicating TB involves both the identification of individuals with LTBI and the stratification of patients according to their risk of reactivation from LTBI to active TB, since preventative antibiotic therapy would be most beneficial for high-risk individuals.
To reach this ambitious goal, high performance diagnostics for accurate LTBI diagnosis and risk stratification are urgently needed. There currently is no stand-alone gold standard test for LTBI. Diagnosis is typically achieved via positive immune responses to Mtb in an interferon-gamma release assay (IGRA) and/or tuberculin skin test (TST), in conjunction with the absence of active TB markers in clinical-radiological examinations. The TST is a less specific test compared to IGRAs, since immune responses to the purified protein derivative utilized are cross-reactive against other mycobacterial and Bacillus Calmette-Guérin (BCG) vaccine antigens.
One of the available IGRAs is the QuantiFERON-TB Gold Plus test (QFT), which identifies individuals who have been infected with or exposed to Mtb. The QFT assay involves stimulation of lymphocytes in whole-blood samples with Mtb-specific antigens and controls followed by quantitation of interferon-gamma (IFN-γ) in the separated plasma by using an enzyme-linked immunosorbent assay (ELISA) or chemiluminescence immunoassay (CLIA) [11,12]. IGRAs have limitations when applied to LTBI diagnostics, including indeterminate test results in immunosuppressed patients, specifically those with altered IFN-γ host responses. IGRAs are also unable to differentiate between LTBI and active TB and have less than 3% predictive value in determining risk of TB reactivation over time [5,13–16]. Recent research into determining individuals at very high risk of TB reactivation (incipient TB) has focused on whole-blood gene expression or ribonucleic acid (RNA) signatures, which can yield false positive results with viral infections, are generally less stable than protein biomarkers, and require RNase-free sample processing [17–21]. Therefore, there is a need for improved protein-based biomarker signatures for both LTBI diagnosis and TB reactivation risk stratification.
Our work is motivated by the World Health Organization’s goal of decreasing the annual TB incidence rate by 80% before 2030. We aim to expand the TB diagnostic toolbox with a multiplexed immunoassay tool that addresses both LTBI status and risk of TB reactivation using a sample input of readily available stimulated QFT plasma. Multiplexed assays are an area of intense interest in clinical diagnostics, as the complex biological processes of the immune system are more informatively understood using multi-biomarker profiles compared to any single biomarker [22–27]. In a TB context, multiplexed assays can identify host biomarkers beyond IFN-γ as indicators of Mtb infection phase, potentially reducing false negatives in patients with IFN-γ dysregulation or due to IFN-γ independent immune responses against Mtb [26,28]. There has been a significant focus on developing multi-biomarker signatures for determining active TB status [29–35] and some studies have incorporated multi-biomarker signatures for other phases of TB infection, such as LTBI [36–40]. These studies demonstrate promise for using cytokine biomarkers to differentiate various phases of TB infection. However, many do not address variability in immunological response between patients and none feature applicability towards combining both LTBI determination and TB reactivation risk stratification in the same assay at initial assessment.
This work utilizes a silicon photonic microring resonator biosensor platform capable of rapid, multiplexed biomarker detection to quantify host proteins in QFT-stimulated plasma samples. The biomarker concentrations are used to classify patients into relevant clinical bins for LTBI diagnosis or TB reactivation risk inference using machine learning algorithms. The 45-minute assay with automated fluid handling is faster and requires less manual manipulation than traditional ELISA assays and, when coupled with machine learning, provides a method to identify important diagnostic biomarkers for LTBI and TB reactivation risk classifications. We have previously applied this multiplexed microring platform in various clinical capacities, including LTBI-relevant diagnostics [41–47]. However, these studies utilized samples emanating from in vitro stimulation of peripheral blood mononuclear cells (PBMCs) with six different antigens or controls, which is technically laborious and time consuming. Notably, though, this work demonstrated the applicability of the platform coupled with machine learning methods to generate biomarker profiles that were not only biologically driven, but also led to predictive accuracies for LTBI and high-risk status that suggested a strong potential for clinical diagnostics. We now postulate that employing this multiplexed assay workflow and bioinformatic analysis with readily available residual QFT plasma will create a more rapid profiling approach directly amenable to the current clinical workflow for TB diagnostics. The goal of this work is to determine the diagnostic accuracy of our method to diagnose LTBI and identify individuals with increased risk (high risk) and reduced risk (low risk) of TB reactivation.
Herein, we report predictive accuracies for classifying LTBI positive individuals from a population of patients who received QFT testing at the Mayo Clinic using both absolute and precision normalized biomarker concentrations. We further stratify the LTBI positive patient group into high- or low-risk of reactivation and report predictive accuracies for classifying both risk designations. Further, we identify the most important biomarker variables for all the models and highlight a dependence on six of the thirteen biomarkers in the panel. Through this implementation, we demonstrate how our microring resonator platform is amenable to the current TB diagnostic pipeline and can increase the amount of clinical information gleaned from a single test. This work enables improvement in LTBI diagnosis and has the potential to optimize individual preventive treatment and management decisions for subjects exposed to Mtb.
Methods
Subject enrollment and clinical designations
This study was approved by the Mayo Clinic Institutional Review Board (IRB #09–003253) and Olmsted County Public Health Services. All study participants signed an informed written consent. Subjects were enrolled at the Mayo Clinic in Rochester, Minnesota, between August 3, 2017 and December 6, 2023. Risk factors for TB infection, TB progression, and/or TB reactivation were obtained through a questionnaire and review of medical records as previously described [48,49]. LTBI diagnoses were made based on the Centers for Disease Control and Prevention (CDC) current guidelines criteria, TB risk factors, and prior TST and QuantiFERON-TB Gold In-Tube or Gold In-Tube Plus results (Qiagen, Germantown, MD) [49,50]. The study subjects included TB unexposed individuals with negative IGRA testing results (LTBI negative cases), and subjects with LTBI at varying risk for developing active TB infection, including untreated LTBI patients and patients who had received treatment for LTBI. A modified multifactorial predictive modeling platform (Online TST/IGRA interpreter) was applied to estimate the cumulative risk of TB reactivation in all subjects [51,52]. The platform accounts for known risk factors for TB progression and reactivation, such as human immunodeficiency virus (HIV) infection, diabetes mellitus, treatment with TNF-α antagonists, and body weight, and is adjusted by LTBI treatment effect.
Three clinical designations were used in this study: LTBI status, high-risk LTBI status, and low-risk LTBI status. Together, these designations provide information relevant to TB reactivation risk assessments and, thus, individual treatment recommendations relevant for clinicians and public health providers who screen, diagnose, and manage LTBI patients who have a possible or confirmed exposure to an active pulmonary TB patient and who are at risk of TB reactivation. CDC guidelines for diagnosis of LTBI were used to classify subjects as LTBI positive (LTBI+) or negative (LTBI−) [50]. Subjects with a LTBI+ diagnosis were deemed at high risk of TB reactivation if they could be classified as either (1) individuals born in and/or from TB endemic areas having untreated LTBI with both TST+ and prior IGRA+ results, (2) having untreated LTBI with TST+ conversion, prior IGRA− results, and prior TB exposures, (3) having prior close TB exposure and subsequent IGRA+ results, (4) being immunosuppressed with untreated LTBI, or (5) having risk factors, such as TB exposure, IGRA+ results, and a cumulative risk of TB reactivation ≥ 4.1% based on the adjusted TST/IGRA interpreter. Subjects with a LTBI+ diagnosis were further classified as low risk of TB reactivation if they were LTBI+, completed guidelines-based preventative treatment for LTBI, and had a low likelihood for subsequent TB exposures at the time of study enrollment, such as those living in low TB endemic areas.
Sample collection and QuantiFERON testing
Three milliliters of blood were collected from each subject and sent for QFT testing at the Mayo Clinic’s clinical laboratories. Initial study participants (39/165 patients) were analyzed using the QuantiFERON®-TB Gold In-Tube (Qiagen) kit. The QFT assay was performed according to manufacturer recommendations that included three separate in-tube stimulation conditions: a negative control (NIL), positive control (MIT), and the Mtb-specific peptide mixture for CD4+ cell stimulation (AG) [51,53]. During the study, the Mayo Clinic laboratory transitioned to the QuantiFERON-TB Gold Plus (Qiagen) kit [54]. This updated kit contained the stimulation categories of NIL, MIT, and Mtb-specific peptide mixture (now termed TB1 and shown to be akin to the AG stimulation in the original QFT assay). The updated kit included an additional Mtb-specific stimulation, TB2, that contained the same peptide cocktail as TB1 along with proprietary peptides to elicit both CD4+ and CD8+ T-cell responses [55]. For the algorithms developed in this work, biomarker profiling data from the TB2 tube was not used since the NIL, MIT, and AG/TB1 stimulations are ubiquitous for both QFT assays used across all patients in the study.
Patient blood was stimulated in-tube and the plasma was separated from the red blood cells and analyzed for IFN- γ concentration by an ELISA or CLIA. The results were interpreted as per the kit instructions. While TB2 tubes were excluded from biomarker profiling for continuity to the original QFT assay, the tubes were still used in clinical designation after transitioning to the updated QFT assay, as per Qiagen instructions. In the original QFT assay, a positive result was declared if measured levels of IFN-γ included NIL ≤ 8.0 IU/mL, the IFN- γ concentration difference between TB1 and NIL tubes (TB1-NIL) ≥ 0.35 IU/mL, and TB1-NIL ≥ 25% of the NIL IFN-γ value. In the updated QFT assay, a positive result was declared if IFN-γ concentrations in NIL ≤ 8.0 IU/mL, TB1-NIL or TB2-NIL ≥ 0.35 IU/mL, and TB1-NIL or TB2-NIL ≥ 25% of the NIL IFN-γ value. Samples that were declared indeterminant were excluded from the study. Leftover plasma from each QFT tube was stored at -80°C after testing and thawed immediately before multiplexed cytokine analyses.
Multiplexed biomarker detection
Silicon photonic microring resonators were used to perform multiplexed assays for quantifying thirteen protein biomarkers in this report. Briefly, changes in the local refractive index at the surface of each sensor causes a shift in the resonant wavelength of light supported by each microring structure. Capture antibodies specific to the target proteins were arrayed across the sensor chip surface such that there were four technical replicates per target. The sensing scheme resembled a sandwich-style ELISA, with the immobilized capture antibody, target protein, and biotinylated tracer antibody forming a highly specific sandwich complex that was tagged with streptavidin-conjugated horse-radish peroxidase (SA-HRP) [56–58]. The final assay step included an amplification reagent (4-chloro-1-napthol) that was catalytically converted to an insoluble product by the HRP, resulting in local deposition of a precipitate onto the microring that in turn generated a greater shift in resonant wavelength proportional to the target analyte concentration. Multiplexed calibration curves (S1 Fig in S2 File) were used to convert the resulting shifts in resonant wavelength to concentrations for each of the thirteen biomarkers in the QFT plasma samples. The technology provided automated, simultaneous analysis of two samples with near real-time output response. The optimized assay featured a time-to-result of 45 minutes, required less than 250 μL of QFT plasma sample, and provided limits of detection in the picogram per milliliter range. A detailed description of the technology, sensing principles, instrumentation, sensor chip preparation, assay parameters, and calibration construction can be found in the Supplemental Information.
Assay panel and patient sample analysis
The 13-plex cytokine and chemokine panel deployed in this study was selected based on previously published work [46,47] and consisted of IL-1β [59], IL-2 [60], IL-6 [61], IL-10 [62], IL-15 [63], IL-17 [64], CCL2/MCP-1 [65], CCL3/MIP-1α [66], CCL4/MIP-1β [67], CCL8/MCP-2 [68], IFN- γ [69], IP-10 [70,71], and TNF-⍺[72]. The antibody pairs and standard proteins for each of these targets are detailed in S1 Table in S2 File. Each patient sample was analyzed simultaneously on a single sensor chip at two dilutions in under 45 minutes using residual QFT stimulated plasma (example sample data output shown in S2 Fig in S2 File). Sample dilutions of 10X and 2X were chosen to allow coverage of the necessary dynamic ranges for all cytokines in the multiplexed panel. Each plasma sample was removed from −80°C, thawed at 4°C, and diluted two- and ten-fold in 1X phosphate-buffered saline containing 0.5% bovine serum albumin to a total volume of 350 μL. The instrumental method for sample analysis is described in the Supplemental Information. Institutionally approved biohazard safety level 2 precautions were exercised while handling the QFT plasma samples.
Microring data analysis was completed as described previously [73]. The relative shift before the amplification step was subtracted from the relative shift at the end of the final buffer rinse to determine a final net shift for each of the thirteen targets per sample across both dilutions. Using the calibration curves, the net shifts for both analyzed dilutions (10X and 2X) were converted to analyte concentration using the corresponding calibration (10% plasma and 50% plasma, respectively). The technical replicates (n = 4) were averaged for each target. The dilution that resulted in a concentration closest to the inflection point of the specific target’s calibration curve was selected, corrected for dilution factor, and used for further analysis. Data points that exceeded the saturating point of a target’s specific calibration were assigned a value ten times greater than the highest measurable concentration, and data points below the specific target’s limit of detection were assigned as zero. The analysis workflow yielded one concentration value for each of the thirteen biomarkers across the three QFT simulated plasma samples, resulting in 39 absolute biomarker concentrations per patient for analysis.
Precision normalization
We normalized biomarker levels within the various QFT samples for each patient to determine if accounting for patient-to-patient variation in baseline immunity would result in more accurate models. For each individual biomarker, the normalization procedure included subtracting the negative control concentration from that of the Mtb-specific stimulation to correct for basal cytokine immune response (TB1-NIL), subtracting the positive control concentration from that of the Mtb-specific stimulation to assess TB-specific response against mitogenic response (TB1-MIT), and subtracting the positive control concentration from that of the negative control as a measure of functional (inducible) immune response of the subject (MIT-NIL). Taken together, the three normalized conditions calculated for each of the 13 measured biomarker concentrations led to 39 variables per patient for analysis.
Statistical testing
Comparing the biomarker concentrations between clinical designations was completed using two-sample Wilcoxon rank-sum tests, also known as Mann-Whitney U tests or Wilcoxon-Mann-Whitney (WMW) tests. A WMW test is a non-parametric statistical hypothesis test that quantifies differences in the distribution of data between two populations [74,75]. WMW tests were chosen for comparing the biomarkers between clinical designations because the samples between groups meet the independent, ordinal, and continuous test assumptions, intervals may not be constant, outliers are present, and this test is suited for these sample sizes. The null hypothesis is that the distribution of the groups being compared are identical, with the alternative hypothesis being that the distribution of the groups is not identical. The absolute cytokine concentrations were log-transformed for data visualization.
Random forest machine learning
This study employs the random forest approach to classify individuals into LTBI and TB reactivation risk clinical designations based on collected biomarker features [76]. Random forest is an ensemble learning method that constructs and aggregates a multitude of decision trees to determine a classification outcome and can detect nonlinear effects of covariate features. The aggregation of many trees increases flexibility and improves accuracy when compared to using a single decision tree, while feature selection ranks a feature’s effect on the classification of a clinical designation, identifying the most important variables for classification [77]. All informatics and data plotting were performed using R version 4.3.1.
The specifics of the random forest algorithm, variable selection, and informatic handling of repeated entries are described in the Supplemental Information. The visual representation of random forest outputs are receiver operator characteristic (ROC) curves [78]. ROC curves graphically depict the trade-off between the true positive rate and false positive rate across various classification thresholds. They can be compared using the area under the curve (AUC), a metric of the model’s ability to discriminate between clinical classification bins. An AUC of one indicates 100% sensitivity and specificity across all thresholds, while an AUC of 0.5 is equivalent to random chance. All ROC curves in our analysis are calculated based on the out-of-bag predictions from random forests [76]. Out-of-bag predictions mimic the leave-one-out mechanism used for evaluating a machine learning model and are less vulnerable to overfitting.
With a dataset comprising 78 possible biomarker features for each subject, the presence of numerous potentially non-informative variables introduces non-robustness and variance into our final label estimation. Our classification pipeline was comprised of two main parts: (1) biomarker feature importance evaluation and (2) variable selection and classification model construction. While there are many existing variable selection methods [79–81], random forest provides a sensible and robust metric, termed variable importance, to gauge each variable’s contribution to prediction accuracy [76]. Empirical confidence intervals can be constructed by leveraging variable importance scores from each tree, which ensures robustness in importance score estimation [82]. Variables with high importance rankings, identified under a predetermined threshold, were integrated into a refined (reduced) random forest model for further training. Classification performance evaluation, AUC, was carried out using out-of-bag samples containing the selected informative variables.
A key challenge in this study arose from repeated records, or multiple samples at different time points from a unique subject, within the dataset. This could lead to correlations between in-bag and out-of-bag data and result in potential biases in prediction accuracy estimates. To mitigate this issue, we proposed a modification to the random forest algorithm. Initially, a single datum was sampled for each individual tree to ensure uniqueness. Subsequently, within the bagging process, repeated observations were managed to ensure proper assignment to in-bag or out-of-bag samples, thereby impeding the robustness of prediction accuracy estimates. Further details are provided in the Supplementary Information.
Results
Patient demographics
This study enrolled 144 individual patients, and all provided a blood draw for QFT testing (Table 1). Seventeen of the patients provided a later second time point and four provided a third time point as part of an ongoing longitudinal profiling project, leading to a total of 165 sample sets included in our analysis. Each set of QFT samples was analyzed independently, and the machine learning algorithm accounted for repeat measurements to minimize correlation bias. Of the 165 samples, 78 came from LTBI+ individuals. LTBI+ subjects were younger and more likely to be male, non-Caucasian, and born outside the United States than the LTBI− subjects. There was no statistical difference in the frequency of BCG vaccination and immunosuppression across these two groups. Within the LTBI+ study population, 50 samples were obtained from individuals designated to be at high reactivation risk and 25 were derived from persons designated as being at low reactivation risk, with the remaining three being indeterminant risk. The high-risk patients were more likely to be male, but no other demographic data was significantly different between these study groups. As expected, the treatment-adjusted predicted cumulative TB risk and annual TB risks, as well as the QFT (TB1 – NIL) IFN-γ concentrations, were statistically lower in the low-risk group compared with the high-risk study group (Table 1).
Classification of LTBI positive subjects using absolute and normalized cytokine concentrations
Using the 165 sample sets in our study, we first aimed to differentiate LTBI+ and LTBI− subjects using the random forest machine learning algorithm adjusted for repeated samples. Leave-one-out cross-validation was completed to determine the ability of the biomarker combinations to identify LTBI+ patients from all others in the study population. The absolute cytokine concentration values from the stimulated plasma samples resulted in a predictive accuracy (AUC) of 89.0% (Fig 1A) and the normalized cytokine values resulted in an accuracy of 89.2% (Fig 1B) using the full data sets of 39 absolute or normalized variables, respectively. Variable reduction was completed by using the variable importance metric from the full data set (S2 and S3 Tables in S2 File) to refine the model. This is done by restricting the model to variables that have the highest standardized importance (variable importance divided by its estimated standard deviation) and refit with the reduced set of variables. The reduced model utilizing eight (out of 39) variables (Fig 1C) achieved a prediction accuracy of 90.0% using absolute cytokine values, while the reduced model with a size of six (Fig 1D) resulted in a prediction accuracy of 91.3% using normalized cytokine values. The normalized cytokine value-derived model improved the classification accuracy by just 1.3% compared to the absolute cytokine value-derived model, indicating the normalized values did not substantially enhance differentiation of the LTBI+ patients from the remaining patients in the study population.
Random forest classification is represented as receiver operator characteristic (ROC) curves when using absolute (A) and normalized (B) stimulated biomarker concentrations as the input variables. The most important variables identified through the variable importance metrics using the full set of absolute (C) and normalized (D) biomarker concentrations were used to generate the reduced data models. The variable importance metrics for all input variables in both models are presented in the Supplemental Information.
The response of IP-10 in Mtb-specific stimulations is the dominant predictive biomarker in both models. There is an overlap in the important biomarkers between models that include IP-10, CCL8, IL-2, and IL-10, which suggests the importance of these biomarkers regardless of which data type was used to generate the model. Five of the eight variables in the reduced absolute cytokine value-derived model are associated with the TB1 stimulated plasma, while all six in the reduced normalized cytokine value-derived model are associated with the TB1 stimulation. This indicates reliance on Mtb-specific peptide stimulations for making the LTBI classification designation. Variables associated with NIL and MIT stimulated plasma samples are still present in the absolute cytokine reduced models, albeit with lower importance metrics. This indicates that contributions from a subject’s functional immune response have value in classifying LTBI status. A key result to highlight is that the reduced models that achieved 90%+ accuracy were generated using different stimulations of just four (normalized) or five (absolute) biomarkers. These results suggest a dependence on a select few cytokines in the multiplexed profiling panel.
The comparison of cytokine distributions between LTBI+ and LTBI− subjects in our population corroborates these selected important variables. All variables used to construct the reduced absolute cytokine-derived model have statistically different cytokine distributions between LTBI+ and LTBI− populations (Fig 2, S4 Table in S2 File). The greatest significance falls in the TB1 stimulated plasma, with IP-10 and CCL8 biomarkers being the most significantly different. Biomarkers IL-10 and IL-15 are significantly different across all three stimulation conditions. Similarly, the top five variables in the normalized cytokine-derived model have statistically different cytokine distributions between positive and negative populations (Fig 3, S4 Table in S2 File). The greatest significance falls in the TB1-related normalized cytokine values, with IP-10, CCL8, and IL-2 showing the greatest significance under TB1-NIL normalized conditions. IP-10 has the most distinction in population distributions between LTBI+ and LTBI− for two normalized conditions, TB1-NIL and TB1−MIT. Biomarkers IL-10 and IL-15 are only significantly different in the MIT-NIL normalized condition and are also the only biomarkers to have significant differences under this normalized condition.
Target concentrations from stimulation conditions of TB1 (A), NIL (B), and MIT (C) that were significantly different between clinical populations of LTBI+ and LTBI− patients are shown. Median value of clinical population represented as a black line. Data visualized on a log scale. Wilcox-Mann-Whitney tests were completed to test significant differences in target concentrations between clinical population distributions. * p ≤ 0.05, ** p ≤ 0.01, *** p ≤ 0.001, **** p ≤ 0.0001.
Normalized conditions of TB1-NIL (A), TB1-MIT (B), and MIT-NIL (C) were significantly different between clinical populations of LTBI+ and LTBI− patients in some measured targets. Median value of clinical population represented as a black line. Wilcox-Mann-Whitney tests were completed to test for significant differences in normalized target concentrations between clinical population distributions. * p ≤ 0.05, ** p ≤ 0.01, *** p ≤ 0.001, **** p ≤ 0.0001.
High risk stratification among LTBI+ subjects
We next aimed to differentiate patients classified as high risk of disease reactivation from all others in the LTBI+ population, which included patients at low risk of reactivation and those at an indeterminant risk. The LTBI+ subset of our study cohort constitutes the clinically relevant population for assessing the risk of reactivation. We used the clinical indicators described in the methods section to classify a patient as high TB reactivation risk given the information provided at sample collection and completed analysis with both absolute and normalized conditions for risk status, as was described for our LTBI status classification workflow.
The absolute cytokine values from the stimulated plasma samples resulted in an accuracy of 74.9% (Fig 4A), and the normalized cytokine values resulted in an accuracy of 63.3% (Fig 4B) using the full data sets of 39 absolute or normalized variables, respectively. The variable importance metrics for the full model are provided for the absolute and normalized biomarkers in S5 and S6 Tables in S2 File, respectively, and the top variables were used to generate the reduced model. The refined absolute value-derived model used six variables (Fig 4C) and achieved an accuracy of 81.9%. With the normalized value input, the refined model resulted in an accuracy of 72.7% using seven variables (Fig 4D). Therefore, the refined absolute value-derived model had an ability to distinguish high-risk classified individuals from all others in the study with 9.2% greater accuracy than the normalized value-derived model.
Random forest classification is represented as receiver operator characteristic (ROC) curves when using absolute (A) and normalized (B) stimulated biomarker concentrations. The most important variables from the full data set were identified through the variable importance metrics of absolute (C) and normalized (D) biomarker concentrations. Reduced data models were generated using the top biomarkers presented here. The variable importance metrics for all input variables in both models are presented in the Supplemental Information.
Within the absolute cytokine value-derived model, the six variables used for the highest-performing reduced model consisted of just four protein biomarkers: IL-10, IL-6, IL-2, and TNF-α. All three stimulation conditions were present, with three emanating from TB1, two from NIL, and one from MIT. Concentrations of IL-10 after stimulation by all three conditions represent half of the top six variables, indicating a strong reliance on IL-10 for high-risk prediction. The variable importance metrics for the normalized value-derived model reveal a similar reliance on IL-10 and IL-2 associated variables, but overall, they have lower importance metrics. Other protein biomarkers, CCL3 and IL-17, are included in the reduced model using normalized conditions. Interestingly, across both models the top IL-10 biomarker is associated with non-Mtb-specific stimulation results, meaning the high-risk classification may be dependent on overall immune function of the LTBI+ patients.
Low risk stratification among LTBI+ subjects
We also aimed to differentiate patients classified as having a low risk of disease reactivation from all others in the LTBI+ population, which included patients at a high risk of reactivation and those at an indeterminant risk. We used the same clinical indicators and machine learning method employed in the high-risk classification. The absolute cytokine values from the stimulated QFT plasma samples resulted in an accuracy of 67.1% (Fig 5A), and the normalized cytokine values resulted in an accuracy of 62.5% (Fig 5B) using the full data sets of 39 absolute or normalized variables, respectively. The variable importance metrics for the full model are provided for the absolute and normalized biomarkers in S7 and S8 Tables in S2 File, respectively, and the top variables were used to generate the reduced model. The refined model used six variables in the absolute value-derived model (Fig 5C) and achieved an accuracy of 78.4%. With the normalized value input, the refined model resulted in an accuracy of 73.6% using four variables (Fig 5D). The refined absolute value-derived model had an ability to distinguish low-risk classified individuals from all others in the study with 4.8% greater accuracy than the normalized value-derived model with six input variables.
Random forest classification is represented as receiver operator characteristic (ROC) curves when using absolute (A) and normalized (B) stimulated biomarker concentrations. The most important variables identified through the variable importance metrics using absolute (C) and normalized (D) biomarker concentrations were used to generate the reduced data models. The variable importance metrics for all input variables in both models are presented in the supplemental information.
The six absolute-related variables used to generate the reduced model consisted of four different biomarkers, IL-2, IL-10, CCL8, and IP-10 with all three stimulations represented. The three unique biomarkers used to generate the reduced model with normalized conditions were similar, IL-2, IL-10, and CCL8, with all three normalized conditions present. Similar to the high-risk classification, there is a high reliance on IL-10 and IL-2 associated variables to accomplish the low-risk classification.
Cytokine distributions between high and low risk patients
We next sought to further elucidate the differences of single biomarkers between the high and low reactivation risk populations. All LTBI+ patients who were classified as either high or low reactivation risk were included and WMW tests were completed to determine if there were significant differences in the distributions of the cytokines between high and low risk groups (S9 Table in S2 File). Targets with significant differences between absolute cytokine concentrations in the QFT stimulated plasma samples from high and low risk individuals are depicted in Fig 6. The ten variables with significantly different distributions between groups showed higher median concentrations in high-risk individuals than in low-risk individuals. IL-10 and IL-2 were imperative in classifying both bins using the random forest method, and both biomarkers also showed significance in absolute level discrimination. IL-10 distributions are significantly different across all three QFT stimulations, while IL-2 has significantly different distributions in TB1 and MIT stimulated plasma samples. Other variables that exhibit a significant difference between high and low risk groups include CCL8 (TB1 and NIL), TNF-α (TB1 and NIL), and IP-10 (TB1). WMW testing was also conducted using normalized variables. Only four significant differences resulted, with two associated with IL-10 and two with IL-2 (S9 Table in S2 File). This compliments the machine learning results, in that the absolute concentrations are better at distinguishing between risk groups than normalized concentrations.
Targets under stimulation conditions of TB1 (A), NIL (B), and MIT (C) that were significantly different between clinical populations of high and low risk LTBI+ patients are shown. Median value of clinical population represented as a black line. Data visualized on log scale. Wilcox-Mann-Whitney tests were completed to test for significant differences in normalized target concentrations between clinical population distributions. * p ≤ 0.05, ** p ≤ 0.01, *** p ≤ 0.001, **** p ≤ 0.0001.
Discussion
This study successfully determined the diagnostic accuracy and identified biomarker signatures for LTBI and TB reactivation risk-based classifications by using QFT stimulated plasma samples coupled with microring resonator-based multiplexed cytokine profiling and machine learning feature selection methods. The 45-minute microring resonator multiplexed biomarker assay required just 210 μL of QFT plasma to simultaneously analyze thirteen cytokines at two sample dilutions. The multiplexed assay had been previously tested using cell stimulation supernatant samples, which are technically laborious to obtain [46,47]. Using QFT-stimulated plasma as the sample input places our method directly in the current TB diagnostic and clinical laboratory workflow, which decreases post-collection sample processing and lowers turnaround times. We successfully transitioned our assay workflow into plasma matrices and analyzed 495 QFT plasma samples.
The enrolled population for this study was representative of the type of individuals and patients commonly screened and evaluated for LTBI and TB in practices across the United States. This includes foreign-born individuals from high TB endemic areas, many with prior history of BCG vaccination, individuals with prior Mtb exposures, immunosuppressed patients, and healthcare workers with patient contact. A challenge to this type of study, and all LTBI studies, is that there is no gold standard diagnostic test for LTBI, and available diagnostic tests are imperfect, which introduces some inherent and unavoidable uncertainty into the initial clinical designations [83]. However, our study subjects were carefully selected to minimize heterogeneity in the study groups and current clinical diagnostic standards were rigorously followed [49]. Our study also included relevant clinical characteristics to analyze our immunoprofiling data, which has been suggested for optimal subject interpretation [84]. Simultaneous testing from the same blood samples and the use of internal and quality controls for all immunoassays also minimizes the risk of false-positive reactions and/or assay-related variability in our study results.
We report a multiplexed protein panel coupled with a reduced random forest model that differentiated LTBI+ and LTBI− patients with an accuracy of 90% or better using absolute or normalized biomarker values. The normalized cytokine values only improved the predictive accuracy marginally but displayed a greater separation in cytokine distributions between LTBI designations when compared to the absolute cytokine levels. The normalization could potentially account for patient-to-patient variability by subtracting the patient-specific cytokine concentrations induced by negative- and positive-control antigen stimulations. Using normalized conditions can improve the generalizability of the model and potentially reduce misclassification based on inherent variability in the basal immune response at the time of blood draw. Additionally, we combined both absolute and normalized cytokine concentrations into one model, resulting in 78 input variables per patient for classification (S3 Fig in S2 File). The accuracy did not improve from the models using absolute and normalized cytokine concentrations independently, suggesting that each individual data set is sufficient to reach the maximum accuracy given this set of biomarkers in our population. Although adding more variables potentially improves the amount of information contained in the data, the sample size in our study may not be able to support such a large model with high prediction accuracy. This phenomenon is commonly known as the curse of dimensionality in statistics.
The most important variables for making the LTBI status classification were similar between the absolute cytokine concentration-derived model and normalized cytokine level-derived model. The main classification biomarker across both methods was IP-10, a chemokine with high correlation to TB infection. Additionally, CCL8, IL-2, and IL-10 are present in the reduced models for both variable inputs and IL-15 is present in the absolute variable-derived model. These targets have been previously identified using other multiplexed biomarker analysis as having a relationship to LTBI or TB designation [36,37,46,47,68,71,85–92]. Our identification of diagnostic biomarkers with literature precedent lends credibility to our overall analytic and informatics approach. Notably, though, our results were achieved using a more rapid and automated assay format relative to previous analyses. Furthermore, all three stimulation conditions were represented in both absolute and normalized important variables, indicating that reliance on Mtb-specific stimulation responses and responses derived from inherent immune system activation are both important to classify LTBI patients. The control antigen stimulations are not typically used in LTBI classifications, but they provide added significance in our biomarker profiling results. Interestingly, absolute concentrations of IP-10, CCL8, and IL-2 are significantly higher in the LTBI+ population, but all remaining significantly different biomarkers are lower in this population. This indicates that increased levels of our top predictive variables and decreased levels of other predictive variables are being taken together by the model to perform the LTBI status classification.
Overall, both absolute and normalized models achieved 90% or better accuracy for differentiating LTBI+ patients from the remainder of our sample cohort. The normalized biomarker-derived model resulted in a marginally higher AUC and required fewer variables to achieve this classification. The normalized cytokine concentrations reveal great importance for IP-10, CCL8, IL-2, and IL-10 to identify LTBI+ subjects and we demonstrated the applicability of our multiplexed microring platform to rapidly analyze the residual QFT samples.
Protein-based risk classification is an important aspect of our study. Most TB reactivation risk classifications are based on RNA signatures. RNA is biochemically less stable than proteins and requires more specialized (RNAse-free) sample processing. Here, we integrated a risk of TB reactivation assessment into the same workflow as standard QFT testing and our LTBI classification method. The TB reactivation risk classification was completed within the LTBI+ population (n = 78) using the same biomarker measurements used for the LTBI status assessment model to classify LTBI+ patients at high or low risk of reactivation upon initial QFT assessment.
Compared to the LTBI classification, the risk classification models have less predictive accuracy. This could be due to a greater variability in subjects classified clinically as having a certain TB risk level. The classification of risk has no gold standard and relies on the analysis of many epidemiological and clinical factors determined with clinical expertise. However, the estimated annual and cumulative TB risk score based on unbiased multifactorial predicting modeling also confirmed the statistical differences in risk of TB reactivation between the low-risk and high-risk study groups (Table 1). Conversely, while LTBI status also does not have a defined gold standard, it is more objectively defined using clinical and quantitative metrics, including QFT results and additional testing. The lower predictive accuracy for risk classification could also be due to lower signal size, as the risk of reactivation assessment was restricted to the LTBI+ population. This population is likely more homogenous than the overall study population, and therefore less amenable to subcategorization via machine learning models.
Our interest in identifying orthogonal risk signatures for both low and high reactivation risk is based on an emerging role of how diagnostics could guide patient interventions and management. Although present guidelines recommend antibiotic therapy for LTBI+ subjects in the United States, concerns for targeted interventions and antibiotic stewardship motivate a need to identify subsets of patients that would benefit most from therapeutic intervention or longitudinal monitoring. Only 5 to 10% of LTBI+ patients reactivate to active TB, and the incidence of reactivation declines over time since the primary infection or exposure [4]. A biomarker-based signature that could establish a patient as being at high TB reactivation risk would provide a clear identification of patients most likely to benefit from preventive therapy. Conversely, a signature establishing low risk of TB reactivation at initial QFT sampling might allow those patients to be monitored for signs of potential emergence of disease, rather than the immediate initiation of therapy. This would reduce the chance of unnecessary adverse effects associated with antimicrobials in the large proportion of LTBI+ patients who will never progress to active TB. Furthermore, a diagnostic signature that develops over time could provide even more guidance for longitudinal monitoring of populations at a higher risk of contracting LTBI and progressing to active disease, such as LTBI+ patients, close contacts, or healthcare workers in TB endemic areas.
The high- and low-risk classifications in our study are inversely related but unique in terms of the goal population of the respective classification. The high-risk classification aims to identify the biomarker signature that uniquely identifies the high TB reactivation risk population from all others (low risk and indeterminant) in the LTBI+ cohort. Conversely, the low-risk classification aims to identify biomarkers that can specifically select the low-risk population from all others. Initially, we presumed that different variables or biomarkers would classify the different populations. The variable importance results revealed that similar variables are important for both classifications. The high overlap in important variables and statistically significant differences in concentration distributions between high and low reactivation risk groups implies the reactivation risk can be thought of as a spectrum, with concentrations of specific biomarker conditions responsible for differentiating both high and low risk individuals, as opposed to different protein biomarkers delineating the separate risk populations.
Both reactivation risk models had the greatest predictive accuracy when derived using the absolute cytokine values compared to normalized values. The global response of certain biomarkers, mainly IL-10 and IL-2, in the QFT stimulations is indicative of the immune surveillance to infection. This host response led to greater designation accuracy and greater significant differences between cytokine levels in clinical populations, in comparison to the normalized conditions. Absolute values of IL-10 and IL-2 account for four of the six variables in the reduced models for both classifications. All the IL-10 and IL-2 associated important variables show statistically significant differences in cytokine concentration distributions between respective clinical populations, with higher concentrations in the population at high risk of TB reactivation. The higher levels of IL-10 in the group at a high risk of TB reactivation may be indicative of an attenuation of the Th1 response that inhibits the formation of mature fibrotic granuloma through suppression of macrophage and dendritic cell functions, leading to a disruption in the dynamic structure of the granuloma [93]. The higher levels of IL-2 may be associated with antigenic memory response and the selected Th2 induced IL-10 response [23]. The increased levels of IL-10 and IL-2 in our LTBI+ group at a high reactivation risk suggests they could be mediators of TB reactivation and disease progression, and our results indicate they can differentiate our LTBI+ subject population by clinically designated reactivation risk at the time of QFT sample collection.
The normalized variables led to a marginally better predictive accuracy for LTBI status designation, while the absolute value variables proved to be best for the risk prediction model. In comparing the protein biomarkers necessary to construct the best reduced models across all three designations, a group of just six proteins stands out, namely IP-10, CCL8, IL-2, IL-10, IL-6, and TNF-α. This biomarker reduction has implications for future work on reducing panel size towards developing smaller biomarker panels amenable across multiple cytokine detection platforms. However, it is important to note that our more highly multiplexed technology was essential in allowing us to identify this reduced number of target analytes. All three QFT stimulations were in each of the reduced variable lists, so eliminating a stimulation is not supported by our results.
IFN-γ is a cytokine heavily utilized in TB diagnostics. However, IFN-γ is not a selected biomarker in any of our reduced models. IFN-γ is a strong predictor of active TB, as IFN-γ is one of the drivers of inducing or clearing granuloma formation [7,26]. However, recent literature details specific instances where LTBI equilibrium is IFN-γ independent with all other immunological elements within comparable ranges. There are multiple factors that could account for this established TB biomarker not contributing to our LTBI clinical significance. One explanation is that the dynamic equilibrium that is yielding the latent Mtb infection state and the propensity for it to shift towards TB progression either does not rely on IFN-γ production, or that other peripheral immune pathways contribute more to this dynamic shift. Alternatively, it could be attributed to the clinical designations themselves, where the LTBI+ designation is determined by the QFT IFN-γ results; therefore, heterogeneous populations in both clinical categories lead to an insignificant contribution.
A limitation of our efforts to selectively identify LTBI+ populations is the near absence of active TB patients within the study population. Developing diagnostics that can distinguish LTBI+ from active TB populations is important, as these groups traditionally exhibit similar IGRA and molecular testing results. However, methods to detect active TB infection are well established and the low frequency of active TB in our study cohort accurately reflects the overall population of individuals receiving QFT testing at the Mayo Clinic. This implies there is a greater need in this population to distinguish LTBI+ individuals from other QFT tested patients than to distinguish LTBI+ and active TB individuals. Nevertheless, future work will aim to include an active TB population, including early forms of TB, to test the sensitivity and specificity of classifying LTBI+ patients from active TB patients. Another limitation of our approach is that while the LTBI status is more objectively defined, the risk classifications involve more subjective assessments. This can impart high variability in patient classification and lead to suboptimal predictive accuracies. However, these limitations represent a significant challenge across all LTBI diagnostic research, and the absence of gold standards highlights the urgency for continued research efforts. Finally, while cytokine biomarker panels containing a greater number of targets are commercially available, many still require manual manipulation which can impart operator variability. The biosensor platform employed in this study was limited to sixteen analytes, including control markers. However, each analyte was measured in four technical replicates and all samples were processed through automated microfluidic handling.
A goal of our study was to use QFT samples collected at an initial screen to classify risk of reactivation from LTBI to active TB. Patients with high risk have a greater potential to progress to active TB, while those with low risk have a lower potential to reactivate. Treatment and monitoring for the two populations can look different. Prognostic biomarkers of reactivation or disease progression have been identified through longitudinal studies that monitor biomarker profiles in patients before and after they progress to disease. These biomarkers are useful but reactivating to active TB over the course of a study can be related to environmental or host factors not captured with the biomarker profiles. Additionally, if treatments are provided to those identified as LTBI+ early in the study, the prognostic biomarkers are limited to a population of treated patients, rather than to all LTBI+ individuals. Furthermore, most work is focused on progression biomarkers and does not aim to find non-progression biomarkers. Our work was not focused on using progression to active TB as an endpoint, rather, we aimed to use biomarkers to classify patients by reactivation risk upon initial QFT sample collection. The implications of identifying these biomarkers are that they can be generalized to all LTBI+ patients through larger, multi-center studies and could be used to adjust patient monitoring schedules. Those identified as low risk could receive specific management recommendations with less frequent monitoring, which would be more feasible for individuals, decrease the risk of medication side effects, and potentially save time and resources. Future work in this ongoing study includes increasing patient cohort size, inclusion of subjects with active TB, those with early forms of TB, and recent close contact of pulmonary TB patients, and working with populations in high-TB burden environments to apply our workflow in high need areas.
Conclusion
This work successfully applied a microring resonator multiplexed sensing platform within a clinically relevant workflow to the TB diagnostic space. Our goals were to use the concentrations of thirteen cytokines in QFT stimulated plasma to identify biomarker signatures that could select LTBI+ subjects and classify their risk of reactivation to active TB. Within one 13-plex cytokine biomarker panel, we demonstrated accuracies over 90% to differentiate LTBI+ patients and accuracies of nearly 80% for stratifying those patients as high or low risk of reactivation. Variable importance metrics highlighted a significant reliance on various normalized conditions of IP-10, CCL8, IL-10, and IL-2 for making the LTBI classification, while concentrations from various absolute concentrations of IL-10 and IL-2 were extremely important for classifying LTBI + patients as high or low TB reactivation risk. Our work employed a rapid technique with high multiplexing capabilities to profile clinically relevant samples and identify biomarkers of importance consistent with previous literature. The inclusion of low TB reactivation risk designation in conjunction with high TB reactivation risk designation provides a unique way to stratify LTBI+ patients with potential future implications in clinical monitoring and treatment management.
Supporting information
S1 File. Supporting information file.
The supporting information file contains additional experimental details, including a multiplexed calibration curve for all thirteen targets in the panel; example data trace from a patient specimen; LTBI classification ROC curves; antibody and reagent information; variable importance metrics (VIMs) for absolute and normalized cytokine values, median values of absolute and normalized cytokine levels for different clinical designations.
https://doi.org/10.1371/journal.pone.0316648.s001
(CSV)
S2 File. Supporting information file.
The supporting information file contains additional experimental details, including a multiplexed calibration curve for all thirteen targets in the panel; example data trace from a patient specimen; LTBI classification ROC curves; antibody and reagent information; variable importance metrics (VIMs) for absolute a normalized cytokine values. The description for the CSV file should be "Clinical subject data and median values of absolute and normalized cytokine levels for different clinical designations.
https://doi.org/10.1371/journal.pone.0316648.s002
(DOCX)
Acknowledgments
We are grateful to the study participants and their families, the staff of the Mayo Clinic Institutional Review Board, the Pulmonary Research Unit and the staff and colleagues of the Public Health Olmsted County Tuberculosis Clinic, Mayo Mycobacterial and Bronchiectasis Clinic, the Mayo Clinic Center for Tuberculosis, and the Mayo Infectious Disease Serology Laboratory for their diligent and valuable support. Anusha Vajrala and Antigone Wilson (UM Chemistry) are also acknowledged for their support.
We also acknowledge a continued collaboration with Genalyte, Inc. regarding the development of multiplexed immunoassays; however, Genalyte, Inc. had no role in assay performance or data interpretation.
References
- 1. Tuberculosis (TB). n.d. [cited 19 Feb 2024. ]. Available: https://www.who.int/news-room/fact-sheets/detail/tuberculosis
- 2. Global Tuberculosis Report. 2024 [cited 26 Apr 2024. ]. Available: https://www.who.int/teams/global-tuberculosis-programme/tb-reports
- 3. Migliori GB, Ong CWM, Petrone L, D’Ambrosio L, Centis R, Goletti D. The definition of tuberculosis infection based on the spectrum of tuberculosis disease. Breathe (Sheff). 2021;17(3):210079. pmid:35035549
- 4. Escalante P, Vadiyala MR, Pathakumari B, Marty PK, Van Keulen VP, Hilgart HR, et al. New diagnostics for the spectrum of asymptomatic TB: from infection to subclinical disease. Int J Tuberc Lung Dis. 2023;27(7):499–505. pmid:37353874
- 5. Pai M, Behr MA, Dowdy D, Dheda K, Divangahi M, Boehme CC, et al. Tuberculosis. Nat Rev Dis Primers. 2016;2:16076. pmid:27784885
- 6. Ernst JD. The immunological life cycle of tuberculosis. Nat Rev Immunol. 2012;12(8):581–91. pmid:22790178
- 7. Flynn JL, Chan J. Immunology of tuberculosis. Annu Rev Immunol. 2001;19:93–129. pmid:11244032
- 8. Mukherjee S, Perveen S, Negi A, Sharma R. Evolution of tuberculosis diagnostics: From molecular strategies to nanodiagnostics. Tuberculosis (Edinb). 2023;140:102340. pmid:37031646
- 9. Houben RMGJ, Dodd PJ. The global burden of latent tuberculosis infection: a re-estimation using mathematical modelling. PLoS Med. 2016;13(10):e1002152. pmid:27780211
- 10. CDCGlobal. Controlling the Global TB Epidemic. In: Centers for Disease Control and Prevention [Internet]. 2024 [cited 29 May 2024. ]. Available: https://www.cdc.gov/globalhivtb/who-we-are/about-us/globaltb/globaltb.html
- 11. Pai M, Denkinger CM, Kik SV, Rangaka MX, Zwerling A, Oxlade O, et al. Gamma interferon release assays for detection of Mycobacterium tuberculosis infection. Clin Microbiol Rev. 2014;27(1):3–20. pmid:24396134
- 12. Mack U, Migliori GB, Sester M, Rieder HL, Ehlers S, Goletti D, et al. LTBI: latent tuberculosis infection or lasting immune responses to M. tuberculosis? A TBNET consensus statement. Eur Respir J. 2009;33(5):956–73. pmid:19407047
- 13. Rangaka MX, Wilkinson KA, Glynn JR, Ling D, Menzies D, Mwansa-Kambafwile J, et al. Predictive value of interferon-γ release assays for incident active tuberculosis: a systematic review and meta-analysis. Lancet Infect Dis. 2012;12(1):45–55. pmid:21846592
- 14. Getahun H, Matteelli A, Chaisson RE, Raviglione M. Latent Mycobacterium tuberculosis infection. N Engl J Med. 2015;372(22):2127–35. pmid:26017823
- 15. Diel R, Loddenkemper R, Nienhaus A. Predictive value of interferon-γ release assays and tuberculin skin testing for progression from latent TB infection to disease state: a meta-analysis. Chest. 2012;142(1):63–75. pmid:22490872
- 16. Abubakar I, Drobniewski F, Southern J, Sitch AJ, Jackson C, Lipman M, et al. Prognostic value of interferon-γ release assays and tuberculin skin test in predicting the development of active tuberculosis (UK PREDICT TB): a prospective cohort study. Lancet Infect Dis. 2018;18(10):1077–87. pmid:30174209
- 17. Kaforou M, Broderick C, Vito O, Levin M, Scriba TJ, Seddon JA. Transcriptomics for child and adolescent tuberculosis. Immunol Rev. 2022;309(1):97–122. pmid:35818983
- 18. Suliman S, Thompson EG, Sutherland J, Weiner J 3rd, Ota MOC, Shankar S, et al. Four-Gene Pan-African Blood Signature Predicts Progression to Tuberculosis. Am J Respir Crit Care Med. 2018;197(9):1198–208. pmid:29624071
- 19. Leong S, Zhao Y, Ribeiro-Rodrigues R, Jones-López EC, Acuña-Villaorduña C, Rodrigues PM, et al. Cross-validation of existing signatures and derivation of a novel 29-gene transcriptomic signature predictive of progression to TB in a Brazilian cohort of household contacts of pulmonary TB. Tuberculosis (Edinb). 2020;120:101898. pmid:32090859
- 20. Zak DE, Penn-Nicholson A, Scriba TJ, Thompson E, Suliman S, Amon LM, et al. A blood RNA signature for tuberculosis disease risk: a prospective cohort study. Lancet. 2016;387(10035):2312–22. pmid:27017310
- 21. Xi X, Li T, Huang Y, Sun J, Zhu Y, Yang Y, et al. RNA biomarkers: frontier of precision medicine for Cancer. Noncoding RNA. 2017;3(1):9. pmid:29657281
- 22. Porter BW, Venkatappa TK. Uncloaking an ancient adversary: Can pathogen biomarker elicitors play a role in confirming extrapulmonary TB and latent TB infection?. Tuberculosis (Edinb). 2018;113:30–7. pmid:30514511
- 23. Essone PN, Leboueny M, Maloupazoa Siawaya AC, Alame-Emane AK, Aboumegone Biyogo OC, Dapnet Tadatsin PH, et al. M. tuberculosis infection and antigen specific cytokine response in healthcare workers frequently exposed to tuberculosis. Sci Rep. 2019;9(1):8201. pmid:31160610
- 24. Kim SY, Park MS, Kim YS, Kim SK, Chang J, Lee HJ, et al. The responses of multiple cytokines following incubation of whole blood from TB patients, latently infected individuals and controls with the TB antigens ESAT-6, CFP-10 and TB7.7. Scand J Immunol. 2012;76(6):580–6. pmid:22946827
- 25. Kellar KL, Gehrke J, Weis SE, Mahmutovic-Mayhew A, Davila B, Zajdowicz MJ, et al. Multiple cytokines are released when blood from patients with tuberculosis is stimulated with Mycobacterium tuberculosis antigens. PLoS One. 2011;6(11):e26545. pmid:22132075
- 26. Chegou NN, Heyckendorf J, Walzl G, Lange C, Ruhwald M. Beyond the IFN-γ horizon: biomarkers for immunodiagnosis of infection with Mycobacterium tuberculosis. Eur Respir J. 2014;43(5):1472–86. pmid:24311770
- 27. Wang X, Jiang J, Cao Z, Yang B, Zhang J, Cheng X. Diagnostic performance of multiplex cytokine and chemokine assay for tuberculosis. Tuberculosis (Edinb). 2012;92(6):513–20. pmid:22824465
- 28. Lu LL, Smith MT, Yu KKQ, Luedemann C, Suscovich TJ, Grace PS, et al. IFN-γ-independent immune markers of Mycobacterium tuberculosis exposure. Nat Med. 2019;25(6):977–87. pmid:31110348
- 29. MacLean E, Broger T, Yerlikaya S, Fernandez-Carballo BL, Pai M, Denkinger CM. A systematic review of biomarkers to detect active tuberculosis. Nat Microbiol. 2019;4(5):748–58. pmid:30804546
- 30. Chegou NN, Sutherland JS, Malherbe S, Crampin AC, Corstjens PLAM, Geluk A, et al. Diagnostic performance of a seven-marker serum protein biosignature for the diagnosis of active TB disease in African primary healthcare clinic attendees with signs and symptoms suggestive of TB. Thorax. 2016;71(9):785–94. pmid:27146200
- 31. Chegou NN, Sutherland JS, Namuganga A-R, Corstjens PL, Geluk A, Gebremichael G, et al. Africa-wide evaluation of host biomarkers in QuantiFERON supernatants for the diagnosis of pulmonary tuberculosis. Sci Rep. 2018;8(1):2675. pmid:29422548
- 32. Manngo PM, Gutschmidt A, Snyders CI, Mutavhatsindi H, Manyelo CM, Makhoba NS, et al. Prospective evaluation of host biomarkers other than interferon gamma in QuantiFERON Plus supernatants as candidates for the diagnosis of tuberculosis in symptomatic individuals. J Infect. 2019;79(3):228–35. pmid:31319143
- 33. Jacobs R, Malherbe S, Loxton AG, Stanley K, van der Spuy G, Walzl G, et al. Identification of novel host biomarkers in plasma as candidates for the immunodiagnosis of tuberculosis disease and monitoring of tuberculosis treatment response. Oncotarget. 2016;7(36):57581–92. pmid:27557501
- 34. Nogueira BMF, Krishnan S, Barreto-Duarte B, Araújo-Pereira M, Queiroz ATL, Ellner JJ, et al. Diagnostic biomarkers for active tuberculosis: progress and challenges. EMBO Mol Med. 2022;14(12):e14088. pmid:36314872
- 35. Mihret A, Bekele Y, Bobosha K, Kidd M, Aseffa A, Howe R, et al. Plasma cytokines and chemokines differentiate between active disease and non-active tuberculosis infection. J Infect. 2013;66(4):357–65. pmid:23178506
- 36. Qiu B, Liu Q, Li Z, Song H, Xu D, Ji Y, et al. Evaluation of cytokines as a biomarker to distinguish active tuberculosis from latent tuberculosis infection: a diagnostic meta-analysis. BMJ Open. 2020;10(10):e039501. pmid:33033030
- 37. Awoniyi DO, Teuchert A, Sutherland JS, Mayanja-Kizza H, Howe R, Mihret A, et al. Evaluation of cytokine responses against novel Mtb antigens as diagnostic markers for TB disease. J Infect. 2016;73(3):219–30. pmid:27311746
- 38. Won E-J, Choi J-H, Cho Y-N, Jin H-M, Kee HJ, Park Y-W, et al. Biomarkers for discrimination between latent tuberculosis infection and active tuberculosis disease. J Infect. 2017;74(3):281–93. pmid:27871809
- 39. Wang S, Li Y, Shen Y, Wu J, Gao Y, Zhang S, et al. Screening and identification of a six-cytokine biosignature for detecting TB infection and discriminating active from latent TB. J Transl Med. 2018;16(1):206. pmid:30029650
- 40. La Manna MP, Orlando V, Li Donni P, Sireci G, Di Carlo P, Cascio A, et al. Identification of plasma biomarkers for discrimination between tuberculosis infection/disease and pulmonary non tuberculosis disease. PLoS One. 2018;13(3):e0192664. pmid:29543810
- 41. Stepanovich GE, Chapman CA, Meserve KL, Sturza JM, Ellsworth LA, Bailey RC, et al. Chorioamnionitis-exposure alters serum cytokine trends in premature neonates. J Perinatol. 2023;43(6):758–65. pmid:36539561
- 42. Qavi AJ. Rapid detection of an Ebola biomarker with optical microring resonators. OPEN ACCESS. 2022;22.
- 43. Graybill RM, Cardenosa-Rubio MC, Yang H, Johnson MD, Bailey RC. Multiplexed microRNA expression profiling by combined asymmetric PCR and label-free detection using silicon photonic sensor Arrays. Anal Methods. 2018;10(14):1618–23. pmid:30275912
- 44. Wade JH, Alsop AT, Vertin NR, Yang H, Johnson MD, Bailey RC. Rapid, multiplexed phosphoprotein profiling using silicon photonic sensor arrays. ACS Cent Sci. 2015;1(7):374–82. pmid:26539563
- 45. Washburn AL, Shia WW, Lenkeit KA, Lee S-H, Bailey RC. Multiplexed cancer biomarker detection using chip-integrated silicon photonic sensor arrays. Analyst. 2016;141(18):5358–65. pmid:27400767
- 46. Robison HM, Escalante P, Valera E, Erskine CL, Auvil L, Sasieta HC, et al. Precision immunoprofiling to reveal diagnostic signatures for latent tuberculosis infection and reactivation risk stratification. Integr Biol (Camb). 2019;11(1):16–25. pmid:30722034
- 47. Robison HM, Chapman CA, Zhou H, Erskine CL, Theel E, Peikert T, et al. Risk assessment of latent tuberculosis infection through a multiplexed cytokine biosensor assay and machine learning feature selection. Sci Rep. 2021;11(1):20544. pmid:34654869
- 48. El Sahly HM, Adams GJ, Soini H, Teeter L, Musser JM, Graviss EA. Epidemiologic differences between United States- and foreign-born tuberculosis patients in Houston, Texas. J Infect Dis. 2001;183(3):461–8. pmid:11133378
- 49. Lewinsohn DM, Leonard MK, LoBue PA, Cohn DL, Daley CL, Desmond E, et al. Official American thoracic society/infectious diseases society of america/centers for disease control and prevention clinical practice guidelines: diagnosis of tuberculosis in adults and children. Clin Infect Dis. 2017;64(2):111–5. pmid:28052967
- 50. CDC. Clinical Overview of Latent Tuberculosis Infection. In: Tuberculosis (TB) [Internet]. 2024 [cited 1 Oct 2024. ]. Available: https://www.cdc.gov/tb/hcp/clinical-overview/latent-tuberculosis-infection.html
- 51. Escalante P, Peikert T, Van Keulen VP, Erskine CL, Bornhorst CL, Andrist BR, et al. Combinatorial immunoprofiling in latent tuberculosis infection. toward better risk stratification. Am J Respir Crit Care Med. 2015;192(5):605–17. pmid:26030344
- 52. Menzies D, Gardiner G, Farhat M, Greenaway C, Pai M. Thinking in three dimensions: a web-based algorithm to aid the interpretation of tuberculin skin test results. Int J Tuberc Lung Dis. 2008;12(5):498–505. pmid:18419884
- 53.
QuantiFERON-TB Gold (QFT) ELISA Package Insert. 2023.
- 54. QuantiFERON-TB Gold Plus ELISA Kit Package Insert - QIAGEN. 2024 [cited 19 Feb 2024. ]. Available: https://www.qiagen.com/us/resources/resourcedetail?id=ac068fc7-a994-4443-ac7c-dda43ce2bc5e&lang=en
- 55. Theel ES, Hilgart H, Breen-Lyles M, McCoy K, Flury R, Breeher LE, et al. Comparison of the QuantiFERON-TB gold plus and QuantiFERON-TB gold in-tube interferon gamma release assays in patients at risk for tuberculosis and in health care workers. J Clin Microbiol. 2018;56(7):e00614-18. pmid:29743310
- 56. Aydin S. A short history, principles, and types of ELISA, and our laboratory experience with peptide/protein analyses using ELISA. Peptides. 2015;72:4–15. pmid:25908411
- 57.
Cerda-Kipper AS, Montiel BE, Hosseini S. Immunoassays | Radioimmunoassays and Enzyme-Linked Immunosorbent Assay⋆. In: Worsfold P, Poole C, Townshend A, Miró M, editors. Encyclopedia of Analytical Science (Third Edition). Oxford: Academic Press; 2019. 55–75. https://doi.org/10.1016/b978-0-12-409547-2.14510-x
- 58.
Alhajj M, Zubair M, Farhana A. Enzyme Linked Immunosorbent Assay. StatPearls. Treasure Island (FL): StatPearls Publishing; 2023. Available: http://www.ncbi.nlm.nih.gov/books/NBK555922/
- 59. Krishnan N, Robertson BD, Thwaites G. Pathways of IL-1β secretion by macrophages infected with clinical Mycobacterium tuberculosis strains. Tuberculosis (Edinb). 2013;93(5):538–47. pmid:23849220
- 60. Millington KA, Innes JA, Hackforth S, Hinks TSC, Deeks JJ, Dosanjh DPS. Dynamic relationship between IFN-γ and IL-2 profile of Mycobacterium tuberculosis-specific T cells and antigen load. J Immunol. 2007;178:5217–26.
- 61. Martinez AN, Mehra S, Kaushal D. Role of interleukin 6 in innate immunity to Mycobacterium tuberculosis infection. J Infect Dis. 2013;207(8):1253–61. pmid:23359591
- 62. Redford PS, Murray PJ, O’Garra A. The role of IL-10 in immune regulation during M. tuberculosis infection. Mucosal Immunology. 2011;4(3):261–70.
- 63. Umemura M, Nishimura H, Hirose K, Matsuguchi T, Yoshikai Y. Overexpression of IL-15 in vivo enhances protection against Mycobacterium bovis bacillus Calmette-Guérin infection via augmentation of NK and T cytotoxic 1 responses. J Immunol. 2001;167(2):946–56. pmid:11441103
- 64. Okamoto Yoshida Y, Umemura M, Yahagi A, O’Brien RL, Ikuta K, Kishihara K, et al. Essential role of IL-17A in the formation of a mycobacterial infection-induced granuloma in the lung. J Immunol. 2010;184(8):4414–22. pmid:20212094
- 65. Hasan Z, Cliff JM, Dockrell HM, Jamil B, Irfan M, Ashraf M, et al. CCL2 responses to Mycobacterium tuberculosis are associated with disease severity in tuberculosis. PLoS One. 2009;4(12):e8459. pmid:20041183
- 66. Singh B, Chitra J, Selvaraj P. CCL2, CCL3 and CCL4 gene polymorphisms in pulmonary tuberculosis patients of South India. Int J Immunogenet. 2014;41(2):98–104. pmid:23998687
- 67. Chegou NN, Black GF, Kidd M, van Helden PD, Walzl G. Host markers in QuantiFERON supernatants differentiate active TB from latent TB infection: preliminary report. BMC Pulm Med. 2009;9:21. pmid:19445695
- 68. Ruhwald M, Bjerregaard-Andersen M, Rabna P, Eugen-Olsen J, Ravn P. IP-10, MCP-1, MCP-2, MCP-3, and IL-1RA hold promise as biomarkers for infection with M. tuberculosis in a whole blood based T-cell assay. BMC Res Notes. 2009;2:19. pmid:19193208
- 69. Sutherland JS, Young JM, Peterson KL, Sanneh B, Whittle HC, Rowland-Jones SL, et al. Polyfunctional CD4(+) and CD8(+) T cell responses to tuberculosis antigens in HIV-1-infected patients before and after anti-retroviral treatment. J Immunol. 2010;184(11):6537–44. pmid:20435929
- 70. Tang NL-S, Fan HPY, Chang KC, Ching JKL, Kong KPS, Yew WW, et al. Genetic association between a chemokine gene CXCL-10 (IP-10, interferon gamma inducible protein 10) and susceptibility to tuberculosis. Clin Chim Acta. 2009;406(1–2):98–102. pmid:19523460
- 71. Corrêa R da S, Rodrigues LS, Pereira LHL, Nogueira OC, Leung J, Sousa M da S, et al. Neutrophil CD64 expression levels in IGRA-positive individuals distinguish latent tuberculosis from active disease. Mem Inst Oswaldo Cruz. 2019;114:e180579. pmid:30970080
- 72. Harari A, Rozot V, Bellutti Enders F, Perreau M, Stalder JM, Nicod LP, et al. Dominant TNF-α+ Mycobacterium tuberculosis-specific CD4+ T cell responses discriminate between latent infection and active disease. Nat Med. 2011;17(3):372–6. pmid:21336285
- 73. Meserve K, Qavi AJ, Aman MJ, Vu H, Zeitlin L, Dye JM, et al. Detection of biomarkers for filoviral infection with a silicon photonic resonator platform. STAR Protoc. 2022;3(4):101719. pmid:36153732
- 74. Mann HB, Whitney DR. On a test of whether one of two random variables is stochastically larger than the other. Ann Math Statist. 1947;18(1):50–60.
- 75.
McKnight PE, Najab J. Mann-Whitney U Test. The Corsini Encyclopedia of Psychology. John Wiley & Sons, Ltd; 2010. 1–1. https://doi.org/10.1002/9780470479216.corpsy0524
- 76. Breiman L. Random Forests. Mach Learn. 2001;45(1):5–32.
- 77. Zhu R, Zeng D, Kosorok MR. Reinforcement Learning Trees. J Am Stat Assoc. 2015;110(512):1770–84. pmid:26903687
- 78. Fawcett T. ROC graphs: Notes and practical considerations for researchers. Mach Learn. 2004;31(1):1–38.
- 79. Fan J, Lv J. A Selective overview of variable selection in high dimensional feature space. Stat Sin. 2010;20(1):101–48. pmid:21572976
- 80. Guyon I, Eliseeff A. An introduction to variable and feature selection. J Mach Learn Res. 2003;3:1157–82.
- 81. Pan R, Wang H, Li R. Ultrahigh-dimensional multiclass linear discriminant analysis by pairwise sure independence screening. J Am Stat Assoc. 2016;111(513):169–79. pmid:28127109
- 82. Ishwaran H, Lu M. Standard errors and confidence intervals for variable importance in random forest regression, classification, and survival. Stat Med. 2019;38(4):558–82. pmid:29869423
- 83.
Pai M, Behr M. Latent mycobacterium tuberculosis infection and interferon-gamma release assays. 2016.
- 84. Burel JG, Qian Y, Lindestam Arlehamn C, Weiskopf D, Zapardiel-Gonzalo J, Taplitz R, et al. An integrated workflow to assess technical and biological variability of cell population frequencies in human peripheral blood by flow cytometry. J Immunol. 2017;198(4):1748–58. pmid:28069807
- 85. Liu M, Guo S, Hibbert JM, Jain V, Singh N, Wilson NO, et al. CXCL10/IP-10 in infectious diseases pathogenesis and potential therapeutic implications. Cytokine Growth Factor Rev. 2011;22(3):121–30. pmid:21802343
- 86. Petrone L, Cannas A, Vanini V, Cuzzi G, Aloi F, Nsubuga M, et al. Blood and urine inducible protein 10 as potential markers of disease activity. Int J Tuberc Lung Dis. 2016;20(11):1554–61. pmid:27776600
- 87. Corstjens PLAM, Tjon Kon Fat EM, de Dood CJ, van der Ploeg-van Schip JJ, Franken KLMC, Chegou NN, et al. Multi-center evaluation of a user-friendly lateral flow assay to determine IP-10 and CCL4 levels in blood of TB and non-TB cases in Africa. Clin Biochem. 2016;49(1–2):22–31. pmid:26285074
- 88. Li H, Ren W, Liang Q, Zhang X, Li Q, Shang Y, et al. A novel chemokine biomarker to distinguish active tuberculosis from latent tuberculosis: a cohort study. QJM. 2023;116(12):1002–9. pmid:37740371
- 89. Borgström E, Andersen P, Atterfelt F, Julander I, Källenius G, Maeurer M, et al. Immune responses to ESAT-6 and CFP-10 by FASCIA and multiplex technology for diagnosis of M. tuberculosis infection; IP-10 is a promising marker. PLoS One. 2012;7(11):e43438. pmid:23144772
- 90. Biselli R, Mariotti S, Sargentini V, Sauzullo I, Lastilla M, Mengoni F, et al. Detection of interleukin-2 in addition to interferon-gamma discriminates active tuberculosis patients, latently infected individuals, and controls. Clin Microbiol Infect. 2010;16(8):1282–4. pmid:19886902
- 91. Rubbo P-A, Nagot N, Le Moing V, Brabet M, Bourdin A, Nogué E, et al. Multicytokine detection improves latent tuberculosis diagnosis in health care workers. J Clin Microbiol. 2012;50(5):1711–7. pmid:22403417
- 92. Frahm M, Goswami ND, Owzar K, Hecker E, Mosher A, Cadogan E, et al. Discriminating between latent and active tuberculosis with multiple biomarker responses. Tuberculosis (Edinb). 2011;91(3):250–6. pmid:21393062
- 93. Cyktor JC, Carruthers B, Kominsky RA, Beamer GL, Stromberg P, Turner J. IL-10 inhibits mature fibrotic granuloma formation during Mycobacterium tuberculosis infection. J Immunol. 2013;190(6):2778–90. pmid:23396944