Figures
Abstract
Bacteremia is a major contributor to global morbidity and mortality, particularly in low- and middle-income countries where diagnostic delays and empirical antimicrobial misuse exacerbate resistance. This study assessed the accuracy of OneChoice®, an artificial intelligence (AI)-based Clinical Decision Support System (CDSS), in guiding antimicrobial therapy for bloodstream infections (BSIs) in Lima, Peru. A cross-sectional, observational design was used, comparing therapeutic recommendations generated by OneChoice®—based on molecular (FilmArray®) and phenotypic (MALDI-TOF MS, VITEK2) data—with the clinical decisions of 94 physicians (35 infectious disease [ID] specialists and 59 non-specialists) across 366 survey-based evaluations of bacteremia cases. Concordance between CDSS and physician decisions was analyzed using Cohen’s Kappa and logistic regression. The overall concordance rate was 96.1% when considering any suggested treatment, and 74.6% for the top recommendation, with a substantial agreement (κ = 0.70). ID specialists showed significantly higher concordance (κ = 0.78) than non-ID physicians (κ = 0.61), and specialization was the strongest predictor of agreement (OR = 2.26, p = 0.001). Escherichia coli cases had the highest concordance, while Pseudomonas aeruginosa showed the lowest. The CDSS reduced inappropriate antibiotic use, particularly unnecessary carbapenem prescriptions. These findings support the utility of AI-CDSS tools in enhancing antimicrobial stewardship and standardizing care, especially in resource-limited healthcare settings.
Citation: Gómez de la Torre JC, Frenkel A, Chavez-Lencinas C, Rendon A, Fabian M, Caceres-DelAguila JA, et al. (2026) Comparison of OneChoice AI-based clinical decision support recommendations with infectious disease specialists and non-specialists for bacteremia treatment in Lima, Peru. PLoS One 21(4): e0331266. https://doi.org/10.1371/journal.pone.0331266
Editor: Benjamin M. Liu, Children’s National Hospital, George Washington University, UNITED STATES OF AMERICA
Received: August 20, 2025; Accepted: February 1, 2026; Published: April 17, 2026
Copyright: © 2026 Gómez de la Torre et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting information files. In addition, the document of ‘Supp 3.pdf’ has been uploaded to https://doi.org/10.6084/m9.figshare.31281952 to ensure permanent accessibility.
Funding: The author(s) received no specific funding for this work.
Competing interests: Ari Frenkel is Chief Science Officer of Arkstone Medical Solutions, the company that produces the OneChoice report evaluated in this study. JC Gómez de la Torre works as the Director of Molecular Informatics at Arkstone Medical Solutions and as the Medical Director at Roe Lab in Perú. At the same time, Alicia Rendon and Miguel Hueda Zavaleta serve as Quality Assurance Managers at Arkstone Medical Solutions. These affiliations may be perceived as potential conflicts of interest. However, the study’s design, data collection, analysis, interpretation, manuscript preparation, and the decision to publish the results were conducted independently, with no undue influence from the authors’ affiliations or roles within the company.
1. Introduction
Sepsis and bloodstream infections (BSIs) represent a critical global health issue, particularly in low- and middle-income countries (LMICs), where delayed diagnoses and treatment gaps compound mortality [1]. In 2020 alone, an estimated 11 million deaths globally were attributed to sepsis, making it one of the leading causes of death worldwide [2]. In Latin America, limited diagnostic capacity and inappropriate antibiotic use exacerbate the crisis, leading to increasing multidrug-resistant (MDR) infections [2].
Rapid and accurate diagnosis is essential for effective management, as delays in appropriate antimicrobial therapy have been associated with increased mortality [3]. Traditional diagnostic methods, such as blood cultures and phenotypic antimicrobial susceptibility testing (AST), remain the gold standard but suffer from long turnaround times, typically requiring 24–72 h for actionable results [4–6]. This diagnostic delay is directly associated with increased mortality and inappropriate empirical therapy [6].
BSIs caused by organisms like Escherichia coli, Klebsiella pneumoniae, and Pseudomonas aeruginosa are increasingly resistant to standard antibiotics, driving the need for faster and more precise diagnostic tools [7]. Recent advances in molecular diagnostics, including FilmArray® (BioFire Diagnostics, LLC, Salt Lake City, UT, USA) and GeneXpert® (Cepheid, Sunnyvale, CA, USA), have significantly reduced this time from days to hours, allowing for faster pathogen identification and resistance profiling [8]. However, effectively integrating such diagnostic tools into clinical workflows remains challenging, necessitating more robust decision support mechanisms.
When integrated with artificial intelligence (AI), these tools form the backbone of clinical decision support systems (CDSSs) and can provide personalized antimicrobial recommendations [9]. AI-based CDSSs, such as OneChoice® developed by Arkstone, are particularly promising for infectious disease management because they synthesize vast datasets in real time and align with clinical guidelines. This AI-CDSS integrates molecular (e.g., FilmArray) and phenotypic (e.g., VITEK2 and MALDI-TOF MS) diagnostic outputs to offer antimicrobial treatment recommendations. Its design aligns with global and regional stewardship goals, including those from the Surviving Sepsis Campaign and IDSA [10,11]. The OneChoice® CDSS, powered by a machine learning model, has demonstrated strong internal validity in antimicrobial stewardship. In one study that examined the accuracy of its internal validation, the model revealed that it can accurately distinguish trained from novel data (100.0% accuracy) and showed 84.0% agreement with clinical standards in minor discrepancies while avoiding major discrepancies entirely. These results, confirmed through multiple validation methods and enhanced by human-in-the-loop (HITL) oversight, support the system’s potential as a reliable tool for optimizing antimicrobial therapy [12].
However, OneChoice® is currently classified as a clinical decision support tool and has not yet received formal regulatory approval as a medical device from the U.S. Food and Drug Administration (FDA) or CE marking under European Union Medical Device Regulation. The system is intended to assist clinical decision-making and does not replace physician judgment. The recommendations generated should be interpreted within the context of individual patient characteristics and local guidelines, and the tool should be used under appropriate clinical supervision.
These systems have demonstrated potential in reducing inappropriate antibiotic use, optimizing dosing, and ensuring compliance with antimicrobial stewardship principles [13]. While initial results show promise in high concordance with expert ID physicians’ recommendations, real-world validation remains limited, especially in settings such as Peru.
Artificial intelligence (AI) and machine learning (ML) advancements have revolutionized diagnostics in infectious diseases. AI models are increasingly integrated into CDSSs, providing real-time, tailored therapeutic recommendations based on molecular and phenotypic data [13]. Such systems have effectively reduced diagnostic turnaround times and improved clinical outcomes [14]. AI-CDSS applications include early sepsis detection, resistance prediction, and antimicrobial therapy optimization [15,16]. Moreover, in regions like Latin America, AI-driven CDSSs may help bridge technological gaps and support overburdened healthcare systems [17].
Given the complex interplay of clinical, microbiological, and pharmacological factors in BSI management, AI-driven CDSSs have the potential to standardize therapeutic approaches and minimize errors due to human variability. AI-enhanced platforms can analyze diverse datasets in real time, offering context-specific recommendations that align with stewardship principles [18]. OneChoice® is designed to mitigate the therapeutic lag between pathogen detection and treatment initiation, particularly critical in time-sensitive infections such as sepsis. OneChoice® has demonstrated that it can deliver rapid and accurate therapeutic recommendations in cases of bacteremia. An 80.0% concordance was observed between the recommendations generated from molecular and phenotypic results. Furthermore, molecular-based recommendations were available up to 29 h earlier than those based on phenotypic data, suggesting that integrating molecular diagnostics with AI-driven CDSSs could significantly accelerate clinical decision making in bacteremia. Notably, most discrepancies were due to the inability of certain molecular platforms to detect specific phenotypic resistance mechanisms [19].
Despite the promise of CDSS platforms, several barriers persist. These include clinician skepticism, system interoperability issues, and a lack of real-world validation [14,20]. Additionally, implementation in LMICs like Peru is hindered by limited infrastructure and low CDSS awareness among healthcare professionals [17].
Studies evaluating CDSSs such as OneChoice® remain limited, and few investigations have rigorously assessed its alignment with expert clinical judgment in LMICs, particularly among non-specialist physicians.
In Peru, a country with documented high resistance rates and an underfunded infectious disease infrastructure [17], the clinical validation of AI-CDSS tools is critically needed. This study is among the first to evaluate the agreement between an AI-CDSS and physicians across specialties, providing crucial insights into how AI can bridge knowledge gaps and enhance precision medicine. This study compares the CDSS recommendations with therapeutic decisions made by infectious disease (ID) specialists and non-ID specialists, assessing their concordance. The findings will help inform implementation frameworks, training needs, and policy recommendations for AI’s ethical and scalable deployment in infectious disease care.
2. Materials and methods
2.1. Study design and setting
This study employed a cross-sectional, observational design, conducted in tertiary healthcare facilities in Lima, Peru. The study utilized molecular and phenotypic diagnostic methods routinely applied to guide antimicrobial therapy. The primary objective was to compare the CDSS recommendations with therapeutic decisions made by infectious disease (ID) specialists and non-ID specialists, assessing their concordance. Confirmed bacteremia cases identified between January and December 2024 were included. These cases were diagnosed using routine molecular and phenotypic methods in clinical practice, with results available from both OneChoice® and OneChoice® Fusion reports. Demographic, clinical, microbiological, and CDSS-related data were collected between June 15 and June 25, 2025. Subsequently, electronic surveys were administered to infectious disease (ID) specialists and non-ID specialists from June 26 to July 15, 2025.
2.2. Participants and selection criteria
The study involved 94 physicians, comprising 35 infectious disease specialists and 59 non-ID specialists. Participants were selected based on their involvement in bacteremia management within the healthcare facilities. The inclusion criteria mandated that participating physicians had at least three years of clinical experience in infectious disease management. The study included confirmed bacteremia cases, selected based on strict inclusion criteria to ensure the reliability of the dataset. Eligible cases involved patients with positive blood cultures confirmed through FilmArray® molecular testing and phenotypic identification using MALDI-TOF MS and VITEK2 systems. Cases were excluded if they had incomplete laboratory data, invalid susceptibility results, or were determined to be contaminants rather than true BSIs.
2.3. Data acquisition and characteristics
Data for the study were acquired from confirmed bacteremia cases, integrating microbiological, clinical, and therapeutic information. The dataset included results from molecular diagnostics using the FilmArray Blood Culture Identification (BCID) Panel (BioFire Diagnostics, LLC, Salt Lake City, UT, USA) and phenotypic antimicrobial susceptibility tests conducted with Matrix-assisted laser desorption ionisation time-of-flight (MALDI-TOF MS) (bioMérieux, Inc. France) and VITEK2 system (bioMérieux, Inc. France). The initial OneChoice® report (S1 Appendix) was generated based on data from molecular results alone, while the final OneChoice® (Fusion) report (S2 Appendix) incorporated both molecular and phenotypic test results.
The study focused on bacterial BSIs, selecting cases with available molecular and phenotypic testing results. Clinical case surveys were administered to physicians, presenting cases twice, initially with molecular results alone, and later with combined molecular and phenotypic data. Responses were collected to evaluate the agreement between physicians and OneChoice®. In total, 366 case evaluations were collected, allowing for a direct comparison of CDSS recommendations versus clinical judgment (S3 Appendix). All data were fully anonymized before analysis, and authors had no access to identifying patient information during or after data collection.
2.4. Study procedures and tools/Instruments/Materials/Equipment
The experimental procedures followed a structured approach to ensure consistency and reproducibility. Blood samples from patients with suspected bacteremia were processed using the FilmArray BCID Panel, which operates under specific conditions: samples were incubated at 37°C for 24 hours before analysis. The MALDI-TOF MS was calibrated daily using standard bacterial strains to ensure accuracy in phenotypic identification. The VITEK2 system was employed for susceptibility testing, using standard inoculum sizes and incubation periods as per manufacturer guidelines.
Each physician received clinical case surveys electronically, with cases presented randomly to minimize bias. Surveys were designed using Microsoft Forms, ensuring uniformity in data collection (S3 Appendix). The CDSS utilized proprietary algorithms to generate therapeutic recommendations, which were then compared to those of physicians.
2.5. Data preparation
Before analysis, all collected data were anonymized to protect patient confidentiality. Data entry was performed using Microsoft Excel, with each case assigned a unique numeric code. The dataset underwent rigorous cleaning procedures to ensure accuracy, including validation checks for duplicate entries and cross-referencing against laboratory records to confirm diagnostic results (S4 Appendix).
In this study, concordance was defined as agreement between the therapeutic recommendation provided by the CDSS and the expert review based on molecular and/or phenotypic data by the Infectious and Tropical Disease and Non-Infectious Disease Specialists. Discordance refers to any disagreement with expert judgment regarding antimicrobial selection, dosing, or spectrum. Discordant cases were further categorized by type of mismatch, including incorrect antibiotic choice; incorrect dosing or interval; both incorrect antibiotic choice and dosing; unnecessary broad-spectrum coverage; unnecessary broad-spectrum with carbapenem overuse; or a combination of these. Multidrug-resistant (MDR) organisms were defined as non-susceptible to at least one agent in three or more antimicrobial categories, while extensively drug-resistant (XDR) organisms were non-susceptible to all but one or two categories, according to international consensus definitions [21].
2.6. Data analysis
Statistical analyses were conducted using STATA v.17. Descriptive statistics summarized physician responses, concordance rates, and frequency distributions of bacterial species and resistance genes. Cohen’s Kappa coefficient (κ) assessed inter-rater agreement, with interpretations based on established scales for agreement levels. Chi-square (χ²) tests examined associations between categorical variables, such as specialty, experience level, and concordance rates. Fisher’s exact test was applied for bacteria-associated bacteremia cases with limited sample sizes.
Multivariate logistic regression identified independent predictors of concordance, adjusting for specialty, experience, and antimicrobial resistance status. Model specifications included interaction terms to explore the influence of combined factors on therapeutic decision-making.
To assess the concordance between the CDSS and the clinical judgment of the participants, each therapeutic recommendation generated by the CDSS was independently reviewed by two infectious disease specialists (JCGdlT and CChL). Cases were classified as concordant or discordant depending on whether the participant’s decision aligned with the CDSS recommendation. For discordant cases, an adjudication process was conducted to determine whether the discrepancy favored the CDSS, the participant, or both. When the two infectious disease specialists disagreed in their classifications, a third specialist (MH-Z) evaluated the case, and a consensus decision was reached. In addition, all discrepancies were categorized into five mutually exclusive groups: incorrect antibiotic choice; combined errors in antibiotic selection and dosing or duration; unnecessary use of broad-spectrum carbapenems; unnecessary use of broad-spectrum antibiotics only; and discrepancies attributable to both perspectives.
This study was approved by the Institutional Ethics Committee of the Universidad Privada de Tacna, Peru (FACSA-CEI/093-06-2025). Informed consent was obtained from all participating physicians prior to survey administration, after being informed about the purpose of the study, the voluntary nature of their participation, confidentiality safeguards, and their right to withdraw at any time. The consent process was documented electronically through Microsoft Forms. Cases of bacteremia and associated microbiological data were derived from routine clinical care performed between January and December 2024, originally collected as part of standard clinical practice for diagnostic and therapeutic purposes. For the secondary use of these pre-existing clinical data in research, the Ethics Committee granted a waiver of informed consent based on the following criteria: (1) the research involved no more than minimal risk to participants; (2) the waiver did not adversely affect the rights and welfare of the participants; (3) the research could not practicably be carried out without the waiver; and (4) all data were fully anonymized prior to analysis, with patient names replaced by unique numerical codes to prevent identification. The study adhered to international ethical standards, ensuring confidentiality, data integrity, and research reproducibility. The findings will be submitted for peer-reviewed publication and presented at scientific conferences to promote awareness of CDSS integration in antimicrobial stewardship.
3. Results
A total of 366 bacteremia cases were evaluated, of which 206 (56.3%) were reviewed by infectious disease (ID) specialists and 160 (43.7%) by non-infectious disease (non-ID) physicians. A significant difference was observed in years of experience between the two groups (p = 0.001); 71.2% of ID specialists had between 1 and 5 years of experience, whereas 52.0% of non-ID physicians had between 16 and 20 years. The most frequently isolated bacterial species were Escherichia coli (35.5%), Klebsiella pneumoniae/Klebsiella aerogenes (15.8%), and Salmonella typhi/enterica (13.7%). Klebsiella spp. and Salmonella spp. were more commonly observed in cases evaluated by ID specialists (65.6% and 64.0%, respectively). Additionally, Serratia marcescens was exclusively identified in cases reviewed by ID specialists. Regarding resistance genes, 59.0% of isolates showed no detectable resistance genes, with a significant difference between groups (p = 0.047). ID specialists more frequently evaluated bacteremia cases involving strains with CTM-X (61.3% vs. 38.7%) and NDM genes (60.0% vs. 40.0%). All cases with VIM beta-lactamase genes were reported in the non-ID group. Overall, 39.9% of isolates were classified as multidrug-resistant, with no significant differences between groups (p = 0.640) (Table 1).
In terms of concordance between the system’s recommendations and expert evaluation, 74.6% of cases showed overall agreement, with a significantly higher rate in the ID group (61.2% vs. 38.8%, p = 0.001). Disagreement was more common among non-ID physicians (58.1% vs. 41.9%) (Table 1). Finally, A detailed assessment of discrepancies in therapeutic decisions revealed that 92.5% of discordant cases favored the CDSS recommendations, while 7.5% reflected consideration of both CDSS and physician input. The most frequent type of mismatch was incorrect antibiotic selection (48.4%), followed by combined errors in antibiotic choice, dosing, or interval (19.4%), and unnecessary broad-spectrum use (8.6%). Additionally, 10.8% of mismatches involved unnecessary carbapenem overuse, underscoring the importance of carbapenem stewardship in the context of resistance. Notably, most dosing-related errors occurred in the non-ID group (Table 1, Fig 1).
The overall concordance rate between OneChoice® recommendations and physician decisions was 96.1% when considering any of the four alternative treatment options provided by the system. Concordance was slightly lower when evaluating the best-recommended option, with rates of 74.3% for molecular-based results and 74.8% for phenotypic results. The Kappa Index, a statistical measure of agreement beyond chance, was calculated for all participants and stratified by specialty, the overall Kappa Index was 0.70, indicating substantial agreement. Infectious disease specialists demonstrated a higher concordance with a Kappa value of 0.78 than non-ID specialists, who had a Kappa value of 0.61 (Table 2).
The study further explored concordance rates based on physician experience, revealing higher agreement among more experienced physicians, particularly among ID specialists. Concordance rates among ID specialists ranged from 70.8% for those with 16–20 years of experience to 84.2% for those with more than 20 years. In contrast, non-ID specialists showed an overall lower concordance rate of 66.3%, with the highest agreement observed in the 16–20-year experience group at 80.7% (Table 3). The analysis of bacterial species and concordance rates revealed that Escherichia coli had the highest concordance rate at 80.0%. In contrast, Pseudomonas aeruginosa exhibited the lowest agreement at 53.8%, with a statistically significant p-value of 0.012. Conversely, Staphylococcus haemolyticus showed 100.0% concordance (Table 3).
A multivariate logistic regression analysis was performed to identify factors influencing concordance, adjusting for specialty, years of experience, and MDR status. The results, summarized in Table 4, indicated that being an ID specialist was the strongest predictor of concordance, with an odds ratio (OR) of 2.26 and a p-value of 0.001. Years of experience did not show a statistically significant effect on concordance, with an OR of 1.27 (p = 0.562) for physicians with 16–20 years of experience and an OR of 1.06 (p = 0.843) for those with more than 20 years. MDR status showed a trend towards higher concordance (OR = 1.52, p = 0.104), suggesting that physicians are more likely to follow CDSS recommendations when dealing with multidrug-resistant infections. However, this result was not statistically significant (Table 4).
Finally, we evaluated the overall level and nature of the discrepancies between the CDSS and the study participants. Concordance between the CDSS recommendations and the participants’ assessments occurred in 74.6% of cases (273/366), while the remaining 25.4% showed discordance (Fig 2a). Among the discordant cases, the vast majority (92.5%) favored the CDSS (Onechoice) recommendation, while only 7.5% were considered acceptable by both parties (CDSS and expert reviewers) (Fig 2b). Likewise, the most frequent types of discrepancies were incorrect antibiotic choice (51.7%), combined errors in antibiotic selection, dosing, or duration (20.7%), unnecessary use of broad-spectrum or carbapenem agents (11.5%), unnecessary use of broad-spectrum antibiotics (9.2%), and discrepancies attributable to both perspectives (6.9%) (Fig 2c). These findings suggest that most discrepancies were due to inappropriate antibiotic selection and that when disagreement occurred, the CDSS recommendations were more accurate in most cases.
4. Discussion
This study evaluated the concordance between the therapeutic recommendations provided by an artificial intelligence–based clinical decision support system (CDSS), OneChoice®, and expert physician judgment in real-world bacteremia cases. Our findings demonstrate a substantial level of agreement (κ = 0.70), particularly among infectious disease specialists (κ = 0.78), suggesting that the system can offer clinically relevant support when interpreted by trained professionals. These results contribute to the growing body of evidence on the role of CDSS in the management of bloodstream infections (BSIs) and antimicrobial stewardship, particularly in high-resistance settings like Peru [22]. The consistently high antimicrobial resistance rates reported in national surveillance data underscore the urgent need for innovative, effective interventions [22,23]. The observed lower concordance among non-infectious disease specialists (κ = 0.61) further emphasizes the importance of clinical expertise and reinforces that such tools should serve as supervised decision aids rather than autonomous systems, especially in complex infectious disease contexts.
Our results demonstrate a high concordance (96.1%) between the CDSS used and physician-prescribed treatments, reflecting the potential for AI-driven tools to support antimicrobial decision-making in real-world settings [16]. Substantial agreement (Kappa Index = 0.70) further supports its reliability. These findings align with existing literature highlighting the utility of machine learning-based systems in infectious disease management [14,15].
This study is among the first to evaluate the accuracy of the OneChoice® CDSS in a Peruvian clinical setting and across physician specializations. Higher concordance rates among ID specialists (Kappa = 0.78) compared to non-ID physicians (Kappa = 0.61) highlight the benefit of domain-specific expertise in interpreting CDSS outputs [16]. AI tools such as OneChoice® may be instrumental in bridging the knowledge gap among non-specialist practitioners [14].
Logistic regression identified ID specialization as the primary predictor of concordance (OR = 2.26, p = 0.001), reinforcing the role of targeted training. Interestingly, years of clinical experience showed no significant effect on concordance, suggesting expertise specificity is more relevant than time in practice.
Moreover, the system’s ability to prevent unnecessary carbapenem use in 10.8% of discordant cases aligns with stewardship goals and can reduce selective pressures leading to resistance [13]. Nathan D. Nielsen et al. have noted that CDSS can improve antibiotic appropriateness in critical care settings [24].
This CDSS demonstrated particular strength in cases involving multidrug-resistant (MDR) organisms, such as Pseudomonas aeruginosa, where therapeutic decisions are inherently complex. Notably, P. aeruginosa had the lowest concordance rate (53.8%, p = 0.012), highlighting the critical need for structured tools in guiding antibiotic selection against resistant pathogens [25]. This aligns with prior studies emphasizing that AI-based diagnostics can enhance early pathogen recognition and streamline antimicrobial therapy [26].
While logistic regression analyses identified ID specialization as the strongest predictor of concordance (OR = 2.26, p = 0.001), years of clinical experience did not significantly influence agreement rates. This suggests that expertise in infectious diseases may play a more critical role in aligning with AI-generated recommendations than overall experience [26].
In contrast to the study by Montiel-Romero et al., which reported moderate concordance (κ = 0.48) in antibiotic selection and low concordance (κ = 0.39) in identifying resistance mechanisms between ChatGPT® and infectious disease (ID) specialists in simulated clinical cases [27], our study evaluated an artificial intelligence–based clinical decision support system (OneChoice®) applied to real-world bacteremia cases. We found a higher level of agreement (κ = 0.70) between the system’s recommendations and physician evaluations. Moreover, concordance was significantly higher among ID specialists (κ = 0.78) compared to non-specialists (κ = 0.61). These findings suggest that, unlike general language models such as ChatGPT®, systems specifically trained on structured clinical data may provide more reliable support in real clinical settings. Nevertheless, consistent with the conclusions of Montiel-Romero et al., our results underscore the importance of using these tools as supervised support rather than as a replacement for clinical judgment, particularly in scenarios where microbiological and pharmacological interpretation is critical for decision-making.
Despite these promising results, the discussion must acknowledge certain limitations. First, the sample size (366 physician surveys) and cross-sectional design limit generalizability and introduce potential confounders unaccounted for in regression modeling. Reliance on survey responses may introduce subjective bias, particularly if treatment deviated from guidelines. Second, our findings are specific to bacteremia and may not extend to non-bacteremic infections such as respiratory, urinary tract, or intra-abdominal syndromes. The real-world impact of this CDSS should be interpreted within the context of bloodstream infections. Third, we focused on concordance as a process metric rather than patient-centered outcomes such as mortality, length of stay, or clinical improvement. While substantial agreement with specialist judgment demonstrates clinical relevance, it does not directly prove improved patient outcomes. Fourth, findings from this Peruvian tertiary care setting may not generalize to healthcare systems with different diagnostic infrastructure, stewardship program maturity, or clinical workflows. Finally, lower concordance among non-specialists (κ = 0.61) highlights risks of overreliance on AI recommendations without adequate clinical reasoning, particularly in complex scenarios or when input data are incomplete.
Our findings demonstrate that the CDSS developed by Arkstone provides reliable therapeutic recommendations that align closely with clinical decision-making, particularly among infectious disease specialists. The substantial overall concordance and Kappa Index values indicate that the CDSS is a valuable tool for guiding antimicrobial therapy in bacteremia cases. The higher concordance rates observed among ID specialists and more experienced physicians suggest that specialized training and clinical expertise enhance alignment with CDSS recommendations. Furthermore, the detailed analysis of discrepancies highlights specific areas where the CDSS can aid in optimizing antibiotic selection and dosing, thereby contributing to improved antimicrobial stewardship.
Overall, this CDSS shows significant promise in aligning therapeutic recommendations with best practices, especially for common pathogens and experienced physicians. By reducing variation and enhancing evidence-based decision-making, such tools can support stewardship goals and improve care quality in resource-constrained settings.
AI-driven CDSS represent a promising advancement in infectious disease management. By enhancing diagnostic precision and standardizing antimicrobial therapy, these systems have the potential to mitigate the global burden of antimicrobial resistance and improve patient outcomes [28]. However, future CDSS iterations must incorporate clinical presentation, local epidemiology, host factors, and dynamic patient status to provide comprehensive support across diverse infectious syndromes beyond bacteremia. Future research should prioritize prospective, multicenter studies assessing CDSS impact on patient-centered outcomes, antimicrobial resistance trends, and healthcare resource utilization across diverse settings. Key directions include: (1) expansion of CDSS validation beyond bacteremia to encompass respiratory, urinary, and intra-abdominal infections; (2) integration of dynamic clinical data streams, local epidemiological inputs, and rapid molecular diagnostics such as FilmArray® BCID2 to enable context-sensitive decision support [29]; (3) incorporation of real-time resistance tracking and genomic insights to refine AI-CDSS applications; and (4) development of multidisciplinary implementation frameworks and training programs to ensure CDSS complements, rather than supplants, expert clinical judgment.
5. Conclusions
This study demonstrates that the OneChoice® AI-based clinical decision support system achieves substantial concordance (κ = 0.70) with physician therapeutic decisions in bacteremia management, with particularly strong agreement among infectious disease specialists (κ = 0.78). The system showed a high overall concordance rate of 96.14% when considering any suggested treatment option, and 74.59% for the top recommendation. Infectious disease specialization was identified as the strongest predictor of concordance with CDSS recommendations (OR = 2.26, p = 0.001). The system demonstrated potential in reducing inappropriate antibiotic prescriptions, particularly unnecessary carbapenem use, supporting antimicrobial stewardship goals in a high-resistance setting. These findings support the utility of AI-CDSS tools as supervised decision aids for enhancing antimicrobial therapy standardization, especially in resource-limited healthcare settings.
Supporting information
S1 Appendix. Example of OneChoice® report based on molecular results alone.
https://doi.org/10.6084/m9.figshare.31281913.
https://doi.org/10.1371/journal.pone.0331266.s001
(PDF)
S2 Appendix. Example of OneChoice® Fusion report incorporating molecular and phenotypic data.
https://doi.org/10.6084/m9.figshare.31281946.
https://doi.org/10.1371/journal.pone.0331266.s002
(PDF)
S3 Appendix. Clinical case survey instrument used for physician–CDSS agreement analysis.
https://doi.org/10.6084/m9.figshare.31281952.
https://doi.org/10.1371/journal.pone.0331266.s003
(PDF)
Acknowledgments
We thank all the personnel in the Arkstone Medical Solutions and Roe clinical laboratory who have actively been working.
References
- 1. Rudd KE, Johnson SC, Agesa KM, et al. Global, regional, and national sepsis incidence and mortality, 1990–2017: analysis for the Global Burden of Disease Study. Lancet. 2020;395(10219):200–11.
- 2.
World Health Organization. Sepsis: Global epidemiology and strategies for prevention. WHO Fact Sheet. 2020.
- 3. Shin DJ, Andini N, Hsieh K, Yang S, Wang T-H. Emerging Analytical Techniques for Rapid Pathogen Identification and Susceptibility Testing. Annu Rev Anal Chem (Palo Alto Calif). 2019;12(1):41–67. pmid:30939033
- 4. Zeng X, Lloyd KM, Hamdy RF, Shapiro CA, Fisher MA, Lin J, et al. Identification and characterization of an invasive, hyper-aerotolerant Campylobacter jejuni strain causing bacteremia in a pediatric leukemia patient. ASM Case Rep. 2025;1(3):e00060-24. pmid:41246370
- 5. Laque-Ale A, Hueda-Zavaleta M, Gómez de la Torre JC, Alvarado L, Cáceres Del Águila JA. Diagnostic performance of the time to positivity of blood cultures to distinguish true bacteremia from contaminants based on an automated system. Rev Peru Med Exp Salud Publica. 2023;40(4):451–8. pmid:38597473
- 6. Peri AM, Stewart A, Hume A, Irwin A, Harris PNA. New Microbiological Techniques for the Diagnosis of Bacterial Infections and Sepsis in ICU Including Point of Care. Curr Infect Dis Rep. 2021;23(8):12. pmid:34149321
- 7. Dettori S, Portunato F, Vena A, Giacobbe DR, Bassetti M. Severe infections caused by difficult-to-treat Gram-negative bacteria. Curr Opin Crit Care. 2023;29(5):438–45. pmid:37641512
- 8. Tjandra KC, Ram-Mohan N, Abe R, Hashemi MM, Lee J-H, Chin SM, et al. Diagnosis of Bloodstream Infections: An Evolution of Technologies towards Accurate and Rapid Identification and Antibiotic Susceptibility Testing. Antibiotics (Basel). 2022;11(4):511. pmid:35453262
- 9. AlGain S, Marra AR, Kobayashi T, Marra PS, Celeghini PD, Hsieh MK, et al. Can we rely on artificial intelligence to guide antimicrobial therapy? A systematic literature review. Antimicrob Steward Healthc Epidemiol. 2025;5(1):e90. pmid:40226293
- 10. Evans L, Rhodes A, Alhazzani W, Antonelli M, Coopersmith CM, French C, et al. Surviving sepsis campaign: international guidelines for management of sepsis and septic shock 2021. Intensive Care Med. 2021;47(11):1181–247. pmid:34599691
- 11. Tamma PD, Aitken SL, Bonomo RA, Mathers AJ, van Duin D, Clancy CJ. 2023 guidance on the treatment of antimicrobial resistant gram-negative infections. Clinical Infectious Diseases. 2023.
- 12. Frenkel A, Rendon A, Chavez-Lencinas C, Gomez De la Torre JC, MacDermott J, Gross C, et al. Internal Validation of a Machine Learning-Based CDSS for Antimicrobial Stewardship. Life (Basel). 2025;15(7):1123. pmid:40724626
- 13. Branda F, Scarpa F. Implications of Artificial Intelligence in Addressing Antimicrobial Resistance: Innovations, Global Challenges, and Healthcare’s Future. Antibiotics (Basel). 2024;13(6):502. pmid:38927169
- 14. Peiffer-Smadja N, Rawson TM, Ahmad R, Buchard A, Georgiou P, Lescure F-X, et al. Machine learning for clinical decision support in infectious diseases: a narrative review of current applications. Clin Microbiol Infect. 2020;26(5):584–95. pmid:31539636
- 15. Anahtar MN, Yang JH, Kanjilal S. Applications of Machine Learning to the Problem of Antimicrobial Resistance: an Emerging Model for Translational Research. J Clin Microbiol. 2021;59(7):e0126020. pmid:33536291
- 16. Jian M-J, Lin T-H, Chung H-Y, Chang C-K, Perng C-L, Chang F-Y, et al. Artificial Intelligence-Clinical Decision Support System in Infectious Disease Control: Combatting Multidrug-Resistant Klebsiella pneumoniae with Machine Learning. Infect Drug Resist. 2024;17:2899–912. pmid:39005853
- 17. Fabre V, Cosgrove SE, Secaira C, Tapia Torrez JC, Lessa FC, Patel TS, et al. Antimicrobial stewardship in Latin America: Past, present, and future. Antimicrob Steward Healthc Epidemiol. 2022;2(1):e68. pmid:36483374
- 18. Alsulimani A, Akhter N, Jameela F, Ashgar RI, Jawed A, Hassani MA, Dar SA. The Impact of Artificial Intelligence on Microbial Diagnosis. Microorganisms. 2024;12(6):1051.
- 19. Gomez de la Torre JC, Frenkel A, Chavez-Lencinas C, Rendon A, Cáceres JA, Alvarado L, et al. AI-based treatment recommendations enhance speed and accuracy in bacteremia management: a comparative study of molecular and phenotypic data. Life. 2025;15(6):864.
- 20. Kelly CJ, Karthikesalingam A, Suleyman M, Corrado G, King D. Key challenges for delivering clinical impact with artificial intelligence. BMC Med. 2019;17(1):195. pmid:31665002
- 21. Magiorakos A-P, Srinivasan A, Carey RB, Carmeli Y, Falagas ME, Giske CG, et al. Multidrug-resistant, extensively drug-resistant and pandrug-resistant bacteria: an international expert proposal for interim standard definitions for acquired resistance. Clin Microbiol Infect. 2012;18(3):268–81. pmid:21793988
- 22. Krapp F, García C, Hinostroza N, Astocondor L, Rondon CR, Ingelbeen B, et al. Prevalence of Antimicrobial Resistance in Gram-Negative Bacteria Bloodstream Infections in Peru and Associated Outcomes: VIRAPERU Study. Am J Trop Med Hyg. 2023;109(5):1095–106. pmid:37722663
- 23. Rondon C, Garcia C, Krapp F, Machaca I, Olivera M, Fernández V, et al. Antibiotic point prevalence survey and antimicrobial resistance in hospitalized patients across Peruvian reference hospitals. J Infect Public Health. 2023;16 Suppl 1(Suppl 1):52–60. pmid:37957105
- 24. Nielsen ND, Dean JT, Shald EA, Conway Morris A, Povoa P, Schouten J, et al. When to Stop Antibiotics in the Critically Ill? Antibiotics (Basel). 2024;13(3):272.
- 25. Oliver A, et al. Pseudomonas aeruginosa antimicrobial susceptibility profiles, resistance mechanisms and international clonal lineages: update from ESGARS-ESCMID/ISARPAE Group. Clin Microbiol Infect. 2024;30(4):469–80.
- 26. Cesaro A, Hoffman SC, Das P, de la Fuente-Nunez C. Challenges and applications of artificial intelligence in infectious diseases and antimicrobial resistance. NPJ Antimicrob Resist. 2025;3(1):2. pmid:39843587
- 27. Montiel-Romero S, Rajme-López S, Román-Montes CM, López-Iñiguez A, Rivera-Villegas HO, Ochoa-Hein E, et al. Recommended antibiotic treatment agreement between infectious diseases specialists and ChatGPT®. BMC Infect Dis. 2025;25(1):38. pmid:39773383
- 28. Rabaan AA, Bakhrebah MA, Alotaibi J, Natto ZS, Alkhaibari RS, Alawad E, et al. Unleashing the power of artificial intelligence for diagnosing and treating infectious diseases: A comprehensive review. J Infect Public Health. 2023;16(11):1837–47. pmid:37769584
- 29. Holma T, Torvikoski J, Friberg N, Nevalainen A, Tarkka E, Antikainen J, et al. Rapid molecular detection of pathogenic microorganisms and antimicrobial resistance markers in blood cultures: evaluation and utility of the next-generation FilmArray Blood Culture Identification 2 panel. Eur J Clin Microbiol Infect Dis. 2022;41(3):363–71. pmid:34350523