Fig 1.
Initial training and performance evaluation of the machine learning model.
(A) High-throughput screening data generated by screening a compound library of 29,537 compounds against B. cenocepacia K56-2 wild-type. Using B-score ≤ -17.5 as a threshold, the screening yielded 256 active compounds. Dark blue and red represent inactive and active compounds, respectively. (B) The machine learning model was trained using a D-MPNN approach, which extracts compounds’ local features, such as atom and bond features. The model was fed more than 200 additional global molecular descriptors to further increase the accuracy. Dataset was split into 80:10:10 ratio to train, validate and test the model. (C) ROC-AUC plot evaluating model performance after training. The model attained a ROC-AUC of 0.823. Parts of panel B are modified from Yang et al. [11]. Fig 1B was created with https://biorender.com/.
Table 1.
Deep learning models trained with different combinations of molecule-level features.
Fig 2.
In vitro testing of top-ranked predicted compounds from an FDA-approved compound library.
(A) Schematic of the screening protocol. Eighty-one commercially available compounds (from the top 100) were screened. (B) The screening identified 21 bioactive compounds with a positive predictive value (PPV) of 25.9%. Dark blue and red represent inactive and active compounds, respectively. (C) The top 100 ranked compounds selected for empirical testing belong to different drug families. Most of the compounds exhibiting bioactivity were known antibiotics or antimicrobial compounds. (D) The ratio of OD600nm and prediction scores were plotted against the predicted rank of the corresponding compounds. The results show a linear correlation (Pearson correlation, R = 0.54) between the prediction score and bioactivity. The predicted score is the probability of a compound being active as calculated by the ML model. The predicted rank is the order of the compounds based on the predicted score, where compounds with the higher predicted scores are ranked higher. The red and blue triangles show the gradient of predicted rank and growth (measured as OD600nm), respectively. Dark blue and red indicate compounds’ probability of being inactive and active, respectively. Results are the average of at least three independent biological replicates. Fig 2A was created with https://biorender.com/.
Fig 3.
In vitro testing of top-ranked predicted compounds from an unprecedented natural product library.
(A) The 43 commercially available compounds (from the 100 top ranked unique compounds) were screened against B. cenocepacia K56-2. The screening yielded 5 bioactive compounds with a hit rate 10 times higher than a conventional screening (hit rate = 11.63%). Dark blue and red are non-inhibitory and inhibitory compounds, respectively, based on the residual growth (RG) threshold of 0.8. (B) Screening these 43 compounds against the ESKAPE pathogens yielded 22 bioactive compounds that displayed broad-spectrum growth inhibitory activity against diverse pathogens (Positive predictive value (PPV) = 51.16%). Dark blue and red are non-inhibitory and inhibitory compounds, respectively. The structures of the compounds that exhibited growth inhibitory activity against B. cenocepacia K56-2 and the ESKAPE pathogens are shown beside the plots. Results are average of at least three independent biological replicates. Error bars indicate mean ± SD. AB = A. baumannii 1225, BC = B. cenocepacia K56-2, EC = E. cloacae ENT001_EB001, PA = P. aeruginosa PAO1, MRSA = Methicillin-resistant S. aureus ATCC33592.
Fig 4.
Enhanced sensitivity of the CRISPRi knockdown mutants indicated RpoB as the in vivo target of STL558147.
(A) Chemical structures of STL558147 and Rifampicin. (B-D) Comparison of hypersensitive CRISPRi knockdown mutants to novobiocin (B), rifampicin (C) and STL558147 (D). Blue indicates more growth (less inhibition), and red indicates less growth (more inhibition). Results are average of at least three independent biological replicates.
Fig 5.
Synergy maps of STL558147 and rifampicin combined with other antibiotics against B. cenocepacia K56-2.
Synergy plots of STL558147 (A) and rifampicin (B) with ceftazidime, colistin, and polymyxin B. The synergy scores were calculated based on the widely used Bliss independence [52] and Loewe additivity [53] models. The most synergistic area in each combination is highlighted with a rectangular box inside the plot. Green (negative δ-scores) indicate antagonistic interactions, and red (positive δ-scores) indicate synergistic interactions. Synergy scores >15, between -5 to 15, and < -15 were considered synergistic, additive and antagonistic, respectively. Results are average of at least three independent biological replicates. Synergy scores are shown as mean ± SEM. Synergy scores were calculated using SynergyFinder 2.0 [31].
Fig 6.
Screening of PHAR261659 analogs.
PHAR261659 analogs with different side chains were selected based on lower predicted logP values. STL529920, a stereoisomer of PHAR261659, exhibited growth inhibitory activity against all six pathogens tested. The activity of growth inhibitory and non-growth inhibitory compounds are shown in red and blue, respectively. Results are the average of three independent biological replicates. Error bars indicate mean ± SD.