Deep learning based predictive modeling to screen natural compounds against TNF-alpha for the potential management of rheumatoid arthritis: Virtual screening to comprehensive in silico investigation

Tasnia Nabi; Tanver Hasan Riyed; Akid Ornob

doi:10.1371/journal.pone.0303954

Peer Review History

Original SubmissionMay 5, 2024
18 Jul 2024 Decision Letter - Sadiq Umar, Editor PONE-D-24-18002Deep learning based predictive modeling to screen natural compounds against TNF-alpha for the potential management of Rheumatoid Arthritis: Virtual screening to comprehensive in silico investigationPLOS ONE Dear Dr. Ornob, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Aug 30 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Sadiq Umar Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, all author-generated code must be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse. 3. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. Additional Editor Comments (if provided): [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes ******** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes ****** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: No ****** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes ****** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: The manuscript analyzes a deep learning (DL) approach to finding natural TNF-alpha inhibitors, a potential treatment for rheumatoid arthritis (RA). The authors use a DL model trained on known TNF-alpha inhibitors to predict the bioactivity of natural compounds in the Selleckchem database. The most promising candidates are then analyzed using in silico methods to assess drug-likeness, binding affinity, and stability. This DL method for virtually screening natural compounds against TNF-alpha is a promising strategy for drug discovery. The analysis is strengthened by using a well-established database and incorporating multifaceted in silico analyses. The manuscript is well-organized and provides a detailed explanation of the methodology. However, there are some methodological weaknesses: Targeting only TNF-alpha may be insufficient for a complex disease like RA. The authors should justify why the model can address broader aspects of the disease. The manuscript doesn't adequately explain why 2AZ5 (structure of TNF-alpha with a small molecule inhibitor) and CASTp/Deepsite were chosen for site identification in the virtual screening. If the active site features identified by the RCSB PDB structure (2AZ5) were compared to those identified by CASTPA/deepsite, the results should be presented and discussed. A major concern is the apparent absence of validation for the virtual screening protocol, especially since TNF-alpha is a well-studied target. Without prospective validation, the authors should consider retrospective studies, as suggested in the following references, to gain a better understanding of the protocol's validity: doi/10.1021/jm300687e for dude selection and https://doi.org/10.1039/C8RA09318K for method execution ****** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Shafi Ullah Khan ******** [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. https://doi.org/10.1371/journal.pone.0303954.r001
Revision 1
20 Sep 2024 Author Response Comment 1: Targeting only TNF-alpha may be insufficient for a complex disease like RA. The authors should justify why the model can address broader aspects of the disease. Response 1: We thank the reviewer for the thoughtful comment and acknowledge that the pathogenesis of Rheumatoid Arthritis (RA) involves a complex and multifactorial network of various cytokines, cells, and immune pathways (1). We have chosen TNF-alpha (TNFα) as our model protein since it is involved multi-directionally in the pathogenesis of RA and represents the largest cytokine target for RA therapeutics (2,3). Secreted by Th1 cells and macrophages, TNFα was initially thought to play a synergistic role to enhance the destructive behaviour of IL-1 (4). However, follow up studies found that excessive activation of TNF-α signaling led to arthritis even in the absence of functional T and B cells (5). Later studies reported that even the membrane-bound form of TNFα (mTNFα) can lead to the full expression of arthritis (6). Additionally, synovial fibroblasts, activated by TNFα, release other pro-inflammatory cytokines such as IL-6, IL-1β which further accelerates cartilage and bone erosion (7). By targeting TNFα, our model indirectly modulates these interconnected pathways, potentially leading to broader therapeutic effects against RA. Comment 2: The manuscript doesn't adequately explain why 2AZ5 (structure of TNF-alpha with a small molecule inhibitor) and CASTp/Deepsite were chosen for site identification in the virtual screening. If the active site features identified by the RCSB PDB structure (2AZ5) were compared to those identified by CASTPA/deepsite, the results should be presented and discussed. Response 2: We thank the reviewer for his valuable comments and for making this interesting observation. The 2AZ5 structure of the TNFα protein has been extensively studied in literature for different virtual screening studies (8). The structure is complexed with a small molecule inhibitor and has become a benchmark for drug discovery studies aiming to inhibit the inflammatory activity of the TNFα protein. We report missing residues in the 2AZ5 structure which was filled via homology modeling in Swiss-Model (9). Briefly, the 2AZ5 target protein was uploaded on the Swiss-Model web server and a template search was initiated which identified 2E7A as the highest ranked template based on sequence coverage, similarity, and accuracy of the predicted structure. The resulting model had GMQE score of 0.87, QMEANDisCO (Global) score of 0.82±0.05, and Ramachandran outlier of 0.46%, confirming the validity of its structure. The CASTp webserver was used to predict several binding pockets and cavities in our processed protein structure (10). Deepsite extended this capability by using deep convolutional neural networks to forecast which of these pockets are more likely to interact with ligands, therefore offering a functional viewpoint on the geometrically detected sites (11). Interestingly, the most probable binding sites in our model overlapped with those found in the 2E7A structure. These common residues were GLY59, CYS60, PRO91, CYS92, GLN93, ARG94, THR96, ALA100, GLU110, PRO104, and TRP105 in chain A and PRO61, HIS64, SER90, PRO91, CYS92, and GLN93 in chain B. Some of these sites also appear in the 3rd largest binding pocket in the 2AZ5 structure - CYS92 in chain A and CYS60, PRO91, CYS92, GLN93, and ARG94 in chain B. We do not see shared active site features identified in the RCSB PDB structure in our model possibly due to subtle changes in geometric and topological features during homology modeling. Prior work involving 2AZ5 protein structure have identified even more distinct active site features. Saddala and Huang (12) reported Val91, Asn92, Leu93, and Phe124 in chain A and His15, Val17, Ala18, Pro20, Arg32 Ala33, Asn34, Ala35, Phe144, Glu146, Ser147, Gly148, Gln149 and Val150 in chain B as potential binding sites. These sites belong to the 5th largest binding pocket (by volume) of 2AZ5 as identified by the CASTp webserver underscoring the fact that protein preparation steps may lead to differential active site prediction. Furthermore, the molecular interaction studies (Fig 7) reveal the multiple and diverse binding between our top-ranked hits and the target protein in the active site regions. Notably, veratramine, which exhibits the highest interaction density, show the lowest binding free energy (ΔGbind = -54.91 kcal/mol) in MM/GBSA analysis. Note: When analyzing active sites, it is important to note that there is a shift of 9 residues between our model and the 2AZ5, 2E7A structures as these proteins start from residue 10 (S1 Fig). We re-indexed our protein sequence to start from 1 during the protein preparation step. Therefore, GLY59 reported in our work will correspond to GLY68 in the 2E7A structure. Conversely, PRO100 in the 2E7A structure will correspond to PRO91 in our protein model. Comment 3: A major concern is the apparent absence of validation for the virtual screening protocol, especially since TNF-alpha is a well-studied target. Without prospective validation, the authors should consider retrospective studies, as suggested in the following references, to gain a better understanding of the protocol's validity: doi/10.1021/jm300687e for dude selection and https://doi.org/10.1039/C8RA09318K for method execution Response 3: Once again, we thank the reviewer for his insightful feedback and agree that retrospective validation is required to confirm the effectiveness of our virtual screening protocol. The referenced computational tools suggested by the reviewer are widely used in developing virtual screening protocols. However, due to budgetary limitations and time constraints associated with obtaining unpaid licences for these tools, we opted to use open-source alternatives. These open-source solutions are widely accepted, well-validated, and maintain the integrity of our validation process (13). Molecular fingerprints have been used in recent literature to validate virtual ligand screening methods and have achieved similar or even superior results compared to 3D shape-based methods for many of the DUD targets (14,15) Given this, we validated our virtual screening using both Morgan and Layered fingerprint descriptors. The Morgan fingerprint (MF) and LayeredFingerprint (16) was calculated using the RDKit Python package. MF had a (radius = 2 and a fingerprint size =2048) setting. The LayeredFingerprint was computed with a minimum path length of 1, a maximum path length of 9, and a fingerprint size of 2048. For benchmarking, we generated and utilized 25 decoy molecules for each of the top 20 active compounds from the DUD-E (17,18) database, and included Schisantherin A (19), a known natural TNF-alpha inhibitor, as a control drug. The anti-inflammatory activity of Schisantherin A against TNF-alpha is well-documented in both computational and in-vitro studies (20). ROC curve analysis showed an AUC of 0.90 for Morgan fingerprints, demonstrating strong predictive power, and 0.80 for Layered fingerprints, indicating reasonable performance as depicted in Fig 1. Fig 1. Retrospective validation of our virtual screening protocol. a) ROC plot based on Tanimoto scores for MF and Layered fingerprint descriptors b) Heat map showing Tanimoto similarity between control (index 1), actives (index 2-21) and decoys (22-50) and c) Box plots with distribution of Tanimoto scores between active-control and decoy-control for both the fingerprint descriptors The Enrichment Factor at 1% was 24.81 for Morgan and 12.40 for Layered, highlighting the model’s ability to prioritize active compounds over decoys. The calculated Enrichment Factor (EF) at 1% for binding affinity was 18.18. Considering our limitations in using paid software and that the ROC-AUC metric may not always be optimal for evaluating binding affinity or docking scores in virtual screening, we used EF at top 1% to assess the performance of our model in prioritizating top-ranking compounds (21). A similarity analysis using Tanimoto coefficients compared active compounds and decoys with Schisantherin A. For clarity, we visualized the first 50 compounds, starting with Schisantherin A as the control, followed by 20 active compounds and 29 decoys. The heatmap reveals a clear distinction between the control, active, and decoy compounds. The control compound (index 1) shows notable similarity with many of the active compounds (indices 2-21), while the active compounds themselves form clusters of moderate-to-high Tanimoto similarity, indicating shared structural features. In contrast, the decoys (indices 22-50) generally display lower similarity to both the control and active compounds, with only a few exceptions. Overall, the demarcations in the heat map indicate that active compounds are more similar to each other and the control, while the decoys remain largely dissimilar, validating the effectiveness of the virtual screening process. The boxplots demonstrate statistically significant differences in similarity values between active and decoy compounds for both Morgan (p <10-5) and Layered fingerprint (p <10-6) techniques. These results highlight that active compounds consistently show greater similarity than decoys, validating the effectiveness of the virtual screening process. References 1. Kondo N, Kuroda T, Kobayashi D. Cytokine Networks in the Pathogenesis of Rheumatoid Arthritis. Int J Mol Sci. 2021 Oct 10;22(20):10922. 2. Jang D in, Lee AH, Shin HY, Song HR, Park JH, Kang TB, et al. The Role of Tumor Necrosis Factor Alpha (TNF-α) in Autoimmune Disease and Current TNF-α Inhibitors in Therapeutics. Int J Mol Sci. 2021 Mar 8;22(5):2719. 3. Choy EHS, Panayi GS. Cytokine Pathways and Joint Inflammation in Rheumatoid Arthritis. New England Journal of Medicine. 2001 Mar 22;344(12):907–16. 4. van de Loo AA, van den Berg WB. Effects of murine recombinant interleukin 1 on synovial joints in mice: measurement of patellar cartilage metabolism and joint inflammation. Ann Rheum Dis. 1990 Apr 1;49(4):238–45. 5. Keffer J, Probert L, Cazlaris H, Georgopoulos S, Kaslaris E, Kioussis D, et al. Transgenic mice expressing human tumour necrosis factor: a predictive genetic model of arthritis. EMBO J. 1991 Dec;10(13):4025–31. 6. Georgopoulos S, Plows D, Kollias G. Transmembrane TNF is sufficient to induce localized tissue toxicity and chronic inflammatory arthritis in transgenic mice. J Inflamm. 1996;46(2):86–97. 7. McInnes IB, Schett G. Pathogenetic insights from the treatment of rheumatoid arthritis. The Lancet. 2017 Jun;389(10086):2328–37. 8. Parves MdR, Mahmud S, Riza YM, Sujon KM, Uddin MAR, Chowdhury MdIA, et al. Inhibition of TNF-Alpha Using Plant-Derived Small Molecules for Treatment of Inflammation-Mediated Diseases. In: The 1st International Electronic Conference on Biomolecules: Natural and Bio-Inspired Therapeutics for Human Diseases. Basel Switzerland: MDPI; 2020. p. 13. 9. Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 2018 Jul 2;46(W1):W296–303. 10. Binkowski TA. CASTp: Computed Atlas of Surface Topography of proteins. Nucleic Acids Res. 2003 Jul 1;31(13):3352–5. 11. Jiménez J, Doerr S, Martínez-Rosell G, Rose AS, De Fabritiis G. DeepSite: protein-binding site predictor using 3D-convolutional neural networks. Bioinformatics. 2017 Oct 1;33(19):3036–42. 12. Saddala MS, Huang H. Identification of novel inhibitors for TNFα, TNFR1 and TNFα-TNFR1 complex using pharmacophore-based approaches. J Transl Med. 2019 Dec 2;17(1):215. 13. Bento AP, Hersey A, Félix E, Landrum G, Gaulton A, Atkinson F, et al. An open source chemical structure curation pipeline using RDKit. J Cheminform. 2020 Dec 1;12(1):51. 14. Cereto-Massagué A, Ojeda MJ, Valls C, Mulero M, Garcia-Vallvé S, Pujadas G. Molecular fingerprint similarity search in virtual screening. Methods. 2015 Jan;71:58–63. 15. Zhou H, Skolnick J. Utility of the Morgan Fingerprint in Structure-Based Virtual Ligand Screening. J Phys Chem B. 2024 Jun 6;128(22):5363–70. 16. Pattanaik L, Coley CW. Molecular Representation: Going Long on Fingerprints. Chem. 2020 Jun;6(6):1204–7. 17. Khan SU, Ahemad N, Chuah LH, Naidu R, Htar TT. Sequential ligand- and structure-based virtual screening approach for the identification of potential G protein-coupled estrogen receptor-1 (GPER-1) modulators. RSC Adv. 2019;9(5):2525–38. 18. Ni B, Wang H, Khalaf HKS, Blay V, Houston DR. AutoDock-SS: AutoDock for Multiconformational Ligand-Based Virtual Screening. J Chem Inf Model. 2024 May 13;64(9):3779–89. 19. Boyenle ID, Adelusi TI, Ogunlana AT, Oluwabusola RA, Ibrahim NO, Tolulope A, et al. Consensus scoring-based virtual screening and molecular dynamics simulation of some TNF-alpha inhibitors. Inform Med Unlocked. 2022;28:100833. 20. Ci X, Ren R, Xu K, Li H, Yu Q, Song Y, et al. Schisantherin A Exhibits Anti-inflammatory Properties by Down-Regulating NF-κB and MAPK Signaling Pathways in Lipopolysaccharide-Treated RAW 264.7 Cells. Inflammation. 2010 Apr 14;33(2):126–36. 21. Wójcikowski M, Ballester PJ, Siedlecki P. Performance of machine-learning scoring functions in structure-based virtual screening. Sci Rep. 2017 Apr 25;7(1):46710. Attachments Attachment Submitted filename: Response to Reviewers.docx https://doi.org/10.1371/journal.pone.0303954.r002
3 Oct 2024 Decision Letter - Sadiq Umar, Editor Deep learning based predictive modeling to screen natural compounds against TNF-alpha for the potential management of Rheumatoid Arthritis: Virtual screening to comprehensive in silico investigation PONE-D-24-18002R1 Dear Dr. Ornob, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Sadiq Umar Academic Editor PLOS ONE https://doi.org/10.1371/journal.pone.0303954.r003
Formally Accepted
14 Oct 2024 Acceptance Letter - Sadiq Umar, Editor PONE-D-24-18002R1 PLOS ONE Dear Dr. Ornob, I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team. At this stage, our production department will prepare your paper for publication. This includes ensuring the following: * All references, tables, and figures are properly cited * All relevant supporting information is included in the manuscript submission, * There are no issues that prevent the paper from being properly typeset If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps. Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. If we can help with anything else, please email us at customercare@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Sadiq Umar Academic Editor PLOS ONE https://doi.org/10.1371/journal.pone.0303954.r004

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .