Peer Review History
| Original SubmissionAugust 13, 2025 |
|---|
|
PCOMPBIOL-D-25-01638 ECOD: Classification of domains in AFDB Swiss-Prot structure predictions PLOS Computational Biology Dear Dr. Schaeffer, Thank you for submitting your manuscript to PLOS Computational Biology. After careful consideration, we feel that it has merit but does not fully meet PLOS Computational Biology's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Jan 14 2026 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at ploscompbiol@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pcompbiol/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: * A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below. * A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. * An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter We look forward to receiving your revised manuscript. Kind regards, Chaok Seok Academic Editor PLOS Computational Biology Nir Ben-Tal Section Editor PLOS Computational Biology Journal Requirements: 1) We ask that a manuscript source file is provided at Revision. Please upload your manuscript file as a .doc, .docx, .rtf or .tex. If you are providing a .tex file, please upload it under the item type u2018LaTeX Source Fileu2019 and leave your .pdf version as the item type u2018Manuscriptu2019. 2) Please upload all main figures as separate Figure files in .tif or .eps format. For more information about how to convert and format your figure files please see our guidelines: https://journals.plos.org/ploscompbiol/s/figures 3) Please amend your detailed Financial Disclosure statement. This is published with the article. It must therefore be completed in full sentences and contain the exact wording you wish to be published. State what role the funders took in the study. If the funders had no role in your study, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.". If you did not receive any funding for this study, please simply state: u201cThe authors received no specific funding for this work.u201d Reviewers' comments: Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: The manuscript by Schaeffer et al., "ECOD: Classification of domains in AFDB Swiss-Prot structure predictions" applies a previously developed DPAM pipeline to classify the protein domains from the AF-predicted structures. The manuscript substantially expands the coverage of classified domains by including over 500,000 structures from Swiss-Prot. Furthermore, the submission substantially expands the phylogenetic coverage of species for which the domains have been annotated, going far beyond the 48 model organisms previously annotated by ECOD. Expanding ECOD with predicted structures is overall an important work, but the manuscript in the current form doesn’t fully reflect it. There are several major issues that the authors need to address before the manuscript becomes suitable for publication in PLoS Computational Biology: 1. With expanded phylogeny, ECOD should now offer an advanced phylogenetic browser/filtering, i.e., the ability to select, compare, or exclude domains across one or multiple taxonomic groups. 2. "4 superkingdoms" (line 93). The Consensus Statement by the International Committee on Taxonomy of Viruses indicates that "the current view [is] that viruses have multiple origins (polyphyly) and that their diversity cannot be represented by a single virosphere-wide tree" (doi: 10.1038/s41564-020-0709-x). Therefore, grouping viruses into a single fourth superkingdom is not supported. In line with this view, the NCBI changed the naming scheme**,** replacing "superkingdom" with "Acellular root" for viruses (https://ncbiinsights.ncbi.nlm.nih.gov/2025/02/27/new-ranks-ncbi-taxonomy/). Please comply throughout the text with the current consensus. 3. "Initial analysis shows Swiss-Prot contains a significantly higher density of functional annotations compared to reference proteomes" (lines 81-82). This should say Swiss-Prot entries and UniProt entries from reference proteomes. Additionally, is this initial analysis part of your manuscript? This is not clear; otherwise, please provide a reference. 4. The manuscript should provide a clear description of new domains identified in this study (maybe in the form of a table): which architectures and X groups expanded due to discoveries of new domains and which phylogenetic groups contributed to this expansion. 5. The authors state that "the propagation of functional annotation from experimentally characterized proteins and their domains to their homologous yet hypothetical or uncharacterized domains can lead to biological insights" (lines 54-56). Yes, but it can also lead to errors. While there is clearly value in functional annotation, please be specific and provide examples and references. Minor comments: 6. The authors start this manuscript with the following sentence: "Protein domains are independent evolutionary units that convey function and fitness" (line 51). This appears to be an overstatement; please provide references showing that protein domains in general provide fitness or remove statements that are unspecific, vague, and unverifiable assumptions. 7. Figure 1. Panel D: labels are cropped and inconsistent. The figure legend for panel D also needs revision as the names on the plot do not correspond to the names in the legend. In the plot we have "EC Annotations", "EC Proteins", and "Unique ECs"; in the legend we have "proteins with at least one EC number", "total EC annotations", and "unique EC identifiers". 8.) Figure 2. Panel A: the y-axis title is cropped. Please fix the color codes: in Panel B, as per the figure legend, domain count is shown in blue and residue count in green. However, it is confusing that the label "Simple Topology" is shown in the same green as residue count bars, and the label "Good" is shown in the same blue as domain count bars. Are the four categories "Partial", "Simple Topology", "Low Confidence", and "Good" mutually exclusive? Please define. 9. Panel C appears to follow the same color code as panel B, but this is not specified; please clarify. The bars in panel D appear to follow the colors indicated by the labels in panel B; please clarify. 10. Please provide a definition for "Simple Topology". The text states that "Simple topology domains (9.20% of domains but only 2.54% of residues) typically represent smaller structural units." Is this a or a definition? 11. Lines 263-269 and 290-297 describe the same results. Please avoid repetition. Reviewer #2: This manuscript presents an extension of the Evolutionary Classification of Protein Domains (ECOD) to include AlphaFold Database (AFDB) structure predictions for all Swiss-Prot proteins. Using the Domain Parser for AlphaFold Models (DPAM) pipeline, the authors classify over 1 million domains derived from more than 540,000 protein structure predictions, achieving broad taxonomic and functional coverage. The study integrates structure-based domain classification with Swiss-Prot’s curated functional annotations and provides a comprehensive dataset deposited in Zenodo and the ECOD database. Overall, this is a valuable resource because of the following strengths: 1. The work expands ECOD’s coverage into a major curated dataset (Swiss-Prot), representing an important addition to structural bioinformatics resources. 2. The authors describe a systematic and reproducible pipeline (DPAM + ECOD + Pfam integration), combining sequence, structure, and taxonomic analysis. 3. The reported high DPAM probability (mean 0.992) and extensive pLDDT validation underscore robust classification. 4. Data are openly available via Zenodo and ECOD. 5. The manuscript provides taxonomic, functional, and evolutionary insights—e.g., the discovery of >100,000 domains lacking Pfam mappings—highlighting areas for future functional annotation and structural validation. However, there are still two issues for the authors to address: • A more detailed analysis or discussion of the difference and similarity between ECOD extension with CATH’s use of AFDB can strengthen the work. • Typographical errors (e.g., “Classificatioxn”) should be corrected. ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our Privacy Policy.. Reviewer #1: No Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] Figure resubmission: While revising your submission, we strongly recommend that you use PLOS’s NAAS tool (https://ngplosjournals.pagemajik.ai/artanalysis) to test your figure files. NAAS can convert your figure files to the TIFF file type and meet basic requirements (such as print size, resolution), or provide you with a report on issues that do not meet our requirements and that NAAS cannot fix. After uploading your figures to PLOS’s NAAS tool - https://ngplosjournals.pagemajik.ai/artanalysis, NAAS will process the files provided and display the results in the "Uploaded Files" section of the page as the processing is complete. If the uploaded figures meet our requirements (or NAAS is able to fix the files to meet our requirements), the figure will be marked as "fixed" above. If NAAS is unable to fix the files, a red "failed" label will appear above. When NAAS has confirmed that the figure files meet our requirements, please download the file via the download option, and include these NAAS processed figure files when submitting your revised manuscript. Reproducibility: To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols |
| Revision 1 |
|
PCOMPBIOL-D-25-01638R1 ECOD: Classification of domains in AFDB Swiss-Prot structure predictions PLOS Computational Biology Dear Dr. Schaeffer, Thank you for submitting your manuscript to PLOS Computational Biology. After careful consideration, we feel that it has merit but does not fully meet PLOS Computational Biology's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Apr 04 2026 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at ploscompbiol@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pcompbiol/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: * A letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below. * A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. * An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. We look forward to receiving your revised manuscript. Kind regards, Chaok Seok Academic Editor PLOS Computational Biology Nir Ben-Tal Section Editor PLOS Computational Biology Reviewers' comments: Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: The revised version of the manuscript "ECOD: Classification of domains in AFDB Swiss-Prot structure predictions" by Grishin and co-workers essentially addressed all the concerns raised in the original submission. I recommend this manuscript for publication in PLOS Computational Biology after a couple of minor glitches are fixed. 1. In the responses, the authors stated that the advanced taxonomic search is available from the main ECOD page http://prodata.swmed.edu/ecod/af2_pdb/. It appears that this useful feature is available from http://prodata.swmed.edu/ecod/af2_pdb/search/advanced. Please make it available at the front page or provide an explicit link in the manuscript. It may also be beneficial to deploy an SSL connection to the server using Certbot or other services. 2. While the 4 superkingdom issue has been addressed in the revised version, line 93 still states "with domains spanning 4 superkingdoms, 98 phyla, and 10,254 species". Please fix. Reviewer #3: In this manuscript, Schaeffer and co-workers apply the Domain Parser for AlphaFold Models (DPAM) to annotate AlphaFold-predicted structures in the manually curated UniProtKB/Swiss-Prot database. While the overall objective is timely and provides valuable insight to related fields, several conceptual and presentation issues limit the impact and interpretability of the work and should be revised for publication. Major points: 1) A key result is the reported discrepancy between DPAM-parsed domains and Pfam annotations. The authors attribute this largely to “novel evolutionary” relationships, but this explanation is insufficiently justified. It remains unclear whether these differences stem from structure-based versus sequence-based similarity, differences in domain definition and annotation philosophy, or limitations of AlphaFold predictions. The manuscript would benefit from a more rigorous discussion supported by concrete examples. 2) Similar to pt 1), in lines 325-326, what does the fact that there are many unannotated but with high-quality ordered structures mean? Are they truly new domains, structures, that were absent in experimental DBs? Or is it due to disparity in sequence similarity and structure similarity? 3) Although ECOD was published some time ago, still, a proper introduction of the hierarchy, organization, and the rationale of the design the ECOD classification framework should be provided. The meanings of the H-, T-, and X-level annotations should be briefly explained to make the results accessible to a broader audience. 4) Even with rather low similarity, 40%, the number of singletons is almost the half of the entire clusters. Does this mean that the current parsing scheme is too strict or sensitive to small changes in structures? Authors should provide more explanation and discussion on these large number of singletons. 5) Figure 2D is not referenced in the text and the color codes of Figure2D are not presented in the legend. 6) Figure 3 is small and difficult to interpret. Minor point: 1) In line 88, “Swiss-Prot/UniProtKB” should be corrected to “UniProtKB/Swiss-Prot.” 2) In line 313, pfam -> Pfam ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our Privacy Policy.. Reviewer #1: No Reviewer #3: Yes: Juyong LeeJuyong Lee [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] Figure resubmission: While revising your submission, we strongly recommend that you use PLOS’s NAAS tool (https://ngplosjournals.pagemajik.ai/artanalysis) to test your figure files. NAAS can convert your figure files to the TIFF file type and meet basic requirements (such as print size, resolution), or provide you with a report on issues that do not meet our requirements and that NAAS cannot fix. After uploading your figures to PLOS’s NAAS tool - https://ngplosjournals.pagemajik.ai/artanalysis, NAAS will process the files provided and display the results in the "Uploaded Files" section of the page as the processing is complete. If the uploaded figures meet our requirements (or NAAS is able to fix the files to meet our requirements), the figure will be marked as "fixed" above. If NAAS is unable to fix the files, a red "failed" label will appear above. When NAAS has confirmed that the figure files meet our requirements, please download the file via the download option, and include these NAAS processed figure files when submitting your revised manuscript. Reproducibility: To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols |
| Revision 2 |
|
Dear Dr. Schaeffer, We are pleased to inform you that your manuscript 'ECOD: Classification of domains in AFDB Swiss-Prot structure predictions' has been provisionally accepted for publication in PLOS Computational Biology. Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests. Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated. IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS. Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. Best regards, Chaok Seok Academic Editor PLOS Computational Biology Nir Ben-Tal Section Editor PLOS Computational Biology *********************************************************** Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: The revised version of the manuscript has been further improved and addressed the majority of concerns raised in the previous submission. I recommend this manuscript for publication after a couple of minor issues are fixed 1) Font in the legend to Fig 4A is to small. Please fix 2) Individual panels in Fig 5 are not called within the main text "At 99% identity, nearly all clusters contained 279 domains from a single phylum, consistent with recent evolutionary divergence (Fig. 5)" Expand the text to contextualize the data shown in Figs. 5A and 5B. Reviewer #3: The authors addressed all my concerns successfully. ********** Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: None Reviewer #3: Yes ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our For information about this choice, including consent withdrawal, please see our Privacy Policy.. Reviewer #1: No Reviewer #3: Yes: Juyong LeeJuyong Lee |
| Formally Accepted |
|
PCOMPBIOL-D-25-01638R2 ECOD: Classification of domains in AFDB Swiss-Prot structure predictions Dear Dr Schaeffer, I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. For Research, Software, and Methods articles, you will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing. Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work! With kind regards, Anita Estes PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol |
Open letter on the publication of peer review reports
PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.
We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.
Learn more at ASAPbio .