Peer Review History

Original SubmissionSeptember 8, 2020
Decision Letter - Jeff Atkins, Editor

PONE-D-20-28051

Machine Learning Models Based on Remote and Proximal Sensing as Potential Methods for In-Season Biomass Yields Prediction in Commercial Sorghum Fields

PLOS ONE

Dear Dr. Habyarimana,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jan 29 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Jeff Atkins

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. In your Methods section, please provide additional location information of the study sites, including geographic coordinates for the data set if available.

3. In your Methods section, please provide additional information regarding the permits you obtained for the work. Please ensure you have included the full name of the authority that approved the field site access and, if no permits were required, a brief statement explaining why.

4. Thank you for stating the following after the Conclusion Section of your manuscript:

'Funding: This research was funded by the European Union, grant number 732064 (H2020-ICT-2016-1-innovation action and the APC was funded by the European Union through the project Data-driven Bioeconomy (www.databio.eu) and the Ministero delle Politiche Agricole, Alimentari, Forestali e del Turismo (Rome, Italy) through the project Risorse GeneticheVegetali (RGV/FAO) 2014–2019.'

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

a. Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

'E.H. received funds from the European Union, grant number 732064 (H2020-ICT-2016-1-innovation action. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.'

b. Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

5. We note that Figure 1 in your submission contains map/satellite images which may be copyrighted.

All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For these reasons, we cannot publish previously copyrighted maps or satellite images created using proprietary data, such as Google software (Google Maps, Street View, and Earth). For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.

We require you to either (a) present written permission from the copyright holder to publish these figure specifically under the CC BY 4.0 license, or (b) remove the figure from your submission:

a. You may seek permission from the original copyright holder of Figure 1 to publish the content specifically under the CC BY 4.0 license. 

We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:

“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”

Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission.

In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”

b. If you are unable to obtain permission from the original copyright holder to publish this figure under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.

The following resources for replacing copyrighted map figures may be helpful:

USGS National Map Viewer (public domain): http://viewer.nationalmap.gov/viewer/

The Gateway to Astronaut Photography of Earth (public domain): http://eol.jsc.nasa.gov/sseop/clickmap/

Maps at the CIA (public domain): https://www.cia.gov/library/publications/the-world-factbook/index.html and https://www.cia.gov/library/publications/cia-maps-publications/index.html

NASA Earth Observatory (public domain): http://earthobservatory.nasa.gov/

Landsat: http://landsat.visibleearth.nasa.gov/

USGS EROS (Earth Resources Observatory and Science (EROS) Center) (public domain): http://eros.usgs.gov/#

Natural Earth (public domain): http://www.naturalearthdata.com/

6. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information

Additional Editor Comments:

Thank you for your submission and my apologies for delays caused by COVID-19. I have considered the input of both reviewers, whom I respect and value greatly. R2 brings up the point of small sample size in testing, and you do address that in your discussion. R1 brings up points about your modelling efforts and how to better clarify those efforts. Addressing those and other concerns they both raise will be helpful. To R1's point on small sample size--this is concerning, but I think framing this paper more as a proof of concept (which you do at points) would be more advisable. Your introduction reads like a review. I think carefully paying attention to where you can augment the existing text to make sure readers understand this is an attempt to push current methods towards opening up new avenues of research would be helpful. Thank you.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: No

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The manuscript compared several crop yield models with several different remotely sensed optical features in several sorghum fields in Italy. It's an interesting topic and fit to the scope of PLOS ONE. While I have several concerns before the consideration of the publication:

My main concerns are about the feature selections used in learning model:

1.1. there is no description or discussion on how these features were selected

1.2. there are 24 features/inputs in the model, while most of them are same features, like 14 features are fAPAR that captured in different time, please add more information about how did you handle the correlations between these inputs to avoid model overfitting

1.3. The description of different learning models is inadequate, more details are needed to show the differences of these learning models and why they are used in this study

1.4. Section 3.1 should be in methodology as it's the description of model inputs

Reviewer #2: This manuscript reports the result of applying different machine learning techniques for Sorghum biomass Prediction. Authors analysed different Sentinel-2 images and collected ground data from different fields using hand held sensors for chlorophyll and NDVI measurements from 2 different years. In general, the topic is current, interesting and the manuscript has adequate information for methodology and results. However, the ground observation number is not suitable for such analysis considering the different varieties, years and fields. Moreover, I have the following specific comments:

L40: This sentence does not add value. It is just defining the word forecast by using the same word. Please revise.

L41: (simplifies), I suggest to replace it with the word support.

L71: I don't think that it is possible to estimate corn yield 4 months before harvesting. Please revise this information.

L228: How did you apply the PROSAIL model to retrieve the fAPAR?

L252-261: Move this part to your introduction section.

L280: The number of observations is very low. I suggest to collect more ground observations as it is not possible to depend on these current results.

L285-306: This information is not related to your methodology and elaborating more in a well known information. I suggest reducing this part and moving it to the introduction section.

L336 Please define where to find this information.

For all figures: Please maintain a consistent style for figure legends, X and Y labels and values.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Revision 1

Response to reviewers

Dear Editor,

Thank you very much for considering my paper for publication in the journal PLOS ONE.

It’s a pleasure for myself and on behalf of the co-author, to be able use the Editor and Reviewer constructive comments and suggestions to improve this manuscript.

I systematically went through the comments and suggestions, providing our comments, answers, and actions we took in response to Editor and Reviewers concerns.

Below, our answers are provided in italicized font preceded with the capital letter “R” followed by a column “:”

Cordially.

Ephrem.

-------------------

PONE-D-20-28051

Machine Learning Models Based on Remote and Proximal Sensing as Potential Methods for In-Season Biomass Yields Prediction in Commercial Sorghum Fields

PLOS ONE

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming.

R: Style and file naming were corrected according to PLOS ONE standard.

2. In your Methods section, please provide additional location information of the study sites, including geographic coordinates for the data set if available.

R: Geographic coordinates were added to Table 1

3. In your Methods section, please provide additional information regarding the permits you obtained for the work. Please ensure you have included the full name of the authority that approved the field site access and, if no permits were required, a brief statement explaining why.

R: On lines 146-149 in the Manuscript file a sentence was added to explain there were formal contracts that regulated the trials in private commercial fields.

4. Thank you for stating the following after the Conclusion Section of your manuscript:

'Funding: This research was funded by the European Union, grant number 732064 (H2020-ICT-2016-1-innovation action and the APC was funded by the European Union through the project Data-driven Bioeconomy (www.databio.eu) and the Ministero delle Politiche Agricole, Alimentari, Forestali e del Turismo (Rome, Italy) through the project Risorse GeneticheVegetali (RGV/FAO) 2014–2019.'

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

a. Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

'E.H. received funds from the European Union, grant number 732064 (H2020-ICT-2016-1-innovation action. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.'

R: As suggested, funding-related information was removed from the manuscript.

b. Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

R: Funding statements were included in the cover letter as suggested.

5. We note that Figure 1 in your submission contains map/satellite images which may be copyrighted.

All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For these reasons, we cannot publish previously copyrighted maps or satellite images created using proprietary data, such as Google software (Google Maps, Street View, and Earth). For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.

We require you to either (a) present written permission from the copyright holder to publish these figure specifically under the CC BY 4.0 license, or (b) remove the figure from your submission:

a. You may seek permission from the original copyright holder of Figure 1 to publish the content specifically under the CC BY 4.0 license.

We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:

“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”

Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission.

R: This figure was properly cited in the manuscript. It was originally published in Agronomy (Agronomy 2019, 9(4), 203; https://doi.org/10.3390/agronomy9040203) under (CC BY) license (http://creativecommons.org/licenses/by/4.0/) which permits unrestricted use. Since the original paper was authored by myself and with a license that permits unrestricted use, there is no need for a written permission to copyright owner. These are the rules for the articles published in MDPI journals:

copyright is retained by the authors (https://www.mdpi.com/authors/rights).

In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”

R: The text was added in the figure caption of the copyrighted Fig 1.

6. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information

R: The captions of the S1 Table was added at the end of the manuscript as suggested.

Additional Editor Comments:

Thank you for your submission and my apologies for delays caused by COVID-19. I have considered the input of both reviewers, whom I respect and value greatly. R2 brings up the point of small sample size in testing, and you do address that in your discussion. R1 brings up points about your modelling efforts and how to better clarify those efforts. Addressing those and other concerns they both raise will be helpful. To R1's point on small sample size--this is concerning, but I think framing this paper more as a proof of concept (which you do at points) would be more advisable. Your introduction reads like a review. I think carefully paying attention to where you can augment the existing text to make sure readers understand this is an attempt to push current methods towards opening up new avenues of research would be helpful. Thank you.

R: Small sample size: this issue was addressed in the manuscript particularly on lines 467-479 of the manuscript.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: No

________________________________________

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

________________________________________

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

________________________________________

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

________________________________________

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The manuscript compared several crop yield models with several different remotely sensed optical features in several sorghum fields in Italy. It's an interesting topic and fit to the scope of PLOS ONE. While I have several concerns before the consideration of the publication:

My main concerns are about the feature selections used in learning model:

1.1. there is no description or discussion on how these features were selected

R: The process of selecting the features used in the models was explained in the Manuscript particularly on lines 246-257, 259-267, and 289-300.

1.2. there are 24 features/inputs in the model, while most of them are same features, like 14 features are fAPAR that captured in different time, please add more information about how did you handle the correlations between these inputs to avoid model overfitting

R: Collecting data at 15-day intervals and averages reduced the correlation between inputs. On the other hand, several other steps were taken to avoid model overfitting and this was explained in the manuscript e.g., (1) Our work is a proof of concept accommodating small sample size, and, among models we included low-complexity models such as Bayesian ridge regression (Manuscript lines 274-275, 468-473), (2) implementing the “one standard error rule” and using fortnightly data (Manuscript lines 467-473).

1.3. The description of different learning models is inadequate, more details are needed to show the differences of these learning models and why they are used in this study

R: The models were comprehensively explained in the manuscript, particularly in lines 263-278, 471-480.

1.4. Section 3.1 should be in methodology as it's the description of model inputs

R: Section 3.1 “Descriptive statistics of the features used in modelling” present the analytics outcome and should therefore remain under Results’ section.

Reviewer #2: This manuscript reports the result of applying different machine learning techniques for Sorghum biomass Prediction. Authors analysed different Sentinel-2 images and collected ground data from different fields using hand held sensors for chlorophyll and NDVI measurements from 2 different years. In general, the topic is current, interesting and the manuscript has adequate information for methodology and results. However, the ground observation number is not suitable for such analysis considering the different varieties, years and fields. Moreover, I have the following specific comments:

R: The ground observation number can be considered suitable as this work is a proof of concept (Manuscript lines 468-469) aiming mainly at opening new research venues (lines 123-125). In addition, small-sized samples are expected to be a rule not an exception in agricultural sciences, particularly at demonstration or pilot levels requiring commercial fields (lines 592-593).

L40: This sentence does not add value. It is just defining the word forecast by using the same word. Please revise.

R: the sentence was edited (Manuscript lines 41-42)

L41: (simplifies), I suggest to replace it with the word support.

R: replaced as suggested (Manuscript line 42)

L71: I don't think that it is possible to estimate corn yield 4 months before harvesting. Please revise this information.

R: The Reviewer was right. There were typos in previous version, we fixed the errors as shown on lines 71-74 of the manuscript.

L228: How did you apply the PROSAIL model to retrieve the fAPAR?

R: We edited the text for a better reading, and the use of PROSAIL model was better described in the manuscript on lines 228-245.

L252-261: Move this part to your introduction section.

R: since this are short specific descriptions of the algorithms used in the work, we consider they should remain under Materials and methods”.

L280: The number of observations is very low. I suggest to collect more ground observations as it is not possible to depend on these current results.

R: The ground observation number can be considered suitable as this work is a proof of concept (Manuscript lines 468-469) aiming mainly at opening new research venues (lines 123-125). In addition, small-sized samples are expected to be a rule not an exception in agricultural sciences, particularly at demonstration or pilot levels requiring commercial fields (lines 592-593).

L285-306: This information is not related to your methodology and elaborating more in a well known information. I suggest reducing this part and moving it to the introduction section.

R: since the information is about the description of the model evaluation metrics implemented in the study as can be seen in the produced tables and figures, the information should remain under Materials and methods.

L336 Please define where to find this information.

R: prior to arriving at the Table 2, the information in question would have been read in paragraph above.

For all figures: Please maintain a consistent style for figure legends, X and Y labels and values.

R: each figure is a different graphic representation and is therefore a different figure. Yet, we tried to be as consistent as we could.

________________________________________

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachments
Attachment
Submitted filename: Response to Reviewers.docx
Decision Letter - Jie Zhang, Editor

PONE-D-20-28051R1

Machine learning models based on remote and proximal sensing as potential methods for in-season biomass yields prediction in commercial sorghum fields

PLOS ONE

Dear Dr. Habyarimana,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please address the over fitting issue raised by reviewer 1, in particular how the linear model parameters are obtained.

Please submit your revised manuscript by Apr 22 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Jie Zhang

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Additional Editor Comments (if provided):

Reviewer 1 has a valid point on the fact that less number of samples the inputs can have a serious impact on the model generalisation, i.e. the over fitting problem. This is especially the case for linear model if the conventional MLR is used for finding model parameters. To cope this problem, principal component regression, or partial least square regression, or ridge regression should be used. The authors need to clarify how they obtained the linear model parameters.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: No

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors didn't address my concerns about the model overfitting issue at all. Instead of rebuttal using statement in the manuscript (which have no proof/reference, such as '15-day intervals and averages reduced the correlation between inputs'), please provide statistical evidence that your model is not overfitting, such as training accuracy and testing accuracy when you do machine learning modeling. or at least provide "adjusted R-square" which is adjusted for the number of parameters, or ANOVA test to proof the independence of your inputs, or at least correlation scatter plots between inputs. But any way, 24 parameters for 23 observations are way too many to avoid overfitting. Please read this post by Jim Frost for reference https://statisticsbyjim.com/regression/overfitting-regression-models/.

Reviewer #2: Authors replied to my comments, improved their manuscript and I have no further comments.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Revision 2

A copy of the Responses to Reviewers was uploaded in the system but, below are the responses as diretly entered to the attention of the Journal office.

Additional Editor Comments (if provided):

Reviewer 1 has a valid point on the fact that less number of samples the inputs can have a serious impact on the model generalisation, i.e. the over fitting problem. This is especially the case for linear model if the conventional MLR is used for finding model parameters. To cope this problem, principal component regression, or partial least square regression, or ridge regression should be used. The authors need to clarify how they obtained the linear model parameters.

RESPONSE: We agree with the Editor on these issues of overfitting and multicollinearity, particularly in the linear model. Usually a few options are available in order to minimize the effect of multicollinearity and these can include: using models that are resilient to sizeable between-feature correlations, such as PLS, PCA, and other machine learning algorithms least squares. Alternatively, as we did in this work, we can identify and remove those features that contribute the most to the between-feature correlations.

All models implemented in this study were multicollinearity resistant except the linear model [60] which therefore required additional measures to avoid overfitting (Lines 293-301). In the linear model, in order to minimize the effect of collinearity, solve the issue of the number of predictor variables being greater than the sample size, estimate all parameters including the constant, and finally be able to fit the overall model during the training stage, we opted to reduce the number of predictors by using an algorithm to remove a subset of those features involved with the most high pairwise correlations such that the sample (training set) size is two more than the number of predictors (allowing for a residual degree of freedom), and all of the remaining pairwise Pearson correlation coefficients are below a 0.90 threshold [60].

Reviewer #1: The authors didn't address my concerns about the model overfitting issue at all. Instead of rebuttal using statement in the manuscript (which have no proof/reference, such as '15-day intervals and averages reduced the correlation between inputs'), please provide statistical evidence that your model is not overfitting, such as training accuracy and testing accuracy when you do machine learning modeling. or at least provide "adjusted R-square" which is adjusted for the number of parameters, or ANOVA test to proof the independence of your inputs, or at least correlation scatter plots between inputs. But any way, 24 parameters for 23 observations are way too many to avoid overfitting. Please read this post by Jim Frost for reference https://statisticsbyjim.com/regression/overfitting-regression-models/.

RESPONSE: We thank the Reviewer as his remarks helped us better rephrase and explain how we prepared the data and implemented the models (Lines 293-301). All models implemented in this study were multicollinearity resistant except the linear model [60] and measures to avoid overfitting were applied. For instance, the “one standard error” rule of Breiman et al. [61] was used, and the methods’ built-in features were invoked to automatically select features, tune hyperparameters to the data set, and select the best final model for the downstream validation step. In the process of data preparation, zero-variance features were removed and those remaining were centered and scaled in order to avoid features with zero or near-zero variance which can behave like second intercepts [63]. In the linear model, in order to minimize the effect of collinearity, solve the issue of the number of predictor variables being greater than the sample size, estimate all parameters including the constant, and finally be able to fit the overall model during the training stage, we opted to reduce the number of predictors by using an algorithm to remove a subset of those features involved with the most high pairwise correlations such that the sample (training set) size is two more than the number of predictors (allowing for a residual degree of freedom), and all of the remaining pairwise Pearson correlation coefficients are below a 0.90 threshold [60].

Several metrics of model performance were provided throughout the manuscript e.g., in Figures 5 and 7 for the model calibration stage, in Figures 6 and 8 and Table 3 for the validation stage.

Attachments
Attachment
Submitted filename: Response to Reviewers.pdf
Decision Letter - Jie Zhang, Editor

Machine learning models based on remote and proximal sensing as potential methods for in-season biomass yields prediction in commercial sorghum fields

PONE-D-20-28051R2

Dear Dr. Habyarimana,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Jie Zhang

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

The authors have adequately addressed the reviewers and editor's comments and the revised manuscript can be accepted.

Reviewers' comments:

Formally Accepted
Acceptance Letter - Jie Zhang, Editor

PONE-D-20-28051R2

Machine learning models based on remote and proximal sensing as potential methods for in-season biomass yields prediction in commercial sorghum fields

Dear Dr. Habyarimana:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Jie Zhang

Academic Editor

PLOS ONE

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .