Batch-produced, GIS-informed range maps for birds based on provenanced, crowd-sourced data inform conservation assessments

Ryan M. Huang; Wilderson Medina; Thomas M. Brooks; Stuart H. M. Butchart; John W. Fitzpatrick; Claudia Hermes; Clinton N. Jenkins; Alison Johnston; Daniel J. Lebbin; Binbin V. Li; Natalia Ocampo-Peñuela; Mike Parr; Hannah Wheatley; David A. Wiedenfeld; Christopher Wood; Stuart L. Pimm

doi:10.1371/journal.pone.0259299

Peer Review History

Original SubmissionJuly 23, 2021
28 Sep 2021 Decision Letter - Tim A. Mousseau, Editor PONE-D-21-23994Batch-produced, GIS-informed range maps for birds based on provenanced, crowd-sourced data inform conservation assessmentsPLOS ONE Dear Stuart, At long last, both reviews are in! Congratulations! Both the reviewers and I strongly feel this represents a very valuable (and useful) addition to the literature. However, there were a few small suggestions that probably should be tended to. Hence the "minor revision". But these are largely cosmetic and should not cause too much grief. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Nov 12 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Tim A. Mousseau Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. We note that Figure(s) 1 , in your submission contain [map/satellite] images which may be copyrighted. All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For these reasons, we cannot publish previously copyrighted maps or satellite images created using proprietary data, such as Google software (Google Maps, Street View, and Earth). For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright. We require you to either (1) present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or (2) remove the figures from your submission: 1. You may seek permission from the original copyright holder of Figure(s) [#] to publish the content specifically under the CC BY 4.0 license. We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text: “I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.” Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission. In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].” 2. If you are unable to obtain permission from the original copyright holder to publish these figures under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only. The following resources for replacing copyrighted map figures may be helpful: USGS National Map Viewer (public domain): http://viewer.nationalmap.gov/viewer/ The Gateway to Astronaut Photography of Earth (public domain): http://eol.jsc.nasa.gov/sseop/clickmap/ Maps at the CIA (public domain): https://www.cia.gov/library/publications/the-world-factbook/index.html and https://www.cia.gov/library/publications/cia-maps-publications/index.html NASA Earth Observatory (public domain): http://earthobservatory.nasa.gov/ Landsat: http://landsat.visibleearth.nasa.gov/ USGS EROS (Earth Resources Observatory and Science (EROS) Center) (public domain): http://eros.usgs.gov/# Natural Earth (public domain): http://www.naturalearthdata.com/ 3. Thank you for stating the following in the Acknowledgments Section of your manuscript: "Funding: The authors thank their home institutions for providing support" We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows: "The authors thank their home institutions for providing support" Please include your amended statements within your cover letter; we will change the online submission form on your behalf Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ******** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes ****** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ****** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ****** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: The authors of this manuscript have produced a useful tool for generating provenanced range maps so that they can be used to inform conservation assessments. However, despite the apparent utility of the tool to batch process large numbers of species, the authors repeatedly mention the need for closer inspection of the data in order to filter out incorrect records. The title and the abstract both emphasise the batch processing aspects, but much of the main text refers to the need for careful inspection of the data, so it is not clear how much of the process is automated and how much additional time is needed to manually correct data. It would be good of the authors could provide some estimates of how much time it takes to collate the data sets, how long it takes the protocol to do the batch processing, and how much time has to be spent manually checking and correcting data. The authors do admit to the concerns about the quality and reliability of the eBird data (lines 102-107) and explain that the protocol does allow users to check each data point to make decisions about its veracity. However, I feel that the authors have under-stated this issue and it may be a significant issue in other parts of the world with many amateur observers who use eBird as their personal listing software and who are not careful about recording the exact location of each observation and instead use it to record their observations over a much larger area than actually indicated. A stronger caveat on the reliability and quality of the data is required. It is not clear from the analysis to what extent the authors of the paper have inspected all the outlier records to check on their validity, as this is not stated I would assume not. Hence the results showing significant differences between the published ranges and the Areas of Habitat may be due to the inclusion of incorrect records and this needs to be clearly indicated. However, that said, it is clear from the results obtained that there are a number of cases where these differences warrant a re-evaluation of the published ranges. But it would help if the authors were a bit more up-front about the issues of using eBird (and other citizen science data – e.g. the iNaturalist data uploaded to GBIF may even be worse because of issues concerning how records are assigned ‘research grade’ status). Applying the methodology to birds in other habitats clearly has its limitations as indicated by the authors (lines 384-416). It is even more limited for other taxonomic groups given the patchy nature of records held by other citizen science initiatives. The taxonomic issues (lines 429-444) will pose problems for all taxonomic groups and only the adoption of a common taxonomy would resolve this (see Garnett et. al. 2020 - https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.3000736). Despite the limitations it is still a useful proof of concept and hopefully it will be easy to modify the code to allow users to add additional data sources in the future (as indicated in lines 374-382). A minor point – on line 73, “Extent of Occurrence” is used, however, the IUCN Red List Categories and Criteria (https://portals.iucn.org/library/node/10315) and the Guidelines for Using the IUCN Red List Categories and Criteria (https://nc.iucnredlist.org/redlist/content/attachment_files/RedListGuidelines.pdf) both use “extent of occurrence”. The authors are also to be congratulated on creating a tool that produces interactive HTML maps allowing users to zoom in on details and the opportunity to check individual data points and their metadata, etc. This kind of functionality helps to increase user confidence and certainly adds to the transparency of the process. A clear advantage of the protocol is its transparency and that it is explicit about what we know and what we don’t know. I support the publication of this manuscript with some minor corrections to expand on the underlying caveats concerning the use of the eBird data, how automated the process really is, and how easy it can be extended to other taxonomic groups and data sources. Reviewer #2: # Summary to author As someone who has recently been working a lot on refining Red List/BirdLife range maps to Area of Habitat, I think anything that makes the process easier and more repeatable is a good thing. Thus, I like the underlying tools of this paper, particularly the transparency of the map-making process and its ability to be refined. The paper is generally well written and the examples given are informative. I really like the integration of eBird data - this is a great idea, and a huge advance. Where I think the paper currently falls down is the justification for certain limitations of the study. In particular, I don’t follow the restriction to forest-dwelling species. I understand that forest cover data is widely available globally, at a high resolution, but why is that strictly necessary? Why didn’t the authors include all habitat types using, for example, the 2020 Jung et al. IUCN habitat types map? This would of course only give habitat presence/absence in a cell rather that any continuous measure, but then it could be applied at a much larger scale, to all species. I am not requesting that the authors completely re-do this analysis, but rather that they better justify in the text why the focus is on forest species, and how this limitation can/will be overcome in the future. There is some discussion of this point (e.g. lines 89-91), but I think it is insufficient at present. My other comments are relatively minor, please see below. I think this is a very useful contribution to the field! # Methods Lines 114-118: I’m not entirely following this. You identified open habitat species using BirdLife habitat descriptions? And had made the decision a priori to exclude them because they occupy low forest cover habitat? Or you identified that you should exclude these species based on the discovery that they occupy low forest cover habitat? Lines 129-131: this doesn’t make it clear what you ultimately did with these 99 species Lines 138-139: can you add a citation for eBird’s best practices? Line 144: I think more upfront explanation on what an alpha hull actually is (and why you used it) would be useful. Probably just move lines 224-227 up to here. Line 167: be explicit in what these four values are (your min & max based on GLOBE, plus the Red List min and max?) Lines 172-173: why 75%? I presume this is an arbitrary value that can be changed by the user? Lines 178-179: this is crucial, so I think it should be more detailed. Or maybe have a flowchart? If I am following this correctly, the process is: (1) the alpha hull is created based on the range and occurrence points? (N.B. I don’t know what “median inter-presence distance” as the alpha actually means). (2) Min/max elevation and forest cover are identified based on distribution of occurrence points. (3) Alpha is refined to forest cover and elevation that falls within the range identified by (2)? (4) Within this AOH, you identify cells that are potentially occupied/unoccupied based solely on their proximity to presence/absence points, not anything to do with the environmental characteristics of that cell? Line 184: what does “previously calculated” mean? By whom? You or the Red List? Or both, as in step (2) above? Lines 185-186: I’m not sure what this means. Species would not be included on the checklist in cells where the habitat is thought to be unsuitable? Is the point that presences outside of range polygon can be included, while absences cannot (because they wouldn’t appear on the checklist)? # Results Line 230: many of the circle points also show a wide scatter Lines 239-241: perhaps you should be more careful in the way the “AOH 12% larger than published ranges” finding is presented, since it includes the potentially unsuitable areas as well. It seems to me that the “potentially suitable AOH” is more accurate, since it is more strongly informed by actual observations. Probably (?) a lot of the “potentially unsuitable AOH” is indeed unsuitable but for reasons we can’t pick up through the environmental layers we currently have, only through repeatedly not observing the species there. Lines 304-315: very interesting examples. Really exemplifies the utility of this approach. # Discussion Lines 3421-343: as above, I think more caution should be added to this point. Lines 391-393: this starts to get at my concern discussed above re focus on forest species, but I’m not quite convinced. Surely you could limit by habitat, it would just be binary (habitat is present/absent in that cell) rather than a continuous measure like forest cover? But that will surely be true for a long time to come, because any continuous measure of non-forest “quality” is hard to measure remotely? Basically, I am concerned that we will be neglecting many non-forest species globally and for a long time if we ignore them until we have data as good as we have for forest species. Lines 434-438: I would suggest adding some kind of text ref here before the numeric “[1]”, so that it is easier to read Lines 457-458: could you add a very brief summary of the supplementary findings here for the lazy/busy reader? # Figures and tables As a general point, the figures are a bit low res at present. Please ensure they are provided at a higher resolution for publication. Figure 1: in the legend it refers to blue lines, but I can’t really make them out in the figure. Figure 3: please explain the solid black dots in more detail, in both the figure legend and its caption. ****** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review?** For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. https://doi.org/10.1371/journal.pone.0259299.r001
Revision 1
14 Oct 2021 Author Response Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. We note that Figure(s) 1 , in your submission contain [map/satellite] images which may be copyrighted. All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For these reasons, we cannot publish previously copyrighted maps or satellite images created using proprietary data, such as Google software (Google Maps, Street View, and Earth). For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright. We require you to either (1) present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or (2) remove the figures from your submission: 1. You may seek permission from the original copyright holder of Figure(s) [#] to publish the content specifically under the CC BY 4.0 license. We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text: “I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.” Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission. In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].” 2. If you are unable to obtain permission from the original copyright holder to publish these figures under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only. The following resources for replacing copyrighted map figures may be helpful: USGS National Map Viewer (public domain): http://viewer.nationalmap.gov/viewer/ The Gateway to Astronaut Photography of Earth (public domain): http://eol.jsc.nasa.gov/sseop/clickmap/ Maps at the CIA (public domain): https://www.cia.gov/library/publications/the-world-factbook/index.html and https://www.cia.gov/library/publications/cia-maps-publications/index.html NASA Earth Observatory (public domain): http://earthobservatory.nasa.gov/ Landsat: http://landsat.visibleearth.nasa.gov/ USGS EROS (Earth Resources Observatory and Science (EROS) Center) (public domain): http://eros.usgs.gov/# Natural Earth (public domain): http://www.naturalearthdata.com/ The original figure attributions were incorrect and the basemaps were derived from USGS data and the captions now reflect the correction. The only exception is Figure 2 which uses an ESRI basemap for which ESRI provides permission for reproduction for research purposes: https://www.esri.com/en-us/legal/copyright-trademarks 3. Thank you for stating the following in the Acknowledgments Section of your manuscript: "Funding: The authors thank their home institutions for providing support" We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows: "The authors thank their home institutions for providing support" Please include your amended statements within your cover letter; we will change the online submission form on your behalf We have now removed all funding references from the Acknowledgements section Reviewer #1: The authors of this manuscript have produced a useful tool for generating provenanced range maps so that they can be used to inform conservation assessments. However, despite the apparent utility of the tool to batch process large numbers of species, the authors repeatedly mention the need for closer inspection of the data in order to filter out incorrect records. The title and the abstract both emphasise the batch processing aspects, but much of the main text refers to the need for careful inspection of the data, so it is not clear how much of the process is automated and how much additional time is needed to manually correct data. It would be good of the authors could provide some estimates of how much time it takes to collate the data sets, how long it takes the protocol to do the batch processing, and how much time has to be spent manually checking and correcting data. The authors do admit to the concerns about the quality and reliability of the eBird data (lines 102-107) and explain that the protocol does allow users to check each data point to make decisions about its veracity. However, I feel that the authors have under-stated this issue and it may be a significant issue in other parts of the world with many amateur observers who use eBird as their personal listing software and who are not careful about recording the exact location of each observation and instead use it to record their observations over a much larger area than actually indicated. A stronger caveat on the reliability and quality of the data is required. It is not clear from the analysis to what extent the authors of the paper have inspected all the outlier records to check on their validity, as this is not stated I would assume not. Hence the results showing significant differences between the published ranges and the Areas of Habitat may be due to the inclusion of incorrect records and this needs to be clearly indicated. However, that said, it is clear from the results obtained that there are a number of cases where these differences warrant a re-evaluation of the published ranges. But it would help if the authors were a bit more up-front about the issues of using eBird (and other citizen science data – e.g. the iNaturalist data uploaded to GBIF may even be worse because of issues concerning how records are assigned ‘research grade’ status). The filtering methodology described in the paper is the result of spending a significant amount of time reviewing and investigating data for potential outliers. Together with eBird’s existing review process, the data are of higher quality than the reviewer implies here. As a result, the methods presented allow for batch processing. The issue of individual review is dependent on the ultimate use of the maps. Individual review may not be needed when aggregating result across many species, but applications focused on a single or few species will want a detailed review process, which our transparent methodology allows for. We have now included text that says so. The science behind iNaturalist was conceived in the lab that produced this current paper and we remain keenly interested in its development. In addition, we are currently examining eBird, iNaturalist, and GBIF data for projects that naturally emerge from this effort. Our experiences are that mapping species distributions in the ways we do here is often the best way to identify outliers and check their plausibility. As we stress continually, transparency — the ability to check who claimed which species, where, and when is vital to such efforts. Applying the methodology to birds in other habitats clearly has its limitations as indicated by the authors (lines 384-416). It is even more limited for other taxonomic groups given the patchy nature of records held by other citizen science initiatives. The taxonomic issues (lines 429-444) will pose problems for all taxonomic groups and only the adoption of a common taxonomy would resolve this (see Garnett et. al. 2020 - https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.3000736). Despite the limitations it is still a useful proof of concept and hopefully it will be easy to modify the code to allow users to add additional data sources in the future (as indicated in lines 374-382). We agree! Another motivation of this effort is its shining light on distributions that in turn inform taxonomic issues. In doing so, sometimes records show geographically distinct populations, but in others, continuous distributions for which more detailed studies might show clinal variation. A minor point – on line 73, “Extent of Occurrence” is used, however, the IUCN Red List Categories and Criteria (https://portals.iucn.org/library/node/10315) and the Guidelines for Using the IUCN Red List Categories and Criteria (https://nc.iucnredlist.org/redlist/content/attachment_files/RedListGuidelines.pdf) both use “extent of occurrence”. Changed to lower case The authors are also to be congratulated on creating a tool that produces interactive HTML maps allowing users to zoom in on details and the opportunity to check individual data points and their metadata, etc. This kind of functionality helps to increase user confidence and certainly adds to the transparency of the process. A clear advantage of the protocol is its transparency and that it is explicit about what we know and what we don’t know. We appreciate the kind words and are very happy that the reviewer acknowledges the value of transparency! I support the publication of this manuscript with some minor corrections to expand on the underlying caveats concerning the use of the eBird data, how automated the process really is, and how easy it can be extended to other taxonomic groups and data sources.\\ We have added the relevant caveats and explanations Reviewer #2: # Summary to author As someone who has recently been working a lot on refining Red List/BirdLife range maps to Area of Habitat, I think anything that makes the process easier and more repeatable is a good thing. Thus, I like the underlying tools of this paper, particularly the transparency of the map-making process and its ability to be refined. The paper is generally well written and the examples given are informative. I really like the integration of eBird data - this is a great idea, and a huge advance. Where I think the paper currently falls down is the justification for certain limitations of the study. In particular, I don’t follow the restriction to forest-dwelling species. I understand that forest cover data is widely available globally, at a high resolution, but why is that strictly necessary? Why didn’t the authors include all habitat types using, for example, the 2020 Jung et al. IUCN habitat types map? This would of course only give habitat presence/absence in a cell rather that any continuous measure, but then it could be applied at a much larger scale, to all species. I am not requesting that the authors completely re-do this analysis, but rather that they better justify in the text why the focus is on forest species, and how this limitation can/will be overcome in the future. There is some discussion of this point (e.g. lines 89-91), but I think it is insufficient at present. We are exploring alternatives in current work to improve upon exactly this point. The idea of matching each observation to the mapped habitat at that location is indeed seductive and one of the author’s has done so in a paper in press at Conservation Biology (https://www.biorxiv.org/content/10.1101/2021.06.08.447053v1). Nonetheless, our experiences are (1) the habitat classification categories are too coarse, (2) when mapped, grasslands and other open area habitats are still subject to errors in accuracy, and (3) that even when one gets past these issues, the connection to species preferences is difficult. Jung et al. include one of this paper’s authors and we are familiar with it. (1) Many of the habitat classification/global land cover data, such as the Jung et al or ESA CCI maps, divide the land into a relatively few number of categories. Instead, using a continuous scale of forest cover allows us to create maps of species ranges at a finer resolution which we believe is appropriate for what we consider the first version of a proof of concept. (2) Accurate remote sensing and mapping of grasslands and their threats is difficult. Even by Jung et al’s own validation, their accuracy for many grassland and shrubland level 2 habitats is only ~50%. We do not intend for this to be a limit to our methodologies, but the resolution of such a challenge is beyond the scope of this paper. (3) In current work, we are examining the habitat choices of ~100 endemic species that occur across a wide range of habitats. We ask which species occurrences are in habitats more often than one would expect by chance, given the habitat’s extent. One quickly sees that while there are some clear preferences, such habitats sometimes do not account for more than two-thirds of the species occurrences. So does one only include habitats in which the species occurs more often than by chance, or all habitats to include (say) 95% of the occurrences. The former has large errors of omission, the latter large errors of commission. Similarly, were we to use Jung et al.’s classification, would we use all habitats or only those top preferences? Given these complications, we find the rigorously quantified measurement of forest cover provides a simple way forward. It’s one for which we can readily explain the consequences of changing the thresholds we employ. Yes, we do not cover all the species — which is why we are exploring extensions to our methods in current work. At the moment, we have added some text in the Limitations mentioning errors of omission and commission in a habitat class matching approach. My other comments are relatively minor, please see below. I think this is a very useful contribution to the field! # Methods Lines 114-118: I’m not entirely following this. You identified open habitat species using BirdLife habitat descriptions? And had made the decision a priori to exclude them because they occupy low forest cover habitat? Or you identified that you should exclude these species based on the discovery that they occupy low forest cover habitat? Because of the batch nature of our methods, we ran all the species based on range size and listed forest habitat cover statistics. When we did so, it was immediately apparent that some species were outliers — most of them known from our personal knowledge. Miners, cinclodes, and helmetcrest hummingbirds were obvious examples. With that insight, we then went to the BirdLife classifications and used the first-mentioned major habitat to decide whether the species was predominantly a grassland or shrubland species. Lines 129-131: this doesn’t make it clear what you ultimately did with these 99 species Now clarified Lines 138-139: can you add a citation for eBird’s best practices? Now included Line 144: I think more upfront explanation on what an alpha hull actually is (and why you used it) would be useful. Probably just move lines 224-227 up to here. Moved and expounded Line 167: be explicit in what these four values are (your min & max based on GLOBE, plus the Red List min and max?) Done Lines 172-173: why 75%? I presume this is an arbitrary value that can be changed by the user? That is correct Lines 178-179: this is crucial, so I think it should be more detailed. Or maybe have a flowchart? If I am following this correctly, the process is: (1) the alpha hull is created based on the range and occurrence points? (N.B. I don’t know what “median inter-presence distance” as the alpha actually means). (2) Min/max elevation and forest cover are identified based on distribution of occurrence points. (3) Alpha is refined to forest cover and elevation that falls within the range identified by (2)? (4) Within this AOH, you identify cells that are potentially occupied/unoccupied based solely on their proximity to presence/absence points, not anything to do with the environmental characteristics of that cell? This is all correct. Figure 1 is intended to serve as a flow chart analog with a visual example and we believe a separate flowchart would be redundant. That being said, we have expounded on the alpha hull description for a clearer understanding for your N.B. Line 184: what does “previously calculated” mean? By whom? You or the Red List? Or both, as in step (2) above? Clarified in text Lines 185-186: I’m not sure what this means. Species would not be included on the checklist in cells where the habitat is thought to be unsuitable? Is the point that presences outside of range polygon can be included, while absences cannot (because they wouldn’t appear on the checklist)? That is not the correct interpretation and we’ve now tried to clarify in the text. We don’t count checklists in unsuitable habitat for absences, but we do for presences since we are more conservative about what constitutes a true absence, but have higher confidence in what is considered a true presence. # Results Line 230: many of the circle points also show a wide scatter We wrote “a wider scatter” — which is correct, but yes, all the points have a wide scatter. Lines 239-241: perhaps you should be more careful in the way the “AOH 12% larger than published ranges” finding is presented, since it includes the potentially unsuitable areas as well. It seems to me that the “potentially suitable AOH” is more accurate, since it is more strongly informed by actual observations. Probably (?) a lot of the “potentially unsuitable AOH” is indeed unsuitable but for reasons we can’t pick up through the environmental layers we currently have, only through repeatedly not observing the species there. The division in the two types of AOH are not based on “suitability” but rather potential occupancy. AOH by definition is all the area that appears to be suitable elevation and landcover and is originally defined in Brooks et al. 2019. For this reason we have chosen to keep the text unchanged. Lines 304-315: very interesting examples. Really exemplifies the utility of this approach. Thanks! # Discussion Lines 3421-343: as above, I think more caution should be added to this point. See above response Lines 391-393: this starts to get at my concern discussed above re focus on forest species, but I’m not quite convinced. Surely you could limit by habitat, it would just be binary (habitat is present/absent in that cell) rather than a continuous measure like forest cover? But that will surely be true for a long time to come, because any continuous measure of non-forest “quality” is hard to measure remotely? Basically, I am concerned that we will be neglecting many non-forest species globally and for a long time if we ignore them until we have data as good as we have for forest species. Exactly so! There are 15 open habitat species that are threatened, of which the white-bellied cinclodes is critically endangered. Our using that as an example is not just because Pimm has a photograph of it. Based on habitat maps, this species should have a large amount of habitat. Clearly most of that is unsuitable, so existing habitat maps are a poor guide. (The species also occurs more widely than previously known, showing the importance of crowd-sourced data.). Mapping grassland species from remote sensing requires considerable care, as Pimm’s work on the Cape Sable seaside sparrow shows. Such efforts are likely to be one-off, specialized maps, so the best we can do is to acknowledge existing limitations. Lines 434-438: I would suggest adding some kind of text ref here before the numeric “[1]”, so that it is easier to read We’ve now added the subject Lines 457-458: could you add a very brief summary of the supplementary findings here for the lazy/busy reader? We have added a couple of sentences to this point. # Figures and tables As a general point, the figures are a bit low res at present. Please ensure they are provided at a higher resolution for publication. Figure 1: in the legend it refers to blue lines, but I can’t really make them out in the figure. Made clearer Figure 3: please explain the solid black dots in more detail, in both the figure legend and its caption. Explained Attachments Attachment Submitted filename: Response to Reviewers.docx https://doi.org/10.1371/journal.pone.0259299.r002
18 Oct 2021 Decision Letter - Tim A. Mousseau, Editor Batch-produced, GIS-informed range maps for birds based on provenanced, crowd-sourced data inform conservation assessments PONE-D-21-23994R1 Dear Stuart, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Tim A. Mousseau Academic Editor PLOS ONE https://doi.org/10.1371/journal.pone.0259299.r003
Formally Accepted
3 Nov 2021 Acceptance Letter - Tim A. Mousseau, Editor PONE-D-21-23994R1 Batch-produced, GIS-informed range maps for birds based on provenanced, crowd-sourced data inform conservation assessments Dear Dr. Pimm: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Tim A. Mousseau Academic Editor PLOS ONE https://doi.org/10.1371/journal.pone.0259299.r004

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .