Peer Review History

Original SubmissionJuly 16, 2019
Decision Letter - Rob J. De Boer, Editor, Cecile Viboud, Editor

Dear Dr Ciavarella,

Thank you very much for submitting your manuscript 'Deriving fine-scale models of human mobility from aggregated origin-destination flow data' for review by PLOS Computational Biology. Your manuscript has been fully evaluated by the PLOS Computational Biology editorial team and in this case also by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the manuscript as it currently stands. While your manuscript cannot be accepted in its present form, we are willing to consider a revised version in which the issues raised by the reviewers have been adequately addressed. We cannot, of course, promise publication at that time.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

Your revisions should address the specific points made by each reviewer. Please return the revised version within the next 60 days. If you anticipate any delay in its return, we ask that you let us know the expected resubmission date by email at ploscompbiol@plos.org. Revised manuscripts received beyond 60 days may require evaluation and peer review similar to that applied to newly submitted manuscripts.

In addition, when you are ready to resubmit, please be prepared to provide the following:

(1) A detailed list of your responses to the review comments and the changes you have made in the manuscript. We require a file of this nature before your manuscript is passed back to the editors.

(2) A copy of your manuscript with the changes highlighted (encouraged). We encourage authors, if possible to show clearly where changes have been made to their manuscript e.g. by highlighting text.

(3) A striking still image to accompany your article (optional). If the image is judged to be suitable by the editors, it may be featured on our website and might be chosen as the issue image for that month. These square, high-quality images should be accompanied by a short caption. Please note as well that there should be no copyright restrictions on the use of the image, so that it can be published under the Open-Access license and be subject only to appropriate attribution.

Before you resubmit your manuscript, please consult our Submission Checklist to ensure your manuscript is formatted correctly for PLOS Computational Biology: http://www.ploscompbiol.org/static/checklist.action. Some key points to remember are:

- Figures uploaded separately as TIFF or EPS files (if you wish, your figures may remain in your main manuscript file in addition).

- Supporting Information uploaded as separate files, titled Dataset, Figure, Table, Text, Protocol, Audio, or Video.

- Funding information in the 'Financial Disclosure' box in the online system.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see here

We are sorry that we cannot be more positive about your manuscript at this stage, but if you have any concerns or questions, please do not hesitate to contact us.

Sincerely,

Cecile Viboud

Associate Editor

PLOS Computational Biology

Rob De Boer

Deputy Editor

PLOS Computational Biology

A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:

[LINK]

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: In “Deriving fine-scale models of human mobility from aggregated origin-destination flow data”, the authors use aggregated mobility data to develop a finer spatial scale model of mobility. The authors build off of the two most commonly applied spatial interaction models (gravity model and radiation model) and use those frameworks to estimate travel at a finer spatial scale. The authors use population and distance data to estimate these travel patterns (down to 1 km). Using previously published mobility data calculated using mobile phone calling records, the authors first fit spatial interaction models to the origin-destination trip counts. The authors the use these origin-destination trip counts to assume that the sum of travel on finer spatial scales is equal to the aggregate. By assuming the finer scale data is equal to the sum of the aggregate, they then fit the same spatial interaction models to these data. Overall, the analysis presented is adequate but would be greatly strengthened by 1) elaborating on the implications and interpretation of the models fit, 2) providing quantitative results of the implications of these finer scale models, and 3) demonstrating the implications for a biological purpose. The primary concern for the methods is the omission (not the fault of the authors) of within administrative unit travel and the implications on the parameters estimated.

Major Comments:

a) The authors do not analyze any travel within a spatial unit. Although the authors state that the within administrative unit data is unavailable (based on the published work where the authors obtained these data), this omission (which is not the fault of the authors) may be biasing estimates from the models. In particular, the role of distance and the estimated parameter (since very short distances – or distances of length 0) may be impacted. On the larger spatial scale of the data, this omission may be less relevant since travel is likely to occur over larger distances (given the administrative unit geography). However, this assumption (that within spatial unit travel can be omitted from the analysis), may be less valid for the finer spatial units analyzed. For example, on the 1km grid scale, very local travel is highly likely since these represent very close and small spatial units. Including data on these scales would likely down weight the distance parameter. However, travel patterns on these scales are not included since they were not included in the originally published data set. Hence the authors cannot state that they are estimating travel between these fine spatial units when actually they are estimating travel between fine spatial units assuming that travel only occurs between large spatial units. This fact underpins the entire analysis, yet the authors do not carefully and clearly address these points in the analysis.

b) The assumption in the analysis is that the same factors such as population size and distance are driving the heterogeneity of travel for finer spatial units. For example, if X trips occur between two districts (or aggregated administrative unit), then the assumption is that the number of trips (that will sum to X) that occur from the grid cells comprising those districts are also driven by population size and distance factors. Since trip data on finer spatial units is unavailable to directly fit parameter estimates, it is unclear if these factors or other factors are the most relevant data sets.

c) The authors should provide additional support and analyses of the model fits. Currently, enough detail is not provided. In particular, a more thorough comparison and investigation of the estimated model fits versus the data should be provided since it forms the primary result.

d) In general, the conclusions do not appear to be the same for Kenya and Namibia. They find opposite results in Kenya and Namibia but do not discuss the implications of these findings further.

e) And in particular, the finer spatial scale models do not perform better in Namibia. The authors should provide additional analyses and interpretation of this finding. Further, in the discussion the authors discuss the sparseness of the population distribution on finer scales. It should be possible to analyze this hypothesis further to provide quantitative support. It is unfair to state in the abstract and results that the finer-spatial models outperformed the aggregate models.

f) The authors do not provide any interpretation of the model parameters or the differences on various spatial scales. Given that this is the one of the primary results of the manuscript, additional analyses and interpretation should be provided.

g) The authors do not provide any implications for biological applications. Although the overall motivation is to study infectious disease dynamics, they do not provide any results that evaluate the impact of different models on disease dynamics. For example, the radiation model may outperform the gravity model in some instances, however these differences may not be relevant for disease transmission given the life history of the pathogen.

h) In the gravity model, the authors have added additional parameters (such as 10^kappa – gamma*epsilon) but do not provide additional support or interpretation for the added these parameters.

Minor Comments:

a) The authors should provide higher resolution figures since the current versions, although interpretable, are blurry to view.

b) There are a number of typos in the manuscript particularly regarding verb tense.

c) It is unclear exactly how the bottom row of figures in Figure S3 was calculated.

d) The authors should provide code to reproduce their analysis and the predicted model output.

Reviewer #2: In the proposed article, the authors fit different models (gravity and radiation models) to CDR data at different levels of spatial resolution using datasets from Kenya and Namibia. The suitability of often-used gravity models is only partially understood and appears to depend on spatial resolution and across locations. This paper provides an approach to explore these questions. I only have a few comments.

The appropriateness of the different model formulations depends on the quality of the CDR data used to fit the models. This appears particularly problematic for the Kenyan dataset, as there was no direction data available for the flows. The authors come up with an approach that sums over both directions, however, the uncertainty generated by such an approach was not clear to me. As a sensitivity analysis, the authors could aggregate the Namibian dataset to make an equivalent to the Kenyan one (ie with no direction information) and use the same approach. This would at least provide some insight as to the consistency of the results and the potential pitfalls of losing direction information.

It was unclear how the authors dealt with within-unit travel – in particular, what distance did they use in the gravity models (e.g., distance of 0, 0.5 x cell width, population weighted average based on LandScan……)? – I could imagine this could be quite important to model fit as most travel with be within the same unit.

It was difficult to assess model fit – the summary measures do not explore the fit of individual district pairs. The authors could plot the observed number of trips between each district pair with that predicted from the different models (aggregating the smaller units where necessary). That would allow readers to assess whether there are any systematic differences.

Relatedly, it would also be good to understand the robustness of the fit to held-out datasets (e.g., fitting in one part of the country and estimating in another)

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: No: The authors do not provide their code and the model output.

Reviewer #2: No: Not provided directly but available from previous publications

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Revision 1

Attachments
Attachment
Submitted filename: Response to Reviewers.docx
Decision Letter - Rob J. De Boer, Editor, Cecile Viboud, Editor

Dear Ms. Ciavarella,

We are pleased to inform you that your manuscript 'Deriving fine-scale models of human mobility from aggregated origin-destination flow data' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Cecile Viboud

Associate Editor

PLOS Computational Biology

Rob De Boer

Deputy Editor

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #2: The authors have done a very impressive, comprehensive reanalysis of the data, incorporating the concerns of the reviewers. The paper now does a good job of discussing the complexities of using simple models to fit data, including the limitations. I have no further comments.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

Formally Accepted
Acceptance Letter - Rob J. De Boer, Editor, Cecile Viboud, Editor

PCOMPBIOL-D-19-01187R1

Deriving fine-scale models of human mobility from aggregated origin-destination flow data

Dear Dr Ciavarella,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Alice Ellingham

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .