Peer Review History

Original SubmissionJanuary 22, 2021
Decision Letter - Anthony C Constantinou, Editor

PONE-D-21-02337

Explaining the difference between men's and women's football

PLOS ONE

Dear Dr. Pappalardo,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

This is a strong paper that shows that the traditional machine learning classifiers, which you could also argue they are not necessarily state-of-the-art machine learning techniques since they have been around for decades, can identify female and male teams with high accuracy based on the set of input variables specified. As suggested by the reviewers, the paper requires minor revision before it can be accepted for publication. Please refer to the reviewers' comments. 

If you decide to resubmit a revised version, please provide point-to-point responses to each of the comments made by the reviewers. In your response, please explain what revisions have been made to address each of the points raised by the reviewers. If a comment is not addressed, please justify this decision in your response to the reviewer.

Please submit your revised manuscript by May 22 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Anthony C Constantinou

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

3. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially identifying or sensitive patient information) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. Please see http://www.bmj.com/content/340/bmj.c181.long for guidelines on how to de-identify and prepare clinical data for publication. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

4. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: I Don't Know

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: I found this to be a very innovative and informative paper. As the authors note, comparisons of male and female soccer have only recently received empirical attention, and this study provides an informative overview of major differences in technical play, with a focus on spatio-temporal events, along with individual and collective performance. Findings are descriptively strong and make considerable (intuitive) sense and therefore help to build a foundation for a new line of research. I must admit that I do not have the expertise to evaluate the statistical analyses, so I hope that this is covered by other reviewers. But focusing on my expertise and what I can evaluate, I believe this paper is a very strong one.

Some suggestions for revisions are the following:

(1) I found the paper a real pleasure to read. At the same time, the paper’s readability can be improved if the authors use labels that help readers to immediately grasp the the meaning. For example, the indices of H, PR, and FC are not linked to any meaning, it seems. Why not use meaningful labels, which the authors do do for FC (Flow centrality). Also, H and FC are two indicators of “collective” performance. It was not clear to me whether they were correlated. For example, when I look at Table 3, it seems that H and FC are not strongly correlated, because there are not so many overlapping countries in the top 10.

(2) One sizeable differences between men and women is the # of fouls (and more free kicks among women than men). The authors do pay much attention to it, but this seems quite interesting. This allows for at least a bit more interpretation.

(3) The conclusions are straightforward. But I wonder whether it is desirable to provide a bit more discussion to the major findings, linking to the broader literature. What comes to mind is a body of literature examining the role differences in biological make-up between men and women or the role of “cognition” (e.g., executive functioning) that might be relevant to understanding performance in soccer (see research by Lot Verburgh et al., 2014, PlosOne).

(4) The paper needs to be checked on typos (e.g., length rather than length, fro rather than for, etc). Also, I noted that the Dutch name “Paul A.M. Van Lange” should be read as “Van Lange” (not “Lange”) both in the text and references.

Again, very nice paper.

Paul Van Lange

Reviewer #2: Review of manuscript PONE-D-21-02337

The manuscript reports a study attempting to identify variables that can distinguish between men’s and women’s football (soccer). The aim was to train an AI-model so that it can recognize whether a team is male or female. The results reveal differences between men’s and women’s football on several variables and the model, based on computed performance indicators from selected variables, was able to correctly identify a team’s sex in around nine out of ten cases.

I will not claim to be an expert on the specific methods used for building the model, so my comments are related to the choice of variables, the interpretation of the results, and the conclusions. In particular, I am concerned with the validity of the results, and consequently their usefulness.

I find the study interesting, especially the first part attempting to identify variables that differ between men’s and women’s football. I would have liked to see more of this information in the paper and not only as supplementary information (for example the heatmaps showing areas where free-kicks and shots were taken), and I would have liked to see more discussion related to these differences and their possible consequences.

According to the authors, “current studies focus on the physical features” (line 42), while their variables measure technical performance. However, some of the variables may be different between sexes due to other factors than technical abilities, and they may not be so one-dimensional as they may seem. In fact, there are several possible confounding variables that could compromise the interpretation of the results.

I will give a few examples below:

• Several of the variables that are defined as technical, are in fact highly dependent on physiological factors. For example, passing length, as well as shooting distance require (leg-) muscle strength and are also dependent on biomechanical factors that are different between the sexes, notably torques. Thus, the differences may be rather obvious, and they may not reveal any important information about technical performance.

• There are trade-offs between tactical- and technical variables such as for example pass length and pass accuracy. A team that plays longer passes may well use this as a strategy against teams that are more passing-oriented, or as a general strategy if the players are not so technically fluent. Also, a team that plays out from the back would generate plenty more passes, and also higher passing accuracy due to many passes being less risky, whereas a team that more often played out long from the goalkeeper would generate longer passes on average, and at the same time increase the risk, thus decrease the accuracy. Hence, the variable may be contaminated by differences in playing styles, regardless of sex.

• The average pass length, as well as the average shot distance, is not very much shorter in women (1 m, and 1.5 m, respectively) with considerable overlap as is evident from the standard deviations. This means that, although there is an average difference between the sexes, there are so large within-sex variations that the variable is rather poor at distinguishing between teams. For illustration, in the FIFA WC 2018, according to fbref.com the average shot distance varied between 22 m (Saudi Arabia) and 15 m (Serbia). 15 of the male teams had average shot distances below the female average (18.39 m, according to the present manuscript). In the Women’s FIFA WC 2019, the average shot distance varied between 25 m (Argentina) and 15 m (the Champions, USA). Six female teams had average shot distances above the male average of 19.99 m.

My biggest concern is with the validity of the results, thus what can be concluded from them, apart from the fact that it is possible, in most cases to identify a team by its sex. Exactly what is the model identifying? I am not completely convinced that it is performance quality, and that it is performance quality that differs between sexes.

The algorithm was generally able to categorize matches and teams by their sex (how were the matches selected; randomly?). However it made errors in around ten percent of cases.

A model is only as good as it’s variables, and variables that are used to conclude about differences in quality of performance, would need to be validated against actual performance, which is what we can deduct from Table 3. None of the three indicators seem particularly sensitive to sex differences, with both male and female teams among the top scorers. I would have liked to see the complete rankings, which I suspect may lend some explanation to the fact that the model sometimes mischaracterizes teams as belonging to the opposite sex.

For illustration, passing accuracy for the male teams in the most recent FIFA World Cup (still according to fbref.com), varied between 89% (Spain) and 60% (Iran), with the World Champions, France, at around 80%, and the runners up, Croatia, at 78%. In the women’s WC, passing accuracy varied between 83% (Japan) and 59% (Cameroon), with both the Champions, USA, and Netherlands, the runners up, at around 77%. It should also be noted that these numbers are dependent on the relative percentage of short, medium, and long passes for which the long passes in particular were less accurate.

More importantly, however, is the fact that the scores on the indicators used in the model are not very different between male and female teams (lines 139-171). Albeit the average may be different, there is considerable overlap. Furthermore, most of the highest scoring teams on the three variables were not performing particularly well in the tournament. Only one of the quarter-finalists (QF) for each sex (England, in both cases) were among the top ten scorers on the Havg, whereas two female QFs (USA and England), and no male QFs were among the top ten scorers on the FCavg. Some of the top scoring teams were even among the poorest performers in the tournament. It should be conceded that the PRavg, had five female QFs (USA, France, Germany, Sweden, England) among the top ten scorers, along with three male QFs (Belgium, Croatia, Russia), however PR-scores in general were not different between men and women (Fig. 2).

Thus, the model may not recognize male or female teams, but instead teams playing with a certain style. Furthermore, that playing style, whether by a male or a female team, may not be a valid indicator of the quality of performance. For these reasons, I would urge the authors to be much more prudent in their interpretation of their results, and in particular, their conclusions.

I apologize in advance for any errors, and trust that the authors notify the editor in such cases.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Paul Van Lange

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Revision 1

see attached file entitled "Revision R1.pdf"

Attachments
Attachment
Submitted filename: Revision R1 .pdf
Decision Letter - Anthony C Constantinou, Editor

Explaining the difference between men's and women's football

PONE-D-21-02337R1

Dear Dr. Pappalardo,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Anthony C Constantinou

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: I very much appreciate the revisions and the explanations of the revisions. What was already a very strong paper -- with very informative and novel data -- has become even stronger. I believe the paper is ready to be published.

Reviewer #2: Review of revised manuscript PONE-D-21-02337R1

Meanwhile, I have read the revised manuscript, and find it much improved. The authors have done a good job with the revision and have amended many of the issues identified by both reviewers.

I appreciate the inclusion of the heatmaps and the figures depicting the distribution of actions across pitch zones, also that many statements have been moderated and the interpretations of the findings are generally more cautious.

I am still not convinced that the algorithm distinguishes between male and female teams based on performance quality, as is stated. This is due to issues regarding the variables included in the algorithm, mentioned in my previous review.

Regardless, I find the study interesting, and the overall discussion is relevant and interesting. Given that the procedures are better explained, and the variables better justified, together with the generally more modest conclusions, I believe that this discussion could now be moved to the general scientific community, as the paper would be publishable in its present form.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Formally Accepted
Acceptance Letter - Anthony C Constantinou, Editor

PONE-D-21-02337R1

Explaining the difference between men’s and women’s football

Dear Dr. Pappalardo:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Anthony C Constantinou

Academic Editor

PLOS ONE

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .