Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Gender disparities among authors of retracted publications in medical journals: A cross-sectional study

Abstract

Background

Gender disparities in scientific authorship are well documented, yet little is known about gender representation among authors of retracted publications.

Methods

We analyzed 878 retracted publications from 131 high-impact medical journals across nine clinical disciplines (anesthesiology, dermatology, general internal medicine, gynecology/obstetrics, neurology, oncology, pediatrics, psychiatry, and radiology). Gender was inferred using Gender API for all, first, and last authors. Two analytic samples were constructed based on prediction confidence thresholds (≥60% and ≥70%). We examined gender distribution across authorship positions, number of retractions per author, and disciplinary representation. Wilcoxon rank-sum and chi-squared tests were used to assess group differences. Gender proportions were compared with publication benchmarks from 2008–2017, restricting retraction data to the same period for comparability.

Results

Among 4,136 authors, 3,909 had full first names, and gender could be assigned to 3,865 (98.9%). In the sample with prediction confidence ≥60% (n = 3,743), 863 (23.1%) were identified as women. They accounted for 16.5% (123/747) of first and 12.7% (87/687) of last authors. They had significantly fewer retractions per author and were less likely to have >5 retractions (all authors: 3 women [8.1%] vs 34 men [91.9%], p < 0.001). Across most disciplines, their representation was below publication benchmarks. Dermatology (retractions = 80.0%, publications = 48.9–51.8%) and radiology (retractions = 40.0%, publications = 31.0-36.8%) were exceptions among first authors, while pediatrics (retractions = 50.0%, publications = 37.0%−42.6%) was an exception among last authors, though all based on small numbers.

Conclusions

Women are markedly underrepresented among authors of retracted publications, particularly in cases involving multiple retractions. Further research is needed to clarify underlying mechanisms.

Introduction

The retraction of scientific articles serves a crucial function in correcting the academic record and is widely regarded as a key mechanism for preserving the integrity of biomedical literature [14]. Retractions may result from a range of issues, including honest error, publisher mistakes, or research misconduct such as fabrication, falsification, and plagiarism [29]. Although retractions remain relatively rare [2,3,8,9], their impact can be substantial, particularly in high-impact medical journals where retracted findings may have already influenced clinical practice or policy.

A growing body of research has examined the characteristics of retracted publications, including country of origin [24,6,1012], discipline [3,4,6,11], and reason for retraction [29]. Fewer studies have explored the demographics of authors, particularly gender [7,1316], and those that did reveal important limitations in scope, methodology, and data completeness. To our knowledge, only three studies benchmarked gender distribution in retractions against overall publication output [7,15,16].

In a preliminary analysis of 438 retracted publications in medical journals, we found a marked underrepresentation of women among first and last authors, especially in misconduct-related cases [7]. This analysis was limited by a relatively small retraction sample and a mismatch in time frames, with retractions covering 2003–2022 and publication data only 2008–2017.

Two recent studies have extended this line of inquiry beyond medicine. Zheng et al. combined Web of Science (WoS) retraction and publication data with retraction reasons from Retraction Watch Database (RWD) and found that male authors generally had higher retraction rates than female authors, particularly for misconduct [15]. However, patterns varied by field: male authors experienced significantly higher retraction rates in biomedical and health sciences, as well as in life and earth sciences, whereas female authors had higher retraction rates in mathematics and computer science. The main analyses focused on first authors, but similar patterns were observed for corresponding authors. Yet, the study’s validity is limited by: (1) reliance on WoS, which is less comprehensive than RWD [1]; (2) restriction to first authors for the main analyses; (3) substantial missingness in gender attribution (only 53% of retracted articles and 77% of non-retracted articles), raising concerns about selection bias and representativeness; and (4) possible bias from a non-comparable reference group, as gender could be determined for a much higher proportion of non-retracted authors than for retracted authors. The low gender match rates reported by Zheng et al. are likely due to their use of a stringent ≥90% confidence threshold for gender inference—based on a combination of tools (Gender API, Genderize.io, and Gender Guesser)—which naturally leads to more names being labeled ‘unknown’ and excluded from analysis [17].

Maddi et al. combined OpenAlex publication data with retraction data from RWD and found that retraction risk varied by team composition: mixed-gender teams were more likely to face retractions than all-male or all-female teams, whereas individually authored publications were less likely to be retracted [16]. Larger teams had a lower likelihood of retraction, while medium-sized teams (3–10 authors) faced a higher risk. Retraction reasons also differed by gender, with male-led publications more often retracted for serious ethical violations and female-led publications more often retracted for procedural errors or updates in rapidly evolving fields. The study’s limitations include: (1) reliance on two data sources with different coverage, indexing practices, and metadata completeness, introducing potential systematic bias; (2) gender attribution being possible for only 67% of authors in the publication dataset (not reported for the retraction dataset), which may skew results; and (3) use of genderize.io without specifying the probability threshold, reducing reproducibility and interpretability.

Taken together, these studies suggest that women are underrepresented among authors of retracted publications, particularly in misconduct-related cases, mirroring broader patterns of gender disparity in research. The underrepresentation of women in science is well documented [1832]. They remain a minority in senior academic roles [21,23] and hold fewer authorship [18,19,23,32] and editorial leadership positions in scientific journals [31]. This imbalance may influence not only publishing opportunities but also exposure to scrutiny and patterns of retraction.

Differences in the indexing of retracted publications across major bibliographic databases complicate efforts to analyze retraction patterns [1,3336]. In a recent study, we demonstrated that while RWD outperformed PubMed and the WoS Core Collection in identifying retracted publications, none of the databases offered complete coverage [1]. The present study builds upon this prior work by using the same dataset of 878 retracted publications from 131 high-impact medical journals across nine clinical disciplines to examine gender disparities among authors of retracted articles.

We aim to quantify the gender distribution of all authors, first authors, and last authors of retracted publications, and to compare these distributions with established benchmarks for female authorship in biomedical literature [18]. We also assess whether gender differences are more pronounced in misconduct-related retractions and examine variation across medical specialties. By highlighting these patterns, our study contributes to the broader conversation on equity, transparency, and trust in scientific publishing.

Based on previous findings, we hypothesize that women are underrepresented among the authors of retracted publications. We further expect that this underrepresentation is more pronounced in misconduct-related retractions, and that gender disparities vary across medical disciplines. Finally, we hypothesize that the proportion of retracted publications with female first or last authors is lower than the baseline proportion of female authors observed in biomedical publishing at large.

Methods

Study design and objective

This cross-sectional study aimed to evaluate gender disparities among authors of retracted publications in high-impact medical journals. Specifically, we assessed the gender distribution of authors whose articles were retracted, both overall and in cases involving research misconduct. We analyzed gender representation among all authors, first authors, and last authors of retracted publications. We also assessed variation in gender representation across disciplines and compared the gender composition of retracted articles to known publication patterns in biomedical research.

This study builds on a previously developed dataset designed to compare the performance of RWD, PubMed, and the WoS Core Collection in identifying retracted publications in medicine [1]. The dataset included all retracted publications indexed up to December 15, 2024, in 131 high-impact medical journals across nine clinical disciplines: anesthesiology, dermatology, general internal medicine, neurology, obstetrics/gynecology, oncology, pediatrics, psychiatry, and radiology/nuclear medicine/medical imaging (Table 1). Journals were selected from Clarivate’s 2023 Journal Citation Reports (JCR) as the 15 with the highest impact factor per discipline. Overlaps were resolved by allowing journals to appear in two disciplines when appropriate, resulting in 131 unique journals.

thumbnail
Table 1. Journals included in the study, grouped by discipline and ranked by 2023 Journal Citation Reports (JCR) impact factor.

https://doi.org/10.1371/journal.pone.0335059.t001

Retraction data sources and extraction

Retractions were identified by searching three databases: RWD, PubMed, and the WoS Core Collection. Searches were conducted using journal names, ISSNs, and eISSNs. Retractions were included if indexed in any of the three databases. Retrieved records were cleaned and de-duplicated using PubMed IDs (PMIDs) and article titles. When PMIDs were unavailable, matching was done manually using article metadata. The final dataset represents the union of retractions retrieved from the three databases.

We used metadata from RWD to classify retractions as related or unrelated to research misconduct. Articles were classified as misconduct-related if the reason for retraction included fabrication or falsification of data, images, or results; plagiarism; manipulation of results or images; authorship fraud; fake peer review (i.e., submission of fabricated reviewer reports, often via falsified reviewer identities); salami slicing (i.e., division of one substantial study into multiple smaller publications to inflate output); use of paper mills (i.e., manuscripts produced by third-party organizations that sell fraudulent research); ethical violations; or sabotage of materials. The full list of criteria is available in the Supplementary Material (S1 Appendix). This classification method has been applied in prior studies investigating retraction causes [7,10]. Records with missing retraction reasons were excluded from misconduct-specific analyses but were retained for general gender analyses.

Gender assignment

Gender was inferred using Gender API (https://gender-api.com), a service that predicts binary gender (male/female) based on first names and provides a confidence score [37]. For each author, the first name was extracted. Authors with a single-letter first name were excluded, as were those for whom Gender API provided no prediction or a confidence score below the inclusion thresholds. Unisex or ambiguous names were automatically assigned lower confidence scores by Gender API and therefore frequently fell below our inclusion thresholds.

Two datasets were created based on gender assignment confidence: one including authors with confidence ≥60%, and another limited to confidence ≥70%. The ≥ 60% sample served as the basis for the main analyses, while the ≥ 70% sample was used for sensitivity checks. Analyses were conducted separately for both datasets, following the methodology of prior studies using algorithmic gender inference [7,18,20]. Gender was determined for all authors listed in each retracted article, as well as separately for first and last authors, using the full unprocessed names as they appeared in the dataset. To assess inference accuracy, we manually verified the gender classification for a random sample of 200 names and found no misclassifications.

To determine the number of unique authors, we standardized names to improve matching. Standardization involved (i) trimming leading/trailing spaces, (ii) removing punctuation, hyphens, and parentheses, (iii) converting to lowercase, and (iv) removing diacritical marks. Authors were then classified based on last and first names. We manually reviewed potential duplicates in which the same first and last name appeared with variations in intermediate names or initials, and considered them the same individual if they were affiliated with the same institution.

In total, 807 retracted articles included at least one author name. These articles contained 4,136 individual author entries, of which 3,909 included full first names (i.e., no initials). Gender could be assigned to 3,865 of these authors (98.9%). A total of 3,743 authors met the 60% confidence threshold, and 3,555 met the 70% threshold. The 4,136 authorships corresponded to 2,864 unique individuals, among whom 2,663 had full first names. Gender was assigned to 2,621 unique authors (98.4%), with 2,505 and 2,329 meeting the 60% and 70% confidence thresholds, respectively. The full anonymized dataset for the 2,663 unique authors is available as Supporting Information (S1 Data).

For first authorship analysis, we identified 772 retracted publications with first authors having full first names. Gender could be inferred for 767 of them, with 747 meeting the 60% and 721 the 70% threshold. For last authors, 701 had full first names, and gender could be assigned to 697 (687 at 60% and 669 at 70%). The high match rates observed for Gender API are consistent with previous research reporting that the proportion of non-classifications (‘naCoded’) can be as low as 0.34% [37].

Statistical analyses

We computed the proportion of male and female authors among all retracted publications, and separately for first and last authors. We repeated these calculations for the subset of retractions related to misconduct, allowing comparisons between overall retractions and misconduct-specific retractions. We also examined gender differences in the number of retracted publications per author.

To test differences in retraction volume by gender, we used the Wilcoxon rank-sum test to compare the median number of retracted publications per author between men and women. Authors were also grouped by number of retractions (1, 2–5, and >5), and gender distributions across these categories were compared using the chi-squared test.

We then stratified the dataset by clinical discipline and calculated the proportion of male and female first and last authors per specialty. These proportions were compared with data from a previously published study by Hart & Perlis, which analyzed gender representation in biomedical authorship across ten medical specialties for the years 2008–2017 [18]. Table 2 summarizes key parameters of our dataset alongside the Hart & Perlis data, providing context for comparing female authorship in retracted publications with its overall representation. For comparability across disciplines, we restricted our retraction dataset to 2008–2017, consistent with Hart & Perlis.

thumbnail
Table 2. Comparison of methods used to collect retraction data and publication benchmark data (publication data from Hart & Perlis [18]).

https://doi.org/10.1371/journal.pone.0335059.t002

Statistical analyses were performed using Stata version 15.1. The study followed STROBE (Strengthening the Reporting of Observational Studies in Epidemiology) guidelines.

Ethics statement

This bibliometric study relied exclusively on published records and did not involve patient or personal data. Therefore, ethical approval was not required under Swiss law. Author names were part of the publication metadata used for analysis, but these identifying data have been removed from the Supporting Information file prior to publication to ensure compliance with PLOS ONE’s data sharing policy.

Results

A total of 878 retracted publications were identified among 422,827 publications across 131 high-impact journals spanning nine medical disciplines, corresponding to a retraction rate of 2.08 per 1,000 publications. Among the 811 retracted publications with available data on reasons, 66.8% were attributed to misconduct.

Fig 1 shows the distribution of publication and retraction years for these 878 retracted publications. Articles were published between 1965 and 2024, with a median publication year of 2009 (interquartile range [IQR]: 16 years), and retracted between 1975 and 2024, with a median retraction year of 2017 (IQR: 10 years). Fig 2 illustrates the delay between publication and retraction, which ranged from 0 to 54 years (median: 4 years, IQR: 9). Other results not directly related to gender—including author count per article, article types, countries of affiliation, and retraction patterns by journal and discipline—are presented in a separate article submitted from the same project.

thumbnail
Fig 1. Publication and retraction years of 878 retracted publications from 131 high-impact journals across nine medical disciplines.

https://doi.org/10.1371/journal.pone.0335059.g001

thumbnail
Fig 2. Delay in years between publication and retraction for 878 retracted publications from 131 high-impact journals across nine medical disciplines.

https://doi.org/10.1371/journal.pone.0335059.g002

The gender distribution varied depending on whether all author entries or unique individuals were considered. Among all authors with gender prediction confidence ≥60%, 2,880 (76.9%) were identified as men and 863 (23.1%) as women. When restricted to unique individuals, men represented 69.1% (n = 1,732) and women 30.9% (n = 773).

Disparities were more pronounced when focusing on first and last authors. Among first authors, 83.5% of all entries were men and 16.5% women; for unique individuals, the breakdown was 70.5% and 29.5%. For last authors, men accounted for 87.3% of all entries and 79.1% of unique individuals.

Gender gaps widened when considering only publications retracted for misconduct. Among first authors, men made up 88.5% of all entries and women 11.5%. For last authors, men represented 90.4% and women 9.6%.

Gender differences were also apparent across categories of retraction frequency (Table 3). Men were significantly more likely than women to have multiple retractions across all authorship positions. For instance, among those with more than five retracted publications (n = 37), 91.9% were men and only 8.1% were women. Although the median number of retractions was identical by gender (1; IQR = 0), counts differed significantly between men and women. Figs 3 and 4 further illustrate these patterns. Fig 3 presents a box plot showing the number of retractions per author by gender, while Fig 4 displays a scatter plot ranking authors by number of retractions and gender.

thumbnail
Table 3. Gender distribution by author position and retraction count, at a gender prediction confidence threshold of 60%, based on 878 retracted publications from 131 high-impact medical journals (n = 2,864 unique authors, of whom 2,663 had full first names and 2,621 could be assigned a gender).

https://doi.org/10.1371/journal.pone.0335059.t003

thumbnail
Fig 3. Box plot of the number of retractions per author, by gender, at a gender prediction confidence threshold of 60%, based on 878 retracted publications from 131 high-impact medical journals (n = 2,505 unique authors: 1,732 men and 773 women).

https://doi.org/10.1371/journal.pone.0335059.g003

thumbnail
Fig 4. Scatter plot showing the number of retractions per author by gender, at a gender prediction confidence threshold of 60%, based on 878 retracted publications from 131 high-impact medical journals (n = 2,505 unique authors: 1,732 men and 773 women).

Authors are sorted on the x-axis by number of retractions, from highest to lowest.

https://doi.org/10.1371/journal.pone.0335059.g004

Tables 4 and 5 present the proportion of women among first and last authors by discipline for both the full retraction sample and the subset of retractions due to misconduct. For comparability with publication benchmarks reported by Hart & Perlis [18]—who estimated that, in 2008–2017, women accounted overall for 41.3–45.4% of first authors and 26.1–33.4% of last authors, and across specialties for 31.0–59.2% of first authors and 17.7–44.4% of last authors—we also report retraction data restricted to 2008–2017.

thumbnail
Table 4. Proportion of women among first authors by discipline, for all retracted publications and those retracted for misconduct, based on 878 retracted publications from 131 high-impact journals across nine medical disciplines (disciplines listed in alphabetical order). Data are shown for names with gender prediction confidence ≥60%.

https://doi.org/10.1371/journal.pone.0335059.t004

thumbnail
Table 5. Proportion of women among last authors by discipline, for all retracted publications and those retracted for misconduct, based on 878 retracted publications from 131 high-impact journals across nine medical disciplines (disciplines listed in alphabetical order). Data are shown for names with gender prediction confidence ≥60%.

https://doi.org/10.1371/journal.pone.0335059.t005

Overall, and in nearly all specialties, the proportion of women in these authorship positions was substantially lower than the corresponding publication benchmarks, with the gender disparity even more pronounced in retractions related to misconduct. For instance, among 366 retracted publications with identifiable first authors in anesthesiology, just 6.8% had female first authors (10.2% when limited to 2008–2017). Likewise, among 332 anesthesiology papers with identified last authors, only 4.2% were female, both overall and for 2008–2017. When restricting the analysis to retractions due to misconduct, the female underrepresentation was even stronger, with women accounting for only 5.7% of first authors and 3.7% of last authors. By contrast, Hart & Perlis reported that women accounted for 33.5–36.7% of first authors and 23.7–26.0% of last authors in anesthesiology publications.

Dermatology and pediatrics were the only two disciplines in which the proportion of women among retracted authors exceeded the publication benchmarks in both the whole retraction sample and the subset restricted to 2008–2017. In dermatology, women accounted for 61.5% of first authors in the whole sample and 80.0% in 2008–2017, both above the estimated 48.9–51.8% benchmark for female first authors. In pediatrics, women represented 46.7% of last authors in the whole sample and 50.0% in 2008–2017, compared with an expected range of 37.0–42.6%. A similar, though less consistent, pattern was observed in radiology, where women represented 40.0% of first authors of retracted articles in 2008-2017, exceeding the 31.0-36.8% benchmark, although their proportion over the entire study period was lower (20.8%). The findings for these disciplines are based on small numbers of retracted articles and should therefore be interpreted with caution.

Secondary analyses (gender assignment confidence ≥70%)

Results at the ≥ 70% confidence threshold closely mirrored those obtained at ≥60%. Overall, 2,738 authors (77.0%) were identified as men and 817 (23.0%) as women; among unique individuals, men accounted for 68.8% (n = 1,602) and women 31.2% (n = 727).

Gender gaps were greater among senior authorship positions: 83.9% of first authors and 87.7% of last authors were men, compared with 16.1% and 12.3% women, respectively; among unique individuals, men represented 70.4% of first authors and 79.4% of last authors. Disparities were even wider in misconduct-related retractions, with men comprising 88.6% of first authors and 90.9% of last authors. Men were also more likely to accumulate multiple retractions (S1 Table).

Discussion

Summary of the findings

In this cross-sectional study of 878 articles retracted from 131 high-impact medical journals, we examined gender disparities among their authors. Women were consistently underrepresented, particularly in first and last authorship positions, and this disparity was even more pronounced in retractions related to misconduct. Women were also significantly less likely to have multiple retractions. These gender disparities were observed across most disciplines, with the exception of dermatology and radiology among first authors, and pediatrics among last authors, where the proportion of female authors of retracted publications exceeded general authorship benchmarks—though these findings were based on small sample sizes.

Comparison with existing literature

Our findings are consistent with prior literature examining gender and retractions. In a preliminary study by our research team, we examined 438 retracted publications in medicine and found that women represented only 25% of first and 14% of last authors, with even lower proportions in misconduct-related cases [7]. Pinho-Gomes et al. conducted a broader analysis of over 35,000 retracted biomedical publications and similarly reported that women were significantly underrepresented—accounting for 27% of first and 24% of last authors overall, and only 19% and 14%, respectively, in fraud-related retractions [14]. Decullier & Maisonneuve analyzed a smaller sample of 113 retractions and found that misconduct (fraud or plagiarism) was significantly more frequent among male-authored publications (59%) than among those authored by women (29%) [13]. Notably, neither of the latter two studies included a comparison group reflecting the general gender distribution among all publications. More recently, Zheng et al. confirmed male overrepresentation in retractions across multiple disciplines, with patterns varying by field [15], while Maddi et al. highlighted the role of team composition, showing higher retraction risk in mixed-gender teams [16].

Together, these studies—including our own—suggest that women are consistently underrepresented among authors of retracted publications, particularly those retracted for misconduct. These gendered patterns may reflect broader inequalities in academic positions, authorship roles, and exposure to investigative or editorial scrutiny. However, further research is needed to disentangle potential drivers such as behavioral, cultural, or systemic factors, including gender bias in retraction practices themselves.

Several studies have examined the reasons for retraction and patterns of retraction across disciplines and countries. Fang et al. found that misconduct accounts for the majority of retractions (67%), a finding consistent with our study, where the same proportion of retractions were attributed to misconduct [2,38]. A recent analysis by our research group comparing the performance of three major bibliographic databases (RWD, PubMed, WoS Core collection) showed inconsistencies in retraction indexing and emphasized the importance of using multiple sources to obtain comprehensive retraction data [1]. We used this same dataset, enhanced with gender inference, to explore author-level characteristics.

Implications for practice and research

The underrepresentation of women among retracted authors, particularly for misconduct-related retractions, may reflect systemic gender imbalances in academia rather than differences in scientific integrity. Women continue to be underrepresented in senior academic positions [21,23], which may reduce both their visibility and their vulnerability to scrutiny or allegations of misconduct. Alternatively, it is also possible that the types of research or positions held by women expose them to fewer opportunities for retraction-inducing misconduct.

Our findings underscore the importance of context when interpreting retraction data. Retractions are not only about correcting the literature but also about understanding broader issues of research culture, responsibility, and inequality. Bibliometric analyses should consider demographic variables, including gender, to ensure that corrective mechanisms do not disproportionately affect certain groups.

Future research should explore how institutional policies, peer review practices, and editorial oversight may contribute to observed disparities. Qualitative studies could also help understand the social and professional dynamics that lead to retractions, including gendered experiences of scrutiny, pressure, or misconduct allegations.

Limitations

Our study has several limitations. First, gender was inferred algorithmically, which may not accurately reflect individuals’ self-identified gender. Although we used two confidence thresholds (≥60% and ≥70%), some misclassification is possible. Second, we excluded authors with abbreviated or ambiguous first names, which may introduce selection bias. Third, retraction reasons were classified based on metadata, and the accuracy of these classifications can vary across journals and time periods. Furthermore, our analysis focused only on high-impact journals, which may not reflect gender disparities in lower-impact or non-English-language journals. Fourth, while our comparisons with Hart & Perlis inherently account for discipline and publication year, we could not adjust for other potential confounders such as team size, open access status, or geographic region, which may influence publishing patterns and retraction dynamics. Fifth, although we applied standardization and manual checks to identify unique authors, minor errors in disambiguation cannot be excluded; however, this metric was a secondary outcome and is unlikely to affect our main findings. Finally, while our findings show associations between gender and retraction patterns, they do not establish causality.

Conclusion

This study demonstrates that women are underrepresented among authors of retracted publications in high-impact medical journals, particularly in misconduct-related cases and in key authorship positions. These disparities were observed consistently across most medical disciplines and align with broader patterns of gender imbalance in academic publishing.

Although this study did not aim to explore the reasons behind these differences, the findings underscore the importance of further research to understand the underlying factors. A better understanding of the social, institutional, and editorial dynamics surrounding retractions could help ensure that the scientific correction process is both rigorous and equitable.

Supporting information

S1 Appendix. Criteria for identifying misconduct-related retractions using Retraction Watch Database.

https://doi.org/10.1371/journal.pone.0335059.s001

(DOCX)

S1 Data. Anonymized dataset of 2,663 unique authors of retracted publications from 131 high-impact medical journals (n = 878 articles).

Includes gender assignment (ga_gender), confidence score from Gender API (ga_accuracy), and number of samples used for the gender inference (ga_samples). All personally identifiable information has been removed.

https://doi.org/10.1371/journal.pone.0335059.s002

(XLSX)

S1 Table. Gender distribution by author position and retraction count, at a gender prediction confidence threshold of 70%, based on 878 retracted publications from 131 high-impact medical journals (n = 2,864 unique authors, of whom 2,663 had full first names and 2,621 could be assigned a gender).

https://doi.org/10.1371/journal.pone.0335059.s003

(DOCX)

Acknowledgments

I thank Melissa Sebo for assistance with project administration and the Center for Scientific Integrity, which maintains the Retraction Watch database, for making their data publicly available.

References

  1. 1. Sebo P, Sebo M. Comparing the performance of Retraction Watch Database, PubMed, and Web of Science in identifying retracted publications in medicine. Account Res. 2025:1–25. pmid:40156496
  2. 2. Fang FC, Steen RG, Casadevall A. Misconduct accounts for the majority of retracted scientific publications. Proc Natl Acad Sci U S A. 2012;109(42):17028–33. pmid:23027971
  3. 3. Grieneisen ML, Zhang M. A comprehensive survey of retracted articles from the scholarly literature. PLoS One. 2012;7(10):e44118. pmid:23115617
  4. 4. Koo M, Lin S-C. Retracted articles in scientific literature: a bibliometric analysis from 2003 to 2022 using the Web of Science. Heliyon. 2024;10(20):e38620. pmid:39640756
  5. 5. Lievore C, Rubbo P, Dos Santos CB, Picinin CT, Pilatti LA. Research ethics: a profile of retractions from world class universities. Scientometrics. 2021;126(8):6871–89. pmid:34054160
  6. 6. Ribeiro MD, Vasconcelos SMR. Retractions covered by Retraction Watch in the 2013–2015 period: prevalence for the most productive countries. Scientometrics. 2018;114(2):719–34.
  7. 7. Sebo P, Schwarz J, Achtari M, Clair C. Women are underrepresented among authors of retracted publications: retrospective study of 134 medical journals. J Med Internet Res. 2023;25:e48529. pmid:37801343
  8. 8. Steen RG, Casadevall A, Fang FC. Why has the number of scientific retractions increased? PLoS One. 2013;8(7):e68397. pmid:23861902
  9. 9. Wager E, Williams P. Why and how do journals retract articles? An analysis of Medline retractions 1988-2008. J Med Ethics. 2011;37(9):567–70. pmid:21486985
  10. 10. Sebo P, Sebo M. Geographical disparities in research misconduct: analyzing retraction patterns by country. J Med Internet Res. 2025;27:e65775. pmid:39808480
  11. 11. Candal-Pedreira C, Ross JS, Ruano-Ravina A, Egilman DS, Fernández E, Pérez-Ríos M. Retracted papers originating from paper mills: cross sectional study. BMJ. 2022;379:e071517. pmid:36442874
  12. 12. Rivera H, Teixeira da Silva JA. Retractions, fake peer reviews, and paper mills. J Korean Med Sci. 2021;36(24):e165. pmid:34155837
  13. 13. Decullier E, Maisonneuve H. Retraction according to gender: a descriptive study. Account Res. 2023;30(6):356–61. pmid:34615410
  14. 14. Pinho-Gomes A-C, Hockham C, Woodward M. Women’s representation as authors of retracted papers in the biomedical sciences. PLoS One. 2023;18(5):e0284403. pmid:37134049
  15. 15. Zheng E-T, Fu H-Z, Thelwall M, Fang Z. Do male leading authors retract more articles than female leading authors? J Informetr. 2025;19(3):101682.
  16. 16. Maddi A, Monneau E, Guaspare-Cartron C, Gargiulo F, Dubois M. The retraction gender gap: are mixed teams more vulnerable? Quant Sci Stud. 2025;6:351–74.
  17. 17. Studies (CWTS) C for S and T. CWTS Leiden Ranking. In: CWTS Leiden Ranking [Internet]. Centre for Science and Technology Studies (CWTS). [cited 12 Aug 2025]. Available from: http://www.leidenranking.com
  18. 18. Hart KL, Perlis RH. Trends in proportion of women as authors of medical journal articles, 2008-2018. JAMA Intern Med. 2019;179(9):1285–7. pmid:31135815
  19. 19. Sebo P, Clair C. Gender gap in authorship: a study of 44,000 articles published in 100 high-impact general medical journals. Eur J Intern Med. 2022;97:103–5. pmid:34598855
  20. 20. Sebo P, Clair C. Gender inequalities in citations of articles published in high-impact General Medical Journals: a cross-sectional study. J Gen Intern Med. 2023;38(3):661–6. pmid:35794309
  21. 21. Richter KP, Clark L, Wick JA, Cruvinel E, Durham D, Shaw P, et al. Women Physicians and Promotion in Academic Medicine. N Engl J Med. 2020;383(22):2148–57. pmid:33252871
  22. 22. Lautenberger D, Dandar VM, Raezer CL. The State of Women in Academic Medicine: The Pipeline and Pathways to Leadership, 2013-2014. Washington, DC: Association of American Medical Colleges; 2014.
  23. 23. Sebo P, de Lucia S, Vernaz N. Gender gap in medical research: a bibliometric study in Swiss university hospitals. Scientometrics. 2020;126(1):741–55.
  24. 24. Sebo P, Maisonneuve H, Fournier JP. Gender gap in research: a bibliometric study of published articles in primary health care and general internal medicine. Fam Pract. 2020;37(3):325–31. pmid:31935279
  25. 25. Moss-Racusin CA, Dovidio JF, Brescoll VL, Graham MJ, Handelsman J. Science faculty’s subtle gender biases favor male students. Proc Natl Acad Sci U S A. 2012;109(41):16474–9. pmid:22988126
  26. 26. Knobloch-Westerwick S, Glynn CJ, Huge M. The matilda effect in science communication: an experiment on gender bias in publication quality perceptions and collaboration interest. Science Communication. 2013;35(5):603–25.
  27. 27. Shaik F, Fusulier B. Academic Careers and Gender Inequality: Leaky Pipeline and Interrelated Phenomena in Seven European Countries. 2015. Available from: https://dial.uclouvain.be/pr/boreal/object/boreal:168170
  28. 28. Boiko JR, Anderson AJM, Gordon RA. Representation of women among academic grand rounds speakers. JAMA Intern Med. 2017;177(5):722–4. pmid:28264094
  29. 29. Fournier LE, Hopping GC, Zhu L, Perez-Pinzon MA, Ovbiagele B, McCullough LD, et al. Females are less likely invited speakers to the international stroke conference: time’s up to address sex disparity. Stroke. 2020;51(2):674–8. pmid:31902331
  30. 30. Carley S, Carden R, Riley R, May N, Hruska K, Beardsell I, et al. Are there too few women presenting at emergency medicine conferences? Emerg Med J. 2016;33(10):681–3. pmid:27534977
  31. 31. Pinho-Gomes A-C, Vassallo A, Thompson K, Womersley K, Norton R, Woodward M. Representation of women among editors in chief of leading medical journals. JAMA Netw Open. 2021;4(9):e2123026. pmid:34495341
  32. 32. Filardo G, da Graca B, Sass DM, Pollock BD, Smith EB, Martinez MA-M. Trends and comparison of female first authorship in high impact medical journals: observational study (1994-2014). BMJ. 2016;352:i847. pmid:26935100
  33. 33. Schneider J, Lee J, Zheng H, Salami MO. Assessing the agreement in retraction indexing across 4 multidisciplinary sources: Crossref, Retraction Watch, Scopus, and Web of Science. International Conference on Science, Technology and Innovation Indicators. 2023.
  34. 34. Ortega JL, Delgado-Quirós L. The indexation of retracted literature in seven principal scholarly databases: a coverage comparison of dimensions, OpenAlex, PubMed, Scilit, Scopus, The Lens and Web of Science. Scientometrics. 2024;129(7):3769–85.
  35. 35. Schmidt M. An analysis of the validity of retraction annotation in pubmed and the web of science. J Assoc Inf Sci Technol. 2017;69(2):318–28.
  36. 36. Donner P. Document type assignment accuracy in the journal citation index data of Web of Science. Scientometrics. 2017;113(1):219–36.
  37. 37. Sebo P. Performance of gender detection tools: a comparative study of name-to-gender inference services. J Med Libr Assoc. 2021;109(3):414–21. pmid:34629970
  38. 38. Marcus A, Oransky I. What studies of retractions tell us. J Microbiol Biol Educ. 2014;15(2):151–4. pmid:25574267