Citation: Lumbard H, Routledge D (2025) Open science and transparency are our strongest tools in the fight against fraudulent publishing activities. PLoS Med 22(9): e1004774. https://doi.org/10.1371/journal.pmed.1004774
Published: September 26, 2025
Copyright: © 2025 Lumbard, Routledge. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The author(s) received no specific funding for this work.
Competing interests: I have read the journal’s policy, and the authors of this manuscript have the following competing interests: HL and DR are current paid employees of the Public Library of Science.
Scientific progress depends on trust that research findings are robust, reproducible, and honestly reported. In recent years, this trust has been increasingly threatened by the rise of systematic research fraud, including paper mills, review mills, and other coordinated attempts to manipulate the scholarly record. In this context, data transparency must be reframed not only as a scientific virtue, but as a frontline defense against research misconduct.
Data sharing has long been advocated to allow replication, accelerate discovery, and foster collaboration [1]. Yet, as the academic publishing ecosystem confronts new and evolving threats, it is becoming clear that data transparency also plays a critical role in detecting fraudulent or fabricated research [2]. Shared data can expose inconsistencies, reveal implausible patterns, and empower both journals and third-party watchdogs to conduct deeper investigations.
Paper mills—services that produce and sell fraudulent manuscripts—often operate at scale and target journals across disciplines, including those in clinical and global health fields. These papers may present fabricated data, recycled content, and/or attempt to manipulate peer review processes [3]. In recent years, publishers have seen growing numbers of paper mill submissions; it is estimated that in 2022, 1.5%–2% of scientific papers came from paper mills, with this number increasing to 3% in medicine and biology [4]. Meanwhile, review mills compromise editorial integrity by infiltrating peer review systems, often with fake reviewer identities, inauthentic favorable reviews, and/or coordinated citation practices [5]. The consequences of these actions are not superficial; they distort the scientific record and could ultimately misinform clinical practice, public health policy, and funding priorities.
In clinical research, false data can lead to incorrect treatment recommendations, patient harm, and loss of trust in evidence-based medicine. In global health, where research often informs urgent responses to health crises, guides allocation of limited resources, and influences national and international health policies, the risks are magnified. Fraudulent studies in these areas not only undermine science but can also cause direct damage to vulnerable populations and health systems already under strain [6,7].
PLOS Medicine, like many journals and publishers, has taken significant steps to detect and address these threats, including enhanced pre- and post-publication checks. These efforts are strengthened through collaboration with initiatives such as United2Act, which coordinates action against paper mill activities, alongside growing research into this malpractice, and increased awareness among editors, reviewers, and readers [8–10]. Together with the development of more sophisticated detection tools, this heightened vigilance has already led to the identification and retraction of thousands of fraudulent articles across multiple publishers [11,12]. Yet, as in medicine, prevention remains the best treatment: while retrospective action is laudable, earlier detection and deterrence must be our focus. However, editorial detection can only go so far without institutional support, coordinated action, and, critically, transparent and accessible data.
While legitimate barriers to data sharing exist—particularly concerning patient privacy, national regulations, and ethical safeguards—blanket refusals to share data, or vague statements that data are “available on request,” do not meet the threshold for good practice. Increasingly, such statements raise red flags, especially in studies with statistical anomalies, improbable sample sizes, or poorly described methods. At PLOS Medicine, we have mandated since 2014 that authors make all data needed to replicate their study’s findings publicly available at the time of publication, without restriction.
This ensures transparency, reproducibility, and the widest possible benefit from the research. If there are legitimate legal or ethical reasons why a dataset cannot be shared openly, authors must explain these restrictions clearly and indicate how others can request access to the data in a responsible way.
In addition to promoting data availability and transparent reporting, we (the scientific community) must act in unity to protect the scholarly record and the real-world systems it informs.
We call on:
- Authors to prioritize data availability and to plan for open data from the outset of research.
- Funders and institutions to invest in ethical, secure infrastructure for data sharing—especially in under-resourced settings—and require a robust data management plan to ensure transparency, reproducibility, and responsible stewardship of research data.
- Journals and publishers to adopt robust data policies, support editors in investigations, and share intelligence about suspected fraud.
- Peer reviewers to scrutinize data availability statements and question inconsistencies.
- Readers and the wider research community to remain vigilant and engaged in post-publication review, open commentary, and data reanalysis.
We also urge collaborative action across the publishing industry, clinicians, medical researchers, and global health stakeholders to develop shared standards, tools, and coordinated responses to detect and deter paper and review mills. No single journal, institution, or sector can tackle this challenge alone.
Transparency is not a panacea, but it is a powerful tool. By making data more open and research processes more visible, we strengthen the foundations of trustworthy, evidence-based science. In doing so, we safeguard the integrity of clinical and global health research and uphold our shared responsibility to the people and communities that research ultimately aims to serve.
References
- 1. Bloom T, Ganley E, Winker M. Data access for the open access literature: PLOS’s Data Policy. PLoS Biol. 2014;12(2):e1001797.
- 2. Wittau J, Seifert R. How to fight fake papers: a review on important information sources and steps towards solution of the problem. Naunyn Schmiedebergs Arch Pharmacol. 2024;397(12):9281–94. pmid:38970685
- 3. Parker L, Boughton S, Bero L, Byrne JA. Paper mill challenges: past, present, and future. J Clin Epidemiol. 2024;176:111549. pmid:39389277
- 4. Van Noorden R. How big is science’s fake-paper problem? Nature. 2023;623(7987):466–7. pmid:37949983
- 5. Oviedo-García MÁ. The review mills, not just (self-)plagiarism in review reports, but a step further. Scientometrics. 2024;129:5805–13.
- 6. Steen RG. Retractions in the medical literature: how many patients are put at risk by flawed research? J Med Ethics. 2011;37(11):688–92. pmid:21586404
- 7. Mackey TK, Liang BA. Combating healthcare corruption and fraud with improved global health governance. BMC Int Health Hum Rights. 2012;12:23. pmid:23088820
- 8. Byrne JA, Abalkina A, Akinduro-Aje O, Christopher J, Eaton SE, Joshi N, et al. A call for research to address the threat of paper mills. PLoS Biol. 2024;22(11):e3002931. pmid:39576835
- 9. Day A. Exploratory analysis of text duplication in peer-review reveals peer-review fraud and paper mills. Scientometrics. 2022;127(10):5965–87.
- 10. Candal-Pedreira C, Guerra-Tort C, Ruano-Ravina A, Freijedo-Farinas F, Rey-Brandariz J, Ross JS, et al. Retracted papers originating from paper mills: a cross-sectional analysis of references and citations. J Clin Epidemiol. 2024;172:111397. pmid:38815634
- 11. Richardson RAK, Hong SS, Byrne JA, Stoeger T, Amaral LAN. The entities enabling scientific fraud at scale are large, resilient, and growing rapidly. Proc Natl Acad Sci U S A. 2025;122(32):e2420092122. pmid:40758886
- 12. Candal-Pedreira C, Ross JS, Ruano-Ravina A, Egilman DS, Fernández E, Pérez-Ríos M. Retracted papers originating from paper mills: cross sectional study. BMJ. 2022;379:e071517. pmid:36442874