Figures
Abstract
As the use and availability of at-home antigen tests for SARS-CoV-2 infection have increased, the number of individuals with SARS-CoV-2 infections that are reported to state COVID-19 surveillance systems have decreased. Self-reported infection dates are critical to accurately track incidence and outbreaks of COVID-19 and for continued research on illness progression; however, the reliability of self-reported infection dates is unknown to date. To assess accuracy of self-reported test dates, we utilized self-reported SARS-CoV-2 testing data from the Arizona CoVHORT Study (CoVHORT) and laboratory-confirmed testing data collected by the Arizona Department of Health Services (ADHS) and calculated the difference in days between dates to examine their percent agreement. We used logistic regression to assess if any participant characteristics were associated with self-reporting a test date >7 days different than the laboratory confirmed date. A total of 1,900 CoVHORT participants aged 18 years or older were included in our analyses. Most participants (82.5%) reported a test date within 7 days of the laboratory confirmed date of their illness. Increasing age and number of weeks between testing positive and self-reporting the test date were both significantly associated with a difference of 7 days or greater between dates. There was an 84% increase (OR=1.84, 95% CI = 1.11–3.06) in likelihood of inaccurately self-reporting their SARS-CoV-2 test date for participants aged 55 years and older and a 2% increase (OR=1.02, 95% CI = 1.02–1.03) for each elapsed week following their SARS-CoV-2 test. We observed an 82% percent agreement (dates within 7 days of each other) between self-reported and laboratory confirmed test dates, suggesting that self-reported SARS-CoV-2 test dates are sufficient for identifying and tracking Long COVID or Post-COVID Conditions when a laboratory-confirmed test date is not available. However, increasing age and greater time between test date and date of self-report were found to decrease the agreement between self-reported and laboratory confirmed test dates.
Citation: Catalfamo CJ, Jacobs ET, Falk LP, Lauro P, Ernst KC, Farland LV, et al. (2025) Concordance between self-reported SARS-CoV-2 positivity and laboratory-confirmed positivity. PLoS One 20(10): e0334102. https://doi.org/10.1371/journal.pone.0334102
Editor: Timothy J. Wade, Retired-United States Environmental Protection Agency, UNITED STATES OF AMERICA
Received: October 14, 2024; Accepted: September 23, 2025; Published: October 17, 2025
Copyright: © 2025 Catalfamo et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The dataset utilized for the current study is available via Open Science Framework (https://osf.io/ngrc5). Note: sharing of key variables, such as test dates, even after removal of other protected health information from the dataset would be in violation our institutional IRB approval and our cooperative agreement with the Arizona Department of Health Services. Requests for full access to data can be made to Dr. Patrick Wightman (wightman@arizona.edu), who serves as Honest Broker for all data utilized for this study (cc: kpogreba@arizona.edu).
Funding: This research was funded in part by the Arizona Biomedical Research Centre (https://www.azdhs.gov/biomedical/) under RFGA2022-010-14 as made available through the Arizona Department of Health Services (awarded to KPB). The content and findings are solely the responsibility of the authors and do not necessarily represent the official views of the Arizona Department of Health Services, Arizona Biomedical Research Commission.
Competing interests: The authors declare that they have no competing interests.
Introduction
The SARS-CoV-2 pandemic had immense global impacts on morbidity and mortality and has led to extensive societal changes since the first scientific report was published in February 2020 [1]. While early response and research efforts by necessity focused on the impact of acute illness, a collection of longer-term sequelae were identified early in the pandemic and have been further refined over time. These symptoms include cardiovascular abnormalities, gastrointestinal disorders, fatigue, shortness of breath, and “brain fog” or cognitive difficulties [1–14]. Further characterization of long-term symptoms was achieved when the Centers for Disease Control and Prevention defined new or ongoing post-infectious symptoms occurring 4 weeks or more following acute infection as post-COVID condition, or PCC [15,16]. These conditions have also been referred to as post-acute sequelae of COVID-19 (PASC) or, colloquially, as Long COVID [1–14].
To date, numerous studies have been conducted to assess the risk factors for and symptoms of PCC using various epidemiological methods ranging from retrospective medical record reviews to formal prospective studies [5,7–10,14], with many of these studies employing time since self-reported SARS-CoV-2 positivity. This research strategy arose for several reasons, including reducing the burden on study participants, protecting patient privacy, and the increased adoption of home-testing kits. The first of these kits were granted emergency use approval in the United States by the Food and Drug Administration in November 2020 [17]. Uptake of home testing became widespread throughout 2021 and 2022, with the provision of free tests by the United States government under the Biden Administration beginning in January 2022 [18]. Because many studies of acute COVID-19, as well as PCC, rely on accuracy of the positive test date or the date the illness began, it is of key importance to understand the accuracy of self-reported infection dates by comparing them to laboratory-confirmed dates.
Therefore, the primary objective of the present study was to compare self-reported data on testing positive for SARS-CoV-2 with laboratory-confirmed data obtained through COVID-19 surveillance and case investigations conducted by the Arizona Department of Health Services (ADHS). Our secondary objective was to assess whether the accuracy of self-reported positive tests varied by participant characteristics.
Methods
Study design and setting
This prospective cohort study analyzed data from participants enrolled in the Arizona CoVHORT Study (CoVHORT) [19] between May 28, 2020, and April 1, 2022. The study was conducted in Arizona using a population-based approach that recruited participants statewide through academic health department partnerships, phased mailing campaigns, and recruitment materials at vaccine distribution sites [20]. Data collection occurred through electronic surveys administered via Research Electronic Data Capture (REDCap) [21], with participants prospectively providing self-reported SARS-CoV-2 testing information at enrollment and follow-up assessments. Self-reported data were subsequently linked to laboratory-confirmed test results from the Arizona Department of Health Services (ADHS) surveillance system (MEDSIS). The analysis period covered SARS-CoV-2 positive test results occurring between January 1, 2020, and April 1, 2022.
Study participants
Participant information for the present study was collected from two primary sources: CoVHORT and the Arizona Department of Health Services (ADHS) surveillance dataset (MEDSIS), which captures all laboratory-reported positive SARS-CoV-2 test results in the state as part of routine public health surveillance. All labs and hospitals report positive PCR and antigen tests to ADHS which includes the date of testing. We included CoVHORT participants who were 18 years of age or older at enrollment, had self-reported a positive PCR or antigen test date for SARS-CoV-2 infection at baseline or during follow-up, and were enrolled in CoVHORT between May 28, 2020, and April 1, 2022. Laboratory-confirmed test data from MEDSIS included SARS-CoV-2 positive test results that occurred prior to April 2, 2022. For participants with multiple SARS-CoV-2 infections reported to the state surveillance system, only information pertaining to the first reported infection was included in the analysis. We excluded all participants who were missing a self-reported SARS-CoV-2 test or a matched laboratory-reported test.
Ethical approval
Participants completed all of their surveys electronically via Research Electronic Data Capture (REDCap) [21] and all participants provide written informed consent upon joining the study. The University of Arizona Institutional Review Board provided ethical approval for the completion of the study (Protocol #2003521636).
Available data
CoVHORT collected information about participant demographics at baseline and assessed SARS-CoV-2 infection status at baseline and during follow-up. Demographic information collected at baseline included age (18−24 years, 25−34 years, 35−44 years, 45−54 years, 55−64 years, or ≥65 years), gender (female or male), race (White, American Indian/Alaska Native, Asian, Black or African American, Native Hawaiian/Pacific Islander, Mixed race, or prefer not to answer), ethnicity (Non-Hispanic, Hispanic, or prefer not to answer), highest level of education completed (High School graduate or less, some college, or college graduate or greater), annual household income (<$25,000, $25,000 to $49,999, $50,000 to $74,999, or ≥$75,000), self-reported employment status (full-/part-time employed, unemployed, or retired), and student status (full-time student or not full-time student). At baseline and during follow-up, participants were asked if they have had a COVID-like illness and/or whether they had tested positive for a SARS-CoV-2 infection, including the type of test used (laboratory conducted polymerase chain reaction (PCR) or antigen tests, or at-home antigen tests) and the date of their test prior to joining the study.
Case definition
For this study, a “case” was defined as a CoVHORT participant who self-reported a positive SARS-CoV-2 test (PCR or antigen) and had a corresponding laboratory-confirmed positive test recorded in the ADHS surveillance system (MEDSIS). Cases were identified through a data linkage process that matched self-reported positive test dates from CoVHORT participants to their laboratory-reported positive test dates in MEDSIS. The most-likely match between self-reported and laboratory-reported test dates was determined by identifying the pair with the smallest difference in number of days among all potential matches for each participant.
Outcomes
The primary outcome of this study was to assess the accuracy of self-reported SARS-CoV-2 test dates by comparing participant self-reported positive test dates to laboratory-confirmed PCR or antigen test dates with independent records in the MEDSIS surveillance system. Accuracy was measured by calculating the concordance between self-reported and laboratory-reported test dates, expressed as percent agreement. The difference in days between participant self-reported dates and laboratory-reported dates was used to categorize participants into accuracy groups, with accurate self-reporting defined as test dates differing by seven days or fewer from the laboratory-reported date, and inaccurate self-reporting defined as test dates differing by more than 7 days from the laboratory-reported date.
Statistical analyses
We first examined concordance between positive test dates reported to MEDSIS and participant self-reported test dates using percent agreement. We calculated the difference, in number of days, between the most-likely self-reported test date match to participants’ laboratory-reported SARS-CoV-2 diagnostic test date. The difference in days between participant self-reported dates and laboratory reported dates were used to categorize participants, and the number and of participants in each group was used to calculate the percent agreement.
Second, we used logistic regression to evaluate factors associated with inaccurate self-reporting of test dates (defined as reporting a test date greater than 7 days different than the laboratory reported date). Independent variables included demographic characteristics (age, gender, race, ethnicity, highest level of education completed, annual household income, self-reported employment status, and student status), symptomatic status at time of diagnostic test for SARS-CoV-2 infection (asymptomatic or symptomatic), and the number of weeks between the date of the diagnostic test and when participants reported the positive test to CoVHORT. We performed univariate logistic regressions to examine the association between self-reporting a test date greater than 7 days different than the laboratory reported test date with each of the independent variables specified. In multivariable models, we included participant age at baseline, with adjustment for number of weeks between the date of the laboratory reported SARS-CoV-2 diagnostic test and the date participants reported their positive test to CoVHORT. Statistical significance was defined as a p-value of 0.05. All analyses were conducted using STATA 18 [22].
Results
A total of 1900 participants recruited into the CoVHORT study between May 28, 2020, through April 1, 2022, who had a positive laboratory test for SARS-COV-2 reported to MEDSIS between January 1, 2020, and April 1, 2022 (Table 1). The median time between participants’ SARS-CoV-2 laboratory test and the date they reported their test to CoVHORT was 5 weeks (IQR: 1.3 weeks – 16.1 weeks).
Overall, 82.5% (n = 1568) self-reported a positive test date within 7 days of the laboratory test date (Table 2). Participant self-reported positive test dates agreed exactly with the laboratory test dates for 28.3% (n = 538) of the cohort. Among participants who did not self-report a positive test date that agreed exactly with the laboratory reported date, most self-reported a positive test date that was after the laboratory test date (46.63% vs 12.53%).
Table 3 presents the likelihood of disagreement by participant characteristics. For every ten years of age, the odds of disagreement (>7 days difference) increased by 15% (OR 1.15). Individuals who self-reported their employment status as being retired were observed to have a 50% increase in the odds of disagreement (OR 1.50). Those aged 55–64 years (OR 2.12) and 65 + years (OR 1.96) had approximately two-fold odds of disagreement compared to 18–24-year-olds. These associations of age persisted after adjusting for number of weeks since testing positive to reporting illness to CoVHORT (Table 4).
Disagreement became more likely with increasing time intervals (number of weeks) between infection status and reporting illness to CoVHORT (OR 1.02)(Table 4); this association remained unchanged after adjusting for age. Testing positive four weeks prior to reporting the illness to CoVHORT was associated with a 9.4% increased likelihood of an inaccurate self-report of test dates, and an interval of three months was associated with a 30.8% increased likelihood.
Discussion
The results of this study show that the vast majority of participants in a population-based cohort study can accurately self-report their laboratory test date. Approximately 82% of participants reported a test date that was within 7 days of the laboratory-confirmed date collected by the state health department, 89% were within 14 days, and 92% were within 28 days. Those aged 55 years or above or reported being retired were significantly more likely to self-report dates that differed by 7 days or more from laboratory-confirmed testing than younger groups and those who are currently employed at work or as a student. Finally, a longer length of time between testing positive and completing the survey was significantly and inversely associated with the accuracy of test date reporting.
Participant responses were less accurate among participants aged 55 years and above as compared to younger respondents. This finding has been demonstrated in prior studies of the impact of age on survey responses [23]. The accuracy of self-reports requires several aspects of cognitive processes, including understanding survey items, recall ability, and information processing [23]. In addition, individuals with frequent healthcare utilization may face additional challenges in distinguishing between multiple similar medical encounters, potentially increasing variability in recall of specific test dates and medical events [24–26]. Despite the variation in report accuracy by age, a stronger relationship was observed for the length of time between the laboratory-confirmed test date and the date of self-report, which supports prior work. For example, it was demonstrated that monthly self-reports of healthcare utilization effects greater accuracy regarding healthcare utilization than do yearly surveys [24], indicating that reports which are more proximal to the events being queried are more reliable. This phenomenon was first described in detail by Ebbinghaus in 1885, has been supported by a wealth of studies related to memory capacity, and arises due to numerous cognitive processes [27]. Hence, regular administration of surveys regarding COVID-19 test dates, as with the majority of other self-reported data, yield benefits in regard to accuracy of reporting.
Overall, we observed moderately high accuracy between the participant self-reported diagnostic test dates and those that were reported to ADHS by the laboratories that processed participant samples, with 82.5% of participants reporting a date within 7 days of the laboratory reported date. Although, slightly less accurate, our results are similar to other studies examining the reliability and accuracy of self-reported COVID vaccination status and other conditions. Among patients diagnosed with a SARS-CoV-2 infection who visited Canadian emergency departments, 96% of patients correctly reported their vaccination status at telephone follow-up compared to their vaccination record as in the Quebec Vaccination Registry [28]. A separate study conducted among adolescent girls further supports the reliability of self-reported diagnoses, which found a high overall agreement between self-report, medical record, and state surveillance record for past chlamydia and gonorrhea diagnoses [29].
Much about the acute-illness and long-term sequelae remain unknown despite the numerous studies of COVID-19 that have been conducted since the first appearance of SARS-CoV-2. Accurate infection dates, symptom start dates and dates of diagnosis, are crucial to ongoing research of illness progression, including PCC. However, with the widespread availability and utilization of at-home antigen tests, an increasing proportion of COVID-19 cases are likely not reported to health department surveillance systems and therefore are not associated with any laboratory confirmed testing information. The reliability of self-reported infection dates observed in this study support their utilization in future research, including those associated with at-home tests that have not been previously reported to any health-department based surveillance system.
Strengths of this study include the ability to collaborate with the Arizona Department of Health Services to obtain accurate data for laboratory testing dates, a large study population, and well-characterized participant data that permitted the evaluation of characteristics that were related to accuracy of testing self-reports. Limitations include a largely white and highly educated study population, affecting the generalizability of these findings.
In conclusion, these findings demonstrate that self-reported COVID-19 test dates exhibit high concordance with laboratory-based test dates reported to the state health department in Arizona. These findings should provide confidence in using these data to assess the presence or absence of PCC among individuals who have tested positive for SARS-CoV-2.
References
- 1. Zhu N, Zhang D, Wang W, Li X, Yang B, Song J, et al. A Novel Coronavirus from Patients with Pneumonia in China, 2019. N Engl J Med. 2020;382(8):727–33. pmid:31978945
- 2. Bell ML, Catalfamo CJ, Farland LV, Ernst KC, Jacobs ET, Klimentidis YC, et al. Post-acute sequelae of COVID-19 in a non-hospitalized cohort: Results from the Arizona CoVHORT. PLoS One. 2021;16(8):e0254347. pmid:34347785
- 3. Fernández-de-Las-Peñas C, Palacios-Ceña D, Gómez-Mayordomo V, Florencio LL, Cuadrado ML, Plaza-Manzano G, et al. Prevalence of post-COVID-19 symptoms in hospitalized and non-hospitalized COVID-19 survivors: A systematic review and meta-analysis. Eur J Intern Med. 2021;92:55–70. pmid:34167876
- 4. Michelen M, Manoharan L, Elkheir N, Cheng V, Dagens A, Hastie C, et al. Characterising long COVID: a living systematic review. BMJ Glob Health. 2021;6(9):e005427. pmid:34580069
- 5. Asadi-Pooya AA, Akbari A, Emami A, Lotfi M, Rostamihosseinkhani M, Nemati H, et al. Long COVID syndrome-associated brain fog. J Med Virol. 2022;94(3):979–84. pmid:34672377
- 6. Hossain MA, Hossain KMA, Saunders K, Uddin Z, Walton LM, Raigangar V, et al. Prevalence of Long COVID symptoms in Bangladesh: a prospective Inception Cohort Study of COVID-19 survivors. BMJ Glob Health. 2021;6(12):e006838. pmid:34906986
- 7. Kayaaslan B, Eser F, Kalem AK, Kaya G, Kaplan B, Kacar D, et al. Post-COVID syndrome: A single-center questionnaire study on 1007 participants recovered from COVID-19. J Med Virol. 2021;93(12):6566–74. pmid:34255355
- 8. Fumagalli C, Zocchi C, Tassetti L, Silverii MV, Amato C, Livi L, et al. Factors associated with persistence of symptoms 1 year after COVID-19: A longitudinal, prospective phone-based interview follow-up cohort study. Eur J Intern Med. 2022;97:36–41. pmid:34903448
- 9. Menges D, Ballouz T, Anagnostopoulos A, Aschmann HE, Domenghino A, Fehr JS, et al. Burden of post-COVID-19 syndrome and implications for healthcare service planning: A population-based cohort study. PLoS One. 2021;16(7):e0254523. pmid:34252157
- 10. Knight DRT, Munipalli B, Logvinov II, Halkar MG, Mitri G, Dabrh AMA, et al. Perception, Prevalence, and Prediction of Severe Infection and Post-acute Sequelae of COVID-19. Am J Med Sci. 2022;363(4):295–304. pmid:35016849
- 11. Kozak R, Armstrong SM, Salvant E, Ritzker C, Feld J, Biondi MJ, et al. Recognition of Long-COVID-19 Patients in a Canadian Tertiary Hospital Setting: A Retrospective Analysis of Their Clinical and Laboratory Characteristics. Pathogens. 2021;10(10):1246. pmid:34684195
- 12. Sudre CH, Murray B, Varsavsky T, Graham MS, Penfold RS, Bowyer RC, et al. Attributes and predictors of long COVID. Nat Med. 2021;27(4):626–31. pmid:33692530
- 13. Kim Y, Kim S-W, Chang H-H, Kwon KT, Bae S, Hwang S. Significance and Associated Factors of Long-Term Sequelae in Patients after Acute COVID-19 Infection in Korea. Infect Chemother. 2021;53(3):463–76. pmid:34405592
- 14. Austhof E, Bell ML, Riddle MS, Catalfamo C, McFadden C, Cooper K, et al. Persisting gastrointestinal symptoms and post-infectious irritable bowel syndrome following SARS-CoV-2 infection: results from the Arizona CoVHORT. Epidemiol Infect. 2022;150:e136. pmid:35801302
- 15. Prevention CfDCa. Post-COVID Conditions: Overview for Healthcare Providers 2022 [10/11/2022]. Available from: https://www.cdc.gov/coronavirus/2019-ncov/hcp/clinical-care/post-covid-conditions.html
- 16. Ely EW, Brown LM, Fineberg HV. Long Covid Defined. N Engl J Med. 2024;391(18):1746–53. pmid:39083764
- 17. Administration FaD. Coronavirus (COVID-19) Update: FDA Authorizes First COVID-19 Test for Self-Testing at Home 2020 [cited 2023 May 17th]. Available from: https://www.fda.gov/news-events/press-announcements/coronavirus-covid-19-update-fda-authorizes-first-covid-19-test-self-testing-home
- 18. Government US. Fact Sheet: The Biden Administration to Begin Distributing At-Home, Rapid COVID- 19 Tests to Americans for Free. 2022. [cited 2023 May 17th]. Available from: https://www.whitehouse.gov/briefing-room/statements-releases/2022/01/14/fact-sheet-the-biden-administration-to-begin-distributing-at-home-rapid-covid-19-tests-to-americans-for-free/
- 19. Catalfamo CJ, Heslin KM, Shilen A, Khan SM, Hunsaker JR, Austhof E, et al. Design of the Arizona CoVHORT: A Population-Based COVID-19 Cohort. Front Public Health. 2021;9:620060. pmid:33643990
- 20. Pogreba Brown K, Austhof E, Rosa Hernández AM, McFadden C, Boyd K, Sharma J, et al. Training and Incorporating Students in SARS-CoV-2 Case Investigations and Contact Tracing. Public Health Rep. 2021;136(2):154–60. pmid:33301694
- 21. Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (REDCap)--a metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform. 2009;42(2):377–81. pmid:18929686
- 22.
StataCorp. Stata Statistical Software: Release 18. College Station, TX: StataCorp LLC; 2023.
- 23. Knäuper B, Carrière K, Chamandy M, Xu Z, Schwarz N, Rosen NO. How aging affects self-reports. Eur J Ageing. 2016;13(2):185–93. pmid:28804377
- 24. Roberts RO, Bergstralh EJ, Schmidt L, Jacobsen SJ. Comparison of self-reported and medical record health care utilization measures. J Clin Epidemiol. 1996;49(9):989–95. pmid:8780606
- 25. Short ME, Goetzel RZ, Pei X, Tabrizi MJ, Ozminkowski RJ, Gibson TB, et al. How accurate are self-reports? Analysis of self-reported health care utilization and absence when compared with administrative data. J Occup Environ Med. 2009;51(7):786–96. pmid:19528832
- 26. Bhandari A, Wagner T. Self-reported utilization of health care services: improving measurement and accuracy. Med Care Res Rev. 2006;63(2):217–35. pmid:16595412
- 27. Murre JMJ, Dros J. Replication and Analysis of Ebbinghaus’ Forgetting Curve. PLoS One. 2015;10(7):e0120644. pmid:26148023
- 28. Archambault PM, Rosychuk RJ, Audet M, Bola R, Vatanpour S, Brooks SC, et al. Accuracy of Self-Reported COVID-19 Vaccination Status Compared With a Public Health Vaccination Registry in Québec: Observational Diagnostic Study. JMIR Public Health Surveill. 2023;9:e44465. pmid:37327046
- 29. Niccolai LM, Kershaw TS, Lewis JB, Cicchetti DV, Ethier KA, Ickovics JR. Data collection for sexually transmitted disease diagnoses: a comparison of self-report, medical record reviews, and state health department reports. Ann Epidemiol. 2005;15(3):236–42. pmid:15723771