Figures
Abstract
With the increasing burden of professional burnout in physicians, attention is being paid to optimizing sleep health, starting in training. The multiple dimensions of physicians’ sleep are not well described due to obstacles to easily and reliably measuring sleep. This pilot study tested the feasibility of using commercial wearable devices and completing manual sleep logs to describe sleep patterns of medical students and residents. Prospective pilot study of 50 resident physicians and medical students during a single year of training. Participants completed a manual sleep log while concurrently wearing the Fitbit Inspire device for 14-consecutive days over three clinical rotations of varying work schedules: light, medium, and heavy clinical rotations. Study completion was achieved in 24/50 (48%) participants. Overall correlation coefficients between the sleep log and Fitbit were statistically low; however, the discrepancies were acceptable, i.e., Fitbit underestimated time in bed and total sleep time by 4.3 and 2.7 minutes, respectively. Sleep onset time and waketime were within 8 minutes, with good agreement. Treatment of sleep episodes during the day led to variance in the data. Average missingness of collected data did not vary between medical students or residents or by rotation type. When comparing the light to heavy rotations, hours slept went from 7.7 (±0.64) to 6.7 (±0.88), quality-of-life and sleep health decreased and stress, burnout, and medical errors increased. Burnout was significantly associated with worse sleep health, hours worked, and quality-of-life. Prospective data collection of sleep patterns using both sleep logs and commercial wearable devices is burdensome for physicians-in-training. Using commercial wearable devices may increase study success as long as attention is paid to daytime sleep. In future studies investigating the sleep of physicians, the timing of data collection should account for rotation type.
Citation: Hassinger AB, Kwon M, Wang J, Mishra A, Wilding GE (2024) Pilot study comparing sleep logs to a commercial wearable device in describing the sleep patterns of physicians-in-training. PLoS ONE 19(7): e0305881. https://doi.org/10.1371/journal.pone.0305881
Editor: Ibrahim Sadek, Helwan University Faculty of Engineering, EGYPT
Received: January 22, 2024; Accepted: June 5, 2024; Published: July 22, 2024
Copyright: © 2024 Hassinger et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Data cannot be shared publicly due to ethical restrictions put in place by the study institution. The FitBit data were extracted from the personal devices using a cloud platform created by the institutional IT team. The study team does not have direct access to this cloud platform. The authors are willing to share the remainder of the de-identified dataset stored in their REDCap platform upon request for researchers who meet the criteria for access to confidential data. This would require assent from the study team or the institutional IRB with enactment of a data use agreement (contact via the Director of Research Compliance at UB: karalus@buffalo.edu).
Funding: This study was funded by an internal grant through the University at Buffalo Blue Sky Program https://buffalobluesky.buffalo.edu/ (ABH). MK holds a postdoctoral position funded by the National Institute of Heart, Lung and Blood Institute of the NIH (T32 HL07953). The funders did not play a role in any aspect of the study.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Professional burnout is an epidemic affecting a large proportion of healthcare providers, as early as the first year of medical school [1]. Data show 50% of medical students [2], 56% of interns [3], 40% of practicing surgeons [4], and 46% of all attending physicians across medicine sub-specialties [5] experience burnout which may result from emotional exhaustion and high workload leading to inadequate and irregular sleep. The consequences of burnout include committing medical errors, depression, suicide, substance abuse, and is suspected to be driving a physician shortage [2,6–11].
One of the modifiable risk factors consistently associated with burnout and physician well-being is sleep [12,13]. Although sleep duration or total number of hours slept is the most often described metric, evidence is emerging that sleep health has multiple dimensions that describe consistent sleep at the correct time without interruption. Work hour rules have forced healthcare workers into shorter shifts abruptly changing from nighttime to daytime, often against the natural circadian rhythm. These types of abrupt shift changes have been documented to cause depression, burnout, cognitive deficits, and motor vehicle accidents among emergency room physicians [14]. However, their impact on physicians-in-training is not thoroughly explored.
Gaining insight into the sleep health of residents and medical students is hindered by the difficulty of characterizing the granular sleep patterns in this actively overworked group. Research-grade actigraphy devices are the gold standard for measuring sleep duration, timing, efficiency, and regularity in outpatient settings; however, they are expensive and require specialized software and expertise to analyze and interpret the data. On the other hand, there are affordable and user-centric commercial wearable devices that offer a modest amount of sleep data that can be transmitted through cloud systems, eliminating the requirement for proprietary software. For this reason, there is a growing volume of studies employing such commercial devices, for example, in research involving surgery residents [15], and internal medicine residents [16]. Nevertheless, in these investigations, there is still a lack of comparisons with manual sleep logs to ascertain the variance between self-report and sleep data derived from commercial wearable devices.
This pilot study compared manual sleep logs to commercial wearable devices in medical students and residents to determine measurement discrepancy to inform feasibility and design of future large-scale studies. The primary aim of this study was to evaluate the correlation between sleep variables (i.e., time in bed [TIB], total sleep time [TST], sleep efficiency [SE%], sleep onset time, and waketime) reported on manual sleep log and that measured by the Fitbit Inspire.TM Our hypothesis was that there would be insignificant differences in TIB, TST, SE%, sleep onset time, and waketime collected via Fitbit when compared to the manual sleep log. Second, we sought to determine study design viability for physicians-in-training to complete the entire protocol.
Materials and methods
This prospective observational trial enrolled 25 medical students and 25 resident physicians over a single year of training from July 1, 2020 to June 30, 2021. At enrollment, participants were given a Fitbit InspireTM and directed to complete a baseline survey. Instructions were then provided on how to register the Fitbit device and synch it with a de-identified cloud platform which was developed for research purposes only. Study participation involved completing a manual sleep log (i.e., paper diary or on a personalized online link to a secure web-based data collection tool [REDCap], based on participant preference) for 14-consecutive days while wearing the Fitbit at three timepoints over the one year of training. For ethical reasons, only the dates entered in the manual sleep log were extracted from the Fitbit database.
Participants were instructed on selecting the rotations during which they should wear the device. The rotations were described as: (1) a light rotation with no nights or weekend shifts and primarily outpatient work, such as an elective or clinic-based rotation, (2) a medium rotation that involved clinical time in a hospital but no night shifts or weekends, (3) a heavy rotation that involved night and weekend shifts, such as in the Emergency Department or Intensive Care Unit.
In addition to entering their sleep patterns on sleep logs for 14-consecutive days on each rotation, participants were also asked to provide the following via REDCap survey: number of days off over the 4-week rotation, missed significant life experiences, commission of a medical error, health-related quality-of-life score, the abridged Maslach Burnout Scale [17], 2-items from the Patient Health Questionnaire related to depression [18], and a 10-item Perceived Stress Scale (PSS) [19]. PSS score above 12 indicated moderate to severe stress, as has been previously reported [19]. Frequency of habits known to impact sleep (i.e., screen time, alcohol, caffeine, and tobacco use before bed) were also elicited.
A composite sleep health score was calculated using 6 domains: regularity (i.e., “How often do you have a set sleep schedule?”), duration (i.e., “How often do you get 6 to 8 hours of sleep per day?”), efficiency (i.e., “How often do you spend less than 30 minutes awake total while trying to sleep?”), timing (i.e., “How often are you asleep between 2 and 4 am?”), quality (i.e., “How often do you wake up refreshed?”) and daytime sleepiness (i.e., “How often do you stay awake all day without napping or dozing off?”) [20]. A six-point Likert scale quantified frequency of each domain with higher number indicative of healthier sleep patterns. This sleep health score has been validated in large population studies and been shown to correlate well with other measures of sleep as well as daytime health [21–26].
At the time of study completion or exit, all participants were prompted to complete an Exit Questionnaire with 5 questions about the feasibility of completing all the required tasks for the study (i.e., reasons for not completing, length taken to complete, other recommendations, etc.). Monetary compensation was provided to participants who completed the baseline survey, sleep logs, surveys and wearing the Fitbit device at all three rotation time points, and the Exit Questionnaire.
The study was approved after full review by the Institutional Review Board of the University at Buffalo. All participants provided written informed consent.
Statistical analysis
Descriptive statistics were used to summarize the sample and inferential statistics to compare the sleep and rotational data between each rotation type. Regression was used to correlate burnout scores, stress, quality-of-life, hours worked, and hours slept. Bland-Altman plots and concordance correlation coefficients (CCC) for agreement were used to assess the agreement between sleep variables obtained from the sleep log and from the Fitbit device for each individual subject [27]. The strength of the association between the two methods of obtaining sleep variables was measured with Pearson correlation coefficients.
We additionally examined the relationship between the data input missingness of the sleep log and both rotation type and training level (i.e., medical student or resident). Descriptive statistics were computed to examine the distribution of the missingness of each rotation type. The comparison of missingness between different rotation types was performed by Kruskal-Wallis rank sum test. Due to the hierarchical nature of the data (i.e., observations nested within participants), the assumption of independence of observations was violated and thus, multilevel modelling was used. Linear mixed effects model was used to investigate different independent variables’ effect on missingness. Statistical analyses were performed using R software version 4.2.2 and SPSS, version 29.
Results
The description of the enrolled participants is displayed in Table 1. Of the 25 medical students recruited, 12 were 3rd years and 13 were 4th year medical students. Residents were interns (9, 36%), 2nd years (9, 36%) or 3rd years (7, 28%) in Internal Medicine (11, 44%), Pediatrics (11, 44%) or Medicine-Pediatric (3, 12%) programs. The majority were female (62%), white (61%), non-Hispanic (90%), and were in a relationship (70.8%). Compared to medical students, residents more often had their origins in different geographic regions or countries other than US, with Canada and various Asian countries being the most common.
Comparison between manual sleep log and Fitbit values
There was a total of 1,446 paired sleep log entries (n = 723) and Fitbit sleep episodes (n = 723) available for direct comparison of TIB, TST, SE% (i.e., TST/TIB x 100%), sleep onset time, and waketime. Table 2 shows the comparisons between the sleep variables obtained from the manual logs as compared to the Fitbit device. While the absolute mean differences for TST and TIB were 2.7 minutes and 4.3 minutes respectively, the correlations were low. Timing of sleep onset and waketime were in better agreement and had a discrepancy of 36 and 8 minutes respectively. The limits of agreement on the Bland Altman plots were wide and affected by outliers (Fig 1). The largest variance was seen in sleep episodes that occurred during the day as seen on the plots for sleep onset and waketimes (Fig 1D and 1E).
A-E. Bland-Altman plots comparing sleep data from FitBit to sleep logs. Plots compare (A) time in bed (TIB), (B) total sleep time (TST), (C) sleep efficiency (SE%), (D) sleep onset time, and (E) waketime as collected from sleep logs and the Fitbit Inspire device in 50 resident physicians and medical students. The data-points in red are from the light rotation, green from the medium and blue from the heavy clinical rotation. Mean difference ± 1.96 standard deviations (SD) are displayed for each Bland-Altman plot.
Study protocol compliance
Once all 50 participants received Fitbit, a total of 32 (64%) followed the instructions to synch the device to the cloud for de-identified data collection. Less than half (24, 48%) of participants completed the study in its entirety (i.e., entered all three rotations and the Exit Questionnaire) and only 15 (30%) wore the Fitbit device for the full 6 weeks. There was no difference in study completion in residents (52%) as compared to medical students (44%) or between rotation type.
The most common reason for not completing the study were: “My work schedule got in the way” (10, 67%), “It was too difficult to keep the sleep log” (9, 60%), “Forgot to wear my FitBit” (8, 53%), “I forgot to complete the surveys during the correct rotation” (5, 33%) and “Too many questions” (5, 33%). The estimated time to complete each data collection form was between 10–20 minutes for 61% of the participants and over 20 minutes for the remaining 39% of participants. Over half (53%) used the paper log to collect data daily and then enter it electronically at the end of the 2-week period (60%). Recurrent themes from the narrative feedback on the study design were that synching and wearing the Fitbit was too hard and the electronic format for data collection (i.e., REDCap survey system) took too much time. Others suggested creating a user-friendly smartphone application for data collection.
Missingness analysis
There were no statistical differences in missingness of data between rotation type or medical students when compared to residents (Fig 2). When conducting additional analyses including sex, age, training level, working in your home area, insomnia, mood disorder and body mass index (BMI), the only variable associated with missing data was BMI.
Missingness of data collection of sleep patterns in 25 medical students and 25 residents during three different types of rotations: Light, medium and heavy clinical burden (A) and between the two training levels (B). The differences did not meet statistical significance.
Differences between types of rotations and participants
Self-reported daily hours worked increased from the light to the medium and the heavy clinical rotations with a decline in total sleep time from 7.7 hours to 6.7 hours from the light to the heavy rotations (p<0.001). Sleep health scores, burnout, stress and quality of life scores worsened as the clinical burden increased. The proportions of participants who missed a life event, spent not enough time with family and who reported committing an error also increased with the heavier clinical rotations (Table 3).
There were significant inverse correlations between burnout and sleep health scores, total number of hours slept per day, and quality of life in each type of clinical rotation (Table 3). Average daily hours worked was associated with burnout in heavy clinical rotation (r = 0.449, p = 0.013), but not in the other rotation-types. When adjusting for all of these variables, only sleep health score remained independently associated with burnout scores on the light rotation (β = -0.459, p = 0.004) and on the medium rotation (β = -0.581, p = 0.008).
Discussion
This study aimed to assess the consistency of data provided by commercially available devices compared to manual sleep logs among physicians-in-training. Although the statistical correlations of the Fitbit TST, TIB, and SE% were found to be weak in our current study, the reported discrepancies for each individual measure may still be acceptable in practice. For instance, discrepancies of 2 minutes or a 2% difference in SE% may be tolerable. A review of literature reveals prior research that supports these findings. Studies employing Fitbit devices in general healthy adults indicates similar agreement between sleep duration reported in sleep logs and that derived from the Fitbit Inspire 2 [28] and Fitbit Flex [29] devices. In other studies of healthy young adults, Fitbit was able to estimate TIB, TST, SE%, sleep onset latency, and wake after sleep onset with minimal bias during the nocturnal sleep episodes [30] but mishandled daytime sleep episodes [31]. This aligns with our observation of low CCC values related to data variability from daytime sleep episodes.
In the studies investigating multiple dimensions of sleep health in physicians, few directly compare sleep log data to that from commercial wearable devices. Existing studies found in this area either rely on sleep logs alone, such as study of sleep banking in internal medicine residents by Cushman et al [32], or on Fitbit data alone, such as the study of 26 surgical residents wearing Fitbit devices on night float and home call [33]. To our knowledge, no study in physicians-in-training has directly compared sleep logs to commercial actigraphy device, which would allow for an evaluation of the external validity of our findings.
A consistent challenge in the literature regarding the sleep habits of physicians or physicians-in-training is the difficulty in obtaining comprehensive data. This challenge was evident in our study’s completion rate. Similarly, in the aforementioned Cushman study, only 25 of 50 eligible residents (50%) were able to complete a manual sleep log over the intended 11 days [32]. This issue is not unique in physicians; it is also seen in studies involving non-physicians. For instance, a Mayo clinic study of 84 hypersomnia patients evaluated by actigraphy reported that only 25 (30%) had complete capture of sleep log data alongside actigraphy use [34].
To inform future studies and contribute to the literature, it is important to note that the agreement between sleep logs and commercial actigraphy measurements met the recommended thresholds set by the American Academy of Sleep Medicine (AASM) for actigraphy to evaluate sleep disorders. For the AASM, the minimal allowable mean difference between clinical-grade actigraphy and sleep logs are within 2.5–5.0% for SE%, and 20 minutes of TST, sleep onset time and waketime [35]. While the difference in sleep onset time and the thresholds of the 95% CI of our data fell outside of the acceptable range per the AASM [35], this could be attributed to our small sample size and combining night time with daytime sleep episodes.
Our findings contribute to the growing body of evidence suggesting that studies on the sleep of physicians should minimize the data capture burden to reduce study attrition rates. The results that clinical rotation work load and training level did not correlate with ability to complete the study protocol provide guidance for future study design. Additionally, the finding that the only demographic variable associated with poorer study participation was BMI–a known risk factor for study attrition previously reported in the literature [36]–warrants further investigation in future research. Future studies should also carefully consider the duration of data collection. If the primary outcome is nightly TST, a sampling duration of 3 to 5 nights offers sufficient accuracy. However, for a comprehensive examination of monthly variations in habitual sleep, a more extensive period of 9 to 14 nights is recommended [37]. Based on our findings, a data collection of 9–10 days of wearing a commercial device to capture nighttime sleep measured in conjunction with a log of daytime sleep and shift-work times may be optimal for data collection and reliability in describing sleep patterns of physicians.
Given the exploratory nature of this pilot study, our initial intent did not include having a sample size adequate for detecting significant differences in sleep health and burnout across various rotations. Despite low compliance rates, we were still able to demonstrate that residents and medical students experience significant loss of sleep health and quality of life with an increase in stress and burnout as clinical duties increase. Our findings suggest that clinical rotation type is an important variable to include in future studies involving sleep in this population. This is consistent with a prospective observational trial of 26 surgical residents who wore Fitbit devices during home call, night float and in-house call. Those authors also found that changes in clinical burden detrimentally impacted physical activity and sleep hours [33]. Our results provide evidence that more work is needed in understanding how residents and medical students sleep in all clinical settings so we can ensure sustainability of this crucial workforce.
Several limitations must be acknowledged in this study. Firstly, poor protocol compliance rates hindered the attainment of robust sleep data. Secondly, the study was conducted at a single site, which limits the generalizability of the findings. Thirdly, the small sample size may have contributed to the substantial variability observed in the sleep data. Thus, further testing with a larger sample size across multiple sites is warranted to enhance the representativeness of the findings. Fourthly, self-reported information on sleep parameters may be susceptible to recall bias. However, the use of a sleep diary over 14 consecutive days likely mitigated this bias compared to single questions or recall-based scales. Fifthly, while Fitbit models have demonstrated reliability in determining sleep duration, pattern, and quality over consecutive nights under normal living conditions [38], caution is needed in interpreting results. Our study was conducted under unrestricted conditions in home settings, where various physical or environmental factors may influence uninterrupted sleep, especially in day and night shift working conditions. Additionally, our observations regarding daytime sleep suggest avenues for future research. Including larger and more diverse occupations of participants, particularly healthcare workers on regular night shifts, and tracking sleep over periods longer than two weeks could offer valuable insights but may be hampered by data capture concerns.
Conclusion
Commercial wearable devices such as Fitbit provided acceptable estimates of total sleep time, efficiency, sleep onset time, and waketime in physicians-in-training. Wearing the device and completing a sleep log for 14-consecutive days simultaneously is not a pragmatic study design in this population. Daytime sleep episodes complicate analyses of physician sleep health. Future studies with streamlined sleep data collection are urgently needed as our evidence shows physicians-in-training have poor sleep health leading to burnout, stress, and a lower quality-of-life.
References
- 1. West CP, Dyrbye LN, Erwin PJ, Shanafelt TD. Interventions to prevent and reduce physician burnout: a systematic review and meta-analysis. Lancet. 2016;388:2272–2281. pmid:27692469
- 2. Dyrbye LN, Thomas MR, Massie FS, Power DV, Eacker A, Harper W, et al. Burnout and suicidal ideation among US medical students. Ann Intern Med. 2008;149:334–41.
- 3. Rosen IM, Gimotty PA, Shea JA, Bellini LM. Evolution of sleep quantity, sleep deprivation, mood disturbances, empathy and burnout among interns. Acad Med. 2006;81(1):82–85. pmid:16377826
- 4. Shanafelt TD, Balch CM, Bechamps GJ, Russell T, Dyrbye L, Satele D, et al. Burnout and career satisfaction among American surgeons. Ann Surg. 2009 Sep;250(3):463–71. pmid:19730177
- 5. Shanafelt TD, Boone S, Tan L, Dyrbye LN, Sotile W, Satele D, et al. Burnout and satisfaction with work-life balance among US physicians relative to the general US population. Arch Intern Med. 2012 Oct 8;172(18):1377–85. pmid:22911330
- 6. Shanafelt TD, Bradley KA, Wipf JE, Back AL. Burnout and self-reported patient care in an internal medicine residency program. Ann Intern Med. 2002 Mar 5;136(5):358–67. pmid:11874308
- 7. Firth-Cozens J. Individual and organizational predictors of depression in general practitioners. Br J Gen Pract. 1998 Oct;48(435):1647–51. pmid:10071396
- 8. Shanafelt TD, Balch CM, Dyrbye L, Bechamps G, Russell T, Satele D, et al. Special report: suicidal ideation among American surgeons. Arch Surg. 2011 Jan;146(1):54–62. pmid:21242446
- 9. Wallace JE, Lemaire JB, Ghali WA. Physician wellness: a missing quality indicator. Lancet. 2009;374:1714–21. pmid:19914516
- 10. Cooper RA, Getzen TE, McKee HJ, Laud P. Economic and demographic trends signal an impending physician shortage. Health Aff (Millwood). 2002;21:140–154. pmid:11900066
- 11. Shanafelt TD, Mungo M, Schmitgen J, Storz KA, Reeves D, Hayes SN, et al. Longitudinal study evaluating the association between physician burnout and changes in professional work effort. Mayo Clin Proc. 2016 Apr;91(4):422–31. pmid:27046522
- 12. Stewart NH, Arora VW. The impact of sleep and circadian disorders on physician burnout. Chest. 2019;156(5):1022–1030. pmid:31352036
- 13. Parthasarathy S Sleepand the medical profession. Curr Opin Pulm Med. 2005;11:507–512. pmid:16217176
- 14. Nelson D. Prevention and treatment of sleep deprivation among emergency physicians. Ped Emerg Care. 2007;23(7):498–503. pmid:17666938
- 15. Kelly-Schuette K, Shaker T, Carroll J, Davis AT, Wright GP, Chung M. A prospective observational study comparing effects of call schedules on surgical resident sleep and physical activity using the fitbit. J Grad Med Edu. 2021;13(1):113–118. pmid:33680310
- 16. El Baba S, Antoun J, Mladenovic A, Tamim H, Naous J. Impact of sleep on medical residents’ emotions: a cross-sectional study using a wearable device. Fam Med. 2021;53(4):275–281. pmid:33887049
- 17. West CP, Dyrbye LN, Satele DV, Sloan JA, Shanafelt TD. Concurrent validity of single-item measures of emotional exhaustion and depersonalization in burnout assessment. J Gen Intern Med. 2012;27(11):1445–52. pmid:22362127
- 18. Kroenke K, Spitzer RL, Williams JB. The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med. 2001;16(9)606–613. pmid:11556941
- 19. Cohen S, Kamarck T, Mermelstein R. A global measure of perceived stress. J Health Soc Behav. 1983;24(4):385–96. pmid:6668417
- 20. Buysse DJ. Sleep Health: can we define it? Does it matter? Sleep. 2014;37(1):9–17. pmid:24470692
- 21. Brindle RC, Yu L, Buysse DJ, Hall MH. Empirical derivation of cutoff values for the sleep health metric and its relationship to cardiometabolic morbidity: results from the Midlife in the United States (MIDUS) study. Sleep. 2019;42(9):zsz116. pmid:31083710
- 22. Dalmases M, Benitez ID, Mas A, Garcia-Codina O, Medina-Bustos A, Escarrabill J, et al. Assessing sleep health in a European population: results of the Catalan Health Survey 2015. PLoS ONE. 2018;13(4): e0194495. pmid:29668685
- 23. Bruxton OM, Lee S, Marino M, Beverly C, Almeida DM, Berkman L. Sleep health and predicted cardiometabolic risk scores in employed adults from two industries. J Clin Sleep Med. 2018;14(3):371–383. pmid:29458700
- 24. Becker NB, Santos Martins RI, Jesus SN, Chiodelli R, Rieber MS. Sleep health assessment: a scale validation. Psychiatry Res. 2018;259:51–55. pmid:29028524
- 25. Furihata R, Hall MH, Stine KL, Ancoli-Israel S, Smagula SF, Cauley JA, et al. An aggregate measure of sleep health is associated with prevalent and incident clinically significant depression symptoms among community-dwelling older women. Sleep. 2017;40(3):zsw075. pmid:28364417
- 26. Hassinger AB, Velez C, Wang J, Mador MJ, Wilding GE, Mishra A. Association between sleep health and rates of self-reported medical errors in Intern Physicians: an ancillary analysis of the Intern Health Study. Journal of Clinical Sleep Medicine. 2024;20(2):221–227. pmid:37767811
- 27. Lin LIK. A concordance correlation coefficient to evaluate reproducibility. Biometrics. 1989;45(1), 255–268. pmid:2720055
- 28. Jang H, Lee S, Son Y, Seo S, Baek Y, Mun S, et al. Exploring variations in sleep perception: Comparative study of chatbot sleep logs and Fitbit Sleep Data. JMIR mHealth and uHealth. 2023;11:e49144. pmid:37988148
- 29. Thota D. Evaluating the relationship between Fitbit sleep data and self-reported mood, sleep, and environmental context factors in healthy adults: pilot observational cohort study. JMIR Form Res. 2020;4(9):e18086.
- 30. Chinoy ED, Cuellar JA, Jameson JT, Markwald RR. Performance of four commercial wearable sleep-tracking devices tested under unrestricted conditions at home in healthy young adults. Nat Sci Sleep. 2022;22(14):493–516. pmid:35345630
- 31. Chinoy ED, Cuellar JA, Jameson JT, Markwald RR. Daytime sleep-tracking performance of four commercial wearable devices during unrestricted home sleep. Nat Sci Sleep. 2023; 1(15): 151–164. pmid:37032817
- 32. Cushman P, Schueller HS, Cushman J, Markert RJ. Improving performance on night shift: a study of resident sleep strategies. J Clin Sleep Med. 2023;19(5):935–940.
- 33. Kelly-Scuette K, Shaker T, Carroll J, Davis AT, Wright GP, Chung M. A prospective observational study comparing effects of call schedules on surgical resident sleep and physical activity using the Fitbit. J Grad Med Edu. 2021;113–118.
- 34. Auger RR, Varghese R, Silber M, Slocumb NL. Total sleep time obtained from actigraphy versus sleep logs in an academic sleep center and impact on further sleep testing. Nat Sci Sleep. 2013;5:125–131. pmid:24124399
- 35. Smith MT, McCrae CS, Cheung J, Martin JL, Harrod CG, Heald JL, et al. Use of actigraphy for the evaluation of sleep disorders and circadian rhythm sleep-wake disorders: an American Academy of Sleep Medicine systematic review, meta-analysis and GRADE assessment. J Clin Sleep Med. 2018;14(7):1209–1230. pmid:29991438
- 36. Cornish RP, Macleod J, Boyd A, Tilling K. Factors associated with participation over time in the Avon Longitudinal Study of Parents and Children: a study using linked education and primary care data. Int J Epidem. 2021:293–302. pmid:33057662
- 37. Lau TY, Ong JL, Ng BKL, Chan LF, Koek D, Tan CS, et al. Minimum number of nights for reliable estimation of habitual sleep using a commercial sleep tracker. Sleep Adv. 2022:1–10.
- 38. Haghayegh S, Khoshnevis S, Smolensky MH, Diller KR, Castriotta RJ. Accuracy of wristband Fitbit models in assessing sleep: systematic review and meta-analysis. JMIR. 2019;21(11):e16273. pmid:31778122