Delirium diagnosis without a gold standard: Evaluating diagnostic accuracy of combined delirium assessment tools

Stephana J. Moss; Chel Hee Lee; Christopher J. Doig; Liam Whalen-Browne; Henry T. Stelfox; Kirsten M. Fiest

doi:10.1371/journal.pone.0267110

Abstract

Background

Fluctuating course of delirium and complexities of ICU care mean delirium symptoms are hard to identify or commonly confused with other disorders. Delirium is difficult to diagnose, and clinicians and researchers may combine assessments from multiple tools. We evaluated diagnostic accuracy of different combinations of delirium assessments performed in each enrolled patient.

Methods

Data were obtained from a previously conducted cross-sectional study. Eligible adult patients who remained admitted to ICU for >24 hours with at least one family member present were consecutively enrolled as patient-family dyads. Clinical delirium assessments (Intensive Care Delirium Screening Checklist [ICSDC] and Confusion Assessment Method-ICU [CAM-ICU]) were completed twice daily by bedside nurse or trained research assistant, respectively. Family delirium assessments (Family Confusion Assessment Method and Sour Seven) were completed once daily by family members. We pooled all delirium assessment tools in a single two-class latent model and pairwise (i.e., combined, clinical or family assessments) Bayesian analyses.

Results

Seventy-three patient-family dyads were included. Among clinical delirium assessments, the ICDSC had lower sensitivity (0.72; 95% Bayesian Credible [BC] interval 0.54–0.92) and higher specificity (0.90; 95%BC, 0.82–0.97) using Bayesian analyses compared to pooled latent class analysis and CAM-ICU had higher sensitivity (0.90; 95%BC, 0.70–1.00) and higher specificity (0.94; 95%BC, 0.80–1.00). Among family delirium assessments, the Family Confusion Assessment Method had higher sensitivity (0.83; 95%BC, 0.71–0.92) and higher specificity (0.93; 95%BC, 0.84–0.98) using Bayesian analyses compared to pooled latent class analysis and the Sour Seven had higher specificity (0.85; 95%BC, 0.67–0.99) but lower sensitivity (0.64; 95%BC 0.47–0.82).

Conclusions

Results from delirium assessment tools are often combined owing to imperfect reference standards for delirium measurement. Pairwise Bayesian analyses that explicitly accounted for each tool’s (performed within same patient) prior sensitivity and specificity indicate that two combined clinical or two combined family delirium assessment tools have fair diagnostic accuracy.

Citation: Moss SJ, Hee Lee C, Doig CJ, Whalen-Browne L, Stelfox HT, Fiest KM (2022) Delirium diagnosis without a gold standard: Evaluating diagnostic accuracy of combined delirium assessment tools. PLoS ONE 17(4): e0267110. https://doi.org/10.1371/journal.pone.0267110

Editor: Enrico Mossello, University of Florence, ITALY

Received: April 9, 2021; Accepted: April 3, 2022; Published: April 18, 2022

Copyright: © 2022 Moss et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Data cannot be shared publicly because of patient confidentiality. Data may be available upon reasonable request from the University of Calgary research ethics board and Alberta Health Services research and innovation administration (contact via chreb@ucalgary.ca and research.administration@ahs.ca) for researchers who meet the criteria for access to confidential data.

Funding: Supported, in part, by grants from the Medical Services Incorporated Foundation, Canadian Institutes of Health Research, and Alberta Health Services (to Dr. Fiest). This research is funded by Canadian Frailty Network (Technology Evaluation in the Elderly Network), which is supported by the Government of Canada through the Networks of Centres of Excellence program (to Dr. Fiest). Dr. Moss was supported by a CIHR Doctoral Research Award. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Background

One third of patients admitted to the intensive care unit (ICU) will develop delirium [1, 2] that is characterized by acute onset and a fluctuating course of symptoms that are hard to identify or commonly confused with other disorders [3]. Delirium is often underdiagnosed and unmanaged [4] despite several delirium assessment tools available for use by both the patient care team and visiting family members [5]. Family members know a patient best and engaging them in delirium detection may help to make earlier note of unrecognized patient changes [6].

A systematic review and psychometric analysis on five clinical delirium assessment tools (i.e., administered by a trained clinical/research observer) from 36 patient cohorts reported the Intensive Care Delirium Screening Checklist (ICDSC) and Confusion Assessment Method-ICU (CAM-ICU) had the highest validity and reliability among critically ill adults [7]. Family delirium assessments (i.e., administered by a family member or friend) have fair, but lower, diagnostic accuracy in critically ill adults in comparison, and in addition to, clinical delirium assessments [8].

The diagnostic accuracy of a delirium assessment tool is determined by comparing the results from a delirium assessment to a gold standard test [with 100% accuracy] [9]. There is no gold standard delirium diagnostic tool and available reference standards are imperfect owing to the fluctuating course of delirium and complexities associated with ICU care [10]. This often leads researchers and patient care teams to combine results from several delirium assessments to increase accuracy of diagnosis [11]. Most literature on this topic reports sensitivity and specificity estimates from combinations of delirium assessments performed in different ICU patients [7].

Latent class models are employed widely to estimate diagnostic accuracy without a gold standard [12] as this approach does not require a reference standard be selected arbitrarily as a criterion standard, which is considered more realistic and reflective of practice [13]. Latent class analyses rely on the central assumption of conditional independence [14]. Another approach is pairwise Bayesian analyses that estimates the probability of an event based on prior knowledge of conditions that are related to the event, common in psychology to improve diagnostic accuracy. This approach can explicitly account for prior uncertainty in the sensitivity and specificity for each individual diagnostic tool [15]. The objective of this study was to use latent class models and pairwise Bayesian analyses to evaluate diagnostic accuracy of two combined clinical or two combined family delirium assessment tools used within the same critically ill adult patient.

Methods

Study design and setting

This diagnostic evaluation study used data from a previously published cross-sectional study [8] and is reported according to Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines for cross-sectional studies (S1 Table).

Participants

Recruitment of participants was performed at a large, tertiary care academic hospital (Foothills Medical Centre [FMC], Calgary, Canada) within a single-payer health system. A multidisciplinary care team staffs the 28 closed beds of the FMC medical-surgical ICU. Eligibility criteria for study participation are listed in S2 Table. Eligibility for participation was assessed daily by a trained research assistant following approval to approach a family granted by the bedside nurse. Patients who met eligibility criteria with at least one present family member (e.g., spouse, child, friend) who provided informed consent were consecutively enrolled as a patient-family dyad.

Procedure

The study collected data in ICU up to a maximum of five days. Patient and family demographics (e.g., age, sex) were collected at first delirium assessment. Patient clinical characteristics (e.g., admitting diagnosis, Acute Physiology and Chronic Health Evaluation-II [APACHE-II]) were obtained from eCritical, a beside clinical information system validated for research purposes [16].

Measures

Clinical delirium assessments.

Two clinical delirium assessments were completed by assessors (i.e., nurse or research assistant) blind to the results of the family administered delirium assessments. The ICDSC was conducted twice daily (once per 12-hour nursing shift; standard of care) and is valid and reliable to screen for delirium on eight independent domains (i.e., hallucinations/delusions/psychosis, level of consciousness, inattention, disorientation, psychomotor agitation, inappropriate speech or mood, sleep wake/cycle disturbance, and symptom fluctuations) [17]. ICDSC delirium assessments yield an ordinal score ranging from 0 to 8 that can be dichotomized to classify patients consistent with delirium (score of ≥4) or not having delirium (score of 0 to 3) [18]. 64% sensitivity and 90% specificity are reported for ICDSC assessments for this sample of patient-family dyads [8]. The second clinical assessment was the four-item dichotomous CAM-ICU (i.e., scored “delirium present” or “delirium absent”), bid (9:00AM-11:00AM; 2:00PM-4:00PM) on all eligible patients by a trained research assistant. The CAM-ICU has 76% sensitivity and 83% specificity in this sample [8].

Family administered delirium assessments.

Two family administered delirium assessments were performed once daily by family members of critically ill patients blind to results from clinical assessments. Family members assessed patient delirium using the Family Confusion Assessment Method (FAM-CAM) and the Sour Seven. Using the FAM-CAM, family members answered 11-items regarding sudden changes to patient attention, orientation, perception or concentration [19]. The FAM-CAM is a dichotomous (i.e., scored “delirium present” or “delirium absent”) family-administered delirium assessment tool with 54% sensitivity and 77% specificity in this sample [8]. The Sour Seven was also used to assess patient delirium symptoms related to altered awareness, disordered thinking, and reduced attention [20]. The Sour Seven is scored out of 18 with a cutpoint ≥4 (i.e., indicating probable delirium); a cutpoint score ≥4 in this sample has 73% sensitivity and 69% specificity [8].

Data analysis

Data are presented as numbers/percentage, mean or median. Measures of diagnostic accuracy (e.g., sensitivity, specificity) are reported with accompanying 95% Confidence Intervals (CIs) or 95% Bayesian Credible (BC) intervals, as appropriate. From the primary study sample (of 147 dyads) we included 73 patient-family dyads that contributed one set of data for both pairwise assessments within 6-hour time windows, chosen to account for the fluctuation in delirium presentation [3]. To be included in analyses, each patient-family dyad needed to have a CAM-ICU, FAM-CAM and Sour Seven that were all recorded within 6-hours of when the ICDSC (i.e., standard of care) was performed (i.e., pairwise clinical assessments [ICDSC + CAM-ICU] and pairwise family assessments [FAM-CAM + Sour Seven], with each delirium assessment performed no longer than 6 hours after the ICDSC) [21]. Each dyad contributed data for one pair of clinical delirium assessments and one pair of family delirium assessment; when multiple pairs of data for a single dyad were reported, the pair that reported the most severe scores was used [22]. We used complete case analysis and no imputation techniques for missing data were employed. Statistical analyses were conducted in SAS 9.4 (SAS Institute Inc.), STATA ICV.16 (StataCorp LLC), and R (R Core Team, 2020). The poLCA (Polytomous Variable Latent Class Analysis) package was used to conduct latent class analysis [23] and the WinBUGS statistical software was used for Bayesian analysis [24–26].

We employed two statistical techniques to evaluate diagnostic accuracy of combined delirium assessments, accounting for each tool’s prior sensitivity and specificity, and with the novelty of being used within the same critically ill patient [26–29]. For the first technique, we used latent class analysis [27, 28] that pooled all four delirium assessments (i.e., ICDSC + CAM-ICU + FAM-CAM + Sour Seven) to simultaneously estimate the sensitivity and specificity for each individual assessment tool. To satisfy the central assumption for the present analyses to report estimates of diagnostic accuracy, all four (clinical and family) delirium assessments were pooled. A two-class model was selected based on the lowest Bayesian Information Criterion (S3 Table). Results from latent class analyses should be interpreted as the diagnostic criteria for an individual delirium assessment tool if that tool were to be used alone, without considering each tool’s prior estimated diagnostic criteria (i.e., sensitivity and specificity) [30].

We used pairwise Bayesian analyses for combined delirium assessments (i.e., pairwise clinical assessments, ICDSC + CAM-ICU; or pairwise family assessments, FAM-CAM + Sour Seven) [26, 29]. We chose these combinations a priori, considering our potential conclusions as combination of multiple clinical or multiple family assessment tools is more practical, and feasible. Using the BUGS software [24], random samples were drawn using Monte Carlo Markov Chain considering posterior distributions of parameters. We report summary statistics based on these random samples for which convergence was ensured. We ran 15,000 iterations of the Gibbs sampler; three different chains were generated, and we followed the Gelman-Rubin’s diagnostic to approximate convergence [31]. The first 5,000 iterations were discarded to report posterior summaries. Results from Bayesian analyses should be interpreted as diagnostic accuracy of combined delirium assessment tools all used within the same critically ill adult patient, with consideration of each tool’s prior estimated diagnostic criteria.

Ethical approval

The study was approved by the Conjoint Health Research Ethics Board at the University of Calgary (REB 16–2060).

Results

Population characteristics

A total of 73 patient-family member dyads from the full study sample [8] were included in these analyses (Fig 1). We excluded 85/147 dyads that did not provide data suitable to conduct pairwise Bayesian estimates (i.e., dyads that did not have the CAM-ICU, FAM-CAM and Sour Seven all recorded within 6-hours of the ICDSC). Demographic and clinical characteristics of the 73 included patient-family member dyads are shown in Table 1. Critically ill patients were primarily admitted with a medical diagnosis (n = 35, 48.0%) and were on average 57.8 years (SD, 15.8 yr); 67% (n = 49) of patients were male. Family members were on average 53.5 years (SD 14.7 yr) and mostly (n = 61, 83.6%) female. Majority (n = 45, 61.6%) of family members reported having completed some university/college education, or greater. The median Richmond Agitation Sedation-Scale Score at the time of ICDSC assessment that was included in analysis was 0 (interquartile range, -1 to 0) (i.e., “alert and calm”).

Download:

Fig 1. Participant flow diagram.

https://doi.org/10.1371/journal.pone.0267110.g001

Download:

Table 1. Demographic and clinical characteristics of 73 included patients and family members.

https://doi.org/10.1371/journal.pone.0267110.t001

Results of delirium assessments

Results of patient delirium assessments are presented in Table 2. On average, clinical delirium assessment tools were performed 1.83 hours (SD 1.54) apart. Majority (n = 40, 54.8%) of patients were classified “without delirium” by both clinical assessments for delirium (i.e., ICDSC and CAM-ICU). Many patients (n = 30, 41.1%) had conflicting assessment results between the CAM-ICU and ICDSC. Only 4.1% (n = 3) of patients were classified “with delirium” by both clinical assessments; common in our cohort of patients when less than clinical threshold symptoms of delirium are presented (not identified by the CAM-ICU, but ICDSC) [32]. Family delirium assessments using the FAM-CAM and Sour Seven which were done by the same family member at the same time, demonstrated more consistent results than clinical assessments. Twenty-eight patients (38.4%) were classified “without delirium” and 23 patients (31.5%) were classified “with delirium” by both the FAM-CAM and Sour Seven conducted within a 6-hour window.

Download:

Table 2. Patient delirium assessment results, paired.

https://doi.org/10.1371/journal.pone.0267110.t002

Diagnostic accuracy from latent class analysis

Estimates of diagnostic accuracy from a two-class latent model are shown in Table 3. Among clinical assessments the ICDSC had lower sensitivity (ICDSC, 0.79 [95% CI 0.58–0.93]; CAM-ICU, 0.88 [95% CI 0.68–0.97]) and negative predictive value (ICDSC, 0.91 [95% CI 0.80–0.97]; CAM-ICU, 0.93 [95% CI 0.81–0.99]) than the CAM-ICU. In contrast, the ICDSC had higher specificity (ICDSC, 1.00 [95% CI 0.93–1.00]; CAM-ICU, 0.82 [95% CI 0.68–0.91]) and positive predictive value (ICDSC, 1.00 [95% CI 0.83–1.00]; CAM-ICU, 0.70 [95% CI 0.51–0.85]) than the CAM-ICU.

Download:

Table 3. Diagnostic criteria for clinical and family delirium assessments from latent class analysis for pooled, whole sample (n = 73).

https://doi.org/10.1371/journal.pone.0267110.t003

Diagnostic accuracy estimated from latent class analysis for family delirium assessments were lower than clinical delirium assessments. Sensitivity of the FAM-CAM and Sour Seven were 0.79 (95% CI 0.58–0.93) and 0.67 (95% CI 0.45–0.84), respectively. Specificity estimates were 0.63 (95% CI 0.48–0.77) and 0.69 (95% CI 0.55–0.83) for the FAM-CAM and Sour Seven, respectively. Positive predictive values (FAM-CAM, 0.51 [95% CI 0.34–0.68]; Sour Seven, 0.53 [95% CI 0.33–0.70]) were lower than negative predictive values (FAM-CAM, 0.86 [95% CI 0.71–0.96]; Sour Seven, 0.81 [95% CI 0.66–0.91]).

Diagnostic accuracy from Bayesian analyses

Estimates for sensitivity and specificity from pairwise Bayesian analyses are shown in Table 4. In combining results from clinical delirium assessments, the ICDSC had lower sensitivity (0.72 [95% BC 0.54–0.92]) and specificity (0.90 [95% BC 0.82–0.97]) than the CAM-ICU (sensitivity, 0.92 [95% BC 0.70–1.00]; specificity, 0.94 [95% BC 0.80–1.00]). In combining results from family delirium assessments, the FAM-CAM had higher sensitivity and specificity compared to the Sour Seven. Sensitivity estimates were 0.83 (95% BC 0.73–0.92) and 0.64 (95% BC 0.47–0.82) and specificity estimates were 0.93 (95% BC 0.84–0.98) and 0.85 (95% BC 0.67–0.99) for the FAM-CAM and Sour Seven, respectively.

Download:

Table 4. Diagnostic criteria for clinical and family delirium assessments from pairwise Bayesian analyses for whole sample (n = 73).

https://doi.org/10.1371/journal.pone.0267110.t004

Discussion

The motivation for this study was to employ two statistical techniques to evaluate diagnostic accuracy of combined clinical or family delirium assessment tools all used within the same critically ill adult patient. The first technique, latent class analysis, assessed performance characteristics (i.e., sensitivity, specificity) without being informed by each tool’s prior estimated results. In contrast, Bayesian analysis specifically accounted for each tool’s prior performance characteristics in estimating post-test probabilities or performance. Both analyses performed comparably and suggest that two combined clinical or two combined family delirium assessment tools have fair diagnostic accuracy.

A gold standard tool [test with 100% diagnostic accuracy] is not available for delirium detection and the development of such a test is likely not feasible or practical [26, 33]. In the delirium literature it is recommend using either the 10th revision of the International Statistical Classification of Diseases and Related Health Problems (ICD-10) or the Diagnostic and Statistical Manual of Mental Disorders, 5th Edition (DSM-5) as the reference standard, as applied by trained clinicians [34]. It is common to see reference to earlier DSM versions (e.g., 4th Edition, DSM-4) as they are considered easier to operationalize than the DSM-5 [35]; the Confusion Assessment Method (CAM) was developed based on the DSM-3-R as a user-friendly diagnostic tool to screen for delirium in most clinical populations [36]. Lack of gold standard and plethora of available (but imperfect) reference standards [5] means combinations of results from many delirium assessment tools are used to determine if delirium is present.

Thomas et al. [37] referred to the DSM-5 to estimate diagnostic accuracy compared to a combination of the ICD-10 and the CAM in a sample of hospitalized elderly patients, and reported higher sensitivity and specificity for the CAM when used in combination with the ICD-10. Shenkin et al. [11] assessed diagnostic accuracy of the 4 A’s Test (4AT)—a short delirium assessment tool—and the CAM against the DSM-4 in a sample of older acute medical inpatients. The 4AT had 76% sensitivity and 94% specificity while the CAM had 40% sensitivity and 100% specificity. Since no delirium assessment tool is perfect [10], by arbitrarily selecting an imperfect delirium reference it is impossible to account for the true uncertainty in estimated diagnostic criteria for delirium assessments.

Latent class analysis is frequently used to estimate diagnostic accuracy from imperfect diagnostic tests [27, 28]. This statistical technique does not require arbitrary selection of a reference standard and is thought to be more realistic and reflective of practice. One limitation of this approach is that latent class models assume conditional independence of test results within each patient given the latent true delirium status—referred to as the central assumption of conditional independence (i.e., test results are independent given latent classes). Latent class analysis also relies on the central assumption of conditional independence [14]. To satisfy this assumption, we pooled all delirium assessment tools in a single two-class latent model, which prevented considering each tool’s prior estimated diagnostic criteria.

Our results are in line with our earlier report [8] such that for delirium diagnosis in ICU, a balance of sensitivity and specificity in diagnostic accuracy should be considered given that delirium diagnosis in critically ill patients is often subjective and challenging [4]. For example, in clinical practice administration of a pharmacological treatment requires high specificity (few false positive delirium cases) whereas for screening purposes high sensitivity (few false negative delirium cases) is desired.

We have previously shown feasibility and acceptability of employing family administered delirium assessment tools in adult ICU [38], and reported that diagnostic accuracy from family delirium assessments are still fair, but lower, in comparison and in addition to clinical delirium assessments [8]. In the present study we found that combining results from family assessments had higher sensitivity and specificity compared to either the FAM-CAM or Sour Seven used alone, which were also higher compared to estimates reported from our earlier diagnostic accuracy study in this sample [8]. While we do not propose that family members can accurately “diagnose” delirium it is possible that engaging family members in delirium detection might be helpful to identify earlier symptoms. Future research should map items across the FAM-CAM and Sour Seven to create a single, family tool with maximized sensitivity and specificity and validate this tool in a large sample of patients.

The results of our study must be interpreted cautiously and there are several limitations worth noting. First, this study was conducted as a secondary analysis of data collected in an earlier observational study. Second, our data was collected from patients admitted to a single medical center. Although this center serves a large catchment area of approximately 1.8 million people and should be reasonably representative of multisystem ICU patients in academic hospitals, these finding may not be generalizable to all ICU patients. Third, nearly 20% of our sample were patients with a neurological admission status in whom delirium diagnosis is especially challenging as neurologic patients fall more under the term acute encephalopathy rather than delirium [39]. Our ability to disentangle what is acute encephalopathy from delirium in the acutely neurologically injured patient using the GCS was limited; increased false positives in our sample would have resulted in reduced diagnostic accuracy. Fourth, achieving repeated assessments over five days was challenging as family members were variably present at the bedside; this limited our sample size given that we included only dyads with results from delirium assessments recorded no longer than 6 hours apart. Fifth, we compared different diagnostic accuracy estimates from diverse statistical techniques that variably considered underlying uncertainly in the accuracy of different delirium assessment tools. Our results indicated that combined delirium assessments tools are more accurate; overestimation is possible considering our small sample size. New, combined versions of clinical and family delirium assessment tools require development, testing, and validation to provide additional evidence to substantiate our hypothesis.

Conclusions

Caring for critically ill patients includes multiple tools to identify ICU delirium. Results from delirium assessment tools employed at different times during patient care are often combined owing to no gold standard and imperfect reference standards for delirium but using several tools at a single time is impractical and infeasible. Using pairwise Bayesian analyses to explicitly account for each tool’s prior sensitivity and specificity all in the same patient within 6-hours, we report that two combined clinical or two combined family delirium assessment tools have fair diagnostic accuracy.

Supporting information

S1 Table. STROBE statement; checklist of items that should be included in reports of cross-sectional studies.

https://doi.org/10.1371/journal.pone.0267110.s001

(DOCX)

S2 Table. Study eligibility criteria.

https://doi.org/10.1371/journal.pone.0267110.s002

(DOCX)

S3 Table. Summary of information criteria and estimated and observed probabilities of patient delirium using latent class analysis with a two-class model.

https://doi.org/10.1371/journal.pone.0267110.s003

(DOCX)

References

1. Leslie DL, Inouye SK. The importance of delirium: economic and societal costs. J Am Geriatr Soc. 2011;59 Suppl 2:S241–3. Epub 2011/12/07. pmid:22091567.
- View Article
- PubMed/NCBI
- Google Scholar
2. Morandi A, Jackson JC, Ely EW. Delirium in the intensive care unit. Int Rev Psychiatry. 2009;21(1):43–58. Epub 2009/02/17. pmid:19219712.
- View Article
- PubMed/NCBI
- Google Scholar
3. Diwell RA, Davis DH, Vickerstaff V, Sampson EL. Key components of the delirium syndrome and mortality: greater impact of acute change and disorganised thinking in a prospective cohort study. BMC Geriatr. 2018;18(1):24. Epub 2018/01/27. pmid:29370764.
- View Article
- PubMed/NCBI
- Google Scholar
4. Rizzo JA, Bogardus ST Jr., Leo-Summers L, Williams CS, Acampora D, Inouye SK. Multicomponent targeted intervention to prevent delirium in hospitalized older patients: what is the economic value? Med Care. 2001;39(7):740–52. Epub 2001/07/18. pmid:11458138.
- View Article
- PubMed/NCBI
- Google Scholar
5. Carvalho JP, de Almeida AR, Gusmao-Flores D. Delirium rating scales in critically ill patients: a systematic literature review. Rev Bras Ter Intensiva. 2013;25(2):148–54. Epub 2013/08/07. pmid:23917980.
- View Article
- PubMed/NCBI
- Google Scholar
6. Bigatello LM, Amirfarzan H, Haghighi AK, Newhouse B, Del Rio JM, Allen K, et al. Effects of routine monitoring of delirium in a surgical/trauma intensive care unit. J Trauma Acute Care Surg. 2013;74(3):876–83. Epub 2013/02/22. pmid:23425751.
- View Article
- PubMed/NCBI
- Google Scholar
7. Gelinas C, Berube M, Chevrier A, Pun BT, Ely EW, Skrobik Y, et al. Delirium Assessment Tools for Use in Critically Ill Adults: A Psychometric Analysis and Systematic Review. Crit Care Nurse. 2018;38(1):38–49. Epub 2018/02/14. pmid:29437077.
- View Article
- PubMed/NCBI
- Google Scholar
8. Fiest KM, Krewulak KD, Ely EW, Davidson JE, Ismail Z, Sept BG, et al. Partnering With Family Members to Detect Delirium in Critically Ill Patients. Crit Care Med. 2020;48(7):954–61. Epub 2020/04/26. pmid:32332281.
- View Article
- PubMed/NCBI
- Google Scholar
9. Devlin JW, Fraser GL, Joffe AM, Riker RR, Skrobik Y. The accurate recognition of delirium in the ICU: the emperor’s new clothes? Intensive Care Med. 2013;39(12):2196–9. Epub 2013/10/12. pmid:24114318.
- View Article
- PubMed/NCBI
- Google Scholar
10. Pun BT, Devlin JW. Delirium monitoring in the ICU: strategies for initiating and sustaining screening efforts. Semin Respir Crit Care Med. 2013;34(2):179–88. Epub 2013/05/30. pmid:23716309.
- View Article
- PubMed/NCBI
- Google Scholar
11. Shenkin SD, Fox C, Godfrey M, Siddiqi N, Goodacre S, Young J, et al. Delirium detection in older acute medical inpatients: a multicentre prospective comparative diagnostic test accuracy study of the 4AT and the confusion assessment method. BMC Med. 2019;17(1):138. Epub 2019/07/25. pmid:31337404.
- View Article
- PubMed/NCBI
- Google Scholar
12. Collins J, Huynh M. Estimation of diagnostic test accuracy without full verification: a review of latent class methods. Stat Med. 2014;33(24):4141–69. Epub 2014/06/10. pmid:24910172.
- View Article
- PubMed/NCBI
- Google Scholar
13. Rindskopf D, Rindskopf W. The value of latent class analysis in medical diagnosis. Stat Med. 1986;5(1):21–7. Epub 1986/01/01. pmid:3961312
- View Article
- PubMed/NCBI
- Google Scholar
14. Hui SL, Walter SD. Estimating the error rates of diagnostic tests. Biometrics. 1980;36(1):167–71. Epub 1980/03/01. pmid:7370371
- View Article
- PubMed/NCBI
- Google Scholar
15. Tommasi M, Ferrara G, Saggino A. Application of Bayes’ Theorem in Valuating Depression Tests Performance. Front Psychol. 2018;9:1240. Epub 2018/08/08. pmid:30083119.
- View Article
- PubMed/NCBI
- Google Scholar
16. Brundin-Mather R, Soo A, Zuege DJ, Niven DJ, Fiest K, Doig CJ, et al. Secondary EMR data for quality improvement and research: A comparison of manual and electronic data collection from an integrated critical care electronic medical record system. J Crit Care. 2018;47:295–301. Epub 2018/08/14. pmid:30099330.
- View Article
- PubMed/NCBI
- Google Scholar
17. Pun BT, Ely EW. The importance of diagnosing and managing ICU delirium. Chest. 2007;132(2):624–36. Epub 2007/08/19. pmid:17699134.
- View Article
- PubMed/NCBI
- Google Scholar
18. Bergeron N, Dubois MJ, Dumont M, Dial S, Skrobik Y. Intensive Care Delirium Screening Checklist: evaluation of a new screening tool. Intensive Care Med. 2001;27(5):859–64. Epub 2001/06/30. pmid:11430542.
- View Article
- PubMed/NCBI
- Google Scholar
19. Steis MR, Evans L, Hirschman KB, Hanlon A, Fick DM, Flanagan N, et al. Screening for delirium using family caregivers: convergent validity of the Family Confusion Assessment Method and interviewer-rated Confusion Assessment Method. J Am Geriatr Soc. 2012;60(11):2121–6. pmid:23039310.
- View Article
- PubMed/NCBI
- Google Scholar
20. Shulman RW, Kalra S, Jiang JZ. Validation of the Sour Seven Questionnaire for screening delirium in hospitalized seniors by informal caregivers and untrained nurses. BMC Geriatr. 2016;16:44. Epub 2016/02/18. pmid:26879927.
- View Article
- PubMed/NCBI
- Google Scholar
21. Asafu-Adjei JK, Betensky RA. A Pairwise Naive Bayes Approach to Bayesian Classification. Intern J Pattern Recognit Artif Intell. 2015;29(7). Epub 2016/04/19. pmid:27087730.
- View Article
- PubMed/NCBI
- Google Scholar
22. Chang C, editor Response feature analysis of longitudinal data in medical research2000.
23. Linzer DAaJ, L. poLCA: Polytomous Variable Latent Class Analysis.” R package version 1.4 2013. https://cran.r-project.org/web/packages/poLCA/README.html.
24. Crainiceanu CM, Goldsmith AJ. Bayesian Functional Data Analysis Using WinBUGS. J Stat Softw. 2010;32(11). Epub 2010/01/01. pmid:21743798.
- View Article
- PubMed/NCBI
- Google Scholar
25. Estimation of Disease Prevalence and the Parameters of One, Two or Three Diagnostic Tests. http://www.medicine.mcgill.ca/epidemiology/Joseph/software/BayesDiagnosticTests/InstallInstructions.html.
26. Joseph L, Gyorkos TW, Coupal L. Bayesian estimation of disease prevalence and the parameters of diagnostic tests in the absence of a gold standard. Am J Epidemiol. 1995;141(3):263–72. Epub 1995/02/01. pmid:7840100
- View Article
- PubMed/NCBI
- Google Scholar
27. Faraone SV, Tsuang MT. Measuring diagnostic accuracy in the absence of a "gold standard". Am J Psychiatry. 1994;151(5):650–7. Epub 1994/05/01. pmid:8166304
- View Article
- PubMed/NCBI
- Google Scholar
28. Walter SD, Irwig LM. Estimation of test error rates, disease prevalence and relative risk from misclassified data: a review. J Clin Epidemiol. 1988;41(9):923–37. Epub 1988/01/01. pmid:3054000.
- View Article
- PubMed/NCBI
- Google Scholar
29. Dendukuri N, Joseph L. Bayesian approaches to modeling the conditional dependence between multiple diagnostic tests. Biometrics. 2001;57(1):158–67. Epub 2001/03/17. pmid:11252592.
- View Article
- PubMed/NCBI
- Google Scholar
30. Ulbricht CM, Chrysanthopoulou SA, Levin L, Lapane KL. The use of latent class analysis for identifying subtypes of depression: A systematic review. Psychiatry Res. 2018;266:228–46. Epub 2018/04/02. pmid:29605104.
- View Article
- PubMed/NCBI
- Google Scholar
31. Vats D, Knudson C. Revisiting the Gelman-Rubin Diagnostic2018.
32. Krewulak KD, Rosgen BK, Ely EW, Stelfox HT, Fiest KM. The CAM-ICU-7 and ICDSC as measures of delirium severity in critically ill adult patients. PLoS One. 2020;15(11):e0242378. Epub 2020/11/17. pmid:33196655.
- View Article
- PubMed/NCBI
- Google Scholar
33. Dendukuri N, Rahme E, Belisle P, Joseph L. Bayesian sample size determination for prevalence and diagnostic test studies in the absence of a gold standard test. Biometrics. 2004;60(2):388–97. Epub 2004/06/08. pmid:15180664.
- View Article
- PubMed/NCBI
- Google Scholar
34. Leonard MM, Nekolaichuk C, Meagher DJ, Barnes C, Gaudreau JD, Watanabe S, et al. Practical assessment of delirium in palliative care. J Pain Symptom Manage. 2014;48(2):176–90. Epub 2014/04/29. pmid:24766745.
- View Article
- PubMed/NCBI
- Google Scholar
35. Lawlor PG, Bush SH. Delirium diagnosis, screening and management. Curr Opin Support Palliat Care. 2014;8(3):286–95. Epub 2014/07/09.
- View Article
- Google Scholar
36. Inouye SK, van Dyck CH, Alessi CA, Balkin S, Siegal AP, Horwitz RI. Clarifying confusion: the confusion assessment method. A new method for detection of delirium. Ann Intern Med. 1990;113(12):941–8. Epub 1990/12/15. pmid:2240918.
- View Article
- PubMed/NCBI
- Google Scholar
37. Thomas C, Kreisel SH, Oster P, Driessen M, Arolt V, Inouye SK. Diagnosing delirium in older hospitalized adults with dementia: adapting the confusion assessment method to international classification of diseases, tenth revision, diagnostic criteria. J Am Geriatr Soc. 2012;60(8):1471–7. Epub 2012/08/14. pmid:22881707.
- View Article
- PubMed/NCBI
- Google Scholar
38. Krewulak KD, Sept BG, Stelfox HT, Ely EW, Davidson JE, Ismail Z, et al. Feasibility and acceptability of family administration of delirium detection tools in the intensive care unit: a patient-oriented pilot study. CMAJ Open. 2019;7(2):E294–E9. Epub 2019/04/28. pmid:31028053 Method for the Intensive Care Unit, E.W. Ely, did not perform the evidence analysis of the study.
- View Article
- PubMed/NCBI
- Google Scholar
39. Patel MB, Bednarik J, Lee P, Shehabi Y, Salluh JI, Slooter AJ, et al. Delirium Monitoring in Neurocritically Ill Patients: A Systematic Review. Crit Care Med. 2018;46(11):1832–41. Epub 2018/08/25. pmid:30142098.
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Leslie DL, Inouye SK. The importance of delirium: economic and societal costs. J Am Geriatr Soc. 2011;59 Suppl 2:S241–3. Epub 2011/12/07. pmid:22091567.
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Morandi A, Jackson JC, Ely EW. Delirium in the intensive care unit. Int Rev Psychiatry. 2009;21(1):43–58. Epub 2009/02/17. pmid:19219712.
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Diwell RA, Davis DH, Vickerstaff V, Sampson EL. Key components of the delirium syndrome and mortality: greater impact of acute change and disorganised thinking in a prospective cohort study. BMC Geriatr. 2018;18(1):24. Epub 2018/01/27. pmid:29370764.
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Rizzo JA, Bogardus ST Jr., Leo-Summers L, Williams CS, Acampora D, Inouye SK. Multicomponent targeted intervention to prevent delirium in hospitalized older patients: what is the economic value? Med Care. 2001;39(7):740–52. Epub 2001/07/18. pmid:11458138.
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Carvalho JP, de Almeida AR, Gusmao-Flores D. Delirium rating scales in critically ill patients: a systematic literature review. Rev Bras Ter Intensiva. 2013;25(2):148–54. Epub 2013/08/07. pmid:23917980.
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Bigatello LM, Amirfarzan H, Haghighi AK, Newhouse B, Del Rio JM, Allen K, et al. Effects of routine monitoring of delirium in a surgical/trauma intensive care unit. J Trauma Acute Care Surg. 2013;74(3):876–83. Epub 2013/02/22. pmid:23425751.
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref7] 7. Gelinas C, Berube M, Chevrier A, Pun BT, Ely EW, Skrobik Y, et al. Delirium Assessment Tools for Use in Critically Ill Adults: A Psychometric Analysis and Systematic Review. Crit Care Nurse. 2018;38(1):38–49. Epub 2018/02/14. pmid:29437077.
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref8] 8. Fiest KM, Krewulak KD, Ely EW, Davidson JE, Ismail Z, Sept BG, et al. Partnering With Family Members to Detect Delirium in Critically Ill Patients. Crit Care Med. 2020;48(7):954–61. Epub 2020/04/26. pmid:32332281.
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref9] 9. Devlin JW, Fraser GL, Joffe AM, Riker RR, Skrobik Y. The accurate recognition of delirium in the ICU: the emperor’s new clothes? Intensive Care Med. 2013;39(12):2196–9. Epub 2013/10/12. pmid:24114318.
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref10] 10. Pun BT, Devlin JW. Delirium monitoring in the ICU: strategies for initiating and sustaining screening efforts. Semin Respir Crit Care Med. 2013;34(2):179–88. Epub 2013/05/30. pmid:23716309.
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref11] 11. Shenkin SD, Fox C, Godfrey M, Siddiqi N, Goodacre S, Young J, et al. Delirium detection in older acute medical inpatients: a multicentre prospective comparative diagnostic test accuracy study of the 4AT and the confusion assessment method. BMC Med. 2019;17(1):138. Epub 2019/07/25. pmid:31337404.
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref12] 12. Collins J, Huynh M. Estimation of diagnostic test accuracy without full verification: a review of latent class methods. Stat Med. 2014;33(24):4141–69. Epub 2014/06/10. pmid:24910172.
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref13] 13. Rindskopf D, Rindskopf W. The value of latent class analysis in medical diagnosis. Stat Med. 1986;5(1):21–7. Epub 1986/01/01. pmid:3961312
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref14] 14. Hui SL, Walter SD. Estimating the error rates of diagnostic tests. Biometrics. 1980;36(1):167–71. Epub 1980/03/01. pmid:7370371
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref15] 15. Tommasi M, Ferrara G, Saggino A. Application of Bayes’ Theorem in Valuating Depression Tests Performance. Front Psychol. 2018;9:1240. Epub 2018/08/08. pmid:30083119.
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref16] 16. Brundin-Mather R, Soo A, Zuege DJ, Niven DJ, Fiest K, Doig CJ, et al. Secondary EMR data for quality improvement and research: A comparison of manual and electronic data collection from an integrated critical care electronic medical record system. J Crit Care. 2018;47:295–301. Epub 2018/08/14. pmid:30099330.
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref17] 17. Pun BT, Ely EW. The importance of diagnosing and managing ICU delirium. Chest. 2007;132(2):624–36. Epub 2007/08/19. pmid:17699134.
View Article
PubMed/NCBI
Google Scholar

[66] View Article

[67] PubMed/NCBI

[68] Google Scholar

[ref18] 18. Bergeron N, Dubois MJ, Dumont M, Dial S, Skrobik Y. Intensive Care Delirium Screening Checklist: evaluation of a new screening tool. Intensive Care Med. 2001;27(5):859–64. Epub 2001/06/30. pmid:11430542.
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

[ref19] 19. Steis MR, Evans L, Hirschman KB, Hanlon A, Fick DM, Flanagan N, et al. Screening for delirium using family caregivers: convergent validity of the Family Confusion Assessment Method and interviewer-rated Confusion Assessment Method. J Am Geriatr Soc. 2012;60(11):2121–6. pmid:23039310.
View Article
PubMed/NCBI
Google Scholar

[74] View Article

[75] PubMed/NCBI

[76] Google Scholar

[ref20] 20. Shulman RW, Kalra S, Jiang JZ. Validation of the Sour Seven Questionnaire for screening delirium in hospitalized seniors by informal caregivers and untrained nurses. BMC Geriatr. 2016;16:44. Epub 2016/02/18. pmid:26879927.
View Article
PubMed/NCBI
Google Scholar

[78] View Article

[79] PubMed/NCBI

[80] Google Scholar

[ref21] 21. Asafu-Adjei JK, Betensky RA. A Pairwise Naive Bayes Approach to Bayesian Classification. Intern J Pattern Recognit Artif Intell. 2015;29(7). Epub 2016/04/19. pmid:27087730.
View Article
PubMed/NCBI
Google Scholar

[82] View Article

[83] PubMed/NCBI

[84] Google Scholar

[ref22] 22. Chang C, editor Response feature analysis of longitudinal data in medical research2000.

[ref23] 23. Linzer DAaJ, L. poLCA: Polytomous Variable Latent Class Analysis.” R package version 1.4 2013. https://cran.r-project.org/web/packages/poLCA/README.html.

[ref24] 24. Crainiceanu CM, Goldsmith AJ. Bayesian Functional Data Analysis Using WinBUGS. J Stat Softw. 2010;32(11). Epub 2010/01/01. pmid:21743798.
View Article
PubMed/NCBI
Google Scholar

[88] View Article

[89] PubMed/NCBI

[90] Google Scholar

[ref25] 25. Estimation of Disease Prevalence and the Parameters of One, Two or Three Diagnostic Tests. http://www.medicine.mcgill.ca/epidemiology/Joseph/software/BayesDiagnosticTests/InstallInstructions.html.

[ref26] 26. Joseph L, Gyorkos TW, Coupal L. Bayesian estimation of disease prevalence and the parameters of diagnostic tests in the absence of a gold standard. Am J Epidemiol. 1995;141(3):263–72. Epub 1995/02/01. pmid:7840100
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref27] 27. Faraone SV, Tsuang MT. Measuring diagnostic accuracy in the absence of a "gold standard". Am J Psychiatry. 1994;151(5):650–7. Epub 1994/05/01. pmid:8166304
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref28] 28. Walter SD, Irwig LM. Estimation of test error rates, disease prevalence and relative risk from misclassified data: a review. J Clin Epidemiol. 1988;41(9):923–37. Epub 1988/01/01. pmid:3054000.
View Article
PubMed/NCBI
Google Scholar

[101] View Article

[102] PubMed/NCBI

[103] Google Scholar

[ref29] 29. Dendukuri N, Joseph L. Bayesian approaches to modeling the conditional dependence between multiple diagnostic tests. Biometrics. 2001;57(1):158–67. Epub 2001/03/17. pmid:11252592.
View Article
PubMed/NCBI
Google Scholar

[105] View Article

[106] PubMed/NCBI

[107] Google Scholar

[ref30] 30. Ulbricht CM, Chrysanthopoulou SA, Levin L, Lapane KL. The use of latent class analysis for identifying subtypes of depression: A systematic review. Psychiatry Res. 2018;266:228–46. Epub 2018/04/02. pmid:29605104.
View Article
PubMed/NCBI
Google Scholar

[109] View Article

[110] PubMed/NCBI

[111] Google Scholar

[ref31] 31. Vats D, Knudson C. Revisiting the Gelman-Rubin Diagnostic2018.

[ref32] 32. Krewulak KD, Rosgen BK, Ely EW, Stelfox HT, Fiest KM. The CAM-ICU-7 and ICDSC as measures of delirium severity in critically ill adult patients. PLoS One. 2020;15(11):e0242378. Epub 2020/11/17. pmid:33196655.
View Article
PubMed/NCBI
Google Scholar

[114] View Article

[115] PubMed/NCBI

[116] Google Scholar

[ref33] 33. Dendukuri N, Rahme E, Belisle P, Joseph L. Bayesian sample size determination for prevalence and diagnostic test studies in the absence of a gold standard test. Biometrics. 2004;60(2):388–97. Epub 2004/06/08. pmid:15180664.
View Article
PubMed/NCBI
Google Scholar

[118] View Article

[119] PubMed/NCBI

[120] Google Scholar

[ref34] 34. Leonard MM, Nekolaichuk C, Meagher DJ, Barnes C, Gaudreau JD, Watanabe S, et al. Practical assessment of delirium in palliative care. J Pain Symptom Manage. 2014;48(2):176–90. Epub 2014/04/29. pmid:24766745.
View Article
PubMed/NCBI
Google Scholar

[122] View Article

[123] PubMed/NCBI

[124] Google Scholar

[ref35] 35. Lawlor PG, Bush SH. Delirium diagnosis, screening and management. Curr Opin Support Palliat Care. 2014;8(3):286–95. Epub 2014/07/09.
View Article
Google Scholar

[126] View Article

[127] Google Scholar

[ref36] 36. Inouye SK, van Dyck CH, Alessi CA, Balkin S, Siegal AP, Horwitz RI. Clarifying confusion: the confusion assessment method. A new method for detection of delirium. Ann Intern Med. 1990;113(12):941–8. Epub 1990/12/15. pmid:2240918.
View Article
PubMed/NCBI
Google Scholar

[129] View Article

[130] PubMed/NCBI

[131] Google Scholar

[ref37] 37. Thomas C, Kreisel SH, Oster P, Driessen M, Arolt V, Inouye SK. Diagnosing delirium in older hospitalized adults with dementia: adapting the confusion assessment method to international classification of diseases, tenth revision, diagnostic criteria. J Am Geriatr Soc. 2012;60(8):1471–7. Epub 2012/08/14. pmid:22881707.
View Article
PubMed/NCBI
Google Scholar

[133] View Article

[134] PubMed/NCBI

[135] Google Scholar

[ref38] 38. Krewulak KD, Sept BG, Stelfox HT, Ely EW, Davidson JE, Ismail Z, et al. Feasibility and acceptability of family administration of delirium detection tools in the intensive care unit: a patient-oriented pilot study. CMAJ Open. 2019;7(2):E294–E9. Epub 2019/04/28. pmid:31028053 Method for the Intensive Care Unit, E.W. Ely, did not perform the evidence analysis of the study.
View Article
PubMed/NCBI
Google Scholar

[137] View Article

[138] PubMed/NCBI

[139] Google Scholar

[ref39] 39. Patel MB, Bednarik J, Lee P, Shehabi Y, Salluh JI, Slooter AJ, et al. Delirium Monitoring in Neurocritically Ill Patients: A Systematic Review. Crit Care Med. 2018;46(11):1832–41. Epub 2018/08/25. pmid:30142098.
View Article
PubMed/NCBI
Google Scholar

[141] View Article

[142] PubMed/NCBI

[143] Google Scholar

Figures

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Study design and setting

Participants

Procedure

Measures

Clinical delirium assessments.

Family administered delirium assessments.

Data analysis

Ethical approval

Results

Population characteristics

Results of delirium assessments

Diagnostic accuracy from latent class analysis

Diagnostic accuracy from Bayesian analyses

Discussion

Conclusions

Supporting information

S1 Table. STROBE statement; checklist of items that should be included in reports of cross-sectional studies.

S2 Table. Study eligibility criteria.

S3 Table. Summary of information criteria and estimated and observed probabilities of patient delirium using latent class analysis with a two-class model.

References