The HIV Dementia Scale (HDS) and International HIV Dementia Scale (IHDS) are brief tools that have been developed to screen for and aid diagnosis of HIV-associated dementia (HAD). They are increasingly being used in clinical practice for minor neurocognitive disorder (MND) as well as HAD, despite uncertainty about their accuracy.
Methods and Findings
A systematic review of the accuracy of the HDS and IHDS was conducted. Studies were assessed on Standards for Reporting Diagnostic Accuracy criteria. Pooled sensitivity, specificity, likelihood ratios (LR) and diagnostic odds ratios (DOR) were calculated for each scale as a test for HAD or MND. We retrieved 15 studies of the HDS, 10 of the IHDS, and 1 of both scales. Thirteen studies of the HDS were conducted in North America, and 7 of the IHDS studies were conducted in sub-Saharan Africa. Estimates of accuracy were highly heterogeneous between studies for the HDS but less so for the IHDS. Pooled DOR for the HDS was 7.52 (95% confidence interval 3.75–15.11), sensitivity and specificity for HAD were estimated at 68.1% and 77.9%, and sensitivity and specificity for MND were estimated at 42.0% and 91.2%. Pooled DOR for the IHDS was 3.49 (2.12–5.73), sensitivity and specificity for HAD were 74.3% and 54.7%, and sensitivity and specificity for MND were 64.3% and 66.0%.
Both scales were low in accuracy. The literature is limited by the lack of a gold standard, and variation in estimates of accuracy is likely to be due to differences in reference standard. There is a lack of studies comparing both scales, and they have been studied in different populations, but the IHDS may be less specific than the HDS. These rapid tests are not recommended for diagnostic use, and further research is required to inform their use in asymptomatic screening.
Citation: Haddow LJ, Floyd S, Copas A, Gilson RJC (2013) A Systematic Review of the Screening Accuracy of the HIV Dementia Scale and International HIV Dementia Scale. PLoS ONE 8(4): e61826. https://doi.org/10.1371/journal.pone.0061826
Editor: Shibo Jiang, Shanghai Medical College, Fudan University, China
Received: October 2, 2012; Accepted: March 12, 2013; Published: April 16, 2013
Copyright: © 2013 Haddow et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The authors have no support or funding to report.
Competing interests: The authors have declared that no competing interests exist.
HIV-associated neurocognitive disorders (HAND) are defined as impairment of multiple cognitive domains in association with HIV, in the absence of other causes for the impairment . HAND may affect up to half of all HIV positive (HIV+) individuals, even in regions with good access to antiretroviral therapy (ART) , . Symptomatic HAND (HIV-associated dementia [HAD] or minor neurocognitive disorder [MND]) is recommended as a reason to initiate ,  or modify  ART in recent European and British clinical guidelines.
The “Frascati criteria” are a research classification system that define HAD, the most severe grade of HAND, as impairment in at least two cognitive domains, scoring at least 2 standard deviations (SD) below demographically-appropriate means, with marked impairment of activities of daily living (ADL) caused by the cognitive deficits . The two milder grades of HAND, much more common than HAD, are MND (defined as at least 1 SD below the mean in two domains with at least moderate impairment of ADL) and asymptomatic neurocognitive impairment (ANI) (defined similarly to MND but without impairment of ADL).
Fulfilment of the Frascati criteria requires neuropsychological (NP) testing of at least five cognitive domains from a possible seven, assessment of ADL, and exclusion of other diagnoses. The criteria are further limited by the lack of a standardised method of grading ADL, uncertainty about the clinical significance and possible oversensitivity for mild impairment , and lack of confirmatory neuropathological, imaging or laboratory biomarkers. The 1991 American Academy of Neurology (AAN) criteria are simpler to use, in that they only require that clinical diagnosis is “supplemented by” neuropsychological assessment, but are otherwise very similar to the Frascati criteria . The 1998 Memorial Sloan-Kettering (MSK) criteria are based largely on clinical assessment and therefore may be more subjective, and are suited to an era prior to the availability of ART when HAD was terminally progressive .
Given the complexity of diagnosis, there is a role for rapid tests that can be incorporated into routine asymptomatic screening. The HIV Dementia Scale (HDS) was developed in 1995 as a “brief but sensitive instrument to identify [HIV-associated] dementia” . The scale comprises four tests of subcortical cognitive domains (attention, motor speed, construction, and working memory). In response to culturally-specific elements of the HDS and difficulties with the administration of the anti-saccadic errors test, the International HIV Dementia Scale (IHDS) was developed as an alternative in 2005 . Both tests provide a score but have a standardised cut-off for determining a positive or negative result. Both were proposed as rapid tests for screening (i.e. for individuals free of significant symptoms) and not diagnostic tests to confirm disease in patients with signs or symptoms of HAND, and patients who test positive with either the HDS or IHDS should undergo further assessment for diagnosis , . Other brief clinical screening tools , – and computerised cognitive test batteries – have been used, but there are fewer studies of their accuracy.
The HDS and IHDS have been used in recent clinical studies in North and Central America , , sub-Saharan Africa , , South Asia ,  and Europe , , , and have been considered for inclusion as screening tools in expert HIV treatment guidelines ,  (although the IHDS has recently been replaced with a three-symptom questionnaire in updated European guidelines ), but important questions remain. First, they were devised for identifying HAD, and their performance in detecting milder neurocognitive impairment may be quite different. Second, it is unknown whether one scale has better accuracy than the other. And third, the study methods, settings and estimates vary considerably between diagnostic accuracy studies. To enable evidence-based use of these tests in clinical practice, we conducted a systematic review to estimate the accuracy of each scale for the diagnosis of HAD and MND when compared to standard diagnostic criteria.
Search strategy and selection criteria
A literature search was conducted in July 2011 and repeated in January 2013 by the first author, including PubMed and PsycInfo indexes, searchable online HIV/AIDS conference proceedings, specialist journals, and major online sources of HIV-related information. Search terms were formulated to capture all studies using the HDS or IHDS alongside another diagnostic method for HAND in a sample of HIV positive adults (Table 1). Manual searches included reference lists of relevant articles identified in automated searches, conference proceedings, and requests for unpublished data to authors of major articles. PubMed and PsycInfo searches were limited to 1994 onwards (the year prior to publication of the HDS) and conference abstracts were limited to available years (mainly 2001 onwards).
From this initial search, studies were excluded if they duplicated data reported in another study in the search, and were only included if they used either the HDS or IHDS to assess individual HIV+ adults, as well as an appropriate reference standard for comparison. In this review, the highest-quality reference standard was a standardised clinical definition (Frascati, AAN, or MSK) supported by a NP battery evaluating at least five broad cognitive domains (attention and working memory; verbal and/or visual learning and recall; processing speed; executive functions; motor skills). Studies using other reference standards such as a detailed NP battery only, clinical opinion or brief NP tools were reviewed but not included in all stages of the analysis (see below).
Assessment of study quality
Data collected for each study included study identifiers, the year(s) in which the work was conducted, geographical region, details of HIV positive study participants (number, age, education, degree of immunodeficiency, ART coverage, drug and alcohol use, psychiatric conditions, and relevant co-morbidities), test of interest (HDS or IHDS), reference standard for comparison, possible sources of bias and error, and the results of the test of interest and reference standard. Authors of papers with useable data were contacted to clarify their methodology.
Possible sources of bias and error were identified from a pre-specified list of quality criteria, based on Standards for Reporting Diagnostic Accuracy (STARD) guidelines . Criteria to assess selection methods were the target population, inclusion and exclusion criteria, sampling methodology (consecutive, random, or opportunistic), information about eligible patients who were not recruited, and whether there was an a priori power calculation. Criteria relating to diagnostic methods included whether assessors completing the test and the reference standard were blinded to each other's assessment, adequacy and appropriateness of methods used for the reference standard, methods of ensuring validity and reliability of the assessments, and time lag or drop-outs between assessments. Studies were also assessed on whether the patient sample was adequately described, and whether there were any characteristics of the sample that might reduce its generalizability.
Collection of screening or diagnostic accuracy data
The number of true and false positives (TP, FP) and number of true and false negatives (TN, FN) among HIV+ study subjects, using standard cut-offs for the test of interest (less than or equal to 10 for both scales), was determined. The reference standard was categorised as having either a severe or a moderate threshold. Severe threshold reference standards were those using Frascati or AAN criteria for HAD, or the MSK grading for AIDS Dementia Complex (grade 1 to 4). All three of these standards are similar in threshold, although the Frascati definition for HAD may represent the more severe end of the impairment spectrum . Severe impairment in studies employing NP batteries was defined by similar criteria to HAD, namely impairment to ≥2 SD below normative means in at least two out of five cognitive domains. Moderate threshold reference standards were those that used MND (Frascati criteria), Minor Cognitive-Motor Disorder (MCMD; AAN criteria), or MSK grade 0.5 as a cut-off, with more severe impairment also included as a positive diagnosis. There is slightly less agreement between MND, MCMD, and MSK grade 0.5 than for more severe impairment , . Moderate impairment in studies using NP batteries was defined by similar criteria to MND, namely impairment in at least two domains at a level of at least 1 SD below expected means.
If the necessary values could not be extracted from published papers, but it was apparent that the necessary source data might exist elsewhere (e.g. if test scores were reported as a continuous distribution), the corresponding author was contacted to request these data. If it was not possible to dichotomise both the test of interest and the reference standard, the study was excluded from the analysis.
The accuracy of the test of interest in each study was quantified by the sensitivity (true positive rate), specificity (1–false positive rate), positive likelihood ratio (LR+; equal to sensitivity ÷ [1−specificity]), negative likelihood ratio (LR−; equal to [1−sensitivity] ÷ specificity), and diagnostic odds ratio (DOR; equal to [TP×TN] ÷ [FP×FN]). Positive and negative LR can be multiplied by the assumed odds of a diagnosis being present before conducting the test (prior odds) to determine the final odds of a diagnosis being present (posterior odds). According to Jaeschke et al, tests with LR+ >5 or LR− <0.2 provide strong evidence for or against the diagnosis, and LR+ >10 and LR− <0.1 provide convincing diagnostic evidence in most scenarios . 95% confidence intervals (CI) were calculated for each measure.
Four groups of studies were defined, according to the test of interest (separate analyses were performed for the HDS and IHDS) and the reference standard threshold (severe or moderate). Some studies reported more than one grade of impairment and therefore contributed to more than one group. Studies were then pooled if they used comprehensive criteria (Frascati, AAN or MSK), but were discounted if they used only a NP battery or a brief tool as the reference standard. Studies were not excluded on the basis of other quality criteria. Where there were two reference standards applied to the same sample, the more comprehensive standard was retained.
For each of these four pools of studies, heterogeneity between estimates of sensitivity and specificity was assessed by chi-squared tests, ignoring studies with cell sizes of <5. Heterogeneity between estimates of LRs was assessed using the I-squared measure. Reasons for heterogeneity between studies were later assessed by meta-regression of LRs with study characteristics as the independent variable.
Pooled sensitivity and specificity were then calculated as averages, weighted by sample size, and pooled LR+ and LR− were calculated using standard meta-analysis methods for risk ratios with a random effects model. These methods have the potential to underestimate test accuracy in the presence of diagnostic threshold variation; such variation was assessed using Spearman's rank test to demonstrate correlation between sensitivity and specificity . A summary DOR that is constant across diagnostic threshold removes this source of error .
Summary DORs were calculated using the Littenberg-Moses method  in which the linear relationship D = a+bS is examined in a regression model (where D = exp(DOR) and S = logit[TPR] + logit[FPR]), with points weighted by the square root of sample size. When calculating DORs, it was possible to combine studies with severe and moderate reference thresholds, and those with detailed NP batteries as the reference standard were re-incorporated into the analysis. A continuity correction was applied, because some studies had FN or FP equal to zero.
DORs are a single composite measure of both true- and false-positive rates, and therefore less clinically useful than other measures. To assist interpretation, predicted specificity and LRs were calculated from the average sensitivity and the summary DOR.
Studies included in the review (Figure 1; Flowchart S1)
Footnote: * Of the final 26 studies in the review, one comprised two separate populations , which are treated as two different studies in all further analyses.
The literature search generated 3698 unique citations, of which 56 reported using the HDS or IHDS and a reference standard in the same HIV positive sample. Of these, 28 were discarded because they did not dichotomise both test results , , , , –, and one was excluded because it used a non-standard cut-off . A further study in rural Zambia was excluded because it was observed that all 48 HIV positive participants and all 15 HIV-negative controls scored positively (impaired) on the HDS . The remainder comprised 15 studies of the HDS , , , , –, ten of the IHDS , , –, and one of both the HDS and IHDS . One of the IHDS studies reported results from two populations and was considered as two separate studies in all further analyses .
Of the 27 studies meeting inclusion criteria (Table 2), several sampled high risk target populations, including patients in the pre-highly active ART (HAART) era , admissions to specialist AIDS facilities , , patients with low CD4+ counts , , , patients in regions with limited access to ART , , , –, , , and individuals with psychiatric illness or drug abuse . Nearly all studies recruited patients unselected for neurocognitive symptoms, apart from four studies that specifically targeted those with cognitive complaints , ,  or neurology clinic referrals . In contrast, two studies targeted virologically stable patients , , and two studies excluded patients known to have significant dementia , . Eighteen studies excluded patients with confounding conditions, mainly neurological disorders (n = 14 studies) , , , –, , , , , , , , psychiatric conditions (n = 11) , , –, , , , , , systemic illnesses (n = 10) , , , , , , , , and drug or alcohol use (n = 8) , , , , , , . Thirteen of the HDS studies were conducted in USA or Canada, with one HDS study in each of South Africa, Switzerland, and Puerto Rico. In contrast, only three of the IHDS studies were conducted in North America, one in India, and one in Italy, with the remainder from sub-Saharan Africa. Characteristics of the patients in HDS studies (n = 3143) and IHDS studies (n = 942) are shown in Table 3.
Methodology and study quality
Methodological characteristics relating to study quality are summarised in Figure 2. Eighteen studies were specifically designed to assess one of the screening tools , , , , –, –, , –, , , . The sampling method was random or consecutive in only seven samples (allowing for some ambiguity in reporting) , , , , , , . The number of eligible patients who were not recruited was available for seven studies. No published articles reported any justification for their sample size, but one author disclosed that they had performed a power calculation .
Olive-green bars indicate fulfilment of study quality criteria, red bars indicate non-fulfilment, and blue bars indicate that this feature was not reported or available from correspondence with the study author. The study by Skinner et al  applied both scales to the same patient sample and is represented in both graphs A and B. HAND: HIV-associated neurocognitive disorder.
Two studies used inadequate reference standards: the Mini Mental State Examination , or a short NP battery only . Five studies used standard definitions based on clinical assessment but either did not report their NP battery , ,  or used a battery assessing fewer than five domains , , and six studies used a NP battery only , –, . Norms were generally based on published demographically-standardised data, but two studies collected normative data from local HIV-negative samples , , and six studies based in Africa used norms primarily derived from US populations , –, , . Methods for quality control of the HDS or IHDS included test-retest methods , specialist supervision and training , , –, , , improved standardisation , , , and expert panel review was used for the reference standard in some studies , –.
Full , , ,  or partial , ,  blinding between assessments occurred in seven studies, and most studies did not report the use or non-use of blinding. Lack of blinding was usually due to assessments being done by the same investigator. Verification bias was difficult to exclude with available information, but three studies had a time-lag between assessments , , .
Estimates of accuracy of the HIV Dementia Scale
Sensitivity estimates for detecting severe HAND with the HDS ranged widely from 35.7–91.7%, specificity 60.4–100%, LR+ 1.89–6.29, and LR− 0.12–0.72 (Table 4). After removing studies with NP batteries or brief tools as reference standards, there was evidence of heterogeneity between these estimates (p = 0.10 for LR+; p = 0.06 for LR−; p = 0.03 for sensitivity; p<0.001 for specificity). There was borderline evidence of a correlation between sensitivity and specificity across these studies (Spearman's ρ = −0.68 for nine observations, p = 0.062). Pooling seven studies that used a comprehensive reference standard gave sensitivity 68.1% (95% CI 59.2–75.9%), specificity 80.2% (76.6–83.5%), LR+ 3.76 (2.65–5.33), LR− 0.42 (0.29–0.63).
Sensitivity estimates for detecting moderate-to-severe HAND were again in a wide range from 9.1–61.5%, specificity 62.5–97.8%, LR+ 1.33–7.00, LR− 0.47 to 0.93. There was also heterogeneity between estimates (p = 0.03 for sensitivity; p<0.001 for specificity; p = 0.03 for LR+), although not for LR− (p = 0.28), but little evidence of correlation between sensitivity and specificity in this pool of studies (ρ = −0.77 for six observations, p = 0.07). Pooled estimates were sensitivity 42.0% (34.6–49.7%), specificity 83.3% (78.4–87.3%), LR+ 3.18 (1.70–5.95), LR− 0.70 (0.60–0.81).
The summary DOR for the HDS was estimated at 7.52 (3.75–15.11) (Figure 3). Predictions of test accuracy for the HDS were made using the above pooled sensitivity estimates (68.1% and 42.0%), giving specificity of 77.9% for severe HAND and 91.2% for moderate-to-severe HAND, LR+ of 3.08 and 4.79, and LR− of 0.41 and 0.64, respectively. Repeat analysis using only studies with the highest-quality reference standards and populations unselected for neurocognitive symptoms gave slightly poorer accuracy estimates as follows: DOR 5.25 (1.42–19.44); sensitivity 55.6%, specificity 80.8%, LR+ 2.89, LR− 0.55 (severe HAND); sensitivity 38.0%, specificity 89.5%, LR+ 3.64, LR− 0.69 (moderate-to-severe HAND).
Blue checks indicate sensitivity and specificity estimates from individual studies using comprehensive reference standards, labelled by first author. Red circles indicate studies using neuropsychological (NP) test batteries or brief NP tests as the reference standard, again labelled by first author. Solid diamonds indicate predicted values based on pooled sensitivity and summary diagnostic odds ratio. A, Reference standard = AIDS dementia complex, HIV-associated dementia, or severe impairment on NP battery. B, Reference standard = mild neurocognitive disorder, minor cognitive/motor disorder, or moderate impairment on NP battery. CI: confidence interval; DOR: diagnostic odds ratio.
Estimates of accuracy of the International HIV Dementia Scale
For the IHDS, sensitivity estimates for detecting severe HAND ranged from 57.1–100%, specificity 22.2–65.6%, LR+ 1.05–2.00, and LR− 0.31–0.82 (Table 4). Two sets of estimates came from the same study, one using MSK grading and one using Frascati criteria ; the latter was dropped from further analysis because the researchers found limitations to using Frascati criteria in rural Kenya. There was strong evidence of heterogeneity between studies in the specificity estimates (p<0.001), and correlation between sensitivity and specificity across IHDS studies using a valid reference standard (ρ = −0.69 for nine observations, p = 0.04), but little evidence of heterogeneity of sensitivity and LR estimates (p>0.10). Pooling studies using gave sensitivity 74.3% (67.1–80.3%), specificity 47.8% (43.9–51.8%), LR+ 1.56 (1.36–1.79), LR− 0.52 (0.40–0.68).
Sensitivity estimates for detecting moderate-to-severe HAND with the IHDS ranged from 53.8–87.5%, with specificity 45.0–80.6%, LR+ 1.32–2.78, LR− 0.25–0.62, with one conspicuous outlier of low sensitivity and high specificity . There was strong evidence of heterogeneity between specificity estimates (p = 0.004), borderline evidence of heterogeneity between sensitivity estimates (p = 0.07), and no evidence of heterogeneity between LR estimates (p>0.10). There was no evidence of a correlation between sensitivity and specificity (ρ = −0.80 for four observations, p = 0.20). Pooled estimates were sensitivity 64.3% (55.6–72.1%), specificity 49.6% (43.7–55.6%), LR+ 1.73 (1.17–2.55), LR− 0.55 (0.41–0.74).
In summary, there was evidence of heterogeneity in specificity among IHDS studies, with considerable overlap between the ranges of estimates for detecting severe HAND and those for detecting moderate HAND. A summary ROC curve was fitted with a pooled DOR of 3.49 (2.12–5.73) (Figure 4). Predictions for the IHDS were again made using pooled sensitivity estimates (74.3% and 64.3%), giving specificity of 54.7% for severe HAND and 66.0% for moderate-to-severe HAND, LR+ of 1.64 and 1.89, and LR− of 0.47 and 0.54, respectively. Repeat analysis using only studies with high-quality reference standards and populations unselected for neurocognitive symptoms gave similar results: DOR 3.54 (2.07–6.05); sensitivity 73.4%, specificity 56.2%, LR+ 1.68, LR− 0.47 (severe HAND); sensitivity 61.9%, specificity 68.5%, LR+ 1.97, LR− 0.56 (moderate-to-severe HAND).
Blue checks indicate sensitivity and specificity estimates from individual studies, labelled by first author. Crosses labelled “Sacktor Uganda” and “Sacktor US” correspond to two separate studies published in a single paper . The cross labelled “Sacktor MCN” corresponds to baseline data from a multicentre trial of minocycline for treatment of cognitive impairment . The two points labelled “Meyer” are derived from the same study ; “(Frascati)” and “(MSK)” denote the reference standard in each case. Red circles indicate studies using neuropsychological (NP) test batteries or brief NP tests as the reference standard, again labelled by first author. Solid diamonds indicate predicted values based on pooled sensitivity and summary diagnostic odds ratio. A, Reference standard = AIDS dementia complex, HIV-associated dementia, or severe neurocognitive impairment. B, Reference standard = mild neurocognitive disorder, minor cognitive/motor disorder, or mild/moderate neurocognitive impairment. CI: confidence interval; DOR: diagnostic odds ratio.
Analysis of sources of heterogeneity
Analysis of study methodological features showed higher average DOR (20.5 vs. 6.85, p = 0.001) and lower average LR− (0.26 vs. 0.59, p = 0.01) in two studies comparing the HDS to a severe-impairment reference standard when the target population was highly immunodeficient , . When compared to the IHDS, the HDS had a significantly higher summary DOR (p = 0.009) and LR+ (p = 0.019), but both scales had similar LR− (p = 0.98). This comparison may however be based on an artificial foundation, given the differences between target populations in studies of each scale. The single study that used both scales in the same population was of small sample size and failed to find a difference between the two .
We have systematically reviewed 15 studies of the HDS, ten of the IHDS, and one that included both scales. Most studies in the review apply to the original intended role of the HDS and IHDS–screening rather than diagnosis–in that participants were not selected on the basis of symptoms. Summary estimates for the HDS as a test for HAD or an equivalent diagnosis (severe HAND) were: sensitivity 68%, specificity 78%, LR+ 3.1, LR− 0.41, but its accuracy appeared to be lower when analysis was limited to studies with high-quality reference standards and unselected populations. When using the HDS as a test for MND or equivalent (all symptomatic HAND), estimates of accuracy were: sensitivity 42%, specificity 91%, LR+ 4.8, LR− 0.64. Summary estimates for the IHDS as a test for severe HAND were: sensitivity 74%, specificity 55%, LR+ 1.6, LR− 0.47. When using the IHDS as a test for all symptomatic HAND, estimates of accuracy were: sensitivity 64%, specificity 66%, LR+ 1.9, LR− 0.54. These summary estimates and most individual study estimates for both scales failed to achieve accepted levels of accuracy to provide strong evidence for a diagnosis of HAND , confirming their unsuitability for diagnostic purposes when used alone.
Comparing the two scales, the HDS had a higher DOR and LR+ than the IHDS, but the only direct comparison of both scales within the same sample failed to find a difference between the two, and was limited by its small sample size . Furthermore, the two scales were studied in different settings, with most of the HDS studies conducted in North America, and most of the IHDS studies conducted in Africa. Unfortunately, while the IHDS was developed with resource-limited settings in mind, it is not free from culturo-linguistic effects. The four-word recall task (in both tests) must be modified for different languages , , and it was shown in an Indian population that education was associated with IHDS score, but HIV status was not . The two scales were also studied in different years, and considerable changes in our understanding of HIV pathogenesis and treatment occurred in the decade between the publication of the HDS in 1995 and the IHDS in 2005.
Estimates of screening accuracy showed wide variation between studies, particularly for the HDS. We did not find strong evidence of a diagnostic threshold effect. However, tests of correlation used to demonstrate this effect are known to have low statistical power , and the reference diagnosis of HAD is complex and subject to variations of interpretation. It is therefore plausible that differences between reference standards contributed to the varying accuracy of these well-standardised diagnostic tools.
Regarding other sources of variability, an increased DOR and lower LR− was seen in two studies assessing the HDS in patients with more advanced immunodeficiency. Spectrum bias is a form of selection bias that may occur when the study population is sampled from a limited or specialised clinical setting and therefore has a narrow spectrum of disease. This form of bias could have increased sensitivity in samples of more severely-impaired patients, such as those conducted in Africa, in the pre-HAART era, or in hospital wards. Spectrum bias could also reduce specificity in those in whom it was difficult to exclude competing diagnoses, such as in resource-limited settings, or conversely increase specificity in samples with fewer competing diagnoses. Non-random, non-consecutive sampling strategies are known to lead to over-estimation of accuracy .
There were a number of methodological limitations to this review. First, the literature search and data extraction were carried out by a single author (LJH). Second, the literature search could have missed studies not cited in the target data sources, or articles in which it was not clear from the abstract that neurocognitive testing was done. To minimise this, researchers in the field were asked about the existence of unpublished datasets. Third, it was not always possible to generate two-by-two tables from available data, usually because HDS and IHDS scores were reported as continuous variables. In a few studies, the estimated values were not consistent with other information in the same article, suggesting other unknown errors in the results. This was despite requests for reconfigured data directly from researchers.
More importantly, the review is limited by the lack of a clinical gold standard for neurocognitive impairment in HIV, whether this be neurological criteria, neuroimaging findings, biomarkers in cerebrospinal fluid, or histopathology. The Frascati criteria are relatively detailed, objective, and appropriate for a research definition, so the analysis in this review provides the best available estimates of the accuracy of the HDS and IHDS when used as screening tools for MND or HAD. However, current data do not clearly inform clinicians of the natural history or appropriate treatment of these conditions, particularly milder impairment, and this limits our ability to predict the effects of screening.
British HIV Association (BHIVA) guidelines do not comment on screening for HAND , whereas the European AIDS Clinical Society (EACS) guidelines recommend a brief symptom questionnaire in all patients at regular intervals ,  and a recent review made similar recommendations but did not support one screening test over another . The general rule that one should minimise false positives if the confirmatory test is expensive or invasive favours the HDS over the IHDS, and the penalty for missing an asymptomatic case of HAND is arguably not high, so the lower-sensitivity test is acceptable. The prevalence of HAD was 2–4% of HIV positive individuals in recent surveys in the US and Switzerland –, lower than the prevalence in most studies included in this review. At this low prior probability, one might confidently exclude the diagnosis with a negative HDS, but the posterior probability would be less than 15% after a positive HDS. In comparison, when used as a test for MND, a positive HDS result would give a posterior probability of 56% in the presence of a prior probability of 20%.
A screening test is an intervention that should be subject to interventional research as any other, and for it to be routinely used in clinical practice, the evidence base should address the next steps in the clinical pathway. For example, we need to evaluate how to investigate patients further, how to predict their outcome, and how to modify medical therapy in the light of a positive or negative screening test. On the tests themselves, studies are needed to determine their repeatability, intra-subject variation, and learning effects, and understand the causes of false positive and false negative results (not explored in the studies reviewed). Further studies of the HDS and IHDS should adhere to STARD guidelines. Specific settings of interest are the use of the HDS in an African or other resource-limited setting, or the IHDS in a North American or European setting with high ART coverage and relatively preserved immune function. There may be a role for studying the scales specifically in older adults, given the growing proportion of HIV+ individuals over the age of 50  and their greater risk of HAND , although their ability to distinguish between HAND and non-HIV causes of NCI has not been assessed. One could also model theoretical screening programmes for neurocognitive impairment within HIV positive populations of known prevalence.
In conclusion, in current clinical practice, interpretation of the results of assessment with the HDS or IHDS requires an appreciation of their limited accuracy, the lack of generalisability of existing research, and the heterogeneity of estimates. The HDS appears to be more accurate overall and its higher specificity probably makes it the preferred test for detecting asymptomatic HAND, although the IHDS may be preferred in situations where sensitivity is most important, at the expense of loss of specificity. Having reviewed the evidence we advise against their further use as diagnostic tests for HAND in symptomatic patients, even in resource-limited settings, and believe that studies reporting their use should acknowledge their limited validity.
Flowchart in PRISMA format.
Checklist of PRISMA reporting standards.
The authors wish to thank the following researchers who have made contributions to this project by providing details of methodology, releasing unpublished data, re-analysing published data, or entering into correspondence on their work: Andreas Antinori (National Institute for Infectious Diseases, Rome); Joseph Berger (University of Kentucky College of Medicine); Gretchen Birbeck (Michigan State University); Youngjee Choi (Washington University School of Medicine, St Louis); Paola Cinque (Vita-Salute University, Milan); David Clifford (Washington University School of Medicine, St Louis); Lucette Cysique (University of New South Wales); Lijuan Deng (Harvard School of Public Health); Renaud Du Pasquier (Centre Hospitalier Universitaire Vaudois, Switzerland); Keith Ganasen (University of Stellenbosch, South Africa); Assawin Gongvatana (Alpert Medical School of Brown University); David Hardy (Loyola Marymount University, Los Angeles); Glenn Jones (Louisiana State University Health Sciences Center); John Joska (University of Cape Town); Michelle Kvalsund (Michigan State University); Anthony Lee (Harvard School of Public Health); Maureen Lyon (Children's National Medical Center, Washington DC); Justin McArthur (Johns Hopkins Medicine); AnaClaire Meyer (University of California, San Francisco); Kyle Minor (Louisiana State University Health Sciences Center); Sachiko Miyahara (Harvard School of Public Health); Erin Morgan (University of California, San Diego); Christopher Power (University of Alberta); Simon Rackstraw (Mildmay Hospital, London); Ned Sacktor (Johns Hopkins Medicine); Cecilia Shikuma (University of Hawaii); Samanta Simioni (Centre Hospitalier Universitaire Vaudois, Switzerland); Dinesh Singh (Medical Research Council of South Africa); Clifford Smith (Rush University Medical Center in Chicago); Drenna Waldrop-Valverde (University of Miami); Alan Winston (Imperial College London); Valerie Wojna (University of Puerto Rico); Steven Woods (University of California, San Diego).
We also thank Professor Robert Miller for comments and advice on the final version for publication.
Performed literature search and collected data: LH. Conceived and designed the experiments: LH SF. Analyzed the data: LH SF AC. Wrote the paper: LH SF AC RG.
- 1. Antinori A, Arendt G, Becker JT, Brew BJ, Byrd DA, et al. (2007) Updated research nosology for HIV-associated neurocognitive disorders. Neurology 69: 1789–1799.
- 2. Simioni S, Cavassini M, Annoni JM, Rimbault Abraham A, Bourquin I, et al. (2010) Cognitive dysfunction in HIV patients despite long-standing suppression of viremia. AIDS 24: 1243–1250.
- 3. Heaton RK, Clifford DB, Franklin DRJ, Woods SP, Ake C, et al. (2010) HIV-associated neurocognitive disorders persist in the era of potent antiretroviral therapy: CHARTER Study. Neurology 75: 2087–2096.
- 4. Reiss P, Battegay M, Clumeck N, Mulcahy F, Arribas J, et al. (2011) European AIDS Clinical Society Guidelines. Paris, France.
- 5. Williams IG, Churchill D, Anderson J, Boffito M, Bower M, et al. (2012) British HIV Association guidelines for the treatment of HIV-1-positive adults with antiretroviral therapy. HIV Medicine 13: 1–85.
- 6. Gisslen M, Price RW, Nilsson S (2011) The definition of HIV-associated neurocognitive disorders: are we overestimating the real prevalence? BMC Infectious Diseases 11: 356.
- 7. American Academy of Neurology AIDS Task Force (1991) Nomenclature and research case definitions for neurologic manifestations of human immunodeficiency virus-type 1 (HIV-1) infection. Report of a Working Group. Neurology 41: 778–785.
- 8. Price RW, Brew BJ (1988) The AIDS dementia complex. Journal of Infectious Diseases 158: 1079–1083.
- 9. Power C, Selnes OA, Grim JA, McArthur JC (1995) HIV Dementia Scale: a rapid screening test. Journal of Acquired Immune Deficiency Syndromes and Human Retrovirology 8: 273–278.
- 10. Sacktor NC, Wong M, Nakasujja N, Skolasky RL, Selnes OA, et al. (2005) The International HIV Dementia Scale: a new rapid screening test for HIV dementia. AIDS 19: 1367–1374.
- 11. Davis HF, Skolasky RL, Selnes OA, Burgess DM, McArthur JC (2002) Assessing HIV-associated dementia: modified HIV dementia scale versus the Grooved Pegboard. AIDS Reader 12: 29–31.
- 12. Kvalsund MP, Haworth A, Murman DL, Velie E, Birbeck GL (2009) Closing gaps in antiretroviral therapy access: human immunodeficiency virus-associated dementia screening instruments for non-physician healthcare workers. American Journal of Tropical Medicine and Hygiene 80: 1054–1059.
- 13. Skinner S, Adewale AJ, DeBlock L, Gill MJ, Power C (2009) Neurocognitive screening tools in HIV/AIDS: comparative performance among patients exposed to antiretroviral therapy. HIV Medicine 10: 246–252.
- 14. Lyon ME, McCarter R, D'Angelo LJ (2009) Detecting HIV associated neurocognitive disorders in adolescents: what is the best screening tool? Journal of Adolescent Health 44: 133–135.
- 15. Jones BN, Teng EL, Folstein MF, Harrison KS (1993) A new bedside test of cognition for patients with HIV infection. Annals of Internal Medicine 119: 1001–1004.
- 16. Minor KS, Jones GN, Stewart DW, Hill BD, Kulesza M (2010) Comparing two measures of psychomotor performance in patients with HIV: the Coin Rotation Test and the Modified HIV Dementia Screen. Journal of Acquired Immune Deficiency Syndromes 55: 225–227.
- 17. Chan LG, Kandiah N, Chua A (2012) HIV-associated neurocognitive disorders (HAND) in a South Asian population-contextual application of the 2007 criteria. BMJ Open 2: e000662.
- 18. Garvey LJ, Yerrakalva D, Winston A (2009) Correlations between computerized battery testing and a memory questionnaire for identification of neurocognitive impairment in HIV type 1-infected subjects on stable antiretroviral therapy. AIDS Research & Human Retroviruses 25: 765–769.
- 19. Valcour VG, Paul R, Chiao S, Wendelken LA, Miller B (2011) Screening for cognitive impairment in human immunodeficiency virus. Clinical Infectious Diseases 53: 836–842.
- 20. Maruff P, Thomas E, Cysique LA, Brew BJ, Collie A, et al. (2009) Validity of the CogState brief battery: relationship to standardized tests and sensitivity to cognitive impairment in mild traumatic brain injury, schizophrenia, and AIDS dementia complex. Archives of Clinical Neuropsychology 24: 165–178.
- 21. Gibbie T, Mijch A, Ellen S, Hoy J, Hutchison C, et al. (2006) Depression and neurocognitive performance in individuals with HIV/AIDS: 2-year follow-up. HIV Medicine 7: 112–121.
- 22. Gonzalez R, Heaton RK, Moore DJ, Letendre S, Ellis RJ, et al. (2003) Computerized reaction time battery versus a traditional neuropsychological battery: detecting HIV-related impairments. Journal of the International Neuropsychological Society 9: 64–71.
- 23. Morgan EE, Woods SP, Scott JC, Childers M, Beck JM, et al. (2008) Predictive validity of demographically adjusted normative standards for the HIV Dementia Scale. Journal of Clinical and Experimental Neuropsychology 30: 83–90.
- 24. Wojna V, Skolasky RL, McArthur JC, Maldonado E, Hechavarria R, et al. (2007) Spanish validation of the HIV dementia scale in women. AIDS Patient Care STDS 21: 930–941.
- 25. Joska JA, Westgarth-Taylor J, Hoare J, Thomas KGF, Paul R, et al. (2011) Validity of the International HIV Dementia Scale in South Africa. AIDS Patient Care STDS 25: 95–101.
- 26. Valcour VG, Shiramizu BT, Sithinamsuwan P, Nidhinandana S, Ratto-Kim S, et al. (2009) HIV DNA and cognition in a Thai longitudinal HAART initiation cohort: the SEARCH 001 Cohort Study. Neurology 21: 992–998.
- 27. Waldrop-Valverde D, Nehra R, Sharma S, Malik A, Jones D, et al. (2010) Education effects on the International HIV Dementia Scale. Journal of Neurovirology 16: 264–267.
- 28. Garvey LJ, Surendrakumar V, Winston A (2011) Low rates of neurocognitive impairment are observed in neuro-asymptomatic HIV-infected subjects on effective antiretroviral therapy. HIV Clinical Trials 12: 333–338.
- 29. Waters L, Patterson B, Scourfield A, Hughes A, de Silva S, et al. (2012) A dedicated clinic for HIV-positive individuals over 50 years of age: a multidisciplinary experience. International Journal of STD and AIDS 23: 546–552.
- 30. Roundtable Discussion (2011) Presentation of guidelines for treatment of HIV infection of the CNS in different countries. Fourth International Meeting on HIV Infection and the Central Nervous System. Monte Porzio Catone, Rome, Italy.
- 31. Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glasziou PP, et al. (2003) Towards complete and accurate reporting of studies of diagnostic accuracy: the STARD initiative. British Medical Journal 326: 41–44.
- 32. Cherner M, Cysique LA, Heaton RK, Marcotte TD, Ellis RJ, et al. (2007) Neuropathologic confirmation of definitional criteria for human immunodeficiency virus-associated neurocognitive disorders. Journal of Neurovirology 13: 23–28.
- 33. Gandhi NS, Moxley RT, Creighton J, Vornbrock Roosa H, Skolasky RL, et al. (2010) Comparison of scales to evaluate the progression of HIV-associated neurocognitive disorder. HIV Therapy 4: 371–379.
- 34. Jaeschke R, Guyatt GH, Sackett DL (1994) Users' guides to the medical literature. VI. How to use an article about a diagnostic test. B: What are the results and will they help me in caring for my patients? Journal of the American Medical Association 271: 703–707.
- 35. Deeks JJ (2001) Systematic reviews of evaluations of diagnostic and screening tests. In: Egger M, Davey Smith G, Altman D, editors. Systematic Reviews in Health Care: Meta-analysis in Context.London, UK: BMJ Publishing Group. pp. 248–284.
- 36. Midgette AS, Stukel TA, Littenberge B (1993) A meta-analytical method for summarizing diagnostic test performance: receiver-operating characteristic-curve point estimates. Medical Decision Making 13: 253–257.
- 37. Moses LE, Shapiro D, Littenberg B (1993) Combining independent studies of a diagnostic test into a summary ROC curve: data-analytic approaches and some additional considerations. Statistics in Medicine 12: 1293–1316.
- 38. Andres MA, Feger U, Nath A, Munsaka S, Jiang CS, et al. (2011) APOE ε4 allele and CSF APOE on Cognition in HIV-Infected Subjects. Journal of Neuroimmune Pharmacology 6: 389–398.
- 39. Arendt G, Koutsilieri E, Sopper S, Husstedt IW, Maschke M, et al. (2009) The influence of cytokines on neuropsychological performance in HIV patients. 12th European AIDS Conference. Cologne, Germany.
- 40. Chang L, Wong V, Nakama H, Watters M, Ramones D, et al. (2008) Greater than age-related changes in brain diffusion of HIV patients after 1 year. J Neuroimmune Pharmacol 3: 265–274.
- 41. Chang L, Ernst T, Leonido-Yee M, Speck O (2000) Perfusion MRI detects rCBF abnormalities in early stages of HIV-cognitive motor complex. Neurology 54: 389–396.
- 42. Dougherty RH, Skolasky RL, McArthur JC (2002) Progression of HIV-associated dementia treated with HAART. AIDS Reader 12: 69–74.
- 43. Ernst T, Chang L, Arnold S (2003) Increased glial metabolites predict increased working memory network activation in HIV brain injury. Neuroimage 19: 1686–1693.
- 44. Kim DH, Jewison DL, Milner GR, Rourke SB, Gill MJ, et al. (2001) Neurocognitive symptoms and impairment in an HIV community clinic. Canadian Journal of Neurological Science 28: 228–231.
- 45. Guillemi S, Gil D, Hull M, Harris M, Hsiung R, et al. (2011) Results of CSF viral load in HIV-positive patients on stable antiretroviral therapy (ART) attending a neurocognitive disorder clinic (NDC). 6th IAS Conference on HIV Pathogenesis, Treatment and Prevention. Rome, Italy.
- 46. Pessin H, Rosenfeld B, Burton L, Breitbart W (2003) The role of cognitive impairment in desire for hastened death: a study of patients with advanced AIDS. General Hospital Psychiatry 25: 194–199.
- 47. von Giesen HJ, Köller H, de Nocker D, Haslinger BA, Arendt G (2003) Long-term safety and efficacy of NNRTI within the central nervous system. HIV Clinical Trials 4: 382–390.
- 48. Wang GJ, Chang L, Volkow ND, Telang F, Logan J, et al. (2004) Decreased brain dopaminergic transporters in HIV-associated dementia patients. Brain 127: 2452–2458.
- 49. Ashby J, Foster CJ, Garvey LJ, Wan T, Allsop JM, et al. (2011) Cerebral function in perinatally HIV-infected young adults and their HIV-uninfected sibling controls. 17th Annual Conference of the British HIV Association. Bournemouth, UK.
- 50. Birbeck GL, Kvalsund MP, Byers PA, Bradbury R, Mang'ombe C, et al. (2011) Neuropsychiatric and socioeconomic status impact antiretroviral adherence and mortality in rural Zambia. American Journal of Tropical Medicine and Hygiene 85: 782–789.
- 51. Choi Y, Townend J, Vincent T, Zaidi I, Sarge-Njie R, et al. (2011) Neurologic manifestations of human immunodeficiency virus-2: dementia, myelopathy, and neuropathy in West Africa. Journal of Neurovirology 17: 166–175.
- 52. Clifford DB, Mitike MT, Mekonnen Y, Zhang J, Zenebe G, et al. (2007) Neurological evaluation of untreated human immunodeficiency virus infected adults in Ethiopia. Journal of Neurovirology 13: 67–72.
- 53. Gasnault J, Dulioust A, Sellier P, Dolphin P, Paquet C (2010) COGNIVIH, une échelle brève de dépistage des troubles neurocognitifs chez les personnes vivant avec le VIH: données préliminaires. 3e Forum de Recherches Fondamentales et Cliniques sur le VIH. Paris, France.
- 54. Holguin A, Banda M, Willen EJ, Malama C, Chiyenu KO, et al. (2011) HIV-1 effects on neuropsychological performance in a resource-limited country, Zambia. AIDS and Behavior 15: 1895–1901.
- 55. Lawler K, Mosepele M, Ratcliffe S, Seliolwe E, Steele K, et al. (2010) Neurocognitive impairment among HIV-positive individuals in Botswana: a pilot study. Journal of the International AIDS Society 13: 15.
- 56. Pumpradit W, Ananworanich J, Lolak S, Shikuma C, Paul R, et al. (2010) Neurocognitive impairment and psychiatric comorbidity in well-controlled human immunodeficiency virus-infected Thais from the 2NN Cohort Study. Journal of Neurovirology 16: 76–82.
- 57. Royal W, Cherner M, Carr J, Habib A, Akomolafe A, et al. (2012) Clinical features and preliminary studies of virological correlates of neurocognitive impairment among HIV-infected individuals in Nigeria. Journal of Neurovirology 18: 191–199.
- 58. Sacktor N, Nakasujja N, Skolasky R, Robertson K, Wong M, et al. (2006) Antiretroviral therapy improves cognitive impairment in HIV+ individuals in sub-Saharan Africa. Neurology 67: 311–314.
- 59. Zhang Y, Qiao L, Ding W, Wei F, Zhao Q, et al. (2012) An initial screening for HIV-associated neurocognitive disorders of HIV-1 infected patients in China. Journal of Neurovirology 18: 120–126.
- 60. Oshinaike OO, Akinbami AA, Ojo OO, Ojini IF, Okubadejo UN, et al. (2012) Comparison of the Minimental State Examination Scale and the International HIV Dementia Scale in Assessing Cognitive Function in Nigerian HIV Patients on Antiretroviral Therapy. AIDS Research and Treatment 2012: 581531.
- 61. Nakasujja N, Miyahara S, Evans S, Lee A, Musisi S, et al. (2013) Randomized trial of minocycline in the treatment of HIV-associated cognitive impairment. Neurology 80: 196–202.
- 62. Perez-Valero I, Heaton RK, Letendre SL, McCutchan JA, Clifford D, et al. (2012) Validation of the EACS guidelines 2011 algorithm for detecting HAND in the CHARTER cohort. 19th Conference on Retroviruses and Opportunistic Infections. Seattle, USA. Abstract 508.
- 63. Avison MJ, Nath A, Greene-Avison R, Schmitt FA, Bales RA, et al. (2004) Inflammatory changes and breakdown of microvascular integrity in early human immunodeficiency virus dementia. Journal of Neurovirology 10: 223–232.
- 64. Berghuis JP, Uldall KK, Lalonde B (1999) Validity of two scales in identifying HIV-associated dementia. Journal of Acquired Immune Deficiency Syndromes 21: 134–140.
- 65. Bottiggi KA, Chang JJ, Schmitt FA, Avison MJ, Mootoor Y, et al. (2007) The HIV Dementia Scale: Predictive power in mild dementia and HAART. Journal of the Neurological Sciences 260: 11–15.
- 66. Carey CL, Woods SP, Rippeth JD, Gonzalez R, Moore DJ, et al. (2004) Initial validation of a screening battery for the detection of HIV-associated cognitive impairment. Clinical Neuropsychology 18: 234–248.
- 67. Cloak CC, Chang L, Ernst T (2004) Increased frontal white matter diffusion is associated with glial metabolites and psychomotor slowing in HIV. Journal of Neuroimmunology 157: 147–152.
- 68. Ganasen KA, Fincham D, Smit J, Seedat S, Stein D (2008) Utility of the HIV Dementia Scale (HDS) in identifying HIV dementia in a South African sample. Journal of Neurological Sciences 269: 62–64.
- 69. Gongvatana A, Woods SP, Taylor MJ, Vigil O, Grant I (2007) Semantic clustering inefficiency in HIV-associated dementia. Journal of Neuropsychiatry and Clinical Neuroscience 19: 36–42.
- 70. Hardy DJ, Hinkin CH, Levine AJ, Castellon SA, Lam MN (2006) Risky decision making assessed with the gambling task in adults with HIV. Neuropsychology 20: 355–360.
- 71. Richardson MA, Morgan EE, Vielhauer MJ, Cuevas CA, Buondonno LM, et al. (2005) Utility of the HIV dementia scale in assessing risk for significant HIV-related cognitive-motor deficits in a high-risk urban adult sample. AIDS Care 17: 1013–1021.
- 72. Smith CA, van Gorp WG, Ryan ER, Ferrando SJ, Rabkin J (2003) Screening subtle HIV-related cognitive dysfunction: the clinical utility of the HIV Dementia Scale. Journal of Acquired Immune Deficiency Syndromes 33: 116–118.
- 73. Sakamoto M, Marcotte TD, Umlauf A, Franklin DRJ, Heaton RK, et al. (2013) Concurrent classification accuracy of the HIV Dementia Scale for HIV-Associated Neurocognitive Disorders in the CHARTER cohort. Journal of Acquired Immune Deficiency Syndromes 62: 36–42.
- 74. Kwasa J, Cettomai D, Lwanya E, Osiemo D, Oyaro P, et al. (2012) Lessons learned developing a diagnostic tool for HIV-associated dementia feasible to implement in resource-limited settings: pilot testing in Kenya. PLoS One 7: e32898.
- 75. Meyer AC, Cettomai D, Kwasa J, Oyaro P, Osiemo D, et al. (2011) Diagnostic tools and culturally-specific norms for the diagnosis of HIV-associated cognitive impairment in western Kenya. 6th IAS Conference on HIV Pathogenesis, Treatment and Prevention. Rome, Italy.
- 76. Nakasujja N, Skolasky RL, Musisi S, Allebeck P, Robertson KR, et al. (2010) Depression symptoms and cognitive function among individuals with advanced HIV infection initiating HAART in Uganda. BMC Psychiatry 10: 44.
- 77. Sacktor N, Miyahara S, Deng L, Evans S, Schiffito G, et al. (2011) Minocycline treatment for HIV-associated cognitive impairment: results from a randomized trial. Neurology 77: 1135–1142.
- 78. Singh D, Goodkin K (2011) Diagnostic utility of the International HIV Dementia Scale for asymptomatic HIV-associated neurocognitive impairment and HIV-Associated Neurocognitive Disorder in South Africa. [unpublished article]
- 79. Singh D, Sunpath H, John S, Eastham L, Gouden R (2008) The utility of a rapid screening tool for depression and HIV dementia amongst patients with low CD4 counts-a preliminary report. African Journal of Psychiatry 11: 282–286.
- 80. Muniyandi K, Venkatesan J, Arutselvi T, Jayaseelan V (2012) Study to assess the prevalence, nature and extent of cognitive impairment in people living with AIDS. Indian Journal of Psychiatry 54: 149–153.
- 81. Antinori A, Balestra P, Lorenzini P, Libertone R, Cataldo G, et al. (2012) Comparison of screening tools for the detection of neurocognitive impairment in HAART-treated patients. Journal of the International AIDS Society 15: 18286.
- 82. Lijmer JG, Mol BW, Heisterkamp S, Bonsel GJ, Prins MH, et al. (1999) Empirical evidence of design-related bias in studies of diagnostic tests. Journal of the American Medical Association 282: 1061–1066.
- 83. Letendre SL, Grant I (2012) Assessment, diagnosis and treatment of Human Immunodeficiency Virus (HIV)-Associated Neurocognitive Disorders (HAND): a consensus report of the Mind Exchange program. Clinical Infectious Diseases Nov 21: [Epub ahead of print].
- 84. High KP, Brennan-Ing M, Clifford DB, Cohen MH, Currier J, et al. (2012) HIV and aging: state of knowledge and areas of critical need for research. A report to the NIH Office of AIDS Research by the HIV and Aging Working Group. Journal of Acquired Immune Deficiency Syndromes 60: 1–18.
- 85. Valcour V, Shikuma C, Shiramizu B, Watters M, Poff P, et al. (2004) Higher frequency of dementia in older HIV-1 individuals: the Hawaii Aging with HIV-1 Cohort. Neurology 63: 822–827.
- 86. Woods SP, Rippeth JD, Frol AB, Levy JK, Ryan E, et al. (2004) Interrater reliability of clinical ratings and neurocognitive diagnoses in HIV. Journal of Clinical and Experimental Neuropsychology 26: 759–778.