Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Manual ability in hand surgery patients: Validation of the ABILHAND scale in four diagnostic groups

  • Ghady El Khoury ,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Methodology, Writing – original draft

    ghady.elkhoury@uclouvain.be

    Affiliations Service d’Orthopédie et Traumatologie, Cliniques Universitaires Saint-Luc, Brussels, Belgium, Institute of Neuroscience, Université Catholique de Louvain, Brussels, Belgium

  • Olivier Barbier,

    Roles Conceptualization, Resources, Writing – review & editing

    Affiliation Service d’Orthopédie et Traumatologie, Cliniques Universitaires Saint-Luc, Brussels, Belgium

  • Xavier Libouton,

    Roles Conceptualization, Resources, Writing – review & editing

    Affiliation Service d’Orthopédie et Traumatologie, Cliniques Universitaires Saint-Luc, Brussels, Belgium

  • Jean-Louis Thonnard,

    Roles Conceptualization, Methodology, Writing – review & editing

    Affiliation Institute of Neuroscience, Université Catholique de Louvain, Brussels, Belgium

  • Philippe Lefèvre,

    Roles Funding acquisition, Supervision, Writing – review & editing

    Affiliations Institute of Neuroscience, Université Catholique de Louvain, Brussels, Belgium, Institute of Information and Communication Technologies, Electronics and Applied Mathematics, Université Catholique de Louvain, Louvain-La-Neuve, Belgium

  • Massimo Penta

    Roles Conceptualization, Formal analysis, Validation, Writing – review & editing

    Affiliations Institute of Neuroscience, Université Catholique de Louvain, Brussels, Belgium, Arsalis SRL, Glabais, Belgium

Abstract

Background

Patients treated in hand surgery (HS) belong to different demographic groups and have varying impairments related to different pathologies. HS outcomes are measured to assess treatment results, complication risks and intervention reliability. A one-dimensional and linear measure would allow for unbiased comparisons of manual ability between patients and different treatment effects.

Objective

To adapt the ABILHAND questionnaire through Rasch analysis for specific use in HS patients and to examine its validity.

Methods

A preliminary 90-item questionnaire was presented to 216 patients representing the diagnoses most frequently encountered in HS, including distal radius fracture (n = 74), basal thumb arthritis (n = 66), carpal tunnel syndrome (n = 53), and heavy wrist surgery (n = 23). Patients were assessed during the early recovery and in the late follow-up period (0–3 months, 3–6 months and >6 months), leading to a total of 305 assessments. They rated their perceived difficulty with queried activities as impossible, difficult, or easy. Responses were analyzed using the RUMM2030 software. Items were refined based on item-patient targeting, fit statistics, differential item functioning, local independence and item redundancy. Patients also completed the QuickDASH, 12-item Short Form Survey (SF-12) and a numerical pain scale.

Results

The rating scale Rasch model was used to select 23 mostly bimanual items on a 3-level scale, which constitute a unidimensional, linear measure of manual ability with good reliability across all included diagnostic groups (Person-Separation Index = 0.90). The resulting scale was found to be invariant across demographic and clinical subgroups and over time. ABILHAND-HS patient measures correlated significantly (p<0.001) with the QuickDASH (r = -0.77), SF-12 Physical Component Summary (r = 0.56), SF-12 Mental Component Summary (r = 0.31), and pain scale (r = -0.49).

Conclusion

ABILHAND-HS is a robust person-centered measure of manual ability in HS patients.

Introduction

In hand surgery (HS), as in other medical specialties, outcome evaluations are needed to assess the effectiveness and reliability of the intervention, as well as to reinforce patient education regarding the risks and outcomes of the procedure and, potentially, to justify therapeutic practices to payers [1]. Physician-documented reports of HS outcomes based on clinical examination and imaging should be complemented with patient reported outcomes assessed by questionnaires designed to capture patients’ perspectives with respect to the impact of their conditions and interventions on their daily lives [24].

Current views of health and disability have been shaped by the World Health Organization's International Classification of Functioning, Disability, and Health [5], which parses disease consequences into three domains: impairment of anatomical structures (e.g. bones, muscles, ligaments) or body functions (e.g. motor skills, sensitivity), activity limitations (e.g. manual activities), and participation restrictions (e.g. in hobbies and work). The impact of a pathology or a surgical intervention in these three domains is also conditioned by personal factors (motivation, capacity to develop compensatory strategies) and environmental factors (social or professional context). Although impairment measurements such as imaging can provide clues regarding functional prognosis, it does not provide good information about performance in everyday life, especially of the hands, which are important for a great variety of activities [68]. For example, demonstration of a bone fracture union is insufficient to determine whether a patient is capable of resuming their usual activities or returning to work [912].

The patient-reported questionnaires that have been most commonly used in HS [13] are the Disability of the Arm, Shoulder and Hand questionnaire (DASH) [14], the Patient Rated Wrist Evaluation (PRWE) [15], and the Carpal Tunnel Questionnaire (CTQ) [16]. Each of these questionnaires has been reported to have good psychometric properties, but each has a particular focus on its own area(s) of disablement. The DASH assesses body functions, activities, and participation [17] and can be divided into 3 subscales based on dimensionality [18]. Meanwhile, the PRWE is specific to the wrist joint and the CTQ is specific to carpal tunnel syndrome (CTS). The Michigan Hand Outcomes Questionnaire (MHQ) [19] is a multidimensional hand-specific outcomes instrument consisting of six subscales, measuring overall hand function, activities of daily living, pain, work performance, aesthetics, and satisfaction. It measures impairment by hand (left and right separately), rather than overall disability. Interpretation of total scores on multi-dimensional instruments can be less than straightforward given that patients can show simultaneous improvement in one domain with deterioration in another [20, 21]. Assessment of functional recovery on a unidimensional [22] and linear [21] scale would allow for quantitative comparisons of ability among different patients and treatments. Such a scale can be developed with state-of-the-art psychometric methods, such as the Rasch model [23, 24].

The ABILHAND questionnaire is a Rasch-model built measure of manual ability [25] that provides an invariant linear scale and allows for quantitative comparisons of manual ability between patients and over time. The scale has been validated in populations with rheumatoid arthritis [26], chronic stroke [8], pediatric cerebral palsy [27], systemic sclerosis [28] and neuromuscular diseases [29]. These previous validations have shown that the difficulty of most manual activities was diagnosis-dependent [30]. Therefore, the objective of this work was to adapt the ABILHAND scale to the most frequent diagnoses treated in HS.

Methods

Questionnaire adaptation to HS patients

The ABILHAND is a measure of manual ability that assesses one’s ability to manage daily activities requiring upper limb use, regardless of strategy [25]. The necessary permissions were obtained from the developer of the original questionnaire to modify it. To develop a HS-adapted ABILHAND, a preliminary item list was compiled from previous versions of the ABILHAND questionnaire, the DASH, PRWE, CTQ, and MHQ items together with some new items. This pool of items was submitted to nine HS experts (hand surgeons, physical medicine and rehabilitation physicians, physical therapists, and occupational therapists), who were asked to assess each item’s relevance to hand surgery patients on a yes/no basis and propose additional items that might be affected by the relevant pathologies (e.g. sensation for CTS and wrist loading for distal radius fractures (DRF)). A final list of 90 items constituted the experimental ABILHAND-HS questionnaire (S1 Appendix).

Patients

A convenience sample of 216 patients was recruited from February 2018 to February 2019 at the HS consultation center at Cliniques Universitaires Saint-Luc, Belgium representing the following four diagnostic categories: CTS, DRF, basal thumb arthritis (BTA), and heavy wrist surgery (HWS, including 1st row carpectomy and partial or total wrist arthrodesis). The inclusion criteria for patients were being >18 years old and being able to read and understand French. The exclusion criteria included comorbidities that may impede manual ability substantially (i.e. tremor, paralysis and active rheumatologic disease) and any mental or cognitive dysfunction (i.e. dementia and mental retardation). The patient characteristics are summarized in Table 1. Patients provided written informed consent to participate. This study was approved by the ethical committee of Cliniques Universitaires Saint-Luc-Université catholique de Louvain (N° B403201523492).

Procedures

The French-language experimental ABILHAND-HS items were presented in five random orders to avoid a systematic item sequence bias. Patients were asked to indicate their perceived difficulty associated with completing the activities without technical or human assistance, independent of the hand used to perform the activity on a three-level scale: impossible (0), difficult (1), or easy (2) [8]. Activities not attempted during the last week were treated as missing responses. Patients also completed the QuickDASH [31], 12-item Short Form Survey (SF-12) questionnaire [32] and a 10-level numerical pain scale, for external validation purposes.

Patients were first assessed as soon as they presented to their hand surgery consultation appointments and had experienced manual activities in their own environment: after hand surgery and cast removal for DRF, BTA and HWS and at the first consultation for non-operated CTS and BTA. For the first assessment, patients were interviewed by the principal investigator in order to ensure clarity, obtain feedback from participants, and make sure instructions are properly followed. Patients were also asked to suggest additional items they felt the questionnaire was missing. However, these were either gender related (e.g. fastening a bra) or very specific and were thus not retained. Follow-up assessments were completed in our consulting office or returned by mail, leading to a total of 305 completed assessments, which provides sufficient power to support the planned Rasch analysis [33].

Rasch analysis

The 90-item experimental ABILHAND-HS questionnaire responses were analyzed using the Rasch model in RUMM2030 software (RUMM Laboratory Pty Ltd., Perth, Australia). The Rasch model [24], a prescriptive model, requires that specified response probabilities depend on only item difficulty and patient ability. Polytomous datasets with thresholds between successive response categories can be analyzed with either a rating scale model that constrains all threshold locations to be equal across items [34] or a partial credit model that allows threshold locations to vary across items [35]. Patient abilities and item difficulties are located along a common linear, unidimensional continuum that defines the latent variable of interest (i.e. manual ability). The locations are expressed in logits, calculated as the logarithm of the pass/fail probability ratio of an item or threshold. The logit locations were converted into centiles to facilitate clinical interpretation on a linear scale ranging from 0% (smallest ability) to 100% (largest ability) [36]. Expected responses, determined based on the patient and item locations, were compared to the responses actually reported to compute residual and fit statistics, which were then used to assess the scale’s unidimensionality [37]. A good fit of the data with the model affirms invariant locations along the continuum and indicates that the measure can be used to compare manual ability across patients and diagnoses.

Item selection

From the experimental version of the questionnaire, the ABILHAND-HS was refined through successive analyses of 305 assessments with the goal of selecting items that define a unidimensional and clinically relevant scale of manual ability. P values < 0.05 were considered significant for each of the following analysis steps:

  1. Item-patient targeting. Based on examination of patient distributions and item locations, items that showed a floor effect (too easy) or did not target the patients sample ability were removed.
  2. Rating scale. Items with disordered thresholds and items with thresholds that were too narrow (<1.4 logits) or too wide (>5 logits) were removed before applying the rating scale model [38].
  3. Unidimensionality. Only items that delineated a common manual ability construct according to the following four criteria were retained: (1) standardized residuals obtained over three class intervals had to be within ±2.5 with a non-significant χ2 [37]; (2) no observable major differential item functioning (DIF) [39], uniform or non-uniform, shown by a 2-way analysis of variance of the residuals with Bonferroni correction [40], according to gender (male vs. female), age (above vs. below the median age of 63 years), pathology (CTS vs. DRF vs. BTA vs. HWS), involved hand (dominant vs. non-dominant), level of education (basic vs. superior), and follow-up (0–3 months vs. 3–6 months vs. >6 months); (3) overall fit of the response set based on a non-significant item-trait interaction χ2 [37]; and (4) statistically similar patient locations, according to paired t-tests, calculated with items that loaded either positively or negatively on the first residuals principal component [4143].
  4. Local independence. When items were found to be querying redundant content [44], demonstrated by a residual correlation > 0.3, the item with the poorer fit statistic was deleted [45].
  5. Item redundancy. To shorten the scale, when two or more items had similar locations on the continuum, the one with the best fit was retained.

Scale reliability

The Person-Separation Index (PSI), i.e. the proportion of total variance (including error) that is attributed to patient location variance, was used to determine the ABILHAND-HS scale’s reliability and its degree of precision with the dataset, and thus how many statistically different ability strata can be distinguished along the scale [46].

Construct validity

The construct validity of the ABILHAND-HS was examined with a comparison of means for associations with gender, involved hand, and diagnosis. The relationships of the ABILHAND-HS with age, the QuickDASH scale, the numerical pain scale, the SF-12 Physical Component Summary (PCS), and the SF-12 Mental Component Summary (MCS) were assessed with a correlation analysis.

Patient perceptions were compared between ABILHAND-HS and QuickDASH items by adding the six QuickDASH activity items to the anchored data matrix. The locations of similar items were then compared between the scales.

Statistical analyses

Statistical analyses were completed in IBM SPSS Statistics for Windows, version 25 (IBM Corp., Armonk, N.Y., USA). Data normality was verified for statistical tests using the Shapiro-Wilk test and Q-Q plots. Parametric tests were used for normal data and continuous variables, non-parametric tests for non-normal data and ordinal variables. A Mann-Whitney u-test (two-tailed) was used for gender differences, an independent-samples t-test (two-tailed) for association with the involved hand, and an analysis of variance for diagnosis. Pearson correlation coefficient was calculated for association with age, while relationships with the QuickDASH scale, the numerical pain scale, the SF-12 Physical Component Summary (PCS), and the SF-12 Mental Component Summary (MCS) were assessed with Spearman correlation coefficients. P values < 0.05 were considered significant. Mean values are reported with standard deviations (SD). Chi-square and t values are reported with degrees of freedom (df).

Results

Item selection for the ABILHAND-HS scale

Successive analyses led to the selection of 23 items defining a unidimensional manual ability scale in HS. Of the 90 experimental items, 34 were removed because they were too easy (e.g. ‘Drinking a glass of water’), 3 items had too-narrow thresholds (e.g. ‘Using a touch screen’), 4 items were misfitting (e.g. ‘Carrying a shopping bag’), and 26 items had a location redundant with another better fitting item (e.g. ‘Peeling onions’ was deleted in favor of ‘Peeling potatoes with a knife’).

Metric properties

The calibration obtained for the 23 mostly bimanual activities retained for ABILHAND-HS is reported in Table 2 in descending difficulty order. The standardized residuals obtained matched the expected standard normal distribution for items [mean (SD), -0.30 (0.99)] and for patients [0.31 (0.97)], indicating that the ABILHAND-HS scale is globally unidimensional. An invariant item location was obtained for more- and less- able patients as shown by a nonsignificant item-trait interaction (χ² = 57.76, 46 df, p = 0.11). An invariant patient ability was obtained with items with different content as shown by a non significant t-test when using items that loaded positively or negatively on the first principal residual component (t = 1.24, 304 df, p = 0.22).

Analysis of DIF of the ABILHAND-HS with six criteria yielded only four instances of uniform DIF among the 23 items (Table 3). A small magnitude DIF was revealed among diagnoses (Fig 1) with no substantial impact on scale invariance, as evidenced by a good overall fit. Note that items were not specifically calibrated to the HWS group because of a limited sample size (n = 23) [47]. No DIF was observed between the first and last assessments, showing satisfactory invariance to support the scale follow-up stability. Likewise, an intraclass correlation coefficient across the first and last assessments was 0.94, indicating excellent item-difficulty-hierarchy consistency and providing confidence for data pooling over different time points [48]. The PSI in this sample was equal to 0.90, indicating the distinguishability of four strata of manual ability [46].

thumbnail
Fig 1. Differential item functioning (DIF) plots comparing the item difficulty hierarchy between subgroups.

In each plot, the lines represent the 95% confidence interval of an ideal invariance between subgroups; the items are represented by the dots or by their letter if they display significant DIF. The most difficult items (dots) are plotted in the top right part of each plot. When comparing the item difficulty hierarchy between each diagnostic group relative to the whole sample, most of the ABILHAND-HS items lie within 95% confidence interval of the ideal invariance, indicating an invariant difficulty across diagnostic groups. When comparing the item difficulty hierarchy between the first and last assessment, all items fall within the 95% confidence interval of an ideal invariance, affirming invariance of item difficulties between the assessments.

https://doi.org/10.1371/journal.pone.0242625.g001

Scale description

The ABILHAND-HS structure and targeting of HS patients are illustrated in Fig 2, showing an average patients’ manual ability of 1.17 logits (SD = 1.85 logits; i.e. 58 (15) centiles). Twenty-four patients (7.9%) were able to perform all 23 activities easily, and were thus identified as extreme patients. Extreme patients tended to be younger men evaluated more than 6 months after treatment, and were more likely to have a CTS rather than a HWS. The three response categories were well distinguished in HS patients, with an inter-threshold distance of 2.93 logits (24 centiles), indicating that, regardless of patient ability, rating an item as ‘easy’ is about 20 (i.e. e2.93 = 18.7) times more difficult than rating it as ‘impossible’. Although the threshold distribution (range, -4.15 to 5 logits) was well targeted to the range of patient abilities, the patients’ ability levels skewed high, indicating that the scale could measure patients that are more severely disabled than in this sample.

thumbnail
Fig 2. Structure of the ABILHAND-HS scale.

Top: distribution of manual ability measures for the whole sample expressed in logits (log of the pass/fail probability ratio) and centiles (fraction of the measurement range). Twenty-four patients (7.9%) were able to perform all 23 activities easily, and were thus identified as extreme patients. None of the participants reported that they could not perform any of the 23 activities. Middle: most probable patient response to each item based on the patient manual ability and on the difficulty of the item’s response category. The average item difficulty was set to 0 logits and the items are ordered from most (top) to least (bottom) difficult. The distance between thresholds (middle bar) is constant for all items (2.93 logits or 24 centiles). A patient with a manual ability measure of 0 logits would be expected to perform the first 3 activities easily, to have some difficulty with the following 17 activities, and to be unable to perform the 3 most difficult activities. A patient with a measure of 2.1 logits should be able to perform all activities easily or with some difficulty. Bottom: conversion of ordinal raw scores into a linear continuum of manual ability for complete response sets. The raw scores ranged from 0 to 46 (sum of scores of 0–2 for 23 items). This curve is linear in its central (30th~70th percentile) range, with sigmoid flattening outside the central range, highlighting a non-linear relationship, especially at the extremities of the score range.

https://doi.org/10.1371/journal.pone.0242625.g002

Construct validity

ABILHAND-HS measures were normally distributed across the whole sample and subgroups, except for men (W = 0.97; 100 df; p = 0.038). An effect of gender on ABILHAND-HS manual ability measures was observed, with men [1.88 (2.39) logits; median 1.74 logits] reporting a significantly higher mean manual ability than women [1.32 (1.93) logits; median 1.4 logits; U = 8642; p = 0.026]. Manual ability was not found to be significantly associated with age (R = -0.04; p = 0.47), the hand involved (t = 0.96; 303 df; p = 0.37), or the patient’s diagnosis (F = 1.92; 3 df; p = 0.12). Although variance across diagnosis groups was not significant, we did observe a broad spectrum of manual ability. Patients with CTS reported the highest manual ability [1.9 (2.0) logits], followed by patients with BTA [1.5 (2.3) logits], DRF [1.4 (2.1) logits], and HWS [0.9 (1.8) logits].

The relationships between ABILHAND-HS measures and scores obtained with other instruments are shown in Fig 3. Briefly, ABILHAND-HS correlated strongly with QuickDASH scores, moderately with SF-12 PCS scores and pain scale scores, and weakly with SF-12 MCS scores. We observed substantial similarity with respect to manual ability scale locations between the ABLHAND-HS and QuickDASH activity items (Fig 4).

thumbnail
Fig 3. Correlations of ABILHAND-HS scores with QuickDASH, PCS, MCS, and numerical pain scale scores.

Spearman correlation coefficients are indicated in the top right of each graph. All correlations were statistically significant (p < 0.001).

https://doi.org/10.1371/journal.pone.0242625.g003

thumbnail
Fig 4. Comparison of difficulty levels (vertical axis) between similar items of the ABILHAND-HS (left) and QuickDASH (right) scales.

QuickDASH item responses were added to the anchored data matrix of ABILHAND-HS responses to equate both measures.

https://doi.org/10.1371/journal.pone.0242625.g004

Discussion

Here, we report the adaptation and validation of an ABILHAND-HS questionnaire for use with HS patients. Impairments present in our study cohort included weakness (e.g. following DRF), loss of sensation (e.g. in CTS), and stiffness (e.g. in BTA), with some patients presenting with a combination of these impairments. The ABILHAND-HS was constructed to measure manual ability on a common, linear, and unidimensional scale wherein the 23 activities retained delineate an invariant item difficulty hierarchy independent of patient diagnosis. All ABILHAND-HS activities with the exception of one involve both hands and, consistent with our clinical experience, the most difficult ones require high levels of force (e.g. ‘doing push-ups’ loads the wrist in extension). Of the experimental 90 items, those that could be interpreted in different ways, for instance using the injured or uninjured hand, were misfitting and thus omitted (e.g. ‘carrying a shopping bag’). The sample size was adequate for the statistical interpretation of fit statistics [33], and was within the same range of studies dealing with the development of outcome measures [14, 15, 19, 31]. The fit statistics for the 23 retained items support the item hierarchy invariance across the latent trait [49]. A few instances of minor DIF were retained to maintain the scale’s construct validity [50]. The resulting scale is well targeted to the studied HS population, despite a small persistent ceiling effect, most likely due to missing responses for the most difficult activities. This observation of apparent ceiling effect involves 7.9% (24/305) of the records, which is well below the maximum recommended allowance of 15% [51].

Although reliability indices should be compared with caution across potentially different study conditions, it is noteworthy that the PSI obtained for the ABILHAND-HS (0.90) was higher than prior values obtained for the activities subscales of the PRWE [52] (0.78 and 0.81 for the usual and specific activities subscales, respectively, in DRF patients), for the Patient-Rated Wrist and Hand Evaluation [53] (0.83 in HS patients), for the QuickDASH scale [54] (0.84 in patients with various upper limb dysfunctions) and for the Manual Ability Measure [55] (MAM-16; 0.83 for HS patients), while being equal to that for the DASH manual functioning subscale [18]. PSI values reflect sensitivity to clinical evolution over time, with greater values indicating a greater number of distinguishable ability strata. We obtained person separation among patients using three response levels (impossible, difficult, and easy), consistent with previous studies showing patients unable to discriminate more than three levels of difficulty (ABILHAND [8], DASH activity items [18] and QuickDASH activity items [54]).

Accurate communication of scale administration instructions is critical for targeting patient manual ability as defined by the ABILHAND-HS. Generally, patients focus on their ability to perform the queried activities with their injured hand; likewise, the PRWE explores use of the affected hand explicitly [15]. The ABILHAND-HS, like the QuickDASH, is oriented towards real daily life behaviors and is intended to be independent of the limb(s) or strategy used and unbiased by activities that are never performed with the affected hand or avoided during recovery [8]. Our findings of stable item calibrations and lack of DIF across the assessments indicate that the ABILHAND-HS can be used confidently to assess the patient recovery at different time points during follow-up. Moreover, the stability of items hierarchy between the first and last evaluation indicate that the results were not influenced by the method of administration (interview with the investigator versus self-reported).

ABILHAND-HS construct validation results fit well with our clinical observations. The patients with the highest manual ability scores on the ABILHAND-HS also had the highest SF-12 PCS and SF-12 MCS scores as well as the lowest QuickDASH and numerical pain scale scores, which was also observed in other validation studies [15, 19]. Correlations of ABILHAND-HS with other instruments, including the QuickDASH, SF-12 and a pain scale are also consistent with prior findings suggesting that generic instruments are less sensitive than specific ones [56]. The present ABILHAND-HS manual ability scores were not related significantly to age, consistent with other versions of the ABILHAND [8, 26, 29]. Our findings of a small, but significant gender effect, with men tending to report a higher manual ability than women (mean difference, 0.56 logits), varied across HS diagnoses but, generally, were consistent with previous reports in patients with DRFs and wrist arthrodesis [5759]. A possible explanation is that manual ability is related to grip strength, which is more important in men compared to women [8, 60]. The construct validity of the ABILHAND-HS was further supported by our confirmation of a similar item difficulty hierarchy for QuickDASH items in our patients sample. Notably, those ABILHAND-HS activities that require a great amount of force (e.g. ‘Opening a screw-topped jar’) have been reported to likewise be among the most difficult items in the DASH and QuickDASH [18, 54] and in the MAM-16 [55].

The ABILHAND-HS, developed using Rasch methodology, has several advantages over questionnaires developed using classical test theory. These are summarized in Table 4. Firstly, the ABILHAND-HS can tolerate missing responses, which enables it to remain valid even in patients who scarcely perform some of the queried activities. Secondly, the ability to analyze response patterns can identify those patients whose responses do not fit the model due to random or careless answers, a particular injury or comorbidities. Finally, the high precision of the ABILHAND-HS items minimizes the need for interpretation, thereby allowing more reliable comparisons between patients (e.g. recreational activities involving force or impact are broken down into the items ‘doing push-ups’, ‘practicing a racket sport’).

Limitations of this research include a sample of patients with hand and wrist disabilities from one hand surgery outpatient clinic. The unbalanced diagnostic groups and genders might have influenced the item calibrations. However, the gender distribution is similar in studies involving DRF and CTS [15, 16, 31] and the DIF analysis allowed us to select items where the diagnosis and gender effects were absent. Future studies with a larger sample size should confirm or refine our findings. Furthermore, the way participants responded to the 90 items in the questionnaire may not be the same as responses to the final 23-item instrument. One limitation to the availability of Rasch model-based questionnaires is that they have a complex statistical background and require the use of dedicated computer programs that are not easy to learn and implement. To facilitate and spread the use of the ABILHAND-HS, a website (www.rehab-scales.org) developed by Université catholique de Louvain and Arsalis, a spin-off of the ABILHAND authors’ laboratory, can be used to convert the questionnaire raw scores into manual ability measures. The web service is free-to-use for daily practice in clinical and research applications although a license is required for commercial applications and for clinical trials.

Conclusion

ABILHAND-HS was demonstrated to be a successful adaptation for application in HS patients. The resulting scale was shown to be a valid, patient-oriented, clinically meaningful and precise instrument. It targets commonly performed manual activities and allows stable and linear measurement of manual ability over multiple time points in patients treated for DRF, BTA, CTS, or HWS. The scale reveals unexpected responses that may provide clues regarding the patient’s clinical state, as summarized at www.rehab-scales.org. The questionnaire is available online, and the web service is free-to-use for daily practice in clinical and research applications although a license is required for commercial applications and for clinical trials. Future research should include more patients with HWS, as well as other diagnoses such as tendinopathies, ligamentous injuries and complex hand injuries, and an assessment of scale responsiveness.

Supporting information

S2 Text. The ABILHAND-HS questionnaire (English version) instructions and scoring sheets.

The scoring sheets are available in 10 different randomizations, as found on www.rehab-scales.org.

https://doi.org/10.1371/journal.pone.0242625.s002

(DOCX)

S3 Text. The ABILHAND-HS questionnaire (French version) instructions and scoring sheets.

The scoring sheets are available in 10 different randomizations, as found on www.rehab-scales.org.

https://doi.org/10.1371/journal.pone.0242625.s003

(DOCX)

S1 Appendix. The experimental 90-item ABILHAND questionnaire.

https://doi.org/10.1371/journal.pone.0242625.s004

(XLSX)

References

  1. 1. Dubert T. Outcome measurements in hand and upper limb surgery. Chirurgie de la Main. 2014;33: 235–246. pmid:24993591
  2. 2. Berkanovic E, Hurwicz M-L, Lachenbruch PA. Concordant and discrepant views of patients’ physical functioning. Arthritis & Rheumatism. 1995;8: 94–101. pmid:7794992
  3. 3. Hewlett SA. Patients and Clinicians Have Different Perspectives on Outcomes in Arthritis. J Rheumatol. 2003;30: 877–9. pmid:12672220
  4. 4. Ziebland S, Fitzpatrick R, Jenkinson C. Tacit models of disability underlying health status instruments. Social science & medicine. 1993;37: 69–75. pmid:8332927
  5. 5. World Health Organization, editor. International classification of functioning, disability and health: ICF. Geneva: World Health Organization; 2001.
  6. 6. Barbier O, Penta M, Thonnard J-L. Outcome evaluation of the hand and wrist according to the International Classification of Functioning, Disability, and Health. Hand Clinics. 2003;19: 371–378. pmid:12945633
  7. 7. Bobos P, Lalone EA, Grewal R, MacDermid JC. Do Impairments Predict Hand Dexterity After Distal Radius Fractures? A 6-Month Prospective Cohort Study. HAND. 2018;13: 441–447. pmid:28366013
  8. 8. Penta M, Tesio L, Arnould C, Zancan A, Thonnard J-L. The ABILHAND Questionnaire as a Measure of Manual Ability in Chronic Stroke Patients: Rasch-Based Validation and Relationship to Upper Limb Impairment. Stroke. 2001;32: 1627–1634. pmid:11441211
  9. 9. Giladi AM, Chung KC. Measuring Outcomes in Hand Surgery. Clinics in Plastic Surgery. 2013;40: 313–322. pmid:23506772
  10. 10. Jaremko JL, Lambert RGW, Rowe BH, Johnson JA, Majumdar SR. Do radiographic indices of distal radius fracture reduction predict outcomes in older adults receiving conservative treatment? Clinical Radiology. 2007;62: 65–72. pmid:17145266
  11. 11. Synn AJ, Makhni EC, Makhni MC, Rozental TD, Day CS. Distal Radius Fractures in Older Patients: Is Anatomic Reduction Necessary? Clin Orthop Relat Res. 2009;467: 1612–1620. pmid:19082864
  12. 12. Young BT, Rayan GM. Outcome following nonoperative treatment of displaced distal radius fractures in low-demand patients older than 60 years. The Journal of Hand Surgery. 2000;25: 19–28. pmid:10642469
  13. 13. Changulani M, Okonkwo U, Keswani T, Kalairajah Y. Outcome evaluation measures for wrist and hand–which one to choose? International Orthopaedics. 2008;32: 1–6. pmid:17534619
  14. 14. Hudak PL, Amadio PC, Bombardier C. Development of an upper extremity outcome measure: the DASH (disabilities of the arm, shoulder and hand) [corrected]. The Upper Extremity Collaborative Group (UECG). Am J Ind Med. 1996;29: 602–608. pmid:8773720
  15. 15. MacDermid JC, Turgeon T, Richards RS, Beadle M, Roth JH. Patient rating of wrist pain and disability: a reliable and valid measurement tool. J Orthop Trauma. 1998;12: 577–586. pmid:9840793
  16. 16. Levine DW, Simmons BP, Koris MJ, Daltroy LH, Hohl GG, Fossel AH, et al. A self-administered questionnaire for the assessment of severity of symptoms and functional status in carpal tunnel syndrome. JBJS. 1993;75: 1585–1592. pmid:8245050
  17. 17. Coenen M, Kus S, Rudolf K-D, Müller G, Berno S, Dereskewitz C, et al. Do patient-reported outcome measures capture functioning aspects and environmental factors important to individuals with injuries or disorders of the hand? Journal of Hand Therapy. 2013;26: 332–342. pmid:23911076
  18. 18. Franchignoni F, Giordano A, Sartorio F, Vercelli S, Pascariello B, Ferriero G. Suggestions for Refinement of the Disabilities of the Arm, Shoulder and Hand Outcome Measure (DASH): A Factor Analysis and Rasch Validation Study. Archives of Physical Medicine and Rehabilitation. 2010;91: 1370–1377. pmid:20801254
  19. 19. Chung KC, Pillsbury MS, Walters MR, Hayward RA. Reliability and validity testing of the Michigan Hand Outcomes Questionnaire. The Journal of Hand Surgery. 1998;23: 575–587. pmid:9708370
  20. 20. Merbitz C, Morris J, Grip JC. Ordinal scales and foundations of misinference. Archives of physical medicine and rehabilitation. 1989;70: 308–312. pmid:2535599
  21. 21. Wright BD, Linacre JM. Observations are always ordinal; measurements, however, must be interval. Arch Phys Med Rehabil. 1989;70: 857–860. pmid:2818162
  22. 22. Thurstone L. The measurement of values. Chicago: The University of Chicago Press; 1959.
  23. 23. Grimby G, Tennant A, Tesio L. The use of raw scores from ordinal scales: Time to end malpractice? Journal of Rehabilitation Medicine. 2012;44: 97–98. pmid:22334345
  24. 24. Rasch G. Probabilistic models for some intelligence and attainment tests. Chicago, IL: The University of Chicago Press; 1980.
  25. 25. Penta M, Thonnard J-L, Tesio L. ABILHAND: a Rasch-built measure of manual ability. Archives of physical medicine and rehabilitation. 1998;79: 1038–1042. pmid:9749680
  26. 26. Durez P, Fraselle V, Houssiau F, Thonnard J, Nielens H, Penta M. Validation of the ABILHAND questionnaire as a measure of manual ability in patients with rheumatoid arthritis. Ann Rheum Dis. 2007;66: 1098–1105. pmid:17170054
  27. 27. Arnould C, Penta M, Renders A, Thonnard J-L. ABILHAND-Kids: a measure of manual ability in children with cerebral palsy. Neurology. 2004;63: 1045–1052. pmid:15452296
  28. 28. Vanthuyne M, Smith V, Arat S, Westhovens R, de Keyser F, Houssiau FA, et al. Validation of a manual ability questionnaire in patients with systemic sclerosis. Arthritis & Rheumatism. 2009;61: 695–703. pmid:19405012
  29. 29. Vandervelde L, Van den Bergh PYK, Penta M, Thonnard JL. Validation of the ABILHAND questionnaire to measure manual ability in children and adults with neuromuscular disorders. Journal of Neurology, Neurosurgery & Psychiatry. 2010;81: 506–512. pmid:19726413
  30. 30. Arnould C, Vandervelde L, Batcho CS, Penta M, Thonnard J-L. Can manual ability be measured with a generic ABILHAND scale? A cross-sectional study conducted on six diagnostic groups. BMJ open. 2012;2: e001807. pmid:23117570
  31. 31. Beaton DE, Wright JG, Katz JN. Development of the QuickDASH: Comparison of Three Item-Reduction Approaches. Journal of Bone and Joint Surgery (American). 2005;87A: 9. pmid:15866967
  32. 32. Ware J, Kosinski M, Keller SD. A 12-Item Short-Form Health Survey: construction of scales and preliminary tests of reliability and validity. Med Care. 1996;34: 220–233. pmid:8628042
  33. 33. Hagell P, Westergren A. Sample Size and Statistical Conclusions from Tests of Fit to the Rasch Model According to the Rasch Unidimensional Measurement Model (Rumm) Program in Health Outcome Measurement. J Appl Meas. 2016;17: 416–431. pmid:28009589
  34. 34. Andrich D. A rating formulation for ordered response categories. Psychometrika. 1978;43: 561–573.
  35. 35. Masters GN. A rasch model for partial credit scoring. Psychometrika. 1982;47: 149–174.
  36. 36. van Nes SI, Vanhoutte EK, van Doorn PA, Hermans M, Bakkers M, Kuitwaard K, et al. Rasch-built Overall Disability Scale (R-ODS) for immune-mediated peripheral neuropathies. Neurology. 2011;76: 337–345. pmid:21263135
  37. 37. Andrich D, Sheridan B, Luo G. Polytomous data. In: Interpreting RUMM2030: part II. Western Australia: Perth; 2013.
  38. 38. Linacre JM. Optimizing rating scale category effectiveness. J Appl Meas. 2002;3: 85–106. pmid:11997586
  39. 39. Holland PW, Wainer H. Differential item functioning. Hillsdale, NJ, US: Lawrence Erlbaum Associates, Inc; 1993.
  40. 40. Armstrong RA. When to use the Bonferroni correction. Ophthalmic Physiol Opt. 2014;34: 502–508. pmid:24697967
  41. 41. Linacre JM. Detecting multidimensionality: which residual data-type works best? J Outcome Meas. 1998;2: 266–283. pmid:9711024
  42. 42. Pallant JF, Tennant A. An introduction to the Rasch measurement model: an example using the Hospital Anxiety and Depression Scale (HADS). Br J Clin Psychol. 2007;46: 1–18. pmid:17472198
  43. 43. Smith EV. Detecting and evaluating the impact of multidimensionality using item fit statistics and principal component analysis of residuals. J Appl Meas. 2002;3: 205–231. pmid:12011501
  44. 44. Wright BD. Local dependency, correlations and principal components. Rasch Meas Trans 1996;10:509–11.
  45. 45. Ramp M, Khan F, Misajon RA, Pallant JF. Rasch analysis of the Multiple Sclerosis Impact Scale (MSIS-29). Health Qual Life Outcomes. 2009;7: 58. pmid:19545445
  46. 46. Fisher W. Reliability, Separation, Strata Statistics. Rasch Meas Trans. 1992;6: 238.
  47. 47. Chen W-H, Lenderking W, Jin Y, Wyrwich KW, Gelhorn H, Revicki DA. Is Rasch model analysis applicable in small sample size pilot studies for assessing item characteristics? An example using PROMIS pain behavior item bank data. Qual Life Res. 2014;23: 485–493. pmid:23912855
  48. 48. Chang W-C, Chan C. Rasch analysis for outcomes measures: some methodological considerations. Archives of Physical Medicine and Rehabilitation. 1995;76: 934–939. pmid:7487434
  49. 49. Tennant A, Conaghan PG. The Rasch measurement model in rheumatology: What is it and why use it? When should it be applied, and what should one look for in a Rasch paper? Arthritis & Rheumatism. 2007;57: 1358–1362.
  50. 50. Hagquist C, Andrich D. Recent advances in analysis of differential item functioning in health research using the Rasch model. Health and Quality of Life Outcomes. 2017;15: 181. pmid:28927468
  51. 51. McHorney CA, Tarlov AR. Individual-patient monitoring in clinical practice: are available health status surveys adequate? Quality of Life Research. 1995;4: 293–307. pmid:7550178
  52. 52. Esakki S, MacDermid JC, Vincent JI, Packham TL, Walton D, Grewal R. Rasch analysis of the patient-rated wrist evaluation questionnaire. Arch Physiother. 2018;8:5. pmid:29497563
  53. 53. Packham T, MacDermid JC. Measurement properties of the Patient-Rated Wrist and Hand Evaluation: Rasch analysis of responses from a traumatic hand injury population. Journal of Hand Therapy. 2013;26: 216–224. pmid:23561017
  54. 54. Franchignoni F, Ferriero G, Giordano A, Sartorio F, Vercelli S, Brigatti E. Psychometric properties of QuickDASH–A classical test theory and Rasch analysis study. Manual Therapy. 2011;16: 177–182. pmid:21044859
  55. 55. Chen CC, Granger CV, Peimer CA, Moy OJ, Wald S. Manual Ability Measure (MAM-16): a preliminary report on a new patient-centred and task-oriented outcome measure of hand function. Journal of Hand Surgery. 2005;30: 207–216. pmid:15757777
  56. 56. Aktekin LA, Eser F, Başkan BM, Sivas F, Malhan S, Öksüz E, et al. Disability of Arm Shoulder and Hand Questionnaire in rheumatoid arthritis patients: relationship with disease activity, HAQ, SF-36. Rheumatol Int. 2011;31: 823–826. pmid:20680284
  57. 57. Amorosa LF, Vitale MA, Brown S, Kaufmann RA. A Functional Outcomes Survey of Elderly Patients who Sustained Distal Radius Fractures. Hand (N Y). 2011;6: 260–267. pmid:22942849
  58. 58. Cowie J, Anakwe R, McQueen M. Factors Associated with One-Year Outcome after Distal Radial Fracture Treatment. Journal of Orthopaedic Surgery. 2015;23: 5. pmid:25920638
  59. 59. Owen DH, Agius PA, Nair A, Perriman DM, Smith PN, Roberts CJ. Factors predictive of patient outcome following total wrist arthrodesis. The Bone & Joint Journal. 2016;98-B: 647–653. pmid:27143736
  60. 60. Arnould C, Penta M, Thonnard J. Hand impairments and their relationship with manual ability in children with cerebral palsy. Journal of Rehabilitation Medicine. 2007;39: 708–714. pmid:17999009