Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Validating the Michigan Hand Outcomes Questionnaire in patients with rheumatoid arthritis using Rasch analysis

  • Mayank Jayaram,

    Roles Conceptualization, Writing – original draft, Writing – review & editing

    Affiliation University of Michigan Medical School, Ann Arbor, Michigan, United States of America

  • Chang Wang,

    Roles Formal analysis, Writing – original draft, Writing – review & editing

    Affiliation University of Michigan School of Public Health, Ann Arbor, Michigan, United States of America

  • Lu Wang,

    Roles Formal analysis, Supervision, Writing – review & editing

    Affiliation University of Michigan School of Public Health, Ann Arbor, Michigan, United States of America

  • Kevin C. Chung

    Roles Conceptualization, Supervision, Writing – review & editing

    kecchung@med.umich.edu

    Affiliation Section of Plastic Surgery, Michigan Medicine, Ann Arbor, Michigan, United States of America

Abstract

Introduction

The Michigan Hand Outcomes Questionnaire (MHQ) is a patient-reported outcome measure previously validated in patients with rheumatoid arthritis (RA) using classical test theory. Rasch analysis is a more rigorous method of questionnaire validation that has not been used to test the psychometric properties of the MHQ in patients with RA. The objective of this study is to evaluate the validity and reliability of the MHQ for measuring outcomes in patients with RA with metacarpophalangeal joint deformities.

Methods

We performed a Rasch analysis using baseline data from the Silicone Arthroplasty in Rheumatoid Arthritis (SARA) prospective cohort study. All domains were tested for threshold ordering, item fit, targeting, differential-item functioning, unidimensionality, and internal consistency.

Results

The Function and Work domains showed excellent fit to the Rasch model. After making adjustments, the Pain, Activities of Daily Living (ADL) and Satisfaction domains also fulfilled all Rasch model criteria. The Aesthetics domain met the majority of Rasch criteria, but could not be tested for unidimensionality.

Conclusions

After collapsing disordered thresholds and removing misfitting items, the MHQ demonstrated reliability and validity for assessing outcomes in patients with RA with metacarpophalangeal joint deformities. These results suggest that interpreting individual domain scores may provide more insight into a patient’s condition rather than analyzing an overall MHQ summary score. However, more Rasch analyses are needed in other RA populations before making adjustments to the MHQ.

Introduction

Rheumatoid arthritis (RA) is a progressive autoimmune disease that primarily affects joints in the hands, wrists, and knees, and can lead to severe pain and swelling in the affected region. Nearly 1.3 million adults in the United States are living with RA, and approximately 70% will experience some impairment to hand performance as a result of the disease [1, 2]. RA often leads to disfiguration at the metacarpophalangeal (MCP) joint, resulting in ulnar deviation and reduced satisfaction with the appearance of the hand [3]. Silicone metacarpophalangeal joint arthroplasty (SMPA) can be used to treat MCP joint deformities in patients with RA; however, there is no consensus on the long-term benefits of this procedure [4]. As a result, it is necessary to develop and validate reliable instruments in the RA population that incorporate the patient’s perspective into assessment of treatment outcomes [5].

Patient-reported outcome measures (PROMs) are assessment tools used in clinical research to gain insight into how a patient perceives outcomes following treatment. PROMs can be helpful in hand/wrist conditions to inform investigators about how quality of life measures such as perceived pain after injury can influence outcomes [6]. The Michigan Hand Outcomes Questionnaire (MHQ) is a 37-item PROM frequently used to measure outcomes in musculoskeletal disorders involving the hand/wrist. Since its development, the MHQ has been used in several multinational clinical trials to study conditions such as RA, distal radius fractures, and finger amputations [79]. The MHQ has been translated into 12 languages and validated in numerous upper extremity musculoskeletal conditions [1014].

Previous methods used to validate the MHQ for RA relied on classical test theory (CTT). However, Rasch analysis provides an alternative method of questionnaire validation that more comprehensively tests the psychometric properties of a questionnaire [15]. Specifically, Rasch analysis investigates each item for biases, redundancy, and ambiguous wording [16]. Results from Rasch analyses can be used to eliminate poorly functioning items in the questionnaire and optimize a PROM for the condition of interest.

The purpose of this study is to use Rasch analysis to rigorously test the psychometric properties of the MHQ in a RA study population. Results from this study can strengthen the credibility of the MHQ as a valid and reliable outcome measure for individuals with RA-related hand deformities.

Materials and methods

MHQ

An expert panel comprised of patients with hand disorders, hand therapists, and hand surgeons were involved in the development of the MHQ [11]. These individuals used previously validated hand questionnaires to categorize a preliminary list of 100 items into 6 domains (Function, Activities of Daily Living (ADLs), Work, Pain, Aesthetics, Satisfaction) that each measure a unique construct related to hand performance. Two psychometricians evaluated the initial 100 items in the MHQ for item redundancy and ambiguous wording and then factor analysis was performed to further optimize the number of items in each scale. Following factor analysis, the MHQ was pilot tested in a cohort of 200 patients before the final version was developed. The finalized MHQ contains 37 items with 5 response options per question. Each of the 6 domains is scored on a scale between 0 and 100. For the pain domain, lower scores represent less pain, and for all other domains, higher scores indicate better hand health.

Study sample

For this analysis, we used data from the Silicone Arthroplasty in Rheumatoid Arthritis (SARA) prospective cohort study. SARA is one of the largest studies to collect information on patients with rheumatoid arthritis with severe deformities at the MCP joint [17]. Patients were recruited from two sites in the United States and one site in the United Kingdom. Study participants were divided into two cohorts: surgical treatment with SMPA or medical management without surgery (non-SMPA). The MHQ was used to measure patient-reported outcomes at baseline and at several intervals following treatment in the follow-up period. Rasch analysis was performed using baseline MHQ scores, and all participants were combined into one cohort for analysis.

Rasch analysis

A key requirement for Rasch analysis is unidimensionality, so the entire methodology was applied to each domain. The methodology for this analysis is based on Tennant and Conaghan’s criteria for Rasch analysis [18]. This method has been previously used to validate the MHQ [19].

Model choice

The likelihood ratio test was performed to determine which Rasch model to use. The rating scale model (RSM) assumes each response option in a category are equidistant. The RSM is nested within the partial credit model (PCM), which assumes the threshold distance between response options are not equally spaced. A non-significant likelihood ratio indicates equidistant response options and fulfils the assumptions of the RSM allowing us to simplify the model and facilitate interpretation. A significant likelihood ratio indicates thresholds between response options are not equally spaced and the partial credit model (PCM) should be used [18].

Threshold ordering

A threshold is observed when an individual has an equal chance of selecting between two response options in a questionnaire. In the MHQ, a threshold would occur if an individual is equally likely to select “very good” or “good” to describe his/her finger mobility. However, if individuals have difficulty discriminating between these response options, disordered thresholds could result. For example, disordered thresholds would occur if individuals with excellent hand function select “good” to describe their finger mobility, whereas individuals with worse hand function select “very good.” In this scenario the ordering of response options by severity of hand dysfunction is not in congruence with the expectations of the questionnaire, and individuals will not be accurately sorted by hand ability. Disordered thresholds are the result of ambiguous wording or too many response options for the item and are addressed by collapsing categories to preserve the latent structure of the MHQ [20].

Item fit

Item fit measures how well observed data meets the expectations of the Rasch model. If responses to certain items are different from model expectations, the item misfits the Rasch model. Item fit was assessed at the individual item and overall domain levels. Individual item fit was assessed by deriving chi-square (X2) statistics from the residual sum of squares [21]. Significant p-values (p <0.05) following Bonferroni adjustment indicate poor fit to the Rasch model [18]. Bonferroni adjustment is a method used to test item fit for a group of items within a specific domain. For a group of items with size k, Bonferroni adjustment compares the minimal p-value with 0.05/k. Therefore, in our case with 5 items, we compare to a significance level of 0.01.

Overall domain fit was assessed using the information-weighted (infit) and outlier-sensitive (outfit) mean square statistics (MNSQ). If the items perfectly fit the Rasch model, they will have MNSQ statistics approximately equal to 1 [22]. Infit MNSQ scores <1 indicate overfit or redundancy, and outfit MNSQ scores >1 indicate underfit and large deviations between observed and expected behavior to the Rasch model [22]. MNSQ scores between 0.6 and 1.4 are acceptable for overall fit to the Rasch model [23]. An overall test of model fit for each domain was also performed using Andersen’s conditional likelihood ratio with a significance of p <0.05 [24].

Targeting

All items in the MHQ were stratified by item difficulty with easier questions located at one end of the continuum and difficult questions located at the other end. Similarly, all people were separated across the same continuum based on person-ability. A well targeted measure is one that has mean item difficulties at a similar location to average person-ability. Poorly targeted measures have items that are too difficult or too simple to accurately evaluate the study population. Targeting will be assessed by visually inspecting the person-item map to ensure each domain captures the broad range of ability levels of the study population [25].

Differential-item functioning

Differential-item functioning (DIF) occurs when individuals respond differently to an item based on characteristics such as age, gender, socioeconomic status, etc. DIF can be classified as uniform or non-uniform. Uniform DIF occurs when the differences in responses are consistent across the characteristic being measured and is addressed by splitting the groups and independently calibrating each subgroup to the Rasch model [26]. Non-uniform DIF occurs when differences in response options are inconsistent across the trait being measured and implies an inherent issue with the item that may be causing the abnormal response pattern. Non-uniform DIF is addressed by removing the item from the Rasch model [27].

For this analysis, DIF was tested for dominant hand, education level, and location (US or UK). Three models were generated. Model 1 assumes no DIF exists, model 2 assumes uniform DIF exists, and model 3 assumes non-uniform DIF exists. If there is a significant difference between models 1 and 2, the item exhibits uniform DIF, and if there is a significant difference between models 2 and 3, the item has non-uniform DIF and will be removed from analysis [28].

Unidimensionality

Each domain was analyzed for unidimensionality using the Martin-Lӧf test. The Martin-Lӧf test is used to evaluate if all the items in a domain are related by a single factor indicating unidimensionality. A significant result (p<0.05) indicates multidimensionality [29]. Response dependency is another method to test for unidimensionality and occurs when a participant’s answer to one item influences their response to another item in the questionnaire, thereby invalidating local independence. If the correlation between two items is > 0.3 from the average residual correlation for all items in the domain, the items demonstrate response dependency [30].

Internal consistency

Internal consistency measures reliability and was calculated using Cronbach’s α. A Cronbach’s α >0.70 indicates high reliability, while a Cronbach’s α >0.90 indicates high internal consistency with redundancy [31]. All the statistical analyses were performed using R version 4.0.2 with a significance level of p-value<0.05.

Results

The final study population included 162 participants from SARA who completed the MHQ at enrollment. The majority of enrolled participants were white (89%), female (73%), and had an income of ≤$50,000/year (72%) (Table 1). The likelihood ratio was significant for all domains except Function. Therefore, the RSM was used for Function and the PCM was used for all other domains. The conditional likelihood ratio for all domains was p>0.05 indicating good model fit for each domain (Table 2).

thumbnail
Table 1. Demographic characteristics for the Silicone Arthroplasty in Rheumatoid Arthritis Cohort.

https://doi.org/10.1371/journal.pone.0254984.t001

thumbnail
Table 2. Andersen’s conditional likelihood ratio (CLR) for each domain.

https://doi.org/10.1371/journal.pone.0254984.t002

Function

No disordered thresholds were observed (Fig 1A) and all items fit well to the Rasch model (Table 3). Function was well targeted and no DIF was observed for dominant hand, education level, or location. Function had high internal consistency (Cronbach’s α = 0.87), was unidimensional according to the Martin-Lӧf test (p = 0.99), and demonstrated no response dependency (Table 4). Overall, Function fit the Rasch model extremely well.

thumbnail
Fig 1.

Person-Item Map and Threshold Ordering for Function (1a, left), ADLs after collapsing disordered thresholds (1b, middle), and Work (1c, right).

https://doi.org/10.1371/journal.pone.0254984.g001

ADLs

Two items (wash your hair and tie shoelaces/knots) had disordered thresholds with difficulty discriminating between “very difficult” and “moderately difficult” (Fig 2A).

thumbnail
Fig 2.

Disordered thresholds for ADLs (2a, left) and Satisfaction (2b, right).

https://doi.org/10.1371/journal.pone.0254984.g002

Additionally, five items (turn a door knob, pick up a coin, hold a glass of water, turn a key in a lock, and tie shoelaces/knots) had non-uniform DIF for dominant hand.

After collapsing thresholds (Fig 1B) and removing all items with non-uniform DIF, the remaining items showed excellent fit to the Rasch model (Table 3). No additional DIF was observed for dominant hand, education level, or location and the items were well-targeted to the study population. ADL had high internal consistency (Cronbach’s α = 0.94), was unidimensional according to the Martin-Lӧf test (p = 1), and had no unusual patterns in the residuals indicating no response dependency (Table 4). After removing a few items and collapsing disordered thresholds, ADLs showed excellent fit to the Rasch model.

Work

No disordered thresholds were observed (Fig 1C) and all items fit well to the Rasch model (Table 3). One item “How often did you accomplish less in your work because of problems with your hand/wrist” showed uniform DIF for education level that was resolved after independently calibrating the domain for each subgroup. Work was well targeted and no other DIF was observed for hand dominance, education level or location. Work showed high internal consistency with a Cronbach’s α of 0.90, was unidimensional according to the Martin-Lӧf test (p = 0.23), and did not show any response dependency (Table 4). Overall, Work fit well to the Rasch model.

Pain

No disordered thresholds were observed (Fig 3A) and all items fit the Rasch model well (Table 3). Pain was well targeted, did not show DIF for hand dominance or education level, and had excellent internal consistency with a Cronbach’s alpha of 0.85 (Table 4). Item 5 (How often did the pain in your hand make you unhappy?) showed non-uniform DIF for location and was removed from analysis. Pain was unidimensional according to the Martin-Lӧf test (p = 0.81) and no unusual patterns were observed in the residuals indicating local independence. Overall, after removing item 5, Pain showed excellent fit to the Rasch model.

thumbnail
Fig 3.

Person-Item Map and Threshold Ordering for Pain (3a, left), Aesthetics (3b, middle), and Satisfaction after collapsing disordered thresholds (3c, right).

https://doi.org/10.1371/journal.pone.0254984.g003

Aesthetics

No disordered thresholds were observed (Fig 3B) and all items fit well (Table 3). Non-uniform DIF for education level was observed for item 4 (“The appearance (look) of my hand sometimes made me uncomfortable in public”). No DIF was observed for dominant hand. Aesthetics had good internal consistency with a Cronbach’s α of 0.76 and did not show any unusual patterns in the residuals indicating no response dependency (Table 4). After removing item 4, there were not enough items to test for unidimensionality. Overall, Aesthetics showed adequate fit to the Rasch model, but could not be tested for unidimensionality following removal of item 4.

Satisfaction

Disordered thresholds were observed for items 1–4 (satisfaction with overall function of your hand, motion of the fingers in your hand, motion of your wrist, and strength of your hand) with participants having difficulty distinguishing between “somewhat dissatisfied” and “neither satisfied nor dissatisfied” (Fig 2B). Additionally, item 3 (motion of your wrist) showed poor fit (p = 0.001) to the Rasch model.

Following removal of item 3 and collapsing of disordered thresholds (Fig 3C), all items had excellent fit to the Rasch model (Table 3). No DIF was observed for hand dominance or education level and Satisfaction had high internal consistency with a Cronbach’s α of 0.87 (Table 4). Satisfaction was unidimensional according to the Martin-Lӧf test (p = 0.97) and no response dependency was observed in the residuals. Overall, after removing item 3 and collapsing disordered thresholds, Satisfaction showed excellent fit to the Rasch model.

Discussion

Our results show that after removing several misfitting items and collapsing disordered thresholds, the MHQ fits well to the Rasch model. Although the MHQ was derived from CTT, Rasch analysis showed that the MHQ has excellent adaptability as an interval-level instrument after making certain adjustments. Because the MHQ was derived from CTT, it is expected that some adjustments will be needed to convert an ordinal scale into an interval level measurement. However, all domains demonstrated local independence, internal consistency, and excellent targeting to the SARA cohort. Additionally, two domains (Function and Work) satisfied all Rasch criteria. The remaining four domains (ADLs, Pain, Aesthetics, and Satisfaction) required some adjustments before fitting the Rasch model.

In ADLs, disordered thresholds were observed for two items (wash your hair and tie shoelaces/knots) with participants unable to distinguish between “very difficult” and “moderately difficult.” Collapsing response options such that “very difficult” and “moderately difficult” are combined into one category addressed this problem, while maintaining a consistent ordering of thresholds across the continuum. Additionally, five items (turn a door knob, pick up a coin, hold a glass of water, turn a key in a lock, and tie shoelaces/knots) showed non-uniform DIF when the dominant hand was injured. Previous studies on patients with RA have demonstrated that individuals with increased mechanical stress experience increased damage to their dominant hand, which could result in the abnormal response pattern observed in ADLs [32, 33]. However, ADLs showed no DIF for education level indicating acceptable construct validity across other demographic variables. In Pain, non-uniform DIF for location was observed for item 5 (How often did the pain in your hand make you unhappy?). This may demonstrate that the association between pain and mood may vary across different cultures. No DIF was observed for hand dominance or education for the Pain domain.

In Aesthetics, item 4 (“The appearance (look) of my hand sometimes made me uncomfortable in public”) showed non-uniform DIF for education level, indicating inconsistencies in the way participants with similar levels of hand deformities answered this item based on their education level. This could result from varying life experiences that cause some individuals to experience more anxiety about the appearance of their hand in public settings. After removing item 4 from analysis, all items fit well with high internal consistency and excellent targeting, however, we could not test Aesthetics for unidimensionality because there were too few items remaining in the domain. These results suggest that including an additional item in Aesthetics may help to more accurately assess the association between hand appearance and RA outcomes.

Finally, Satisfaction had disordered thresholds in four of the six items with participants having difficulty discriminating between “neither satisfied nor dissatisfied” and “somewhat dissatisfied.” This could be the result of too many response options or ambiguity in the phrasing of these response categories. However, collapsing these responses addressed this limitation and maintained the latent structure of the domain. Additionally, item 3 (how satisfied are you with the motion of your wrist?) showed misfit to the Rasch model. This item could misfit because some participants who rely more on their wrist for daily activities may be less satisfied with their wrist motion than others with less reliance on their hand/wrist.

After these modifications, all domains satisfied Rasch criteria with the exception of Aesthetics which could not be tested for unidimensionality. Within the domains of the MHQ, the Function and Work domain scores showed ideal fit to the Rasch model. The remaining 4 domains required some adjustments before fitting the Rasch model. These results suggest that the Function and Work domains may be more reliable and valid than the other domains for interpreting results from MHQ administration. As a result, averaging all domains of the MHQ to create a summary score may not be indicative of a patient’s true experience living with rheumatoid arthritis because each domain may not equally contribute to a patient’s overall assessment of their hand outcome. By identifying the domains that are most accurate for measuring outcomes in patients with RA, we suggest clinicians focus more on the interpretation of individual domain scores when assessing outcomes in patients with RA. Additionally, we hope future investigators will use Rasch analysis to investigate the psychometric properties of outcome instruments used in RA to observe if other instruments would also benefit from a domain-specific interpretation. These results can help personalize RA treatment and optimize the way the MHQ is interpreted and applied in clinical practice.

Although certain domains fit the Rasch model better than others, we currently do not recommend making modifications to the MHQ. This is the first Rasch analysis performed on the MHQ in a RA population with MCP joint deformities and certain items that misfit in this cohort may fit well in other RA populations. By removing certain items in the MHQ after a single analysis, we may reduce the content validity of the overall assessment tool. Repeated Rasch analyses in other RA populations are needed to identify specific items that consistently misfit the Rasch model. If future analyses demonstrate similar item misfit, it presents an opportunity to develop an alternative version of the MHQ that is more specific to the RA population. Because the MHQ was developed as a generalized PROM that can be used to evaluate a variety of upper extremity musculoskeletal conditions, there may be certain items/domains that are more important in certain conditions. For example, our study demonstrates that the Function and Work domains may be more accurate for assessing outcomes in a RA population. However, other domains in the MHQ such as ADLs or Satisfaction could be more important in traumatic injuries such as finger amputations or distal radius fractures. Therefore, making significant changes to the MHQ could affect its ability to be used for other upper extremity conditions. Repeated Rasch analyses of the MHQ in other RA populations, however, could lead to the development of an alternative, RA-specific version of the MHQ that is more accurate for patients with RA.

Similarly, several items in the MHQ required collapsing of disordered thresholds to fit the Rasch model. This suggests that there are too many response options in the MHQ and that fewer response options may retain the properties of the questionnaire while maintaining its validity and reliability. Although certain categories required collapsing of thresholds, it does not limit the current version of the MHQ, which can still discriminate among patients with different levels of hand performance. In its current state, we recommend clinicians continue to use the complete MHQ for analyzing patients with RA. After more Rasch analyses are performed, an alternative, RA-specific version of the MHQ can be developed and a conversion system will be created that can easily convert scores between the old MHQ and the newer more RA-specific version of the MHQ.

Overall, the few adjustments required to satisfy Rasch criteria are not large enough to justify a modification to the MHQ at this time, and more studies are needed to investigate if similar misfit occurs in other populations before attempting to develop an RA-specific alternative to the MHQ. Moreover, these additional studies can provide clarity in how much each domain contributes to the MHQ to develop a summary score that weights each domain appropriately in a RA population.

One limitation is that the majority of enrolled SARA participants were white females, and these results may not translate across other racial and gender characteristics. Additionally, this Rasch analysis was performed using baseline data at single point in time, and does not analyze the psychometric properties of the MHQ following RA treatment over time. Finally, this study is specific to patients with MCP joint deformities and patients with RA with other hand/wrist problems may not respond the same way.

After removing misfitting items and collapsing disordered thresholds, all domains in the MHQ except for Aesthetics showed high validity and reliability in patients with RA with MCP joint deformities. The Aesthetics domain fulfilled all criteria except for unidimensionality. A domain-specific interpretation of the MHQ may provide more insight into RA outcomes than an overall summary score. Additionally, more studies are needed to identify items in the MHQ that consistently misfit in a RA cohort before any adjustments are considered in the MHQ.

Supporting information

Acknowledgments

The authors would like to thank Sandra Kotsis for her help in reviewing and revising the manuscript.

References

  1. 1. Chung KC, Burns PB, Wilgis EFS, Burke FD, Regan M, Kim HM, et al. A Multicenter Clinical Trial in Rheumatoid Arthritis Comparing Silicone Metacarpophalangeal Joint Arthroplasty With Medical Treatment. J Hand Surg. 2009; 34(5):815–23. pmid:19410984
  2. 2. Helmick CG, Felson DT, Lawrence RC, Gabriel S, Hirsch R, Kwoh CK, et al. Estimates of the prevalence of arthritis and other rheumatic conditions in the United States. Part I. Arthritis Rheum. 2008;58(1):15–25. pmid:18163481
  3. 3. Chung KC, Burns PB, Kim HM, Burke FD, Wilgis EFS, Fox DA. Long-term followup for rheumatoid arthritis patients in a multicenter outcomes study of silicone metacarpophalangeal joint arthroplasty. Arthritis Care Res. 2012;64(9):1292–300.
  4. 4. Abboud JA, Beredjiklian PK, Bozentka DJ. Metacarpophalangeal joint arthroplasty in rheumatoid arthritis. J Am Acad Orthop Surg. 2003;11(3):184–91. pmid:12828448
  5. 5. Malm K, Bergman S, Andersson ML, Bremander A, Larsson I. Quality of life in patients with established rheumatoid arthritis: A phenomenographic study. SAGE Open Med. 2017;5:2050312117713647. pmid:28611920
  6. 6. Dacombe PJ, Amirfeyz R, Davis T. Patient-Reported Outcome Measures for Hand and Wrist Trauma: Is There Sufficient Evidence of Reliability, Validity, and Responsiveness? Hand (N Y). 2016;11(1):11–21.
  7. 7. Sears ED, Burns PB, Chung KC. Relationship between Patient Expectations and Clinical Measures in Patients Undergoing Rheumatoid Hand Surgery from the Silicone Arthroplasty in Rheumatoid Arthritis (SARA) Study. Plast Reconstr Surg. 2015;136(6):775e–81e. pmid:26595031
  8. 8. Chung KC, Kim HM, Malay S, Shauver MJ. The Wrist and Radius Injury Surgical Trial: 12-Month Outcomes from a Multicenter International Randomized Clinical Trial. Plast Reconstr Surg. 2020;145(6):1054e–66e. pmid:32195857
  9. 9. Chung KC, Yoon AP, Malay S, Shauver MJ, Wang L, Kaur S. Patient-Reported and Functional Outcomes After Revision Amputation and Replantation of Digit Amputations: The FRANCHISE Multicenter International Retrospective Cohort Study. JAMA Surg. 2019;154(7):637–46. pmid:30994871
  10. 10. [Available from: http://mhq.lab.medicine.umich.edu/translations].
  11. 11. Chung KC, Pillsbury MS, Walters MR, Hayward RA. Reliability and validity testing of the Michigan Hand Outcomes Questionnaire. J Hand Surg Am. 1998;23(4):575–87. pmid:9708370
  12. 12. Waljee JF, Chung KC, Kim HM, Burns PB, Burke FD, Wilgis EF, et al. Validity and responsiveness of the Michigan Hand Questionnaire in patients with rheumatoid arthritis: a multicenter, international study. Arthritis Care Res (Hoboken). 2010;62(11):1569–77. pmid:20521331
  13. 13. Kotsis SV, Chung KC. Responsiveness of the Michigan Hand Outcomes Questionnaire and the Disabilities of the Arm, Shoulder and Hand questionnaire in carpal tunnel surgery. J Hand Surg Am. 2005;30(1):81–6. pmid:15680560
  14. 14. Chung KC. MHQ Michigan Hand Outcomes Questionnaire 2014 [Available from: http://mhq.lab.medicine.umich.edu/translations.
  15. 15. Rusch T, Lowry PB, Mair P, Treiblmaier H. Breaking free from the limitations of classical test theory: Developing and measuring information systems scales using item response theory. Information & Management. 2017;54(2):189–203.
  16. 16. Salaffi F, Franchignoni F, Giordano A, Ciapetti A, Gasparini S, Ottonello M. Classical test theory and Rasch analysis validation of the Recent-Onset Arthritis Disability questionnaire in rheumatoid arthritis patients. Clin Rheumatol. 2013;32(2):211–7. pmid:23065133
  17. 17. Chung KC, Kotsis SV, Burns PB, Burke FD, Wilgis EFS, Fox DA, et al. Seven-Year Outcomes of the Silicone Arthroplasty in Rheumatoid Arthritis Prospective Cohort Study. Arthritis Care Res.2017;69(7):973–81. pmid:27696739
  18. 18. Tennant A, Conaghan PG. The Rasch measurement model in rheumatology: What is it and why use it? When should it be applied, and what should one look for in a Rasch paper? Arthritis Rheum. 2007;57(8):1358–62. pmid:18050173
  19. 19. Jayaram M, Wang C, Shauver MJ, Wang L, Chung KC. Using Rasch analysis to validate the Michigan Hand Outcomes Questionnaire from the WRIST Trial. Plast Reconstr Surg, In press.
  20. 20. Esakki S, MacDermid JC, Vincent JI, Packham TL, Walton D, Grewal R. Rasch analysis of the patient-rated wrist evaluation questionnaire. Archives of Physiotherapy: Springer Science and Business Media LLC; 2018. p. 5. pmid:29497563
  21. 21. Heffernan E, Maidment DW, Barry JG, Ferguson MA. Refinement and Validation of the Social Participation Restrictions Questionnaire. Ear Hear. 2019;40(2):28–39. pmid:29905669
  22. 22. Bond T. G., & Fox C. M. Applying the Rasch model fundamental measurement in the human sciences (2nd ed.). Mahwah, NJ: Lawrence Erlbaum Associates 2007.
  23. 23. Mohaffyza Mohamad M, Lisa Sulaiman N, Chee Sern L, Mohd Salleh K. NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). Peer-review under responsibility of Faculty of Technical and Vocational Education Measuring the Validity and Reliability of Research Instruments. Procedia-Social and Behavioral Sciences2015. p. 5–6.
  24. 24. Andersen EB. A goodness of fit test for the Rasch model. Psychometrika. 1973;38(1):123–40.
  25. 25. Boone WJ. Rasch analysis for instrument development: Why,when,and how? CBE Life Sci Educ. 2016;15(4):rm4. pmid:27856555
  26. 26. Robinson M, Johnson AM, Walton DM, MacDermid JC. A comparison of the polytomous Rasch analysis output of RUMM2030 and R (ltm/eRm/TAM/lordif). BMC Med Res Methodol.2019;19(1):36. pmid:30786868
  27. 27. Pallant JF, Tennant A. An introduction to the Rasch measurement model: An example using the Hospital Anxiety and Depression Scale (HADS). Br J Clin Psychol.2007;46(Pt 1):1–18. pmid:17472198
  28. 28. Choi SW, Gibbons LE, Crane PK. lordif: An R Package for Detecting Differential Item Functioning Using Iterative Hybrid Ordinal Logistic Regression/Item Response Theory and Monte Carlo Simulations. J StatSoftw.2011;39(8):1–30.
  29. 29. Christensen K.B., Bjourner J.B., Kreiner S. et al. Testing uindimensionality in poytomous Rasch models. Psychometrika 67, 563–574 (2002).
  30. 30. Christensen K.B., Makransky G, Horton M. Critical Values for Yen’s Q3: Identification of Local Dependence in the Rasch Model Using Residual Correlations. Applied Psychological Measurement. 2017;41(3):178–194. pmid:29881087
  31. 31. Tsang S, Royse CF, Terkawi As. Guidelines for developing, translating, and validating a questionnaire in perioperative and pain medicine. Saudi J Anaesth.2017;11(Suppl 1):S80–89. pmid:28616007
  32. 32. Yaku A, Hashimoto M, Furu M, Ito H, Yamakawa N, Yamamoto W, et al. Relationship between handedness and joint involvement in rheumatoid arthritis. Sci Rep. 2016;6:39180. pmid:27976695
  33. 33. Owsianik WDJ, Kundi A, Whitehead JN, Kraag GR, Goldsmith C. Radiological articular involvement in the dominant hand in rheumatoid arthritis. Ann Rheum Dis.1980;39(5):508–10. pmid:7436583