Figures
Abstract
Children with severe motor and intellectual disabilities experience chronic pain but cannot communicate verbally. However, no Japanese tool currently exists for assessing pain in this population. This study aimed to develop and evaluate the reliability and validity of a Japanese version of the Paediatric Pain Profile, which is a behavioral rating scale to assess pain in children with severe neurological disabilities. The sample comprised 30 children with severe motor and intellectual disabilities at three hospitals in Japan. Three specialist nurses rated low and high pain video scenes of the children (twice at 1-week intervals) using the Face, Legs, Activity, Cry, Consolability behavioral scale and a translated Japanese version of the Paediatric Pain Profile. On the basis of their ratings, we calculated the internal consistency, test–retest reliability, and intra- and inter-observer reliabilities of the Paediatric Pain Profile. Additionally, we assessed concurrent validity using the Face, Legs, Activity, Cry, Consolability behavioral scale and construct validity using low versus high pain scenes. Both internal consistency (low pain: alpha = 0.735; high pain: alpha = 0.928) and test–retest reliability (r = 0.846) of the Japanese version of the Paediatric Pain Profile were good. Intra-observer reliability was substantial (r = 0.748), whereas inter-observer reliability was only moderate (r = 0.529). However, the concurrent validity with Face, Legs, Activity, Cry, Consolability scores was good (r = 0.629) and construct validity was confirmed (p < 0.001). We confirmed the validity of the Japanese version of the Paediatric Pain Profile, but reliable pain assessment may require repeated ratings by the same person. To accurately assess pain in children with severe motor and intellectual disabilities, healthcare staff must be properly trained and become more skilled in using the Japanese version of the Paediatric Pain Profile.
Citation: Okita M, Nio K, Murabata M, Murata H, Iwamoto S (2020) Reliability and validity of the Japanese version of the Paediatric Pain Profile for children with severe motor and intellectual disabilities. PLoS ONE 15(12): e0243566. https://doi.org/10.1371/journal.pone.0243566
Editor: Marzia Lazzerini, IRCCS materno infantile Burlo Garofolo, ITALY
Received: May 22, 2020; Accepted: November 23, 2020; Published: December 22, 2020
Copyright: © 2020 Okita et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting information files.
Funding: This study was supported by JSPS KAKENHI Grant Number JP19K11031 and Mie Medical Research foundation. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
The concept of “severe motor and intellectual disabilities” (SMID) is a novel concept that originated in Japan. The definition of a person with SMID is an individual who is bedridden or able to sit, crawl, or walk with support, and has a profound intellectual disability (IQ < 35; Diagnostic and Statistical Manual, Fourth Edition) [1]. Individuals with SMID have motor abilities comparable to levels IV–V of the Gross Motor Function Motor Classification System (GMFCS) [2], and they only communicate nonverbally or using body language. The concept of SMID is very similar to that of severe multiple disabilities [3]. The main causes of SMID are hypoxic encephalopathy/postnatal distress (20.0% of cases); unknown prenatal causes (10.7%); external causes, including brain injury (8.3%); and meningitis/encephalitis (7.0%) [4]. There are approximately 43,000 SMID patients in Japan, and their number is increasing. Some of these patients require advanced home medical care, such as mechanical ventilation.
Previous studies show that children with severe neurological or cognitive impairments experience chronic pain [5–10] from gastrointestinal dysfunction, musculoskeletal complications, and therapeutic procedures, as well as from other conditions common to all children (e.g., otitis media, dysmenorrhea, or appendicitis) [7, 10, 11]. These children experience difficulty in pain management, and their pain has been poorly recognized and inadequately managed [12]. Despite analgesic use, the frequency of pain episodes and level of distress are invariant over time, which indicates the inadequacy of treatment [13].
There have been no studies of day-to-day pain in SMID children in Japan because these children have previously lived only in dedicated facilities or at home. Recently, children with SMID who require advanced medical care (e.g., tracheostomy management, ventilator management, or tube feeding) have started to attend school or to go on outings. A common tool is needed to explore the experiences of children with SMID as their social interactions increase. However, there are no assessment tools to evaluate day-to-day pain in this population. The Paediatric Pain Profile (PPP) is a 20-item behavioral rating scale that assesses pain in children with severe neurological disabilities. Each item is rated on a four-point scale (0–3) as occurring “not at all” to “a great deal” in a particular given time period. Hunt et al. have assessed the validity and reliability of the scale [14].
Thus, we were able to develop a Japanese version of the PPP to assess pain in children with SMID, and to evaluate several types of reliability and validity of the scale.
Methods
Translation
We used the back-translation method. In the first step, we obtained permission from the developer (Dr. Anne Hunt) and the copyright-holding facility (University College London Business) to translate the PPP into Japanese. Next, two pediatricians, a palliative care doctor, and two nursing researchers translated the PPP into Japanese. In the third step, we conducted a pilot test on two children with SMID. In the fourth step, a professional bilingual (English and Japanese) translator back-translated the Japanese version of the PPP into English. Only a few minor changes were needed to the back-translated Japanese version of the scale. Therefore, there were no substantial problems with the translation and a final Japanese version of the PPP was produced (Fig 1).
Abbreviations: PPP, Paediatric Pain Profile; SMID, severe motor and intellectual disabilities; UCLB, University College London Business.
Procedures
First, 30 children were filmed during their daily nursing care. The care included one of nine activities: suction, rehabilitation, dietary and oral care, repositioning, transfer, tracheal cannula care, blood sampling, bed bathing, changing clothes. We recorded approximately 10–50 minutes of movies per child and extracted and edited low pain and high pain recordings from the movie. Finally, each child’s movie was edited to produce approximately 30 seconds of low pain observations and approximately 30 seconds of high pain observations. One-minute movie episodes (i.e., 30 seconds of low pain and 30 seconds of high pain observations per child) were then individually rated by each of three evaluators using a 20-item Japanese version of the PPP.
Next, we recorded the following participant characteristics: age, sex, and current medical care.
Each of three specialist SMID nurses evaluated each child’s pain using the Japanese version of the PPP and the Japanese version of the Face, Legs, Activity, Cry, Consolability (FLACC) behavioral scale [15], while watching a video recording of the child’s daily care session. The FLACC behavioral scale has shown high reliability and validity for assessing acute pain in nonverbal children [16–18]. The Japanese version of the scale has also shown good to excellent reliability and validity. This scale comprises five items, each of which is rated on a three-point scale (0–2).
Finally, 1 week after the test, the same nurses used the PPP to evaluate each child’s pain again while watching the same video recordings.
Statistical analyses
For the analyses, we referred to methods for testing reliability and validity used in previous studies on the PPP [14, 19]. The rating of the video observations is a less common type of reliability check, so we could briefly mention the successful use of video observations by Hunt et al. (2004) (reference 14) to support our own use of this method.
The reliability of a scale is the extent to which measurements using that scale are reproducible; that is, whether measurements of individuals on different occasions, by different observers, or using similar or parallel tests produce the same or similar results [20]. To assess reliability, intra- and inter-observer reliability were examined using intraclass correlation coefficients (ICCs). Test–retest reliability was calculated using Spearman’s rank correlation coefficient for the retest interval (1 week). Cronbach’s alpha, a statistic that describes the average correlation between each of the items on a scale, was used to assess the internal consistency of the scale. Cronbach’s alpha values of 0.7 or greater were considered indicative of excellent internal consistency. The kappa coefficient index of Landis & Koch [21] has often been used to assess ICC values. According to their guidelines, the strength of agreement indicated by an alpha of 0.00–0.20 is “slight”; 0.21–0.40 is “fair”; 0.41–0.60 is “moderate”; 0.61–0.80 is “substantial”; and 0.81–1.00 is “almost perfect”.
To show that a test measures what it was intended to measure requires some evidence of validity [20]. Construct validity is the most important test and is based on logical positivistic assumptions, which require a coherent and well-articulated theory from which to ground validity claims [22]. We calculated the concurrent validity between the PPP and FLACC scores using Spearman’s rank correlation coefficient. In addition, the construct validity of the PPP was evaluated using the Wilcoxon signed-rank test for paired data to compare scores between the “low pain” and “high pain” scenes. Comparisons at p < 0.05 were considered statistically significant.
All analyses were conducted using the Statistical Package for the Social Sciences (SPSS Inc., IBM Company, Chicago, IL, USA) version 25.0.
Participants and sample size
Participants were children with SMID (1–18 years of age) who were patients at three specialized hospitals in Japan from February 2019 to August 2019. Exclusion criteria were the loss of one or more movements, of any facial expressions, or of any eye-catching expressions.
The sample size was based on a power calculation for proportions and was calculated based on the results of a previous study [14]. The correlation for the level of agreement between observers needed to be at least 0.6. We determined that a sample of 30 children would be required for α = 0.05 and a test power (1 − β) of 0.95.
Ethics
The study was first approved by the medical ethics review board of Mie University (U2019-003) and subsequently approved by the medical ethics review board of each of the research hospitals (National Hospital Organization Mie Hospital; 30–35, MEIWA Hospital; G18, Mie prefectural Medical Center for Childgrowth Development and Disability; none). The parents of all subjects provided written informed consent before their participation.
Results
Characteristics
Of the total sample of 30 children, 23 had medical complexity. Table 1 shows the characteristics of children with SMID in this study.
Internal consistency
Internal consistency was good, as shown by a Cronbach’s alpha of 0.928 for “high pain” scenes and 0.735 for “low pain” scenes.
Intra-observer reliability
Each of the three nurses watched and scored the pain-related recordings of the 30 children twice. Table 2 shows the analysis results for the ICC based on a 95% confidence interval for the agreement rate of the two evaluations using a one-way variance model.
The intra-observer agreement rate was r = 0.748. Using the criteria provided by Landis & Koch [21], which have often been used in previous studies, this agreement was judged to be substantial.
Inter-observer reliability
Each of the three nurses watched and scored the pain-related recordings (of both the “low pain” and “high pain” scenes) of the 30 children twice. Table 2 shows the analysis results for the inter-observer agreement rates using a two-way mixed model, including the 95% confidence intervals and the ICCs. The inter-observer agreement rate was r = 0.529, which indicates moderate reliability.
Test–retest reliability
Three nurses scored the 20 items of the PPP twice. The Spearman’s rank correlation between the test–retest scores was r = 0.846, which indicates a very strong positive correlation (Fig 2).
Abbreviations: PPP, Paediatric Pain Profile.
Concurrent validity
Three nurses evaluated the pain of 30 children using the PPP and FLACC scales. The Spearman’s rank correlation between the PPP and FLACC scores was r = 0.629, which indicates a positive correlation (Fig 3).
Abbreviations: FLACC, Face, Legs, Activity, Cry, Consolability Behavioral Sscale; PPP, Paediatric Pain Profile.
Construct validity
Table 3 shows the results of the PPP construct validity analysis, which was conducted using the Wilcoxon signed-rank test for paired data to compare scores between “low pain” and “high pain” scenes.
“High pain” scores were significantly higher than “low pain” scores (p < 0.001).
Discussion
In this study, a Japanese version of the PPP was developed and used by nurses to evaluate the pain of children with SMID in day-to-day care settings. The results demonstrated the reliability and validity of the Japanese version of the PPP.
The PPP reliability results were satisfactory, as indicated by the substantial intra-observer agreement (ICC = 0.748), the moderate inter-observer agreement (ICC = 0.529), the very strong positive test–retest correlation (r = 0.846), and the internal consistency of the scale (ɑ = 0.735–0.928). This suggests that if the same person (e.g., a parent, a physician, a visiting nurse, or a home care teacher) performed continuous evaluations, a more consistent evaluation would be obtained. The intra-observer reliability was better than the inter-observer reliability in this study. This was also the case in a previous study [19], in which the intra-observer reliability (ICC = 0.90) was higher than the inter-observer reliability (ICC = 0.62). In other words, PPP assessment is not suitable for cases in which the observer changes frequently, such as on a hospital ward, or for a single evaluation by more than one observer. The inter-observer reliability in this study was moderate because approximately 40% of the children were tracheostomized (dysphonic), which meant that it was difficult to evaluate PPP item 4 (“Cries /moans/groans/screams or whimpers”). In addition, approximately 75% of the children were tube-fed, so it may have been difficult to evaluate item 7 (“Is reluctant to eat/difficult to feed”). Items with an ICC < 0.4 were considered to contain terms that were confusing for the observers, or terms that did not correctly express the behavioral response to pain shown by children with SMID.
Regarding the validity of the scale, there was a strong positive correlation between the Japanese version of the PPP and the Japanese version of the FLACC, which assesses postoperative acute pain in non-communicative children. In addition, PPP scores increased significantly from “low pain” to “high pain” scenes. Therefore, we consider the Japanese version of the PPP to be a suitable tool for assessing pain in children with SMID.
Pain is always subjective. However, we must consider potential biases in self-reporting that reflect the level of cognitive function, expectations, and needs of the patient [23]. Nonverbal expressions are generally considered more reliable than self-reports because they add context and meaning to the language and do not involve as much conscious control in language use [24]. Proxy assessments of pain in nonverbal children are also influenced by the children’s ability to express themselves and by potential biases [25]. In addition, observers may differ in their assessments of pain based on their own particular experience, resulting in differences, for example, in how the child’s behavioral response is perceived or an assumption that the child must be in pain.
Questions for future studies are why these nonverbal behavioral responses to pain appear; the sequence of events leading to pain-induced distress in the child; and how the observer can resolve the situation. To address these issues, we recommend that staff gain more experience in using pain scales, as Barney et al. [26] have suggested. Medical and support staff need to develop in parallel an educational program to operators and to parents. The present findings confirm that this scale can be applied to children with severe neurological or cognitive impairment, such as spinal muscular atrophy, and we hope that the Japanese PPP will be used to investigate pain in palliative care research.
Conclusion
We demonstrated the reliability and validity of a Japanese version of the PPP. The results suggest that the Japanese version of the PPP is valid but that reliable assessment of pain in children with SMID may require repeated ratings by the same person.
Supporting information
S1 Fig. The Japanese version of the Paediatric Pain Profile.
https://doi.org/10.1371/journal.pone.0243566.s001
(TIF)
S1 Table. This file contains all the data reported in the results.
https://doi.org/10.1371/journal.pone.0243566.s003
(XLSX)
Acknowledgments
We thank Doran Amos, PhD, and Diane Williams, PhD, from Edanz Group (https://en-author-services.edanzgroup.com/ac) for editing a draft of this manuscript and helping to draft the abstract.
References
- 1. Oshima K. Basic issue on severe motor and intellectual disabilities. Jpn J Public Health. 1971;35(11):648–55. (in Japanese)
- 2. Palisano R, Rosenbaum P, Walter S, Russell D, Wood E, Galupp B. Development and reliability of a system to classify gross motor function in children with cerebral palsy. Dev Med Child Neurol. 1997 Apr;39(4):214–23. pmid:9183258.
- 3. Mensch SM, Echteld MA, Lemmens R, Oppewal A, Evenhuis HM, Rameckers EAA. The relationship between motor abilities and quality of life in children with severe multiple disabilities. J Intellect Disabil Res. 2019 Feb;63(2):100–12. pmid:30175518.
- 4. Hanaoka T, Mita K, Hiramoto A, Suzuki Y, Maruyama S, Nakadate T, et al. Survival prognosis of Japanese with severe motor and intellectual disabilities living in public and private institutions between 1961 and 2003. J Epidemiol. 2010;20(1):77–81. pmid:19946176.
- 5.
Bottos S, Chambers CT. The epidemiology of pain in developmental disabilities. In: Symons F, Oberlander T, eds. Pain in individuals with developmental disabilities. Baltimore, MD: Paul H. Brookes; 2006. p. 67–87.
- 6. Friedrichsdorf SJ, Postier AC, Andrews GS, Hamre KE, Steele R, Siden H. Pain reporting and analgesia management in 270 children with a progressive neurologic, metabolic or chromosomally based condition with impairment of the central nervous system: cross-sectional, baseline results from an observational, longitudinal study. J Pain Res. 2017 Jul;10:1841–52. pmid:28831272
- 7. Hunt A, Mastroyannopoulou K, Goldman A, Seers K. Not knowing—the problem of pain in children with severe neurological impairment. Int J Nurs Stud. 2003 Feb;40(2):171–83. pmid:12559141.
- 8. Parkinson KN, Dickinson HO, Arnaud C, Lyons A, Colver A. SPARCLE group. Pain in young people aged 13 to 17 years with cerebral palsy: cross-sectional, multicentre European study. Arch Dis Child. 2013 Jun;98(6):434–40. pmid:23606716
- 9. Riquelme I, Cifre I, Montoya P. Age‐related changes of pain experience in cerebral palsy and healthy individuals. Pain Med. 2011 Apr;12(4):535–45. pmid:21463475.
- 10. Steele R, Siden H, Cadell S, Davies B, Andrews G, Feichtinger L, et al. Charting the territory: symptoms and functional assessment in children with progressive, non-curable conditions. Arch Dis Child. 2014 Aug;99 (8):754–62. pmid:24833792.
- 11. Breau LM, Camfield CS, McGrath PJ, Finley GA. The incidence of pain in children with severe cognitive impairments. Arch Pediatr Adolesc Med. 2003 Dec;157(12):1219–26. pmid:14662579.
- 12. Warlow TA, Hain RDW. ‘Total Pain’ in children with severe neurological impairment. Children (Basel). 2018 Jan;5(1):13. pmid:29346304
- 13. Siden H, Steele R. Charting the territory: children and families living with progressive life-threatening conditions. Paediatr Child Health. 2015 Apr;20(3):139–44. pmid:25914572
- 14. Hunt A, Goldman A, Seers K, Crichton N, Mastroyannopoulou K, Moffat V, et al. Clinical validation of the Paediatric Pain Profile. Dev Med Child Neurol. 2004 Jan;46(1):9–18. pmid:14974642.
- 15. Matsuishi Y, Hoshino H, Shimojo N, Enomoto Y, Kido T, Hoshino T, et al. Verifying the validity and reliability of the Japanese version of the Face, Legs, Activity, Cry, Consolability (FLACC) behavioral scale. PLoS One. 2018 Mar;13(3):e0194094. pmid:29534083
- 16. Merkel SI, Voepel-Lewis T, Shayevitz JR, Malviya S. The FLACC: a behavioral scale for scoring postoperative pain in young children. Pediatr Nurs. 1997 May-Jun;23(3):293–7. pmid:9220806.
- 17. Malviya S, Voepel-Lewis T, Burke C, Merkel S, Tait AR. The revised FLACC observational pain tool: improved reliability and validity for pain assessment in children with cognitive impairment. Paediatr Anaesth. 2006 Mar;16(3):258–65. pmid:16490089.
- 18. Kabes AM, Graves JK, Norris J. Further validation of the nonverbal pain scale in intensive care patients. Crit Care Nurse. 2009 Feb;29(1):59–66. pmid:19182281.
- 19. Hunt A, Wisbeach A, Seers K, Goldman A, Crichton N, Perry L, et al. Development of the Paediatric Pain Profile: Role of video analysis and saliva cortisol in validating a tool to assess pain in children with severe neurological disability. J Pain Symptom Manage. 2007 Mar;33(3):276–89. pmid:17349497.
- 20.
Streiner DL, Norman GR, Cairney J, editors. Health Measurement Scales. A practical guide to their development and use. 5th ed. New York: Oxford University Press; 2015.
- 21. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977 Mar;33 (1):159–74. pmid:843571.
- 22.
Kane MT. Validation. In: Brennan RL (Ed.), Educational Measurement. 4th ed. 2006;17–64. Westport, CT: American Council on Education/Praeger.
- 23. Stinson JN, Kavanagh T, Yamada J, Gill N, Stevens BJ. Systematic review of the psychometric properties, interpretability and feasibility of self-report pain intensity tools for use in clinical trials in children and adolescents. Pain. 2006 Nov;125(1–2):143–57. pmid:16777328.
- 24. Poole GD, Craig KD. Judgments of genuine, suppressed, and faked facial expressions of pain. J Pers Soc Psychol. 1992 Nov;63(5):797–805. pmid:1447693.
- 25. Craig KD, Versloot J, Goubert L, Vervoort T, Crombez G. Perceiving pain in others: automatic and controlled mechanisms. J Pain. 2010 Feb;11(2):101–8. pmid:19962352.
- 26. Barney CC, Stibb SM, Merbler AM, Summers RLS, Deshpande S, Krach LE, et al. Psychometric properties of the brief pain inventory modified for proxy report of pain interference in children with cerebral palsy with and without cognitive impairment. Pain Rep. 2018 Jul;3(4):e666. pmid:30123858