Figures
Abstract
Objectives
Patient perceived benefit of treatment for cleft lip and/or palate is of great importance since it is central to development of cleft care. CLEFT-Q is a cleft-specific questionnaire on health-related quality of life. Test-retest reliability, aspects of responsiveness and interpretability are yet to be established for CLEFT-Q. This study aims to investigate these psychometric characteristics of CLEFT-Q.
Methods
To establish the test-retest reliability of CLEFT-Q, data will be collected repeatedly and independently at approximately 1-week intervals. Inclusion of approximately 50 patients is considered adequate for a test-retest study. To improve the interpretability of CLEFT-Q norm data from a control population of volunteers without a cleft will be collected. A total of approximately 210 participants will be included from schools, high-schools and universities. To test the responsiveness of CLEFT-Q, patients will answer selected subscales of CLEFT-Q, longitudinal anchor questions and perform global ratings of change before and after surgery. To ensure robust results, approximately 50 patients for each type of treatment will be recruited. If CLEFT-Q is found to be responsive, the pre- and postoperative difference in scores of CLEFT-Q will be compared with the change in objective measurements based on assessments by professionals in cleft care obtained in this study. To evaluate interpretability, results will be analysed to investigate the minimal important change using anchor-based, distribution-based and qualitative approach.
Citation: Stiernman M, Klintö K, Wiedel A-P, Schaar-Johansson M, Cornefjord M, Schönmeyr B, et al. (2025) CLEFT-Q SwePsych protocol: A prospective observational study to investigate the psychometric characteristics test-retest reliability, responsiveness, and interpretability of CLEFT-Q. PLoS One 20(5): e0321021. https://doi.org/10.1371/journal.pone.0321021
Editor: James J. Cray Jr., Ohio State University, UNITED STATES OF AMERICA
Received: November 15, 2024; Accepted: February 27, 2025; Published: May 7, 2025
Copyright: © 2025 Stiernman et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: No datasets were generated or analysed during the current study. All relevant data from this study will be made available upon study completion.
Funding: This work was supported by The Foundation for Plastic Surgery Research, Malmö, Sweden (MS) as well as regional research funds from Region Skåne (USVE 2023-068 to MS).
Competing interests: The authors have declared that no competing interests exist.
Introduction
Cleft lip and/or palate (CL/P) is one of the most common congenital anomalies. Primary surgery is performed to repair the lip, palate and alveolus. Secondary treatment includes surgery to improve speech, decrease visibility of scars, increase fullness and symmetry of the upper lip and nose, increase nose tip projection, normalize skeletal relations in the facial skeleton and improve occlusion.
Various aspects of health-related quality of life (HRQOL) can be affected by CL/P and its treatment [1,2]. Consequently, many interventions for patients with CL/P focus on increasing the quality of life and patient self-perception. Patient perspective therefor is central to evaluating results of treatment [3–6]. Patient reported outcome measures (PROMs) measure different aspects of HRQOL and accordingly provide valuable information to overall outcomes measurement and quality improvement.
CLEFT-QTM is a CL/P-specific HRQOL questionnaire developed through a process involving review of previous field surveys [1], qualitative research on patients with CL/P from different countries [7,8], field studies in 12 different countries, and finally, analysis with Rasch measurement theory (RMT) [9]. CLEFT-Q has been translated into Swedish [10]. Normative scores for the CLEFT-Q subscales have been developed for different ages and diagnostic groups based on 2434 individuals with CL/P aged 8–29 years from 12 countries [9]. Construct validity has been appraised for CLEFT-Q by comparing it with the PROMs Cleft Hearing Appearance and Speech Questionnaire (CHASQ) and Child Oral Health Impact Profile (COHIP) [11]. Several CLEFT-Q subscales were selected for inclusion in the standard set of outcomes for cleft care recommended by the International Consortium for Health Outcomes Measurement (ICHOM) [12]. Continuous effort is being made to refine the standard set and minimising the burden on patients by selecting the most meaningful scales for different subpopulations [13,14] and by developing computerised adapting testing for CLEFT-Q [15].
Previous publications have discussed remaining methodological problems within PROMs in the CL/P population [2,16]. Some were, for example, the lack of qualitative research as well as the lack of studies following the development of patient satisfaction longitudinally and change of satisfaction from before to after interventions. It is also important to describe the level of health within the general population. This facilitates the identification of a cut-off level for pathological values in the CL/P population and when further examination or intervention is indicated. Therefore, including a control population in studies on psychosocial health in individuals with CL/P has also been strongly recommended [2,16–18].
Several areas of interest regarding reliability and validity of CLEFT-Q are still unexplored. These include test-retest reliability, responsiveness and interpretability. Test-retest reliability is defined by the COSMIN initiative (COnsesus based Standards for the selection of health Measurement Instruments) as ‘The extent to which a score for patients who have not changed are the same for repeated measurement over time’ [19]. Further, the COSMIN initiative defines responsiveness as ‘The ability of a PROM to detect change over time in the construct to be measured’ [19]. Testing this characteristic of a PROM is crucial to know if it is suitable for longitudinal follow-up and detecting change before and after treatment. Finally, the COSMIN initiative defines interpretability as ‘The degree to which one can assign qualitative meaning – that is, clinical or commonly understood connotations – to an instrument’s quantitative scores or change in scores’ [19]. In investigating the interpretability of a PROM, one may examine the scores in a control population as well as the minimal clinical important change (MIC) [20].
With these concepts in mind, there remains a critical need to more thoroughly evaluate the CLEFT-Q scales. Specifically, this project will appraise the following:
- Test-retest reliability of CLEFT-Q
- Responsiveness of CLEFT-Q
- MIC of CLEFT-Q
- Control population values for CLEFT-Q
Materials and methods
This project includes several studies which address four separate aims related to the validity and reliability of CLEFT-Q. The methods and analysis for each study will be described separately. A SPIRIT Checklist for this protocol is available as Supporting Information. The Swedish Ethical Review Authority approved this project on 2022-02-15 under the ID nr 2021-06993-01. Major amendments to the trial protocol will be approved by the ethical review authority before being carried out. Local healthcare professionals will inform participants verbally, provide written information, and collect signed consent forms before inclusion in the project. Study data will be collected and managed using REDCap electronic data capture tools hosted at Lund University [21,22]. Results will be disseminated via publications in open access scientific journals and at national and international congresses. Recruitment and data collection began in 2022-09-17. Participant recruitment and data collection is predicted to be completed in 2027 and results are expected in 2028.
Participants
The common inclusion criteria for all studies in the project CLEFT-Q SwePsych are participants between 8 and 29 years of age who can understand written or spoken Swedish. This age interval aligns with earlier validation of CLEFT-Q [23]. Sufficient knowledge of Swedish is crucial for inclusion in the studies since participants must be able to answer the PROM independently from the health care provider. Exclusion criteria are age < 8 or > 29 and insufficient knowledge of Swedish. Patients lost to follow-up will be analysed regarding cleft subtype, sex, age, intervention, and additional diagnoses.
Study 1: Test-retest reliability.
Quantitative data from CLEFT-Q will be collected repeatedly and independently at approximately 1-week intervals. In the period between the two measurements, no interventions will take place. Results from the two different measurement points will be compared to see how reliably CLEFT-Q measures HRQOL. The intraclass correlation coefficient (ICC) will be calculated. Since CLEFT-Q scores are continuous data, analysis of variance (ANOVA) models will be used to calculate test-retest reliability. Inclusion of approximately 50 patients is considered adequate for a test-retest study [20,24]. According to a formula for calculating sample size in the study of de Vet et al. (2011), fifty patients are required to reach an ICC of 0.8 with a confidence interval of 95% + /- 0.1 in a design with two repeated measurements. Patients will be recruited consecutively until 50 patients have been included. They will be recruited at routine check ups at 10, 13, 16 and 19 years of age. Results will be presented according to guidelines for test-retest reliability [25]. Smallest Detectable Change (SDC) will also be calculated [20].
Study 2: Responsiveness.
Participants will be recruited from all six CL/P centres in Sweden – Sahlgrenska University Hospital in Gothenburg, Karolinska University Hospital in Stockholm, Linköping University Hospital, Skåne University Hospital in Malmö, Umeå University Hospital and Uppsala University Hospital. Patients will answer selected subscales of CLEFT-Q and global ratings of change (GRCs) before and after secondary CL/P surgery. The participant timeline is shown in Fig 1. Specific subscales of CLEFT-Q and clinical measurements relevant for each group of secondary CL/P surgery are listed in Table 1.
A construct approach to testing responsiveness will be carried out. All hypotheses are made a priori and listed in Table 2. Responsiveness can be considered high if > 75% of hypotheses are confirmed. A level of 50–75% confirmed hypotheses is considered moderate, and < 50% confirmed hypotheses is considered low responsiveness [20]. Hypotheses will also be ranked according to perceived importance for each type of surgery. More important hypotheses will be given greater influence on the analysis of results alongside the numerical, but arbitrary, cut-offs for levels of responsiveness mentioned above. In Table 2 the hypotheses are listed in order of importance.
GRC (Global Rating of Change)
To ensure robust results, approximately 50 patients for each treatment will be recruited [24,26]. Hypotheses for the magnitude of change are based on normative data from the CLEFT-Q field test [9]. For example, on the subscale Nose, the mean score for patients with cleft lip and palate (CLP) was 53.5 points and the mean score for patients with cleft palate only (CP) was 71.7 points [9]. The difference between the two mean scores is 71.7–53.5 = 18.2 points. Patients with CP are hypothesised to have scores close to that of the general population, since their noses are not affected by their cleft. Patients who undergo a surgical intervention to improve their nose are hypothesised to improve their appearance but not erase the underlying deformity completely. They are therefore hypothesised to improve their CLEFT-Q Nose score somewhat less than 18 points. In this study, the magnitude of improvement of approximately15 points was chosen.
Hypotheses regarding differences in magnitude of different CLEFT-Q subscales are based on how well the subscale is thought to capture the change in HRQOL due to the treatment. Correlations between changes in subscale scores, treatment-specific longitudinal anchors and GRC outcomes will also be analysed to test responsiveness. Hypotheses of differences in the strength of correlations are derived from the similarity between the constructs of the treatment-specific longitudinal anchor, GRC, and the CLEFT-Q subscale.
Calculations regarding correlation hypotheses will be carried out with Spearman’s rank correlation. Correlation strength will be judged as high (≥0.5), moderate (0.3–0.5), or weak (<0.3) in accordance with earlier studies [27]. Power calculations have been carried out to ensure that sample sizes are also adequate to detect a statistical difference between pre- and postoperative scores if the hypotheses regarding magnitude are satisfied. The power calculations are based on normative data from the CLEFT-Q field test [9]. For example, in the normative data from CLEFT-Q the mean Nose subscale score for patients with CL/P was 53.5 and the standard deviation was 21.9. With alpha set at 0.05 and power set to 80%, 50 participants is an adequate sample to ensure that 15 points improvement will be statistically significant (a postoperative Nose subscale score of 53.5 + 15 = 68.5 points).
Pre- and postoperative audio recordings, photographs and cephalometric measurements will be collected. The entire population in each study will be judged by the same raters. For example, all the patients who have had secondary lip surgery will be rated by the same panel of plastic surgeons. Photographs, audio recordings and radiographs will be cropped or edited so that assessors cannot identify individuals, at which center they were treated or whether they are pre- or postintervention.
For nose and lip surgery, photographs will be cropped and rated by three plastic surgeons according to the method described by Asher-McDade et al. (1991) [28]. A change in one point out of the five point scale described by Asher-McDade et al will be used as a an indicator for successful surgery. For orthognatic surgery, dental overjet, overbite and sectioned Modified Huddart/Bodenham (MHB) index will be analysed on dental casts by a panel of three orthodontists or craniofacial surgeons. Photographs of front view, profile view and cephalometric measurements on x-rays will be included to analyse the angles between cefphalometric landmarks sella, nasion and point A (SNA), sella, nasion and point B (SNB), point A, nasion and point B (ANB), and nasion, point A and pogonion (NAPg). A change of four points on the MHB will be used as an indicator for successful surgery. Speech will be documented with standardized audio recordings using the Swedish Test for Nasality and Articulation (SVANTE) [29]. The recordings will be rated independently by three blinded speech-language pathologists specialised in cleft palate speech, which is considered as gold standard [30]. Velopharyngeal competence will be rated using the VPC-Sum and the VPC-Rate [31]. Intra and inter reliability of ratings will be reported. A one-point change of the median score on either speech scale will be used as an indicator for successful surgery.
Depending on the change in the clinical outcomes described above, patients will be divided into successful and unsuccessful treatment subpopulations. The hypotheses for responsiveness of CLEFT-Q will be tested in the total population as well as in the subpopulations.
Study 3: Minimal clinical important change.
Patients will complete selected subscales of CLEFT-Q pre- and postoperatively, see Fig 1. Results will be analysed to investigate MIC using an anchor-based approach, a distribution-based approach and a qualitative approach.
Anchor-based approach.
Patients will be divided into subpopulations of ‘changed’ and ‘not changed’ according to their answers to the longitudinal and retrospective anchors (the GRC). The anchors will be specific to each studied intervention and subscale of CLEFT-Q. The GRC will explicitly mention the specific condition under treatment, construct under examination, time anchor points, and an ordinal scale with seven steps defining change in pre- to postoperative results [32]. An example of the nose surgery specific GRC is demonstrated in Fig 2.
Based on the GRC and longitudinal anchor, differences in CLEFT-Q scores will be analysed with a receiver operating characteristic (ROC) method to test CLEFT-Qs ability to distinguish between participants who have changed and those who haven’t changed after the intervention. This study will use items and response options from CHASQ as longitudinal anchors. Sensitivity and 1- specificity will be plotted on the ROC-curve for different values of MIC. The optimal value, depending on preferred levels of sensitivity and specificity, will be determined and presented according to the visual anchor-based MIC distribution method [33]. The mean difference in PROM-scores between ‘changed’ and ‘not changed’ subpopulations will also be presented as an estimate of MIC. Patients will be divided into subpopulations of successful and unsuccessful treatment depending on clinical change (on audio recordings, photographs, x-ray and/or dental casts). The MIC calculations of CLEFT-Q will be tested in both the total population and the subpopulations.
Distribution-based approach.
In agreement with Norman et al. 2003, a distribution based approach will be used to calculate a value of MIC as 0.5*SD of baseline data collected with CLEFT-Q [34].
Qualitative approach.
Qualitative data will be collected through semi-structured interviews pre- and postoperatively. Interview templates specific to treatment including topics based on a literature review will guide the discussion [7,8,35–38]. Examples of topics are: What are the main expectations before surgical intervention? Have the results after surgical interventions met patient expectations? Are there concepts of importance that CLEFT-Q does not examine? The interview template may be adapted during the study depending on patient answers. Interviews will be recorded and transcribed verbatim. Data will be collected until thematic saturation is achieved (when further interviews no longer contribute to new emerging themes). Interview data will be examined with thematic analysis to find common themes in the material [39]. Themes and quotes will be checked with 10 study participants to ensure representativeness and reliability. A clear pathway of referral to a counsellor will be presented to the patient if such a need should become evident during the interview [40]
Study 4: Control population for CLEFT-Q.
Determining scores of CLEFT-Q in a control population without CL/P is one aspect of investigating the interpretability of the PROM [20]. CLEFT-Q results will be collected from volunteers in schools and high schools who were not born with a cleft. A descriptive presentation of the mean scores of the subscales in CLEFT-Q for the control population will be presented. These mean scores can then be compared to individuals and populations with CL/P.
The control population will be divided into six subpopulations (girls, boys and ages 8–11 years, 12–16 years, and 17 years and older). Approximately 35 participants per control subpopulation will be included, resulting in 210 healthy volunteers. Power calculations are based on normative data from the CLEFT-Q field test [9]. For example, on the subscale Lips, the mean score for patients with CLP was 58.2 points and the standard deviation 24.8. The mean score for patients with CP was 74.4 points and the standard deviation was 23.1 [9]. Healthy volunteers without CLP are hypothesised to have scores at least as high as patients with CP. Sample sizes are calculated to detect a difference of 74.4–58.2 = 16.2 points with alpha set at 0.05, power set to 80% and standard deviation set to 24. Sample sizes regarding speech are based on differences in scores between subpopulations of patients with CLP, and cleft lip and alveolus (CLA).
Discussion
There is a general lack of CL/P HRQOL research to generate longitudinal data, data from before and after interventions as well as data from a control population. There is also a specific need to further evaluate the test-retest reliability, responsiveness and MIC of CLEFT-Q with a longitudinal study design. The studies presented in this protocol will specifically evaluate these aspects of reliability and validity of CLEFT-Q with both longitudinal and cross-sectional data.
Responsiveness has previously been tested for CLEFT-Q using a longitudinal study design [41]. The study included CL/P patients undergoing revision surgery of the nose or lip or orthognathic surgery. CLEFT-Q was found to be responsive. Limitations included a small sample size (18–31 patients per intervention), and that many participants were lost to follow-up (26–44% per intervention). Additionally, speech scales could not be evaluated since the study did not include speech-improving interventions [41]. Study 2 in this protocol aims to establish responsiveness with larger sample sizes and includes speech-improving interventions.
MIC has previously been tested for CLEFT-Q using two distribution-based approaches: 0.5*Standard Deviation and 0.5*Effect Size [41]. MIC estimates ranged between 5.9–14.4 points of 100. Limitations were a small sample size (18–31 patients per intervention) and that many participants were lost to follow-up (26–44% per intervention). Additionally, speech scales could not be evaluated since the study design did not include speech-improving interventions. The authors recommend that future studies include a larger population, evaluation of more subscales of CLEFT-Q, and include an anchor-based approach. Study 3 in this protocol will complement current research findings on MIC with larger sample sizes, an anchor-based approach and by including speech-improving interventions.
In summary, patient perceived benefit of treatment is of great importance since it is central to care development. The use of a high quality PROM to capture the patient perspective is a fundament of this type of research. This study will add to the examination of the quality of CLEFT-Q by assessing test-retest reliability, responsiveness and interpretability. Results from this study will immediately translate into improved interpretation of patient reported outcomes and inform treatment of patients with CL/P.
Supporting information
S1 File. CLEFT-Q SwePsych protocol SPIRIT Checklist.
https://doi.org/10.1371/journal.pone.0321021.s001
(DOC)
References
- 1. Klassen AF, Tsangaris E, Forrest CR, Wong KWY, Pusic AL, Cano SJ, et al. Quality of life of children treated for cleft lip and/or palate: a systematic review. J Plast Reconstr Aesthet Surg. 2012;65(5):547–57. pmid:22118856
- 2. Stock NM, Feragen KB. Psychological adjustment to cleft lip and/or palate: A narrative review of the literature. Psychol Health. 2016;31(7):777–813.
- 3. Semb G, Brattström V, Mølsted K, Prahl-Andersen B, Zuurbier P, Rumsey N, et al. The Eurocleft study: intercenter study of treatment outcome in patients with complete cleft lip and palate. Part 4: relationship among treatment outcome, patient/parent satisfaction, and the burden of care. Cleft Palate Craniofac J. 2005;42(1):83–92. pmid:15643921
- 4. Wormald JCR, Rodrigues JN. Outcome measurement in plastic surgery. J Plast Reconstr Aesthet Surg. 2018;71(3):283–9. pmid:29233505
- 5. Dobbs TD, Rodrigues J, Hart AM, Whitaker IS. Improving measurement 1: Harnessing the PROMise of outcome measures. J Plast Reconstr Aesthet Surg. 2019;72(3):363–5. pmid:30655241
- 6. Geoghegan L, Dobbs TD, Rodrigues JN. PART 2: Can improvement in outcome measurement in plastic surgery be achieved in today’s health systems?. J Plast Reconstr Aesthet Surg. 2019;72(3):366–8. pmid:30660465
- 7. Wong Riff KWY, Tsangaris E, Goodacre TEE, Forrest CR, Lawson J, Pusic AL, et al. What Matters to Patients With Cleft Lip and/or Palate: An International Qualitative Study Informing the Development of the CLEFT-Q. Cleft Palate Craniofac J. 2018;55(3):442–50. pmid:29437508
- 8. Tsangaris E, Wong Riff KWY, Goodacre T, Forrest CR, Dreise M, Sykes J, et al. Establishing Content Validity of the CLEFT-Q: A New Patient-reported Outcome Instrument for Cleft Lip/Palate. Plast Reconstr Surg Glob Open. 2017;5(4):e1305. pmid:28507866
- 9. Klassen AF, Riff KWW, Longmire NM, Albert A, Allen GC, Aydin MA, et al. Psychometric findings and normative values for the CLEFT-Q based on 2434 children and young adult patients with cleft lip and/or palate from 12 countries. CMAJ. 2018;190(15):E455–62. pmid:29661814
- 10. Tsangaris E, Wong Riff KWY, Dreise M, Stiernman M, Kaur MN, Piplani B, et al. Translation and cultural adaptation of the CLEFT-Q into Arabic, Dutch, Hindi, Swedish, and Turkish. Eur J Plast Surg. 2018;41(6):643–52.
- 11. Miroshnychenko A, Rae C, Riff KW, Forrest C, Goodacre T, Swan M, et al. Psychometric Validation of the CLEFT-Q Patient Reported Outcome Measure: A Prospective Study to Examine Cross-Sectional Construct Validity. Cleft Palate Craniofac J. 2023;60(3):327–35. pmid:34919453
- 12. Allori AC, Kelley T, Meara JG, Albert A, Bonanthaya K, Chapman K, et al. A Standard Set of Outcome Measures for the Comprehensive Appraisal of Cleft Care. Cleft Palate Craniofac J. 2017;54(5):540–54. pmid:27223626
- 13. Ombashi S, Kurniawan MS, Allori A, Sharif-Askary B, Rogers-Vizena C, Koudstaal M, et al. What is the optimal assessment of speech? A multicentre, international evaluation of speech assessment in 2500 patients with a cleft. BMJ Open. 2023;13(12):e071571. pmid:38154881
- 14. Ombashi S, Kurniawan MSIC, Koudstaal MJ, Allori AC, Jansson K, Rogers-Vizena CR, et al. Most Efficient and Meaningful Patient-Reported Appearance Assessment in Different Cleft Types and Age Groups with CLEFT-Q. Plast Reconstr Surg. 2024;153(1):120e–9e. pmid:37054385
- 15. Harrison CJ, Geerards D, Ottenhof MJ, Klassen AF, Riff KWYW, Swan MC, et al. Computerised adaptive testing accurately predicts CLEFT-Q scores by selecting fewer, more patient-focused questions. J Plast Reconstr Aesthet Surg. 2019;72(11):1819–24. pmid:31358447
- 16. Stock NM, Feragen KB, Moss TP, Rumsey N. Toward a Conceptual and Methodological Shift in Craniofacial Research. Cleft Palate Craniofac J. 2018;55(1):105–11. pmid:34162053
- 17. Ranganathan K, Vercler CJ, Warschausky SA, MacEachern MP, Buchman SR, Waljee JF. Comparative effectiveness studies examining patient-reported outcomes among children with cleft lip and/or palate: a systematic review. Plast Reconstr Surg. 2015;135(1):198–211. pmid:25539306
- 18. Aaronson N, Alonso J, Burnam A, Lohr KN, Patrick DL, Perrin E, et al. Assessing health status and quality-of-life instruments: attributes and review criteria. Qual Life Res. 2002;11(3):193–205. pmid:12074258
- 19. Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol 2010;63(7):737–45.
- 20.
de Vet HC, Terwee CB, Mokkink LB, Knol DL. Measurement in Medicine. Cambridge University Press; 2011.
- 21. Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (REDCap)--a metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform. 2009;42(2):377–81. pmid:18929686
- 22. Harris PA, Taylor R, Minor BL, Elliott V, Fernandez M, O’Neal L, et al. The REDCap consortium: Building an international community of software platform partners. J Biomed Inform. 2019;95:103208. pmid:31078660
- 23.
CLEFT-Q. CLEFT-Q Users Guide. 2022.
- 24.
Mokkink LB, Prinsen AC, Patrick D, Alonso J, Bouter LM, de Vet HC, et al. COSMIN Study Design checklist for Patient-reported outcome measurement instruments. 2019.
- 25. Kottner J, Audigé L, Brorson S, Donner A, Gajewski BJ, Hróbjartsson A, et al. Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed. J Clin Epidemiol. 2011;64(1):96–106. pmid:21130355
- 26. Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study. Qual Life Res. 2010;19(4):539–49. pmid:20169472
- 27. Lokhorst MM, Horbach SER, Waner M, O TM, van der Vleuten CJM, Spuls PI, et al. Responsiveness of quality of life measures in children with peripheral vascular malformations: The OVAMA project. JPRAS Open. 2020;27:70–9. pmid:33364290
- 28. Asher-McDade C, Roberts C, Shaw WC, Gallager C. Development of a method for rating nasolabial appearance in patients with clefts of the lip and palate. Cleft Palate Craniofac J. 1991;28(4):385–90; discussion 390-1. pmid:1742308
- 29. Lohmander A, Lundeborg I, Persson C. SVANTE - The Swedish Articulation and Nasality Test - Normative data and a minimum standard set for cross-linguistic comparison. Clin Linguist Phon. 2017;31(2):137–54. pmid:27552341
- 30. Sell D. Issues in perceptual speech analysis in cleft palate and related disorders: a review. Int J Lang Commun Disord. 2005;40(2):103–21. pmid:16101269
- 31. Lohmander A, Hagberg E, Persson C, Willadsen E, Lundeborg I, Davies J, et al. Validity of auditory perceptual assessment of velopharyngeal function and dysfunction - the VPC-Sum and the VPC-Rate. Clin Linguist Phon. 2017;31(7–9):589–97. pmid:28362219
- 32. Kamper SJ, Maher CG, Mackay G. Global rating of change scales: a review of strengths and weaknesses and considerations for design. J Man Manip Ther. 2009;17(3):163–70. pmid:20046623
- 33. de Vet HCW, Ostelo RWJG, Terwee CB, van der Roer N, Knol DL, Beckerman H, et al. Minimally important change determined by a visual method integrating an anchor-based and a distribution-based approach. Qual Life Res. 2007;16(1):131–42. pmid:17033901
- 34. Norman GR, Sloan JA, Wyrwich KW. Interpretation of changes in health-related quality of life: the remarkable universality of half a standard deviation. Med Care. 2003;41(5):582–92. pmid:12719681
- 35. Kamran R, Longmire NM, Rae C, Riff K, Forrest CR, O’Hara J, et al. Concepts Important to Patients With Facial Differences: A Qualitative Study Informing a New Module of the FACE-Q for Children and Young Adults. Cleft Palate Craniofac J. 2020:1055665620969589.
- 36. Robinson J, Khan N, Fusco L, Malpass A, Lewis G, Dowrick C. Why are there discrepancies between depressed patients’ Global Rating of Change and scores on the Patient Health Questionnaire depression module? A qualitative study of primary care in England. BMJ Open. 2017;7(4):e014519. pmid:28473513
- 37. Longmire NM, Wong Riff KWY, O’Hara JL, Aggarwala S, Allen GC, Bulstrode NW, et al. Development of a New Module of the FACE-Q for Children and Young Adults with Diverse Conditions Associated with Visible and/or Functional Facial Differences. Facial Plast Surg. 2017;33(5):499–508. pmid:28962056
- 38. Tang X, Tong Q, Li Z, Chen Z, Weng M, Zhang X, et al. Difference and Commonness Among CLP, NON-CLP Patients and Their Parents Seeking for Orthodontic Treatment: A Questionnaire Applying Q Methodology to Investigate Motives. Patient Prefer Adherence. 2019;13:2047–60. pmid:31827321
- 39.
Creswell JW, Poth CN. Qualitative inquiry and research design. 4th edition. SAGE Publications; 2018.
- 40. Stiernman M, Österlind K, Rumsey N, Becker M, Persson M. Parental and health care professional views on psychosocial and educational outcomes in patients with cleft lip and/or cleft palate. Eur J Plast Surg. 2019;42(4):325–36.
- 41. Miroshnychenko A, Rae C, Wong Riff K, Forrest CR, Goodacre T, Swan MC, et al. A Prospective Study to Examine Responsiveness and Minimally Important Differences (MIDs) for the CLEFT-Q Scales Following Three Cleft-Specific Operations. Cleft Palate Craniofac J. 2021:10556656211064479.