Figures
Abstract
Background
The psoas major (PM) plays an important role in population with low back pain (LBP). Its evaluation considers clinical history, which can be confused with other lumbar and lower limb conditions, physical examination, with an inconsistent palpation, and imaging tests that provide inconclusive parameters in various studies. Therefore, developing reliable assessment procedures to evaluate PM elastic properties is necessary to improve diagnosis and follow-up. This study aimed to evaluate the intra and inter-examiner reliability of shear wave elastography (SWE) for calculating the PM stiffness in patients with LBP.
Methods
Longitudinal views of the of the PM muscle using SWE were acquired bilaterally in 52 volunteers experiencing moderate LBP and disability. All measurements were performed twice, involving the assessment of shear wave speed and Young’s modulus as indicators of stiffness, with data collected by an experienced examiner and a novice examiner.
Results
Test-retest reliability showed strong consistency regardless of the examiner’s level of experience, with intraclass correlation coefficients (ICCs) exceeding 0.9 for both metrics. However, experienced examiners achieved smaller minimal detectable changes. Inter-examiner reliability was comparatively lower, with ICCs ranging from 0.854 to 0.925, and notable differences in mean values between examiners were observed (p < 0.01).
Conclusion
Excellent test-retest reliability was observed for the measurement of PM muscle stiffness in chronic LBP patients using SWE by both experienced and novice evaluators, although statistically significant differences were found between the two trials for the novice examiner. Inter-examiner reliability was lower, highlighting variability between assessors. To minimize errors and improve consistency and accuracy, if multiple examiners are involved, averaging measurements is recommended.
Citation: Rabanal-Rodríguez G, Fernández-de-las-Peñas C, Álvarez-González J, Roldán-Ruiz A, Buffet-García J, Valera-Calero JA (2025) Shear Wave Elastography for measuring the elastic properties of the Psoas Major muscle: An intra- and inter-examiner reliability study. PLoS One 20(10): e0330076. https://doi.org/10.1371/journal.pone.0330076
Editor: Masatoshi Nakamura, Nishikyushu University: Nishikyushu Daigaku, JAPAN
Received: May 25, 2025; Accepted: July 27, 2025; Published: October 9, 2025
Copyright: © 2025 Rabanal-Rodríguez et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting information files.
Funding: This study was financially supported by Banco Santander (https://www.bancosantander.es) in the form of a researcher training scholarship award (CT25/24) received by GR-R. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. No additional external funding was received for this study.
Competing interests: The authors have read the journal’s policy and confirm there are no patents, products in development or marketed products associated with this research to declare. The authors also confirm adherence to PLOS ONE policies on sharing data and materials.
Introduction
The psoas major (PM) is a long muscle that originates from the transverse processes, vertebral bodies, and intervertebral discs of T12 to L5 and inserts into the lesser trochanter of the femur. Biomechanically, the PM exerts compression and shear forces on the lumbar spine, tending to produce extension moments in the upper lumbar segments and flexion moments in the lower segments. Despite its influence on lumbar spine movements, its primary function is hip flexion, with a secondary role in lumbar stabilization through force transmission during trunk and hip flexion activities [1].
From a clinical perspective, the PM muscle plays a key role in the pathogenesis of painful musculoskeletal conditions affecting the lumbar spine and the lower limbs (e.g., groin injuries, low back pain (LBP), pelvic pain, hip snapping and femoroacetabular impingement) [1] as described in numerous reports [2–7].
Regarding non-specific LBP, it has been established that increased PM stiffness due to continued local contraction could be a relevant etiologic factor, suggesting that tools for its evaluation, like elastography, can provide significant quantitative information for the diagnosis and assessment of these patients [8]. In this context, other authors have also noted overactivation of the PM in patients with chronic LBP [7].
The evaluation of the PM is based on the medical history, clinical examination and complementary imaging tests. Isolated assessment based on clinical history is complex and not recommended, since it may present compatible symptoms with impairments or injuries of other structures of the lumbopelvic region. In the clinical setting, the evaluation can be complemented by palpation (with evidence advising that soft tissue palpation is inconsistent in patients with LBP [9]) or by performing muscular resistance tests (with isometric contractions) or orthopedic tests including the Thomas or FADIR maneuvers [1]. As for imaging tests, the gold standard is considered to be Magnetic Resonance Imaging, although some studies have also used Computerized Tomography [5]. However, in recent years, ultrasound imaging (US) has begun to be studied as a possible alternative for the evaluation of the muscular characteristics of the PM [10–12].
This tool, in particular through sonoelastography, allows to assess tissue elasticity, functioning as an adjunct to B-mode US and offering significant advantages with reduced cost, examination time and complexity. Shear wave elastography (SWE), which employs mechanical shear waves generated by the compressive acoustic waves used during B-mode imaging to measure the velocity of shear wave propagation (which correlates directly with tissue stiffness), despite competing with other elastography techniques for the assessment of musculoskeletal tissues [13], is regarded as the most reliable approach. Evaluating muscle stiffness could offer clinically significant insights, potentially identifying early disease stages when morphological changes are not yet visible on grayscale imaging, making it more informative than parameters such as size, histology, or shape descriptors.
Taking into account the available evidence on changes in the muscular characteristics of the PM in relation to LBP and disc pathology [6,14–22], the uncertain physical examination with an inconclusive palpation and given the scarce published literature on the usefulness and reliability of US assessment for the determination of its properties, only including the study by Zhou et al. [23] in healthy subjects, whose results may not be extrapolated to individuals with LBP regarding their particular characteristics, the development of a study to establish valid and reliable methods for the assessment of the elastic properties of the PM is rationally justified.
This information could enable researchers to continue with a line of knowledge on the association between SWE-obtained parameters and LBP-related variables as suggested on clinical practice guidelines [24], as well as deepening into SWE precision for the discrimination between LBP and non-LBP populations.
Thereby, the primary objective of this study is to evaluate the intra and inter-examiner reliability of a SWE technique for assessing the stiffness of the PM muscle in a cohort of patients experiencing LBP.
Methods
Study design
Between 29th February and 31st May 2024, a longitudinal observational study was carried out to evaluate the intra and inter-examiner reliability estimates of a SWE procedure. The study was conducted in a physiotherapy laboratory within the Faculty of Health Sciences of the Francisco de Vitoria University. To ensure the quality of the report, the guidelines outlined in the Reporting Reliability and Agreement Studies (GRRAS) [25] and Enhancing the QUAlity and Transparency Of health Research (EQUATOR) frameworks [26] were followed. Furthermore, participants’ rights were safeguarded in compliance with the Declaration of Helsinki, and the study protocol was reviewed and approved by the Ethics Committee of the Francisco de Vitoria University (15/2024) prior to data collection.
Participants
A sample of patients with LBP was recruited through local announcements posted at the campus of the Francisco de Vitoria University. The inclusion criteria required participants to be between 18 and 65 years old and to have experienced at least one clinically significant episode of LBP within the past year, without neurological signs, along with mild-to-moderate pain intensity (minimum 3 out of 10 on the Visual Analogue Scale [27]) and associated disability (at least 12 out of 100 on the Oswestry Disability Index [28]) at the time of data collection.
Individuals were excluded if they were undergoing pharmacological treatment that could influence muscle tone, had a history of spinal surgeries, neuropathies (e.g., radiculopathies or myelopathies), serious medical conditions (e.g., tumors, fractures, neurological disorders, or systemic diseases), clinically relevant asymmetries, or widespread musculoskeletal conditions such as fibromyalgia.
Participants who met these criteria were asked to read, understand, and sign a written informed consent form before being scheduled for data collection.
Sample size calculation
The minimum sample size required for this study was calculated using the methodology outlined by Walter et al. [29] for reliability studies involving intraclass correlation coefficients (ICCs). Since no previous studies had evaluated intra or inter-examiner reliability estimates in clinical populations, data from asymptomatic subjects were used as a reference. Zhou et al. [23] reported test-retest reliability ICCs ranging from 0.89 to 0.92 in a sample of 52 subjects.
Based on these references, the following parameters were applied: the minimum acceptable ICC (ρ0) was set at 0.7 (the lower limit of “good reliability” as defined in the literature [30]), the expected ICC (ρ1) at 0.92, the significance level (α) at 0.05, the statistical power (1 – β) at 0.95, with 2 raters (k) and an anticipated dropout rate of 10% due to the longitudinal nature of the study. Consequently, a total of 29 participants were required to ensure adequate statistical power.
Examiners
The study involved two examiners: one experienced examiner with over 10 years of expertise in musculoskeletal US and clinical management of musculoskeletal conditions, and one novice examiner with less than 1 year of experience in both areas who had completed 20 hours of US training.
To enhance methodological rigor (as assessing the left and right side from the same participant cannot be fully considered as independent datapoints), imaging acquisition procedures were randomized for both the order in which volunteers participated and the initially examined side. For each measurement, the entire positioning process of the participant and the probe was restarted to minimize the potential influence of prior alignment or muscle relaxation. The study was conducted in two separate sessions: the first in the morning, between 9:00 and 11:00, and the second in the afternoon, from 13:00–15:00. A strict isolation protocol was followed, with the two examiners working on alternating days to eliminate any possibility of communication or agreement between them.
Participants were asked to attend twice a day for two consecutive days (24-hour interval). The first two visits were in day one (in the morning with examiner A and in the afternoon with examiner B) and the second two visits were in day two (in the morning with examiner B and in the afternoon with examiner A). A single image was obtained in each session. Subsequently, a third investigator coded all the images. Finally, each examiner measured their own images in a randomized order, ensuring blinding to both the participant’s identity and the evaluated side.
US acquisition protocol
The US device used for image acquisition was a Canon Aplio A, equipped with a convex transducer 8C1 (Canon Medical Corp, 1385 Shimoishigami, Otawara, Tochigi 324–8550, Japan). Standard console settings were applied for all acquisitions (Frequency: 5 MHz, Gain: 80 dB, Dynamic Range: 60, Depth: 12 cm).
The imaging acquisition protocol adhered to the guidelines established by Zhou et al. [23]. Participants were positioned in a lateral decubitus posture to ensure spinal and lower limb neutrality. A wedge-shaped cushion was placed posterior to the upper thoracic region to maintain the torso perpendicular to the examination table. Additionally, a square cushion was inserted between the thighs to maintain neutral hip alignment, and a towel was used when necessary to support a neutral lumbar spine position. Participants were instructed to relax their muscles during the procedure to prevent morphological bias caused by muscle contractions [31].
Acoustic coupling gel was applied to the transducer, which was initially positioned superior to the iliac crest along the mid-axillary line. The cranial end of the transducer was then tilted posteriorly by approximately 20° to center the L4 vertebra in the imaging field using B-mode US (Fig 1A). At this location, the PM muscle was identified as the structure overlying the vertebral bodies (Fig 1B).
To capture images of the PM, the transducer’s orientation was adjusted to align parallel to the muscle fibers’ long axis, ensuring the central portion of the transducer was perpendicular to the muscle fibers. Once aligned, the SWE mode was activated, positioning the region of interest in the center of the muscle and covering at least 50% of its area to capture the image, adapting the dimensions to each participant depending on their characteristics. Finally, images were measured by outlining a freely drawn quantification box, ensuring it did not overlap the muscle edges and excluded the muscle fasciae (Fig 1C). All SWE images were acquired using the built-in software of the Canon Aplio A system, which automatically applies standard post-processing algorithms to optimize image quality and measurement accuracy. These include real-time smoothing filters to reduce noise and enhance the visibility of shear wave propagation, as well as temporal averaging to stabilize the color-coded elastograms. The system also applies quality maps to guide the placement of the region of interest, ensuring that measurements are taken only from areas with sufficient signal stability. No manual or external image processing was performed after acquisition.
After contouring the region of interest, the US device automatically provides the shear wave speed and automatically calculates the Young’s modulus. The relationship between these two parameters is based on the physical equation E = 3ρ·SWS2 where E is Young’s modulus (in kilopascals), ρ is the tissue density (usually assumed to be ~ 1000 kg/m3 for soft tissue), and SWS is the shear wave speed (in meters per second). This formula assumes the tissue behaves as a linear, isotropic, and purely elastic medium. Although biological tissues are more complex, this approximation allows the ultrasound system to express stiffness in both m/s (direct measure of SWS) and kPa (converted Young’s modulus), depending on clinical preference or research requirements.
The InBody 770 bioimpedance device (Biospace, Urbandale, IA, USA) was used to assess body composition parameters, collecting values for water volume, body weight, body mass index (BMI) and muscle and body fat mass.
Statistical analysis
All data processing and analysis were performed using the Statistical Package for the Social Sciences (SPSS) v.29.1.1 (Armonk, NY, USA) for Mac OS. All tests were two-tailed, with a significance threshold set at p < 0.05. The distribution of continuous variables was initially evaluated using histograms and Shapiro-Wilk tests.
The demographic and clinical characteristics of the sample were described using descriptive statistics. For categorical variables, frequencies and percentages were reported. Continuous variables were described using measures of central tendency (mean or median) and dispersion (standard deviation or interquartile range), depending on whether the data followed a normal distribution. Gender differences were analyzed using Student’s T-tests for independent samples, reporting the mean difference, 95% confidence intervals, and p-values.
For the reliability analyses, the following metrics were calculated: 1) The mean and standard deviation for each US metric obtained by both operators; 2) The measurement disagreement between examiners, computed as the difference between trials for intra-examiner reliability and as the difference between examiners for single measures and the average of two attempts for inter-examiner reliability, along with the absolute error to avoid underestimating errors; 3) ICCs (ICC3,1 for intra-examiner reliability and ICC3,2 for inter-examiner reliability) using a two-way mixed-effects model for consistency; 4) The standard error of measurement (SEM), calculated as the standard deviation of the mean multiplied by the square root of 1 minus the ICC; 5) The minimal detectable change (MDC), determined as 1.96 times the square root of 2 times the SEM [30]; and 6) the coefficient of variation, calculated to express the relative variability of the measurement error with respect to the average value of the parameter assessed. It was calculated by dividing the absolute error by the mean of the corresponding variable. The result was then multiplied by 100 to express the coefficient of variation as a percentage.
Classification tresholds for ICCs were established following the general guidelines proposed by Koo et al. [30], assigning poor reliability for values under 0.5, moderate reliability for values between 0.5 and 0.75, good reliability for values between 0.75 and 0.9 and excellent reliability for values exceeding 0.9.
Finally, Bland-Altman plots were generated for both shear wave speed and Young’s modulus across examiners. For each parameter, the difference between Trial 1 and Trial 2 values was plotted against their mean. The plots include the mean bias (solid red line) and the 95% limits of agreement (mean difference ±1.96 × standard deviation of the differences; dashed lines). This approach allows for visual inspection of the consistency and spread of repeated measurements. Additionally, to assess the presence of systematic error, we examined whether the mean difference significantly deviated from zero using paired-sample t-tests. Proportional bias was evaluated by performing linear regression analyses between the differences and the means of the two trials; a significant regression slope would indicate proportional error. These analyses were conducted separately for novice and experienced examiners, and for each measurement parameter, to assess the influence of examiner expertise on measurement agreement. The database is available in Supporting information.
Results
Throughout the recruitment phase, 52 individuals indicated a willingness to participate in the study. All prospective participants met the inclusion criteria, and therefore none were excluded. Every image acquired during data collection was reviewed in terms of content and quality, accepted and subsequently analyzed, ensuring that no data were lost. The final sample comprised 35 women and 17 men, and 103 images of the PM muscle were captured from both sides. Given that each examiner acquired two images per muscle, a total of 416 SWE images were acquired and 412 images (n = 4 were excluded due to a data loss) were analyzed.
Table 1 provides an overview of the demographic and clinical profiles of the participants, including gender-based comparisons. Statistically significant differences in demographic variables were found between male and female participants: males were notably younger (p < 0.001), heavier (p < 0.001), and taller (p < 0.001). Differences were also observed in body composition; males exhibited a higher BMI (p = 0.009) and greater water volume (p < 0.001). Nonetheless, despite the BMI discrepancy, there was no significant difference in body fat percentage between sexes (p = 0.271). Clinically, both genders reported comparable levels of pain intensity, classified as moderate (p = 0.584), and similar degrees of disability (moderate disability, p = 0.209).
Reliability estimates for repeated measures of PM stiffness are shown in Table 2. For the experienced examiner, no significant differences emerged between the first and second trials in measurements of shear wave speed or Young’s modulus (both p > 0.05). Conversely, the novice examiner demonstrated significant trial-to-trial differences in both parameters (shear wave speed: p = 0.041; Young’s modulus: p = 0.048). Despite these discrepancies, both examiners achieved excellent intra-class correlation coefficients (ICCs > 0.9), although the experienced examiner exhibited superior consistency across trials (shear wave speed: ICC = 0.971 vs. ICC = 0.930; Young’s modulus: ICC = 0.977 vs. ICC = 0.904). Furthermore, the experienced examiner showed a lower minimal detectable change (MDC), suggesting greater sensitivity to true changes beyond measurement error (shear wave speed: MDC = 0.15 vs. 0.43; Young’s modulus: MDC = 2.06 vs. 7.24). Bland-Altman plots (Fig 2) visually represent these reliability findings, plotting the mean of both trials on the X-axis and their differences on the Y-axis.
(A) Shear wave speed: mean average for an experienced examiner, (B) Shear wave speed: mean average for a novice examiner, C) Young’s Modulus: mean average for an experienced examiner, (D) Young’s Modulus: mean average for a novice examiner. The solid red line represents the mean difference (bias) between trials, while the dashed lines indicate the 95% limits of agreement (mean difference ± 1.96 × standard deviation of the differences). Note: the dashed lines may appear slightly curved due to the smoothing method used by the plotting software, but they reflect constant limits of agreement across the x-axis.
Table 3 presents inter-examiner reliability metrics. Analysis of mean differences revealed that most of the measurements taken by the two examiners differed significantly (shear wave speed: both single and averaged measurements, p = 0.001; Young’s modulus: single measurement, p = 0.01). However, no statistically significant difference was found when comparing the average of two Young’s modulus measurements (p > 0.05). Reflecting these outcomes, inter-examiner ICC values were generally lower than intra-examiner values. Single measurements showed good reliability between examiners (shear wave speed: ICC = 0.854; Young’s modulus: ICC = 0.858). When the average of two measurements was used, reliability improved to good-to-excellent levels (shear wave speed: ICC = 0.857; Young’s modulus: ICC = 0.925). Correspondingly, MDC values were reduced when using averaged measurements, aligning with the improved ICCs.
Discussion
This investigation is, to our knowledge, the first to examine both intra- and inter-examiner reliability of SWE in evaluating PM muscle stiffness in individuals with LBP. Among the key findings, test-retest reliability demonstrated to be high by both novice and experienced examiners, with all reliability indices yielding near-perfect ICCs (>0.9). Despite these strong ICC values, analysis of mean differences revealed that the measurements taken by the experienced examiner were more consistently reproducible. In contrast, although the novice examiner also achieved excellent ICCs, significant trial-to-trial variability was evident, suggesting reduced repeatability relative to the experienced examiner.
Regarding inter-examiner agreement, significant differences in mean scores were identified between examiners, despite good ICC values for both shear wave speed and Young’s modulus. However, when the mean of two measurements was calculated, no significant differences were observed between examiners for Young’s modulus. This approach improved agreement and accuracy, as evidenced by lower absolute error, SEM, and MDC values. Therefore, conducting a second measurement and calculating the mean average is justified for novel examiners for improving intra-examiner reliability. While experienced examiners obtained almost perfect ICCs obtained for a single measurement (>0.97) with no significant differences between trials, novice examiners have a wider improvement margin (as significant differences between trials were found). Therefore, even if the small improvement margin and the extra time required for multiple measurements is not worthy for experienced examiners, novice examiners may benefit from calculating a mean average of multiple trials.
The variability observed in both intra- and inter-examiner reliability when novice examiners were involved may be attributed to inconsistencies in transducer manipulation, including fluctuations in applied pressure, angulation or alignment. Additionally, differences in the interpretation of SWE images could have contributed to this variability, specially regarding precise delineation of the muscle boundaries [32], which may be hindered by potential histological alterations commonly associated with chronic pain [21,33–37].
Future studies should explore these hypotheses by involving novice and experienced examiners assessing healthy subjects. If reliability differences persist in healthy individuals, this would support the transducer handling hypothesis. Conversely, reduced variability in healthy subjects would point toward the impact of muscle histological changes in patients with LBP.
A comparison with Zhou et al. [23] (who reported ICCs of 0.89–0.92 using an examiner with 16 years of clinical experience and 3 years of SWE experience) suggests that experienced examiners are less affected by potential histological changes, supporting the importance of extensive training and experience in probe handling. The reliability observed in Zhou’s study aligns with the results obtained here, regardless of whether participants had pain or were asymptomatic.
The comparison of stiffness scores from asymptomatic individuals reported by Zhou et al. [23] to those in this study should be interpreted with caution as different devices were used and there are no established reliability values for deep muscles. Nevertheless, this contrast revealed higher Young’s modulus values in LBP patients (13.8 ± 3.7 kPa in asymptomatic individuals versus 15.79 ± 4.91 kPa in LBP patients). This difference may indicate that LBP, particularly through its chronic nature, contributes to increased muscle stiffness, possibly linked to the high prevalence of myofascial trigger points in the PM. These differences in stiffness between healthy and LBP populations should be highlighted considering that the mean BMI of the population in our study exceeded 25 points, indicating overweight or obesity, which could imply a greater presence of fatty infiltration in the analyzed muscles [38]. The presence of fatty infiltrates may reduce tissue stiffness and therefore be associated with a lower Young’s modulus, indicating that, in subjects with normal and comparable BMIs, this contrast could be even more notable [39]. Further research is needed to explore whether SWE can effectively differentiate between LBP patients and asymptomatic individuals, the impact of BMI on SWE errors, and its correlation with clinical severity indicators of LBP.
To enhance measurement accuracy, for longitudinal or follow-up assessments, the participation of an experienced examiner is particularly important. Their lower MDC values allow for more accurate identification of true physiological changes in muscle stiffness, thereby decreasing the risk of mistaking measurement variability for actual clinical change. When posible, a single examiner is recommended to be responsible for both acquiring and interpreting SWE images in both clinical practice and research contexts. Variability in technique across different examiners can introduce inconsistencies, potentially leading to misinterpretation of changes in muscle stiffness. When the involvement of multiple examiners is unavoidable, averaging at least two measurements is advised, as this approach minimizes error and enhances measurement reliability, making it appropriate for both clinical and investigative use.
All these findings reinforce the critical role of examiner expertise and methodological rigor in obtaining reliable SWE measurements, particularly in clinical populations.
Despite the valuable results, this study has some limitations that should be acknowledged. The SWE-obtained parameters have not been compared with other clinical variables like the Thomas test, the range of motion in hip extension or the previous physical activity levels, factors that could enrich the clinical contribution of our research. In addition, the sample size is small and the design does not include a group of healthy subjects, which would allow to contrast the results of individuals with similar characteristics except for the presence or absence of LBP. Additionally, the fact that we only evaluated one portion of the PM at rest can restrict the generalizability of our results. Future studies could improve these aspects in their methodology to complement our findings and facilitate their extrapolation and integration in clinical settings.
Conclusion
SWE test-retest reliability for evaluating PM muscle stiffness in individuals with chronic LBP was excellent for both novice and experienced examiners, with ICCs exceeding 0.9, providing highly consistent measurements when performed by the same assessor. In contrast, inter-examiner reliability was notably lower, reflecting considerable variability in SWE outcomes between examiners with differing levels of experience. The accuracy of SWE largely depends on consistency in the evaluator conducting the assessment. When multiple assessors must be involved, averaging multiple measurements is recommended to improve reliability and reduce the potential for measurement error.
References
- 1. Lifshitz L, Bar Sela S, Gal N, Martin R, Fleitman Klar M. Iliopsoas the Hidden Muscle: Anatomy, Diagnosis, and Treatment. Curr Sports Med Rep. 2020;19(6):235–43. pmid:32516195
- 2. Ogon I, Takashima H, Morita T, Oshigiri T, Terashima Y, Yoshimoto M, et al. Is the Lipid Content of the Psoas Major Correlated with Chronic Low Back Pain and Spinopelvic Alignment? A Magnetic Resonance Spectroscopic Study. Asian Spine J. 2020;14(4):430–7. pmid:32693437
- 3. Jokar F, Hosseini SM, Taheri N. The relationship between the psoas major muscle morphology characteristics with disability index and pain in patients with chronic nonspecific low back pain. J Bodyw Mov Ther. 2024;38:454–8. pmid:38763592
- 4. Stanuszek A, Jędrzejek A, Gancarczyk-Urlik E, Kołodziej I, Pisarska-Adamczyk M, Milczarek O, et al. Preoperative paraspinal and psoas major muscle atrophy and paraspinal muscle fatty degeneration as factors influencing the results of surgical treatment of lumbar disc disease. Arch Orthop Trauma Surg. 2022;142(7):1375–84. pmid:33484312
- 5. Seyedhoseinpoor T, Taghipour M, Dadgoo M, Sanjari MA, Takamjani IE, Kazemnejad A, et al. Alteration of lumbar muscle morphology and composition in relation to low back pain: a systematic review and meta-analysis. Spine J. 2022;22(4):660–76. pmid:34718177
- 6. Cui M, Zhang Q, Chen X, Wu H. Analysis of CT imaging changes of psoas major muscles in patients with lumbar disc herniation mainly based on low back pain and lower limb pain. Front Surg. 2023;9:1022903. pmid:36756659
- 7. Cai C, Gazali NA, Tan XN, Soon B, Lee ATH, Aw QWS, et al. Comparison of psoas major activation during standing hip flexion between chronic low back pain and healthy populations. J Back Musculoskelet Rehabil. 2024;37(5):1299–308. pmid:38578881
- 8. Numano T, Habe T, Ito D, Onishi T, Takamoto K, Mizuhara K, et al. A new technique for motion encoding gradient-less MR elastography of the psoas major muscle: A gradient-echo type multi-echo sequence. Magn Reson Imaging. 2019;63:85–92. pmid:31425804
- 9. Nolet PS, Yu H, Côté P, Meyer A-L, Kristman VL, Sutton D, et al. Reliability and validity of manual palpation for the assessment of patients with low back pain: a systematic and critical review. Chiropr Man Therap. 2021;29(1):33. pmid:34446040
- 10. Yaghoubi Z, Ebrahimi Takamjani I, Sarrafzadeh J, Rezasoltani A, Maroufi N. Intrarater reliability of musculoskeletal ultrasound imaging of psoas major muscle in patients with subacute low back pain and healthy controls. Med J Islam Repub Iran. 2020;34:145. pmid:33437741
- 11. Ito K, Maeshima E, Arai N, Saito K, Koshiba H, Maruyama J, et al. Evaluation of the Anatomical Cross-Sectional Area of Psoas Major Muscle Using an Ultrasound Imaging System Combined With an Inertial Measurement Unit: Improved Reliability in the US Using IMU-Based Positioning Techniques. Transl Sports Med. 2024;2024:7774612. pmid:39502872
- 12. Jeon I, Kwon O, Weon J, Choung S, Hwang U. Comparison of psoas major muscle thickness measured by sonography during active straight leg raising in subjects with and without uncontrolled lumbopelvic rotation. Man Ther. 2016;21:165–9. pmid:26231611
- 13. Winn N, Lalam R, Cassar-Pullicino V. Sonoelastography in the musculoskeletal system: Current role and future directions. World J Radiol. 2016;8(11):868–79. pmid:27928468
- 14. Parkkola R, Rytökoski U, Kormano M. Magnetic resonance imaging of the discs and trunk muscles in patients with chronic low back pain and healthy control subjects. Spine (Phila Pa 1976). 1993;18(7):830–6. pmid:8316880
- 15. Barker KL, Shamley DR, Jackson D. Changes in the cross-sectional area of multifidus and psoas in patients with unilateral back pain: the relationship to pain and disability. Spine (Phila Pa 1976). 2004;29(22):E515-9. pmid:15543053
- 16. Kim WH, Lee S-H, Lee DY. Changes in the cross-sectional area of multifidus and psoas in unilateral sciatica caused by lumbar disc herniation. J Korean Neurosurg Soc. 2011;50(3):201–4. pmid:22102949
- 17. Ploumis A, Michailidis N, Christodoulou P, Kalaitzoglou I, Gouvas G, Beris A. Ipsilateral atrophy of paraspinal and psoas muscle in unilateral back pain patients with monosegmental degenerative disc disease. Br J Radiol. 2011;84(1004):709–13. pmid:21081573
- 18. Xu WB, Chen S, Fan SW, Zhao FD, Yu XJ, Hu ZJ. Facet orientation and tropism: Associations with asymmetric lumbar paraspinal and psoas muscle parameters in patients with chronic low back pain. J Back Musculoskelet Rehabil. 2016;29(3):581–6. pmid:26836843
- 19. Sions JM, Elliott JM, Pohlig RT, Hicks GE. Trunk Muscle Characteristics of the Multifidi, Erector Spinae, Psoas, and Quadratus Lumborum in Older Adults With and Without Chronic Low Back Pain. J Orthop Sports Phys Ther. 2017;47(3):173–9. pmid:28158957
- 20. D’hooge R, Cagnie B, Crombez G, Vanderstraeten G, Dolphens M, Danneels L. Increased intramuscular fatty infiltration without differences in lumbar muscle cross-sectional area during remission of unilateral recurrent low back pain. Man Ther. 2012;17(6):584–8. pmid:22784801
- 21. Wan Q, Lin C, Li X, Zeng W, Ma C. MRI assessment of paraspinal muscles in patients with acute and chronic unilateral low back pain. Br J Radiol. 2015;88(1053):20140546. pmid:26105517
- 22. Dangaria TR, Naesh O. Changes in cross-sectional area of psoas major muscle in unilateral sciatica caused by disc herniation. Spine (Phila Pa 1976). 1998;23(8):928–31. pmid:9580961
- 23. Zhou EFM, Wong AYL, Zheng YP, Lam KHS, Fu SN. Reliability of Ultrasound Shear Wave Elastography for Evaluating Psoas Major and Quadratus Lumborum Stiffness: Gender and Physical Activity Effects. Ultrasound Med Biol. 2024;50(4):564–70. pmid:38272743
- 24. Zaina F, Côté P, Cancelliere C, Di Felice F, Donzelli S, Rauch A, et al. A Systematic Review of Clinical Practice Guidelines for Persons With Non-specific Low Back Pain With and Without Radiculopathy: Identification of Best Evidence for Rehabilitation to Develop the WHO’s Package of Interventions for Rehabilitation. Arch Phys Med Rehabil. 2023;104(11):1913–27. pmid:36963709
- 25. Gerke O, Möller S, Debrabant B, Halekoh U, Odense Agreement Working Group. Experience Applying the Guidelines for Reporting Reliability and Agreement Studies (GRRAS) Indicated Five Questions Should Be Addressed in the Planning Phase from a Statistical Point of View. Diagnostics (Basel). 2018;8(4):69. pmid:30249982
- 26. Faggion CM Jr. EQUATOR reporting guidelines should also be used by clinicians. J Clin Epidemiol. 2020;117:149–50. pmid:31550512
- 27. Hjermstad MJ, Fayers PM, Haugen DF, Caraceni A, Hanks GW, Loge JH, et al. Studies comparing Numerical Rating Scales, Verbal Rating Scales, and Visual Analogue Scales for assessment of pain intensity in adults: a systematic literature review. J Pain Symptom Manage. 2011;41(6):1073–93. pmid:21621130
- 28. Tonosu J, Takeshita K, Hara N, Matsudaira K, Kato S, Masuda K, et al. The normative score and the cut-off value of the Oswestry Disability Index (ODI). Eur Spine J. 2012;21(8):1596–602. pmid:22298236
- 29. Walter SD, Eliasziw M, Donner A. Sample size and optimal designs for reliability studies. Statist Med. 1998;17(1):101–10.
- 30. Koo TK, Li MY. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. J Chiropr Med. 2016;15(2):155–63. pmid:27330520
- 31. Latash ML. Muscle coactivation: definitions, mechanisms, and functions. J Neurophysiol. 2018;120(1):88–104. pmid:29589812
- 32. Valera-Calero JA, Navarro-Santana MJ, Plaza-Manzano G, Fernández-de-Las-Peñas C, Ortega-Santiago R. Identifying Demographic, Clinical, Muscular and Histological Factors Associated with Ultrasound Cervical Multifidus Measurement Errors in a Chronic Neck Pain Population. Sensors (Basel). 2022;22(21):8344. pmid:36366042
- 33. Goubert D, De Pauw R, Meeus M, Willems T, Cagnie B, Schouppe S, et al. Lumbar muscle structure and function in chronic versus recurrent low back pain: a cross-sectional study. Spine J. 2017;17(9):1285–96. pmid:28456669
- 34. Freeman MD, Woodham MA, Woodham AW. The role of the lumbar multifidus in chronic low back pain: a review. PM R. 2010;2(2):142–6; quiz 1 p following 167. pmid:20193941
- 35. Danneels LA, Vanderstraeten GG, Cambier DC, Witvrouw EE, De Cuyper HJ. CT imaging of trunk muscles in chronic low back pain patients and healthy control subjects. Eur Spine J. 2000;9(4):266–72. pmid:11261613
- 36. Noonan AM, Brown SHM. Paraspinal muscle pathophysiology associated with low back pain and spine degenerative disorders. JOR Spine. 2021;4(3):e1171. pmid:34611593
- 37. Hodges PW, Danneels L. Changes in Structure and Function of the Back Muscles in Low Back Pain: Different Time Points, Observations, and Mechanisms. J Orthop Sports Phys Ther. 2019;49(6):464–76. pmid:31151377
- 38. Xie Y, Zhan C, Tao H, Li Q, Hua R, Yao Q, et al. Longitudinal variations of paraspinal muscle fatty infiltration in obese patients after bariatric surgery. Eur Spine J. 2025;:10.1007/s00586-025-08970–2. pmid:40494970
- 39. Shahidi B, Padwal JA, Su JJ, Regev G, Zlomislic V, Allen RT, et al. The effect of fatty infiltration, revision surgery, and sex on lumbar multifidus passive mechanical properties. JOR Spine. 2023;6(3):e1266. pmid:37780825