Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Intra- and inter-rater reliability of joint range of motion tests using tape measure, digital inclinometer and inertial motion capturing

  • Laura Fraeulin ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Writing – original draft

    fraeulin@med.uni-frankfurt.de

    Affiliation Institute for Occupational Medicine, Social Medicine and Environment Medicine, Goethe-University Frankfurt, Frankfurt am Main, Germany

  • Fabian Holzgreve,

    Roles Conceptualization, Data curation, Investigation, Methodology, Writing – review & editing

    Affiliation Institute for Occupational Medicine, Social Medicine and Environment Medicine, Goethe-University Frankfurt, Frankfurt am Main, Germany

  • Mark Brinkbäumer,

    Roles Data curation, Investigation, Methodology, Writing – review & editing

    Affiliation Institute of Sports Sciences, Goethe-University Frankfurt, Frankfurt am Main, Germany

  • Anna Dziuba,

    Roles Data curation, Investigation, Methodology, Writing – review & editing

    Affiliation Institute of Sports Sciences, Goethe-University Frankfurt, Frankfurt am Main, Germany

  • David Friebe,

    Roles Data curation, Investigation, Methodology, Writing – review & editing

    Affiliation Institute of Sports Sciences, Goethe-University Frankfurt, Frankfurt am Main, Germany

  • Stefanie Klemz,

    Roles Data curation, Investigation, Methodology, Writing – review & editing

    Affiliation Institute of Sports Sciences, Goethe-University Frankfurt, Frankfurt am Main, Germany

  • Marco Schmitt,

    Roles Data curation, Investigation, Methodology, Writing – review & editing

    Affiliation Institute of Sports Sciences, Goethe-University Frankfurt, Frankfurt am Main, Germany

  • Anna-Lena Theis A.,

    Roles Data curation, Investigation, Methodology, Writing – review & editing

    Affiliation Institute of Sports Sciences, Goethe-University Frankfurt, Frankfurt am Main, Germany

  • Sarah Tenberg,

    Roles Data curation, Investigation, Methodology, Writing – review & editing

    Affiliation Institute of Sports Sciences, Goethe-University Frankfurt, Frankfurt am Main, Germany

  • Anke van Mark,

    Roles Supervision, Writing – review & editing

    Affiliation Institute for Occupational Medicine, Social Medicine and Environment Medicine, Goethe-University Frankfurt, Frankfurt am Main, Germany

  • Christian Maurer-Grubinger,

    Roles Data curation, Formal analysis, Writing – review & editing

    Affiliation Institute for Occupational Medicine, Social Medicine and Environment Medicine, Goethe-University Frankfurt, Frankfurt am Main, Germany

  • Daniela Ohlendorf

    Roles Conceptualization, Methodology, Resources, Software, Supervision, Writing – review & editing

    Affiliation Institute for Occupational Medicine, Social Medicine and Environment Medicine, Goethe-University Frankfurt, Frankfurt am Main, Germany

Abstract

Background

In clinical practice range of motion (RoM) is usually assessed with low-cost devices such as a tape measure (TM) or a digital inclinometer (DI). However, the intra- and inter-rater reliability of typical RoM tests differ, which impairs the evaluation of therapy progress. More objective and reliable kinematic data can be obtained with the inertial motion capture system (IMC) by Xsens. The aim of this study was to obtain the intra- and inter-rater reliability of the TM, DI and IMC methods in five RoM tests: modified Thomas test (DI), shoulder test modified after Janda (DI), retroflexion of the trunk modified after Janda (DI), lateral inclination (TM) and fingertip-to-floor test (TM).

Methods

Two raters executed the RoM tests (TM or DI) in a randomized order on 22 healthy individuals while, simultaneously, the IMC data (Xsens MVN) was collected. After 15 warm-up repetitions, each rater recorded five measurements.

Findings

Intra-rater reliabilities were (almost) perfect for tests in all three devices (ICCs 0.886–0.996). Inter-rater reliability was substantial to (almost) perfect in the DI (ICCs 0.71–0.87) and the IMC methods (ICCs 0.61–0.993) and (almost) perfect in the TM methods (ICCs 0.923–0.961). The measurement error (ME) for the tests measured in degree (°) was 0.9–3.3° for the DI methods and 0.5–1.2° for the IMC approaches. In the tests measured in centimeters the ME was 0.5–1.3cm for the TM methods and 0.6–2.7cm for the IMC methods. Pearson correlations between the results of the DI or the TM respectively with the IMC results were significant in all tests except for the shoulder test on the right body side (r = 0.41–0.81).

Interpretation

Measurement repetitions of either one or multiple trained raters can be considered reliable in all three devices.

Introduction

Range of motion (RoM) measurements are often used to assess functional mobility [1, 2]. However, unassisted assessments of RoM, which are still performed in medical assessments, are subjective and, therefore, lack reliability [3, 4]. Nevertheless, standardized and objective RoM tests can be useful tools in choosing and evaluating therapeutic treatments in patients with issues of the musculoskeletal system [5, 6]. However, good intra- and inter-rater reliabilities should also be ensured for the respective test procedures using measurement devices in order to properly evaluate the therapy progress [2]. Both the intra- and inter-rater reliability depend on the experience of the rater, the health status of the participants, the accuracy of the instrument, its exact use and the respective test protocol [2, 7].

In clinical practice goniometers, digital inclinometers (DI) and tape measures (TM) are in frequent use [1, 2, 5, 6]. These pieces of equipment are easy to handle and are also cost and time effective. Recently, the smartphone has also received considerable attention as a measuring device and several applications have been compared to goniometer data, (reviewed by Keogh et al. [8]). The authors showed mostly adequate intra- and inter-rater reliability as well as validity for smartphone devices on all joints assessed. However, the intra-rater reliability of the DI has been shown to be equivalent [9] to the goniometer, while the inter-rater reliability was found to be superior [10, 11], since the exact placement of goniometer can be difficult [7, 12, 13]. The reliabilities of the DI and TM procedures, however, do differ [2, 14]; for example, in evaluating the flexion of the lumbar spine reliability was found to be inconsistent for several (slightly) modified versions of the Schoeber method using a TM [1517] and an inclinometer placed on the spine [18, 19].

A relatively new and frequently applied method is inertial motion capturing (IMC); this allows for precise and objective kinematic measurement in real world environments [3, 20, 21]. So far, IMC has been used by researchers when assessing functional RoM [22, 23] but, as yet, not for assessing the maximal range of motion which is held in static positions. Inertial sensors, like they are used in the MVN system by Xsens, combine signals of accelerometers, gyroscopes and magnetometers to determine the position, acceleration and orientation of body segments in space [24]. When multiple sensors are applied, joint angles can be calculated for all three body dimensions by means of biomechanical models [25].

In motion capturing, the optical approach (OMC) is considered the gold standard of kinematic measurements [26, 27]. However, as OMC is executed in the laboratory with several high precision cameras, it is scarcely available and is also expensive. Although research of the validity of IMC systems relies on small sample sizes [28], concurrent findings suggest that the IMC systems deliver good to excellent data especially in the frontal and sagittal planes [24, 27, 29, 30] and in slower and less complex movements [20, 31]. For example, comparing the Xsens system to the Optotrak system, Robert-Lachaine et al. [31] have shown that in the Xsens system the mean root mean square error (RMSE) on all joints was 1.2° in short functional movements compared to 2.8° in faster and more complex tasks (p≤0.001). They also showed, that the differences between the Xsens and Optotrak systems were significantly more attributed to discrepancies in the biomechanical model, rather than technological issues in estimating the orientation of segments (RMSE <5° in manual handling tasks); this has been supported by the findings of Zhang et al. [28]. The validation studies for the Xsens system have already been published in 2016 [32] and 2013 [28] and since then the calibration procedure and data processing protocols have been further developed.

In the present study, we chose to evaluate five traditional joint RoM tests. In these tests, RoM data are captured in static poses, mostly in the sagittal plane (fingertip-to-floor test, Thomas test, retroflexion of the trunk) and in the frontal plane (lateral inclination) while only for the shoulder test is a combination of planes used (Fig 1).

thumbnail
Fig 1. The range of motion measurements.

a) Shoulder test modified after Janda on the left-hand body side, b) modified Thomas test on the left-hand body side, c) retroflexion of the trunk after Janda in the modified version, d) fingertip-to-floor test and e) lateral inclination on the right-hand body side. The measurements were simultaneously recorded in the Xsens system (subjects wears the measurement suit) and the low cost devices.

https://doi.org/10.1371/journal.pone.0243646.g001

The aim of this study was to collect joint range of motion data using low cost device methods (DI and TM) and more objective kinematic data using the Xsens system. For both approaches the intra- and inter-rater reliability are calculated in order to determine their practicability for medical assessment.

Methods

Study protocol

The aim of this study was to compare data obtained with either the DI [5, 7, 13] or TM [33, 34] with the data derived from the IMC system in order to assess the reliability of five RoM tests. Thus, we chose the following five tests (Fig 1) which depict the flexibility of the trunk in all three dimensions as well as the shoulder and the hip: the shoulder test modified after Janda [35], the modified Thomas test [5], retroflexion of the trunk after Janda in a modified version [35], the fingertip-to-floor test [33] and lateral inclination [12].

The study was approved by the ethics committee of Goethe University (2018–46) Frankfurt am Main, Germany.

Subjects

22 healthy subjects (12f/10m) with an average age of 25 years ± 2 volunteered to participate in this prospective study. The subjects were on average 174.1 cm ± 9.8 tall and weighed 66.6 kg ± 11.3. The average body mass index (BMI) was 21.9 ± 2.0 and all subjects were right-handed. All participants provided written informed consent. Two raters (sports scientists, B.A. and B.Sc., respectively) were intensively trained in the use of the measuring systems. They were responsible for performing the range of motion tests and positioning the DI and the TM.

To take part in the study, subjects had to be aged between 18 and 30 years. Exclusion criteria consisted of relevant operations to, or surgical stiffening, of the musculoskeletal system, relevant artificial joint replacement, severe diseases such as ankylosing spondylitis, chronic destructive joint diseases, multiple sclerosis, myodystrophic or neurodegenerative diseases, congenital malpositions of the musculoskeletal system or an acute heriated disc. In addition, the intake of muscle relaxants or other drugs that influence the elasticity of the musculature and pregnancy were considered as contra indicators.

Materials

Inertial Motion Capture (IMC).

The MVN BIOMECH Link system from Xsens (Enschede, Netherlands) was used for kinematic data collection. This system provides position, orientation and angle information, in addition to acceleration and velocity parameters, of the entire human body by means of 17 motion sensors, each consisting of acceleration and inertial sensors as well as gyroscopes. The sampling rate of the system is 240 Hz and the measurement error is specified by the manufacturer as ± 1%. The corresponding software displays the collected data in real-time using kinematic motion reconstruction as a biomechanical model. Due to the arrangement of the sensors within a full body suit (available in various sizes), the system is comfortable to wear without restricting the freedom of movement. In the study, when the position to be measured was reached, a marker was set in the recording session. The recording was executed in the multi-level scenario, which is recommended by Xsens when measurements are not uniquely performed on a flat floor (for our measurements subjects partly lay on a bench or stood on a step, thus, the multi-level scenario was the best setting applicable for these tests). Where subjects stood on even ground, the single-level scenario was chosen. After the measurements were taken, the "HD reprocessing" filter was applied to all recordings, this is provided in the MVN Analyze software and offers the best possible data quality according to the manufacturer.

Digital Inclinometer (DI).

A digital inclinometer (Model: AcumarTM DIGITAL INCLINOMETER Model ACU002 / Lafayette Instrument Company / Lafayette / USA) was used for the angle measurement in the modified Thomas test, in the retroflexion of the trunk modified after Janda and also in the shoulder test modified after Janda. As the DI displays only integers, the absolute measurement error is 0.3° [36].

Tape Measure (TM).

A commercially available tape measure was used to measure the distance between the finger and the ground and for lateral flexion. The measuring tape had a double measuring scale (inch and cm) with the smallest increments being 0.1 cm. The hard surface and the associated reduced buckling behavior enabled exact measurements to be made. The distances were recorded to the nearest 0.1 cm.

Measurement protocol

Prior to the measurements, subjects dressed in the measurement suits of the IMC system which were then calibrated. The five RoM tests described were carried out by the test persons in a randomized order. Since the modified Thomas test, the lateral inclination and the shoulder test modified after Janda were tested on both body sides, this study included a total of eight tests. For each test, 25 repetitions were performed which were recorded simultaneously by the IMC system and the DI or IMC. The first 20 repetitions were recorded by the first rater but not included in any calculations as they were included as warm-up in order to control for acute effects [37, 38]. For rater one, measurement 21–25 were included in the analysis [39]. Subsequently, the second rater measured another five repetitions. The order of the raters was chosen at random.

The raters were responsible for the correct positioning of the subjects at the beginning of each measurement and the application of either the DI or TM. When the subjects had reached the position to be measured, an additional investigator gave a verbal signal for the simultaneous measurement to be taken for the IMC and DI or TM. In the IMC, markers were set in the software which were checked after the recordings. The raters read out loud the angles or distances of the DI or the TM, respectively. The order of the raters was also randomized. The correct execution of the exercises was monitored by an additional investigator who underwent training at the same time as the raters.

Range of motion measurements

In Fig 1 the performed RoM tests are presented. Further detailed information can be found in the methodology paper of Holzgreve et al. [13].

Shoulder test modified after Janda.

In contrast to the test by Janda, in this study the elbow was stretched (Fig 1A) and the raters placed the inclinometer proximal to the processus styloideus radii on the radius. While the test person was lying on a treatment couch, the shoulder joint was free and the arm was lowered in a controlled manner until the tension of the musculature terminated the movement. For each measurement, the rater moved the subject's arm and decided when the defined position had been reached. The test was performed separately for each arm. The joint centers of the wrist and the humerus head were extracted from the IMC. The arm length (distance between the humerus head and wrist) and the height difference between the humerus head and the wrist joint were calculated. The angle was calculated by the sin-1 from height/length. The recording was executed in the multi-level scenario and subsequently HD reprocessed.

Modified Thomas test.

During the measurement session of the modified Thomas test (Fig 1B), the 0° alignment of the pelvis was checked by the raters after every fifth measurement. The inclinometer was then placed on the thigh, proximal to the patella, to determine the joint angle [40, 41]. The leg remained in the same position during all measurements and while changing the raters; the test was also performed separately for each leg. For the analysis of the IMC data, the flexion/extension angle of the hip joint was used. The recording was executed using the multi-level scenario and subsequently reprocessed (not HD).

Retroflexion of the trunk modified after Janda.

The retroflexion of the trunk test, which was modified after Janda (Fig 1C), was determined by placing the inclinometer on the proximal part of the sternum. At the instructor's command, the test person adopted the position to be measured, maintained it for a few seconds and the angle was measured by the rater. For comparison with the IMC system, the orientation angle (y-axis) of the sternum was analyzed. The recording was executed using the multi-level scenario and subsequently HD reprocessed.

Fingertip-to-floor test.

For the fingertip-to-floor tests (Fig 1D), subjects adopted the standardized position on a 15 cm high stool; this ensured that flexible persons could also execute the test with full RoM. For the execution, it was checked that the knees were always stretched and that the index fingers of both hands were brought together. The distances between the floor and the fingertip were measured using a conventional measuring tape. For the corresponding IMC data, the distance between the hand segments and the foot segments were calculated; to do so, the data of the left and right side were averaged for hand and foot segments.

Lateral inclination.

The lateral inclination was executed in a standardized standing position. Sagittal fluctuations in the lateral inclination were eliminated by leaning the back against a wall. The ipsilateral hand of the body side to be measured had to be guided distally, directly along the body. An investigator confirmed that the test person kept their knees straight, that their heels were not lifted off the ground and that their backs were always leant against the wall. If the position to be measured was actively reached by the test person, the rater measured the lateral fingertip-to-floor distance using a measuring tape. Each body side was measured separately. For comparison with the IMC system, the distance between the hand segment and the floor was calculated. The recording was executed in the single level scenario and afterwards HD reprocessed.

Statistical analysis

Statistical analysis was performed using Matlab R2020a, Microsoft Excel 2016, IBM SPSS Statistics 25 and BIAS (Version 11) software programs. The relevant joint angles were provided in the Xsens software except for the specific angle used in the shoulder test. For this test, Matlab was used to calculate the distances between the segments and the angles. In addition, all necessary IMC data were analyzed in Matlab and exported as Excel-files similar to the DI and the TM data.

For the intra-rater reliability of rater 1 and rater 2, the last five measurements of each test were taken into account. Inter-rater reliability was analyzed by comparing the mean values of the last five measurements of rater 1 and rater 2. Reliabilities were calculated by means of intraclass correlation coefficients (ICC) using the BIAS software.

Intra-rater reliability (ICC) was assessed according to Bland and Altmann [42]: where m is the number of observations per subject, SST the total sum of squares, and SSB the sum of squares between persons. Measurements are considered reliable if differences between the two measurements of the same person are small compared to the differences between the individuals.

The measurement error (ME) is the square root of the average of the subject-specific variances:

Repeatability was calculated as the mean difference between two measurements of the same subject, which can be estimated as √2⸱ME [43].

The coefficient of repeatability (CoR) was calculated as an estimation of the minimum detectable change: 1.96 x √2⸱ME.

For inter-rater reliability for the low cost device methods, the “two-way mixed, single-measure” ICC(3,1) according to Shrout and Fleiss [44] was used, where ICC(3,1) values were estimated via this random effects model: with Yij the j-th observation of the i-th person, μ the fixed effect, si the random effect of the i-th person, iid, N(0, σs2), αj the fixed rater effect of the j-th rater, eij the measurement error, iid, N(0, σe2).

Measurements are considered reliable if differences between the observers are small compared to the differences between the individuals. The intra-class correlation coefficient ICC(3,1) measures the relation of the variance that is attributed to a random factor as

ICC(3,1) = [σ2s- σ2r] /[σ2s+ σ2r+ σ2e], with σ2r being the variance of the rater.

Both intra- and inter-rater reliability coefficients (ICCs) were classified by means of the method suggested by Landis and Koch [45]: ICCs 0–0.20 = “slight”, 0.21–0.40 = “fair”, 0.41–0.60 = “moderate”, 0.61–0.80 = “substantial”, 0.81–1.00 = “(almost) perfect”.

As the vast majority of data was normally distributed, the Pearson correlation coefficient was used to show the relationship between the measurement systems using BIAS. Only data of which the synchronicity could be assured was included. The correlation coefficients were classified according to Evans [46]: r <0.2 = “poor”, 0.2–0.4 = “weak”, 0.4–0.6 = “moderate”, 0.6–0.8 = “strong” and >0.8 = “optimal” correlations.

Results

Intra-rater reliability for all tests using the DI or the TM, respectively, showed (almost) perfect results (Table 1). The same was demonstrated for the IMC measurement system (Table 2). For the RoM tests measured in centimeters, the ME ranged from 0.5–1.3cm in the TM method and 0.6–2.7cm in the IMC method. The repeatability ranged from 0.7–1.9cm in the TM method and from 0.9–3.9cm in the IMC method. In the tests measured in degrees (°) the ME was 0.9–3.3° in the DI method and 0.5–1.2° in the IMC method. The repeatability was 1.2–4.7° in the DI method and 0.5–2.8° in the Xsens system.

thumbnail
Table 1. Intra-rater reliability for both raters using the DI and TM.

https://doi.org/10.1371/journal.pone.0243646.t001

thumbnail
Table 2. Intra-rater reliability for both raters using the IMC.

https://doi.org/10.1371/journal.pone.0243646.t002

Inter-rater reliability (Table 3) for the DI and the TM protocols revealed substantial to (almost) perfect agreement. This can also be described for the IMC testing.

thumbnail
Table 3. Inter-rater reliability for using the TM and DI.

https://doi.org/10.1371/journal.pone.0243646.t003

The relationship between the two methods is illustrated in Fig 2 for each test. The correlations between the low cost device methods and the IMC method were shown to be moderate or better and except for the shoulder test on the right-hand body side statistically significant (Table 4). While the fingertip-to-floor test exhibited an optimal correlation, the analysis of the lateral inclination and the modified shoulder test on the right-hand body side revealed strong correlations. The retroflexion of the trunk, the modified shoulder test of the right-hand body side and the Thomas test were found to correlate in a moderate manner.

thumbnail
Fig 2. The relationship between the low cost device methods and the IMC method.

Here the means of the last five measurements of rater 1 were used for the plot.

https://doi.org/10.1371/journal.pone.0243646.g002

thumbnail
Table 4. Pearson correlations between the DI–IMC or TM–IMC measurement systems.

https://doi.org/10.1371/journal.pone.0243646.t004

The main findings of Tables 14, concerning the intra- and inter-rater reliabilities and the correlations, are summarized in Table 5.

thumbnail
Table 5. Summary of the correlations, intra- and inter-rater reliabilities.

https://doi.org/10.1371/journal.pone.0243646.t005

Discussion

The aim of this study was to provide both intra- and inter-rater reliabilities for five joint range of motion tests measured with low cost devices (TM or DI) and an IMC system. The results show, that intra-rater reliability was (almost) perfect in all tests for all three devices; whilst inter-rater reliability was shown to be substantial to (almost) perfect in the DI and the IMC methods and (almost) perfect in the TM methods (Table 5). Consequently, the measurement repetitions of all three devices with either one or multiple trained raters can be considered reliable.

While the TM and the DI are in frequent use, the relatively new IMC systems have not, as yet, been used for static ROM assessment. This study compliments the field of IMC science, being the first to provide intra- and inter-rater reliabilities on static RoM measurements. Although the IMC approach does not show the exact same angles or distances as a TM or a DI (e.g. distance calculation between the hand and foot sensors or the distance from the fingertip to floor measured via TM), the correlations between the methods were moderate to strong (Table 4, Fig 2). It can therefore be assumed that the same or at least a very similar mobility construct was measured. Typically, IMC systems are used for the kinematic measurement of motions, for which reliabilities have been provided. van der Straaten et al. [24] used an Xsens device (Awinda, 60Hz) for the examination of joint angles of the trunk, pelvis, hip, knee and ankle in all three degrees of freedom in a single leg squat. For the sagittal hip RoM the results showed within-session, between-session and between operator reliabilities ICCs of 0.9, 0.86 and 0.86 respectively. Relative to the total RoM executed in the single leg squat, the proportional standard error of measurement (%SEM) in the hip joint was 18%– 20% and the minimal detectable change (MDC) was 8–11°. In the Thomas test of the current study, we also measured the hip RoM in the sagittal plane and demonstrated a considerably lower ME (0.5–1.1°) and CoR (1.4–3.1°). This is possibly due to the more precise Xsens system MVN Link which has a sampling rate of 240Hz and the static nature of the Thomas test which improves measurement accuracy [20].

The results of the TM and DI methods can be evaluated with regard to the concurrent evidence. Regarding the measurement of shoulder mobility, literature provides a large range of different RoM measurements [14, 4750]; given the fact that the shoulder is a highly mobile joint, from a functional point of view, different degrees of freedom are of interest. In these studies, reproducibility has been shown to differ [14, 50, 51]; for example de Winter et al. [48] report ICCs ranging from 0.28–0.90 for the inter-rater reliability and a MDC of 20–25° for glenohumeral abduction and external rotation measured via DI. However, to our knowledge so far, no study has researched the precise RoM used in the current approach, which can be described as a combination of external rotation and abduction of the shoulder. Based on the presented results, this approach produces reliable results and an acceptable CoR ranging from 5.1–6.1° for the DI method and from 3.9–5.5° for the IMC method.

In the Thomas test the present findings of substantial to (almost) perfect reproducibility are supported by concurrent evidence [5, 11]. In addition, the CoRs of 2.4–3.9° in the DI method and 1.4–3.1° in the IMC method support a precise application of the test protocol in medical assessment. However, the Thomas test must, in general, be applied cautiously in medical assessments in obese people [52]; controlling for hip flexion [53] and testing proximal of the patella may be influenced by the curvature of the thigh and the subcutaneous fat tissue. Nevertheless, in the present study, this was not an issue as the subjects had a rather low BMI (21.9 ± 2.0).

In the retroflexion of the trunk, the available results on intra- and inter-rater reliabilities for measuring the retroflexion on different bony landmarks on the back are rather mixed [2, 5457]. Mellin et. al. [52] have shown that the lying position is the most reliable method available (correlations between repetitions: r 7.2–9.2). However, they applied the inclinometer on the spine, whereas in this study the DI was placed on the sternum as it is easy to palpate and provides a solid base for the DI. Up to the present day, no other study has aimed to evaluate the extension of the spine via taking measurements from the sternum. Although reliability was (almost) perfect, the ME in the DI method ranged from 2.71–3.30° and the CoR was 7.5–9.2°; these values can both be considered relatively high as RoM gains in the extension of the spine may only be small. Therefore, we recommend an even clearer definition of the bony landmarks when using the DI assessment. The IMC method, on the other hand, was shown to be a precise approach (ME (1.1–1.2°) and CoR (2.9–3.1°)).

In the lateral inclination the present results add to those from assessments of lateral spinal flexion in a study using a bubble inclinometer [58] and a measurement table against which the subject had to lean [59]; both studies produced (almost) perfect ICCs for the intra-rater reliability and inter-rater reliability, respectively. In the current study, the low ME and good repeatability values add to the practicability of the TM method in medical assessments. The ME of 0.5–1.1cm for the TM is even smaller than previously reported by Inger et al. [60] (2.6cm); here their SDC for this method was 7.3cm [60] which is considerably greater than the CoR of 1.8–2.2cm found in the present investigation. One explanation for this may be the thorough warm-up procedure used in the present investigation. In the IMC method, the CoR was 4.3cm; this could be due to the fact that the position data calculated by signal processing in IMC systems is not entirely accurate [61].

In the fingertip-to-floor test, current evidence shows mixed results. While three studies [1, 34, 62] confirm the current, very good intra-rater reliabilities (ICCs 0.97–0.99), two studies [4, 16] have reported a low reproducibility, possibly due to differences in the study design. Merrit et al. [4] recorded the measurements for the intra-rater reliability on three different days with only one instructional repetition; this, however, is not sufficient to control for acute effects. In addition, the subjects in this study were 18–65 years old. The lack of a warm-up might also explain the poor repeatability Gill et al. [16] described (coefficient of variation: 14.1), since the subjects maintained the flexed position while a rater took repeated measurements. Better reproducibility is presented by Ekedahl et al. [6] who report a 4.5cm MDC, which is similar to the present finding of 2.9–3.7cm CoR.

When selecting measuring instruments for medical assessment, the current findings, in general, support the easy to use low cost devices since the described MEs, repeatabilities and CoRs support to the good reproducibility results. It is only in the retroflexion of the trunk, that the application of the DI on the sternum might be too imprecise to capture actual changes in mobility. The IMC method also showed satisfying results in the tests measured in degrees, especially in the spinal movement where it could provide precise and reliable data. However, when measuring distances between body segments, it should be kept in mind, that position estimation with inertial sensors is not, as yet, entirely precise [61]. While the MVN Link system by Xsens is expensive and probably too time consuming for use in clinical practice, future research should aim to evaluate cheaper and more practical IMC systems.

References

  1. 1. Gauvin MG, Riddle DL, Rothstein JM. Reliability of clinical measurements of forward bending using the modified fingertip-to-floor method. Physical therapy. 1990;70(7):443–7. pmid:2141425
  2. 2. Clarkson HM. Joint Motion and Function Assessment. A Research-Based Practical Guide: Lippincott Williams & Wilkins; 2005.
  3. 3. Cloete T, Scheffer C. Repeatability of an off-the-shelf, full body inertial motion capture system during clinical gait analysis. Conf Proc IEEE Eng Med Biol Soc. 2010;2010:5125–8. pmid:21095808
  4. 4. Merritt JL, McLean TJ, Erickson RP, Offord KP. Measurement of trunk flexibility in normal subjects: reproducibility of three clinical methods. Mayo Clin Proc. 1986;61(3):192–7. pmid:3945120
  5. 5. Clapis PA, Davis SM, Davis RO. Reliability of inclinometer and goniometric measurements of hip extension flexibility using the modified Thomas test. Physiother Theory Pract. 2008;24(2):135–41. pmid:18432516
  6. 6. Ekedahl H, Jönsson B, Frobell RB. Fingertip-to-Floor Test and Straight Leg Raising Test: Validity, Responsiveness, and Predictive Value in Patients With Acute/Subacute Low Back Pain. Arch Phys Med Rehabil. 2012;93(12):2210–5. pmid:22555005
  7. 7. Antonaci F, Ghirmai S, Bono G, Nappi G. Current methods for cervical spine movement evaluation: a review. Clin Exp Rheumatol. 2000;18(2 Suppl 19):S45–52. pmid:10824287
  8. 8. Keogh JWL, Cox A, Anderson S, Liew B, Olsen A, Schram B, et al. Reliability and validity of clinically accessible smartphone applications to measure joint range of motion: A systematic review. PLoS One. 2019;14(5):e0215806. pmid:31067247
  9. 9. Bierma-Zeinstra SM, Bohnen AM, Ramlal R, Ridderikhoff J, Verhaar JA, Prins A. Comparison between two devices for measuring hip joint motions. Clin Rehabil. 1998;12(6):497–505. pmid:9869253
  10. 10. Boone DC, Azen SP, Lin CM, Spence C, Baron C, Lee L. Reliability of goniometric measurements. Phys Ther. 1978;58(11):1355–60. pmid:704684
  11. 11. Roach S, San Juan JG, Suprak DN, Lyda M. Concurrent validity of digital inclinometer and universal goniometer in assessing passive hip mobility in healthy subjects. Int J Sports Phys Ther. 2013;8(5):680–8. pmid:24175147
  12. 12. Petherick M, Rheault W, Kimble S, Lechner C, Senear V. Concurrent validity and intertester reliability of universal and fluid-based goniometers for active elbow range of motion. Phys Ther. 1988;68(6):966–9. pmid:3375320
  13. 13. Holzgreve F, Maltry L, Lampe J, Schmidt H, Bader A, Rey J, et al. The office work and stretch training (OST) study: an individualized and standardized approach for reducing musculoskeletal disorders in office workers. Journal of Occupational Medicine and Toxicology. 2018;13(1):37.
  14. 14. Hoving JL, Buchbinder R, Green S, Forbes A, Bellamy N, Brand C, et al. How reliably do rheumatologists measure shoulder movement? Ann Rheum Dis. 2002;61(7):612–6. pmid:12079902
  15. 15. Fitzgerald GK, Wynveen KJ, Rheault W, Rothschild B. Objective assessment with establishment of normal values for lumbar spinal range of motion. Phys Ther. 1983;63(11):1776–81. pmid:6634943
  16. 16. Gill K, Krag MH, Johnson GB, Haugh LD, Pope MH. Repeatability of four clinical methods for assessment of lumbar spinal motion. Spine. 1988;13(1):50–3. pmid:3381139
  17. 17. Hyytiainen K, Salminen JJ, Suvitie T, Wickstrom G, Pentti J. Reproducibility of nine tests to measure spinal mobility and trunk muscle strength. Scand J Rehabil Med. 1991;23(1):3–10. pmid:2035000
  18. 18. Portek I, Pearcy MJ, Reader GP, Mowat AG. Correlation between radiographic and clinical measurement of lumbar spine movement. Br J Rheumatol. 1983;22(4):197–205. pmid:6652384
  19. 19. Saur PM, Ensink FB, Frese K, Seeger D, Hildebrandt J. Lumbar range of motion: reliability and validity of the inclinometer technique in the clinical measurement of trunk flexibility. Spine (Phila Pa 1976). 1996;21(11):1332–8. pmid:8725925
  20. 20. Al-Amri M, Nicholas K, Button K, Sparkes V, Sheeran L, Davies JL. Inertial Measurement Units for Clinical Movement Analysis: Reliability and Concurrent Validity. Sensors (Basel). 2018;18(3). pmid:29495600
  21. 21. Nuesch C, Roos E, Pagenstert G, Mundermann A. Measuring joint kinematics of treadmill walking and running: Comparison between an inertial sensor based system and a camera-based system. J Biomech. 2017;57:32–8. pmid:28366438
  22. 22. Doğan M, Koçak M, Onursal Kılınç Ö, Ayvat F, Sütçü G, Ayvat E, et al. Functional range of motion in the upper extremity and trunk joints: Nine functional everyday tasks with inertial sensors. Gait Posture. 2019;70:141–7. pmid:30875600
  23. 23. Rigoni M, Gill S, Babazadeh S, Elsewaisy O, Gillies H, Nguyen N, et al. Assessment of Shoulder Range of Motion Using a Wireless Inertial Motion Capture Device-A Validation Study. Sensors (Basel). 2019;19(8). pmid:31013931
  24. 24. van der Straaten R, Bruijnes AKBD, Vanwanseele B, Jonkers I, De Baets L, Timmermans A. Reliability and Agreement of 3D Trunk and Lower Extremity Movement Analysis by Means of Inertial Sensor Technology for Unipodal and Bipodal Tasks. Sensors (Basel, Switzerland). 2019;19(1):141. pmid:30609808
  25. 25. Mundt M, Thomsen W, David S, Dupré T, Bamer F, Potthast W, et al. Assessment of the measurement accuracy of inertial sensors during different tasks of daily living. J Biomech. 2019;84:81–6. pmid:30585155
  26. 26. Inokuchi H, Tojima M, Mano H, Ishikawa Y, Ogata N, Haga N. Neck range of motion measurements using a new three-dimensional motion analysis system: validity and repeatability. Eur Spine J. 2015;24(12):2807–15. pmid:25847728
  27. 27. Teufl W, Miezal M, Taetz B, Fröhlich M, Bleser G. Validity, Test-Retest Reliability and Long-Term Stability of Magnetometer Free Inertial Sensor Based 3D Joint Kinematics. Sensors (Basel). 2018;18(7).
  28. 28. Zhang JT, Novak AC, Brouwer B, Li Q. Concurrent validation of Xsens MVN measurement of lower limb joint angular kinematics. Physiol Meas. 2013;34(8):N63–9. pmid:23893094
  29. 29. Mavor MP, Ross GB, Clouthier AL, Karakolis T, Graham RB. Validation of an IMU Suit for Military-Based Tasks. Sensors. 2020;20(15). pmid:32751920
  30. 30. Karatsidis A, Jung M, Schepers HM, Bellusci G, de Zee M, Veltink PH, et al. Musculoskeletal model-based inverse dynamic analysis under ambulatory conditions using inertial motion capture. Med Eng Phys. 2019;65:68–77. pmid:30737118
  31. 31. Robert-Lachaine X, Mecheri H, Larue C, Plamondon A. Validation of inertial measurement units with an optoelectronic system for whole-body motion analysis. Med Biol Eng Comput. 2017;55(4):609–19. pmid:27379397
  32. 32. Robert-Lachaine X, Mecheri H, Larue C, Plamondon A. Validation of inertial measurement units with an optoelectronic system for whole-body motion analysis. Med Biol Eng Comput. 2016;55. pmid:27379397
  33. 33. Heikkila S, Viitanen JV, Kautiainen H, Kauppi M. Sensitivity to change of mobility tests; effect of short term intensive physiotherapy and exercise on spinal, hip, and shoulder measurements in spondyloarthropathy. J Rheumatol. 2000;27(5):1251–6. pmid:10813296
  34. 34. Perret C, Poiraudeau S, Fermanian J, Colau MM, Benhamou MA, Revel M. Validity, reliability, and responsiveness of the fingertip-to-floor test. Arch Phys Med Rehabil. 2001;82(11):1566–70. pmid:11689977
  35. 35. Smolenski UC BJ, Beyer L, Harke G, Pahnke J, Seidel W. Janda. Manuelle Muskelfunktionsdiagnostik: Theorie und Praxis. 5 ed. Germany: Elsevier Health Sciences; 2016.
  36. 36. Grabe M. Measurement Uncertainties in Science and Technology. 2nd ed. Berlin: Springer International Publishing; 2014.
  37. 37. Boyce D, Brosky JA Jr. Determining the minimal number of cyclic passive stretch repetitions recommended for an acute increase in an indirect measure of hamstring length. Physiother Theory Pract. 2008;24(2):113–20. pmid:18432514
  38. 38. Hatano G, Suzuki S, Matsuo S, Kataura S, Yokoi K, Fukaya T, et al. Hamstring Stiffness Returns More Rapidly After Static Stretching Than Range of Motion, Stretch Tolerance, and Isometric Peak Torque. J Sport Rehabil. 2019;28(4):325–31. pmid:29252096
  39. 39. Frost M, Stuckey S, Smalley LA, Dorman G. Reliability of measuring trunk motions in centimeters. Phys Ther. 1982;62(10):1431–7. pmid:7122701
  40. 40. Kim WD, Shin D. Correlations Between Hip Extension Range of Motion, Hip Extension Asymmetry, and Compensatory Lumbar Movement in Patients with Nonspecific Chronic Low Back Pain. Med Sci Monit. 2020;26:e925080. pmid:32968039
  41. 41. Roach SM, San Juan JG, Suprak DN, Lyda M, Bies AJ, Boydston CR. Passive hip range of motion is reduced in active subjects with chronic low back pain compared to controls. Int J Sports Phys Ther. 2015;10(1):13–20. pmid:25709858
  42. 42. Bland JM, Altman DG. Measurement error and correlation coefficients. BMJ. 1996;313(7048):41–2. pmid:8664775
  43. 43. Bland JM, Altman DG. Measurement error. Bmj. 1996;313(7059):744. pmid:8819450
  44. 44. Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86(2):420–8. pmid:18839484
  45. 45. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159–74. pmid:843571
  46. 46. Evans JD. Straightforward statistics for the behavioral sciences. Pacific Grove: Brooks/Cole Pub. Co.; 1996.
  47. 47. Valentine RE, Lewis JS. Intraobserver reliability of 4 physiologic movements of the shoulder in subjects with and without symptoms. Archives of physical medicine and rehabilitation. 2006;87(9):1242–9. pmid:16935062
  48. 48. de Winter AF, Heemskerk MA, Terwee CB, Jans MP, Devillé W, van Schaardenburg DJ, et al. Inter-observer reproducibility of measurements of range of motion in patients with shoulder pain using a digital inclinometer. BMC Musculoskelet Disord. 2004;5:18. pmid:15196309
  49. 49. Green S, Buchbinder R, Forbes A, Bellamy N. A standardized protocol for measurement of range of movement of the shoulder using the Plurimeter-V inclinometer and assessment of its intrarater and interrater reliability. Arthritis Care Res. 1998;11(1):43–52. pmid:9534493
  50. 50. Kolber MJ, Saltzman SB, Beekhuizen KS, Cheng MS. Reliability and minimal detectable change of inclinometric shoulder mobility measurements. Physiother Theory Pract. 2009;25(8):572–81. pmid:19925265
  51. 51. https://de.statista.com/statistik/daten/studie/249080/umfrage/anteile-der-wirtschaftssektoren-am-bruttoinlandsprodukt-bip-der-eu-laender/ (11.09.2018)[
  52. 52. Mellin G, Kiiski R, Weckstrom A. Effects of subject position on measurements of flexion, extension, and lateral flexion of the spine. Spine (Phila Pa 1976). 1991;16(9):1108–10.
  53. 53. Vigotsky AD, Lehman GJ, Beardsley C, Contreras B, Chung B, Feser EH. The modified Thomas test is not a valid measure of hip extension unless pelvic tilt is controlled. PeerJ. 2016;4:e2325. pmid:27602291
  54. 54. Alaranta H, Hurri H, Heliovaara M, Soukka A, Harju R. Flexibility of the spine: normative values of goniometric and tape measurements. Scand J Rehabil Med. 1994;26(3):147–54. pmid:7801064
  55. 55. Burdett RG, Brown KE, Fall MP. Reliability and validity of four instruments for measuring lumbar spine and pelvic positions. Phys Ther. 1986;66(5):677–84. pmid:3703932
  56. 56. Chen SP, Samo DG, Chen EH, Crampton AR, Conrad KM, Egan L, et al. Reliability of three lumbar sagittal motion measurement methods: surface inclinometers. J Occup Environ Med. 1997;39(3):217–23. pmid:9093973
  57. 57. Ng JK, Kippers V, Richardson CA, Parnianpour M. Range of motion and lordosis of the lumbar spine: reliability of measurement and normative values. Spine (Phila Pa 1976). 2001;26(1):53–60. pmid:11148646
  58. 58. Uswr PT. The Reliability of Bubble Inclinometer and Tape Measure in Determining Lumbar Spine Range of Motion in Healthy Individuals and Patients2015. 137–44 p.
  59. 59. Jonsson E, Ljungkvist I, Hamberg J. Standardized measurement of lateral spinal flexion and its use in evaluation of the effect of treatment of chronic low back pain. Ups J Med Sci. 1990;95(1):75–86. pmid:2145682
  60. 60. Inger L, Anderson B, Hildegunn L, Skouen J, Ostelo R, Magnussen L. Responsiveness to Change of 10 Physical Tests Used for Patients With Back Pain. Phys Ther. 2011;91:404–15. pmid:21273629
  61. 61. Pollock DSG. Handbook of Time Series Analysis, Signal Processing, and Dynamics. London: Academic Press; 1999.
  62. 62. Kippers V, Parker AW. Toe-touch test. A measure of its validity. Phys Ther. 1987;67(11):1680–4. pmid:3671506