Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Between-day reliability of local and global muscle-tendon unit assessments in female athletes whilst standardising menstrual cycle phase

  • Scott Newbould ,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Project administration, Writing – original draft

    s.newbould@leedsbeckett.ac.uk

    Affiliation Carnegie School of Sport, Leeds Beckett University, Leeds, United Kingdom

  • Josh Walker,

    Roles Conceptualization, Methodology, Visualization, Writing – review & editing

    Affiliation Carnegie School of Sport, Leeds Beckett University, Leeds, United Kingdom

  • Alexander J. Dinsdale,

    Roles Conceptualization, Methodology, Writing – review & editing

    Affiliation Carnegie School of Sport, Leeds Beckett University, Leeds, United Kingdom

  • Sarah Whitehead,

    Roles Conceptualization, Methodology, Writing – review & editing

    Affiliations Carnegie School of Sport, Leeds Beckett University, Leeds, United Kingdom, Leeds Rhinos Netball, Leeds, United Kingdom

  • Gareth Nicholson

    Roles Conceptualization, Methodology, Supervision, Writing – review & editing

    Affiliation Carnegie School of Sport, Leeds Beckett University, Leeds, United Kingdom

Abstract

Muscle-tendon unit (MTU) assessments can be categorised into local (e.g., tendon strain) or global (e.g., jump height) assessments. Although menstrual cycle phase may be a key consideration when implementing these assessments in female athletes, the reliability of many MTU assessments is not well defined within female populations. Therefore, the purpose of this study was to report the test-retest reliability of local and global MTU assessments during the early follicular phase of the menstrual cycle. Seventeen naturally menstruating females (age 28.5 ± 7.3 years) completed local and global MTU assessments during two testing sessions separated over 24–72 hours. Local tests included Achilles’ tendon mechanical testing and isometric strength of ankle plantar flexors and knee extensors, whereas global tests included countermovement, squat, and drop jumps, and the isometric midthigh pull. Based on intraclass correlation coefficient (ICC) statistics, poor to excellent reliability was found for local measures (ICC: 0.096–0.936). Good to excellent reliability was found for all global measures (ICC: 0.788–0.985), excluding the eccentric utilisation ratio (ICC 0.738) and most rate of force development metrics (ICC: 0.635–0.912). Isometric midthigh pull peak force displayed excellent reliability (ICC: 0.966), whereas force-time metrics ranged from moderate to excellent (ICC: 0.635–0.970). Excluding rate of force development (coefficient of variation [CV]: 10.6–35.9%), global measures (CV: 1.6–12.9%) were more reproducible than local measures (CV: 3.6–64.5%). However, local metrics directly measure specific properties of the MTU, and therefore provide valuable information despite lower reproducibility. The novel data reported here provides insight into the natural variability of MTU assessments within female athletes which can be used to enhance the interpretation of other female athlete data, especially that which aims to investigate other aspects of variability, such as the menstrual cycle.

Introduction

Measures of muscle-tendon unit (MTU) function are of interest in sport science because of their relationships with athletic performance tasks [13]. Sport scientists frequently use local and global MTU assessments to gain insight into these characteristics and to inform training practices. Local assessments are those that directly measure a specific MTU property (e.g., tendon mechanical properties [i.e., strain or stiffness]), whilst global tests focus on movement outcomes, such as jump height or reactive strength. Global measures are underpinned by the aspects of MTU properties measured in the local assessments, for example, rate of force development (RFD) and jump height are influenced by tendon stiffness [4]. Thus, gaining further insight into the interrelationships between local MTU measures and global outcomes can help inform training priorities. Reliably assessing both local and global MTU properties is therefore of great importance to sport scientists. However, much of the sport science literature does not reflect the increasing participation in female sport seen in recent years [5] and there is currently limited data on the sources of variability in MTU assessments in females.

The scarcity of female-specific reliability data is a concern because differences in the hormonal environment between males and females might result in different magnitudes of biological variation within and between days [6]. Specifically, it must be considered that females present with comparatively higher and more variable levels of hormones such as oestrogen, which can impact both neuromuscular output [7] and tendon mechanical properties [8]. However, the true effect of changes in the hormonal environment on MTU function remains unknown [9]. Until these effects are better understood, reliability data should not be directly transferred between sexes, and when collecting female-specific data, consideration should be made for the hormonal environment at the time of testing. Understanding the natural variability of MTU assessments without the confounding effects of hormonal changes would allow more objective inferences to be made regarding MTU testing and monitoring data in female athletes, especially when comparing different hormonal profiles (e.g., different phases of the menstrual cycle).

Regarding local MTU assessments, the triceps surae muscle group and the Achilles’ tendon (AT) comprise one of the most profiled muscle-tendon complexes in athletes. This is because of the role that the AT has in overall MTU strategy [10,11], and the relationships that exist between AT mechanical properties, such as elongation and stiffness, and performance in sprint [12], jumping [13], and long-distance running tasks [14]. However, if this field of research is to progress to individualised monitoring and prescription of MTU characteristics [15], it is important to first understand the reliability and sensitivity to change of AT metrics. To this end, several studies have reported test-retest reliability data, with a wide range of intraclass correlation coefficients (ICC) and coefficient of variations (CV) reported for both maximal AT elongation (ICC: 0.56–0.95, CV: 4.1–12.3% [1620]) and AT stiffness (ICC: 0.64–0.98, CV: 4.7–13.5% [13,1618,2023]). This variation in reliability is contributed to by the between-study heterogeneity in methods, such as differences in joint configuration, the exact tendinous structure measured, the region used for the calculation of stiffness, and the loading protocol used. Importantly, to the authors’ knowledge, only one of these studies has included female participants (7 males and 4 females in Hansen et al. [16]), and therefore the reliability of these measures within female populations is not well defined.

For global MTU tests, force plate analyses of the countermovement jump (CMJ), squat jump (SJ), drop jump (DJ), and the isometric mid-thigh pull (IMTP) are frequently used to assess and monitor athletes [e.g., 24,25]. Although recommendations have been made for data collection and analysis procedures for these tests [2628], the interpretation of these metrics is limited without coinciding reliability data. The reliability of both basic (e.g., jump height, peak force) and more complex CMJ metrics (e.g., RFD) have been investigated within female populations in studies by Moir et al. [29] and Keogh et al. [30], with both studies reporting high reliability for nearly all metrics (ICC: 0.80–0.97). However, reliability data has scarcely been reported in adult females for the squat jump, drop jump, or isometric midthigh pull, reducing the ability to interpret these tests in this population. Therefore, research which provides reliability data for a variety of global MTU assessments in a female population will provide novel and valuable information for researchers and practitioners.

Many studies have aimed to measure the variability in performance over the course of the menstrual cycle [9], although true understanding of menstrual cycle variability cannot be complete without first understanding the natural variability that exists within this population. Therefore, the assessment of variability without potentially confounding changes in the hormonal environment will provide meaningful context in which both past and future female athlete research can be interpreted. Thus, the aim of this study was to report the test-retest reliability of local and global MTU assessments in naturally menstruating female athletes, whilst standardising menstrual cycle phase.

Materials and methods

Study design

This study used a within-subject repeated measures design, whereby participants completed two testing sessions separated over 24–72 hours during the early follicular phase of the menstrual cycle (days 1–5). The early follicular phase was chosen because it is the phase with the most stable hormonal profile [31], and the timing of this phase was predicted by calendar-based counting using previous cycle length data and confirmed by the presence of menstrual bleeding [32].

Participants

Seventeen naturally menstruating females volunteered to participate in this study (age 28.5 ± 7.3 years, stature 166.8 ± 5.6 cm, body mass 62.6 ± 8.2 kg). ‘Naturally menstruating’ was self-reported and was defined as having cycle lengths between 21–35 days and having had nine or more periods in the previous year [32]. Participants had an average training experience of 8.0 ± 5.9 years and performed 4.8 ± 2.6 training sessions per week, and therefore aligned with Tier 2 of the Participant Classification Framework [33]. Additionally, participants were required to be free from lower-body injuries for six months prior to participation, and to not have a previous AT or patellar tendon rupture. Participant recruitment took place between 6/10/22 and 20/10/23, and all participants provided their written informed consent to participate. The study was approved by the Leeds Beckett Ethics Committee (approval reference 102193) and conformed to the Declaration of Helsinki [34].

Procedures

All participants completed a familiarisation session approximately seven days prior to the first testing visit, which included submaximal and maximal attempts of each test. For the testing sessions, participants first completed a standardised warm up before completing all tests in the following order: CMJ, SJ, DJ, isometric plantar flexion (including measurements of AT mechanical properties), isometric knee extension, and the IMTP. At least 2 minutes rest was provided between each test. Definitions and calculations for all metrics are provided in S1 and S2 Tables.

Mechanical properties of the muscle-tendon unit.

All measurements of the triceps surae MTU were performed on the left leg to quasi-randomise for limb dominance. First, resting AT length was measured using the techniques of Barfod et al. [35]. This technique accounts for the curvature of the AT by using flexible measuring tape to measure the distance between the AT origin and insertion. The origin of the AT was defined as the gastrocnemius medialis myotendinous junction (which in turn was defined as the junction between the most distal point of the gastrocnemius medialis and the AT), and the insertion of the AT as the posterior and superior corner of the calcaneus [35]. Participants were then seated on an isokinetic dynamometer (CSMI, Cybex Humac Norm, Stoughton, MA, USA), sampling at a frequency of 1,000 Hz, with a hip joint angle of 120° (180° in anatomical position), knee joint angle of 180° (fully extended), and an ankle joint angle of 90° (neutral). Recording at 15 Hz, a 60 mm, 128-element linear array ultrasound probe (LV7.5/60/128Z-2, 5.0–8.0 MHz; EchoBlaster128 CEXT-1Z, Telemed UAB; Vilnius, Lithuania) was placed over the gastrocnemius medialis myotendinous junction to measure the displacement of this landmark throughout all protocols below. Firstly, the AT moment arm was calculated using the tendon excursion method [36]. Next, participants performed four submaximal and one maximal plantar flexion contraction to precondition the AT [37]. Then, participants performed graded isometric contractions of 5 seconds each with 20 seconds of rest between each contraction, starting at 20 N.m and increasing in steps of 10 N.m until a maximum was reached, with 3–5 maximal attempts being performed [19]. The displacement of the myotendinous junction during each contraction was taken as a measure of AT elongation, which was defined as the difference between the location of the myotendinous junction at rest and during the isometric portion of each contraction. Definitions of each metric of Achilles’ tendon mechanical properties are provided in S1 Table. Throughout all dynamometry procedures participants were securely strapped into position to minimise joint rotation. However, a certain amount of joint rotation is inevitable, and this results in misalignment of the ankle joint centre and the axis of rotation of the dynamometer, which causes error in the recorded plantar flexion moments. Whilst this study did not account for this rotation, as reported in the Results section, ankle joint rotation was similar between sessions and thus the error in the plantar flexion moment values will be similar across sessions, leaving the reliability results unaffected. A 2D video camera (Sony RX10 Mk3, Sony Group Corporation, Tokyo, Japan) in video mode (50 Hz) was used to record the lower limb in the sagittal plane, and any observed change in ankle joint angle (i.e., from heel lift and dynamometer movement) was measured. These data were used in conjunction with the tendon excursion data to correct the observed myotendinous junction displacement during the contractions for the displacement due to passive rotation [38].

To investigate the reliability of the manual measurement of AT elongation, a random sample of 10 maximal contractions were selected from the whole dataset. For intra-rater reliability, these images were analysed twice by the lead researcher with an intervening period of 48 hours, and for inter-rater reliability, a second experienced ultrasound operator analysed the same videos. The ICCs and CVs for intra- and inter-rater reliability were 0.994 (95% confidence interval [CI]: 0.976–0.999) and 1.47% (CI: 0.69–2.26%), and 0.936 (CI: 0.762–0.984) and 4.80% (CI: 2.50–7.10%), respectively.

Vertical jumps.

All vertical jumps were performed on a Kistler force plate (Kistler 9286BA, Switzerland) sampling at 1,000 Hz, and all force data were analysed using BioWare software (version 5.3.0.7, Kistler, Switzerland). The contact threshold for take-off and landing was defined as the time point when force fell below or rose above, respectively, the value equal to 5 standard deviations of the residual force during the final 200 ms of the flight phase [39,40]. Participants completed four successful trials of each jump, with the average of the best three trials used for analysis [41,42]. For the CMJ and SJ, the best three trials were defined as the trials with the greatest flight time-derived jump height (calculated as d = vit + ½gt2, where d = jump height, Vi = initial velocity, and g = acceleration due to gravity, and t = half of flight time), whereas for the DJ, the three trials with the greatest reactive strength index (RSI) were used. For the CMJ and SJ, the reliability of two jump height methods is reported; flight time derived jump height and the jump height calculated using take-off velocity (see S2 Table for definitions and calculations of global MTU metrics). Throughout all jumps, participants were required to keep their hands on their hips to remove the effect of arm swing to ensure jump performance was predominantly driven by the lower limbs [43]. A 30 second rest was provided between each jump trial.

For the CMJ, participants were instructed to start standing upright, then perform a countermovement to a self-selected depth immediately into a maximal effort vertical jump, aiming to maximise vertical displacement of the centre of mass [26]. For the SJ, participants were instructed to descend to a self-selected squat depth [44], hold this posture for 3 seconds, then perform a maximum effort concentric-only vertical jump, once again aiming to maximise vertical displacement. Any trials with a visible countermovement or that displayed a force unload of ≥10% of body mass before the initiation of the jump were excluded [45]. The onset of both the CMJ and SJ was defined as the time point 30 ms before the vertical force trace moves beyond ± 5 standard deviations of body weight from the quiet weighing period [46]. Finally, for the DJ, participants were instructed to step forwards off a 30 cm box and to perform a maximal effort vertical jump immediately once they make ground contact, whilst keeping ground contact time to a minimum [27].

Isometric strength tests.

As well as the isometric plantar flexion contractions above, participants also performed unilateral isometric knee extensions and the bilateral IMTP. For both tests, participants performed three submaximal trials (at approximately 25, 50, and 75% of maximum effort) and three maximal effort trials. For the knee extension, participants were seated on the Cybex dynamometer with a knee joint angle of 90° and a hip joint angle of 110°, and all contractions were held for 5 seconds. All participants used their left leg to quasi-randomise for limb dominance. IMTP testing followed the setup and protocol guidelines set out by Comfort et al. [28]. In brief, participants stood in a squat rack on a force plate (Kistler 9253B, Switzerland) with a knee joint angle of 125–145° and hip joint angle of 140–150°, with the bar fixed at mid-thigh level. The participants were then instructed to push their feet into the floor and pull on the bar “as fast and hard as possible” for 3 seconds [47,48]. Onset for the IMTP was defined in the same manner as for the CMJ and SJ [49]. Participants had 30 and 60 seconds of rest between each trial of the KE and IMTP, respectively. Strong verbal encouragement was used throughout all strength tests.

Statistical analysis

All data are presented as mean ± standard deviation (SD). Data were checked for normality using a Shapiro-Wilk test, and any variables that were not normally distributed were logarithmically transformed before the calculation of ICCs. To investigate relative and absolute test-retest reliability, the ICC (two-way mixed effects, mean of k measurements, absolute agreement [50]) and CV were used, respectively, both with 95% CI. The ICC was interpreted as ≤ 0.50 = poor, 0.50 to 0.75 = moderate, 0.75 to 0.90 = good, and ≥ 0.90 = excellent [50]. The CV was calculated for each participant as (within-subject SD / mean) * 100, with the mean of values from all participants used as the test-retest CV (e.g., [24]). Additionally, the standard error of measurement (SEM, calculated as SDpooled * √(1-ICC)) and the minimum detectable change (MDC, calculated as 1.96 * √2 * SEM) were calculated. For intra-rater reliability the ICC form used was a two-way mixed effects, single measurement, absolute agreement, whereas for inter-rater reliability, the two-way random effects, single rater, absolute agreement was used [50]. Finally, a two-tailed paired t-test (or Wilcoxon test in the case of non-normally distributed data) was used to assess systematic bias between testing sessions, with α = 0.05. Microsoft Excel (Microsoft Corporations, Redmond, WA, USA) and SPSS Statistics (IBM Corporations, Version 28.0, Armonk, NY, USA) were used for calculations and statistical analyses. Due to technical issues, data are missing for one participant for AT elongation, strain, and stiffness measures, and for two participants for the IMTP and knee extension tests.

Results

All results, including the ICC, CV, mean change, effect size and p-value of change, SEM and MDC, as well as the relevant ICC-based descriptor are displayed for each variable in Tables 1–5. For local MTU metrics, the plantar flexion and knee extension MVCs displayed excellent reliability, the AT moment arm and elongation measures all displayed good to excellent reliability, whereas all stiffness metrics displayed moderate or poor reliability, except ATkindex which showed good reliability (Table 1). All CMJ variables displayed good to excellent reliability (Table 2). For the SJ, all metrics showed excellent reliability, except RFD time bands which were all good, and the eccentric utilisation ratio, which was moderate (Table 3). All DJ variables displayed excellent reliability (Table 4). Peak force from the IMTP displayed excellent reliability, whereas results for force-time variables were mixed, ranging from moderate to excellent. Finally, the DSI displayed good reliability (Table 5).

thumbnail
Table 1. Test-retest reliability of local muscle tendon unit metrics.

https://doi.org/10.1371/journal.pone.0306587.t001

thumbnail
Table 2. Test-retest reliability of countermovement jump metrics.

https://doi.org/10.1371/journal.pone.0306587.t002

thumbnail
Table 5. Test-retest reliability of isometric midthigh pull metrics.

https://doi.org/10.1371/journal.pone.0306587.t005

Discussion

The aim of this study was to report the test-retest reliability of local and global MTU assessments in naturally menstruating female athletes, whilst standardising menstrual cycle phase at the time of testing. Local measures, which included metrics of AT mechanical properties and single-joint isometric strength tests, displayed good to excellent reliability, except for traditional AT stiffness metrics which were poor to moderate. For global measures, good to excellent reliability was found for all CMJ metrics, all SJ metrics except eccentric utilisation ratio (moderate), and all DJ metrics. IMTP peak force also showed excellent reliability, whereas IMTP force-time metrics ranged from moderate to excellent. An overview of the ICC for each variable is provided in Fig 1. To the authors’ knowledge, this is the first study to investigate the reliability of many of these metrics in a female sample, and the first study to standardise menstrual cycle phase of the participants at the time of testing. Therefore, these results provide highly novel insight into the natural variability of MTU assessments within female athletes, data which can be used to enhance the interpretation of a vast array of past and future female athlete research.

thumbnail
Fig 1. Intraclass correlation coefficients (ICC) for all metrics.

MTU, muscle-tendon unit; ATk, Achilles’ tendon stiffness; NATk normalised Achilles’ tendon stiffness; ATkindex, Achilles’ tendon stiffness index; CMJ, countermovement jump; SJ, squat jump; DJ, drop jump; IMTP, isometric midthigh pull; ISO, isometric; RFD, rate of force development; RPD, rate of power development; RSI, reactive strength index; RSImod, reactive strength index modified; kY, vertical stiffness; DSI, dynamic strength index; TOV, take-off velocity; FT, flight time; MVC, maximum voluntary contraction; PF, plantar flexion; KE, knee extension.

https://doi.org/10.1371/journal.pone.0306587.g001

A wide range of reliability scores were seen across the local and global measures in this study (Fig 1), and local measures generally had higher CVs than global ones (except for single-joint isometric moments and global RFD metrics). When interpreting these findings, it is important to consider that any change between sessions is comprised of a combination of measurement error and biological variation [51]. It is plausible that any biological variation that is present could directly affect local measures, but only indirectly affect global ones. This is because local metrics directly measure some specific aspect of the MTU (e.g., tendon strain) whereas global measures have more degrees of freedom and may employ many different motor strategies to achieve a consistent performance outcome. Therefore, the performance outcome in a global test can still be maintained from session to session despite potential changes to MTU properties that underpin the movement. For example, there is the potential for jump strategies to differ between sessions, but the overall outcome of jump height does not (e.g., [52]). As such, global measures such as jump height may lack the precision to detect biological changes in MTU function, whereas local measures can provide more sensitive insights into specific aspects of the MTU.

Two local MTU metrics, plantar flexion and knee extension MVCs, displayed excellent reliability (Table 1). These single joint isometric actions largely remove the ‘skill’ aspect of strength testing [53] and provide reliable insights into the force-generating capacity of a muscle group. The consistent plantar flexion MVC provides a conducive environment for measuring the mechanical properties of the AT, and this analysis found that all AT metrics displayed good to excellent reliability, except for stiffness measures which all displayed either moderate or poor reliability (Table 1). In contrast, ATkindex, calculated as maximum strain divided by maximum AT force [19], displayed good reliability. In congruence with the present study, Rogers et al. [23] found that a similar variable (stiffness calculated as maximum torque divided by maximum elongation) was more reliable than traditional stiffness variables (ICC: 0.87), and Hunter et al. [19] also reported good reliability for ATkindex in males (ICC: 0.75). This metric could be more reliable than traditional stiffness variables (that calculate stiffness over certain ranges of the force-elongation curve) as it only takes into account maximal contractions, meaning that any differences in the lower and mid-portions of the force-elongation curve between days would not influence the outcome. In traditional stiffness variables, these small differences are combined, and possibly amplified, in the subsequent calculations, which may partially explain their lower reliability.

As traditional stiffness metrics account for AT force-elongation properties over a greater range of forces, these metrics might provide a greater fidelity of information, but at the cost of lower between-day reliability. This discrepancy in reliability between different stiffness metrics might also explain the mixed reliability findings from previous research [13,1618,2023], as there is considerable between-study heterogeneity in stiffness calculation methods, as well as actual measurement methods (e.g., chosen landmark for tracking, contraction protocol). As this is the one of the first studies to assess the reliability of these metrics in female athletes, the menstrual cycle phase of the participants was standardised to account for one potentially confounding source of error. Whilst it is difficult to distinguish between the contribution of biological variation and measurement error to the variability in these metrics, the manual measurement of AT elongation displayed excellent intra- and inter-rater reliability in this study. This eliminates one key aspect of measurement error and increases the likelihood that biological variability is the driver of the observed changes, which means that stiffness metrics are perhaps not unreliable per se, but are highly sensitive to small, biologically real changes (e.g., [54]). Ultimately, these metrics provide direct insight into a specific MTU property, and because previous research has highlighted that AT properties are able to distinguish between cohorts [55] and identify changes due to training interventions [56], these metrics may still be of interest to researchers providing that the reproducibility of the metrics is kept in mind.

Regarding global tests, most variables from the CMJ, SJ, and DJ showed good to excellent reliability, with very little change between sessions (Tables 2–4). Insights into these global measures of lower-body performance (e.g., jump height, force production, or RSI) are useful for measuring adaptations to training interventions, and because of their high reliability, any observed changes are likely meaningful. However, because of their increased degrees of freedom allowing for maintenance of the overall performance outcome, these types of outcomes may not be sensitive enough to detect smaller acute changes such as those expected to occur over menstrual cycle phases [9] or because of fatigue [57]. Furthermore, although MTU properties are often inferred through global tests, it must be acknowledged that only local tests can provide objective evidence of specific MTU properties. For example, RSI and CMJ performance are often used as markers of stretch-shortening cycle function [e.g., 58,59], although evidence has shown that true stretch-shortening cycles do not occur for the triceps surae musculature during the CMJ [10] or DJ [60]. Therefore, as well as considering reliability and sensitivity, practitioners should be mindful of the information that each metric can provide when choosing tests and consider how this might inform training and testing practices. A final consideration for global metrics is the use of composite metrics which are comprised of two other variables, such as the eccentric utilisation ratio being calculated using jump heights from the CMJ and SJ [61] and the dynamic strength index calculated from CMJ and IMTP peak force [62]. Importantly, the results of the present study show that even though the input variables show excellent reliability, these composite metrics displayed moderate and good reliability, respectively. Thus, caution should be taken when considering these types of metrics and what information they are able to provide above their constituent parts.

In this study, although high reliability was found for nearly all vertical jump metrics, systematic bias was present between testing sessions for CMJ and DJ flight time and its associated jump height, DJ RSI, and SJ RFD and rate of power development (RPD). The values for these variables were greater in session two, and despite this bias, all variables displayed excellent ICCs and CVs < 10% (excluding SJ RFD), potentially highlighting that the bias is reflecting a small, but real change between sessions. This change might be caused by changes in exercise technique (e.g., changes in landing posture influencing flight time in the CMJ and DJ), or through participants developing the neuromuscular skill of the movement (i.e., a learning effect). This is particularly salient for the DJ, which because of its technical complexity, has been shown to be less reliable than the CMJ and SJ [63]. Thus, the technique and skill requirements for these global metrics and the potential for bias must be considered when choosing tests. Familiarisation sessions can be used to reduce the risk of bias due to a learning effect, and in the present study a familiarisation session was completed by all participants despite previous research showing that familiarisation is not necessary for these three jump tests [6466]. As the results presented here suggest that more familiarisation sessions are required, future research should explore the effects of multiple familiarisations within a trained female athlete sample. Furthermore, CMJ and SJ depths were not controlled in this study which presents another potential source of variability, and therefore future research should also explore the effects of controlling these variables on the reliability of all jump metrics. Overall, due to their generally excellent reliability, once participants are sufficiently familiarised with the movement (to negate systematic bias), these vertical jump tests can reliably assess lower-body performance capacity.

Much has been written about the importance of muscular strength and RFD for sports performance [1]. Muscular strength, measured either locally via plantar flexion and knee extension MVCs or globally via IMTP peak force, showed excellent reliability, underscoring the high reliability of strength tests with limited degrees of freedom (Tables 1 and 5). The RFD measures from the CMJ, SJ and IMTP resulted in good to excellent ICCs, but also high CVs (10.6–35.9%). In the SJ and IMTP, where RFD was measured at 50, 100, and 250 ms from onset, the 250 ms time band proved to be the most reliable. It is expected that calculating RFD over longer time epochs may result in improved reliability, as the absolute change in force measurement produces a smaller relative error over longer time periods and greater force outputs, a trend which has been reported in previous studies [49,67,68]. Additionally, early RFD is driven largely by neuromuscular contributions [47] and has been shown to be more responsive to training [2], highlighting that it may be more sensitive to change than the later time bands, explaining its poorer reliability scores. Therefore, if RFD is to be used, whilst longer time bands (i.e., 250 ms) provide better reliability, it should be considered which aspects of neuromusculoskeletal function are of interest when choosing metrics. It should also be noted that for the IMTP, poorer reliability was found for RFD than the other IMTP force-time metrics (force and impulse over the same 50, 100, and 250 ms time bands as RFD; Table 5), which is in accordance with previous research [67,69,70]. Similarly, RPD displayed greater reliability than RFD in the CMJ and SJ both in the present study and previous investigations [43,71]. Therefore, these findings caution against the use of RFD and promote the use of other force-time metrics instead, such as RPD or impulse.

This study standardised menstrual cycle phase using best practice recommendations [32] to understand the natural variability of the chosen metrics in female athletes without the potentially confounding influence of menstrual cycle related hormone changes. Although this design provides a suitable level of control, future research should measure serum hormone concentrations and record menstrual cycle symptoms to identify the effects that these factors have on outcomes. In addition, to provide a greater scope of support to female athletes, research should also investigate the variability of measures within individuals with other hormonal profiles, such as hormonal contraceptive users or post-menopausal athletes.

Conclusions

This study investigated the test-retest reliability of local and global assessments in female athletes, with the reliability of many variables reported for the first time. This is the first reliability study investigating MTU, strength, and jump data to standardise the menstrual cycle phase of the participants at the time of testing. In summary, local metrics were generally less reproducible than global metrics. However, these metrics allow measurement of specific MTU characteristics that influence function, and as such, can provide valuable information. Good to excellent reliability was found for single-joint isometric strength and metrics of AT mechanical properties, except for traditional stiffness metrics, which were poor to moderate. On the other hand, global metrics generally displayed high reliability, with good to excellent ICCs reported for almost all CMJ, SJ, DJ, and IMTP metrics. These tests can reliably assess overall performance outcomes; however, their increased degrees of freedom means that they lack the necessary precision to monitor and infer specific aspects of MTU function. The data reported here provides insight into the natural variability of MTU assessments within a female athlete population, and therefore this data can be used to contextualise and enhance the interpretation of other female athlete data, particularly that which investigates other aspects of variability, such as the menstrual cycle.

Supporting information

S1 Table. Definitions and calculations of local muscle-tendon unit metrics.

https://doi.org/10.1371/journal.pone.0306587.s001

(DOCX)

S2 Table. Definitions and calculations of global muscle-tendon unit metrics.

https://doi.org/10.1371/journal.pone.0306587.s002

(DOCX)

Acknowledgments

The authors would like to thank the participants for their time and cooperation throughout the study.

References

  1. 1. Suchomel TJ, Nimphius S, Stone MH. The importance of muscular strength in athletic performance. Sports Med. 2016;46(10):1419–49. pmid:26838985
  2. 2. Luis HDJ, Sabido R. Rate of force development: reliability, improvements and influence on performance. A review. Eur J Hum Movem. 2014;33:46–69.
  3. 3. Jarvis P, Turner A, Read P, Bishop C. Reactive strength index and its associations with measures of physical and sports performance: a systematic review with meta-analysis. Sports Med. 2022;52(2):301–30. pmid:34606061
  4. 4. Bojsen-Møller J, Magnusson SP, Rasmussen LR, Kjaer M, Aagaard P. Muscle performance during maximal isometric and dynamic contractions is influenced by the stiffness of the tendinous structures. J Appl Physiol (1985). 2005;99(3):986–94. pmid:15860680
  5. 5. Cowley ES, Olenick AA, McNulty KL, Ross EZ. “Invisible sportswomen”: the sex data gap in sport and exercise science research. Women Sport Phys Act J. 2021;29(2):146–51.
  6. 6. Sims ST, Heather AK. Myths and methodologies: reducing scientific design ambiguity in studies comparing sexes and/or menstrual cycle phases. Exp Physiol. 2018;103(10):1309–17. pmid:30051938
  7. 7. Ansdell P, Brownstein CG, Škarabot J, Hicks KM, Simoes DCM, Thomas K, et al. Menstrual cycle-associated modulations in neuromuscular function and fatigability of the knee extensors in eumenorrheic women. J Appl Physiol (1985). 2019;126(6):1701–12. pmid:30844334
  8. 8. Leblanc DR, Schneider M, Angele P, Vollmer G, Docheva D. The effect of estrogen on tendon and ligament metabolism and function. J Steroid Biochem Mol Biol. 2017;172:106–16. pmid:28629994
  9. 9. McNulty K, Elliott-Sale K, Dolan E, Swinton P, Ansdell P, Goodall S. The effects of menstrual cycle phase on exercise performance in eumenorrheic women: a systematic review and meta-analysis. Sports Med. 2020;50(10):1813–27.
  10. 10. Kurokawa S, Fukunaga T, Nagano A, Fukashiro S. Interaction between fascicles and tendinous structures during counter movement jumping investigated in vivo. J Appl Physiol (1985). 2003;95(6):2306–14. pmid:12871964
  11. 11. Lichtwark GA, Bougoulias K, Wilson AM. Muscle fascicle and series elastic element length changes along the length of the human gastrocnemius during walking and running. J Biomech. 2007;40(1):157–64. pmid:16364330
  12. 12. Houghton L, Dawson B, Rubenson J. Achilles tendon mechanical properties after both prolonged continuous running and prolonged intermittent shuttle running in cricket batting. J Appl Biomech. 2013;29(4):453–62. pmid:22923384
  13. 13. Burgess KE, Connick MJ, Graham-Smith P, Pearson SJ. Plyometric vs. isometric training influences on tendon properties and muscle output. J Strength Cond Res. 2007;21(3):986–9. pmid:17685695
  14. 14. Kubo K, Tabata T, Ikebukuro T, Igarashi K, Tsunoda N. A longitudinal assessment of running economy and tendon properties in long-distance runners. J Strength Cond Res. 2010;24(7):1724–31. pmid:20543735
  15. 15. Arampatzis A, Mersmann F, Bohm S. Individualized muscle-tendon assessment and training. Front Physiol. 2020;11.
  16. 16. Hansen P, Aagaard P, Kjaer M, Larsson B, Magnusson SP. Effect of habitual running on human Achilles tendon load-deformation properties and cross-sectional area. J Appl Physiol (1985). 2003;95(6):2375–80. pmid:12937029
  17. 17. Fouré A, Nordez A, Cornu C. Plyometric training effects on Achilles tendon stiffness and dissipative properties. J Appl Physiol (1985). 2010;109(3):849–54. pmid:20576842
  18. 18. Wu Y, Lien Y, Lin K, Shih T, Wang T, Wang H. Relationships between three potentiation effects of plyometric training and performance. Scand J Med Sci Sports. 2010;20(1).
  19. 19. Hunter S, Werth J, James D, Lambrianides Y, Smith K, Karamanidis K, et al. Reliability and accuracy of a time-efficient method for the assessment of Achilles tendon mechanical properties by ultrasonography. Sensors (Basel). 2022;22(7):2549. pmid:35408164
  20. 20. Robshaw DC. Gastrocnemius medialis muscle-tendon properties and their relationship with power and reactive strength in elite youth soccer players [dissertation]. Liverpool, UK: Liverpool John Moores University; 2021.
  21. 21. Kubo K, Kanehisa H, Fukunaga T. Effects of resistance and stretching training programmes on the viscoelastic properties of human tendon structures in vivo. J Physiol. 2002;538(Pt 1):219–26. pmid:11773330
  22. 22. Ackermans TMA, Epro G, McCrum C, Oberländer KD, Suhr F, Drost MR. Aging and the effects of a half marathon on Achilles tendon force–elongation relationship. Eur J Appl Physiol. 2016;116(11–12):2281–92.
  23. 23. Rogers SA, Whatman CS, Pearson SN, Kilding AE. Assessments of mechanical stiffness and relationships to performance determinants in middle-distance runners. Int J Sports Physiol Perform. 2017;12(10):1329–34. pmid:28290718
  24. 24. Beattie K, Flanagan EP. Establishing the reliability & meaningful change of the drop-jump reactive-strength index. J Aust Strength Cond. 2015;23(5):12–8.
  25. 25. Stone M, O’Bryant H, Hornsby G, Cunanan A, Mizuguchi S, Suarez D. Using the isometric mid-thigh pull in the monitoring of weightlifters: 25 years of experience. Prof Strength Cond. 2019;54:19–26.
  26. 26. McMahon JJ, Suchomel TJ, Lake JP, Comfort P. Understanding the key phases of the countermovement jump force-time curve. Strength Cond J. 2018;40(4):96–106.
  27. 27. Pedley JS, Lloyd RS, Read P, Moore IS, Oliver JL. Drop jump: a technical model for scientific application. Strength Cond J. 2017;39(5):36–44.
  28. 28. Comfort P, Dos’Santos T, Beckham GK, Stone MH, Guppy SN, Haff GG. Standardization and methodological considerations for the isometric midthigh pull. Strength Cond J. 2019;41(2):57–79.
  29. 29. Moir GL, Garcia A, Dwyer GB. Intersession reliability of kinematic and kinetic variables during vertical jumps in men and women. Int J Sports Physiol Perform. 2009;4(3):317–30. pmid:19953820
  30. 30. Keogh J, Bishop C, Ruder M, Kobsar D. Tracking the reliability of force plate-derived countermovement jump metrics over time in female basketball athletes: a comparison of principal component analysis vs. conventional methods. J Sci Sport Exerc. 2023.
  31. 31. Schmalenberger KM, Tauseef HA, Barone JC, Owens SA, Lieberman L, Jarczok MN, et al. How to study the menstrual cycle: practical tools and recommendations. Psychoneuroendocrinology. 2021;123:104895. pmid:33113391
  32. 32. Elliott-Sale KJ, Minahan CL, de Jonge XAKJ, Ackerman KE, Sipilä S, Constantini NW, et al. Methodological considerations for studies in sport and exercise science with women as participants: a working guide for standards of practice for research on women. Sports Med. 2021;51(5):843–61. pmid:33725341
  33. 33. McKay A, Stellingwerff T, Smith E, Martin D, Mujika I, Goosey-Tolfrey V, et al. Defining training and performance caliber: a participant classification framework. Int J Sports Physiol Perform. 2021;:1–15.
  34. 34. World Medical Association. World Medical Association declaration of Helsinki: ethical principles for medical research involving human subjects. J Am Med Assoc. 2013;310(20):2191–4.
  35. 35. Barfod K, Riecke A, Boesen A, Hansen P, Maier J, Døssing S. Validation of a novel ultrasound measurement of achilles tendon length. Knee Surg Sports Traumatol Arthrosc. 2015;23(11):3398–406.
  36. 36. Fath F, Blazevich AJ, Waugh CM, Miller SC, Korff T. Direct comparison of in vivo Achilles tendon moment arms obtained from ultrasound and MR scans. J Appl Physiol (1985). 2010;109(6):1644–52. pmid:20847130
  37. 37. Maganaris CN. Tendon conditioning: artefact or property? Proc R Soc B: Biol Sci. 2003;270 Suppl 1(Suppl 1):S39–42. pmid:12952631
  38. 38. Arampatzis A, Karamanidis K, Morey-Klapsing G, De Monte G, Stafilidis S. Mechanical properties of the triceps surae tendon and aponeurosis in relation to intensity of sport activity. J Biomech. 2007;40(9):1946–52. pmid:17101142
  39. 39. McMahon JJ, Murphy S, Rej SJE, Comfort P. Countermovement-jump-phase characteristics of senior and academy Rugby league players. Int J Sports Physiol Perform. 2017;12(6):803–11. pmid:27918658
  40. 40. Pérez-Castilla A, Fernandes J, Rojas F, García-Ramos A. Reliability and magnitude of countermovement jump performance variables: influence of the take-off threshold. Meas Phys Educ Exerc Sci. 2021;25(3):227–35.
  41. 41. Venegas-Carro M, Kramer A, Moreno-Villanueva M, Gruber M. Test–retest reliability and sensitivity of common strength and power tests over a period of 9 weeks. Sports. 2022;10(11).
  42. 42. Claudino JG, Cronin J, Mezêncio B, McMaster DT, McGuigan M, Tricoli V. The countermovement jump to monitor neuromuscular status: a meta-analysis. J Sci Med Sport. 2017;20(4):397–402.
  43. 43. Heishman AD, Daub BD, Miller RM, Freitas EDS, Frantz BA, Bemben MG. Countermovement jump reliability performed with and without an arm swing in NCAA Division 1 intercollegiate basketball players. J Strength Cond Res. 2020;34(2):546–58. pmid:30138237
  44. 44. Petronijevic MS, Garcia Ramos A, Mirkov DM, Jaric S, Valdevit Z, Knezevic OM. Self-preferred initial position could be a viable alternative to the standard squat jump testing procedure. J Strength Cond Res. 2018;32(11):3267–75. pmid:30540284
  45. 45. Sheppard JM, Doyle TLA. Increasing compliance to instructions in the squat jump. J Strength Cond Res. 2008;22(2):648–51. pmid:18550987
  46. 46. Owen NJ, Watkins J, Kilduff LP, Bevan HR, Bennett MA. Development of a criterion method to determine peak mechanical power output in a countermovement jump. J Strength Cond Res. 2014;28(6):1552–8. pmid:24276298
  47. 47. Maffiuletti NA, Aagaard P, Blazevich AJ, Folland J, Tillin N, Duchateau J. Rate of force development: physiological and methodological considerations. Eur J Appl Physiol. 2016;116(6):1091–116. pmid:26941023
  48. 48. Rendos NK, Harriell K, Qazi S, Regis RC, Alipio TC, Signorile JF. Variations in verbal encouragement modify isokinetic performance. J Strength Cond Res. 2019;33(3):708–16. pmid:30589722
  49. 49. DosʼSantos T, Jones PA, Comfort P, Thomas C. Effect of different onset thresholds on isometric midthigh pull force-time variables. J Strength Cond Res. 2017;31(12):3463–73. pmid:28002178
  50. 50. Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. 2016;15(2):155–63. pmid:27330520
  51. 51. Hopkins WG. Measures of reliability in sports medicine and science. Sports Med. 2000;30(1):1–15. pmid:10907753
  52. 52. Bishop C, Jordan M, Torres-Ronda L, Loturco I, Harry J, Virgile A. Selecting metrics that matter: comparing the use of the countermovement jump for performance profiling, neuromuscular fatigue monitoring, and injury rehabilitation testing. Strength Cond J. 2023;45(5):545–53.
  53. 53. Buckner SL, Jessee MB, Mattocks KT, Mouser JG, Counts BR, Dankel SJ. Determining strength: a case for multiple methods of measurement. Sports Med. 2017;47(2):193–5.
  54. 54. Walker J, Bissas A, Wainwright B, Hanley B, Cronin N. Repeatability and sensitivity of passive mechanical stiffness measurements in the triceps surae muscle-tendon complex. Scand J Med Sci Sports. 2022;32(1):83–93.
  55. 55. Wiesinger H-P, Rieder F, Kösters A, Müller E, Seynnes OR. Are sport-specific profiles of tendon stiffness and cross-sectional area determined by structural or functional integrity? PLoS One. 2016;11(6):e0158441. pmid:27362657
  56. 56. Lazarczuk SL, Maniar N, Opar DA, Duhig SJ, Shield A, Barrett RS. Mechanical, material and morphological adaptations of healthy lower limb tendons to mechanical loading: a systematic review and meta-analysis. Sports Med. 2022;52(10):2405–29.
  57. 57. Gathercole R, Sporer B, Stellingwerff T, Sleivert G. Alternative countermovement-jump analysis to quantify acute neuromuscular fatigue. Int J Sports Physiol Perform. 2015;10(1):84–92. pmid:24912201
  58. 58. Southey B, Willshire M, Connick M, Austin D, Spits D, Beckman E. Reactive strength index as a key performance indicator in different athlete populations – a systematic review. Sci Sports. 2023.
  59. 59. Kozinc Ž, Žitnik J, Smajla D, Šarabon N. The difference between squat jump and countermovement jump in 770 male and female participants from different sports. Eur J Sport Sci. 2021;22(7):985–93.
  60. 60. Ishikawa M, Niemelä E, Komi PV. Interaction between fascicle and tendinous tissues in short-contact stretch-shortening cycle exercise with varying eccentric intensities. J Appl Physiol (1985). 2005;99(1):217–23. pmid:15705735
  61. 61. McGuigan MR, Doyle TLA, Newton M, Edwards DJ, Nimphius S, Newton RU. Eccentric utilization ratio: effect of sport and phase of training. J Strength Cond Res. 2006;20(4):992–5. pmid:17194252
  62. 62. Comfort P, Thomas C, Dos’Santos T, Jones PA, Suchomel TJ, McMahon JJ. Comparison of methods of calculating dynamic strength index. Int J Sports Physiol Perform. 2018;13(3):320–5. pmid:28714767
  63. 63. Gathercole RJ, Sporer BC, Stellingwerff T, Sleivert GG. Comparison of the capacity of different jump and sprint field tests to detect neuromuscular fatigue. J Strength Cond Res. 2015;29(9):2522–31. pmid:26308829
  64. 64. Moir G, Sanders R, Button C, Glaister M. The influence of familiarization on the reliability of force variables measured during unloaded and loaded vertical jumps. J Strength Cond Res. 2005;19(1):140–5. pmid:15705024
  65. 65. Nibali ML, Tombleson T, Brady PH, Wagner P. Influence of familiarization and competitive level on the reliability of countermovement vertical jump kinetic and kinematic variables. J Strength Cond Res. 2015;29(10):2827–35. pmid:26398700
  66. 66. Costley L, Wallace E, Johnston M, Kennedy R. Reliability of bounce drop jump parameters within elite male rugby players. J Sports Med Phys Fitness. 2018;58(10):1390–7.
  67. 67. Ishida A, Suarez DG, Travis SK, Slaton JA, White JB, Bazyler CD, et al. Intrasession and intersession reliability of isometric squat, midthigh pull, and squat jump in resistance-trained individuals. J Strength Cond Res. 2023;37(1):18–26. pmid:36515586
  68. 68. Suarez DG, Carroll KM, Slaton JA, Rochau KG, Davis MW, Stone MH. Utility of a shortened isometric midthigh pull protocol for assessing rapid force production in athletes. J Strength Cond Res. 2022;36(7):1819–25. pmid:33044370
  69. 69. Thomas C, Dos’santos T, Comfort P, Jones P. Between-session reliability of common strength-and power-related measures in adolescent athletes. Sports. 2017;5(15):15.
  70. 70. Guppy SN, Brady CJ, Kotani Y, Stone MH, Medic N, Haff GG. The effect of altering body posture and barbell position on the between-session reliability of force-time curve characteristics in the isometric mid-thigh pull. Sports. 2018;6(4):162.
  71. 71. Souza A, Bottaro M, Rocha Junior V, Lage V, Tufano J, Vieira A. Reliability and test-retest agreement of mechanical variables obtained during countermovement jump. Int J Exerc Sci. 2020;13(4):6–17.