Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Assessment of Lower Limb Muscle Strength and Power Using Hand-Held and Fixed Dynamometry: A Reliability and Validity Study

  • Benjamin F. Mentiplay ,

    Contributed equally to this work with: Benjamin F. Mentiplay, Ross A. Clark

    Affiliation School of Exercise Science, Faculty of Health Sciences, Australian Catholic University, Melbourne, Australia

  • Luke G. Perraton ,

    ‡ These authors also contributed equally to this work.

    Affiliation School of Exercise Science, Faculty of Health Sciences, Australian Catholic University, Melbourne, Australia

  • Kelly J. Bower ,

    ‡ These authors also contributed equally to this work.

    Affiliation School of Exercise Science, Faculty of Health Sciences, Australian Catholic University, Melbourne, Australia

  • Brooke Adair ,

    ‡ These authors also contributed equally to this work.

    Affiliation School of Allied Health, Faculty of Health Sciences, Australian Catholic University, Melbourne, Australia

  • Yong-Hao Pua ,

    ‡ These authors also contributed equally to this work.

    Affiliation Department of Physiotherapy, Singapore General Hospital, Singapore, Singapore

  • Gavin P. Williams ,

    ‡ These authors also contributed equally to this work.

    Affiliations Department of Physiotherapy, Epworth Healthcare, Melbourne, Australia, School of Physiotherapy, Faculty of Medicine, Denistry and Health Sciences, The University of Melbourne, Melbourne, Australia

  • Rebekah McGaw ,

    ‡ These authors also contributed equally to this work.

    Affiliation School of Exercise Science, Faculty of Health Sciences, Australian Catholic University, Melbourne, Australia

  • Ross A. Clark

    Contributed equally to this work with: Benjamin F. Mentiplay, Ross A. Clark

    Affiliation School of Exercise Science, Faculty of Health Sciences, Australian Catholic University, Melbourne, Australia



Hand-held dynamometry (HHD) has never previously been used to examine isometric muscle power. Rate of force development (RFD) is often used for muscle power assessment, however no consensus currently exists on the most appropriate method of calculation. The aim of this study was to examine the reliability of different algorithms for RFD calculation and to examine the intra-rater, inter-rater, and inter-device reliability of HHD as well as the concurrent validity of HHD for the assessment of isometric lower limb muscle strength and power.


30 healthy young adults (age: 23±5yrs, male: 15) were assessed on two sessions. Isometric muscle strength and power were measured using peak force and RFD respectively using two HHDs (Lafayette Model-01165 and Hoggan microFET2) and a criterion-reference KinCom dynamometer. Statistical analysis of reliability and validity comprised intraclass correlation coefficients (ICC), Pearson correlations, concordance correlations, standard error of measurement, and minimal detectable change.


Comparison of RFD methods revealed that a peak 200ms moving window algorithm provided optimal reliability results. Intra-rater, inter-rater, and inter-device reliability analysis of peak force and RFD revealed mostly good to excellent reliability (coefficients ≥ 0.70) for all muscle groups. Concurrent validity analysis showed moderate to excellent relationships between HHD and fixed dynamometry for the hip and knee (ICCs ≥ 0.70) for both peak force and RFD, with mostly poor to good results shown for the ankle muscles (ICCs = 0.31–0.79).


Hand-held dynamometry has good to excellent reliability and validity for most measures of isometric lower limb strength and power in a healthy population, particularly for proximal muscle groups. To aid implementation we have created freely available software to extract these variables from data stored on the Lafayette device. Future research should examine the reliability and validity of these variables in clinical populations.


Muscular weakness, as a component of muscle function, is an impairment that is commonly observed in clinical populations and has been widely documented to impact upon physical function [14]. Two important components of muscle function are the peak force that a muscle group can produce (muscle strength) and how rapidly that force can be produced (muscle power) [3, 5]. The latter has previously been quantified by calculating the rate of force development (RFD), which is calculated by measuring the change in force over a certain time period (Δforce/Δtime), usually during an isometric contraction [5, 6]. The measure of RFD has important functional implications; sufficient RFD is necessary to perform quick and forceful muscle contractions, such as those observed during walking [5]. Previous literature indicates that reduced muscle power, often associated with aging, may contribute to reduced physical function and an increased risk of falls in a range of clinical populations [713]. As such, assessments of muscle power may be useful in clinical settings for identifying individuals at risk of falls and functional limitations.

Currently there are varying methods utilised to calculate RFD from isometric contractions. Commonly used methods involve calculating the change in force over the change in time with discrete time intervals from the onset of contraction to 30, 50 or 100ms [5, 14, 15]. However, onset of contraction has been defined in different ways including when the force reading exceeds a set threshold of either absolute values or percentages of a maximal voluntary contraction [5, 14, 1618]. Other methods of calculating RFD involve examining successive time intervals (e.g. 5ms) during the initial rise in force to determine the peak RFD across the trial [1921], or examining the RFD between percentages of the peak force (e.g. between 30 and 60% of peak force) [22]. There is currently no consensus as to which measure of RFD should be used in the assessment of muscle power.

The criterion-reference assessment of muscle strength and power involves fixed laboratory-based dynamometry. A limitation of laboratory-based dynamometers is they are expensive and cumbersome which precludes their use as a clinically-feasible device for routine patient assessment [2325]. Other devices that can be used to assess dynamic muscle power include linear position transducers [2628], the Nottingham power rig [2830], and force plates [31, 32], however the cost, availability, time-consuming nature, and difficulty of implementation of such assessments may limit their use in clinical settings. Clinic-based assessment of muscle power is important to allow widespread access to testing and easily-interpreted results. Commonly used devices that measure isometric lower limb muscle strength include hand-held dynamometers (HHDs). These low-cost and portable devices are an appropriate and convenient method to assess muscle strength in a clinical setting due to their strong reliability and validity when compared with expensive laboratory-based dynamometers [2325, 33, 34]. Previous psychometric literature assessing isometric lower limb strength using HHDs has focused predominantly on the knee extensors, with limited information on the validity of HHDs when assessing the muscle strength of other lower limb muscle groups [25, 34]. Additionally, the reliability and validity of HHDs for the assessment of isometric muscle power is currently unknown and warrants further investigation due to the importance of muscle power [30, 35].

The first aim of this study was to examine the reliability of different algorithms for assessment of RFD using fixed dynamometry. Secondly, this study aimed to determine the concurrent validity of two HHDs (Lafayette and Hoggan manufactured devices) compared to fixed dynamometry (KinCom) to assess isometric lower limb muscle strength and power using measures of peak force and RFD. Additionally, the intra-rater, inter-rater, and inter-device reliability of each device for the assessment of peak force and RFD was assessed. It was hypothesised that the HHDs would demonstrate good validity and reliability for the assessment of both muscle strength and power (intraclass correlation coefficients ≥0.75).

Materials and Methods


The isometric lower limb muscle strength and power of 30 healthy participants over the age of 18 was assessed. Participants were required to have no lower limb injury in the preceding two months or other comorbidities such as cardiovascular or respiratory conditions that could potentially impact on the assessment of muscle strength and power. This study used a concurrent validity, test-retest reliability design whereby participants attended two identical testing sessions. This study had approval from the Australian Catholic University Human Research Ethics Committee, where a convenience sample of participants were recruited. All participants gave written informed consent.


Two HHDs were used to assess lower limb strength and power: a Lafayette Manual Muscle Testing System Model-01165 (Lafayette Instrument Company, Lafayette IN, USA) and a Hoggan microFET2 (Hoggan Scientific, LLC, Salt Lake City UT, USA). The two HHDs were left as purchased from each manufacturer with no additional padding secured to the devices. The approximate retail cost of the Lafayette device is US$1,200, with the Hoggan device costing approximately US$1,095 (plus US$495 for the software package). For determination of the validity of each HHD, participants were also assessed using a fixed, laboratory-based KinCom dynamometer (Chattex Corporation, Chattanooga TN, USA). Laboratory-based dynamometers can often cost in excess of US$50,000. All devices recorded force in kilograms and were calibrated once at the start of the study. Both assessors were male and were experienced at using such devices, with Assessor-A having one year experience using HHDs and Assessor-B having 10 years of clinical physiotherapy experience using HHDs.


Currently there is no consensus on the most appropriate testing positions for HHD use, with a recent systematic review demonstrating a variety of methodologies used for lower limb assessment in previous research [25]. Based on prior research and our own pilot work of assessments in a variety of different positions, we implemented those shown in Fig 1. These testing positions have shown strong reliability for the measurement of isometric strength in previous studies for the hip [36], knee [37], and ankle [37] muscle groups.

Fig 1. Testing positions for strength and power assessment.

Note: Same positions were used on the fixed dynamometer. (A) Hip flexors with the participant seated and hips and knees flexed at 90°. Dynamometer placed on the anterior aspect of the thigh, proximal to the knee joint. (B) Knee extensors with the participant seated and hip and knees flexed at 90°. Dynamometer placed on the anterior aspect of the shank, proximal to the ankle joint. (C) Knee flexors with the participant seated and hips and knees flexed at 90°. Dynamometer placed on the posterior aspect of the shank, proximal to the ankle joint. (D) Ankle plantarflexors with the participant lying supine with the ankle in plantargrade and hips and knees extended. Dynamometer placed over the metatarsal heads on the sole of the foot. (E) Ankle dorsiflexors with the participant lying supine with the ankle relaxed and hips and knees extended. Dynamometer placed over the metatarsal heads on the dorsum of the foot. (F) Hip abductors with the participant lying supine and hips and knees extended. Dynamometer placed on the lateral aspect of the shank, proximal to the ankle joint. (G) Hip adductors with the participant lying supine and hips and knees extended. Dynamometer placed on the medial aspect of the shank, proximal to the ankle joint. (H) Hip extensors with the participant lying prone and hips and knees extended. Dynamometer placed on the posterior aspect of the shank, proximal to the ankle joint.

Assessment of isometric muscle strength and power was performed with the participants in three positions (seated, supine, and prone); hip flexors, knee extensors, and knee flexors were assessed in a seated position; ankle plantarflexors, ankle dorsiflexors, hip abductors, and hip adductors in a supine position; hip extensors in a prone position. These positions were chosen to minimise changes in position by the participant to enhance the feasibility of testing in a clinical setting. All tests involved maximal voluntary isometric contractions. Assessment using the HHDs was conducted first. The order was randomised for assessor and HHD, however the order of the muscle groups tested was kept consistent as shown in Fig 1; for example if HHD1 was randomly assigned first, all seated muscle groups would be assessed, followed by HHD2 assessing seated muscle groups, with the same order of HHDs for supine and then prone muscle groups. Following a rest period of five minutes, the same protocol was repeated by the second assessor. During pilot testing, problems arose in the assessment of very strong muscle groups, namely the knee extensors and ankle plantarflexors. To assist the assessor in overcoming the force produced by the participant, the plinth was placed close to a wall, which aided the assessors in their resistance of the participants’ contractions for these two muscle groups (see Fig 1B and 1D).

Following HHD testing, the isometric strength and power of participants was then assessed using the KinCom dynamometer utilising the positions described for the HHDs. In order to minimise position changes and reduce time requirements, the order of muscles tested was different during the assessment with the KinCom dynamometer. The order for the KinCom was as follows: knee extensors, knee flexors, hip flexors, hip abductors, hip adductors, hip extensors, ankle plantarflexors, and ankle dorsifexors. Instructions provided to participants for all trials were ‘at the count of three, push/pull as hard and as fast as you can and hold that contraction until I say relax’. Each test lasted between three to five seconds and ended after a steady maximal force was produced by the participant. Participants were instructed to hold the side of the plinth for stabilization (see Fig 1). Constant verbal encouragement was provided throughout the testing. Only the right limb of each participant was assessed to reduce fatigue and the time demands of the testing session. A submaximal practice trial was given for each muscle group on both HHDs and the fixed dynamometer to ensure the participant understood the contraction required. Two trials were recorded for each muscle group, again to minimise the time requirements of testing.

Data Analysis

A custom-written software program (LabVIEW 2009 National Instruments, Austin TX, USA) was made to analyse the data from the three devices using the following procedures. A zero-phase shift 10Hz lowpass 4th order Butterworth filter was applied to data from each of the three devices. Due to the differing sampling rates between devices (Lafayette: stable 40Hz; Hoggan: unstable, approximately 100Hz; KinCom: stable 1000Hz), the data for the HHDs were resampled to a constant interval 1000Hz using cubic spline interpolation to allow for consistent and unbiased analysis. Strength was assessed by measuring peak force, which was determined by calculating the highest force value recorded in kilograms during both trials for each muscle group. Whilst normalisation to the length of the lever arm and body mass is crucial if comparing results from HHDs between participants, data from this study were not normalised in this way; our analysis of results was only performed within participants and therefore normalisation was redundant.

There is currently no consensus in the literature on the most appropriate measure of RFD [5, 19, 22]. Thus a comparison of the reliability of differing methodologies on the criterion-reference KinCom dynamometer was included in the current study. The analysis methods used included variants of three methods for the assessment of muscle power: 1) time to peak force, 2) calculating peak RFD between percentages of the peak force (5–95%, 10–90%, 15–85%, 20–80%, 25–75%, 30–70%, 35–65%, 40–60%), and, 3) examining successive time intervals (e.g. sample 1–11, 2–12, 3–13 etc.) during the initial rise in force to determine the peak RFD across the trial for time intervals of 10, 20, 50, 100, and 200ms. The methods differed in that the second method has a fixed position on the force trace but a variable time interval (i.e. it is always between the set force thresholds, but the duration shortens if the RFD is higher), whilst the third method has a fixed time interval but variable force position (i.e. the extracted data always has the same number of samples in it, but it could occur anywhere on the ascending slope of the force trace).

Statistical Analysis

Data were assessed for normality using a Shapiro-Wilk test, with the data conforming to normal distribution. Descriptive statistics (mean and standard deviations) were used to describe participant demographics and anthropometrics and outcome measures of peak force and RFD. The first step in analysis was to calculate the reliability of different RFD algorithms from the fixed dynamometer, which was done through intraclass correlation coefficients (ICC2,1).

Assessment of intra-rater and inter-rater reliability was conducted using ICC2,1, standard error of measurement (SEM), and minimal detectable change (MDC) with 95% confidence intervals. The SEM and MDC were calculated using the formulas provided by Portney and Watkins [38] and expressed as percentages of the mean. The SEM was calculated by multiplying the standard deviation of the first session results by the square root of one minus the ICC () [38]. The MDC was calculated using the following formula , where z = 1.96 (based on 95% confidence) and SEM is the standard error of measurement [38]. The association and agreement between assessors and devices, for inter-rater and inter-device reliability, were also measured using Pearson’s correlation (R) and concordance correlation coefficients (Rc). The Pearson’s correlation coefficient assesses association irrespective of magnitude differences whereas the concordance coefficient assesses both association and deviations from the line of identity (y = x).

Analysis of concurrent validity was conducted by comparing results from the two HHDs to the gold standard laboratory-based KinCom using ICC2,1, R, and Rc. Standard or regression-based (when proportional bias was detected) Bland-Altman plots with 95% limits of agreement [39] were calculated for all variables (see S1S4 Files). Correlations of the difference between scores and the average scores were examined to detect a proportional bias (R>0.50), which indicated use of a regression-based Bland-Altman plot. Point estimates of the correlation and ICC values for reliability and validity analyses were based on those provided by Portney and Watkins [38] interpreted as excellent (≥0.90), good (0.75–0.89), moderate (0.50–0.74), or poor (<0.50).


A convenience sample of thirty participants (age: 22.87±5.08yrs, mass: 68.67±9.15kg, height: 172.85±9.11cm, male: 15) who were recruited through the University attended two testing sessions one week apart (mean: 7±2 days). One participant was unable to attend the second session. Further explanation of missing data is provided in S1S4 Files and the full data set is in S5 File.

Reliability of RFD measures

Measures of time to peak force and RFD that involved calculating the change in force over the change in time between percentages of the peak force (5–95%, 10–90%, 15–85%, 20–80%, 25–75%, 30–70%, 35–65%, 40–60%) revealed poor to moderate test re-test reliability (median ICC<0.85) across the majority of muscle groups on the fixed KinCom dynamometer (Table 1). Examination of the RFD measures calculated across successive time intervals (10, 20, 50, 100, and 200ms) showed good to excellent results for test re-test reliability on the KinCom dynamometer (median ICC≥0.91). The 200ms time interval method displayed the highest median reliability results (median ICC = 0.93) and no results lower than our threshold for good (≥0.75), and was therefore used for further analyses.

Table 1. Test-retest reliability (ICCs) of different rate of force development measures on the fixed KinCom dynamometer.

Intra- and Inter-rater Reliability

The mean (standard deviation (SD)) and intra-rater reliability results for peak force and RFD are shown in Tables 2 and 3 respectively. Intra-rater reliability was good to excellent (ICC≥0.75) for all peak force measures with the exception of a moderate result for the ankle plantarflexors measured by Assessor-B using the Hoggan device (ICC = 0.74). Intra-rater reliability was also good to excellent for all RFD measures with the exception of the knee extensors measured by Assessor-A using the Hoggan device (ICC = 0.71), and measures of ankle dorsiflexors (ICC = 0.49), hip abductors (ICC = 0.74), and knee extensors (ICC = 0.71) using the Lafayette device by Assessor-B.

Table 2. Mean (SD) values and intra-rater reliability for each assessor on each device plus the KinCom for peak force (kg).

Table 3. Mean (SD) values and intra-rater reliability for each assessor on each device plus the KinCom for rate of force development (kg/s).

Inter-rater reliability results are displayed in Table 4. Inter-rater reliability was good to excellent for both peak force and RFD measures (ICC≥0.75) in all muscle groups except for peak force of the ankle dorsiflexors (ICC = 0.68) and ankle plantarflexors (ICC = 0.66) measured on the Hoggan device, and RFD of the ankle dorsiflexors measured on the Lafayette device (ICC = 0.70). The tables also show the intra- and inter-rater reliability results of the SEM and MDC calculations, expressed as a percentage of the mean.

Table 4. Inter-rater and inter-device reliability for the HHDs.

Inter-device Reliability

Analysis of inter-device results showed good to excellent correlations (R≥0.75) between results obtained on the Lafayette and Hoggan devices for all peak force measures (Table 4). Additionally, concordance correlations for peak force also showed good to excellent agreement (Rc≥0.75) with the exception of ankle dorsiflexors (Rc = 0.66) measured by Assessor-A. Inter-device analysis of RFD measures showed good to excellent correlations (R≥0.75) for all muscle groups with the exception of the ankle dorsiflexors measured by Assessor-B (R = 0.73) and the knee extensors for Assessors-A and B (R = 0.41, 0.57 respectively). The majority of RFD concordance correlation results showed moderate to good agreement (see Table 4). Measures of RFD for the knee extensors showed poor agreement and moderate correlations between devices for both assessors.

Concurrent Validity

Results from the validity analysis for peak force and RFD measures are shown in Table 5. Validity of peak force measures were good to excellent (ICC≥0.75) with the exception of most ankle results which demonstrated moderate validity; this included ankle dorsiflexors measured by Assessor-A on the Lafayette device (ICC = 0.62) and the Hoggan device (ICC = 0.51) and ankle plantarflexors measured by Assessor-A and B on the Lafayette device (ICC = 0.51, 0.54 respectively) and the Hoggan device (ICC = 0.47, 0.40 respectively). The validity of RFD measures were mixed, however all measures of the hip musculature demonstrated good to excellent validity (ICC≥0.75) except for the hip abductors measured by Assessor-B using the Lafayette device (ICC = 0.74). Ankle and knee RFD measures displayed mostly moderate to good validity. Results from the Bland-Altman plots are provided in S1S4 Files.

Table 5. Concurrent validity analysis of the HHDs for assessment of peak force and RFD measures compared to the KinCom.


Hand-held dynamometry demonstrated good to excellent intra- and inter-rater reliability for the assessment of isometric lower limb muscle strength and power in a healthy population. Comparison of the HHDs to a laboratory-based dynamometer showed moderate to excellent concurrent validity for both measures of isometric lower limb strength and power. To the authors’ knowledge, this is the first study to evaluate the intra- and inter-rater reliability and validity of HHDs for assessing muscle strength in all major muscles of the lower limbs with a greater than poor sample size based on the COSMIN checklist [40], and the first to use HHDs to assess muscle power. These low-cost, portable, and easy-to-use devices have previously shown excellent results for use as a clinically-feasible alternative to laboratory-based dynamometry for the assessment of isometric muscle strength. The results from the current study indicate promise for HHDs in the assessment of isometric muscle power.

Previous literature has focussed primarily on the assessment and treatment of muscle strength in various clinical populations; however, muscle power is another important consideration. Evidence indicates that in an elderly population, measures of muscle power are more strongly associated with self-reported function and incidence of falls than muscle strength [30, 35]. As such, knowledge of both muscle strength and power may be of use to clinicians when assessing and treating their patients. The HHD results for both peak force and RFD can be obtained from the same trial using the same methodology, adding to the feasibility of using this device in a clinical setting for patient assessment. A potential limiting factor is the lack of widely available software to extract the RFD data. For this reason we have created a freely available software program (available at which allows the user to obtain the 200ms rolling window RFD from data stored on a Lafayette device. A software program for data from a Hoggan device is not available on the website due to the additional cost of purchasing the data recording software for this device and issues experienced during testing with saving recorded data (See S1S5 Files).

The inter-rater reliability was good to excellent for both peak force and RFD using both HHD devices. Nonetheless, agreement between assessors ranged from moderate to excellent for peak force and RFD, suggesting that although results between assessors are comparable, the results are not interchangeable. Previous research has found mixed inter-rater reliability in a range of populations for the assessment of muscle strength [34, 4145]. Both assessors in the current study were male, with differing levels of experience. Prior research has compared reliability results for peak force analysis using a Hoggan microFET2 HHD and found similar reliability between male and female assessors with varying levels of experience, height, and weight [46]. Previous studies have commented on the influence that assessor strength may have on HHD testing [34, 47]. In our experience, sufficient strength levels are required to control the movement of the patient, after which the technique of the assessor is likely to be just as important for obtaining valid results. During testing assessors should have a wide base of support, use their own body mass to lean into the participant and keep arms tucked in towards their body.

Closer examination of the results from each lower limb muscle group revealed that the hip musculature showed the strongest reliability and validity for measures of peak force and RFD. Previous research examining peak force has also found similar results for the assessment of hip strength using HHD in a range of populations [36, 37, 4850]. Assessment of the ankle muscles demonstrated good to excellent reliability however validity was lower than expected. Previous research in a healthy population has also shown poor validity of HHD measures of plantarflexor strength in comparison with the KinCom [51]. Assessment of the ankle muscle groups is important, because the ankle plantarflexors have a primary role in power generation during walking [52] and the dorsiflexors are the lower body muscles most strongly associated with gait speed in people living with stroke [53]. Our mixed validity results in the current study and previous research [51] may have been caused by the ankle plantar/dorsiflexor attachment used on the KinCom. Participants reported difficulty in using the attachment, especially for ankle dorsiflexion, due to the lack of stabilisation that the attachment provides. Moreover, the ankle attachment does not fit tightly within the load cell, which may have resulted in measurement error. Similar comments have been made previously using the ankle inversion/eversion attachment on the KinCom [54]. Assessment of peak force of the knee extensors and flexors demonstrated good to excellent reliability and validity however validity of RFD measures for the knee extensors using both HHDs ranged from moderate to good. This may have been due to the higher levels of force and power generated in the knee extensors, leading to the assessors having difficulty in stabilising the HHD during the initial rapid rise in force, consequently impacting on measures of RFD. Therefore, if the knee extensors are the primary muscle of interest it may be necessary to consider external bracing during power assessment. Analysis of the SEM for intra-rater reliability for each device showed small percentages of the mean indicating low measurement error, with RFD higher than peak force values (<10% for peak force except one measure and <20% for RFD except for one measure). The MDC results were also higher for RFD measures compared with peak force (<50% of the mean for RFD except two measures and <25% for peak force except for two measures). Analysis of the MDC for HHD measures of muscle strength and power may prove more informative in clinical populations than the healthy participants used in the current study.

The measurement of RFD has been widely used previously with a lack of consensus of which method is appropriate. After a comparison of various techniques for assessing RFD that were applicable to a HHD, this study utilised a peak 200ms iterative windowed time period method to determine peak RFD. Previous work has commented on the arbitrary nature of determining onset of contraction for calculation of RFD [55]. As such, this study did not determine the onset of contraction; rather RFD was identified using algorithms calculated from the first sample recorded to determine the peak RFD across the trial. This method ignores any erroneous recordings made by placing the HHD on the lower limb and as such the calculation of RFD will not include these initial recordings. Whilst increasing the duration of the window in the calculation of RFD may produce higher reliability results, a longer time window may include unwanted plateaus. We found the 200ms successive time window analysis technique to be robust to different sources of error during testing, however further research is needed in clinical populations to verify the findings of the current study.

Comparison of the Hoggan and Lafayette HHDs used in this study revealed no apparent differences between the devices in their reliability or validity for either measure of peak force or RFD. The inter-device reliability indicated that peak force results are interchangeable between the two different HHDs. Caution is necessary if interchanging RFD results between devices, as this study demonstrated mixed agreement between HHDs for measures of RFD. Additionally, both HHDs demonstrated mixed agreement with the KinCom for measures of peak force and RFD. The lack of agreement between devices for measures of RFD may be due to the different sampling rates employed by each device. Based on the results of the current study, there can be no recommendation as to which HHD should be used, with both devices demonstrating similar reliability and validity. One consideration for the future development of HHDs is the real-time calculation of RFD. Calculation of RFD on both of the devices chosen for this study currently requires post-testing analysis. The Hoggan device needs to be wirelessly connected to a computer during testing; with the software interface occasionally losing recorded data during collection (see S1S5 Files). The Lafayette device stores raw data within the device, which can be downloaded to a laptop for analysis. After further testing, manufacturers should consider including RFD as an automated output on their device.

Study Limitations

The sample used in this study was a group of young, healthy, and physically active individuals. Even with the assessors bracing against a wall, the assessment of the knee extensors and ankle plantarflexors could not be completed for one male participant. Additionally, as can be seen in Tables 2 and 3, these two muscle groups recorded much higher strength and power values on the fixed dynamometer across all participants. It is likely that the assessment of muscle strength and power would be easier in those with muscle weakness, such as the elderly or those with neurological impairments. The findings of this study may therefore not be directly generalisable to some clinical populations. A recent review demonstrated that the reliability of HHDs is generally lower in healthy populations compared to clinical populations [34]. This could be due to the difficulties when testing stronger participants or lower inter-subject variability in healthy populations, compared to clinical. Nonetheless, the inclusion of healthy individuals does not discount the importance of our study, as normative data is required, albeit not normalised to body mass, to allow comparison with healthy populations and therefore establishing reliability and validity in this group was considered essential.


Hand-held dynamometry is a reliable and valid tool for the assessment of isometric lower limb muscle strength and power, which may be valuable information particularly in clinical populations with gait impairments. Assessment of muscle strength and power in clinical populations using HHDs is warranted to determine the psychometric properties of these devices.

Competing Interests

The authors have declared that no competing interests exist.

Supporting Information

S1 File. Bland-Altman Plots indicating the validity of the Lafayette HHD in comparison with the KinCom for analysis of peak force (kg).


S2 File. Bland-Altman Plots indicating the validity of the Lafayette HHD in comparison with the KinCom for analysis of rate of force development (kg/s).


S3 File. Bland-Altman Plots indicating the validity of the Hoggan HHD in comparison with the KinCom for analysis of peak force (kg).


S4 File. Bland-Altman Plots indicating the validity of the Hoggan HHD in comparison with the KinCom for analysis of rate of force development (kg/s).


Author Contributions

Conceived and designed the experiments: BFM KJB BA YHP GPW RAC. Performed the experiments: BFM LGP RM. Analyzed the data: BFM KJB YHP RM RAC. Contributed reagents/materials/analysis tools: BFM RAC. Wrote the paper: BFM LGP KJB BA YHP GPW RM RAC. Designed the software used in analysis: BFM RAC.


  1. 1. Bohannon RW, Walsh S. Nature, reliability, and predictive value of muscle performance measures in patients with hemiparesis following stroke. Arch Phys Med Rehabil. 1992;73(8):721–5. pmid:1642521
  2. 2. Doherty TJ. Invited review: aging and sarcopenia. J Appl Physiol. 2003;95(4):1717–27. pmid:12970377
  3. 3. Gerrits KH, Beltman MJ, Koppe PA, Konijnenbelt H, Elich PD, de Haan A et al. Isometric muscle function of knee extensors and the relation with functional performance in patients with stroke. Arch Phys Med Rehabil. 2009;90(3):480–7. pmid:19254615
  4. 4. Wiley ME, Damiano DL. Lower-extremity strength profiles in spastic cerebral palsy. Dev Med Child Neurol. 1998;40(2):100–7. pmid:9489498
  5. 5. Aagaard P, Simonsen EB, Andersen JL, Magnusson P, Dyhre-Poulsen P. Increased rate of force development and neural drive of human skeletal muscle following resistance training. J Appl Physiol. 2002;93(4):1318–26. pmid:12235031
  6. 6. Caserotti P, Aagaard P, Buttrup Larsen J, Puggaard L. Explosive heavy-resistance training in old and very old adults: Changes in rapid muscle force, strength and power. Scand J Med Sci Sports. 2008;18(6):773–82. pmid:18248533
  7. 7. Aagaard P, Suetta C, Caserotti P, Magnusson SP, Kjær M. Role of the nervous system in sarcopenia and muscle atrophy with aging: Strength training as a countermeasure. Scand J Med Sci Sports. 2010;20(1):49–64. pmid:20487503
  8. 8. Fimland MS, Moen PMR, Hill T, Gjellesvik TI, Tørhaug T, Helgerud J et al. Neuromuscular performance of paretic versus non-paretic plantar flexors after stroke. Eur J Appl Physiol. 2011;111(12):3041–9. pmid:21455614
  9. 9. Fleming BE, Wilson DR, Pendergast DR. A portable, easily performed muscle power test and its association with falls by elderly persons. Arch Phys Med Rehabil. 1991;72(11):886–9. pmid:1929806
  10. 10. Izquierdo M, Aguado X, Gonzalez R, López JL, Häkkinen K. Maximal and explosive force production capacity and balance performance in men of different ages. Eur J Appl Physiol. 1999;79(3):260–7.
  11. 11. Maffiuletti NA, Bizzini M, Widler K, Munzinger U. Asymmetry in quadriceps rate of force development as a functional outcome measure in TKA. Clin Orthop. 2010;468(1):191–8. pmid:19597897
  12. 12. Moreau NG, Falvo MJ, Damiano DL. Rapid force generation is impaired in cerebral palsy and is related to decreased muscle size and functional mobility. Gait Posture. 2012;35(1):154–8. pmid:21930383
  13. 13. Winters JD, Rudolph KS. Quadriceps rate of force development affects gait and function in people with knee osteoarthritis. Eur J Appl Physiol. 2014;114(2):273–84. pmid:24240535
  14. 14. Andersen LL, Aagaard P. Influence of maximal muscle strength and intrinsic muscle contractile properties on contractile rate of force development. Eur J Appl Physiol. 2006;96(1):46–52. pmid:16249918
  15. 15. Suetta C, Aagaard P, Rosted A, Jakobsen AK, Duus B, Kjaer M et al. Training-induced changes in muscle CSA, muscle strength, EMG, and rate of force development in elderly subjects after long-term unilateral disuse. J Appl Physiol. 2004;97(5):1954–61. pmid:15247162
  16. 16. Andersen LL, Andersen JL, Zebis MK, Aagaard P. Early and late rate of force development: Differential adaptive responses to resistance training? Scand J Med Sci Sports. 2010;20(1):e162–e9. pmid:19793220
  17. 17. Blazevich AJ, Horne S, Cannavan D, Coleman DR, Aagaard P. Effect of contraction mode of slow-speed resistance training on the maximum rate of force development in the human quadriceps. Muscle Nerve. 2008;38(3):1133–46. pmid:18642358
  18. 18. Pijnappels M, van der Burg JCE, Reeves ND, van Dieën JH. Identification of elderly fallers by muscle strength measures. Eur J Appl Physiol. 2008;102(5):585–92. pmid:18071745
  19. 19. Bemben MG, Massey BH, Bemben DA, Misner JE, Boileau RA. Isometric muscle force production as a function of age in healthy 20- to 74-yr-old men. Med Sci Sports Exerc. 1991;23(11):1302–10. pmid:1766348
  20. 20. Korhonen MT, Cristea A, Alén M, Häkkinen K, Sipilä S, Mero A et al. Aging, muscle fiber type, and contractile function in sprint-trained athletes. J Appl Physiol. 2006;101(3):906–17. pmid:16690791
  21. 21. Kyröläinen H, Avela J, McBride JM, Koskinen S, Andersen JL, Sipilä S et al. Effects of power training on muscle structure and neuromuscular performance. Scand J Med Sci Sports. 2005;15(1):58–64. pmid:15679573
  22. 22. Sleivert GG, Wenger HA. Reliability of measuring isometric and isokinetic peak torque, rate of torque development, integrated electromyography, and tibial nerve conduction velocity. Arch Phys Med Rehabil. 1994;75(12):1315–21. pmid:7993170
  23. 23. Kolber MJ, Cleland JA. Strength testing using hand-held dynamometry. Phys Ther Rev. 2005;10(2):99–112.
  24. 24. Moriello C, Mayo NE. Development of a position-specific index of muscle strength to be used in stroke evaluation. Arch Phys Med Rehabil. 2006;87(11):1490–5. pmid:17084125
  25. 25. Stark T, Walker B, Phillips JK, Fejer R, Beck R. Hand-held dynamometry correlation with the gold standard isokinetic dynamometry: A systematic review. PM R. 2011;3(5):472–9. pmid:21570036
  26. 26. Alemany JA, Pandorf CE, Montain SJ, Castellani JW, Tuckow AP, Nindl BC. Reliability assessment of ballistic jump squats and bench throws. J Strength Cond Res. 2005;19(1):33–8. pmid:15705042
  27. 27. Garnacho-Castaño MV, López-Lastra S, Maté-Muñoz JL. Reliability and validity assessment of a linear position transducer. J Sports Sci Med. 2015;14(1):128–36. pmid:25729300
  28. 28. Villadsen A, Roos EM, Overgaard S, Holsgaard-Larsen A. Agreement and reliability of functional performance and muscle power in patients with advanced osteoarthritis of the hip or knee. Am J Phys Med Rehabil. 2012;91(5):401–10. pmid:22311054
  29. 29. Bassey EJ, Fiatarone MA, O'Neill EF, Kelly M, Evans WJ, Lipsitz LA. Leg extensor power and functional performance in very old men and women. Clin Sci. 1992;82(3):321–7. pmid:1312417
  30. 30. Skelton DA, Kennedy J, Rutherford OM. Explosive power and asymmetry in leg muscle function in frequent fallers and non-fallers aged over 65. Age Ageing. 2002;31(2):119–25. pmid:11937474
  31. 31. Davies CTM, White MJ, Young K. Electrically evoked and voluntary maximal isometric tension in relation to dynamic muscle performance in elderly male subjects, aged 69 years. Eur J Appl Physiol. 1983;51(1):37–43.
  32. 32. McMaster DT, Gill N, Cronin J, McGuigan M. A brief review of strength and ballistic assessment methodologies in sport. Sports Med. 2014;44(5):603–23. pmid:24497158
  33. 33. Trudelle-Jackson E, Jackson AW, Frankowski CM, Long KM, Meske NB. Interdevice reliability and validity assessment of the Nicholas Hand-Held Dynamometer. J Orthop Sports Phys Ther. 1994;20(6):302–6. pmid:7849750
  34. 34. Bohannon RW. Hand-held dynamometry: A practicable alternative for obtaining objective measures of muscle strength. Isokinet Exerc Sci. 2012;20(4):301–15.
  35. 35. Foldvari M, Clark M, Laviolette LC, Bernstein MA, Kaliton D, Castaneda C et al. Association of muscle power with functional status in community-dwelling elderly women. J Gerontol A Biol Sci Med Sci. 2000;55(4):M192–M9. pmid:10811148
  36. 36. Thorborg K, Petersen J, Magnusson SP, Hölmich P. Clinical assessment of hip strength using a hand-held dynamometer is reliable. Scand J Med Sci Sports. 2010;20(3):493–501. pmid:19558384
  37. 37. Bohannon RW. Test-retest reliabilty of hand-held dynamometry during a single session of strength assessment. Phys Ther. 1986;66(2):206–8. pmid:3945674
  38. 38. Portney LG, Watkins MP. Foundations of clinical research: applications to practice. 3rd ed ed. Upper Saddle River, New Jersey: Pearson/Prentice-Hall; 2009.
  39. 39. Ludbrook J. Confidence in Altman-Bland plots: a critical review of the method of differences. Clin Exp Pharmacol Physiol. 2010;37(2):143–9. pmid:19719745
  40. 40. Terwee CB, Mokkink LB, Knol DL, Ostelo RWJG, Bouter LM, De Vet HCW. Rating the methodological quality in systematic reviews of studies on measurement properties: A scoring system for the COSMIN checklist. Qual Life Res. 2012;21(4):651–7. pmid:21732199
  41. 41. Bandinelli S, Benvenuti E, Del Lungo I, Baccini M, Benvenuti F, Di Iorio A et al. Measuring muscular strength of the lower limbs by hand-held dynamometer: a standard protocol. Aging Clin Exp Res. 1999;11(5):287–93.
  42. 42. Malliaras P, Hogan A, Nawrocki A, Crossley K, Schache A. Hip flexibility and strength measures: reliability and association with athletic groin pain. Br J Sports Med. 2009;43(10):739–44. pmid:19282303
  43. 43. Poulsen E, Christensen HW, Penny JØ, Overgaard S, Vach W, Hartvigsen J. Reproducibility of range of motion and muscle strength measurements in patients with hip osteoarthritis: an inter-rater study. BMC Musculoskelet Disord. 2012;13:242. pmid:23217149
  44. 44. Vanpee G, Segers J, Van Mechelen H, Wouters P, Van Den Berghe G, Hermans G et al. The interobserver agreement of handheld dynamometry for muscle strength assessment in critically ill patients. Crit Care Med. 2011;39(8):1929–34. pmid:21572324
  45. 45. Richardson J, Stratford P, Cripps D. Assessment of reliability of the hand-held dynamometer for measuring strength in healthy older adults. Physiother Theory Pract. 1998;14(1):49–54.
  46. 46. Kelln BM, McKeon PO, Gontkof LM, Hertel J. Hand-held dynamometry: Reliability of lower extremity muscle testing in healthy, physically active, young adults. J Sport Rehabil. 2008;17(2):160–70. pmid:18515915
  47. 47. Wikholm JB, Bohannon RW. Hand-held dynamometer measurements: Tester strength makes a difference. J Orthop Sports Phys Ther. 1991;13(4):191–8. pmid:18796845
  48. 48. Arnold CM, Warkentin KD, Chilibeck PD, Magnus CRA. The reliability and validity of handheld dynamometry for the measurement of lower-extremity muscle strength in older adults. J Strength Cond Res. 2010;24(3):815–24. pmid:19661831
  49. 49. Fulcher ML, Hanna CM, Raina Elley C. Reliability of handheld dynamometry in assessment of hip strength in adult male football players. J Sci Med Sport. 2010;13(1):80–4. pmid:19376747
  50. 50. Taylor NF, Dodd KJ, Graham HK. Test-retest reliability of hand-held dynamometric strength testing in young people with cerebral palsy. Arch Phys Med Rehabil. 2004;85(1):77–80. pmid:14970972
  51. 51. Marmon AR, Pozzi F, Alnahdi AH, Zeni JA. The validity of plantarflexor strength measures obtained through hand-held dynamometry measurements of force. Int J Sports Phys Ther. 2013;8(6):820–7. pmid:24377068
  52. 52. Winter DA. Energy generation and absorption at the ankle and knee during fast, natural, and slow cadences. Clin Orthop. 1983;175:147–54. pmid:6839580
  53. 53. Mentiplay BF, Adair B, Bower KJ, Williams G, Tole G, Clark RA. Associations between lower limb strength and gait velocity following stroke: A systematic review. Brain Inj. 2015;29(4):409–22. pmid:25549284
  54. 54. Kaminski TW, Perrin DH, Mattacola CG, Szczerba JE, Bernier JN. The reliability and validity of ankle inversion and eversion torque measurements from the Kin Com II isokinetic dynamometer. J Sport Rehabil. 1995;4(3):210–8.
  55. 55. Pua YH, Wrigley TW, Cowan SM, Bennell KL. Intrarater test-retest reliability of hip range of motion and hip muscle strength measurements in persons with hip osteoarthritis. Arch Phys Med Rehabil. 2008;89(6):1146–54. pmid:18503813