Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Prediction of performance in a 100-km run from a simple equation

  • Jeremy B. Coquart

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    jeremy.coquart@univ-lille.fr

    Current address: Unité de Recherche Pluridisciplinaire Sport, Santé, Société, Eurasport, Loos, France

    Affiliation Univ. Lille, Univ. Artois, Univ. Littoral Côte d’Opale, ULR 7369—URePSSS—Unité de Recherche Pluridisciplinaire Sport Santé Société, Lille, France

Abstract

This study aimed to identify predictive variables of performance for a 100-km race (Perf100-km) and develop an equation for predicting this performance using individual data, recent marathon performance (Perfmarathon), and environmental conditions at the start of the 100-km race. All runners who had performed official Perfmarathon and Perf100-km in France, both in 2019, were recruited. For each runner, gender, weight, height, body mass index (BMI), age, the personal marathon record (PRmarathon), date of the Perfmarathon and Perf100-km, and environmental conditions during the 100-km race (i.e., minimal and maximal air temperatures, wind speed, total amount of precipitation, relative humidity and barometric pressure) were collected. Correlations between the data were examined, and prediction equations were then developed using stepwise multiple linear regression analyses. Significant bivariate correlations were found between Perfmarathon (p<0.001, r = 0.838), wind speed (p<0.001, r = -0.545), barometric pressure (p<0.001, r = 0.535), age (p = 0.034, r = 0.246), BMI (p = 0.034, r = 0.245), PRmarathon (p = 0.065, r = 0.204) and Perf100-km in 56 athletes The, 2 prediction equations with larger sample (n = 591) were developed to predict Perf100-km, one including Perfmarathon, wind speed and PRmarathon (model 1, = 0.549; standard errors of the estimate, SEE = 13.2%), and the other including only Perfmarathon and PRmarathon (model 2, = 0.494; SEE = 14.0%). Perf100-km can be predicted with an acceptable level of accuracy from only recent Perfmarathon and PRmarathon, in amateur athletes who want to perform a 100 km for the first time.

Introduction

The popularity of the ultramarathon has increased tremendously over the last decades, with increasingly more organized events every year [1, 2]. An ultramarathon is currently defined as any running event taking longer than 6 hours [3]. Ultramarathon races are generally held as time-limited events (e.g., 24-hour races) or distance-limited events, such as the 100-km races [4].

The ability to predict running performance is of great interest for athletes and coaches, particularly in the ultramarathon. Indeed, it is helpful for prescribing speeds during tempo runs, determining the optimal pace strategy during the race, and even choosing splitting times [57].

Knechtle et al. proposed a simple equation to predict performance in 24-hour [8] and 100-km [9] races. These authors showed that performance in the 100-km race (Perf100-km) is primarily related to training intensity and volume, as well as to the age of the runners, but less so to their anthropometric characteristics [9]. However, this study was based only on data from male ultramarathoners at the Biel ultramarathon in Switzerland, and the particular conditions of this race (start at 10:00 p.m., first weekend of June, no rain…) make it difficult to generalize these results to other races, especially in France where 100-km races are usually held in the daytime in September/October with generally lower temperatures. Indeed, temperature and thus season have been shown to influence marathon performance in [1013].

The authors [9] were also able to obtain an important indicator: the personal best in the marathon. However, this record may have been achieved years earlier (or even decades) and, although it was associated with Perf100-km from bivariate correlation analysis (p < 0.0001, r = 0.65), this potential predictive variable was excluded from stepwise multiple-regression analysis [9]. Indeed, an old record may sometimes no longer be representative of a runner’s current marathon performance potential and thus of 100-km performance. Therefore, a recent marathon performance (within the last 9 months) may be more appropriate.

The aim of the current study was to identify the predictive variables of Perf100-km and develop an equation for predicting performance, using individual data, a recent marathon performance, and the environmental conditions at the start of the 100-km race.

Materials and methods

Procedure

All French official rankings of the French Athletics Federation (FFA for Fédération Française d’Athlétisme) in 2019 for the marathon (n = 88,455) and the 100-km run (n = 1,560) were retrospectively scrutinized. Only French competitions have been selected. From these rankings, all athletes who had competed in both were retained (n = 591). Then, runners who had not self-reported their weight and/or height and/or birth date were removed from the analysis (n = 533). Thus, 58 athletes were included in this stage. Moreover, runners who maintained a higher speed in the 100-km run than in the marathon were also removed (n = 2). Therefore, 56 athletes were ultimately included in the statistical analysis.

For each athlete, gender (i.e., woman vs man), birth date (to calculate the age), weight and height (to calculate the body mass index: BMI) were collected. Moreover, the race times on the 100-km run (i.e., Perf100-km) and the marathon (i.e., Perfmarathon), attaining (or not) of personal record during the marathon (PRmarathon), and the dates of participation in the marathon (i.e., Datemarathon) and 100-km run (i.e., Date100-km) were recorded, these last in order to determine the moment of the performances (i.e., the number of days since January 1, 2019) and to calculate the interval between the performances in the marathon and the 100-km. Last, for each 100-km race, city, Date100-km, minimal and maximal air temperatures, wind speed, total amount of precipitation, relative humidity and barometric pressure were collected.

This study was approved by the National Ethics Committee for Research in Sports Sciences (CERSTAPS 2019-22-02-31). Moreover, the protocol for this study was legally declared, in accordance with the European General Data Protection Regulations.

Statistical analysis

Standard statistical methods were used to calculate the means and standard deviations (SD).

Pearson product-moment correlations were used to evaluate the bivariate associations between dependent (i.e., Perf100-km) and independent variables (i.e., Perfmarathon, gender, age, weight, height, BMI, PRmarathon, Datemarathon, Date100-km, interval between performances, minimal and maximal air temperatures, wind speed, total amount of precipitation, relative humidity and barometric pressure).

Then, two prediction equations were developed from only the significantly correlated variables (p < 0.10) using stepwise multiple linear regression analysis. The first multiple linear regression analysis included all variables correlated with Perf100-km, while the second analysis included the characteristics of the athletes and their marathon performance but excluded the environmental conditions of the 100-km (i.e., minimal and maximal air temperatures, wind speed, total amount of precipitation, relative humidity and barometric pressure), since it is difficult to predict them accurately well in advance of the race.

The variance inflation factor (VIF) was used to detect the severity of multicollinearity among the independent variables in the regression models.

Fisher’s tests were used to examine the contribution of each variable in the two models, and the results were confirmed by the analysis of the standardized β coefficients.

Moreover, the relationship between the Perf100-km estimated by the prediction equation and actual Perf100-km was analyzed with the Bravais-Pearson method and quantified with Pearson’s correlation coefficient. The 95% confidence interval (95%CI) were also calculated.

The standard error of the estimate (SEE) and percentage of SEE were calculated to establish the accuracy of the prediction equations.

Statistical significance was set at p < 0.05 and all analyses were performed with the SPSS package (release 20.0, Chicago, IL, USA).

Results

The characteristics of the 56 runners are presented in Table 1.

thumbnail
Table 1. Characteristics of 56 athletes and their performances in marathon and 100-km.

https://doi.org/10.1371/journal.pone.0279662.t001

From the sample of 56 subjects, significant bivariate correlations were found between Perfmarathon (p < 0.001, r = 0.838, 95%IC between 0.817 and 0.931), wind speed (p < 0.001, r = -0.545, 95%IC between -0.508 and -0.041), barometric pressure (p < 0.001, r = 0.535, 95%IC between 0.195 and 0.615), age (p = 0.034, r = 0.246, 95%IC between 0.277 and 0.666), BMI (p = 0.034, r = 0.245, 95%IC between 0.330 and 0.697), PRmarathon (p = 0.065, r = 0.204, 95%IC between -0.253 and 0.272) and Perf100-km. Therefore, only these variables were included in models, because they were significantly correlated to Perf100-km with p < 0.10.

Nevertheless, the first stepwise multiple linear regression analysis entered only Perfmarathon, wind speed and PRmarathon as the independent variables to yield the prediction equation, considering that the other variables (i.e., barometric pressure, age and BMI) were redundant with each other. Therefore, to improve the quality of the model (from a larger sample size; n = 591), subjects with missing variables were re-injected into the statistical analysis.

For this new statistical analysis, the characteristics of these 591 runners and environmental conditions are presented in Tables 2 and 3, respectively.

thumbnail
Table 2. Characteristics of 591 athletes and their performances in marathon and 100-km.

https://doi.org/10.1371/journal.pone.0279662.t002

thumbnail
Table 3. Characteristics and environmental conditions during the 100-km races.

https://doi.org/10.1371/journal.pone.0279662.t003

From the sample of 591 subjects, significant bivariate correlations were found between Perfmarathon (p < 0.001, r = 0.696, 95% IC between 0.672 and 0.751), wind speed (p < 0.001, r = -0.394, 95%IC between -0.460 and -0.324), PRmarathon (p = 0.001, r = 0.127, 95%IC between 0.054 and 0.212) and Perf100-km.

The proposed prediction equation (i.e., model 1) is: (model 1) with Perf100-km and Perfmarathon in minutes, wind speed in m.s-1 and PRmarathon = 1 when PRmarathon was performed or 0 when no PRmarathon has been performed.

Very low multicollinearity was found (because VIF < 5) for the independent variables (VIF < 1.069). The increase in r² from adding the second (i.e., wind speed) and third predictors (i.e., PRmarathon) to the prediction equation was significant with F(1,588) = 76.225 (p < 0.001) and F(1,587) = 7.222 (p = 0.007), respectively. Moreover, Fisher’s test revealed a p < 0.001. The performance estimated by the prediction equation (including the three independent variables: Perfmarathon, wind speed and PRmarathon) was significantly correlated with the actual Perf100-km (r = 0.741 and r² = 0.549). The standardized β coefficients and p values on Student’s t-test were 0.639 (p < 0.001), -0.241 (p < 0.001), and 0.075 (p = 0.007) for Perfmarathon, wind speed and PRmarathon, respectively. No autocorrelation in the residuals was noted. The 95%CI between actual and predicted Perf100-km was between 0.719 and 0.788. The SEE for the prediction equation was 97 min, i.e., 13.2%.

The second multiple linear regression analysis (excluding environmental conditions) entered Perfmarathon and PRmarathon as the independent variables and yielded the following prediction equation (i.e., model 2): (model 2) with Perf100-km and Perfmarathon in minutes and PRmarathon = 1 when PRmarathon was performed or 0 when no PRmarathon has been performed.

With VIF = 1.001 for both Perfmarathon and PRmarathon, very low multicollinearity was found for the independent variables. The increase in r² from adding PRmarathon to the prediction equation was significant with F(2,588) = 12.335 (p < 0.001). The performance estimated by the prediction equation (including Perfmarathon and PRmarathon) was significantly correlated with the actual Perf100-km (r = 0.703 and r² = 0.494). The standardized β coefficients and p values on Student’s t-test were 0.692 (p < 0.001) and 0.103 (p < 0.001) for Perfmarathon and PRmarathon, respectively. The analyse of the residuals indicated no autocorrelation. The 95%CI between actual and predicted Perf100-km was between 0.680 and 0.758. The SEE for this second prediction equation was 103 min, i.e., 14.0%.

Discussion

The current study aimed to identify the predictive variables of Perf100-km in order to develop a prediction equation. The results showed significant bivariate correlations between Perf100-km and individual data (i.e., age and BMI), recent performance and attaining (or not) of personal record during the marathon (i.e., Perfmarathon and PRmarathon), and certain environmental conditions at the start of the 100-km race (i.e., wind speed and barometric pressure). However, only Perfmarathon, PRmarathon and/or wind speed during the 100-km race were included in the prediction equations.

Age and BMI were significantly correlated with Perf100-km in the bivariate correlation analysis, but these variables were removed from the multiple linear regression analyses because they were also significantly correlated with Perfmarathon (p < 0.001 and r = 0.433 for age, and p = 0.015 and r = 0.290 for BMI) and were thus predictive variables already included in prediction equations. This outcome is not surprising because Knechtle and colleagues showed that when the personal best Perfmarathon is included in the prediction equation, the addition of individual variables (e.g., BMI) does not improve the accuracy of the predicted time [9, 14, 15]. Moreover, age and BMI are known to be correlated with Perfmarathon [16].

In the present study, the barometric pressure during the 100-km races was negatively correlated with the wind, with a very high correlation coefficient (p < 0.001 and r = -0.998). Thus, when wind speed was entered into the stepwise multiple linear regression analysis, barometric pressure in the prediction equation was no longer related to Perf100-km (i.e., model 1). Recent studies have confirmed the influence of wind speed on Perfmarathon [12, 13]. The current results confirm this for 100-km races; yet it would have been interesting to know the wind direction in order to determine whether it was a head wind, side wind or tail wind. In contrast to these studies [12, 13], a decrease in performance was nevertheless not found in races in the rain. Notably, of the four 100-km races included, only one experienced rainfall, and it involved only three athletes.

The prediction equation including wind speed during the 100-km race provided slightly more accurate predictions (SEE = 13.2 vs 14.0% for models 1 and 2, respectively). However, to allow athletes and/or coaches to predict Perf100-km more simply, model 2 (without wind speed) may be sufficient. Indeed, predicting Perf100-km to help the athlete determine the optimal pace strategy during the race and/or choose splitting times can be simple using a single previous Perfmarathon (in the last 9 months), whereas forecasting environmental conditions (often changing and difficult to forecast far in advance), such as wind speed, can be more complicated (for a low gain; i.e., improvement of 0.8% in accuracy). In the literature, about 10% accuracy is generally accepted as tolerable for predicting running performance [5], especially in amateur athletes who want to perform a 100 km for the first time.

It should be noted that, in the future, the accuracy of predictions might be improved by removing the limitations of the present study. For example, one limitation was the self-declaration of body height and weight and thus the calculation of athletes’ BMI. Height and weight were not measured in this study, but self-reported. Thus, the runners may have under- or overestimated these parameters. Nevertheless, it should be noted that runners are known to self-report their anthropometric data accurately [17]. Also, to avoid the possible influence of physical fitness between the marathon and the 100-km race, the two performances had to be performed within a time interval of 9 months. Yet, this time interval may not be negligible, thus allowing for significant changes in physical fitness. Similarly, although environmental conditions during the 100-km races were collected, this information was not available for the marathons. Therefore, it cannot be ruled out that some runners performed in different environmental conditions in the marathon and the 100-km race (e.g., marathon at 30°C in June vs 100-km at 15°C in September). Last but not least, running performance can be affected by a multitude of potential factors as physiological (e.g., maximal oxygen uptake, running economy, anaerobic threshold…), psychological (e.g., motivation, stress) and environmental (e.g., race profile: uphill, downhill…) variables. These variables have not been collected during the current study. However, a potential perceptive could be to include several of these variables to attempt to develop other models more accurate. However, despite these potential limitations (i.e., physical fitness, environmental conditions and psychological states between the marathon and the 100-km race), the proposed equations had an acceptable level of accuracy in amateur athletes. Nevertheless, a future study should confirm the validity of the 2 models presented in this study fom new sample of athletes.

Conclusion

Perf100-km was significantly correlated with individual data (i.e., age and BMI), recent performance and the attaining (or not) of personal record during the marathon (i.e., Perfmarathon and PRmarathon), and certain environmental conditions at the start of the 100-km race (i.e., wind speed and barometric pressure). However, only Perfmarathon, PRmarathon and wind speed during the 100-km race proved useful to predict Perf100-km. Moreover, for simplicity, model 2 including only Perfmarathon and PRmarathon (in the 9 months prior to a 100-km race) seems to be sufficient to predict Perf100-km with an acceptable level of accuracy (SEE = 14.0%), especially in amateur athletes who want to perform a 100 km for the first time.

Acknowledgments

The author is grateful to the French Athletics Feredation (Fédération Française d’Athlétisme) for data diffusion.

References

  1. 1. Hoffman MD, Ong JC, Wang G. Historical analysis of participation in 161 km ultramarathons in North America. International Journal of the History of Sport. 2010;27(11):1877–91. pmid:20684085
  2. 2. Stöhr A, Nikolaidis PT, Villiger E, Sousa CV, Scheer V, Hill L, et al. An analysis of participation and performance of 2067 100-km ultra-marathons worldwide. International Journal of Environmental Research and Public Health. 2021;18(2):362. pmid:33418867
  3. 3. Zaryski C, Smith DJ. Training principles and issues for ultra-endurance athletes. Current Sports Medicine Reports. 2005;4(3):165–70. pmid:15907270
  4. 4. Knechtle B, Scheer V, Nikolaidis PT, Sousa CV. Participation and performance trends in the oldest 100-km ultramarathon in the world. International Journal of Environmental Research and Public Health. 2020;17(5). pmid:32155703
  5. 5. Coquart JB, Alberty M, Bosquet L. Validity of a nomogram to predict long distance running performance. Journal of strength and conditioning research. 2009;23(7):2119–23. pmid:19855340
  6. 6. Lerebourg L, Guignard B, L’Hermette M, Held E, Coquart JB. Predictions of the distance running performances of female runners using different tools. International Journal of Sports Medicine. In press. pmid:35395690
  7. 7. Lerebourg L, Guignard B, Racil G, Jlid MC, Held E, Coquart JB. Prediction of distance running performances of female runners using nomograms. International Journal of Sports Medicine. In press. pmid:34666415
  8. 8. Knechtle B, Knechtle P, Rosemann T, Lepers R. Personal best marathon time and longest training run, not anthropometry, predict performance in recreational 24-hour ultrarunners. Journal of strength and conditioning research. 2011;25(8):2212–8. pmid:21642857
  9. 9. Knechtle B, Knechtle P, Rosemann T, Lepers R. Predictor variables for a 100-km race time in male ultra-marathoners. Perceptual and Motor Skills. 2010;111(3):681–93. pmid:21319608
  10. 10. El Helou N, Tafflet M, Berthelot G, Tolaini J, Marc A, Guillaume M, et al. Impact of environmental parameters on marathon running performance. PloS One. 2012;7(5):e37407. pmid:22649525
  11. 11. Marc A, Sedeaud A, Guillaume M, Rizk M, Schipman J, Antero-Jacquemin J, et al. Marathon progress: demography, morphology and environment. Journal of Sports Sciences. 2014;32(6):524–32. pmid:24191965
  12. 12. Nikolaidis PT, Di Gangi S, Chtourou H, Rüst CA, Rosemann T, Knechtle B. The role of environmental conditions on marathon running rerformance in men competing in Boston marathon from 1897 to 2018. International Journal of Environmental Research and Public Health. 2019;16(4):614.
  13. 13. Knechtle B, Di Gangi S, Rüst CA, Villiger E, Rosemann T, Nikolaidis PT. The role of weather conditions on running performance in the Boston marathon from 1972 to 2018. PloS One. 2019;14(3):e0212797. pmid:30849085
  14. 14. Knechtle B, Wirth A, Knechtle P, Zimmermann K, Kohler G. Personal best marathon performance is associated with performance in a 24-h run and not anthropometry or training volume. British Journal of Sports Medicine. 2009;43(11):836–9. pmid:18385194
  15. 15. Knechtle B, Wirth A, Knechtle P, Rosemann T. Training volume and personal best time in marathon, not anthropometric parameters, are associated with performance in male 100-km ultrarunners. Journal of Strength and Conditioning Research. 2010;24(3):604–9. pmid:20145568.
  16. 16. Vickers AJ, Vertosick EA. An empirical study of race times in recreational endurance runners. BMC Sports Science, Medicine & Rehabilitation. 2016;8(1):26. pmid:27570626
  17. 17. Nikolaidis PT, Knechtle B. Validity of recreational marathon runners’ self-reported anthropometric data. Perceptual and Motor Skills. 2020;127(6):1068–78. pmid:32539530