Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Confirming the attainment of maximal oxygen uptake within special and clinical groups: A systematic review and meta-analysis of cardiopulmonary exercise test and verification phase protocols

  • Victor A. B. Costa,

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Project administration, Writing – original draft, Writing – review & editing

    Affiliations Graduate Program in Exercise Science and Sports, University of Rio de Janeiro State, Rio de Janeiro, Brazil, Laboratory of Physical Activity and Health Promotion, University of Rio de Janeiro State, Rio de Janeiro, Brazil

  • Adrian W. Midgley ,

    Roles Conceptualization, Methodology, Supervision, Writing – review & editing

    Midglead@edgehill.ac.uk

    Affiliation Department of Sport and Physical Activity, Edge Hill University, Ormskirk, England, United Kingdom

  • Julia K. Baumgart,

    Roles Methodology, Writing – review & editing

    Affiliation Centre for Elite Sports Research, Department of Neuromedicine and Movement Science, Norway, University of Science and Technology, Trondheim, Norway

  • Sean Carroll,

    Roles Methodology, Writing – review & editing

    Affiliation School of Sport, Exercise and Rehabilitation Sciences, University of Hull, Hull, England, United Kingdom

  • Todd A. Astorino,

    Roles Methodology, Writing – review & editing

    Affiliation Department of Kinesiology, California State University, San Marcos, CA, United States of America

  • Gustavo Z. Schaun,

    Roles Methodology, Writing – review & editing

    Affiliation Centre for Sport Science and University Sports, University of Vienna, Vienna, Austria

  • Guilherme F. Fonseca,

    Roles Methodology, Writing – review & editing

    Affiliations Graduate Program in Exercise Science and Sports, University of Rio de Janeiro State, Rio de Janeiro, Brazil, Laboratory of Physical Activity and Health Promotion, University of Rio de Janeiro State, Rio de Janeiro, Brazil

  • Felipe A. Cunha

    Roles Conceptualization, Methodology, Supervision, Writing – review & editing

    Affiliations Graduate Program in Exercise Science and Sports, University of Rio de Janeiro State, Rio de Janeiro, Brazil, Laboratory of Physical Activity and Health Promotion, University of Rio de Janeiro State, Rio de Janeiro, Brazil

Abstract

Background and aim

A plateau in oxygen uptake () during an incremental cardiopulmonary exercise test (CPET) to volitional exhaustion appears less likely to occur in special and clinical populations. Secondary maximal oxygen uptake () criteria have been shown to commonly underestimate the actual . The verification phase protocol might determine the occurrence of ‘true’ in these populations. The primary aim of the current study was to systematically review and provide a meta-analysis on the suitability of the verification phase for confirming ‘true’ in special and clinical groups. Secondary aims were to explore the applicability of the verification phase according to specific participant characteristics and investigate which test protocols and procedures minimise the differences between the highest values attained in the CPET and verification phase.

Methods

Electronic databases (PubMed, Web of Science, SPORTDiscus, Scopus, and EMBASE) were searched using specific search strategies and relevant data were extracted from primary studies. Studies meeting inclusion criteria were systematically reviewed. Meta-analysis techniques were applied to quantify weighted mean differences (standard deviations) in peak from a CPET and a verification phase within study groups using random-effects models. Subgroup analyses investigated the differences in according to individual characteristics and test protocols. The methodological quality of the included primary studies was assessed using a modified Downs and Black checklist to obtain a level of evidence. Participant-level data were analysed according to the threshold criteria reported by the studies or the inherent measurement error of the metabolic analysers and displayed as Bland-Altman plots.

Results

Forty-three studies were included in the systematic review, whilst 30 presented quantitative information for meta-analysis. Within the 30 studies, the highest mean values attained in the CPET and verification phase protocols were similar (mean difference = -0.00 [95% confidence intervals, CI = -0.03 to 0.03] L·min-1, p = 0.87; level of evidence, LoE: strong). The specific clinical groups with sufficient primary studies to be meta-analysed showed a similar between the CPET and verification phase (p > 0.05, LoE: limited to strong). Across all 30 studies, was not affected by differences in test protocols (p > 0.05; LoE: moderate to strong). Only 23 (53.5%) of the 43 reviewed studies reported how many participants achieved a lower, equal, or higher value in the verification phase versus the CPET or reported or supplied participant-level data for this information to be obtained. The percentage of participants that achieved a lower, equal, or higher value in the verification phase was highly variable across studies (e.g. the percentage that achieved a higher in the verification phase ranged from 0% to 88.9%).

Conclusion

Group-level verification phase data appear useful for confirming a specific CPET protocol likely elicited , or a reproducible , for a given special or clinical group. Participant-level data might be useful for confirming whether specific participants have likely elicited , or a reproducible , however, more research reporting participant-level data is required before evidence-based guidelines can be given.

Trial registration

PROSPERO (CRD42021247658) https://www.crd.york.ac.uk/prospero.

Introduction

Maximal oxygen uptake () represents the upper physiological limit of utilising oxygen to produce energy during volitional exercise to exhaustion [1]. The original concept emerged in the 1920’s in the seminal works of Hill and colleagues [2, 3]. These authors described this phenomenon as the “ceiling” of oxygen uptake () during a discontinuous step-incremented exercise test, beyond which no additional increase in is observed despite an increase in work rate. In special groups such as apparently healthy children and older adults, and clinical groups such as people with chronic respiratory and metabolic conditions, testing is increasingly recommended and typically determined using an incremental cardiopulmonary exercise test (CPET) with concurrent recording of electrocardiography, blood pressure, and oxyhemoglobin saturation [4]. Applications of CPET include facilitating the diagnosis and evaluating the physiological impact of cardiopulmonary disease [5], evaluating fitness for major surgery [4], deciding on the appropriateness of cardiac transplantation [6], investigating unexplained dyspnea and exercise intolerance [4], prescribing exercise to special and clinical groups [7, 8], and evaluating the chronic effects of exercise training programs and other interventions to promote health [9]. Over 40 outcome variables can be derived from a CPET [10], however, is widely regarded as the most important, since it is considered the gold-standard measure of cardiorespiratory fitness [4]. The importance of cardiorespiratory fitness for health and longevity is highlighted by the scientific statement from the American Heart Association, which calls for cardiorespiratory fitness to be recognised as a “clinical vital sign” [11].

Different ergometers, test protocols, procedures, and criteria have been developed and applied to promote accurate determination, which is significant considering the relationship between and health status. Potential reasons for not obtaining valid values include inappropriate test protocols and poor effort from participants [12]. However, there is no consensus on the best approach to establish whether a ‘true’ has been attained. The primary criterion for verifying that has been attained has been based on establishing a plateau at volitional exhaustion, first proposed in 1955 by Taylor, Buskirk and Henschel [13]. They defined the plateau occurrence as an increase in of less than 150 mL·min-1 (or ≤ 2.1 mL·kg-1·min-1) during a discontinuous step-incremented protocol incorporating 3–5 visits to the laboratory. With advances in technology (specifically, gas analysers that record breath-by-breath pulmonary gas exchange data and electronically programmable treadmills and cycle ergometers), discontinuous step-incremented tests protocols have largely been replaced by more time-efficient continuous ramp and pseudo-ramp protocols [1417]. Ramp protocols typically elicit an accelerated or linear response during the final portion of the test, which reduces plateau incidence, especially in low fit individuals [18], although a high level of variability has been observed between studies [1921]. This variability can be explained by the use of different plateau criterion thresholds [22], sampling intervals [23], test protocols [24], and variation in participant characteristics [25]. This inherent dependency on the test protocol and procedures, and participant characteristics, reduces the robustness of the plateau concept for confirming attainment.

Numerous investigators have not applied a plateau criterion or have relied upon so-called secondary criteria to assess whether has been elicited if a plateau threshold has not been met. Examples of secondary criteria include attainment of 90% age-predicted maximal heart rate, maximal respiratory exchange ratio ≥ 1.10, post-exercise blood lactate concentration ≥ 8 mmol·L-1, and ratings of perceived exertion ≥ 18 [12, 26]. However, secondary criteria may result in an underestimation of , given several studies have observed that participants often satisfy the thresholds at submaximal exercise intensities [20, 22, 27]. Moreover, like the plateau, secondary criteria are dependent on the test protocol and procedures and participant characteristics [12]. Some researchers have therefore encouraged the abandonment of traditional primary and secondary criteria due to their lack of sensitivity and specificity in establishing whether a ‘true’ has been attained [20, 27].

The verification phase is an increasingly recognised procedure for confirming ‘true’ and typically involves performing a subsequent constant work rate (square wave), sub- or supra- peak bout of exercise after the CPET has been voluntarily terminated (incorporating a short or more extended rest period in between the tests) [20, 27]. The supra peak verification phase is conceptually like the discontinuous tests that were used in developing plateau criteria, but has the advantage of requiring only one visit to the laboratory [28]. The verification phase has emerged as a potentially valid alternative for establishing whether a ‘true’ has been attained [28, 29]. A recent meta-analysis of studies recruiting apparently healthy participants reported that unlike traditional criteria, the verification phase is not affected by the test protocol or procedures, or participant characteristics such as sex and level of cardiorespiratory fitness [29]. Considering the plateau is less likely to occur in unfit participants, and that secondary criteria commonly underestimate in special and clinical groups [3032], the verification phase might be particularly useful to establish the occurrence of ‘true’ or, when has not been elicited, the highest possible attainable in these groups. Notably, participant and test protocol characteristics may not allow the attainment of during the CPET or verification phase. This has direct clinical applications in establishing functional capacity and the effectiveness of exercise training, and the subsequent evaluation of health risks in clinical populations. However, no systematic review and meta-analysis has investigated the utility of the verification phase across diverse special and clinical groups according to different CPET and verification phase protocols and procedures. The effect of these factors on the utility of the verification phase in special and clinical groups is therefore unclear.

The primary aim of the current study was to systematically review and provide a meta-analysis on the suitability of the verification phase for confirming ‘true’ in special and clinical groups. Secondary aims were to explore the applicability of the verification phase according to specific participant characteristics and investigate which test protocols and procedures minimise the differences between the highest values attained in the CPET and verification phase.

Methods

Protocol and registration

The systematic review and meta-analysis were performed and reported in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) [33]. A full PRISMA checklist is shown in S1 Checklist. The protocol was registered at https://www.crd.york.ac.uk/prospero (CRD42021247658).

Search strategy

MEDLINE (accessed through PubMed), Web of Science, SPORTDiscus, Scopus, and EMBASE were searched for peer-reviewed literature. The search strategy included terms relating to cardiopulmonary exercise test, verification phase, test, and oxygen uptake, using a combination of entry terms and synonyms. Medical subject heading (MeSH) descriptors were also included in the PubMed search. All studies published from the inception of the databases until the search date (14th October 2023) were sought. All references from electronic search results were imported into Endnote bibliographic software (version X9, Bld 12062, Clarivate) and duplicates were removed. The electronic searches were re-run before the final analysis and further studies were retrieved for inclusion. A list containing the full search strategy for each database is available (see S1 Fig). Backward searching for additional relevant studies was conducted by scrutinising the reference lists of the full-text articles of the initial included studies. Forward searching for additional relevant studies was conducted within electronic databases by scrutinising studies that have cited the initial included studies since their publication.

Eligibility criteria for inclusion were studies published in English or Portuguese language involving 1) Population–individuals affected by any disease, disability or clinical condition, apparently healthy children and adolescents (<18 years of age), and older adults (≥ 65 years of age) according to American College of Sports Medicine and American Heart Association definitions [34]; 2) Type of study–any research design that included at least one CPET and at least one verification phase carried out on a cycle ergometer, while walking or running or wheelchair propulsion on a treadmill, or on a ski-/wheelchair-/arm-ergometer; and 3) Outcome determined using expired gas analysis during the maximal CPET (control) and verification phase (comparator). Studies were excluded if: 1) they involved secondary analysis of previously included studies; and 2) they investigated older adults, and it could not be ascertained whether any of the participants were below 65 years old. Two blinded researchers performed the searches and screening procedures.

Study selection

Studies were screened for inclusion using a three-step approach: 1) titles and abstracts were initially screened for potentially eligible articles; 2) the full texts of all potentially eligible articles were obtained; and 3) all inclusion and exclusion criteria were applied to full-text articles for the final decision on eligibility. Two of the authors independently determined whether each study met the eligibility criteria. Disagreements were resolved by discussion. Fig 1 shows the screening and selection phases.

thumbnail
Fig 1. Flowchart for the systematic review and meta-analysis according to PRISMA guidelines.

Abbreviations: : maximal oxygen uptake.

https://doi.org/10.1371/journal.pone.0299563.g001

Data extraction and management

The following data were systematically extracted to a Microsoft Excel (2013) spreadsheet: 1) total sample size; 2) characteristics of study participants (special population or clinical condition, sex, age, body mass index, and cardiorespiratory fitness); 3) exercise modality; 4) type of CPET protocol; 5) verification phase protocol and procedures (work rate, type of recovery, timing in relation to whether performed on the same day or different day to the CPET, and whether or not a verification phase threshold criterion was used); and 6) outcome measures (mean ± standard deviation [SD] test duration and absolute for the CPET and verification phase). Authors of the original articles were contacted to request data when these were not reported. Non-responses from authors were followed up with a second email.

Quality assessment

The quality of the included studies was independently assessed by two of the authors using a modified version of the Downs and Black checklist [35] (see S2 Fig). Modified versions of this checklist have been employed in reviews in the sport and exercise sciences, which also mainly used cross-sectional studies for their data retrieval [36, 37]. The original checklist comprises 27 items, which are distributed over five sub-scales: reporting (items 1–10), external validity (items 11–13), bias (items 14–20), confounding (items 21–26), and power (item 27) [35]. The Downs and Black checklist was originally designed for intervention studies. Since the present review does not focus on intervention studies, items 8, 9, 12–16, 19, and 22–26 were excluded, and the remaining 14 items included. Furthermore, an additional item was added on whether the included studies provided information on the sampling method. The term “patient” was replaced by “participant”, the term “principal confounders” by “participant characteristics”, and, where applicable, the term “treatment” was interpreted in the context of “testing” [36, 37]. All items, except item numbers 4 and 6, were rated as “Yes” (1 point), “No” (0 points), or “Unknown” (0 points).

For item 4, both the CPET and verification phase needed to be described in sufficient detail, i.e., 1) the duration and magnitude of increments in the CPET; 2) total duration of the CPET; 3) whether the verification phase used a sub- or supra-peak work rate/speed during the CPET; 4) total duration of the verification phase; 5) exercise modality used for the CPET and verification phase; and 6) type and duration of recovery between the CPET and verification phase. Two points were given if all six items were described, and one point was given if four or five of the six items were described. For item 6, both simple outcome data for the major findings of the study, as well as values for and its unit of measurement needed to be sufficiently described for this item to be scored a “Yes” with 2 points. Only providing the percentage change or absolute difference in between the CPET to exhaustion and verification phase was not sufficient. If only one was sufficiently described, it was rated as a "Yes” with 1 point. A third author helped reach consensus where there were disagreements between the primary reviewers. Quality assessment cut-off points were decided on retrospectively and studies were regarded as low (0–8 points), moderate (9–14 points), or good (15–17 points) methodological quality, based on the total score achieved on the modified Downs and Black checklist. The level of evidence (LoE) for the results of the main and subgroup analyses was categorised from very limited to strong by combining the quality scores of each of the studies included (Table 1). A figure showing the results of the quality assessments was constructed using the R studio programme (R Core Team. R: A language and environment for statistical computing. R foundation for statistical computing Vienna, Austria; 2013).

thumbnail
Table 1. Criteria for determining the level of evidence based on the quality of studies included in the main analysis and each of the subgroup analyses (adjusted from the criteria provided by Van Tulder, Furlan [38]).

https://doi.org/10.1371/journal.pone.0299563.t001

Statistical analysis

All meta-analyses were performed using the Review Manager (RevMan) software version 5.3 (Copenhagen, The Nordic Cochrane Centre, The Cochrane Collaboration, 2014). Data are presented as the mean ± SD, following Cochrane Handbook guidelines [39]. These guidelines state that to perform a meta-analysis of continuous data, authors should utilise the mean value, standard deviation, and number of participants for whom the outcome was measured in each intervention group or test protocol. The outcome was the mean difference (95% confidence interval [CI]) between the CPET and verification phase for the highest absolute value in L·min-1. Given that absolute are continuous data, the weighted mean difference method was used for combining study effect size estimates. With the weighted mean difference method, the pooled effect estimate represents a weighted mean of all included study group comparisons. The weighting assigned to each individual study group (i.e., the comparison of CPET and verification phase results) in the analysis was inversely proportional to the variance of the absolute . This method typically assigns more weight in the meta-analysis to studies with higher precision (inverse variance) and larger sample sizes. The weighted mean differences were calculated using random-effects models given the study group differences in participants’ characteristics, CPET modalities and protocols, types of recovery, and verification phase protocols. These differences in both participants and protocols characteristics allow that the effect size could vary from study to study. A standardised mean difference method with pooled effect estimate represents a weighted standardised mean of all included study group comparisons and was included as a sensitivity check given the differences in absolute observed across different clinical groups.

Heterogeneity of net group changes in absolute was examined using the Q statistic. Cochran’s Q statistic is computed by summing the squared deviations of each trial’s estimate from the overall meta-analytic estimate and weighting each trial’s contribution in the same manner as in the meta-analysis. The p-values were obtained by comparing the statistic with a χ2 distribution with k-1 degrees of freedom (where k is the number of trials). A p-value of < 0.10 was adopted since the Q statistic tends to suffer from low differential power [40]. The formal Q statistic was used in conjunction with the methods for assessing heterogeneity. The I2 statistic was used to measure the extent of inconsistency among the results of the primary study groups, interpreted approximately as the proportion of total variation in point estimates due to heterogeneity rather than sampling error. Effect sizes with a corresponding I2 value of ≤ 50% were considered to have low heterogeneity. Potential publication bias or studies with outlier data were assessed using a funnel plot.

Subgroup analyses were defined a priori to investigate the magnitude of differences between CPET and verification phases due to variations in group characteristics, exercise modality, CPET protocol design, or how the verification phase was performed. The following subgroups based on medical conditions and participant characteristics were considered: paediatric (obese and non-obese under 18 yrs), geriatric (≥ 65 yrs), wheelchair (elite wheelchair athletes and individuals in a wheelchair without spinal cord injury or spina bifida), respiratory (cystic fibrosis, chronic asthma/airway disorders, and bronchiectasis), metabolic (overweight and obese adults with and without metabolic syndrome or hypertension), oncological, and cardiological. Forest plots were constructed to display values at the 95% confidence level. Effect sizes were calculated by subtracting the highest mean values for absolute observed in the CPET from the verification phase values, based on grouping studies with selected verification phase characteristics for work rate (i.e., sub vs. supra peak work rate) and type of recovery between the CPET and verification phase (i.e., active vs. passive). The studies were also classified according to whether a criterion threshold for was used for the verification phase, involving an absolute or relative differences in between the CPET and verification phase (i.e., yes vs. no), typically characterised as a percentage difference between tests. In addition to the meta-analytical approach, a participant-level analysis was conducted with data directly reported in the reviewed studies or those supplied by the corresponding authors. The analysis of the differences between the highest values elicited in the CPET and verification phase was based on the threshold criteria utilised by the studies or the error of measurement of the metabolic analysers that were used.

Results

The literature search identified 2108 potential studies, of which 2082 were obtained from electronic databases and 26 from manual searches through a wider inspection of reference lists and citations of these articles. Forty-three articles published between 1993 and 2023 met the eligibility criteria and were included in the systematic review, whilst 30 presented relevant quantitative information to be considered for meta-analysis (see Fig 1).

Tables 2 and 3 show, respectively, the sample characteristics for the reviewed studies and the exercise testing protocols used to measure the in all participants. Twenty-two studies (51%) used continuous step-incremented protocols, 19 (44%) used ramp-incremented protocols, one used a discontinuous protocol (2%), and one included both continuous and discontinuous step-incremented protocols (2%). Twenty-three studies (53%) used one or more traditional criteria, of which 17 (39%) used a plateau, 16 (37%) used a heart rate plateau or criteria based on the age-predicted maximal heart rate, 17 (39%) used maximal respiratory exchange ratio, 5 (12%) used post-exercise blood lactate concentration, and 6 (14%) used ratings of perceived exertion cut-off values.

thumbnail
Table 2. Sample characteristics of the reviewed studies (N = 43).

https://doi.org/10.1371/journal.pone.0299563.t002

thumbnail
Table 3. Characteristics of the cardiopulmonary exercise test (CPET) and verification phase protocols used in the reviewed studies (N = 43).

https://doi.org/10.1371/journal.pone.0299563.t003

Regarding respiratory expired gas analysis procedures, smoothing of pulmonary gas exchange data is required during exercise testing for the determination of , especially for data collected from participants on a breath-by-breath basis. The most common approach was based on time averages. Thirty-two studies (74%) reported using time averages of between 10 and 30 s, two (5%) used moving time averages, two (5%) applied 12-breath rolling averages, and three (7%) did not describe which data processing method was applied. Amongst more traditional expired gas collection techniques, four studies (9%) used Douglas bag collections of between 30 and 60 s. No study addressed the effect of different sampling intervals on the difference between the peak values attained in the CPET and verification phase.

Regarding the type of recovery between CPET termination and the start of the verification phase, 10 studies (23%) used active recovery, 11 (26%) used passive recovery, 8 (19%) adopted a combination of passive and active recovery, and 11 (26%) did not report the type of recovery. The verification phase was carried out on a different day as the CPET in three studies (7%). When the verification phase was performed on the same day as the CPET, the recovery period varied from 4 to 25 min. Ten studies (23%) used a 10-min recovery, which was the most common. Two articles (5%) did not state the duration of the recovery period used in the studies.

Twenty-five studies (58%) used square-wave verification phase protocols (i.e., the work rate was immediately increased to the target sub or supra peak work rate), whereas 17 (40%) used multistage verification phase protocols characterised by an initial warm-up stage. Only one study (2%) did not describe the verification phase protocol. The peak work rate used in the verification phase protocols ranged from 80% to 110% of the peak work rate attained in the CPET across studies. Most studies applied a supra peak work rate based on the peak work rate achieved in the CPET (n = 33; 77%). Three studies (7%) applied both sub and supra peak work rates within the same study, two (5%) used peak work rate, four (9%) applied only a sub peak work rate, and one (2%) did not describe the verification phase work rate. The mean times to exhaustion for the CPET and verification phase were 568 s (SD, 143 s) and 127 s (SD, 57 s), respectively.

Twenty-two studies (51%) employed threshold criteria to analyse differences between the highest attained in the CPET and verification phase and were frequently based on the intra-subject coefficient of variation acquired from the researchers’ laboratories or from published literature. Threshold criteria included a difference in (L·min-1) of < 2%, < 3%, < 5% and < 9%, and an absolute difference between measured and predicted from linear extrapolations of / work rate responses during the CPET (such as < 50% of the “expected” increase). Other cut-off points included a typical error of 0.06 L·min-1, 2.1 mL·kg-1·min-1, or when the comparison of the highest group mean value obtained in the CPET versus the verification phase resulted in p > 0.05.

Methodological quality of the included studies

There was 86% agreement between the two authors in ranking the items initially, and full agreement was reached upon discussion with a third author. Two studies (5%) were rated as having low methodological quality, 37 (86%) as moderate, and four (9%) as good (Fig 2). The quality of the studies included in the main and subgroup comparisons were used to underpin the respective LoE.

thumbnail
Fig 2. Quality scores for the 43 included studies.

Grey dots for items scored as ‘no’, yellow dots are for items scored ‘yes’ (1), and blue dots for ‘yes’ (2). Black dots are added where authors retrospectively provided extra information that would have led to a higher quality appraisal score if included in the original publication.

https://doi.org/10.1371/journal.pone.0299563.g002

Group-level quantitative data synthesis: Differences between the highest attained in the CPET and verification phase

Table 4 shows comparisons between the highest values elicited in the CPET and verification phase for each study. Fig 3 displays the forest plot of effect sizes and 95% CIs for the highest values (30 studies) based on the random effects meta-analysis results. The highest was not different between CPET and verification phase (mean difference = -0.00 [95% CI = -0.03 to 0.03] L·min-1, p = 0.87; LoE: strong). Given the potential for large heterogeneity in mean values across the different populations included in the review, the overall and subgroup meta-analysis findings were robust to a sensitivity check using the alternative statistical approach of using standardised mean differences. This method resulted in a different weighting pattern to the individual study group differences between the CPET and verification phase values (data not presented). However, the overall effect was unchanged (standardised mean difference = -0.01 [95% CI = -0.09 to 0.08] L·min-1, p = 0.87, LoE: strong). Pooled data for following the CPET and verification phase showed no significant heterogeneity among all the studies (see Fig 3). Except for one of the included studies judged to be an outlier [44], the meta-analysed studies were judged to have a low-risk of bias as shown by the funnel plot (Fig 4).

thumbnail
Fig 3. Forest plot for all the studies included in the meta-analysis (n = 30) for the highest (L·min-1) attained in the cardiopulmonary exercise test (CPET) and verification phase using a random-effects model.

Data are reported as mean differences adjusted for control data (95% CI).

https://doi.org/10.1371/journal.pone.0299563.g003

thumbnail
Fig 4. Funnel plot assessment of publication bias for the highest (L·min-1) attained in the cardiopulmonary exercise test (CPET) and verification phase.

One outlier identified, which may relate to methodological error within the verification phase protocol (see discussion section).

https://doi.org/10.1371/journal.pone.0299563.g004

thumbnail
Table 4. Overall group-level comparisons for the reviewed studies for the highest values attained in the cardiopulmonary exercise test (CPET) and verification phase (VP) (N = 30).

https://doi.org/10.1371/journal.pone.0299563.t004

There was no statistically significant difference between CPET and verification-derived for the paediatric group that included seven studies with 18 experimental conditions (mean difference = -0.01 [95% CI = -0.05 to 0.03] L·min-1, p = 0.68, LoE: moderate). Additionally, the subgroup analysis of obese and non-obese paediatric participants, composed of two studies with four experimental conditions, revealed no statistically significant difference (mean difference = -0.11 [95% CI = -0.24 to 0.01] L·min-1, p = 0.08, LoE: moderate). The wheelchair group consisted of three studies with eight experimental conditions, and no statistically significant difference was observed between the CPET and verification phase (mean difference = 0.11 [95% CI = -0.02 to 0.23] L·min-1, p = 0.10, LoE: strong). The chronic respiratory group consisted of 6 studies and 11 experimental conditions that demonstrated no statistically significant significance between the CPET and verification phase (mean difference = 0.07 [95% CI = -0.02 to 0.17] L·min-1, p = 0.14, LoE: strong). The 15 girls from the study by Robben et al. [69] were removed from the meta-analysis, since the reported SD was an extreme outlier (e.g., the study would have been weighted at 40.7% in the final meta-analysis). The subgroup of paediatric patients with cystic fibrosis included four studies and seven experimental conditions (mean difference = 0.06 [95% CI = -0.13 to 0.25] L·min-1, p = 0.55 LoE: strong). The geriatric group incorporated four studies with seven experimental conditions, and results showed no statistically significant difference between the CPET and verification phase (mean difference of -0.08 [95% CI = -0.21 to 0.05] L·min-1, p = 0.20 LoE: moderate). Finally, the metabolic group including individuals with overweight or obesity (with and without metabolic disease), comprised six studies and 13 experimental conditions. These studies used ramp-based cycle ergometry and demonstrated no statistically significant difference between CPET and the verification phase (mean difference = -0.06 [95% CI = -0.13 to -0.01] L·min-1, p = 0.09).

Regarding the CPET and verification phase protocols (Fig 5 combines all subgroup analyses into the same category), there were no statistically significant differences for verification phase work rate (mean difference = 0.00 [95% CI = -0.04 to 0.04] L·min-1, p = 0.98, LoE: strong), recovery mode (mean difference = -0.02 [95% CI = -0.07 to 0.02] L·min-1, p = 0.29, LoE: strong), adoption of verification phase criteria (mean difference = -0.02 [95% CI = -0.06 to 0.03] L·min-1, p = 0.43, LoE: strong), verification phase performed on the same or on a separate day as the CPET (mean difference = 0.00 [95% CI = -0.04 to 0.04] L·min-1, p = 0.97, LoE: strong), or verification phase duration (mean difference = 0.01 [95% CI -0.05 to 0.07] L·min-1, p = 0.82, LoE: strong). All combined categories were not statistically significantly different when analysed separately (p = 0.28 to p = 0.93, LoE: strong). The exercise mode used did not influence the results for treadmill (mean difference = -0.02 [95% CI -0.07 to 0.03] L·min-1, p = 0.41, LoE: moderate) or cycling (mean difference = 0.00 [95% CI -0.04 to 0.04 L·min-1, p = 0.93, LoE: strong) across all included studies. The highest attained in the CPET was similar among protocols and their subsequent verification phase (p = 0.30; p = 0.43 and p = 0.56) for ramp, discontinuous, and continuous step-incremented protocols, respectively; LoE: moderate to strong.

thumbnail
Fig 5. Mean differences (95% confidence intervals [CIs]) between the highest values (L·min-1) in the cardiopulmonary exercise test (CPET) and verification phase according to the verification-phase characteristics for work rate (i.e., sub WRpeak vs. supra WRpeak), recovery (i.e., active vs. passive), adoption of a criterion threshold (i.e., yes vs. no), timing (performed on the same day vs. a different day to the CPET), and duration (i.e., no longer than 80 s, from 81 to 120-s and longer than 120-s).

https://doi.org/10.1371/journal.pone.0299563.g005

Participant-level analysis of the highest values attained in the CPET and verification phase

Only 23 (53.5%) of the 43 reviewed studies reported how many participants achieved a lower, equal, or higher value in the verification phase versus the CPET or supplied participant-level data from which this information could be obtained. Table 5 shows the percentages of participants that achieved a lower, equal, or higher value in the verification phase versus the CPET for each study where this information was available. Fig 6 shows participant-level differences between the highest values obtained in the CPET and verification phase for the seven studies where these data were available [31, 4447, 56, 60].

thumbnail
Fig 6. Participant-level differences between the highest values obtained in the cardiopulmonary exercise test (CPET) and verification phase for the seven studies where these data were available.

https://doi.org/10.1371/journal.pone.0299563.g006

thumbnail
Table 5. Frequencies and percentages for whether the verification phase (VP) elicited a lower, similar, or higher than the cardiopulmonary exercise test (CPET) for the 23 studies that mentioned or reported this information.

Participant-level data were supplied by the authors for the first seven studies in the table.

https://doi.org/10.1371/journal.pone.0299563.t005

Discussion

Main findings

To the best of our knowledge, this is the first systematic review and meta-analysis to investigate the utility of the verification phase for confirming in special and clinical groups. The major findings were: a) overall, the highest attained in the CPET was not statistically significant different to that obtained in the verification phase across all primary studies included; b) subgroup analysis showed there were no statistically significant differences in the highest attained in the CPET and verification phase for specific groups; c) across all studies, the difference between the highest attained in the CPET and verification phase was not affected by test protocol characteristics; d) participant-level verification phase data might be useful for providing evidence of whether the CPET likely elicited in a given person; e) a plateau in the CPET does not always confirm since the verification phase is sometimes higher in the presence of a plateau; and f) the included studies did not report any adverse events associated with the verification phase.

The mean absolute difference in the highest attained in the verification phase and CPET was less than 1% for the 30 studies included in the meta-analysis. This difference was similar to that found in our recent meta-analysis that investigated the utility of the verification phase in apparently healthy adults [29]. The global analysis (see Fig 3) revealed a statistically significantly higher mean in the CPET for only one study [44]. This study involved 14 elite para ice hockey players and was the only study that used a ski ergometer [44] involving an upper-body poling technique. The CPET involved an initial WR associated with a value of 11 on the Borg RPE scale, followed by WR increments of 10 W every 30 s. The verification phase was performed at 110% of WRmax achieved in the CPET. The rapid WR increments during the CPET led to a relatively short mean ± SD test duration of 365 ± 58 s (unpublished data) and a higher peak WR than would have been the case for a longer protocol. Consequently, the verification phase WR might have been excessively high and resulted in a mean ± SD test duration of only 90 ± 13 s (unpublished data), likely due to early local fatigue of the upper-body musculature. These findings indicate that a longer CPET duration and/or lower WR for the verification phase are required for achieving a verification phase of sufficient length in athletes with lower-limb impairments performing upper-body exercise modalities.

A plausible explanation for the contrasting findings between this study and the other 29 meta-analysed studies is that the verification phase workload was presumably too high to attain similar values because it induced fatigue of the limited active upper body musculature. These results suggest the adoption of a lower percentage of peak work rate might be a better strategy for this specific population.

Paediatric group

Similar values were observed between the CPET and verification phase in paediatric participants. Accordingly, two original studies found no significant difference in the attained in the CPET and verification phase in children [30, 55]. Conversely, a study involving 128 children and adolescents (76 male and 52 females; aged 9.3–17.4 years with and without overweight [71]), reported a significantly higher in the CPET, although the authors reported a strong correlation (r = 0.94) between the tests. When comparing the highest values obtained in the CPET and verification phase on an participant-level basis, 88% (n = 112) of children and adolescents had their highest in the CPET verified as their ‘true’ (< 5% difference between protocols). For the remaining 12% of participants who did not have their verified, the highest recorded in the verification phase was 6–23% higher than in the CPET. The authors recommended the verification phase be used to confirm attainment in paediatric individuals. In another study [46], eight non-obese children had their confirmed in the verification phase (< 3% difference in ), while 15 achieved a higher (> 3% difference in between protocols) in the verification phase. Consequently, these data suggest the suitability of the verification phase to confirm has been attained in non-obese, overweight, and obese children.

Wheelchair group

The subgroup of people using a wheelchair attained similar highest values between protocols. Of the three studies included in this sub-analysis, two investigated the verification phase as the focus of the study [42, 56]. The first recruited 10 individuals with spinal cord injury, who performed a verification phase at 105% of the peak work rate (WRpeak) attained in the CPET, attained highest values that were not significantly different between the CPET and verification phase. Hence, the authors recommended a verification phase to confirm ‘true’ in people with spinal cord injury. Similarly, 24 trained wheelchair athletes (tetraplegics, paraplegics and non-spinal cord-injured) had their confirmed through a verification phase performed at the same treadmill speed, but a gradient of 0.3% to 0.6% higher than that of the CPET treadmill gradient.

Chronic respiratory group

The verification phase also confirmed in the chronic respiratory group. Fourteen paediatric and 28 adults with cystic fibrosis performed a supra peak verification phase at 110% WRmax. Twelve paediatric (85.7%) and 27 adults (96.4%) had their confirmed through the verification phase by applying a threshold of 9% difference between CPET and verification phase protocols, which represented the within-subject variability in for this population [50]. No exercise-induced hypoxemia or perceived discomfort were reported. The authors concluded that a verification phase should be incorporated to confirm that a has been attained. Sixteen adolescents with mild-to-moderate cystic fibrosis (8 males and 8 females; age 14.6 ± 1.7 years) also achieved similar highest values in the CPET and verification phase [78]. Notably, the verification phase was reported as being well tolerated. These results reinforce the robustness of the verification phase as a tool to confirm in people with chronic respiratory limitations.

Geriatric group

Regarding the geriatric group, our results demonstrated no significance difference in the highest observed in the CPET and verification phase. The lack of difference in between protocols was reported in eight older adults who performed a verification phase at 85% WRmax and 23 older adults who exercised at 105% WRmax. Moreover, the highest values observed in the CPET and verification phase were highly correlated (r = 0.99). The authors emphasised that only minor within-individual differences were observed between the highest elicited during the CPET and verification phase [65]. Another study investigated the applicability of the verification phase performed at 85% (n = 22) and 110% (n = 20) WRmax in older adults [77]. The mean highest across participants was not significantly different between both sub and supra peak verification phases and the associated CPET. Although not included in any of our subgroup analyses, Bowen, Cannon [49] showed that group highest values were similar between CPET and a verification protocol performed at 95% WRpeak (14.5 ± 3.0 vs. 14.7 ± 3.1 mL·kg−1·min−1) within 24 older adults with chronic heart failure. However, within-subject comparisons confirmed in only 14 of 24 patients. Importantly, the verification phase protocol was cited to accurately and reliably confirm (or refute) along with a measurement sensitivity (95% confidence interval) that is specific for each individual. Likewise, using a similar protocol, Manresa-Rocamora, Fuertes-Kenneally [58] showed was confirmed in 11 (52.4%) patients and not confirmed in 10 (47.6%). The authors concluded that a submaximal verification phase performed at 95% WRpeak is a safe and suitable method to determine in patients with heart failure with reduced ejection fraction. However, this primary study was not included in the meta-analysis since the authors reported non-parametric variables related to and verification phases, and a non-parametric 95% CI for the differences were used to compare the CPET and verification phase.

Metabolic group

Our final group sub-analysis within the metabolic group revealed a trend for an attenuated in the CPET compared to the verification phase in individuals with overweight or obesity, with or without metabolic disease or prehypertension. Notably, all six studies in the subgroup analysis used cycle ergometry. In a trial with four verification phases performed at 80, 90, 100 and 105% WRmax, eight of nine men with obesity attained a higher in the verification phase [57]. Although not statistically significant, the highest during the verification phase at 90% WRmax was 0.24 L·min-1 higher than during the CPET, which is a 7% difference. Other data from studies with large samples of individuals who are sedentary, have obesity, and metabolic disease, indicate that the magnitude (3%-9%) and prevalence (40% of people with overweight/obesity) of the underestimation of during a CPET is high. Regardless, the results from the current meta-analysis and these primary studies indicate that the verification phase appears to be a robust method for confirming in individuals with overweight or obesity, with or without metabolic disease or prehypertension.

Verification phase characteristics

Regarding verification phase characteristics, Fig 5 illustrates the combined effects of work rate, recovery mode, use of a verification phase threshold, day of the test, and protocol duration. No significant results were found for the combined analysis, nor when each category was analysed separately. Consequently, a specific verification phase protocol cannot currently be recommended for any of the groups included in the present systematic review. These results agree with those found in apparently healthy adults, where the verification phase was not affected by test protocol characteristics [29]. Although was similar among different protocols, notably, no included study applied a sub WRpeak verification phase on a treadmill. It therefore remains unclear if relative exercise work rate affects the suitability of the verification phase for confirming on a treadmill in special and clinical groups.

Regarding the mode of recovery (i.e., active or passive) between the CPET and verification phase, our meta-analysis demonstrated that the 20 mL·min-1 mean difference between the highest values observed in the CPET and verification phase was not statistically significant. Similar to a recent systematic review that investigated the utility of the verification phase in apparently healthy adults [29], we did not find any study that compared active and passive recoveries. In the present systematic review, the time between the CPET and verification phase ranged from 4 to 25 minutes in studies where these tests were performed on the same day. Although no single study compared the verification phase conducted on the same day versus a different day to the CPET, no statistically significant difference was found in the present review after combining the results. We therefore recommend either an active or passive recovery after the CPET, with no need for an additional visit on a separate day.

Considering the importance of a verification phase being individually analysed to identify those who confirmed their , we suggest the adoption of a threshold criterion to compare the differences between the CPET and verification phase. The threshold criterion value has commonly been based on the reproducibility of during the CPET and specific to the metabolic cart that was used, or an arbitrary 2–3% difference has been used. Alternatively, some studies have calculated the differences between measured and predicted [4547, 61, 62]. However, we did not obtain significant differences between studies that did or did not apply a threshold criterion. We systematically recorded whether authors of primary studies commented specifically on participant level data. In fact, 22 studies (51%) in the present systematic review discussed participant level differences between the highest attained in the CPET and verification phase. We strongly suggest future researchers report participant-level data, since the absence of any significant mean differences in the highest attained in the CPET and verification phase, may mask individuals who attain a practically significant higher in the verification phase.

The combined effects of verification phase duration (i.e., short < 80 s, medium 81–120 s, and long > 120 s) resulted in no statistically significant difference between the highest observed in the CPET and verification phase. Similarly, a previous study did not find any significant correlations between different durations at 110% WRmax for the highest attained in the CPET and verification phase [77]. Furthermore, 12 paediatric participants with cystic fibrosis confirmed their and two showed 9% higher values during a verification phase performed at 110% WRmax (76 ± 22 s). A brief duration may be inadequate to allow sufficient time for oxygen uptake to achieve maximal values, especially in individuals having slow O2 kinetics such as those with metabolic or respiratory disease [8183]. Verification phases shorter than 80 s might elicit , however, we recommend future studies implement strategies to avoid inappropriately short verification phases and early termination, such as the adoption of multistage verification protocols that incorporate submaximal and supra peak (in relation to the CPET WRmax) work rates.

Participant-level data

Whether the verification phase confirmed that was likely elicited during the CPET was highly variable across the 23 studies that either reported this information directly, or where participant-level data were available to obtain this information. For example, the verification phase elicited a higher than the CPET in 0% to 88.9% of participants across studies. This large variability between studies is likely due to differences in the CPET and verification phase protocols that were used for the various special and clinical groups involved.

The verification phase failed to confirm the highest in all participants in one study involving para ice hockey players [44], which included a CPET with rapidly-incrementing work rates and a verification phase performed at 110% of the WRmax achieved in the CPET. A longer CPET duration with a slower ramp rate and/or a lower WR in the verification phase might be more effective in eliciting a higher when using upper-body exercise modalities in athletes with lower-limb impairments. In other studies where participants elicited a higher in the CPET, the percentage ranged from 3.6% to 35%. The study that observed 35% involved 22 apparently healthy older adults that performed verification phases at 85% and 110% WRmax [77]. The authors concluded that 85% WRmax was preferable for this population as it more likely confirmed .

Four out of seven people after stroke achieved a that was > 3% higher in the verification phase versus the CPET (range 4.7% to 24.1%) [60]. Similar to the para ice hockey athletes [44], it is unclear whether the same limiting factors during the CPET also occurred during the verification phase, leading to submaximal values in both protocols. Twelve of 18 (66.7%) children elicited a higher in the verification phase compared to the CPET [45]. A notable finding is five of the seven children that exhibited a plateau during the CPET elicited a higher during the verification phase. One study involving nine men with obesity compared verification phases performed at 80%, 90%, 100% and 105% of WRmax achieved in the CPET [57]. Eight of the men attained a higher during a verification phase, with the submaximal verification phase performed at 90% WRmax eliciting the highest values. These findings are similar to those observed in older adults in that a sub-peak verification phase during testing appears preferable for adults with obesity, at least during cycle ergometry. In another study including adults with obesity [72], a verification phase performed at 100% WRmax elicited a that was 2% to 21% higher than in the CPET in 13 out of 19 participants. A plausible explanation for these findings is that the CPET work rate increment of 30 W·min-1 for men was inappropriately high for this group, resulting in submaximal values during the CPET. Two studies included older adults with hypertension [32, 74]. In the first study [32], 18 out of 33 participants achieved a higher in the verification phase compared to the value obtained in the CPET (ranging from 3 to 22.1%). Notably, 29 out of 33 tests would have been validated as eliciting if the traditional criteria were utilised (i.e. false positives). In the second study [74], 9 out of 12 participants obtained a verification-derived that was 4.9 to 21% higher than the value observed in the CPET at baseline and after a 12 week exercise program. All participants but one would have had their tests validated as eliciting if traditional criteria were applied. Results from the present systematic review therefore question the validity of the plateau as the primary criterion for establishing the occurrence of [13] and highlights the importance of performing a verification phase even in the presence of a plateau in the CPET.

Midgley et al. [12] suggested that it is unlikely that a person would give identical submaximal efforts in the CPET and verification phase, thereby rationalising that similar highest values in the CPET and verification phase likely verifies that has been elicited. Whether this supposition is true is debatable, especially in certain clinical populations where exercise might be terminated due to symptoms related to the clinical condition. Examples are people with pulmonary disease where undue breathlessness typically results in test termination before any cardiovascular limitation [84], severely deconditioned individuals with very poor exercise tolerance, and those conditions associated with chronic pain made worse with exercise. In such people, the verification phase might simply verify the reproducibility of the associated with symptom tolerance (i.e., ). This is likely also true for CPET involving upper body ergometry, where the CPET is often limited by peripheral fatigue [85].

Limitations and recommendations for future research

A limitation of the present systematic review is that 13 (30%) of the 43 studies included in the systematic review were not included in the meta-analysis due to unsuccessful attempts to acquire the required unpublished information from the corresponding authors for these studies. Some authors responded to our correspondence declaring they no longer have access to the requested data. In regards group-level data, the verification phase appears useful for helping verify that, on average, a specific CPET protocol elicited in special and clinical groups. A verification phase therefore seems unnecessary for this purpose if a new study uses the same CPET protocol with the same population as a previous study that confirmed the CPET was appropriate for eliciting using a verification phase.

Another limitation of the present systematic review is that participant-level data were available for only seven of the 43 reviewed studies, limiting inferences that could be made. We recommend that participant-level data are made available for future studies. Further research is needed to optimise verification phase protocols so that evidence-based guidelines can be published in the future. This includes identifying robust verification threshold criteria to establish when the highest values elicited in the CPET and verification phases are sufficiently similar for or to be ‘verified’. Few studies have directly compared test protocols and this is a particularly important area for future research.

Further critical discussion is also needed among researchers regarding how the verification phase should be interpreted when applied to participant-level data. For example, In the first edition of the Canadian Association of Sports Sciences guidelines for the physiological testing of high-performance athletes, Thoden et al. [86] suggested that the highest value elicited in either the CPET or verification phase should be regarded as . In the second edition of the guidelines, however, Thoden et al. [87] recommended that an increase in the highest elicited in the verification phase that is not more than 2% higher than that elicited in the CPET, verifies was elicited in the CPET. At the participant level, the verification phase can therefore be used to verify whether was likely elicited in the CPET or used simply as another opportunity for a person to elicit .

Conclusion

The main finding was that the mean difference in between the CPET and associated verification phase was similar. In other words, the verification phase confirmed that had been attained in the CPET. Moreover, a 10-to-15-minute recovery phase and short verification phase following a CPET appears to be safe, well-tolerated, and time-efficient for a diverse range of special and clinical groups. Unlike traditional criteria, the application of the verification phase was not affected by differences in test protocol or procedures, or participant characteristics, except perhaps for those with overweight or obesity. In those with overweight or obesity, the attained in the CPET was significantly lower than that obtained in the verification phase. For individuals with obesity and apparently healthy older adults, the selection of sub peak work rates above critical power is desirable. For the remaining groups, it remains somewhat unclear which work rate is most suitable. For paediatrics, a verification phase of 100–105% WRmax conducted 15-min after the CPET on a cycle or treadmill has been useful. For the wheelchair group, we can only advise to use a verification phase < 110% WRmax. Regarding individuals with chronic respiratory problems, a cycle ergometer is applicable for a verification phase of 3-min at 20 W, then 110% WRmax, performed after 5-min of active rest and 10-min of passive rest after the CPET. Older adults, including those with chronic heart failure, may perform a verification phase 5–10 min after the CPET, either on a cycle ergometer or treadmill. If cycling is chosen, a work rate equal to 85–95% WRpeak would optimise attainment. On a treadmill, 2-min at 50%, 1-min at 70%, and then one stage higher than WRpeak is recommended. Lastly, adults with obesity or metabolic conditions may perform a verification phase above critical power on a cycle ergometer up to 110% WRpeak on the CPET.

Some researchers might decide not to conduct a verification phase where results from the present meta-analysis indicate that particular CPET test protocols and procedures for certain special and clinical groups appear to elicit ‘true’ values. However, there remains the issue of identifying whether a participant has likely elicited during a CPET or elicited a submaximal value due to early test termination. Considering the limitations associated with traditional criteria, the verification phase remains a reasonable time-efficient alternative in special and clinical populations; however, further research and critical debate are required before robust evidence-based guidelines can be provided.

Supporting information

S1 Fig. Search strategies for all databases.

https://doi.org/10.1371/journal.pone.0299563.s002

(DOCX)

S2 Fig. Modified downs and Black checklist.

https://doi.org/10.1371/journal.pone.0299563.s003

(DOCX)

Acknowledgments

We sincerely thank the following authors for kindly providing additional data: Dr. Tony Babb, Dr. Dharini Bhammar, Prof. James Faulkner, Dr. Christof Leicht, Prof. Melitta McNarry, Dr. Friederike Rosenberger, Dr. Brandon Sawyer, Dr. Zoe Saynor, Justine Schneider, Dr. Owen Tomlinson, and Prof. Craig Williams.

References

  1. 1. Fletcher GF, Ades PA, Kligfield P, Arena R, Balady GJ, Bittner VA, et al. Exercise standards for testing and training: a scientific statement from the American Heart Association. Circulation. 2013;128(8):873–934. pmid:23877260
  2. 2. Hill AV, Lupton H. Muscular exercise, lactic acid, and the supply and utilization of oxygen. QJM: An International Journal of Medicine. 1923;os-16(62):135–171.
  3. 3. Hill AV, Long CNH, Lupton H. Muscular exercise, lactic acid and the supply and utilisation of oxygen.—Parts VII–VIII. Proc R Soc Lond B Biol Sci. 1924;97(682):155–176.
  4. 4. ATS/ACCP. American Thoracic Society/American College of Chest Physicians statement on cardiopulmonary exercise testing. Am J Respir Crit Care Med. 2003;167(2):211–277. pmid:12524257
  5. 5. McMurray RG, Ainsworth BE, Harrell JS, Griggs TR, Williams OD. Is physical activity or aerobic power more influential on reducing cardiovascular disease risk factors? Med Sci Sports Exerc. 1998;30(10):1521–1529. pmid:9789853
  6. 6. Weber KT, Janicki JS. Cardiopulmonary exercise testing for evaluation of chronic cardiac failure. Am J Cardiol. 1985;55(2):A22–A31. pmid:3966407
  7. 7. Riebe D, Ehrman JK, Liguori G, Magal M. ACSM’s guidelines for exercise testing and prescription. 10th ed. Philadelphia: Wolters Kluwer; 2018.
  8. 8. da Cunha FA, Farinatti PTV, Midgley AW. Methodological and practical application issues in exercise prescription using the heart rate reserve and oxygen uptake reserve methods. J Sci Med Sport. 2011;14(1):46–57. pmid:20833587
  9. 9. Weatherwax R, Harris N, Kilding AE, Dalleck L. Time course changes in confirmed ‘true’ VO2max after individualized and standardized training. Sports Med Int Open. 2019;3(02):E32–E39. pmid:31192297
  10. 10. Cooper CB, Storer TW. Exercise testing and interpretation: a practical approach. Cambridge, UK: Cambridge University Press; 2001.
  11. 11. Ross R, Blair SN, Arena R, Church TS, Després JP, Franklin BA, et al. Importance of assessing cardiorespiratory fitness in clinical practice: a case for fitness as a clinical vital sign: a scientific statement from the American Heart Association. Circulation. 2016;134(24):e653–e699. pmid:27881567
  12. 12. Midgley AW, McNaughton LR, Polman R, Marchant D. Criteria for determination of maximal oxygen uptake: a brief critique and recommendations for future research. Sports Med. 2007;37(12):1019–1028. pmid:18027991
  13. 13. Taylor HL, Buskirk E, Henschel A. Maximal oxygen intake as an objective measure of cardio-respiratory performance. J Appl Physiol. 1955;8(1):73–80. pmid:13242493
  14. 14. Davis JA, Whipp BJ, Lamarra N, Huntsman DJ, Frank MH, Wasserman K. Effect of ramp slope on determination of aerobic parameters from the ramp exercise test. Med Sci Sports Exerc. 1982;14(5):339–343. pmid:7154888
  15. 15. Buchfuhrer MJ, Hansen JE, Robinson TE, Sue DY, Wasserman K, Whipp BJ. Optimizing the exercise protocol for cardiopulmonary assessment. J Appl Physiol Respir Environ Exerc Physiol. 1983;55(5):1558–1564. pmid:6643191
  16. 16. Myers J, Buchanan N, Walsh D, Kraemer M, McAuley P, Hamilton-Wessler M, et al. Comparison of the ramp versus standard exercise protocols. J Am Coll Cardiol. 1991;17(6):1334–1342. pmid:2016451
  17. 17. Keir DA, Paterson DH, Kowalchuk JM, Murias JM. Using ramp-incremental VO2 responses for constant-intensity exercise selection. Appl Physiol Nutr Metab. 2018;43(9):882–892. pmid:29570982
  18. 18. Day JR, Rossiter HB, Coats EM, Skasick A, Whipp BJ. The maximally attainable VO2 during exercise in humans: the peak vs. maximum issue. J Appl Physiol. 2003;95(5):1901–1907. pmid:12857763
  19. 19. Myers J, Walsh D, Sullivan M, Froelicher V. Effect of sampling on variability and plateau in oxygen uptake. J Appl Physiol 1990;68(1):404–410. pmid:2312484
  20. 20. Poole DC, Wilkerson DP, Jones AM. Validity of criteria for establishing maximal O2 uptake during ramp exercise tests. Eur J Appl Physiol. 2008;102(4):403–410. Epub 2007/10/31. pmid:17968581.
  21. 21. Rossiter HB, Kowalchuk JM, Whipp BJ. A test to establish maximum O2 uptake despite no plateau in the O2 uptake response to ramp incremental exercise. J Appl Physiol 2006;100(3):764–770. pmid:16282428
  22. 22. Midgley AW, Carroll S, Marchant D, McNaughton LR, Siegler J. Evaluation of true maximal oxygen uptake based on a novel set of standardized criteria. Appl Physiol Nutr Metab. 2009;34(2):115–123. pmid:19370041
  23. 23. Astorino TA. Alterations in VO2max and the VO2 plateau with manipulation of sampling interval. Clin Physiol Funct Imaging. 2009;29(1):60–67. pmid:19125732
  24. 24. Gordon D, Mehter M, Gernigon M, Caddy O, Keiller D, Barnes R. The effects of exercise modality on the incidence of plateau at VO2max. Clin Physiol Funct Imaging. 2012;32(5):394–399. pmid:22856347
  25. 25. Edvardsen E, Hem E, Anderssen SA. End criteria for reaching maximal oxygen uptake must be strict and adjusted to sex and age: a cross-sectional study. PLoS One. 2014;9(1):e85276. pmid:24454832
  26. 26. Howley ET, Bassett DR, Welch HG. Criteria for maximal oxygen uptake: review and commentary. Med Sci Sports Exerc. 1995;27:1292–1301. pmid:8531628
  27. 27. Poole DC, Jones AM. Measurement of the maximum oxygen uptake VO2max: VO2peak is no longer acceptable. J Appl Physiol 2017;122(4):997–1002. pmid:28153947
  28. 28. Midgley AW, Carroll S. Emergence of the verification phase procedure for confirming ‘true’ VO2max. Scand J Med Sci Sports. 2009;19(3):313–322. pmid:19422662
  29. 29. Costa VAB, Midgley AW, Carroll S, Astorino TA, de Paula T, Farinatti P, et al. Is a verification phase useful for confirming maximal oxygen uptake in apparently healthy adults? A systematic review and meta-analysis. PLoS One. 2021;16(2):e0247057. pmid:33596256
  30. 30. Barker AR, Williams CA, Jones AM, Armstrong N. Establishing maximal oxygen uptake in young people during a ramp cycle test to exhaustion. Br J Sports Med. 2011;45(6):498–503. pmid:19679577
  31. 31. Saynor ZL, Barker AR, Oades PJ, Williams CA. A protocol to determine valid VO2max in young cystic fibrosis patients. J Sci Med Sport. 2013;16(6):539–544. pmid:23510652
  32. 32. Schaun GZ, Alberton CL, Gomes MLB, Santos LP, Bamman MM, Mendes GF, et al. Maximal oxygen uptake is underestimated during Incremental testing in hypertensive older adults: Findings from the HAEL study. Med Sci Sports Exerc. 2021;53(7):1452–1459. pmid:33449605
  33. 33. Moher D, Shamseer L, Clarke M, Ghersi D, Liberati A, Petticrew M, et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. Syst Rev. 2015;4(1):1–9. pmid:25554246
  34. 34. Nelson ME, Rejeski WJ, Blair SN, Duncan PW, Judge JO, King AC, et al. Physical activity and public health in older adults: recommendation from the American College of Sports Medicine and the American Heart Association. Circulation. 2007;116(9):1094. pmid:17671236
  35. 35. Downs SH, Black N. The feasibility of creating a checklist for the assessment of the methodological quality both of randomised and non-randomised studies of health care interventions. J Epidemiol Community Health. 1998;52(6):377–384. pmid:9764259
  36. 36. Hébert-Losier K, Supej M, Holmberg HC. Biomechanical factors influencing the performance of elite alpine ski racers. Sports Med. 2014;44(4):519–533. pmid:24374655
  37. 37. Baumgart JK, Brurok B, Sandbakk Ø. Comparison of peak oxygen uptake between upper-body exercise modes: A systematic literature review and meta-analysis. Front Physiol. 2020;11:412. pmid:32508667
  38. 38. Van Tulder M, Furlan A, Bombardier C, Bouter L, Group EBotCCBR. Updated method guidelines for systematic reviews in the cochrane collaboration back review group. Spine. 2003;28(12):1290–1299. pmid:12811274
  39. 39. Higgins JP, Li T, Deeks JJ. Choosing effect measures and computing estimates of effect. In: Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch VA, editors. Cochrane handbook for systematic reviews of interventions. Oxford, UK: John Wiley & Sons; 2019. pp. 143–176.
  40. 40. Hardy RJ, Thompson SG. Detecting and describing heterogeneity in meta‐analysis. Stat Med. 1998;17(8):841–856. pmid:9595615
  41. 41. Armstrong N, Welsman J, Winsley R. Is peak VO2 a maximal index of children’s aerobic fitness? Int J Sports Med. 1996;17(05):356–359. pmid:8858407
  42. 42. Astorino TA, Bediamol N, Cotoia S, Ines K, Koeu N, Menard N, et al. Verification testing to confirm VO2max attainment in persons with spinal cord injury. J Spinal Cord Med. 2019;42(4):494–501. pmid:29355464
  43. 43. Astorino TA, De La Rosa AB, Clark A, De Revere JL. Verification testing to confirm VO2max attainment in inactive women with obesity. Int J Exerc Sci. 2020;13(4):1448.
  44. 44. Baumgart JK, Moes M, Skovereng K, Ettema G, Sandbakk Ø. Examination of gas exchange and blood lactate thresholds in Paralympic athletes during upper-body poling. PLoS One. 2018;13(10):e0205588. pmid:30379853
  45. 45. Bhammar DM, Stickford JL, Bernhardt V, Babb TG. Verification of maximal oxygen uptake in obese and nonobese children. Med Sci Sports Exerc. 2017;49(4):702–710. pmid:27875494
  46. 46. Bhammar DM, Adams-Huet B, Babb TG. Quantification of cardiorespiratory fitness in children with obesity. Med Sci Sports Exerc. 2019;51(11):2243–2250. pmid:31634291
  47. 47. Bhammar DM, Chien LC. Quantification and verification of cardiorespiratory fitness in adults with prehypertension. Sports. 2021;9(1):9. pmid:33440896
  48. 48. Blazquez A, Guillamo E, Alegre J, Ventura JL, Barbany JR, Javierre C. Cardiorespiratory responses to exercise in female chronic fatigue syndrome patients affect the psychological health of their partners. JEPonline. 2011;14(2):1–16.
  49. 49. Bowen TS, Cannon DT, Begg G, Baliga V, Witte KK, Rossiter HB. A novel cardiopulmonary exercise test protocol and criterion to determine maximal oxygen uptake in chronic heart failure. J Appl Physiol 2012;113(3):451–458. pmid:22653993
  50. 50. Causer AJ, Shute JK, Cummings MH, Shepherd AI, Bright V, Connett G, et al. Cardiopulmonary exercise testing with supramaximal verification produces a safe and valid assessment of VO2max in people with cystic fibrosis: a retrospective analysis. J Appl Physiol 2018;125(4):1277–1283. pmid:30346240
  51. 51. Cockcroft EJ, Bond B, Williams CA, Harris S, Jackman SR, Armstrong N, et al. The effects of two weeks high-intensity interval training on fasting glucose, glucose tolerance and insulin resistance in adolescent boys: a pilot study. BMC Sports Sci Med Rehabil. 2019;11(1):1–9. pmid:31827806
  52. 52. de Groot JF, Takken T, de Graaff S, Gooskens RHJM, Helders PJM, Vanhees L. Treadmill testing of children who have spina bifida and are ambulatory: does peak oxygen uptake reflect maximum oxygen uptake? Phys Ther. 2009;89(7):679–687. pmid:19482903
  53. 53. Goosey-Tolfrey VL, Paulson TAW, Tolfrey K, Eston RG. Prediction of peak oxygen uptake from differentiated ratings of perceived exertion during wheelchair propulsion in trained wheelchair sportspersons. Eur J Appl Physiol. 2014;114(6):1251–1258. pmid:24610244
  54. 54. Lambrick D, Bertelsen H, Eston R, Stoner L, Faulkner J. Prediction of peak oxygen uptake in children using submaximal ratings of perceived exertion during treadmill exercise. Eur J Appl Physiol. 2016;116(6):1189–1195. pmid:27106870
  55. 55. Lambrick D, Jakeman J, Grigg R, Kaufmann S, Faulkner J. The efficacy of a discontinuous graded exercise test in measuring peak oxygen uptake in children aged 8 to 10 years. Biol Sport. 2017;34(1):57. pmid:28416899
  56. 56. Leicht CA, Tolfrey K, Lenton JP, Bishop NC, Goosey-Tolfrey VL. The verification phase and reliability of physiological parameters in peak testing of elite wheelchair athletes. Eur J Appl Physiol. 2013;113(2):337–345. pmid:22718268
  57. 57. Mahoney JM, Baughman BR, Sheard AC, Sawyer BJ. Determining the optimal workrate for cycle ergometer verification phase testing in males with obesity. Sports. 2021;9(2):30. pmid:33672531
  58. 58. Manresa-Rocamora A, Fuertes-Kenneally L, Blasco-Peris C, Sempere-Ruiz N, Sarabia JM, Climent-Paya V. Is the verification phase a suitable criterion for the determination of maximum oxygen uptake in patients with heart failure and reduced ejection fraction? A validation study. Int J Environ Res Public Health. 2023;20(4):2764. pmid:36833461
  59. 59. McCreery JL, Mackintosh KA, Mills-Bennett R, McNarry MA. The effect of a high-intensity PrO2Fit inspiratory muscle training intervention on physiological and psychological health in adults with bronchiectasis: A mixed-methods study. Int J Environ Res Public Health. 2021;18(6):3051. pmid:33809595
  60. 60. Michalski AC, Ferreira AS, Midgley AW, Costa VAB, Fonseca GF, da Silva NSL, et al. Mixed circuit training acutely reduces arterial stiffness in patients with chronic stroke: a crossover randomized controlled trial. Eur J Appl Physiol. 2023;123(1):121–134. pmid:36205814
  61. 61. Moreno-Cabañas A, Ortega JF, Morales‐Palomo F, Ramirez‐Jimenez M, Mora‐Rodriguez R. Importance of a verification test to accurately assess VO2max in unfit individuals with obesity. Scand J Med Sci Sports. 2020;30(3):583–590. pmid:31746500
  62. 62. Moreno-Cabañas A, Ortega JF, Morales-Palomo F, Ramirez-Jimenez M, Alvarez-Jimenez L, Pallares JG, et al. The use of a graded exercise test may be insufficient to quantify true changes in following exercise training in unfit individuals with metabolic syndrome. J Appl Physiol 2020;129(4):760–767. pmid:32881617
  63. 63. Murias JM, Kowalchuk JM, Paterson DH. Mechanisms for increases in VO2max with endurance training in older and young women. Med Sci Sports Exerc. 2010;42(10):1891–1898. pmid:20308926
  64. 64. Murias JM, Kowalchuk JM, Paterson DH. Time course and mechanisms of adaptations in cardiorespiratory fitness with endurance training in older and young men. J Appl Physiol 2010;108(3):621–627. pmid:20056848
  65. 65. Murias JM, Pogliaghi S, Paterson DH. Measurement of a true VO2max during a ramp incremental test is not confirmed by a verification phase. Front Physiol. 2018;9:143. pmid:29535642
  66. 66. Oliveira RS, Barker AR, Debras F, Kranen SH, Williams CA. Effects of exercise intensity on vascular and autonomic components of the baroreflex following glucose ingestion in adolescents. Eur J Appl Physiol. 2019;119(4):867–878. pmid:30730001
  67. 67. Pal A, Schneider J, Schlüter K, Steindorf K, Wiskemann J, Rosenberger F, et al. Different endurance exercises modulate NK cell cytotoxic and inhibiting receptors. Eur J Appl Physiol. 2021;121(12):3379–3387. pmid:34477931
  68. 68. Paulson TAW, Goosey-Tolfrey VL, Lenton JP, Leicht CA, Bishop NC. Spinal cord injury level and the circulating cytokine response to strenuous exercise. Med Sci Sports Exerc. 2013;45(9):1649–1655. pmid:23475168
  69. 69. Robben KE, Poole DC, Harms CA. Maximal oxygen uptake validation in children with expiratory flow limitation. Pediatr Exerc Sci. 2013;25(1):84–100. pmid:23406709
  70. 70. Rowland TW. Does peak VO2 reflect VO2max in children?: Evidence from supramaximal testing. Med Sci Sports Exerc. 1993;25(6):689–693.
  71. 71. Sansum KM, Weston ME, Bond B, Cockcroft EJ, O’Connor A, Tomlinson OW, et al. Validity of the supramaximal test to verify maximal oxygen uptake in children and adolescents. Pediatr Exerc Sci. 2019;31(2):213–222. pmid:30885084
  72. 72. Sawyer BJ, Tucker WJ, Bhammar DM, Gaesser GA. Using a verification test for determination of VO2max in sedentary adults with obesity. J Strength Cond Res. 2015;29(12):3432–3438. pmid:26382135
  73. 73. Saynor ZL, Barker AR, Oades PJ, Williams CA. Reproducibility of maximal cardiopulmonary exercise testing for young cystic fibrosis patients. J Cyst Fibros. 2013;12(6):644–650. pmid:23726255
  74. 74. Schaun GZ, Alberton CL, Brizio Gomes ML, Mendes GF, Häfele MS, Andrade LS, et al. Exercise intervention does not reduce the likelihood of VO2max underestimation in older adults with hypertension. J Sports Sci. 2022;40(12):1399–1405. pmid:35609113
  75. 75. Schneider J, Schlüter K, Wiskemann J, Rosenberger F. Do we underestimate maximal oxygen uptake in cancer survivors? Findings from a supramaximal verification test. Appl Physiol Nutr Metab. 2020;45(5):486–492. pmid:31604021
  76. 76. Tomlinson OW, Barker AR, Chubbock LV, Stevens D, Saynor ZL, Oades PJ, et al. Analysis of oxygen uptake efficiency parameters in young people with cystic fibrosis. Eur J Appl Physiol. 2018;118(10):2055–2063. pmid:30003381
  77. 77. Villanueva IR, Campbell JC, Medina SM, Jorgensen TM, Wilson SL, Angadi SS, et al. Comparison of constant load exercise intensity for verification of maximal oxygen uptake following a graded exercise test in older adults. Physiol Rep. 2021;9(18):e15037. pmid:34558207
  78. 78. Werkman MS, Hulzebos HJ, van de Weert-van PB, Arets HGM, Helders PJM, Takken T. Supramaximal verification of peak oxygen uptake in adolescents with cystic fibrosis. Pediatr Phys Ther. 2011;23(1):15–21. pmid:21304339
  79. 79. Woloschuk A, Hodges GJ, Massarotto RJ, Klentrou P, Falk B. The skin blood flow response to exercise in boys and men and the role of nitric oxide. Eur J Appl Physiol. 2020;120(4):753–762. pmid:31836915
  80. 80. Wood RE, Hills AP, Hunter GR, King NA, Byrne NM. VO2max in overweight and obese adults: do they meet the threshold criteria? Med Sci Sports Exerc. 2010;42(3):470–477. pmid:19952821
  81. 81. Combret Y, Medrinal C, Prieur G, Robledo Quesada A, Gillot T, Gravier FE, et al. Oxygen uptake kinetics during treadmill walking in adolescents with clinically stable cystic fibrosis. Physiother Theory Pract. 2022;38(10):1389–1397. Epub 2021/01/05. pmid:33390080.
  82. 82. O’Connor E, Kiely C, O’Shea D, Green S, Egaña M. Similar level of impairment in exercise performance and oxygen uptake kinetics in middle-aged men and women with type 2 diabetes. Am J Physiol Regul Integr Comp Physiol. 2012;303(1):R70–76. Epub 2012/04/28. pmid:22538515.
  83. 83. Wolpat A, Lima FV, Silva FM, Tochetto M, de Freitas A, Grandi T, et al. Association between inspiratory muscle weakness and slowed oxygen uptake kinetics in patients with chronic obstructive pulmonary disease. Appl Physiol Nutr Metab. 2017;42(12):1239–1246. Epub 2017/07/28. pmid:28750180.
  84. 84. Boutou AK, Zafeiridis A, Pitsiou G, Dipla K, Kioumis I, Stanopoulos I. Cardiopulmonary exercise testing in chronic obstructive pulmonary disease: An update on its clinical value and applications. Clin Physiol Funct Imaging. 2020;40(4):197–206. Epub 20200402. pmid:32176429.
  85. 85. Loughney L, West M, Pintus S, Lythgoe D, Clark E, Jack S, et al. Comparison of oxygen uptake during arm or leg cardiopulmonary exercise testing in vascular surgery patients and control subjects. B J Anaesth. 2014;112(1):57–65. pmid:24193322
  86. 86. Thoden JS, Wilson BA, MacDougall JD. Testing aerobic power. In: MacDougall JD, Wenger HA, Green HJ, editors. Physiological testing of the elite athlete. Ithaca, NY: Mouvement Publications, Inc; 1982. pp. 39–60.
  87. 87. Thoden JS, Wislon BA, MacDougall JD. Testing aerobic power. In: MacDougall JD, Wenger HA, Green HJ, editors. Physiological testing of the high-performance athlete. 2nd ed. Champaign, IL: Human Kinetics; 1991. pp. 107–173.