Figures
Abstract
Background: The Neck Disability Index (NDI) is widely used to assess neck pain-related disability. It is commonly combined with objective measures, particularly range of motion and peak velocity, but these variables usually show weak correlations with patient-reported outcomes. Functional data analysis (FDA) makes it possible to analyze complete kinematic waveforms and provides tools to identify relevant motion features most related to disability.
Objective: This study aimed to examine whether scalar-on-function regression of cervical angular velocity curves can predict NDI scores in patients with nonspecific neck pain and which motion variables and sub-phases contribute most to the prediction.
Methods: We analyzed 56 recordings from 28 patients, each paired with an NDI questionnaire, which were collected over two sessions. Cervical flexion–extension, lateral flexion, and axial rotation velocities were processed using functional principal component analysis to reduce dimensionality while retaining the main modes of variation.
Univariate and bivariate scalar-on-function regression models were estimated. Model selection was based on a procedure that combined the Akaike Information Criterion, Bayesian Information Criterion, and measures of goodness-of-fit, ensuring a balance between model simplicity and predictive accuracy. Comparisons were made using non-functional regressions, including demographic and anthropometric variables.
Results: The bivariate flexion–extension plus lateral flexion model showed the best performance (r = 0.677), which was clearly higher than that of the non-functional regressions (r 0.391). The coefficient functions indicated that the acceleration and deceleration phases were the most informative in explaining variability in NDI scores. The inclusion of anthropometric variables did not improve model performance.
Conclusions: Functional regression of velocity curves improves the prediction of NDI scores and highlights specific phases of movement that are clinically relevant. Larger studies are required to confirm these findings and assess their clinical applicability.
Citation: Aragón-Basanta E, Ayala G, Serra-Añó P, Page Á (2026) Application of functional regression in biomechanical assessment of neck disability: Selection of clinically relevant variables. PLoS One 21(2): e0340428. https://doi.org/10.1371/journal.pone.0340428
Editor: Giulia Pascoletti, University of Perugia: Universita degli Studi di Perugia, ITALY
Received: June 25, 2025; Accepted: December 19, 2025; Published: February 19, 2026
Copyright: © 2026 Aragón-Basanta et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting information files.
Funding: This publication is part of the I+D+i project PGC type B with reference PID2020-117114GBI00 funded by the Spanish Ministry of Science, Innovation and Universities, MCIN/AEI/10.13039/501100011033/ (G. A.) and project CIPROM/2023/066 (G. A.) Also, it is part of the project PID2021-125694OB-I00 funded by MICIU/AEI//10.13039/501100011033 and FEDER/UE (P. S.A. and A.P.) and CIAICO/2021/215 (P.S.A.). There was no additional external funding received for this study. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Functional data analysis (FDA) represents a substantial advancement in the clinical application of biomechanics, facilitating the correlation of kinematic and dynamic variables with health conditions and functional changes. Unlike traditional numerical variables, which rely on extreme values or phase durations, functional variables use entire curves to characterize the movement. FDA offers significant advantages over classical statistical methods by preserving all information associated with continuous movement, rather than reducing it to a few discrete values. Furthermore, functional variables do not require the identification of specific points or characteristics of the curve, such as maxima, minima, or specific times, which can be challenging to discern when dealing with abnormal movement patterns associated with injury or disability [1–3].
There is substantial scientific literature on applying FDA in biomechanics [3–5], often focusing on comparing healthy and diseased subjects [6,7]. These studies provide valuable insights into how movement patterns change in individuals with disabilities. However, the differences between healthy and affected individuals do not always directly correlate with continuous changes in injury severity or pain, both of which are crucial for analyzing disease progression and the effectiveness of rehabilitation treatments [8,9]. Alternatively, in patient cohorts, regression models offer a more appropriate strategy by uncovering relationships between biomechanical variables and disability measured via clinical or pain scales [10]. Most current studies rely on correlations between numerical variables and clinical scale scores, which typically demonstrate weaker associations than those derived from healthy–ill comparisons [11–13]. By contrast, functional regression models demonstrated stronger predictive performance. In particular, scalar-on-function regression models allow the identification of relevant functional variables that can predict a patient’s functional state [14]. Although this type of model has received considerable theoretical attention [3,15], its translation into routine clinical assessment remains limited [14,16].
A limitation of scalar-on-function regression models is their inherent high dimensionality, which can lead to overfitting [3,15]. Even with dimensionality reduction via basis expansion, analyses must include numerous kinematic or dynamic variables (e.g., position, velocity, force) across multiple joints or movements, resulting in dozens of potential predictors. Selecting the most relevant variables thus demands methods capable of simultaneously evaluating multiple predictors. It is therefore essential to select an appropriate basis that preserves relevant information while minimizing dimensionality. Traditional criteria based on multiple correlations are insufficient and vulnerable to overfitting. Instead, penalized model selection criteria such as the Akaike Information Criterion (AIC) or Bayesian Information Criterion (BIC)—which balance goodness of fit with model complexity—are well suited to choose both the basis and relevant predictors by achieving a compromise between the simplicity of the model and the goodness of fit [17,18]. Unfortunately, despite their suitability, these criteria are still rarely used in clinical applications of functional regression [14,19].
In this study, we propose and validate a novel approach using functional regression models to identify relevant kinematic variables in the functional assessment of nonspecific neck pain. Our work addresses a significant gap in the literature by directly correlating complete movement curves with clinical disability indices. Neck injuries are second only to lumbar injuries in prevalence, making kinematic analyses of the cervical spine valuable indicators of neck joint function [20–22]. However, most published studies do not use a functional approach, which has the potential to provide deeper insight into the relationships between kinematic variables and levels of disability or pain. Typically, these studies relate scalar summaries such as ranges of motion or angular velocities in the three anatomical planes to pain scales [23] or to disability scales [24], among others [8,12,13,20–22].
This study uses a scalar-on-function regression model with the Neck Disability Index (NDI) as the response variable. Predictor variables include angular position and angular velocity curves from flexion-extension (FE), lateral flexion (LF), and axial rotation (AR), as well as control covariates like sex, age, neck length, head mass, and neck mass. Six functional variables and five numerical variables are used, which require combining quality criteria, in addition to the classic ones based on the correlation coefficient and p-values to select the most predictive and robust models. The quality of the models and the risk of overfitting were analyzed to provide criteria for selecting the base dimension. Finally, we compared multiple functional regression models to identify which kinematic variables and movements are most strongly associated with the NDI.
Materials and methods
Materials
The present analysis used kinematic data from the participants of a previously published randomized controlled trial on cervical spine manipulation in patients with nonspecific neck pain [25] (ID: NCT04059692). In the original trial, conventional numerical variables such as range of motion (RoM) and peak velocity were analyzed. However, the continuous movement recordings obtained during the same sessions were also preserved, and in this study we processed and analyzed those waveforms using Functional data analysis (FDA). Twenty-eight participants contributed to two sessions (pre- and post-treatment), resulting in 56 observations. For the present analysis, both sessions were pooled to provide two data points per subject. The current study focused exclusively on the association between NDI and kinematic variables, and no pre–post comparisons or treatment effects were considered.
Participants and eligibility criteria.
The inclusion criteria were as follows: age between 18 and 65 years, acute or chronic nonspecific neck pain lasting at least one month, pain intensity 3 on a visual analog scale and NDI
5 in the first session. Exclusion criteria included inflammatory or rheumatic disease; vestibular disorders; neurological conditions; recent trunk or shoulder surgery; or the use of opioids, antidepressants, or sedatives. All participants provided written informed consent and the study protocol was approved by the Ethics Committee of the University of Valencia (H1450106985729).
Kinematic assessment.
In each session, the participants performed cyclic cervical movements of flexion–extension (FE), right and left lateral flexion (LF), and right axial rotation (AR) in randomized order. Movements were recorded using the Kinescan-IBV videophotogrammetry system [26] while the participants sat in a custom-designed chair with trunk stabilization (Fig 1). To minimize torso movements during the trials, participants were instructed to avoid compensatory trunk actions. In addition, the torso was stabilized using two straps attached to the chair, securing the shoulders and ensuring that the recorded motion originated solely from the neck. A rigid headband with reflective markers was used to record head movements.
Participants sat with the trunk secured to the backrest to limit torso motion. Head motion was recorded using eight markers attached to a rigid helmet. The local reference frame was centered at the midpoint between both tragus points. The X-axis (lateral flexion) pointed anteriorly through the line connecting the infraorbital landmarks, the Z-axis (flexion extension) connected the tragus points toward the right side, and the Y-axis (axial rotation) completed the right-handed triad. Additional details on the determination of anatomical axes are described in Venegas et al. [27].
Calibration was performed with additional markers placed on the right and left tragus, the nasal bone, and the infraorbital points to define the anatomical reference system; these calibration markers were removed immediately afterward. The experimental procedure and the determination of the anatomical axes followed the methodology described in Venegas et al. [27].
Each participant completed seven continuous cycles per movement during each session (before and after treatment), with the first and last cycles discarded. Time scales were normalized, and the functional mean of the five remaining position–velocity curves was calculated. Additional numerical variables were derived, such as range of motion (RoM) and velocity ranges for movement description [26].
Anthropometric measurements.
Neck length was measured as the straight-line distance between the marker placed on the spinous process of C7 and the midpoint between the tragus markers, which defined the origin of the anatomical reference system in the neutral head position. Head and neck masses were estimated using regression equations proposed by [28], specifically designed to provide anthropometry-based estimates of the inertial parameters of the head–neck complex.
Disability assessment.
The Spanish version of the Neck Disability Index (NDI) was used to assess disability and served as the dependent variable for each observation [29]. The questionnaire, with a score of 0-50, was administered prior to both measurement sessions.
Methods
Functional linear regression model.
Our data consist of where xi contains functional and scalar predictors and yi the corresponding response. The scalar-on-function regression models the conditional distribution of Yi given the (scalar and functional) predictors xi. For illustration, consider the simplest case with one functional predictor xi. The model is
with random errors ’s are independent and identically distributed with
. We assume that
i.e. the square of the function is integrable.
To facilitate estimation, expand and xi in appropriate bases:
and
in such a way that
where is known. The multiple regression model would be
where the bj’s are the coefficients. The coefficients cik are estimated using the least squares method. The problem is reduced to a multiple regression model.
If several functional predictors with
are considered besides scalar predictors
i.e.
then the functional regression model can be formulated as
It can be chosen different basis functions for the different coefficient functions . Additionally, note that the functional predictors have to be expressed in its own basis.
To compare modeling approaches, we fitted standard regression models using scalar predictors. For each movement, the model included the range of motion (RoM), the range of angular velocity (RoV), and their interaction term (RoM × RoV). These models serve as benchmarks against which we compare our functional regression results. Including both RoM and RoV, along with their interaction, is justified by network analysis findings that demonstrate strong statistical interactions between these variables [8].
Dimensionality reduction via functional principal component analysis.
We represented the functional data —each movement curve— using a B-spline basis of 40 functions. To reduce dimensionality, we subsequently applied Functional Principal Component Analysis (FPCA), which generalizes standard PCA to functional data by identifying the dominant modes of variation across the movement curves. FPCA allows any curve to be approximated as the mean function plus a linear combination of these functional principal components [30–34].
Since some analyses involve more than one functional variable, we also employed bivariate FPCA (bFPCA). This technique performs joint FPCA on a pair of curves, capturing both shared and complementary modes of variation across them. Bivariate and multivariate FPCA are increasingly used in biomechanics when dealing with simultaneous functional signals, for example, in sports biomechanics to analyze multivariate movement signatures [35,36]. More information about FPCA or bFPCA can be found in [1].
To select the optimal number of functional principal components, we evaluate multiple model quality metrics:
- Akaike Information Criterion (AIC) [17], computed as
, where
is the maximum likelihood of the model, and k is the number of parameters.
- Bayesian Information Criterion (BIC) [18], computed as
+
, penalizing complexity more heavily for larger sample sizes (n).
- The multiple linear correlation coefficient (r) and the F-statistic with associated p-value to assess model significance.
In addition to AIC, BIC, the correlation coefficient, the F statistic, and the p-value, the cumulative explained variance was also considered, to ensure that a substantial proportion of the information contained in the original curves was preserved. Specifically, we ensured that the number of components retained accounts for at least 95% of the total variance.
Our strategy for selecting the number of components was as follows:
- Increase the number of components until cumulative explained variance reaches 95%.
- Assess each model’s predictive accuracy using high r and significant F-test.
- Among models meeting these criteria, choose the one that minimizes AIC and BIC, striking a balance between goodness of fit and model parsimony.
Seventeen models—ranging from 4 to 20 components—were fitted per functional predictor to identify optimal dimensionality. These criteria also allowed comparison across movement types (flexion-extension, lateral flexion, axial rotation) and data types (angle vs angular velocity). For cases with both angle and velocity curves, bFPCA was used, and component selection followed the same scheme to determine the most predictive movement patterns.
Model comparison and variable selection.
Once the FPCA-based functional models were selected (with optimal number of components per criteria such as cumulative explained variance, AIC, BIC, r, F-statistic and p-value), we proceeded in two steps.
- Comparison of functional models. We compared functional regression models using either angle curves or angular velocity curves for each movement, including flexion–extension, lateral flexion, and axial rotation. For angle–velocity pairs, we also applied bivariate FPCA. Each model’s performance was evaluated using the standard criteria (AIC, BIC, r, F-statistic, p-value) to determine which combination of movement and kinematic variable yields the highest predictive power.
- Demographic and anthropometric covariates. Starting from the best functional model for each movement domain, we tested whether adding scalar covariates (sex, age, neck length, head mass, and neck mass) improved the model performance. For each scalar predictor, a separate extended model was fitted and compared again using AIC, BIC, the correlation coefficient, the F-statistic, and the p-value for the null hypothesis that the added predictor had no effect. A low p-value (e.g. <0.05) indicates significant improvement over the base functional model.
As a baseline for comparison, we also fitted simple scalar-only models for each movement using the range of motion (RoM), range of angular velocity (RoV), and their interaction (RoM × RoV) as the predictors. This enables a direct assessment of whether our functional regression models offer superior predictive performance compared to traditional models built on these scalar movement summaries (RoM and RoV) [14].
All analyses were conducted in R using the fda and fda.usc packages.
Results
Participants and kinematic data
The sample comprised 28 subjects (18 women and 10 men) with non-specific neck pain diagnosed by experienced physiotherapists, measured for flexion-extension, lateral flexion, and axial rotation movements, each assessed twice, before and after neck treatment. Therefore, there were 56 samples. However, only 55 were considered because one has been discarded owing to errors during the measurement process.
Some basic numerical characteristics of the sample are displayed in Table 1, where the mean and standard deviation of some variables of the subjects by sex are shown. The NDI scores ranged from 1 to 19.
In addition, Fig 2 shows the curves for the 55 repetitions (gray) for angle (top) and angular velocity (bottom) in the flexion-extension, lateral flexion, and axial rotation movements, together with the functional mean (black) plus and minus one standard deviation (dotted black).
Angular position of (a) flexion-extension; (b) lateral flexion; (c) axial rotation. Angular velocity of: (d) flexion-extension; (e) lateral flexion; (f) axial rotation. The grey lines are the curves for each subject, with the functional mean (black) plus and minus one functional standard deviation (dotted black).
Table 2 shows the values of range of motion (RoM) and angular velocity (RoV) for each of the three movements analyzed: flexion-extension, lateral flexion and axial rotation.
Non-functional regression models
Three classical (non-functional) linear regression models were implemented for each movement, whose predictors were range of motion (RoM), range of angular velocity (RoV) and their interactions. Table 3 shows values of the multiple linear correlation r, F statistic and p-value. The highest value of the multiple correlation coefficient was for the lateral flexion movement (0.391), followed by flexion-extension (0.356), and axial rotation, with the lowest value (0.296). On the other hand, only the models for the lateral flexion and axial rotation movements had a significant F statistic (0.036 in both cases), while for flexion-extension, the p-value was 0.073.
In all models, the predictors used were range of motion (RoM), range of velocity (RoV), and their interactions.
Functional regression models
Optimal number of functional principal components.
To illustrate the process of selecting the optimal number of basis functions, Fig 3 displays the evolution of key model quality metrics for two representative functional regression models. The first model includes a single functional predictor—lateral flexion velocity—while the second uses two functional predictors—velocities of flexion-extension and lateral flexion—. In both cases, the figure plots AIC (black) and BIC (blue) against the multiple linear correlation coefficient r, shown on the x-axis. Each point on the plot corresponds to a model fitted with a specific number of principal components, ranging from 4 (first point) to 20 (last point), so that the plot captures how AIC, BIC, and r vary as component count increases.
The plot shows the multiple linear correlation coefficient (x-axis) and the AIC and BIC (y-axis) according to different numbers of principal components, varying from four (first point) to 20 (last point). Black and blue represent the AIC and BIC values, respectively. The red point corresponds to the optimal number of principal components chosen (11 for both models). (a) The plot on the left corresponds to the functional regression model, whose predictor is the lateral flexion velocity. (b) The one on the right is for the principal components model concatenating the flexion-extension and lateral flexion velocity curves.
In the left panel, applying the elbow method reveals that both AIC and BIC reach their minimum at 11 principal components, which is highlighted in red as the optimal choice.
However, not all models displayed such a clear minimum. The right panel of Fig 3 provides an illustrative example: the model using both functional predictors shows that while AIC also attains a minimum at 15 components and r continues to improve, the BIC rises sharply. In such a case, choosing 11 components remains preferable due to the more favorable trade-off between fit and parsimony.
Model comparison.
Table 4 shows the values of explained variability, AIC, BIC, multiple correlation coefficient, F statistic, and p-value for each model with the optimum number of principal components chosen for each movement. In all cases, the explained variability was approximately 99%. Among the three movements studied, the optimal number of principal components for angle-based models and for flexion–extension (FE) velocity was nine. In contrast, for both lateral flexion (LF) and axial rotation (AR) velocity, the optimal number was eleven principal components.
Optimal number of functional principal components, proportion of variance explained, Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), multiple correlation coefficient (r), and F statistic with associated p-value for each model.
The predictive performance of the models based on angular displacement was limited. Specifically, the multiple linear correlation coefficients (r) were relatively low: 0.535 for flexion–extension (FE), 0.555 for lateral flexion (LF), and 0.275 for axial rotation (AR). However, these values were higher than those observed for the traditional scalar models based on the range of motion (RoM) and range of velocity (RoV), which yielded r values of 0.356 for FE, 0.391 for LF, and 0.296 for AR (Table 3). Additionally, these angular models produced the highest AIC and BIC values among all functional models. Only the LF model reached statistical significance (p = 0.038).
In contrast, models using angular velocity as a functional predictor demonstrated markedly improved performance. The FE and LF velocity models achieved substantially higher correlation coefficients (r = 0.603 and r = 0.652, respectively) and both were statistically significant (p = 0.010 and p = 0.006). These models also recorded the lowest AIC and BIC values among all single-variable functional models. The AR velocity model, on the other hand, performed worse; it yielded r = 0.463, with a not significant p-value (p = 0.410), and displayed higher AIC and BIC values.
Building on these findings, a combined model was formulated by concatenating the velocity curves of FE and LF and applying bivariate functional principal component analysis (bFPCA). As reported in the last row of Table 4, this bFPCA model—with 11 optimal components—delivered the most favorable metrics of all models evaluated: AIC = 319, BIC = 345, r = 0.677, and a highly significant p-value of 0.002. This model was selected as the basis for the subsequent analysis involving demographic and anthropometric covariates.
Effect of scalar covariates.
To evaluate the impact of demographic and anthropometric covariates, we began with the best-performing functional model as the baseline: the bFPCA model with 11 components combining flexion–extension and lateral flexion velocities. We then added each scalar predictor—sex, age, neck length, head mass, and neck mass—individually to assess whether any would enhance the model’s predictive ability.
Table 5 reports the AIC, BIC, multiple linear correlation coefficient r, F-statistic, and p-value for testing the null hypothesis that the coefficient of the scalar predictor is zero. The first row corresponds to the base functional model, while subsequent rows present the results for each model, extended by a single covariate. In all extended models, both AIC and BIC increased relative to the baseline, indicating that model parsimony did not improve. The multiple correlation coefficient r increased slightly in all cases; the most notable increase occurred with the inclusion of sex, raising r from 0.677 to 0.693 (increase of 0.016). However, none of the added covariates achieved statistical significance, as evidenced by non-significant p-values across all models. Therefore, there is no evidence to reject the null hypothesis, and the addition of any single scalar variable does not meaningfully improve the base functional model.
Different scalar predictors were added to the base model (only functional) to analyze whether their introduction improved the initial functional model.
Interpreting the coefficient function β(t).
Since none of the scalar predictors significantly improved the performance of the initial functional model, our interpretative focus was set on the 11-component bFPCA model that combined flexion–extension and lateral flexion velocities as the baseline. To elucidate the interpretative value of the coefficient function we compared the flexion–extension (FE) and lateral flexion (LF) curves of two participants: one with a low Neck Disability Index (NDI = 4) and the other with a higher NDI (NDI = 15), both exhibiting minimal residual error to ensure a clean comparison.
This comparison is visualized in Fig 4. The top panels correspond to flexion-extension (FE) velocities, and the bottom panels correspond to lateral flexion (LF). On the left, panels (a) and (c) display data for the low-NDI subject; on the right, panels (b) and (d) show the high-NDI subject. Each panel includes the observed velocity curve (dashed black), estimated coefficient function (blue), and their pointwise product
(solid black). The coefficient function was scaled by a factor of 25 to enable a clearer joint visualization on the same plot. The vertical scales have been adjusted to highlight the relative differences between patients.
This plot shows the coefficient and product functions of the chosen bivariate flexion-extension (top) and lateral flexion (bottom) principal component models for 11 principal components. The flexion-extension velocity curves are plotted for a participant with a low NDI of 4 (a) versus one with a high NDI of 15 (b); analogously for lateral flexion (c) and (d). In each plot, the velocity of the subject (dashed black), coefficient function (blue), and product of the velocity by the coefficient function (black) are shown. The negative and positive areas under the product function are shaded in orange and green, respectively. It is important to note that although the coefficient function is continuous from the start of the flexion extension velocity to the end of the lateral flexion velocity, as it is for the bPCA model, it has been separated for each velocity for a better interpretation of the results. The coefficient function is multiplied by 25 for better visualization in the same plot. The vertical scales have been adjusted to highlight the relative differences between patients.
Although the coefficient function is displayed separately for FE and LF for interpretative clarity, both segments belong to a single regression model in which flexion-extension and lateral flexion velocities enter simultaneously. Thus, the model yields a unified coefficient function , which is partitioned into FE- and LF-related intervals solely for visualization purposes.
According to Eq 1, the predicted NDI is obtained by adding to the model intercept the integral of the product function
. This integral represents the area under the curve. In this way, larger positive areas (shaded in green in Fig 4) and smaller negative areas (shaded in orange) increase the predicted NDI value.
This effect is clearly observed in the upper part of Fig 4 during the flexion-extension movement, where the higher-NDI subject shows relatively smaller negative areas, leading to a net increase in predicted NDI compared with the lower-NDI subject. These differences were most evident during the braking phase of extension, immediately after reaching the peak extension velocity, and during the acceleration phase at the onset of flexion, before maximum flexion velocity is attained. In contrast, for LF, the differences between subjects were much less pronounced. The positive and negative areas of the product function are more balanced, and their relative contribution to the estimated NDI is therefore smaller than in the case of FE.
Discussion
The evaluation of disability in patients with neck pain mainly relies on patient-reported outcomes, particularly the Neck Disability Index (NDI). Complementary information can be obtained from biomechanical measures, such as position, velocity, acceleration, or force curves, which describe the dynamics of cervical motion [10,14,20].
Most published studies compare patients with healthy controls, which is informative but does not capture the progression of pathology or the effects of rehabilitation [6,7]. Greater clinical value arises from cohort-based studies, in which biomechanical variables are directly related to indicators of functional status such as disability scales [11–13]. However, such studies remain relatively scarce, particularly those employing functional data analysis (FDA), despite its ability to exploit complete waveforms and its demonstrated potential to reveal associations with disability [14,19]. Among the FDA approaches, scalar-on-function regression models are particularly valuable, as they not only quantify the relationship between disability and complete motion curves but also identify the specific phases of movement that are most strongly associated with disability [14,16].
Functional regression models are affected by the high dimensionality of functional variables, which can lead to overfitting when an excessive number of basis functions are used for the coefficient functions. To mitigate this risk, it is necessary to implement a selection strategy that determines both the most relevant predictors and the appropriate number of basis functions, ensuring a balance between predictive accuracy and model robustness. In this study, principal component analysis was used to reduce dimensionality, and the number of components was chosen according to complementary criteria: the Akaike and Bayesian information criteria (AIC and BIC), the multiple correlation coefficient, the explained variance, and the significance of the F statistic. Increasing the number of components improves the correlation coefficient, but it also worsens the information criteria (Fig 3). Because BIC penalizes complexity more strongly than AIC, it provided a stricter reference when the two diverge. Applying these criteria to our patient cohort—characterized mainly by low to moderate levels of disability—consistently indicated an optimal solution with 9 to 11 components (Table 4).
Based on the optimal number of components selected for each model, the functional regressions consistently outperformed the nonfunctional ones. Correlation values reached up to r = 0.677 for the functional models (Table 4), whereas the non-functional regressions did not exceed r = 0.4 even when including the range of motion, range of velocity, and their interaction (Table 3). Previous work reported this advantage for flexion–extension alone [14]; the present study extends this finding by demonstrating that the superiority of functional models also applies to lateral flexion and axial rotation. The higher predictive values obtained with functional regression emphasize the methodological advantage of analyzing complete waveforms rather than summary measures.
Angular velocity showed a stronger relationship with disability than angular displacement in all three movements. Among the angle functions, only lateral flexion was significantly associated with NDI. These findings are consistent with previous studies using both numerical [8,12,13,20–22] and functional variables [14], which also reported greater predictive capacity for velocity measures.
Flexion-extension and lateral flexion were more strongly related to the NDI than axial rotation, whose position and velocity variables showed weak and non-significant associations. Between the two, lateral flexion yielded slightly higher correlations. As summarized in Table 4, the highest predictive values corresponded to models based on velocity curves of flexion-extension and lateral flexion, indicating that these movements provide the most informative predictors of disability. Although studies based on numerical variables have reported inconsistent results depending on movement, pathology, and severity, our analysis demonstrates that functional regression consistently prioritizes flexion-extension and lateral flexion.
This comparison of variables and movements highlights the strength of the proposed procedure: combining information criteria with classical fit indices makes it possible to identify the predictors that are most strongly associated with disability. In our analyses, anthropometric and demographic variables (sex, age, neck length, head mass, and neck mass) did not significantly enhance the velocity-based model. Although age is known to influence cervical mobility, its effect was likely already captured by the functional predictors, and the relatively young composition of the sample limited our ability to detect age-related changes that typically emerge in older populations [37].
A key advantage of functional regression is that the coefficient functions indicate which phases of motion are most associated with disability. This information can be visualized through the product , where the predicted NDI depends on the balance between positive and negative areas under the curve (Fig 4). Positive areas (green) increase the predicted disability, whereas negative areas (orange) reduce it. Fig 4 shows this for two representative cases: one with a low NDI (4) and one with a higher NDI (15).
For flexion-extension (upper panels), the main contributions are concentrated in three phases: a positive area around the peak velocity when the head passes through the neutral position during extension and two negative areas, one during braking before maximum extension and another during re-acceleration afterwards. In the participant with higher disability, the angular velocity was globally reduced; however, the reduction was not proportional across phases. The decrease in the positive peak was moderate, whereas the negative lobes in the braking and re-acceleration were relatively more attenuated. This disproportionate reduction shifts the balance of the integral toward a net positive value, thereby leading to a higher predicted NDI value. Clinically, this pattern suggests that increased disability is linked to lower absolute velocities during braking and acceleration around end-range extension, which reduces the protective negative contributions and increases disability scores.
For lateral flexion (lower panels), the differences between participants with relatively lower and higher disability were less pronounced. Several positive and negative areas were observed; however, their overall effect was smaller than that of flexion–extension. In the bivariate model, lateral flexion primarily modulated the predicted NDI, acting as a secondary adjustment to the dominant influence of flexion-extension curve rather than determining it independently.
Taken together, these results illustrate how functional regression not only improves predictive performance but also provides interpretable information about the phases of motion most closely associated with disability. As previously noted, clinical assessment typically relies on discrete variables, most commonly range of motion, or in more advanced evaluations, velocity. However, our findings show that isolated measures of range or peak velocity provide insufficient information on how activities of daily living are actually performed. Daily tasks inherently involve phases of deceleration and reacceleration, which allow for appropriate adjustment of cervical motion and contribute to overall coordination. The proposed model enables assessment of velocity throughout the entire movement trajectory, revealing attenuation during these dynamic phases in participants with higher disability. Consequently, intervention programs should not be restricted to increasing joint range of motion alone but should also incorporate dynamic training of acceleration (concentric contraction) and deceleration (eccentric contraction). This therapeutic approach targets the neuromuscular control of movement limits and enhances both efficiency and functional relevance of cervical motion.
This study had some limitations that should be acknowledged. The sample size was relatively modest, although sufficient to yield statistically significant results. Moreover, participants were restricted to patients with nonspecific neck pain and mild to moderate disability. The ability of the method to detect significant associations within such a narrow NDI range may be considered a strength, since correlations are usually stronger when the spectrum of disability is wider and includes severe cases, where kinematic differences are more pronounced [10,12,38]. Even so, larger and more heterogeneous cohorts will be necessary to confirm the stability of the findings and extend their generalizability. Within such cohorts, the potential influence of age and sex should also be examined in more detail, as the present sample was relatively young and unbalanced, which may have limited the detection of demographic effects on the results. In addition, the coefficient functions and their interpretation refer specifically to patients with mild to moderate nonspecific neck pain; other clinical conditions may present different kinematic–disability relationships that will need to be determined in future studies. Despite these limitations, the present work makes a methodological contribution by demonstrating how functional regression can be used to control overfitting, select the most relevant predictors, and identify the motion phases most closely related to disability.
Conclusions
This study provides three methodological contributions to the assessment of neck disability using functional data analysis. First, scalar-on-function regression was applied to patients with nonspecific neck pain using kinematic curves instead of reduced summary measures. Second, a model selection strategy combining information criteria (AIC/BIC) with conventional fit indices was implemented to ensure accuracy and robustness. Third, a bivariate functional analysis was introduced, showing how functional regression can improve prediction and help identify which kinematic features are most closely associated with disability. Together, these contributions strengthen the methodological foundation and enhance the clinical interpretability of functional data analysis.
Our results, obtained in patients with nonspecific neck pain and low to moderate levels of disability, indicate that functional regression outperforms conventional models for assessing disability from cervical kinematics. This approach consistently identified angular velocities of flexion–extension and lateral flexion as the most informative predictors, and revealed that specific phases of motion—particularly braking and re-acceleration during flexion–extension—were associated with greater disability. Beyond predictive performance, these findings demonstrate the capacity of functional data analysis to uncover clinically meaningful patterns.
Supporting information
S1 Data. Kinematic data.
The data is stored in an RData file which contains 4 lists: 1) FE: this list contains the matrices of the angle (FEfi) and angular velocity (FEdfi) of the neck flexion-extension movement. The matrices are 56x101, so each row corresponds to a subject (the total sample is 56). 2) FL: matrices of angle (FLfi) and angular velocity (FLdfi) of lateral flexion. The matrices are 56x101, so each row corresponds to a subject. 3) RA: matrices of angle (RAfi) and angular velocity (RAdfi) of axial rotation. The matrices are 56x101, so each row corresponds to a subject. 4) num: this list contains the scalar data for each subject, so they are vectors of length 56: NDI (NDI value), sex (coded as 0 = male, and 1 = female), age (in years) l_neck (neck length in cm), mass_head (head mass in kg) and mass_neck (neck mass in kg).
https://doi.org/10.1371/journal.pone.0340428.s001
(ZIP)
References
- 1.
Ramsay J, Silverman BW. Functional data analysis. New York: Springer; 2005.
- 2.
Crane E, Childers D, Gerstner G, Rothm E. Functional data analysis for biomechanics. Functional data analysis for biomechanics. InTech; 2011.
- 3.
Gunning E, Warmenhoven J, Harrison AJ, Bargary N. Functional data analysis in biomechanics: a concise review of core techniques, applications and emerging areas. Springer Nature Switzerland; 2024.
- 4. Ullah S, Finch CF. Applications of functional data analysis: a systematic review. BMC Med Res Methodol. 2013;13:43. pmid:23510439
- 5. Dannenmaier J, Kaltenbach C, Kölle T, Krischak G. Application of functional data analysis to explore movements: walking, running and jumping - A systematic review. Gait Posture. 2020;77:182–9. pmid:32058281
- 6. Park J, Seeley MK, Francom D, Reese CS, Hopkins JT. Functional vs. traditional analysis in biomechanical gait data: an alternative statistical approach. J Hum Kinet. 2017;60:39–49. pmid:29339984
- 7. Davis HC, Luc-Harkey BA, Seeley MK, Troy Blackburn J, Pietrosimone B. Sagittal plane walking biomechanics in individuals with knee osteoarthritis after quadriceps strengthening. Osteoarthritis Cartilage. 2019;27(5):771–80. pmid:30660722
- 8. Devecchi V, Alalawi A, Liew B, Falla D. A network analysis reveals the interaction between fear and physical features in people with neck pain. Sci Rep. 2022;12(1):11304. pmid:35787648
- 9. Maly MR, Costigan PA, Olney SJ. Role of knee kinematics and kinetics on performance and disability in people with medial compartment knee osteoarthritis. Clin Biomech (Bristol). 2006;21(10):1051–9. pmid:16956703
- 10. De Rosario H, Vivas MJ, Sinovas MI, Page Á. Relationship between neck motion and self-reported pain in patients with whiplash associated disorders during the acute phase. Musculoskelet Sci Pract. 2018;38:23–9. pmid:30218881
- 11. Ylinen J, Takala E-P, Kautiainen H, Nykänen M, Häkkinen A, Pohjolainen T, et al. Association of neck pain, disability and neck pain during maximal effort with neck muscle strength and range of movement in women with chronic non-specific neck pain. Eur J Pain. 2004;8(5):473–8. pmid:15324778
- 12. Pinheiro CF, Oliveira AS, Will-Lemos T, Florencio LL, Fernández-de-Las-Peñas C, Dach F, et al. Neck active movements assessment in women with episodic and chronic migraine. J Clin Med. 2021;10(17):3805. pmid:34501252
- 13. Tsang SMH, Szeto GPY, So BCL, Lau RWL, Tai JJ. Using cervical movement velocity to assist the prediction of pain and functional recovery for people with chronic mechanical neck pain. Clin Biomech (Bristol). 2022;93:105607. pmid:35245780
- 14. Aragón-Basanta E, Venegas W, Ayala G, Page A, Serra-Añó P. Relationship between neck kinematics and neck dissability index. An approach based on functional regression. Sci Rep. 2024;14(1):215. pmid:38167615
- 15. Reiss PT, Goldsmith J, Shang HL, Ogden RT. Methods for scalar-on-function regression. Int Stat Rev. 2017;85(2):228–49. pmid:28919663
- 16. Matabuena M, Vidal JC, Hayes PR, Saavedra-Garcia M, Trillo FH. Application of functional data analysis for the prediction of maximum heart rate. IEEE Access. 2019;7:121841–52.
- 17. Akaike H. A new look at the statistical model identification. IEEE Trans Automat Contr. 1974;19(6):716–23.
- 18. Schwarz G. Estimating the dimension of a model. Ann Statist. 1978;6(2).
- 19. Liebl D, Willwacher S, Hamill J, Brüggemann G-P. Ankle plantarflexion strength in rearfoot and forefoot runners: a novel clusteranalytic approach. Hum Mov Sci. 2014;35:104–20. pmid:24746605
- 20. Howell ER. The association between neck pain, the Neck Disability Index and cervical ranges of motion: a narrative review. J Can Chiropr Assoc. 2011;55(3):211–21. pmid:21886283
- 21. Treleaven J, Chen X, Sarig Bahat H. Factors associated with cervical kinematic impairments in patients with neck pain. Man Ther. 2016;22:109–15. pmid:26585294
- 22. Franov E, Straub M, Bauer CM, Ernst MJ. Head kinematics in patients with neck pain compared to asymptomatic controls: a systematic review. BMC Musculoskelet Disord. 2022;23(1):156. pmid:35172799
- 23. Langley GB, Sheppeard H. The visual analogue scale: its use in pain measurement. Rheumatol Int. 1985;5(4):145–8. pmid:4048757
- 24. Vernon H. The Neck Disability Index: state-of-the-art 1991 -2008. J Manipulative Physiol Ther. 2008;31(7):491–502. pmid:18803999
- 25. Serra-Añó P, Venegas W, Page A, Inglés de la Torre M, Aguilar-Rodríguez M, Espí-López G. Immediate effects of a single session of cervical spine manipulation on cervical movement patterns in people with nonspecific neck pain: a randomized controlled trial. J Manipulative Physiol Ther. 2023;46(1):17–26. pmid:37422751
- 26. Baydal-Bertomeu JM, Page AF, Belda-Lois JM, Garrido-Jaén D, Prat JM. Neck motion patterns in whiplash-associated disorders: quantifying variability and spontaneity of movement. Clin Biomech (Bristol). 2011;26(1):29–34. pmid:20858573
- 27. Venegas W, Inglés M, Page Á, Serra-Añó P. Paths of the cervical instantaneous axis of rotation during active movements-patterns and reliability. Med Biol Eng Comput. 2020;58(5):1147–57. pmid:32193862
- 28. Venegas Toro Wr, Medina Fabara A, Zambrano Orejuela I, Rosales Acosta A, Page Del Pozo A. Validation of a dynamic model of the neck for applications in ergonomics and functional assessment. DYNAII. 2023;98(4):369–74.
- 29. Andrade Ortega JA, Delgado Martínez AD, Almécija Ruiz R. Validation of the Spanish version of the neck disability index. Spine (Phila Pa 1976). 2010;35(4):E114-8. pmid:20110848
- 30.
Lois JMB, Broseta MJV, del Horno SM, Sanchez-Sanchez ML, Matas M, Viosca E. Functional data analysis for gait analysis after stroke. Functional data analysis. Berlin, Heidelberg: Springer; 2013. p. 775–9.
- 31. Kipp K, Redden J, Sabick M, Harris C. Kinematic and kinetic synergies of the lower extremities during the pull in olympic weightlifting. J Appl Biomech. 2012;28(3):271–8. pmid:21975459
- 32. Richter C, O’Connor NE, Marshall B, Moran K. Comparison of discrete-point vs. dimensionality-reduction techniques for describing performance-related aspects of maximal vertical jumping. J Biomech. 2014;47(12):3012–7. pmid:25059895
- 33. Warmenhoven J, Cobley S, Draper C, Harrison A, Bargary N, Smith R. Considerations for the use of functional principal components analysis in sports biomechanics: examples from on-water rowing. Sports Biomech. 2019;18(3):317–41. pmid:29141500
- 34. Warmenhoven J, Bargary N, Liebl D, Harrison A, Robinson MA, Gunning E, et al. PCA of waveforms and functional PCA: a primer for biomechanics. J Biomech. 2021;116:110106. pmid:33429072
- 35. Warmenhoven J, Cobley S, Draper C, Harrison A, Bargary N, Smith R. Bivariate functional principal components analysis: considerations for use with multivariate movement signatures in sports biomechanics. Sports Biomech. 2019;18(1):10–27. pmid:29125036
- 36. Roach KE, Pedoia V, Lee JJ, Popovic T, Link TM, Majumdar S, et al. Multivariate functional principal component analysis identifies waveform features of gait biomechanics related to early-to-moderate hip osteoarthritis. J Orthop Res. 2021;39(8):1722–31. pmid:33615524
- 37. Pan F, Arshad R, Zander T, Reitmaier S, Schroll A, Schmidt H. The effect of age and sex on the cervical range of motion - a systematic review and meta-analysis. J Biomech. 2018;75:13–27. pmid:29776822
- 38. Magaña LC, Murati S, Riffitts M, Harrison C, Harris A, Sowa G, et al. Subjective and objective measures in assessing neck disability and pain in head and neck cancer. Laryngoscope. 2021;131(9):2015–22. pmid:33656195