Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Predictive models in SMA II natural history trajectories using machine learning: A proof of concept study

  • Giorgia Coratti ,

    Contributed equally to this work with: Giorgia Coratti, Jacopo Lenkowicz

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Supervision, Validation, Writing – original draft, Writing – review & editing

    Affiliations Pediatric Neurology, Università Cattolica del Sacro Cuore, Rome, Italy, Centro Clinico Nemo, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Rome, Italy

  • Jacopo Lenkowicz ,

    Contributed equally to this work with: Giorgia Coratti, Jacopo Lenkowicz

    Roles Data curation, Formal analysis, Methodology, Resources, Software, Validation, Writing – original draft, Writing – review & editing

    Affiliation Fondazione Policlinico Universitario A. Gemelli IRCCS, Università Cattolica del Sacro Cuore, Rome, Italy

  • Stefano Patarnello,

    Roles Conceptualization, Funding acquisition, Validation, Writing – original draft, Writing – review & editing

    Affiliation Fondazione Policlinico Universitario A. Gemelli IRCCS, Università Cattolica del Sacro Cuore, Rome, Italy

  • Consolato Gullì,

    Roles Data curation, Methodology, Validation, Visualization

    Affiliation Department of Radiological and Hematological Sciences Fondazione, Policlinico Universitario A. Gemelli, IRCCS Università Cattolica del Sacro Cuore, Largo A. Gemelli, Rome, Italy

  • Maria Carmela Pera,

    Roles Investigation, Methodology, Visualization

    Affiliations Pediatric Neurology, Università Cattolica del Sacro Cuore, Rome, Italy, Centro Clinico Nemo, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Rome, Italy

  • Carlotta Masciocchi,

    Roles Data curation, Methodology, Visualization

    Affiliation Fondazione Policlinico Universitario A. Gemelli IRCCS, Università Cattolica del Sacro Cuore, Rome, Italy

  • Riccardo Rinaldi,

    Roles Data curation, Methodology, Visualization

    Affiliation Fondazione Policlinico Universitario A. Gemelli IRCCS, Università Cattolica del Sacro Cuore, Rome, Italy

  • Valeria Lovato,

    Roles Funding acquisition, Visualization

    Affiliation Roche S.p.A., Monza, Monza e Brianza, Italy

  • Antonio Leone,

    Roles Data curation, Methodology, Supervision, Visualization

    Affiliation Department of Radiological and Hematological Sciences Fondazione, Policlinico Universitario A. Gemelli, IRCCS Università Cattolica del Sacro Cuore, Largo A. Gemelli, Rome, Italy

  • Alfredo Cesario,

    Roles Funding acquisition, Project administration, Resources, Validation, Visualization

    Affiliation Open Innovation Manager, Scientific Directorate, Fondazione Policlinico Universitario A. Gemelli IRCCS, Università Cattolica del Sacro Cuore, Rome, Italy

  • Eugenio Mercuri

    Roles Conceptualization, Data curation, Funding acquisition, Methodology, Supervision, Writing – original draft, Writing – review & editing

    eugeniomaria.mercuri@unicatt.it

    Affiliations Pediatric Neurology, Università Cattolica del Sacro Cuore, Rome, Italy, Centro Clinico Nemo, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Rome, Italy

Abstract

It is known from previous literature that type II Spinal Muscular Atrophy (SMA) patients generally, after the age of 5 years, presents a steep deterioration until puberty followed by a relative stability, as most abilities have been lost. Although it is possible to identify points of slope indicating early improvement, steep decline and relative stabilizations, there is still a lot of variability within each age group and it’s not always possible to predict individual trajectories of progression from age only. The aim of the study was to develop a predictive model based on machine learning using an XGBoost algorithm for regression and report, explore and quantify, in a single centre longitudinal natural history study, the influence of clinical variables on the 6/12-months Hammersmith Motor Functional Scale Expanded score prediction (HFMSE). This study represents the first approach to artificial intelligence and trained models for the prediction of individualized trajectories of HFMSE disease progression using individual characteristics of the patient. The application of this method to larger cohorts may allow to identify different classes of progression, a crucial information at the time of the new commercially available therapies.

Introduction

Spinal muscular atrophy (SMA) is an autosomal recessive disorder characterized by loss of motor neurons with subsequent progressive muscle weakness and wasting [1]. Classically, SMA is described into subtypes (0-IV) based on age of onset and maximum function achieved, with type II patients achieving the ability to sit but not to walk independently [2, 3]. Several studies have reported natural history longitudinal data in type II SMA, mostly using disease specific outcome measures such as the Hammersmith Functional Motor Scale (HFMSE) or Revised Upper Limb Module (RULM) [49].

Type II patients generally present onset of clinical signs between 6 and 18 months of age, after they have achieved the ability to sit independently [2, 10]. In the first years after diagnosis, there may be some improvement in motor function that however is not constantly observed. Conversely, after the age of 5 years there is often a steep deterioration until puberty, with loss of several functional abilities, followed by a relative stability [5, 8, 9].

Both cross sectional and longitudinal studies have identified points of slope indicating early improvement, steep decline and relative stabilizations occurring at different ages in type II patients, there is still a lot of variability within each age group and it’s not always possible to predict individual trajectories of progression from age only [5, 8, 11]. In the last few years there has been an effort to identify predictors of progression in several diseases, including some neuromuscular disorder such as Duchenne muscular dystrophy [1214]. These studies have suggested that the possibility to predict progression increases by combining different variables in a composite model. To our knowledge, the possibility to predict progression in SMA using a number of clinical variables, such as weight, scoliosis, ventilation or nutritional status has not been systematically explored.

The aim of the study was to develop a predictive model and report, explore and quantify, in a single center longitudinal natural history study, the influence of clinical variables on the 6/12-months HMFSE prediction.

Material and methods

Cohort selection and dataset definition

All patients included had a genetic diagnosis of SMA and a phenotype compatible with type II, i.e. onset between 6 and 18 months and independent standing and walking never achieved. All the patients older than 2.5 years with at least three assessments were included. Assessments performed at the time the patients were treated with investigational or approved disease modifying therapies such as nusinersen or risdiplam were not included. The final dataset was created by taking the complete cases record (i.e. no missing values), retrieved form medical records, of all the available variables: gender, SMN2 copies, age at visit, age at symptom onset, anthropometric measures, Cobb values, vitamin D treatment, SMA specific surgeries (spinal or tendon surgeries), salbutamol treatment, acute hospitalizations, ventilatory and nutritional status. HFMSE total score and functional status (non sitters/sitters) were considered and included as an indicator of motor function. Height, weight and Cobb values were imputed for missing values via linear interpolation between visits with non-missing values.

XGBoost algorithm

XGBoost is a popular and efficient open-source implementation of the gradient boosted trees algorithm. Gradient boosting is a supervised learning algorithm, which attempts to accurately predict a target variable by combining the estimates of a set of simpler, weaker models. When using gradient boosting for regression, the weak learners are regression trees, and each regression tree maps an input data point to one of its leafs that contains a continuous score. XGBoost minimizes a regularized (L1 and L2) objective function that combines a convex loss function (based on the difference between the predicted and target outputs) and a penalty term for model complexity (in other words, the regression tree functions). The training proceeds iteratively, adding new trees that predict the residuals or errors of prior trees that are then combined with previous trees to make the final prediction. It’s called gradient boosting because it uses a gradient descent algorithm to minimize the loss when adding new models. Advantages of using this algorithm include: the non-linearity it introduces in the association among predictor variables and outcome; the ensemble of weak learners approach helps to prevent overfitting by an appropriate tuning of the model’s hyperparameters; it has built-in regularization terms in the loss function which help reduce overfitting and improve generalization; finally, it is designed to be computationally efficient and to support parallel and distributed computing, which is useful to explore wider hyperparameters spaces, and eventually perform incremental training in a multicentric setting without in principle sharing the actual datasets.

No preliminary feature selection was performed, and all the available variables in the dataset were given as input to the training algorithm. The algorithm itself therefore assigns a higher or lower importance score to each variable so that, at the end of the training phase, the different variables are ideally ranked by their importance in an optimal way. The study was approved by the institutional review board (ethics committee) of the Fondazione Policlinico Agostino Gemelli (project code:GEN-SMA01, prot n°0019648/21). Written informed consent was obtained from all participants (or guardians of participants) in the study.

Visit-by-visit analysis

The model was trained to predict the HFMSE value after 6 or 12 months from a given visit, based on the actual visit variables. For each time-point, the visit data were linked to the HFMSE value closest to 6 or 12 months from the visit date, which becomes the outcome of the predictive model. Data were extracted from the Gemelli datamart for Spinal Muscular Atrophy, were filtered by the inclusion/exclusion criteria and were then included in the model. After the construction of the datamart, the data were split into a training set and testing set at a 75%/25% proportion. An XGBoost algorithm for regression was run in 5-fold cross-validation on the training set for hyperparameters optimization, for a total of 5400 different models. The best model was chosen according to lowest cross-validation Root Mean Squared Error (RMSE). The model was then applied on the testing set to measure the RMSE and the Mean Absolute Error (MAE). The complete workflow is depicted in Fig 1.

thumbnail
Fig 1. Workflow of the machine learning analysis for the model.

https://doi.org/10.1371/journal.pone.0267930.g001

Results

6-months prediction

Considering the 6-month interval and applying the filtering criteria, a total of 30 patients and 324 visits were included. Average number of visits per patient was 10.8, with a minimum of 3.0, a maximum of 23.0 and interquartile range 7.2–13.7. According to the machine learning model described above, 22 patients were assigned to the training set (239 visits), and 8 patients (85 visits) were assigned to the testing set. Of the patients included in the analysis, 2 patients (training set) were able to stand with support at truck. Table 1 shows the dataset characteristics summary subdivided by training and testing set.

thumbnail
Table 1. 6-months HMFSE dataset summary.

Numerical values are reported as median and interquartile range.

https://doi.org/10.1371/journal.pone.0267930.t001

The best XGBoost model showed a testing MAE of 1.86 HFMSE points. The five most important variable for the model according to the Shapley Additive Explanations (SHAP) framework are (Fig 2A): HFMSE value at given visit, age at current visit, first recorded HFMSE value, value of Cobb angle at current visit, age at symptoms onset. The prediction for 6-month value of HFMSE are influenced towards higher values by: a higher HFMSE value at current visit, a lower age at visit, a higher value of first recorded HFMSE and a lower Cobb angle (Fig 2B). Fig 3 shows four examples of visit-by-visit trajectory as predicted by the model compared to the actual values.

thumbnail
Fig 2.

Top 20 features importance for 6 months model according to mean SHAP value (A) and SHAP value (B).

https://doi.org/10.1371/journal.pone.0267930.g002

thumbnail
Fig 3. Trajectory predictions on the testing set for 4 patients.

Key to figure: Blu line = actual HFMSE progression, Red line = model-predicted HFMSE progression.

https://doi.org/10.1371/journal.pone.0267930.g003

12-months prediction

Considering the 12-month interval and after applying the new filtering criterion, patients eligible for the analysis were 29, for a total of 316 visits. Average number of visits per patient was 10.9, with a minimum of 3.0, a maximum of 24.0 and interquartile range 7.0–14.0. Of these, 21 patients were assigned to the training set (233 visits), and 8 patients (83 visits) were assigned to the testing set.

Table 2 reports the dataset summary characteristics applying to the 12-months interval.

thumbnail
Table 2. 12-months HMFSE dataset summary.

Numerical values are reported as median and interquartile range.

https://doi.org/10.1371/journal.pone.0267930.t002

The best XGBoost model showed a testing set MAE of 1.97 HFMSE points. The five most important variable for the model according to the SHAP framework are: HFMSE value at current visit, age at current visit, Cobb angle, first recorded HFMSE value, Non-invasive ventilation at current visit (Fig 4A and 4B).

thumbnail
Fig 4.

Top 20 features importance for 6 months model according to mean SHAP value (A) and SHAP value (B).

https://doi.org/10.1371/journal.pone.0267930.g004

A prediction for a sample testing patient and corresponding prediction-by-prediction SHAP values is reported in Fig 5 and the explanation in Fig 5.

thumbnail
Fig 5.

Prediction (A) and corresponding prediction by-prediction (B) for a sample testing patient. Key to Fig = Panel A: comparison between the actual 12 months trajectory (blue) of a testing set patient and the corresponding trajectory predicted by the 12-months model (red). Panel B: a focus on the predicted trajectory to show the variables contributing to the predictions along the trajectory, according to the SHAP framework. The red area represents positive contribution to the prediction, while the blue area represent negative contribution (e.g.: the first prediction (HFMSE equal to 11.14) relies on the positive contribution of age at current visit (5.3 years), and the negative contributions of age at symptoms onset (0.58 years), Cobb angle value (67.9), and previous HFMSE value).

https://doi.org/10.1371/journal.pone.0267930.g005

Discussion

Several international efforts have reported that in SMA type 2 patients motor function progression is not linear and that different slopes of progression can be identified using appropriate functional measures such as the HFMSE [5, 8, 9, 1517] or other measures [7, 1820]. In some of those studies, using multivariate/descriptive analysis per cohort or subgroup of patients, age appears to be an important predictor. Children younger than 5 years appear to have the highest chances of showing an improvement in HFMSE scores while those between 5 and 13 years are in contrast most susceptible to negative changes. Even when using age subgroups based on these cutoff points, there was still a high variability that could not be explained by multivariate analysis including other variables such as gender or SMN2 copies.

The objective of this study was not to describe HFMSE disease progression by cohorts or pre-defined subgroups (e.g. age, phenotype severity), but to train a model able to predict individualized trajectories of HFMSE disease progression on the basis of the individual characteristics of the patient.

Following the suggestion that a composite and individualized model may improve the prognostic accuracy of disease progression [12, 13], we applied a machine learning approach using an XGBoost algorithm for regression. The advantage and peculiarity of this method compared to multivariate analysis is that it provides an estimate of the possible individual trajectory based on the baseline features, each of them assessed to establish their prognostic value. Moreover, unlike predictive models which rely on baseline features only, our approach is able to update the trajectory prediction at each visit time-point, thus capturing the dynamics of each exploratory variable over time.

The results of our analysis confirmed that age is an important prognostic factor but also showed that other variables may contribute to influence the progression of the disease. The analysis allowed to establish that HFMSE value at visit, i.e the first assessment of the two in a given interval, also appears to have a relevant impact on the prediction. Other variables, such as age at symptoms onset, as well as the HFMSE value recorded on the very first visit and BMI, also partially contributed to the prediction. Cobb’s angle and non-invasive ventilation were also very relevant, suggesting that increasing scoliosis and ventilatory status also contribute. In contrast, other variables such as gender and SMN2 copies, did not appear to have a strong influence, as suggested by previous studies using multivariate analysis [5, 9].

These results should be interpreted with caution as the study was meant to be a proof of concept on a relatively small cohort and the results need to be validated in a larger cohort. The principal limitations of this study are that natural history data were drawn from a single center and that, even if there was a relatively large number of 6 month and 12 month follow-up intervals (>300 visits), the number of patients was much smaller (n = 30). Furthermore, it is known that the HFMSE is able to measure progression overtime but that can present floor/ceiling effect depending on age and functional status [15, 16, 19, 21, 22]. To address these concerns, additional work is in progress to establish external validity in separate datasets with a greater sample size. To address the issue of floor/ceiling effect in the prediction of the results, the choice of a predictive model whose underlying algorithm can introduce non-linear effects by partitioning the variables’ space into different sets, helps the predictive accuracy, as the model is not prone to extrapolation as it would be with linear or polynomial models.

In conclusion, our results suggest a possible role of this method that uses different criteria than those used in previous studies. Rather than providing a general rate of progression for a cohort or identify factors contributing to the progression through a multivariate analysis, the model can potentially provide more individualized trajectories. The application of this method to larger cohorts may allow to identify different classes of progression. The need to define more precise trajectories and predict patient outcome is crucial at the time when real world data from the commercially available new therapies are becoming increasingly available and there is the need to measure drug effect or potential treatment effect. This information may also be potentially used for clinical trial design to reduce variability and manage inclusion and criteria stratification.

Acknowledgments

We are thankful to the Gemelli SMA clinical and research group for the support (Marika Pane, Roberto de Sanctis, Elena Mazzone, Simona Lucibello, Sara Carnicella, Laura Antonaci, Gloria Ferrantini, Annalia Frongia, Giulia Stanca, Giulia Norcia, Nicola Forcina, Daniela Leone, Concetta Palermo, Beatrice Berti, Costanza Cutrona, Chiara Bravetti, Diletta Rossi).

References

  1. 1. D’Amico A, Mercuri E, Tiziano FD, Bertini E. Spinal muscular atrophy. Orphanet Journal of Rare Diseases. 2011;6: 71. pmid:22047105
  2. 2. Dubowitz V. Chaos in the classification of SMA: a possible resolution. Neuromuscular disorders: NMD. 1995;5: 3–5. pmid:7719138
  3. 3. Mercuri E, Finkel RS, Muntoni F, Wirth B, Montes J, Main M, et al. Diagnosis and management of spinal muscular atrophy: Part 1: Recommendations for diagnosis, rehabilitation, orthopedic and nutritional care. Neuromuscular disorders: NMD. 2018;28: 103–115. pmid:29290580
  4. 4. Mazzone ES, Mayhew A, Montes J, Ramsey D, Fanelli L, Young SD, et al. Revised upper limb module for spinal muscular atrophy: Development of a new module. Muscle & Nerve. 2017;55: 869–874. pmid:27701745
  5. 5. Coratti G, Pera MC, Lucibello S, Montes J, Pasternak A, Mayhew A, et al. Age and baseline values predict 12 and 24-month functional changes in type 2 SMA. Neuromuscular Disorders. 2020;30: 756–764. pmid:32900576
  6. 6. Coratti G, Lucibello S, Pera MC, Duong T, Muni Lofra R, Civitello M, et al. Gain and loss of abilities in type II SMA: A 12-month natural history study. Neuromuscular Disorders. 2020;30: 765–771. pmid:32893082
  7. 7. Pera MC, Coratti G, Mazzone ES, Montes J, Scoto M, De Sanctis R, et al. Revised upper limb module for spinal muscular atrophy: 12 month changes: 12 month changes on RULM for SMA. Muscle Nerve. 2019;59: 426–430. pmid:30677148
  8. 8. Kaufmann P. Observational Study of Spinal Muscular Atrophy Type 2 and 3: Functional Outcomes Over 1 Year. Arch Neurol. 2011;68: 779. pmid:21320981
  9. 9. Mercuri E, Finkel R, Montes J, Mazzone ES, Sormani MP, Main M, et al. Patterns of disease progression in type 2 and 3 SMA: Implications for clinical trials. Neuromuscular Disorders. 2016;26: 126–131. pmid:26776503
  10. 10. Pera MC, Coratti G, Berti B, D’Amico A, Sframeli M, Albamonte E, et al. Diagnostic journey in Spinal Muscular Atrophy: Is it still an odyssey? Allen MD, editor. PLoS ONE. 2020;15: e0230677. pmid:32203538
  11. 11. Mercuri E, Lucibello S, Pera MC, Carnicella S, Coratti G, de Sanctis R, et al. Long-term progression in type II spinal muscular atrophy: A retrospective observational study. Neurology. 2019;93: e1241–e1247. pmid:31451515
  12. 12. Goemans N, Wong B, Van den Hauwe M, Signorovitch J, Sajeev G, Cox D, et al. Prognostic factors for changes in the timed 4-stair climb in patients with Duchenne muscular dystrophy, and implications for measuring drug efficacy: A multi-institutional collaboration. Li Y, editor. PLoS ONE. 2020;15: e0232870. pmid:32555695
  13. 13. Goemans N, vanden Hauwe M, Signorovitch J, Swallow E, Song J, Collaborative Trajectory Analysis Project (cTAP). Individualized Prediction of Changes in 6-Minute Walk Distance for Patients with Duchenne Muscular Dystrophy. Jefferies JL, editor. PLoS ONE. 2016;11: e0164684. pmid:27737016
  14. 14. Mercuri E, Signorovitch JE, Swallow E, Song J, Ward SJ. Corrigendum to “Categorizing natural history trajectories of ambulatory function measured by the 6-minute walk distance in patients with Duchenne muscular dystrophy” [Neuromuscular Disorders 26/9 (2016) 576–583]. Neuromuscular Disorders. 2017;27: e1. pmid:28284874
  15. 15. Wijngaarde CA, Stam M, Otto LAM, Bartels B, Asselman F-L, van Eijk RPA, et al. Muscle strength and motor function in adolescents and adults with spinal muscular atrophy. Neurology. 2020;95: e1988–e1998. pmid:32732299
  16. 16. Wadman RI, Wijngaarde CA, Stam M, Bartels B, Otto LAM, Lemmink HH, et al. Muscle strength and motor function throughout life in a cross‐sectional cohort of 180 patients with spinal muscular atrophy types 1c–4. Eur J Neurol. 2018;25: 512–518. pmid:29194869
  17. 17. Mendonça RH, Polido GJ, Matsui C, Silva AMS, Solla DJF, Reed UC, et al. Real-World Data from Nusinersen Treatment for Patients with Later-Onset Spinal Muscular Atrophy: A Single Center Experience. J Neuromuscul Dis. 2021;8: 101–108. pmid:33074187
  18. 18. Annoussamy M, Seferian AM, Daron A, Péréon Y, Cances C, Vuillerot C, et al. Natural history of Type 2 and 3 spinal muscular atrophy: 2-year NatHis-SMA study. Ann Clin Transl Neurol. 2021;8: 359–373. pmid:33369268
  19. 19. Coratti G, Pera MC, Montes J, Pasternak A, Scoto M, Baranello G, et al. Different trajectories in upper limb and gross motor function in spinal muscular atrophy. Muscle & Nerve. 2021; mus.27384. pmid:34327716
  20. 20. Pierzchlewicz K, Kępa I, Podogrodzki J, Kotulska K. Spinal Muscular Atrophy: The Use of Functional Motor Scales in the Era of Disease-Modifying Treatment. Child Neurol Open. 2021;8: 2329048X211008725. pmid:33997096
  21. 21. Vázquez-Costa JF, Povedano M, Nascimiento-Osorio AE, Escribano AM, Garcia SK, Dominguez R, et al. Validation of motor and functional scales for the evaluation of adult patients with 5q spinal muscular atrophy. Neurology; 2021 Jun.
  22. 22. Wu JW, Pepler L, Maturi B, Afonso ACF, Sarmiento J, Haldenby R. Systematic review of motor function scales and patient reported outcomes in spinal muscular atrophy. American Journal of Physical Medicine & Rehabilitation. 2021;Publish Ahead of Print. pmid:34483260