Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Evaluation of smartphone-based cough data in amyotrophic lateral sclerosis as a potential predictor of functional disability

  • Pedro S. Rocha ,

    Roles Conceptualization, Data curation, Formal analysis, Writing – original draft

    pedro.rocha@medicina.ulisboa.pt

    Affiliations Institute of Physiology, Lisbon School of Medicine, University of Lisbon, Lisbon, Portugal, Mamede de Carvalho Lab, Institute of Molecular Medicine - João Lobo Antunes, University of Lisbon, Lisbon, Portugal

  • Nuno Bento,

    Roles Formal analysis, Validation, Writing – review & editing

    Affiliation Fraunhofer Portugal AICOS, Lisbon, Portugal

  • Duarte Folgado,

    Roles Data curation, Validation, Writing – review & editing

    Affiliations Fraunhofer Portugal AICOS, Lisbon, Portugal, LiBPhys (Laboratory for Instrumentation Biomedical Engineering and Radiation Physics), NOVA School of Science and Technology, Lisbon, Portugal

  • André V. Carreiro,

    Roles Supervision, Validation, Writing – review & editing

    Affiliation Fraunhofer Portugal AICOS, Lisbon, Portugal

  • Miguel Oliveira Santos,

    Roles Data curation, Validation

    Affiliations Institute of Physiology, Lisbon School of Medicine, University of Lisbon, Lisbon, Portugal, Department of Neurosciences and Mental Health, Hospital de Santa Maria -CHULN, Lisbon, Portugal

  • Mamede de Carvalho,

    Roles Funding acquisition, Project administration, Supervision, Writing – review & editing

    Affiliations Institute of Physiology, Lisbon School of Medicine, University of Lisbon, Lisbon, Portugal, Mamede de Carvalho Lab, Institute of Molecular Medicine - João Lobo Antunes, University of Lisbon, Lisbon, Portugal, Department of Neurosciences and Mental Health, Hospital de Santa Maria -CHULN, Lisbon, Portugal

  • Bruno Miranda

    Roles Funding acquisition, Project administration, Supervision, Validation, Writing – review & editing

    Affiliations Institute of Physiology, Lisbon School of Medicine, University of Lisbon, Lisbon, Portugal, Mamede de Carvalho Lab, Institute of Molecular Medicine - João Lobo Antunes, University of Lisbon, Lisbon, Portugal

Abstract

Objectives

Cough dysfunction is a feature of patients with amyotrophic lateral sclerosis (ALS). The cough sounds carry information about the respiratory system and bulbar involvement. Our goal was to explore the association between cough sound characteristics and the respiratory and bulbar functions in ALS.

Methods

This was a single-center, cross-sectional, and case-control study. On-demand coughs from ALS patients and healthy controls were collected with a smartphone. A total of 31 sound features were extracted for each cough recording using time-frequency signal processing analysis. Logistic regression was applied to test the differences between patients and controls, and in patients with bulbar and respiratory impairment. Support vector machines (SVM) were employed to estimate the accuracy of classifying between patients and controls and between patients with bulbar and respiratory impairment. Multiple linear regressions were applied to examine correlations between cough sound features and clinical variables.

Results

Sixty ALS patients (28 with bulbar dysfunction, and 25 with respiratory dysfunction) and forty age- and gender-matched controls were recruited. Our results revealed clear differences between patients and controls, particularly within the frequency-related group of features (AUC 0.85, CI 0.79–0.91). Similar results were observed when comparing patients with and without bulbar dysfunction. Sound features related to intensity displayed the strongest correlation with disease severity, and were the most significant in distinguishing patients with and without respiratory dysfunction.

Discussion

We found a good relationship between specific cough sound features and clinical variables related to ALS functional disability. The findings relate well with some expected impact from ALS on both respiratory and bulbar contributions to the physiology of cough. Finally, our approach could be relevant for clinical practice, and it also facilitates home-based data collection.

1. Introduction

Amyotrophic Lateral Sclerosis (ALS) is a progressive neurodegenerative disease characterized by the loss of both upper and lower motor neurons [1]. The consequent motor dysfunction leads to symptoms affecting limbs, bulbar and respiratory muscles—and eventual death by respiratory insufficiency or infection [2]. In general, the disease is characterized by significant variability in onset region, as well as in the pattern and rate of progression [37].

The majority of cases show either a spinal phenotype or a bulbar variant, but some patients present initial trunk or respiratory involvement [5, 8, 9]. However, an early respiratory and bulbar impairment are associated with poor quality of life, malnutrition, and early mortality [10, 11]. Non-invasive pulmonary function tests, in particular forced vital capacity (FVC), has long been used for respiratory assessment and monitoring. However, they require cooperative patients, good lips strength, and repeated testing to ensure consistency of measurements [12]. On the other hand, an established and comprehensive clinical scale to objectively monitor bulbar disease and respiratory progression in ALS has yet to be achieved [13].

During the last few years, objective evaluation of cough sounds, in particular evaluating its quantitative characteristics in terms of sound frequency or intensity, has gained popularity for detecting and distinguishing different respiratory dysfunctions [12, 1418]. The increasing evidence concerning the objective evaluation of cough is also grounded by the physiological mechanisms of coughing which require considerable coordination and timing of breathing, thus being sensitive to abnormalities in the respiratory system [19, 20]. Physiologically, cough involves a deep inspiration, followed by vigorous contraction of the expiratory muscles (in particular the abdominal muscles) against a closed glottis. When a certain subglottic pressure is reached, the glottis opens, producing one initial supramaximal expiratory airflow followed by a longer-lasting lower expiratory flow, generating the cough sound at the same time. Importantly, such physiological mechanisms for a normal cough also rely on a normal bulbar function, being especially relevant for the glottis and intrinsic laryngeal muscles’ performance. The latter muscles are the ones responsible for the dimensions of the glottis rhyme (i.e., the tension regulation of the vocal ligaments) and changes in laryngeal opening and closing—which are key properties of the cough sound. In more advanced ALS, cough is generally weak and absent [21, 22]; this causes inability to clear secretions, eases choking and impairs protection of the respiratory system—often leading to aspiration pneumonia. Recent progress has been made to take advantage of sensors to monitor the functional state of ALS patients, including for home-based assessments [2326]. Stegmann et al. [27] used a mobile application (app) installed on the patient’s mobile device to record speech acoustics and to predict their forced vital capacity (FVC). Furthermore, Vashkevich et al. [28] proposed an approach to voice assessment for automatic systems to differentiate healthy individuals from ALS patients (based on sustained phonation of the vowels /a/ and /i/). They used a wide range of acoustic features to achieve high accuracy in this classification. A feasibility study utilizing cough sound to differentiate between healthy individuals and those with ALS was recently conducted by Cebola et al. [29]. The study endorsed the viability of using coughs for remote monitoring; however, the sample size was limited and not gender-matched. Despite previous efforts focused on studying speech and cough acoustics, very few studies have comprehensively explored the potential of cough sound analysis in ALS, making this work novel on its approach.

In this study, we hypothesize that cough sound features obtained by a smartphone and using time- and frequency-domain analysis, could inform about bulbar and respiratory impairments in ALS patients. Thus, the present work aims to: 1) evaluate if the sound features of a voluntary cough in ALS patients are different from age- and gender-matched healthy controls; 2) correlate cough sound features with functional status, respiratory and bulbar impairment in ALS patients; and 3) test the hypothesis that frequency sound features have a stronger association with bulbar dysfunction, while intensity sound features are more closely related to respiratory dysfunction. Furthermore, we aimed to evaluate the usefulness of machine learning for conducting future home-based assessments, by recording audio samples with a commonly available device, in an ecological setting.

2. Materials and methods

2.1 Study design and participants

This was a single-center, cross-sectional, case-control study that was part of a broader ALS project (HomeSenseALS—PTDC/MEC-NEU/6855/2020). We included consecutive ALS patients according to Gold Coast criteria [30]. All patients were followed at our ALS clinic in Lisbon, and had full neurological, neurophysiological, neuroimaging and blood tests to rule out mimicking conditions [31]. Patients with a previous history of lung disorders, with resting dyspnea, severe cognitive involvement impairing the understanding of the voluntary coughing task, and those declining to participate were excluded. In the control group we included healthy age- and gender-matched controls (in general spouses of the ALS patients and people working in the institution). The recruitment started on April 4, 2022, and was concluded on August 31, 2023. The study was approved by the local research ethics committee of the Centro Académico de Medicina de Lisboa (CAML-Ref. 146/21). All participants gave written informed consent, which was in accordance with the declaration of Helsinki.

2.2 Clinical evaluation

For ALS patients, we collected demographic data including age, sex, body mass index (BMI), smoking habits, disease duration, and the region of disease onset. To evaluate the functional disability, we used the revised functional ALS rating scale (ALSFRS-R) [32].

Respiratory symptoms were determined based on the ALSFRS-R respiratory subscore (which consists of questions 10 through 12 pertaining to dyspnea, orthopnea, and respiratory insufficiency). This approach was used because not all patients had spirometry measures within 6 weeks prior of the cough sound recordings, making it not possible to define respiratory dysfunction using standardized metrics such as FVC. Thus, patients with a score less than 12 were considered to have respiratory dysfunction [32]. For those who underwent pulmonary function tests, sitting predicted FVC (FVC%) was measured using a computer-based USB spirometer (microQuark®, Cosmed®), and the best of three reliable maneuvers was used for statistics [11]. In addition to FVC%, the following respiratory measures were also included: maximum expiratory and inspiratory pressures (MIP% and MEP%, respectively) and cough peak flow (CPF). Similarly, bulbar symptoms were evaluated using the ALSFRS-R bulbar subscore (which consists of questions 1 through 3 about speech, salivation, and swallowing). Patients with a score less than 12 were considered having bulbar dysfunction [32]. This data was accessed retrospectively, between August 31, 2023, and October 31, 2023. The authors had no access to information that could identify individual participants during or after data collection.

2.3 Cough sound: Recording, signal processing and feature extraction

All subjects were instructed to perform, while seated in a quiet room, three voluntary coughs (to ensure a repeatable sound relationship). The instructions were straightforward and standardized for all participants: “Please, cough as strong as you can when prepared”–without any additional verbal encouragement. One assessor trained in the protocol conducted the sound recordings using a smartphone (OnePlus, model: BE2013, from OnePlus Technology (Shezhen) Co.Ltd, China) placed approximately 20–25 cm away from the mouth and at an angle of approximately 45° (as described in [12]). These procedures aimed to remove effects of wind noise produced when one rapid expulsion of air directly hits the microphone. For patients, the cough sounds were recorded during a routine patient’s clinical visit, after ensuring that the patient was resting for a period longer than 10 minutes, and comfortable without dyspnea.

After the cough data collection, the raw signal was processed with Librosa—a Python package for audio signal analysis [33]. The analysis was conducted using a frame length of 2048 samples per frame and a hop length of 512. In order to minimize potential biases stemming from the beginning and end of the recordings (and to ensure that the analysis was focused solely on the cough time frames) the split function of Librosa was employed with a cutoff of 20 decibels eliminating the initial and final periods of silence in the cough recordings.

Once the pre-processing was completed, the generated cough sound signals were analyzed to extract audio-based features. For this, we used the Time Series Feature Extraction Library (TSFEL) that automatically extracts over 60 different features on the statistical, temporal, and spectral domains [34].

In light of prior research findings [12, 35, 36] and relevance in general sound analysis, we pre-selected 11 features based on the time domain and 20 features based on the frequency domain (see details in Table 1 and consult [34] for more comprehensive information on each feature). To enhance the interpretation of the results and decrease the effect of multiple comparison, we subsequently categorized these features (both from the time and frequency domains) into three distinct groups, each pertaining to specific underlying information (with potential relevance for the various physiological steps of coughing): 1) a group encompassing sound frequency-related features—the frequency group; 2) another group comprising sound intensity-related features—the intensity group; and 3) a final group that combines features of both frequency and intensity domains—the mixed group. All extracted features were normalized to their maximum value (with a range between -1 and 1) (Fig 1). This division also enables a broader perspective on how different, yet related, features (and their combinations) impact the functional state of the patient (group view), rather than focusing solely on the impact of individual sound features (singular view).

thumbnail
Fig 1. The workflow of feature categorization according to frequency, intensity, and mixed groups.

https://doi.org/10.1371/journal.pone.0301734.g001

thumbnail
Table 1. List of cough sound features subjected to analysis.

https://doi.org/10.1371/journal.pone.0301734.t001

2.4 Machine-learning analysis

After the feature extraction step, a dataset was built for the purpose of a binary classification task with the objective of distinguishing between ALS patients and control subjects, patients with and without bulbar dysfunction, and patients with and without respiratory dysfunction. The dataset was partitioned, with 75% of the data allocated to the training set and the remaining data designated for testing. The process of shuffling resulted in a well-balanced test set in terms of class, age, gender, and dataset distribution. To identify a small subset of relevant features for the objective analysis of bulbar and respiratory ALS dysfunction, the extracted cough sound features underwent feature selection using the sequential feature selection (SFS) algorithm based on a logistic regression (LR) classifier. Through SFS, we selected the cough sound features that were strongly correlated with the class, thus removing the less relevant features from the original dataset. This selection process is independent of the machine-learning classifier used for the final classification. The main classification task was performed by training a support vector machine (SVM) classifier based on the linear kernel. In this process, only the most relevant features, which were selected in the preceding step were considered, in an attempt to reduce the potential for overfitting. For model evaluation, ROC-AUC (area under the curve) scores were calculated over five iterations, each with a distinct random seed, so that it would be possible to estimate the 95% confidence interval. This comprehensive procedure facilitated the assessment of model stability and reliability.

2.5 Statistical analysis

Data analysis was performed using Python version 3.11.2 (Python Software Foundation). For the significance level, α = 0.05 was considered. Descriptive statistics consisted of frequencies (with proportions) for categorical variables and mean values (with standard deviation) for continuous variables. To compare mean values, parametric tests such as the two-sample t-test or the one-way ANOVA were applied. If the normality assumption of the continuous variable was violated (significant Kolmogorov-Smirnov test with an absolute skewness > 2), non-parametric tests such as Mann-Whitney U-test or Kruskal-Wallis test were considered and results reported, if different from parametric analysis.

ROC analyses were performed to identify the ROC-AUC of the SVM, for discriminating between:

  1. controls vs. ALS—with the frequency group of features;
  2. controls vs. ALS—with the intensity group of features;
  3. controls vs. ALS—with the mixed group of features.

Similar analyses were carried out for the comparison between patients with bulbar dysfunction vs. those without; and for patients with respiratory dysfunction vs. those without. Age and gender were added into each set of features, enabling the SVM model to consider these important demographic factors.

Finally, we examined how each of the selected features related to the disability score and pulmonary function tests in ALS patients. For the former, multiple linear regression models were used, having the ALSFRS-R total score as dependent variable and age and gender as confounding variables. On the other hand, simple linear correlations were used to elucidate associations between sound features and pulmonary function measurements, including FVC%, MEP%, MIP% and CPF. Linear correlations were chosen because all pulmonary function metrics had already been normalized for various demographic factors (including age and gender) [37].

3. Results

3.1 Demographics and clinical characteristics

We analyzed 300 cough sounds recordings from a total of 100 subjects (60 ALS patients and 40 controls—3 cough sounds each). The demographic and clinical characteristics of participants are shown in Table 2. Groups had no significant differences in terms of age (p = 0.79) or sex distribution (p = 0.21). There were no statistically significant differences between patients with vs without respiratory dysfunction, as well as between patients with vs without bulbar dysfunction, in terms of age, BMI, disease duration and percentage of smokers (all t-tests with p-values > 0.05). However, the frequency of females was higher in the group with bulbar dysfunction (71% vs 34%, p< 0.001) and in the group of patients with respiratory dysfunction (72% vs 43%, p< 0.05).

thumbnail
Table 2. Baseline characteristics of whole ALS patient’s population (n = 60), and controls (N = 40).

https://doi.org/10.1371/journal.pone.0301734.t002

3.2 Cough sound features in ALS and healthy controls

We started by comparing the frequency group of cough sound features in ALS patients vs. controls. The SFS algorithm selected six features of the thirteen initially proposed (where both age and gender were added), including fundamental frequency, number of the spectrum positive turning points, spectral bandwidth, spectral roll-off, spectral dispersion, and zero-crossing rate (ZCR). Following ROC analysis (Fig 2), the prediction ROC-AUC of the final model with the seven selected features was 0.85 (IC 95%: 0.79–0.91).

thumbnail
Fig 2. Support vector machine (SVM) analysis of all cough sound samples.

Receiver operating characteristic curves (ROC) were calculated with a SVM to differentiate ALS patients and controls for the three different groups of cough sound features. One of the five model’s iterations is demonstrated (settings: k_features = ‘best’; forward = ‘False’; scoring = ‘accuracy’; cv = ‘5’; random_state = ‘41’).

https://doi.org/10.1371/journal.pone.0301734.g002

Similar analyses were performed for the remaining intensity and mixed groups. The SFS applied to the intensity group of features resulted in the selection of four features out of the initial twelve. Specifically, these included the temporal centroid, the mean, and the kurtosis of the signal, and presumed gender. However, the model with the four intensity features exhibited a modest performance of 0.59 (IC 95%: 0.52–0.66). Also, to note that only the temporal centroid and the kurtosis of the signal demonstrated significant discriminative capability between an ALS-related cough and a control cough.

Regarding the mixed group, the model comprised the following selected features: absolute energy, spectral and time entropies, maximum power of the signal, total amount of energy, and presumed gender. However, similar to the intensity group, the overall model performance was only modest, yielding an ROC-AUC of 0.6 (IC 95%: 0.52–0.68). To also note that only the time and spectral entropies exhibited significant predictive capability for distinguishing between ALS patients and healthy controls. Fig 3 shows an example where it is evident that the primary distinctions between an ALS and a control cough lie within the frequency group of features. Table 3 shows all statistical values.

thumbnail
Fig 3. An example of the analysis of sound waves in voluntary coughing of a healthy control (upper row) and an ALS patient (lower row).

Each column of the image depicts different features of the three distinct groups of features. The first column highlights the frequency features associated with the repetition rate of one event, such as the number of times that the signal passes the zero line or the number of positive turning points. The second column emphasizes the intensity features, such as the signal amplitude or peak distance. Lastly, the third column shows features that provide information on both frequency and intensity, such as the signal power and entropy. The main differences were observed in the frequency group. To note that these cough signals did not undergo pre-processing procedures.

https://doi.org/10.1371/journal.pone.0301734.g003

thumbnail
Table 3. F values from regression analyses contributing of control vs. ALS classification to performance on each voice sound variable.

https://doi.org/10.1371/journal.pone.0301734.t003

3.3 Correlations with the overall functional disability

Next, we started to focus more specifically on the ALS patients, and how the cough sound features were related to the functional disability of the disease. We found that the intensity, as well as the mixed group features, exhibited the strongest correlations with ALSFRS-R total score—indicating that patients with more severe symptoms produced cough sounds with a greater relative impact on the intensity domain (Fig 4). Broadly, patients in more advanced functional states produce less intense cough sounds.

thumbnail
Fig 4. Analysis of sound waves in voluntary coughing: Comparison between patients in different disease states.

The left image represents the cough sound of one patient in a better functional state (Female; 60> years old; ALSFRS-R total score of 39) versus the right image, which represents the cough sound of one patient in a worse functional state (Female; 60> years old; ALSFRS-R total score of 20). The main differences are presented in the intensity-related group of features. (Signal without pre-processing).

https://doi.org/10.1371/journal.pone.0301734.g004

Moderate but significant correlations have been found between the ALSFRS-R total score and various intensity features, including the maximum amplitude (beta = 0.43, p = 6.85e-4), the standard deviation of the amplitude of the signal (beta = 0.33, p = 1.55e-5) and the peak-to-peak distance (beta = 0.44, p = 4.07e-4). Moreover, we found moderate to strong negatively significant correlations, with the maximum cough sound power (beta = -0.58, p = 2.54e-10); and moderate positively significant correlations with the area under the curve of the signal and the absolute energy (beta = 0.35, p = 9.76e-6; beta = 0.34, p = 1.67e-6, respectively).

Despite being effective in distinguishing cough sounds from ALS and healthy controls, the frequency group of features showed weaker associations with the functional status of the disease. Our analysis revealed that ZCR (beta = 0.20, p = 3.01e-6), and the number of positive turning points (beta = 0.25, p = 1.10e-5) exhibited weak positively significant correlations with the ALSFRS-R total score. In contrast, the spectral centroid (beta = 0.38, p = 1.97e-4), and spectral bandwidth (beta = 0.32, p = 8.83e-8) demonstrated stronger correlations with the functional state of the disease, making them the best-correlated features in this frequency group (see Table 4).

thumbnail
Table 4. Correlations between the ALSFRS-R total score and different cough sound features.

Results are adjusted for age and gender.

https://doi.org/10.1371/journal.pone.0301734.t004

3.4 Differences between ALS patients with and without respiratory dysfunction

We identified both intensity and mixed groups of features as the most effective in distinguishing patients with respiratory dysfunction from those without (with positive associations). In terms of intensity-related features, the predictors that remained in the final model were the maximum and standard deviation of the signal as well as peak-to-peak distance, yielding a prediction ROC-AUC of 0.67 (IC 95%: 0.56–0.78).

Regarding the mixed group features, the SFS application resulted in a final model comprising maximum power, spectral entropy, and the spectral slope. The prediction ROC-AUC of this model was 0.65 (IC 95%: 0.55–0.75).

Finally, the group of frequency-related features included in the final model the spectral bandwidth and centroid, the spectral skewness, and the ZCR; and yielding a prediction ROC-AUC of 0.59 (IC 95%: 0.49–0.69). Table 5 shows all statistical values.

thumbnail
Table 5. F values from regression analyses contributing of with respiratory vs. without respiratory dysfunction classification to performance on each voice sound variable.

https://doi.org/10.1371/journal.pone.0301734.t005

3.4.1 Correlations between cough sound analysis and respiratory function assessments.

Out of the complete cohort of 60 ALS patients, 47 had respiratory function testing at the time of cough sound data recording (< 6 weeks). In this group, mean ALSFRS-R was 38, and the average FVC%, MIP%, absolute MIP, MEP%, absolute MEP, and CPF were 77% (18.33 SD), 93.6% (32.8 SD), 69.4 (28.6 SD), 79.8% (27 SD), 82.5 (39.1 SD) and 288.8 L/min (109 SD), respectively. Add one table here.

Table 6 shows that the intensity and mixed groups of features exhibited a moderate (negative) significant correlations with FVC%. However, no significant correlations were observed for CPF, MIP%, and MEP%.

thumbnail
Table 6. Correlations between the FVC (%), MIP (%), MEP (%), and CPF (L/min), and different cough sound features.

https://doi.org/10.1371/journal.pone.0301734.t006

3.5 Differences between ALS patients with and without bulbar dysfunction

When comparing ALS patients with and without bulbar dysfunction, we observed that frequency-related features were the best group at this discrimination (Table 7). The frequency features that were retained in the final model included spectral bandwidth, spectral centroid, spectral roll-off, and spectral kurtosis and skewness. Despite being the most significant, the overall ROC-AUC of the model prediction was 0.53 (IC 95%: 0.44–0.61).

thumbnail
Table 7. F values from regression analyses contributing of with bulbar vs. without bulbar dysfunction classification to performance on each voice sound variable.

https://doi.org/10.1371/journal.pone.0301734.t007

As for the intensity-related features, the predictors found in the final model included temporal centroid, maximum, mean, median, standard deviation, variance, and kurtosis of the signal. The final ROC-AUC of the model prediction for the intensity group was 0.63 (IC 95%: 0.51–0.75).

Lastly, for the features related to the mixed group, the ones that remained in the model were the area under the curve of the signal, maximum power, spectral entropy, and gender. The final ROC-AUC of the model was 0.51 (IC 95%: 0.39–0.63).

Table 8 shows the sensitivity and specificity scores of the ROC curves for each ML classification model (the results did not improve when using all the features together compared to when we grouped them into categories).

thumbnail
Table 8. Classification results based on the different groups of cough sound features.

Only the highest sensitivity and specificity scores from the best random seeds are presented.

https://doi.org/10.1371/journal.pone.0301734.t008

4. Discussion

Our study aimed to comprehensive investigate the potential of cough sound features, extracted from both the time and frequency domains, as discriminators for clinical diagnosis of ALS, and predictors of bulbar and respiratory impairments, at the convenience of using a simple smartphone. Based on our hypothesis, significant differences were observed in the frequency group of features between ALS patients and healthy controls, after adjustment for age and gender. This was also the group of features that demonstrated higher correlations with bulbar impairments. Conversely, the intensity and mixed groups of features were found to be highly correlated with the functional status of the disease and were the most significant in detecting respiratory impairments (Table 9).

thumbnail
Table 9. Summary of all correlations undertaken in this study.

The symbol ‘*’ denotes statistical significance, α = 0.05 was considered. The ‘+’ and ‘-‘ symbols indicate positive and negative associations, respectively, between related sound analysis features clinical variable. This pertains to the classification of controls vs. ALS patients; patients with bulbar symptoms vs. without; patients with respiratory symptoms vs. without.

https://doi.org/10.1371/journal.pone.0301734.t009

Firstly, changes in sound frequencies during any type of vocalization are primarily attributed to intrinsic modifications of the vocal cords. These variations in sound tone are intricately linked to the vocal cords’ dimension, tension, and/or thickness [38]. In the present work, we noticed that the disease-related ALS cough is hoarser when compared to the controls—i.e., patients’ cough depicts lower frequencies (more specifically, lower zero-crossing rates and spectral positive turning points). These results suggest that the bulbar region of the glottis in ALS patients potentially exhibits increased tension and reduced flexibility, as higher levels of tension tend to produce lower frequency sounds [3941].

Another finding, closely related to the previous, was that the cough produced by ALS patients displays greater sound entrainment, greater noise, and reduced sound occlusion when compared to controls. Occlusive sounds result from the obstruction or blockage of airflow in the vocal tract, and they are representative of functional cough sounds. In ALS, the adductor muscles of the arytenoid cartilages become dysfunctional [42], the glottis is not rapidly coordinated and fails to close effectively, leading to an abnormal compressive cough phase. Consequently, the typical peak in cough sound amplitude is not succeeded by a period of silence, but it is rather followed by an entrainment of the expiratory airflow. This was broadly represented by higher spectral dispersion and bandwidths. In fact, the cough sound properties of ALS patients resemble the characteristics of a sustainable vowel sound—a monophonic sound characterized by a continuous flow of air through the vocal cords. Moreover, as an alternative, the representation of the cough sound (presented in Fig 3) can also be explained by the varying properties of the medium through which the sound wave travels, such as different pressures and tensions, resulting in differing wave speeds and wave spread.

As a whole, the above-mentioned observations are in line with the evidence reported in cough airflow studies and cough waveforms visual analysis in patients with motor neuron diseases [43, 44]. Chaudri et al. [44] characterized the absence of distinct “peak expiratory spikes” and associated this with reduced cough strength and increased mortality. Recently, Plowman et al. [45] demonstrated that ALS patients showed lower peak expiratory flow rates and a longer time to generate maximum expiratory flow during a voluntary cough. They observed that this less efficient expulsive cough (as indexed by a lower cough volume acceleration) is predictive of poor airway safety during swallowing. Moreover, Korpáš et al. [46] have reported that, in laryngeal inflammation, the cough record consists of a large and long mono sound, where both sound intensity and duration may be increased. Thus, the cough sound in ALS may be associated with a secondary inflammation as well. All the aforementioned attributes collectively rendered this set of features superior in discriminating between cough sounds of patients and controls, resulting in a final model ROC-AUC of 0.85 (IC 95%: 0.79–0.91).

Emphasizing these primary distinctions in cough sounds between patients and controls, it is noteworthy that the observed differences between the two groups also manifested when comparing patients with and without bulbar symptoms. In this particular analysis, even though the machine-learning classifier did not exhibit exceptional robustness (with comparable results among frequency, intensity, and mixed feature groups), the most impactful features in distinguishing the two groups were maximum frequency, spectral bandwidth, and spectral centroid. Notably, these features are easily perceptible to the human ear, as healthy cough sounds typically display a clear quality, even when accounting for variations in age and gender. This enables clinicians to develop early suspicions regarding disease progression.

Furthermore, the intensity and mixed groups of features did not exhibit many significant differences between patients and healthy subjects. It is established that intense or louder sounds are related to higher air volumes in the lungs and consequently higher subglottic pressures. Despite 25 out of the 60 patients presenting respiratory dysfunction, as defined by scores less than 12 in the three respiratory-related questions of the ALSFRS- R (although many patients only presented with one less point) and a moderate to low FVC in the population, we speculate that these characteristics were insufficient to detect changes in intensity features, such as signal amplitude or peak distance, when compared to the cough sound of controls. Additionally, bulbar impairments such as a narrowed glottis are more likely to become clinically symptomatic when respiratory muscles are still strong enough to generate negative airway pressure [47]. Nonetheless, patients exhibited higher sound entropies and spectral temporal centroids, meaning that the cough sounds are more variable and difficult to predict, and the average energy of the sound, occurs later in time (also related to wave spread and power). For these reasons, the sets of intensity and mixed features exhibited the lowest level of ROC-AUC, with the latter outperforming the former (ROC-AUC of 0.62, 95% CI: 0.55–0.69 and 0.70, 95% CI: 0.64–0.76, respectively), primarily due to the presence of features associated with sound frequency.

To verify the relationships between cough sound features and the respiratory system, the same approach was utilized, first to assess correlations with variables from respiratory function tests, and second to evaluate differences in patients with and without respiratory dysfunctions.

In ALS patients, coughing is impaired during both the inspiratory and expiratory phases, with lower volumes of inspired air during a prolonged inspiratory phase and a longer time period to generate a lower peak expiratory airflow during the expulsive phase (as reported by [42]). Additionally, studies have demonstrated that the volume of air achieved at the initiation of the cough has the greatest influence on the volume expelled during cough [48]. In sound analysis, loudness and intense sounds are related to volume. This relationship further reinforces the significance of the intensity-based cough sound features as the most reliable indicators of respiratory impairment in patients. Furthermore, the cough sound pattern exhibited by ALS patients is consistent with that observed in patients with restrictive respiratory disease, characterized by reduced lung elasticity or limitations in chest wall expansion [16]. In these patients, there is a gradual reduction in the intensity of cough attempts over time, leading to a negative slope of the signal amplitude (Fig 4). This is in contrast to obstructive respiratory diseases, where such a phenomenon is not observed. In the machine learning analysis, the model that demonstrated the highest ROC-AUC (although also not particularly robust overall), in distinguishing between patients with and without respiratory symptoms was the one trained with intensity-related features (0.67; IC 95%: 0.56–0.78). However, and despite these findings, the exact role of different respiratory muscles and their association with these cough sound features remains unclear. To understand this relation, we performed linear regressions between the cough sound features and FVC% values. Sharan et al. [12], demonstrated the potential for cough sound analysis to predict spirometry results in patients with different respiratory diseases. In this work, the intensity and mixed group of features, specifically the temporal centroid and absolute energy, exhibited stronger correlations with FVC%. These findings provide support for the association between sound energy, intensity, and lung function. Notably, the correlations between FVC% and energy features are negative. This finding may indicate that patients with respiratory dysfunctions often experienced increased efforts to move air in and out of the lungs and even that, as it becomes difficult to fully exhale air, leading to air trapping, the trapped air during the subsequent cough bouts has contributed to higher sound energies. It was also anticipated that stronger correlations would be observed between cough sound features and MEP%, in comparison to MIP%, given that pulmonary exhalation is the primary source of energy for sound production.

MEP represents the highest achievable pressure during forceful expiration against a closed airway and indicates the strength of the abdominal muscles and other expiratory muscles. Conversely, MIP assesses the strength of inspiratory muscles, primarily the diaphragm, and enables the evaluation of ventilatory insufficiency. Although only the spectral positive turning points showed a significant correlation with MEP%, sound energy exhibited the potential to serve as a valuable distinguishing feature as well. Moreover, no significant associations were observed between cough sound features and CPF. In this study, cough was deemed efficient with an average CPF exceeding 270 L/min [49] (specifically 288 L/min). However, variations in cough sounds were still noted across different disease states. This observation could be due to 1) methodological and technical issues (e.g., CPF involves coughing forcefully into a face mask connected to a small peak flow meter, whereas we captured cough sounds from a distance of 20–25 cm and laterally from the mouth) and 2) possibly different physiological properties of the cough. For instance, although necessary, having volume of air on its own is not sufficient to produce sound. Furthermore, regarding the first point, Recasens et al. [50] aimed to evaluate the reliability of a smartphone app for estimating CPF based on cough sounds in patients with neuromuscular disorders (including 26 ALS patients). They found that the app demonstrated high sensitivity and specificity in detecting abnormal CPF (<270 L/min), when compared to the clinical metric. However, they noted that technical factors, such as the distance (specifically 30 cm) between the recording device and the patient, can affect the reliability of the results. To achieve consistent and robust correlations between CPF and cough sounds, the smartphone’s microphone may need to be positioned closer to the patient. This emphasizes the reason why we considered both CPF and sound analysis independently, but further studies should explore these potential correlations.

One last important aspect to consider is the impact of age and gender on sound features. For instance, the fundamental frequency of the human voice, determined by the resting state of the vocal cords, varies significantly: males have an average frequency of 120 Hz, females 200 Hz, and children 270 Hz [37]. This is relevant as our bulbar and respiratory patient groups included significantly more women. However, while we adjusted the results for age and gender (in both classifiers and multiple regressions), we speculate that this impact is more pronounced in speech than in cough sounds, especially as cough sounds are much shorter in duration, with also different physiological mechanisms.

Some other limitations of this study must be acknowledged. Specifically, voluntary coughs bypass the sensory system and previous research has demonstrated that maximum voluntary cough function tends to overestimate reflexive cough function among healthy volunteers [48, 51]. Moreover, the current study includes patients with mild-moderate disease severity. As a result, the generalization of these findings to airway defense in the event of aspiration as well as to individuals in a more advanced disease state may be limited. Further, given the clinical heterogeneity of ALS, it would be beneficial to document upper versus lower motor neuron involvement, and slow versus fast progress to develop more homogenous groups for comparison. It is also possible that more appropriate features (as well as other machine learning models) may be extracted from the data, even when features that do not contribute to the model prediction ROC-AUC were eliminated. Future research should further explore related topics to enhance our understanding of sound analysis and smartphone use. This includes investigating the relationship between cough peak flow and sound analysis with other clinical outcomes like dysphagia and respiratory infection rates. Conducting longitudinal cough sound analyses, recording sounds in a lying position, correlating clinically with phrenic nerve conduction and cervical muscles strength, and adjusting findings for other motor neuron diseases are promising perspectives that could clarify the results presented in this work. Lastly, comparing the use of these methods by doctors, caregivers, and patients themselves should also be explored.

5. Conclusion

The present study demonstrates that analyzing cough sounds can serve as a valuable technique for evaluating and monitoring ALS patients, particularly those with respiratory and bulbar impairments. However, it is important to note that cough sound analysis should not be the only indicator utilized to evaluate respiratory and bulbar health, as ALS is a multifaceted and intricate disease. Rather, it can be used as an adjunct measure, supplementing commonly used ways of disease progression. It is also noteworthy that the method used in this study was a convenient smartphone-based approach, which facilitates data collection in home-based settings without requiring specialized careers or equipment.

Supporting information

S1 File. Dataset encompassing all the information used for comparing a cough sound associated with ALS and a cough sound from a healthy control.

https://doi.org/10.1371/journal.pone.0301734.s001

(CSV)

S2 File. Dataset encompassing all the information used for comparing cough sound features and clinical variables from ALS patients.

https://doi.org/10.1371/journal.pone.0301734.s002

(CSV)

References

  1. 1. Hardiman O, Al-Chalabi A, Chio A, Corr EM, Logroscino G, Robberecht W, et al. Amyotrophic lateral sclerosis. Vol. 3, Nature Reviews Disease Primers. Nature Publishing Group; 2017.
  2. 2. De Carvalho M, Swash M, Pinto S. Diaphragmatic neurophysiology and respiratory markers in ALS. Vol. 10, Frontiers in Neurology. Frontiers Media S.A.; 2019.
  3. 3. Brown RH, Al-Chalabi A. Amyotrophic Lateral Sclerosis. New England Journal of Medicine. 2017 Jul 13;377(2):162–72. pmid:28700839
  4. 4. Gordon PH, Cheng B, Katz IB, Pinto M, Hays AP, Mitsumoto H, et al. The natural history of primary lateral sclerosis. Neurology. 2006;66(5):647–53. pmid:16534101
  5. 5. Wales S, C Kiernan DSc AM, Cheah MBiostat BC, Burrell MBBS J, Zoing BNurs MC, Kiernan MC, et al. Seminar Amyotrophic lateral sclerosis. Lancet [Internet]. 2011;377:942–55. Available from: www.thelancet.com
  6. 6. Swinnen B, Robberecht W. The phenotypic variability of amyotrophic lateral sclerosis. Vol. 10, Nature Reviews Neurology. Nature Publishing Group; 2014. p. 661–70.
  7. 7. Gromicho M, Figueiral M, Uysal H, Grosskreutz J, Kuzma-Kozakiewicz M, Pinto S, et al. Spreading in ALS: The relative impact of upper and lower motor neuron involvement. Ann Clin Transl Neurol. 2020 Jul 1;7(7):1181–92. pmid:32558369
  8. 8. Darrell Hulisz. Am J Manag Care. 2018. Amyotrophic Lateral Sclerosis: Disease State Overview.
  9. 9. Pinto S, Gromicho M, Oliveira Santos MO, Swash M, De Carvalho M. Respiratory onset in amyotrophic lateral sclerosis: clinical features and spreading pattern. Amyotroph Lateral Scler Frontotemporal Degener. 2023 Jan 2;24(1–2):40–4. pmid:35510537
  10. 10. Kaufmann P, Levy G, Thompson J, DelBene M, Battista V, Gordon P, et al. The ALSFRSr predicts survival time in an ALS clinic population. 2005.
  11. 11. Pinto S, de Carvalho M. Comparison of slow and forced vital capacities on ability to predict survival in ALS. Amyotroph Lateral Scler Frontotemporal Degener. 2017 Oct 2;18(7–8):528–33. pmid:28741375
  12. 12. Sharan R V., Abeyratne UR, Swarnkar VR, Claxton S, Hukins C, Porter P. Predicting spirometry readings using cough sound features and regression. Physiol Meas. 2018 Sep 5;39(9). pmid:30091716
  13. 13. Pattee GL, Plowman EK, (Focht) Garand KL, Costello J, Brooks BR, Berry JD, et al. Provisional best practices guidelines for the evaluation of bulbar dysfunction in amyotrophic lateral sclerosis. Muscle Nerve. 2019 May 1;59(5):531–6. pmid:30620104
  14. 14. Chung Y, Jin J, Jo HI, Lee H, Kim SH, Chung SJ, et al. Diagnosis of pneumonia by cough sounds analyzed with statistical features and ai. Sensors. 2021 Nov 1;21(21). pmid:34770341
  15. 15. Kosasih K, Abeyratne UR, Swarnkar V, Triasih R. Wavelet Augmented Cough Analysis for Rapid Childhood Pneumonia Diagnosis. IEEE Trans Biomed Eng. 2015 Apr 1;62(4):1185–94. pmid:25532164
  16. 16. Rudraraju G, Palreddy SD, Mamidgi B, Sripada NR, Sai YP, Vodnala NK, et al. Cough sound analysis and objective correlation with spirometry and clinical diagnosis. Inform Med Unlocked. 2020 Jan 1;19.
  17. 17. Sharan R V., Abeyratne UR, Swarnkar VR, Porter P. Automatic croup diagnosis using cough sound recognition. IEEE Trans Biomed Eng. 2019 Feb 1;66(2):485–95. pmid:29993458
  18. 18. Toop LJ, Thorpe CW, Frightt R. Cough Sound Analysis: A New Tool for the Diagnosis of Asthma? [Internet]. Vol. 6, Family Practice ©Oxford University Press. 1989. http://fampra.oxfordjournals.org/
  19. 19. Lee KK, Davenport PW, Smith JA, Irwin RS, McGarvey L, Mazzone SB, et al. Global Physiology and Pathophysiology of Cough: Part 1: Cough Phenomenology—CHEST Guideline and Expert Panel Report. Vol. 159, Chest. Elsevier Inc.; 2021. p. 282–93.
  20. 20. McGarvey L, Rubin BK, Ebihara S, Hegland K, Rivet A, Irwin RS, et al. Global Physiology and Pathophysiology of Cough: Part 2. Demographic and Clinical Considerations: CHEST Expert Panel Report. Chest. 2021 Oct 1;160(4):1413–23.
  21. 21. Chatwin M, Simonds AK. Long-term mechanical insufflation-exsufflation cough assistance in neuromuscular disease: Patterns of use and lessons for application. Respir Care. 2020 Feb 1;65(2):135–43. pmid:31690614
  22. 22. Plowman EK, Tabor-Gray L, Rosado KM, Vasilopoulos T, Robison R, Chapin JL, et al. Impact of expiratory strength training in amyotrophic lateral sclerosis: Results of a randomized, sham-controlled trial. Muscle Nerve. 2019 Jan 1;59(1):40–6. pmid:29981250
  23. 23. Berry JD, Paganoni S, Carlson K, Burke K, Weber H, Staples P, et al. Design and results of a smartphone-based digital phenotyping study to quantify ALS progression. Ann Clin Transl Neurol. 2019 May 1;6(5):873–81. pmid:31139685
  24. 24. van Eijk RPA, Bakers JNE, Bunte TM, de Fockert AJ, Eijkemans MJC, van den Berg LH. Accelerometry for remote monitoring of physical activity in amyotrophic lateral sclerosis: a longitudinal cohort study. J Neurol. 2019 Oct 1;266(10):2387–95. pmid:31187191
  25. 25. Garcia-Gancedo L, Kelly ML, Lavrov A, Parr J, Hart R, Marsden R, et al. Objectively monitoring amyotrophic lateral sclerosis patient symptoms during clinical trials with sensors: Observational study. JMIR Mhealth Uhealth. 2019;7(12). pmid:31859676
  26. 26. Haulman A, Geronimo A, Chahwala A, Simmons Z. The Use of Telehealth to Enhance Care in ALS and other Neuromuscular Disorders. Vol. 61, Muscle and Nerve. John Wiley and Sons Inc.; 2020. p. 682–91.
  27. 27. Stegmann GM, Hahn S, Duncan CJ, Rutkove SB, Liss J, Shefner JM, et al. Estimation of forced vital capacity using speech acoustics in patients with ALS. Amyotroph Lateral Scler Frontotemporal Degener. 2021;22(S1):14–21. pmid:34348537
  28. 28. Vashkevich M, Rushkevich Y. Classification of ALS patients based on acoustic analysis of sustained vowel phonations. Biomed Signal Process Control. 2021 Mar 1;65.
  29. 29. Cebola R, Folgado D, Carreiro A, Gamboa H. Speech-Based Supervised Learning Towards the Diagnosis of Amyotrophic Lateral Sclerosis. In INSTICC; 2023. p. 74–85.
  30. 30. Shefner JM, Al-Chalabi A, Baker MR, Cui LY, de Carvalho M, Eisen A, et al. A proposal for new diagnostic criteria for ALS. Vol. 131, Clinical Neurophysiology. Elsevier Ireland Ltd; 2020. p. 1975–8.
  31. 31. Mamede de Carvalho, Reinhard Dengler, Andrew Eisen, John D England, Ryuji Kaji, Jun Kimura, et al. Electrodiagnostic criteria for diagnosis of ALS. Review Clin Neurophysiol. 2008;
  32. 32. Cedarbaum JM, Stambler N, Malta E, Fuller C, Hilt D. The ALSFRS-R: a revised ALS functional rating scale that incorporates assessments of respiratory function [Internet]. Vol. 169, Journal of the Neurological Sciences. 1999. www.elsevier.com/locate/jns
  33. 33. McFee B, McVicar M, Faronbi D, Roman I, Gover M, Balke S, et al. librosa/librosa: 0.10.0.post2. 2023 Mar 17 [cited 2023 Apr 24]; https://zenodo.org/record/7746972
  34. 34. Barandas M, Folgado D, Fernandes L, Santos S, Abreu M, Bota P, et al. TSFEL: Time Series Feature Extraction Library. SoftwareX. 2020 Jan 1;11.
  35. 35. Abaza AA, Day JB, Reynolds JS, Mahmoud AM, Goldsmith WT, McKinney WG, et al. Classification of voluntary cough sound and airflow patterns for detecting abnormal pulmonary function. Cough. 2009;5(1). pmid:19930559
  36. 36. Nemati E, Rahman J, Blackstock E, Nathan V, Rahman M, Vatanparvar K, et al. Estimation of the Lung Function Using Acoustic Features of the Voluntary Cough*. 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). 2020.
  37. 37. Cooper BG, Stocks J, Hall GL, Culver B, Steenbruggen I, Carter KW, et al. The global lung function initiative (GLI) network: Bringing the world’s respiratory reference values together. Breathe. 2017 Sep 1;13(3):e56–64. pmid:28955406
  38. 38. Stranding Susan, Borley Neil R, Collins Patricia, Crossman Alan R, Gatzoulis Michael A, Healy Jeremiah C, et al. Gray’s: Atlas de anatomia. 40th ed. Elsevier; 2010. 592 p.
  39. 39. Fukae J, Kubo SI, Hattori N, Komatsu K, Kato M, Aoki M, et al. Hoarseness due to bilateral vocal cord paralysis as an initial manifestation of familial amyotrophic lateral sclerosis. Amyotrophic Lateral Sclerosis and Other Motor Neuron Disorders. 2005 Jun;6(2):122–4. pmid:16036438
  40. 40. Van Der Graaff MM, Grolman W, Westermann EJ, Boogaardt HC, Koelman H, Anneke;, et al. Vocal Cord Dysfunction in Amyotrophic Lateral Sclerosis Four Cases and a Review of the Literature [Internet]. Vol. 66, Arch Neurol. 2009. http://archneur.jamanetwork.com/
  41. 41. Hillel A, Dray T, Miller R, Yorkston K, Konikow N, Strande E, et al. Presentation of ALS to the otolaryngologist/head and neck surgeon: getting to the neurologist. Neurology. 1999;
  42. 42. Tabor-Gray LC, Gallestagui A, Vasilopoulos T, Plowman EK. Characteristics of impaired voluntary cough function in individuals with amyotrophic lateral sclerosis. Amyotroph Lateral Scler Frontotemporal Degener. 2019 Jan 2;20(1–2):37–42. pmid:30652513
  43. 43. Tabor-Gray L, Vasilopoulos T, Plowman EK. Concordant Validity of a Digital Peak Cough Flow Meter to Assess Voluntary Cough Strength in Individuals with ALS. Dysphagia. 2020 Aug 1;35(4):568–73. pmid:31493068
  44. 44. Chaudri MB, Liu C, Hubbard R, Jefferson D, Kinnear WJ. Relationship between supramaximal flow during cough and mortality in motor neurone disease. European Respiratory Journal. 2002;19(3):434–8. pmid:11936519
  45. 45. Plowman EK, Watts SA, Robison R, Tabor L, Dion C, Gaziano J, et al. Voluntary Cough Airflow Differentiates Safe Versus Unsafe Swallowing in Amyotrophic Lateral Sclerosis. Dysphagia. 2016 Jun 1;31(3):383–90. pmid:26803772
  46. 46. Korpáš J, Sadloň Ová J, Vrabec M. Analysis of the Cough Sound: an Overview. Vol. 9, Pulmonary Pharmacology. 1996.
  47. 47. Hillel AD, Miller R. Bulbar amyotrophic lateral sclerosis: patterns of progression and management. Head Neck. 1989;
  48. 48. Tabor-Gray L, Vasilopoulos T, Plowman EK. Differences in voluntary and reflexive cough strength in individuals with amyotrophic lateral sclerosis and healthy adults. Muscle Nerve. 2020 Nov 1;62(5):597–600. pmid:32776561
  49. 49. Sancho J, Servera E, Bañuls P, Marín J. Effectiveness of assisted and unassisted cough capacity in amyotrophic lateral sclerosis patients. Amyotroph Lateral Scler Frontotemporal Degener. 2017 Oct 2;18(7–8):498–504. pmid:28597697
  50. 50. Recasens BB, Balañá Corberó A, Llorens JMM, Guillen-Sola A, Moreno MV, Escobar GG, et al. Sound-based cough peak flow estimation in patients with neuromuscular disorders. Muscle Nerve. 2024 Feb 1;69(2):213–7. pmid:37860934
  51. 51. Brandimore AE, Troche MS, Huber JE, Hegland KW. Respiratory kinematic and airflow differences between reflex and voluntary cough in healthy young adults. Front Physiol. 2015;6(OCT). pmid:26500560