Deep learning-based differential gut flora for prediction of Parkinson’s

Bo Yu; Hang Zhang; Min Zhang

doi:10.1371/journal.pone.0310005

Abstract

Background

There had been extensive research on the role of the gut microbiota in human health and disease. Increasing evidence suggested that the gut-brain axis played a crucial role in Parkinson’s disease, with changes in the gut microbiota speculated to be involved in the pathogenesis of Parkinson’s disease or interfere with its treatment. However, studies utilizing deep learning methods to predict Parkinson’s disease through the gut microbiota were still limited. Therefore, the goal of this study was to develop an efficient and accurate prediction method based on deep learning by thoroughly analyzing gut microbiota data to achieve the diagnosis of Parkinson’s disease.

Methods

This study proposed a method for predicting Parkinson’s disease using differential gut microbiota, named the Parkinson Gut Prediction Method (PGPM). Initially, differential gut microbiota data were extracted from 39 Parkinson’s disease (PD) patients and their corresponding 39 healthy spouses. Subsequently, a preprocessing method called CRFS (combined ranking using random forest scores and principal component analysis contributions) was introduced for feature selection. Following this, the proposed LSIM (LSTM-penultimate to SVM Input Method) approach was utilized for classifying Parkinson’s patients. Finally, a soft voting mechanism was employed to predict Parkinson’s disease patients.

Results

The research results demonstrated that the Parkinson gut prediction method (PGPM), which utilized differential gut microbiota, performed excellently. The method achieved a mean accuracy (ACC) of 0.85, an area under the curve (AUC) of 0.92, and a receiver operating characteristic (ROC) score of 0.92.

Conclusion

In summary, this method demonstrated excellent performance in predicting Parkinson’s disease, allowing for more accurate predictions of Parkinson’s disease.

Citation: Yu B, Zhang H, Zhang M (2025) Deep learning-based differential gut flora for prediction of Parkinson’s. PLoS ONE 20(1): e0310005. https://doi.org/10.1371/journal.pone.0310005

Editor: Upaka Rathnayake, Atlantic Technological University, IRELAND

Received: March 28, 2024; Accepted: August 23, 2024; Published: January 7, 2025

Copyright: © 2025 Yu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the manuscript and its Supporting Information files.

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

1.Contexts

Parkinson’s disease [1] is a common multifunctional dysfunction and neurodegenerative disorder among elderly people, and its prevalence is second only to that of Alzheimer’s disease [2]. With the increase in population size and the intensification of aging trends, the burden of Parkinson’s disease on society and individual health will continue to increase. According to research predictions, by 2040, the global number of diagnosed cases of Parkinson’s disease [3] will exceed 10 million.

Research indicates that the gut microbiota interacts with the autonomic and central nervous systems through various pathways, and dysbiosis of the gut microbiota may affect both the enteric nervous system and the central nervous system. Previous studies have revealed the existence of the brain-gut-microbiota axis, where bidirectional interactions between the gut microbiota and the human nervous system could lead to central nervous system diseases. The gut microbiota, also known as the "second brain," can influence brain activity under both physiological and pathological conditions through the gut-microbiota-brain axis. Changes in the gut microbiota have been linked to several psychiatric and neurological diseases, including schizophrenia [4], depression [5,6], and autism [7]. Recently, numerous studies have shown significant differences in the composition of the gut microbiota between Parkinson’s disease patients and healthy controls, with metagenomic [8] studies further revealing the correlation between Parkinson’s and abnormalities in the gut microbiome. However, research on the use of the gut microbiota as a predictive tool for Parkinson’s disease is still relatively scarce. Therefore, exploring a method to predict Parkinson’s disease using the gut microbiota is highly important. Therefore, the aim and objective of this study are to develop an efficient and accurate method for predicting Parkinson’s disease based on gut microbiota, in order to achieve the diagnosis of Parkinson’s disease. By incorporating deep learning technology, we aim to capture subtle differences in gut microbiota to provide new perspectives and tools for predicting Parkinson’s disease, thereby offering scientific support for its diagnosis.

The diagnosis of Parkinson’s disease relies on core clinical features and follows standard clinical criteria to improve accuracy. For example, the UK Parkinson’s Disease Society Brain Bank (UKPDSBB) has established comprehensive standards, including criteria such as bradykinesia and exclusion of other potential causes. However, these standards still have limitations and rely on the expertise of neurologists. With the development of artificial intelligence and the increasing demand for healthcare, AI-based methods have been applied to the automated diagnosis of Parkinson’s disease. Common methods, such as EEG [9], gait analysis [10], voice analysis, and brain imaging, use biomarkers of Parkinson’s disease for automated detection. Traditional machine learning models need to extract features from biomarkers and select significant features for model training. Although AI-based methods have potential in the automated diagnosis of Parkinson’s disease, they have limitations. These methods may be constrained by technical limitations and challenges in data collection during practical applications. Additionally, the accuracy and reliability of biomarkers still have certain limitations. Furthermore, individual differences and the complexity of cases may affect the applicability and generalizability of the models. Moreover, these methods are typically used as auxiliary diagnostic tools and still require the professional judgment and clinical experience of doctors. It is worth noting that there is relatively limited research on the use of the gut microbiota to predict PD. Therefore, this study utilized gut microbiota prediction combined with artificial intelligence methods to predict Parkinson’s disease.

In this article, a Parkinson’s disease prediction method called Differential Gut Microbiota for Parkinson’s Prediction (PGPM), which can predict Parkinson’s disease more accurately, is proposed. First, the PGPM method introduces the CRFS preprocessing method for feature selection, reducing the dimensionality of features; second, PGPM differs from individual classifiers, improving prediction accuracy; and finally, the final prediction result is obtained through soft voting. Under 10-fold cross-validation, PGPM achieves mean ACC, AUC, and ROC values of 0.85, 0.92, and 0.92, respectively, which are significantly higher than those of existing methods.

2. Materials and methods

2.1 Microbiota datasets

The data for this study were obtained from a cross-sectional study of the gut microbiota of Parkinson’s disease patients in the Central China region [11]. The dataset included 39 Parkinson’s disease patients (PD) with a BMI of 23.15 kg/m2 and their healthy spouses (SP) with a BMI of 24.22 kg/m2. The diagnosis of Parkinson’s disease was based on the 2015 Movement Disorder Society Parkinson’s diagnostic criteria, with the core criterion being the presence of Parkinsonian symptoms. If a patient exhibited bradykinesia along with either resting tremor or rigidity, they were considered to have Parkinson’s syndrome.

2.2 Transcriptome sequencing

The data were collected by sampling the subjects’ feces, which were then stored at -80°C. DNA was extracted from the feces using the MetaHIT protocol, and the DNA concentration was estimated using a Qubit instrument. After DNA extraction, gene libraries were prepared according to the manufacturer’s instructions and sequenced. The raw sequencing data have been deposited under the accession number PRJNA588035. The quality of the raw metagenomic data was checked using the FastQC tool, followed by trimming low-quality data and removing unwanted genomes. Subsequently, taxonomic analysis was performed, and the read abundance was estimated after processing. The relative abundance was calculated by multiplying the sequence count and rounding the result.

2.3 Overall framework of the forecasting methodology

In this study, a method for predicting Parkinson’s disease patients using differential microbiota was implemented. Building upon previous research, improvements were made in data preprocessing, specifically in feature selection and dimensionality reduction, and a method combining neural networks and machine learning was developed. The overall framework of the PGPM method constructed in that article was illustrated in Fig 1, which consisted of three modules: the CR (CRFS Preprocessing Layer) layer, the LS (LSTM-SVM Layer) layer, and the OP (Output Layer) layer. The CR layer was responsible for the initial processing and selection of the raw Parkinson’s gut microbiota data to meet the network input requirements. The LS layer utilized LSTM and SVM as shown in Fig 1 to construct the network, while the OP layer provided Parkinson’s prediction results through soft voting. The training of the PGPM method network employed the Adam optimization algorithm. Unlike traditional methods that used a single classifier for training, the PGPM method significantly improved model performance by not relying on a single classifier.

Download:

Fig 1. PGPM framework diagram.

https://doi.org/10.1371/journal.pone.0310005.g001

2.4 CRFS preprocessing methods

In previous studies, a single feature selection method was often used. While this approach could yield simplified features and to some extent improve model performance by reducing model complexity, to enhance the reliability of the selection, the PGPM introduced the CRFS data preprocessing method, as illustrated in Fig 2. Unlike previous research, the CRFS data preprocessing method comprehensively considered the advantages of both random forest (RF) [12] and principal component analysis (PCA) [13] feature dimensionality reduction methods.

Download:

Fig 2. Diagram of the CRFS preprocessing process.

https://doi.org/10.1371/journal.pone.0310005.g002

Parkinson’s gut microbiome data typically contained multiple variables, i.e., different types of microbial populations. One of the advantages of random forest was that it could estimate the importance of each feature and identify the most important features during classification. By selecting important microbes as inputs, the model complexity could be simplified, computational efficiency could be improved, and overfitting risk could be reduced [14].

On the other hand, principal component analysis (PCA) could be used for dimensionality reduction by explaining most of the variance in the variables with a few principal components. This helped to better understand the data and extract the most informative microbial populations. In PCA, covariance played a crucial role. By calculating the covariance matrix between microbial variables, relationships and correlations could be understood. The covariance matrix could represent the trend of how different microbial populations increased or decreased together. If two microbial populations had high positive covariance, it indicated they had similar patterns of variation in the sample. Conversely, high negative covariance indicated opposite trends in variation. By arranging the covariance matrix according to the size of variance and selecting the top principal components, most of the variance in the data could be explained, achieving dimensionality reduction and retaining the most informative microbial populations. The main method of extracting features was to transform the feature space through the relationships between attributes and map the original feature space to a lower-dimensional feature space, thus accomplishing dimensionality reduction. PCA (Principal Component Analysis) reduced dimensionality through the prior inertia between multidimensional datasets.

The primary method for feature extraction was to transform the feature space by exploring the relationships among attributes and mapping the original feature space into a lower-dimensional feature space to achieve dimensionality reduction. PCA (Principal Component Analysis) achieved dimensionality reduction by leveraging the inertia between multidimensional data groups.

The preprocessing method for feature selection in CRFS involves the following steps:

Step 1: After the distinct gut microbiota associated with Parkinson’s disease species are extracted, where the original gut microbiota data in each column represent a sample, the microbiota needs to be transposed. This transformation changes the data so that each row represents a sample, and each column corresponds to a distinct gut microbiota.

Step 2: For the transposed data, feature selection is conducted using two methods: random forest (RF) and principal component analysis (PCA). The distinct gut microbiota were ranked based on importance scores using random forest, and the top 20 were selected. Subsequently, PCA was used to rank the distinct microbiota, selecting the top 20. The shared top 20 microbiota from both methods were chosen as input features. The covariance calculation formula for PCA is shown below (Eq 2–1).

(2–1)

Step 3: Extract the corresponding data of the common features from the top 20 features sorted by both methods. The highlighted green portion in Fig 2 represents the identical features.

Step 4: Normalize the extracted data. As the species abundance of the gut microbiota is purely numerical, if the abundance of a certain microorganism is too large, it may lead to an overly significant weight for that microorganism. Therefore, after feature selection and dimensionality reduction, the abundance of each microorganism was normalized to ensure equal weight for each microorganism during the training process, thus ensuring the model’s accuracy. The normalization calculation Formula (2–2) is as follows, where x represents the original data, Min represents the minimum value of the data, Max represents the maximum value of the data, and x′ represents the transformed data: (2–2)

Step 5: Add the corresponding disease status labels to the extracted data after each sample.

2.5 LSIM

In that study, the classifier for the PGPM method was based on a combined classification strategy of LSTM-SVM [15]. The structure of the LSIM [16] classifier model was illustrated in Fig 3. By leveraging the advantages of LSTM neural networks in storing long-term information and the generalization and accuracy advantages of SVM in handling classification problems, these two methods were integrated. The LSIM method utilized SVM as the classifier, where the output from the second-to-last layer of LSTM was transformed into the input feature vector for SVM. This approach further involved training SVM using the previous feature vectors, which meant extracting features with LSTM and then classifying them with SVM. The combination of LSTM and SVM not only enhanced the precision and effectiveness of feature extraction but also improved the accuracy of classification results.

Download:

Fig 3. LSIM structure.

https://doi.org/10.1371/journal.pone.0310005.g003

The Support Vector Machine (SVM) was a classic machine learning method commonly used for binary classification tasks. Its principle involved constructing an optimal decision hyperplane to separate data samples of different classes. For new input data, classification was determined based on which side of the hyperplane it fell on, thus achieving the classification task. In that study, the SVM utilized the Radial Basis Function (RBF) kernel. The RBF kernel was one of the commonly used kernel functions. It measured the similarity of sample points in a high-dimensional space by calculating the Euclidean distance between the sample points and support vectors. The role of the kernel function in the SVM model was to introduce nonlinear transformations, map the data from the input space to a higher-dimensional feature space, making the data more easily separable in the new feature space. The formula for the RBF kernel function is shown below: (2–3)

γ was a parameter in the RBF kernel function that controlled the rate of decay of the distance between samples, with a larger γ causing the similarity between samples to decrease faster, i.e., the similarity between samples that were farther away decreased, and vice versa. Therefore, choosing a suitable γ value was highly important for SVM performance and classification results. Too large or too small γ values could lead to overfitting or underfitting of the model. In this study, the framework was used to automatically adjust the γ values in the framework to adaptively select the appropriate γ values. The Formula (2–4) is shown below: (2–4)

Where n_features denoted the number of features, and X. var() denoted the variance of each feature in the input data X. The method could automatically adjust the input data according to the different scales of its γ values to better fit the data.

However, in some tasks, the "sparse" and "discrete" features in the input data made it difficult to detect relationships between data points, which were often crucial for determining the overall relationships in the input. In contrast, Long Short-Term Memory (LSTM) networks could capture dependencies in input information, and were particularly suitable for handling sequential data. LSTMs excelled at handling long-term dependencies and temporal relationships within sequences.

LSTM was a special type of RNN. LSTM introduced the concepts of memory cells, input gates, output gates, and forget gates, enabling it to capture dependencies in input information. The input gate selected relevant information to update the input memory cell. The forget gate determined whether the input and output information should pass through. If the result of the forget gate was close to zero, the information was forgotten, while if it was close to one, the information was retained. This operation at the forget gate allowed LSTM to address the issues of gradient explosion and vanishing gradients. LSTM overcame the short-term memory limitations of RNNs; when a sequence was long, an RNN struggled to propagate information from earlier time steps to later ones, whereas LSTM could learn long-term dependencies, remember information from earlier time steps, and thus establish context.

The LSTM is calculated using the following information:

x_t: Enter the data at time t.
h_t−1: the hidden state at time t-1.
c_t−1: the state of the cell at time t.

Given x_t,h_t−12 and c_t−1, the LSTM prioritizes the computation of forgetting gates, input gates, output gates and candidate contexts with the Formulas (2–5) to (2–8): (2–5) (2–6) (2–7) (2–8)

The LSTM is based on f_t、c_t−1, the i_t and are used to calculate the cell state at the current step c_t, as shown in Eq (2–9): (2–9)

LSTM utilizes the o_t and c_t to compute the hidden state of the current step as shown in Eq (2–10): (2–10)

Finally, the hidden state h_t is the same as the output given by the LSTM at time t.

LSTM was commonly used for classification tasks, and the softmax layer was a commonly used classification layer for performing binary classification tasks. The output of the softmax layer could be interpreted as the estimated probability of the sample belonging to a certain class. In binary classification tasks, a threshold was applied to convert the probability value into a specific class label. If the probability was greater than the threshold, the sample was predicted to belong to the positive class; otherwise, it was predicted as the negative class. In this experiment, the cross-entropy loss function, which affected the classification layer of LSTM, was used. Therefore, when the features of the data were linearly inseparable, combining SVM with LSTM could address the same classification problem from different perspectives. This combination may have rendered the originally inseparable classification problem linearly separable, thereby further improving the classification performance.

3 Experimental results

3.1 Network training

This study is implemented based on Python (3.9.12) using publicly available standard libraries: pandas (1.5.2), numpy (1.22.4), scikit-learn (1.2.0), torch (1.12), and matplotlib (3.6.2). To avoid underfitting or overfitting, the DataLoader method is used to randomly shuffle the samples in the dataset at the beginning of each epoch. This helps the model better learn the data distribution and improves its generalization ability.

The network training mainly focuses on the hidden layers. In this study, 10-fold cross-validation is used to evaluate the model’s performance. First, the entire dataset is divided into 10 parts, each of which is used as a training set in turn, with the rest used as a test set. Then, the dataset undergoes 10 rounds of training, and during each training loop, an internal epoch is used for multiple rounds of training. The training set is divided into small batches for training, and the model’s parameters are updated through backpropagation and the Adam optimizer. After the training is completed, the penultimate layer output of the LSTM is extracted as a feature vector. These feature vectors and the test set are used for training, prediction, and accuracy calculation. Finally, after each round of validation, the accuracy is stored in a list.

This experiment conducts comparative tests on multiple models with the same hyperparameter settings. The specific settings are as follows: the training epoch is 300, the initial learning rate is 0.001 [17], the batch size is set to 6, and the optimization algorithm used is Adaptive Moment Estimation (Adam) [18]. The GPU used for training is an NVIDIA GeForce GTX1060 laptop GPU, with 16GB of memory and 1280 CUDA cores.

3.2 CRFS preprocessing results

To address the issue of redundant information in the data that may lead to suboptimal classification, the CRFS preprocessing method is used to retain relevant information and eliminate irrelevant information. Table 1 presents partial results of feature selection using the CRFS data preprocessing method.

Download:

Table 1. CRFS preprocessing results.

https://doi.org/10.1371/journal.pone.0310005.t001

Table 1 shows that, within the CRFS preprocessing method, the Random Forest (RF) and PCA (Principal Component Analysis) methods share 8 identical microbes among their top 20 features, which are highlighted in bold, including Bacteroides_coprocola, and Alistipes_putredinis, among others. Fig 4 illustrates the corresponding importance scores and contribution rates of these 8 shared features among the top 20 features in the CRFS preprocessing method. Ultimately, these 8 features are incorporated into the model, indicating their significant role in the prediction process.

Download:

Fig 4. CRFS significant scores and contributions.

https://doi.org/10.1371/journal.pone.0310005.g004

3.3 PGPM classifier performance analysis

3.3.1 Evaluation of the performance of different models.

In this study, ACC stands for accuracy, which refers to the proportion of correctly classified instances out of the total number of instances when using the test set to evaluate a model in classification tasks. However, ACC has certain limitations and may not fully reflect the performance of a model. For example, it does not consider situations of class imbalance, where one class has significantly more samples than others. As a result, the model’s performance cannot be fully assessed, leading to the introduction of the Area Under the Curve (AUC) and the ROC curve. The term ncorrect represents the number of correctly classified records, while ntotal represents the total number of test data. The calculation formulas are shown as follows in Eqs (3–1) to (3–2): (3–1) (3–2) (3–3)

Among them, True Positives (TP) refer to positive samples correctly predicted as positive, representing the number of positive instances correctly predicted; False Positives (FP) refer to negative samples incorrectly predicted as positive, representing the number of negative instances incorrectly predicted; True Negatives (TN) refer to negative samples correctly predicted as negative, representing the number of negative instances correctly predicted; False Negatives (FN) refer to positive samples incorrectly predicted as negative, representing the number of positive instances incorrectly predicted.

To compare the effectiveness of the PGPM proposed in this study with that of other commonly used neural networks for processing gut microbiota data, training and testing were conducted on this dataset, and the results are presented in Table 2. Table 2, shows that the classification performance of the PGPM method overall surpasses that of other commonly used classification models. For a more intuitive comparison of the differences in Mean Acc, AUC, and ROC among the various models, this study provides bar graphs of the three indicators, as shown in Fig 5A. The ROC curve plot for the PGPM method is illustrated in Fig 5B.

Download:

Fig 5. Histograms of the different models and ROC curves of the different models.

https://doi.org/10.1371/journal.pone.0310005.g005

Download:

Table 2. Experimental results of different models.

https://doi.org/10.1371/journal.pone.0310005.t002

Fig 5A, clearly shows that the PGPM exhibits significant advantages in Mean Acc, ROC, and AUC, and the comprehensive performance across all three indicators is notably high.

3.3.2 PGPM ablation experiments.

In this experiment, to assess the individual impact of each module on the model’s predictive ability, ablation experiments were conducted, as shown in Table 3. By comparing these experiments, we can observe the effects of different modules on the experimental results. The baseline was set as the LSTM model.

Download:

Table 3. List of ablation experiments.

https://doi.org/10.1371/journal.pone.0310005.t003

Based on the experimental list in Table 1, the corresponding model structures are constructed using the same hyperparameters, experiments are conducted using the same dataset, and the experimental results of the five methods are compared, as shown in Table 4. The comparison line graph is depicted in Fig 6.

Download:

Fig 6. Folded line comparison chart.

https://doi.org/10.1371/journal.pone.0310005.g006

Download:

Table 4. Experimental results.

https://doi.org/10.1371/journal.pone.0310005.t004

From Table 4 and the line graph in Fig 6, it can be observed that as the methods continue to improve, the experimental results also show consistent enhancement. Comparing the results between Experiment 1 and Experiment 2, as well as between Experiment 1 and Experiment 3, it is evident that incorporating a single feature selection method improves the LSTM model’s classification performance in terms of mean accuracy, AUC, and ROC. This suggests that feature simplification can reduce model complexity and enhance model performance to a certain extent.

Comparing Experiment 2, Experiment 3, and Experiment 4, it is apparent that the performance of the CRFS module surpasses that of a single feature selection method. Contrasting Experiment 1 with Experiment 5, it is clear that all metrics have improved. By combining the LSTM and SVM classification methods, the model’s performance is further boosted.

Comparing Experiment 2, Experiment 3, Experiment 4, Experiment 5, and Experiment 6, it becomes evident that the contributions of the CRFS module and the PGPM method to the model’s improvement exceed those of the individual methods, leading to superior overall performance. The experimental results demonstrate that the effectiveness of the PGPM surpasses previous research efforts. The experimental results consistently prove that the PGPM method is more effective than the methods used in previous studies.

4 Discussion

The gut microbiota played a crucial role in predicting Parkinson’s disease [19]. Previous studies had clearly indicated the close relationship between the gut microbiota and Parkinson’s disease. For example, the study by Bedarf et al. [20] found significant differences in the gut microbiota composition of Parkinson’s disease patients compared to healthy controls. These differences were mainly reflected in the abundance changes of specific microorganisms, which might reveal particular pathophysiological processes of Parkinson’s disease, providing new clues for its diagnosis and prediction. The core objective of this study was to develop an efficient and accurate prediction method for the early diagnosis of Parkinson’s disease through in-depth analysis of gut microbiota data. The close relationship between the gut microbiota and Parkinson’s disease has been widely studied. We propose a differential gut microbiota-based Parkinson’s prediction method (PGPM) based on deep learning, aiming to capture the subtle differences in the gut microbiome that traditional machine learning [21] methods might miss, offering new perspectives and tools for Parkinson’s disease prediction.

In this study, we explored different methods for predicting Parkinson’s patients’ performance. Compared to traditional methods (including DNN, LSTM, CNN, and SVM), our proposed method performed better, demonstrating its high capability in Parkinson’s prediction classification. Precision, AUC, and ROC values were selected as key indicators to evaluate method performance, and the research results showed that the PGPM method achieved the best performance. In the comparison of classification performance after feature selection, it was found that feature dimensionality reduction could simplify the model complexity and improve model performance to a certain extent. Additionally, the combination of preprocessing methods led to more significant improvements in classification performance.

Our PGPM method achieved significant results in the classification prediction of Parkinson’s disease, with a mean accuracy (Mean ACC) of 0.85, and both the area under the curve (AUC) and receiver operating characteristic curve (ROC) reaching 0.92. These results indicated that by deeply analyzing gut microbiota data, we could accurately distinguish Parkinson’s disease patients from healthy individuals, providing strong support for the early diagnosis of Parkinson’s disease.

Given the high-dimensional feature space and high redundancy of medical data, feature selection was necessary in data analysis. In this study, using the CRFS preprocessing method, eight gut microbiota features were selected, resulting in higher prediction accuracy for subsequent classification, with an increase of about 0.2 compared to single feature selection methods. This demonstrated the importance of feature selection for disease prediction. A study on the gut microbiota of diabetic patients also confirmed this, showing that selected gut microbiota features were crucial for the predictive ability of the model [22].

Furthermore, like other classifiers such as DNN, LSTM, CNN, and SVM, when dealing with high-dimensional feature spaces, redundant features, noisy features, and class imbalance in the data posed challenges to classification performance. Therefore, in this study, we combined LSTM with SVM, which improved the accuracy by about 0.3 compared to other methods (DNN, LSTM, CNN, and SVM). The experimental results also fully demonstrated the effectiveness of combining feature dimensionality reduction and combined classification models.

Compared to some methods developed for the microbiome in recent years, our method was simple, robust, and effective. Despite the significant achievements of this study, we acknowledged certain limitations. Future work would focus on expanding the sample size, improving result stability, and validating the external applicability and generalizability of this prediction model in larger independent validation groups. Additionally, we would further explore the deep relationship between gut microbiota and Parkinson’s disease to achieve broader applications in personalized medicine.

In conclusion, the Parkinson’s disease prediction model established in this study had achieved significant results, revealing the potential association between gut microbiota and Parkinson’s disease. These findings might provide new ideas and methods for the early diagnosis and treatment of Parkinson’s disease. Further research could deepen the understanding of the relationship between gut microbiota and Parkinson’s disease and explore its potential in personalized medicine.

5 Conclusion

Support Vector Machine (SVM) and other machine learning methods are mainstream approaches for processing various gut microbiota data. In addition to the large volume of data, there are many implicit correlations among the data. Moreover, the complex background of gut microbiota data makes it challenging for traditional machine learning and LSTM to obtain accurate features. Furthermore, there is a lack of research on the use of deep learning for classification prediction using gut microbiota data. Therefore, the PGPM method includes a complete set of methods ranging from feature selection to classification prediction. It accurately selects relevant features through the preprocessing process and utilizes a classification strategy combining LSTM-SVM to accomplish the classification prediction task. Overall, PGPM outperforms existing models and can effectively classify and predict Parkinson’s gut microbiota. In future research, efforts will continue to accurately capture relevant features and focus on more precise classification model predictions. Additionally, this method can be extended to predict other diseases related to the gut microbiota.

Supporting information

S1 File.

https://doi.org/10.1371/journal.pone.0310005.s001

(RAR)

References

1. Zhang W, Ye Y, Song J, Sang T, Xia T, Xie L, et al. Research Progress of Microbiota-Gut-Brain Axis in Parkinson’s Disease. J Integr Neurosci. 2023 Oct 30;22(6):157. pmid:38176929
- View Article
- PubMed/NCBI
- Google Scholar
2. Verhaar B J H, Hendriksen H M A, De Leeuw F A, et al. Gut Microbiota Composition Is Related to AD Pathology [J]. Frontiers in Immunology, 2022, 12.
- View Article
- Google Scholar
3. Waller S, Williams L, Morales-Briceño H, et al. The initial diagnosis and management of Parkinson’s disease [J]. Australian Journal for General Practitioners, 2021, 50: 793–800.
- View Article
- Google Scholar
4. Zhu F, Ju Y, Wang W, et al. Metagenome-wide association of gut microbiome features for schizophrenia [J]. Nature Communications, 2020, 11(1): 1612.
- View Article
- Google Scholar
5. Bastiaanssen T F S, Cussotto S, Claesson M J, et al. Gutted! Unraveling the Role of the Microbiome in Major Depressive Disorder [J]. Harvard Review of Psychiatry, 2020, 28(1).
- View Article
- Google Scholar
6. Lin P, Li Q. Can gut flora changes be new biomarkers for depression? [J]. Frontiers in Laboratory Medicine, 2017, 1(3): 129–34.
- View Article
- Google Scholar
7. Chen B, You N, Pan B, et al. Application of Clustering Method to Explore the Correlation Between Dominant Flora and the Autism Spectrum Disorder Clinical Phenotype in Chinese Children [J]. Frontiers in neuroscience, 2021, 15.
- View Article
- Google Scholar
8. Zhulin Igor B. Classic Spotlight: 16S rRNA Redefines Microbiology [J]. Journal of Bacteriology, 2016, 198(20): 2764–5.
- View Article
- Google Scholar
9. Boutet A, Madhavan R, Elias G J B, et al. Predicting optimal deep brain stimulation parameters for Parkinson’s disease using functional MRI and machine learning [J]. Nature Communications, 2021, 12(1): 3043.
- View Article
- Google Scholar
10. Borzì L, Mazzetta I, Zampogna A, et al. Prediction of Freezing of Gait in Parkinson’s Disease Using Wearables and Machine Learning [J/OL] 2021, 21(2): pmid:33477323
- View Article
- PubMed/NCBI
- Google Scholar
11. Mao L, Zhang Y, Tian J, et al. Cross-Sectional Study on the Gut Microbiome of Parkinson’s Disease Patients in Central China [J]. Front Microbiol, 2021, 12: 728479.
- View Article
- Google Scholar
12. Yang L, Wu H, Jin X, et al. Study of cardiovascular disease prediction model based on random forest in eastern China [J]. Scientific Reports, 2020, 10(1): 5245.
- View Article
- Google Scholar
13. Ahmad F, Dar W M. Classification of Alzheimer’s Disease Stages: An Approach Using PCA-Based Algorithm [J]. American Journal of Alzheimer’s Disease & Other Dementias®, 2018, 33(7): 433–9.
- View Article
- Google Scholar
14. 赵丹丹. 基于特征选择的结直肠癌预测模型研究 [J]. 2019.
15. 郑承昊. 人类肠道菌群结构与疾病关联关系预测系统的设计与实现 [D]; 哈尔滨工业大学, 2021.
16. 侯晓丽,赵雅,严慧深,等.基于深度LSTM残差网络的帕金森症诊断方法[J].中国医学物理学杂志,2023,40(05):609–615.
17. Buongiorno R, Germanese D, Colligiani L, et al. Chapter 9—Artificial intelligence for chest imaging against COVID-19: an insight into image segmentation methods [M]//CHATTERJEE P, ESPOSITO M. Artificial Intelligence in Healthcare and COVID-19. Academic Press. 2023: 167–200.
18. Zeke Xie, Xinrui Wang, Huishuai Zhang, Issei Sato, Masashi Sugiyama Proceedings of the 39th International Conference on Machine Learning, PMLR 162:24430–24459, 2022.
19. Kim J, Lee S, Hwang E, et al. Limitations of Deep Learning Attention Mechanisms in Clinical Research: Empirical Case Study Based on the Korean Diabetic Disease Setting [J]. J Med Internet Res, 2020, 22(12): e18418.
- View Article
- Google Scholar
20. Romano S, Savva G M, Bedarf J R, et al. Meta-analysis of the Parkinson’s disease gut microbiome suggests alterations linked to intestinal inflammation [J]. npj Parkinson’s Disease, 2021, 7(1): 27.
- View Article
- Google Scholar
21. Hernández Medina R, Kutuzova S, Nielsen K N, et al. Machine learning and deep learning applications in microbiome research [J]. ISME Communications, 2022, 2(1): 98.
- View Article
- Google Scholar
22. Guo S., Zhang H., Chu Y., Jiang Q., & Ma Y. (2022). A neural network‐based framework to understand the type 2 diabetes‐related alteration of the human gut microbiome. iMeta, 1(2). pmid:38868565
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Zhang W, Ye Y, Song J, Sang T, Xia T, Xie L, et al. Research Progress of Microbiota-Gut-Brain Axis in Parkinson’s Disease. J Integr Neurosci. 2023 Oct 30;22(6):157. pmid:38176929
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Verhaar B J H, Hendriksen H M A, De Leeuw F A, et al. Gut Microbiota Composition Is Related to AD Pathology [J]. Frontiers in Immunology, 2022, 12.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref3] 3. Waller S, Williams L, Morales-Briceño H, et al. The initial diagnosis and management of Parkinson’s disease [J]. Australian Journal for General Practitioners, 2021, 50: 793–800.
View Article
Google Scholar

[9] View Article

[10] Google Scholar

[ref4] 4. Zhu F, Ju Y, Wang W, et al. Metagenome-wide association of gut microbiome features for schizophrenia [J]. Nature Communications, 2020, 11(1): 1612.
View Article
Google Scholar

[12] View Article

[13] Google Scholar

[ref5] 5. Bastiaanssen T F S, Cussotto S, Claesson M J, et al. Gutted! Unraveling the Role of the Microbiome in Major Depressive Disorder [J]. Harvard Review of Psychiatry, 2020, 28(1).
View Article
Google Scholar

[15] View Article

[16] Google Scholar

[ref6] 6. Lin P, Li Q. Can gut flora changes be new biomarkers for depression? [J]. Frontiers in Laboratory Medicine, 2017, 1(3): 129–34.
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref7] 7. Chen B, You N, Pan B, et al. Application of Clustering Method to Explore the Correlation Between Dominant Flora and the Autism Spectrum Disorder Clinical Phenotype in Chinese Children [J]. Frontiers in neuroscience, 2021, 15.
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref8] 8. Zhulin Igor B. Classic Spotlight: 16S rRNA Redefines Microbiology [J]. Journal of Bacteriology, 2016, 198(20): 2764–5.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref9] 9. Boutet A, Madhavan R, Elias G J B, et al. Predicting optimal deep brain stimulation parameters for Parkinson’s disease using functional MRI and machine learning [J]. Nature Communications, 2021, 12(1): 3043.
View Article
Google Scholar

[27] View Article

[28] Google Scholar

[ref10] 10. Borzì L, Mazzetta I, Zampogna A, et al. Prediction of Freezing of Gait in Parkinson’s Disease Using Wearables and Machine Learning [J/OL] 2021, 21(2): pmid:33477323
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref11] 11. Mao L, Zhang Y, Tian J, et al. Cross-Sectional Study on the Gut Microbiome of Parkinson’s Disease Patients in Central China [J]. Front Microbiol, 2021, 12: 728479.
View Article
Google Scholar

[34] View Article

[35] Google Scholar

[ref12] 12. Yang L, Wu H, Jin X, et al. Study of cardiovascular disease prediction model based on random forest in eastern China [J]. Scientific Reports, 2020, 10(1): 5245.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref13] 13. Ahmad F, Dar W M. Classification of Alzheimer’s Disease Stages: An Approach Using PCA-Based Algorithm [J]. American Journal of Alzheimer’s Disease & Other Dementias®, 2018, 33(7): 433–9.
View Article
Google Scholar

[40] View Article

[41] Google Scholar

[ref14] 14. 赵丹丹. 基于特征选择的结直肠癌预测模型研究 [J]. 2019.

[ref15] 15. 郑承昊. 人类肠道菌群结构与疾病关联关系预测系统的设计与实现 [D]; 哈尔滨工业大学, 2021.

[ref16] 16. 侯晓丽,赵雅,严慧深,等.基于深度LSTM残差网络的帕金森症诊断方法[J].中国医学物理学杂志,2023,40(05):609–615.

[ref17] 17. Buongiorno R, Germanese D, Colligiani L, et al. Chapter 9—Artificial intelligence for chest imaging against COVID-19: an insight into image segmentation methods [M]//CHATTERJEE P, ESPOSITO M. Artificial Intelligence in Healthcare and COVID-19. Academic Press. 2023: 167–200.

[ref18] 18. Zeke Xie, Xinrui Wang, Huishuai Zhang, Issei Sato, Masashi Sugiyama Proceedings of the 39th International Conference on Machine Learning, PMLR 162:24430–24459, 2022.

[ref19] 19. Kim J, Lee S, Hwang E, et al. Limitations of Deep Learning Attention Mechanisms in Clinical Research: Empirical Case Study Based on the Korean Diabetic Disease Setting [J]. J Med Internet Res, 2020, 22(12): e18418.
View Article
Google Scholar

[48] View Article

[49] Google Scholar

[ref20] 20. Romano S, Savva G M, Bedarf J R, et al. Meta-analysis of the Parkinson’s disease gut microbiome suggests alterations linked to intestinal inflammation [J]. npj Parkinson’s Disease, 2021, 7(1): 27.
View Article
Google Scholar

[51] View Article

[52] Google Scholar

[ref21] 21. Hernández Medina R, Kutuzova S, Nielsen K N, et al. Machine learning and deep learning applications in microbiome research [J]. ISME Communications, 2022, 2(1): 98.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref22] 22. Guo S., Zhang H., Chu Y., Jiang Q., & Ma Y. (2022). A neural network‐based framework to understand the type 2 diabetes‐related alteration of the human gut microbiome. iMeta, 1(2). pmid:38868565
View Article
PubMed/NCBI
Google Scholar

[57] View Article

[58] PubMed/NCBI

[59] Google Scholar

Figures

Abstract

Background

Methods

Results

Conclusion

1.Contexts

2. Materials and methods

2.1 Microbiota datasets

2.2 Transcriptome sequencing

2.3 Overall framework of the forecasting methodology

2.4 CRFS preprocessing methods

2.5 LSIM

3 Experimental results

3.1 Network training

3.2 CRFS preprocessing results

3.3 PGPM classifier performance analysis

3.3.1 Evaluation of the performance of different models.

3.3.2 PGPM ablation experiments.

4 Discussion

5 Conclusion

Supporting information

S1 File.

References