Figures
Abstract
Alzheimer’s disease (AD) is a neurodegenerative disorder that requires early diagnosis for effective management. However, issues with currently available diagnostic biomarkers preclude early diagnosis, necessitating the development of alternative biomarkers and methods, such as blood-based diagnostics. We propose c-Triadem (constrained triple-input Alzheimer’s disease model), a novel deep neural network to identify potential blood-based biomarkers for AD and predict mild cognitive impairment (MCI) and AD with high accuracy. The model utilizes genotyping data, gene expression data, and clinical information to predict the disease status of participants, i.e., cognitively normal (CN), MCI, or AD. The nodes of the neural network represent genes and their related pathways, and the edges represent known relationships among the genes and pathways. Simulated data validation further highlights the robustness of key features identified by SHapley Additive exPlanations (SHAP). We trained the model with blood genotyping data, microarray, and clinical features from the Alzheimer’s Neuroimaging Disease Initiative (ADNI). We demonstrate that our model’s performance is superior to previous models with an AUC of 97% and accuracy of 89%. We then identified the most influential genes and clinical features for prediction using SHapley Additive exPlanations (SHAP). Our SHAP analysis shows that CASP9, LCK, and SDC3 SNPs and PINK1, ATG5, and ubiquitin (UBB, UBC) expression have a higher impact on model performance. Our model has facilitated the identification of potential blood-based genetic markers of DNA damage response and mitophagy in affected regions of the brain. The model can be used for detection and biomarker identification in other related dementias.
Author summary
C-Triadem, our novel developed deep neural network, accurately predicts moderate cognitive impairment (MCI) and Alzheimer’s disease (AD) while identifying potential blood biomarkers for AD. Current diagnostic methods have limitations, emphasizing the critical need for early AD detection. Our model integrates genetic, gene expression, and clinical data to differentiate among cognitively normal individuals, MCI, and AD cases. Training and validation using Alzheimer’s Disease Neuroimaging Initiative (ADNI) data demonstrate superior performance, with a 97% AUC and 89% accuracy, surpassing previous models. SHapley Additive exPlanations (SHAP) analysis highlights key clinical features (e.g., MMSE scores, brain volume) and genes (e.g., CASP9, LCK, SDC3), revealing potential genetic markers and pathways in blood associated with AD. By incorporating Reactome pathways, our approach enhances interpretability, providing insight into the biological context of predictions. In summary, c-Triadem represents a significant advancement in AD diagnostics, enabling earlier and more accurate diagnoses for improved treatment strategies.
Citation: Jemimah S, Abuhantash F, AlShehhi A (2025) c-Triadem: A constrained, explainable deep learning model to identify novel biomarkers in Alzheimer’s disease. PLoS ONE 20(4): e0320360. https://doi.org/10.1371/journal.pone.0320360
Editor: Jose Gerardo Tamez-Peña, Instituto Tecnológico y de Estudios Superiores de Monterrey: Tecnologico de Monterrey, Researcher, Escuela de Medicina, MEXICO.
Received: April 9, 2024; Accepted: February 17, 2025; Published: April 14, 2025
Copyright: © 2025 Jemimah et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The constrained model for this study is publicly available from the GitHub repository (https://github.com/Sherlyn-J/KU-BMED2/). The dataset analyzed in this study is publicly available from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) repository (https://adni.loni.usc.edu/) with accession number “sa000002.”
Funding: This work is supported by Khalifa University under Award no. FSU-2021-005. This fund acquired by AA. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Alzheimer’s disease (AD), the most common form of dementia, is characterized by a gradual decline in cognition and memory. By 2030, it is estimated to affect approximately 78 million elderly individuals[1]. While the APOE ϵ4 allele and mutations in presenilin-I and APP (amyloid precursor protein) are established genetic markers [2], AD is widely recognized as a multifactorial and complex disease influenced by both genetic and environmental factors [3]. The hallmark pathological features of AD include amyloid-β (Aβ) deposition and the formation of neurofibrillary tangles (NFTs), primarily composed of tau protein aggregates [4]. Early symptoms are associated with accelerated neurodegeneration in the entorhinal cortex and the cornu ammonis 1 region of the hippocampus [5,6], which correspond to increased NFT accumulation in these regions [7].
AD hallmarks are used by the National Institute on Aging and Alzheimer’s Association (NIA-AA) to categorize existing AD biomarkers under Aβ deposition, pathologic tau, and neurodegeneration [AT(N)] research framework, enabling a biological definition of AD [8]. However, the diagnostic modalities for AT(N) biomarkers (such as MRI (magnetic resonance imaging), PET (positron emission tomography), and lumbar puncture) are known to be expensive or invasive -knowing that MRI and PET scans can cost up to US$ 8,000 and US$ 3,400 respectively [9]. In contrast, blood tests are relatively cost-effective (up to US$ 1,250). Therefore, clinical diagnosis through blood sampling has been proposed to make testing more accessible and affordable for the patient population, and enable routine monitoring [9]; as well as recent research which has focused on elucidating blood-based biomarkers, to facilitate less-invasive diagnostic tests for AD [10–13].
Conventional biomarker testing in blood samples faces several significant challenges. A primary issue is the difficulty in quantifying the extremely low levels of Aβ and tau proteins against a backdrop of abundant plasma proteins, such as albumin and immunoglobulin. Additionally, these low levels of Aβ and tau are subject to metabolization and clearance by physiological processes, further complicating detection [14]. While ultrasensitive assays may address this issue [15], the considerable overlap in Aβ levels between AD patients and cognitively normal individuals limits its utility as a standalone blood biomarker [16]. Furthermore, current biomarkers rely on significant Aβ deposition or tau pathology for detection, making early diagnosis—critical for effective treatment—difficult to achieve [17].
The limitations of conventional blood-based biomarkers highlight the urgent need for novel methods and the identification of new biomarkers to enable accurate and early diagnosis. Experimental evidence suggests changes in the blood cells of patients with mild cognitive impairment (MCI), the prodromal stage, and advanced AD, which could serve as diagnostic indicators [18]. Polygenic risk scores, which capture the association of multiple gene loci with plasma biomarkers [19], have been shown to improve the distinction between cognitively normal (CN) individuals and AD cases [20]. Similarly, Reddy et al. [21] demonstrated that plasma mRNA levels from an eQTL-curated gene panel significantly enhance diagnostic accuracy and may help differentiate between types of dementia. Machine learning (ML) approaches have also identified blood-based panels of potential biomarker genes for the early detection of AD [17,22].
With ML and artificial intelligence (AI), accurate and early diagnosis using multimodal, blood-based data is a distinct possibility. Several ML models have been proposed to distinguish between CN, MCI, and advanced AD patients. Stamate et al. [23] utilized plasma metabolites in an XGBoost model to achieve an AUC of 0.89 in distinguishing between AD and CN cases. Logistic regression using plasma levels of inflammatory proteins enabled the differentiation of AD from controls (AUC 0.79) and MCI subjects (AUC 0.74) [24]. A Random forest model of serum protein multiplex biomarker data attained an AUC of 0.91 in predicting AD and CN cases [25]. Qui et al. [26] proposed a deep learning framework that uses neuroimaging data and patient demographic information for binary classification of AD and cognitively normal participants with accuracy comparable to human experts. Advances in deep learning such as the development of the SHapley Additive exPlanations (SHAP) [27] algorithm enable the investigation of multi-omic data with transparent, biologically relevant models, which is important to engender trust and facilitate adoption by clinicians [28].
In this study, we introduce c-Triadem, a constrained deep neural network multiclass-classifier, which incorporates prior biological information in the form of Reactome pathways to accurately classify samples as CN, MCI, or AD. Current diagnostic methods for AD, such as MRI and PET, are costly and invasive, highlighting the need for alternative, blood-based biomarkers. However, challenges like the low levels of Aβ and tau proteins in blood hinder their use. c-Triadem addresses this by utilizing blood genotyping, gene expression, and clinical data. Data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) [29] are used for training, validation, and testing. The significance of genetic and clinical features in the input is assessed by employing SHapley Additive exPlanations (SHAP). Further, we interrogate the intermediate layers to better understand model behavior and gain mechanistic insights into the role of the genes identified as potential biomarkers. The proposed methodology, experimental data, results, and conclusions are presented in the following sections.
Results
Clinical cohort characteristics
The cohort of selected ADNI participants includes 212 CN, 317 MCI, and 97 AD subjects. The clinical features after imputation are summarized in Tables 1 and 2. In general, the CN, MCI, and AD groups differ significantly in terms of mean age (p < 0.001) and proportion of female participants (p = 0.012). We also observe a higher proportion of AD patients with the APOE ϵ4 allele compared to MCI and CN subjects (p < 0.001). As expected, AD patients show significantly worse performance in neuropsychological tests (p < 0.001). Furthermore, measures of brain volume are lower in AD patients (p < 0.001) with the exception of ICV, which remains similar across the groups. Brain functioning measures are also significantly impacted in AD with higher levels of AV45 and CSF tau protein, along with lower FDG uptake (p < 0.001).
Model performance
With c-Triadem, we achieved an accuracy of 89% and an AUC of 97% on the test data. We also developed an unconstrained dense network with similar architecture (accuracy 87% and AUC 96%) for comparison and were able to demonstrate c-Triadem’s superior performance. ROC curves for the unconstrained and constrained models are shown in Fig 1a and Fig 1b, respectively. We also tested whether our model may be refined by additional pathway layers. Training our model with two additional pathway layers in each subnetwork produced comparable results with an AUC of 96% and an accuracy of 88%. The performance of our model compared to other available machine learning classifiers on ADNI data is shown in Table 3.
Model explanation
To examine the importance of specific genes and clinical features in model prediction, we computed SHAP values in the constrained model for the three types of input, i.e., SNPs, gene expression, and clinical features. Beeswarm plots for the top 20 features in each type of input are provided in Fig 2.
SHAP scores colored by feature values are presented for each feature and prediction class. Plots show the impact of top 20 (a, d, g) aggregated SNPs features, (b, e, h) microarray gene expression data, and (c, f, i) clinical data for (a, b, c) CN, (d, e, f) MCI and (g, h, i) AD prediction, respectively.
SNPs.
The SHAP beeswarm plots for SNPs are depicted in Fig 2a for CN prediction, Fig 2d for MCI prediction, and Fig 2g for AD prediction. SNPs found in genes such as CASP9, LCK, and SDC3 have been prioritized by the SHAP scores, showing their influence on model behavior. The SHAP bar plots for SNPs in Fig 3a, Fig 3d, and Fig 3g summarize the average impact of each feature on the model’s output. We performed a network analysis with STRING (Search Tool for Interacting Genes/Proteins) using the selected genes to identify enriched biological pathways, as described in the Methodology section 4.6. The resulting network contains 40 edges (protein-protein interaction (PPI) enrichment p-value 0.0157), which suggests that the genes are biologically connected. More than three-fourths of the interactions have been experimentally validated or extracted from curated databases. Notably enriched pathways include cellular response to stress (network strength (NS) of 0.92 with false discovery rate (FDR) of 0.00018) and CD28 costimulation (NS 1.74 with FDR 0.00055). CD28 is expressed by T-cells and is essential for T-cell proliferation and cytokine production.
SHAP summary bar plot displaying the mean impact of features on the model’s output. Plots show the impact of top 20 (a, d, g) aggregated SNPs features, (b, e, h) microarray gene expression data, and (c, f, i) clinical data for (a, b, c) CN, (d, e, f) MCI and (g, h, i) AD prediction, respectively.
Gene expression.
The SHAP beeswarm plots for gene expression are depicted in Fig 2b for CN prediction, Fig 2e for MCI prediction, and Fig 2h for AD prediction. The expression of CCNE1, ATG5, MAP1LC3B, RB1, UBC, TOMM20, and PINK1 are marked as significant for model prediction with SHAP. The SHAP bar plots for gene expression in Fig 3b, Fig 3e, and Fig 3h summarize the average impact of each feature on the model’s output. We examined the protein-protein interaction network using STRING to identify enriched pathways among the selected genes. The network contains 164 edges (PPI enrichment p-value < 1.0×10−16), with the vast majority being experimentally validated and/or extracted from curated databases, indicating biological relevance. Enriched pathways include aggrephagy (NS 2.12, FDR 0.0082), mitophagy (NS 1.99, FDR 0.0119), autophagosome assembly (NS 1.52, FDR 9.02×10−5) and selective autophagy (NS 1.69, FDR 0.0027). Further, pathways related to DNA damage response (NS 1.78, FDR 7.36×10−7) and nucleotide repair (NS 1.83, FDR 9.27×10−5) have been enriched.
Clinical features.
The SHAP beeswarm plots for clinical features are depicted in Fig 2c for CN prediction, Fig 2f for MCI prediction, and Fig 2i for AD prediction. We observed that the clinical features, such as MMSE, Ecog, FAQ, RAVLT immediate recall, RAVLT learning and CDRSB scores are highly impactful in prediction of AD. Other clinical features with significant influence on model prediction include whole brain volume, presence of APOE ϵ4 allele, age, and uptake levels of FDG and AV45. The SHAP bar plots for clinical features in Fig 3c, Fig 3f, and Fig 3i summarize the average impact of each feature on the model’s output.
Intermediate layer activation
Nodes with significant differences in activation across CN, MCI, and AD inputs represent pathways through which genotyping and gene expression inputs impact c-Triadem predictions. Therefore, we examined node activation in the intermediate layers of the subnetworks. We observed differences in activation at all three hierarchy levels in our model. Fig 4 depicts the kernel density estimation curves for pathway nodes in the intermediate layers of the subnetworks with significant differences in activation. At the lowest level, we observed differences in activation among the nodes representing PLCγ1 (phospholipase-c gamma 1) signaling, neutrophil degranulation, and TRKA (tropomyosin receptor kinase A) activation by NGF (nerve growth factor). Differentially activated pathways at the second level include DNA double-stranded break response, formation of apoptosomes, and caspase activation via death receptors. Third-level pathways include cytochrome-c mediated apoptotic response, caspase activation, and signaling by NOTCH and FLT3.
Kernel density estimation curves are presented for pathways with significant differences in activation (established by the Kruskal-Wallis test) at all three levels of hierarchy for (a) SNPs and (b) gene expression data. The P-value for each pathway is reported adjacent to the graph.
Simulated data validation
To validate the robustness of the c-Triadem model, a simulated dataset was created. We selected three key features identified by SHAP analysis: CCNE1 (SNP data), LCK (gene expression data), and MMSE (clinical data) for modification. CCNE1 and LCK were reduced by 20–30%, while MMSE was decreased by 1–25%, reflecting realistic ranges derived from observed real data and literature [56]. The simulated data led to improved classification performance, with the model achieving accuracy of 0.9037, AUC of 0.9810, and F1-score of 0.9037, compared to the real data, which achieved ACC of 0.893, AUC of 0.972, and F1-score of 0.893. These results confirm that the features identified by the model are important for accurate predictions, as amplifying their associations significantly enhances the model’s ability to distinguish between Alzheimer’s disease progression stages. The distributional shifts for LCK, CCNE1, and MMSE following the simulated adjustments are illustrated in Fig 5a, Fig 5b, and Fig 5c respectively. Additionally, Fig 5d includes the ROC curve for the simulated data validation, highlighting the enhanced predictive performance of the model with amplified feature associations.
(a) CCNE1, (b) LCK, and (c) MMSE distributions show the effects of simulated adjustments. (d) The ROC curve demonstrates the predictive performance, validating the alignment of simulated data with real-world variability.
Discussion
In this study, we have described our model c-Triadem, a constrained multimodal deep neural network multiclassifier that accurately predicts the patient’s disease status as either CN, MCI, or AD on the basis of their genotyping, gene expression, and clinical data. The input layers contain nodes encoding genes and their respective biological pathways, and edges are constrained to reflect known relationships among the genes and pathways. The pathway information was taken from the Reactome pathway database, which systematically associates proteins with their functional role in biological pathways, and connects pathways hierarchically. We have shown that our model has superior performance compared to previous deep neural networks trained with the ADNI dataset.
SHAP scores enable the identification of genetic and clinical features which explain the model predictions. From the STRING interaction networks, it is observed that the genes prioritized by SHAP (such as CASP9, LCK, SDC3, CCNE1, ATG5, UBC, and TOMM20) are associated with each other in AD-related biological pathways. The gene CASP9 (caspase-9) is a cysteine-aspartic protease that serves as an initiator for apoptosis. CASP9 is well-studied in neurodegenerative diseases, with increasing levels of CASP9 implicated in AD brains as well as platelet-rich plasma [33–35]. LCK (lymphocyte-specific protein tyrosine kinase) is located in a well-known linkage region [36] and has been named a risk factor for AD [37]. Mouse studies suggest that LCK inhibition alters spatial learning and impairs long-term memory. [38] SDC3 (Syndecan-3, a proteoglycan) expression in PBMC has been correlated with Aβ and has been suggested as a potential biomarker for the diagnosis of early AD [39]. Several studies have identified dysfunctional mitophagy as an early marker of Alzheimer’s disease, driving amyloid-β aggregation and neurofibrillary tangle formation [40]. Mitophagy and aggrephagy refer to the autophagic clearance of damaged mitochondria and protein aggregates respectively. Reduced levels of mitophagy markers ATG5 and Parkin have been demonstrated in MCI and AD patients [41]. Given that hypometabolism caused by impaired energy metabolism is one of the earliest possible markers of AD, genes related to mitophagy may serve as potential biomarkers for AD diagnosis in the early stages. Additionally, SHAP-selected clinical features (RAVLT scores, specifically immediate recall and percent forgetting) have a strong association with brain structural atrophy [42].
The constrained model also allows the identification of biological pathways which emphasize the differences among the three diagnostic groups. Moreover, the selected pathways with significant differences in intermediate node activation (such as PLCγ1 signaling, TRKA activation by NGF, DNA double-stranded break response, and caspase activation) reinforce SHAP findings. For instance, SHAP-selected CASP-9 works through the caspase activation pathway identified by intermediate node activation. The findings are also supported by previous studies. It is known that PLCγ1 activation is reduced in PBMC samples from AD patients [43]. Deficiencies in the TRKA/NGF axis are implicated in the depletion of cholinergic neurons and cognitive decline in AD [44].
Recent studies indicate that the development of AD is associated with systemic changes in the neuronal environment reflected in other parts of the body. Studies in peripheral blood mononuclear cells (PBMCs) from amnestic MCI and AD patients show differential expression of senescence markers, such as cell cycle blockade (p16 and p53) and DNA damage response (γH2AX) [18]. Garfias et al. [45] have reported significantly higher levels of activated lymphocytes in AD patients. Moreover, a gene expression analysis of PBMC samples in the AddNeuroMed cohort [46] identified DEGs (differentially expressed genes) significantly enriched in pathways related to T-cell and neutrophil activation in immune response, lymphocyte differentiation, protein serine/threonine kinase activity, GTPase and DNA transcription factor binding [47]. In our study, we have identified potential genetic markers of amyloidosis, immune activation, DNA damage response, and dysfunctional mitophagy in blood-derived genetic data. Further, the information from enriched pathways provides mechanistic insights into AD pathogenesis.
To further validate our findings, simulated data were generated by modifying three key features identified through SHAP analysis: CCNE1, LCK, and MMSE. CCNE1, associated with cell cycle regulation and implicated in neurodegenerative diseases [48], was reduced to simulate its dysregulation in AD. LCK, a critical kinase involved in T-cell activation, has been linked to neuroinflammation in AD pathogenesis [49]. MMSE, a cognitive measure, was adjusted to reflect varying stages of cognitive decline in AD. The enhanced model performance on simulated data (AUC = 0.981, ACC = 0.9037) shows the importance of these features in distinguishing AD progression stages, further supporting their role as potential biomarkers for early diagnosis and monitoring of the disease.
While our model has been validated through network analysis and showed consistency with literature reports, we recognize several limitations. One limitation is that multimodal data from additional cohorts would corroborate our findings and further establish the validity of our model on independently curated datasets. Additionally, while the Reactome pathways enhance interpretability, they may limit the discovery of novel interactions not yet reported in current biological databases. Another limitation is the absence of non-coding SNP data and other data modalities, which we plan to incorporate in future work. Non-coding SNPs may affect gene regulation and expression indirectly, potentially revealing more mechanistic insights into AD pathology.
The integration of multimodal data—genotyping, gene expression, and clinical features—enables the model to detect subtle biological and clinical changes, which are critical for identifying prodromal stages of AD. The inclusion of features like CASP9, LCK, and MMSE, highlighted by SHAP analysis, supports early prediction by capturing processes associated with apoptosis, neuroinflammation, and cognitive decline, respectively. Additionally, the model’s high accuracy across all classes, particularly its ability to distinguish MCI as an early stage of AD, highlights its potential for timely interventions and improved patient outcomes.
Additionally, c-Triadem’s performance in distinguishing MCI from AD suggests its potential in assessing and predicting MCI to AD conversion possibly through further sub-classification of converting MCI and non-converting MCI patients. SHAP analysis identified clinically relevant features, such as RAVLT scores and APOE ϵ4 status, that are well-documented predictors of cognitive decline. By modeling these features alongside genetic and molecular data, the model provides insights into pathways and markers that drive progression from MCI to AD. This could facilitate targeted monitoring and interventions for individuals at high risk of conversion.
Our model uses explainable AI to probe multi-omic data for genes and clinical features associated with AD pathogenesis and provides further transparency by exposing the biological pathways that influence its predictions. As a result, we firmly believe that c-Triadem holds the potential to be a tool that facilitates precise and early diagnosis of AD, as well as to assess and predict disease progression, including MCI to AD conversion, enabling tailored treatment strategies and improving patient care.
Materials and methods
Dataset
Alzheimer’s Disease Neuroimaging Initiative.
Data used in the preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database. ADNI was launched in 2003 as a public-private partnership, led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD). For up-to-date information, see https://adni.loni.usc.edu/. In addition to MRI and PET neuroimaging of patients at regular intervals, ADNI has collected and analyzed whole blood samples for genotyping and gene expression analysis. Blood gene expression profiling was conducted using Affymetrix Human Genome U219 Array for 744 samples in the ADNI2/ADNI-GO (Grand Opportunity) phase. Blood genotyping and gene expression methods for ADNI have been described in detail by Saykin et al. [29] The ADNI studies were approved by institutional review boards of all participating institutions, and written informed consent was obtained from all participants or authorized representatives [29]. The study protocol obtained approval from the human research committees at each involved institution, and informed consent was provided by all participants or their legal guardian(s)/legally authorized representatives. The research procedures strictly adhered to applicable guidelines and regulations [54,55]. Table 4 presents a summary of the genotyping data provided by ADNI.
We selected sample data from 626 participants that had both SNP data and gene expression data for training, testing, and validation in the deep learning model. Out of 626 participants, 212 were CN, 317 participants had MCI and 97 were diagnosed with dementia due to AD. The criteria for AD and MCI diagnosis in ADNI are detailed by Petersen [50].
Genotype data.
Genotyping data were downloaded in PLINK binary format, consisting of .bed, .bim and .fam files. The .bed file is the primary representation of genotype calls of biallelic variants. The .bim file accompanies the .bed file and provides extended variant information, i.e., SNP IDs, base-pair coordinates, and allele information. The .fam file provides sample information, including parent roster IDs and phenotype. We utilized the .bim file to identify the presence of variants and encoded SNPs using additive representation (i.e., 0 = homozygous dominant, 1 = heterozygous, 2 = homozygous recessive). We used the dbSNP ID as the unique identifier for SNPs. We then mapped SNPs that occur in coding sequences (CDS) to their corresponding gene loci. Input values represent the aggregate additive value of all SNPs mapped to the coding sequences of individual genes. The SNP data along with gene expression data represent two of the inputs for the model.
Clinical data.
Clinical data include demographic information, scores from neuropsychological tests, brain volume measurements, and levels of clinical biomarkers of AD, such as Aβ, tau protein, and FDG uptake. Patient demographic information includes age, gender, ethnicity, racial category, marital status, and years of education. The demographic data were encoded as categorical variables before model training. Genotype information (ie. presence of APOE ϵ4) is also included. Brain functioning as measured by fluorodeoxyglucose (FDG), PIB (Pittsburgh compound B), and amyloid detection ligand (AV45) uptake with PET are reported. Cerebrospinal fluid (CSF) biomarker levels of Aβ, tau, and P-tau protein have also been recorded. Additionally, the clinical data reports patient scores from a battery of neuropsychological tests. A Functional Activities Questionnaire (FAQ) assesses the patient’s level of independence to perform daily tasks. Everyday cognitive evaluations (Ecog) of the patient’s ability to carry out everyday tasks are reported by the patient (self) and a study partner. Reported scores for Mini-Mental State Exam (MMSE), Montreal Cognitive Assessment (MOCA), Rey’s Auditory Verbal Learning Test (RAVLT), Alzheimer’s Disease Assessment Scale (ADAS), Modified Preclinical Alzheimer Cognitive Composite (mPACC), Digit Span memory test (DIGITSCOR), Trail Making test (TRABSCOR) and Logical Memory Delayed Recall Total Number of Story Units Recalled (LDELTOTAL) are used to estimate the severity and progression of cognitive and memory impairment. MRI measurements of hippocampal, intracranial, mid-temporal, fusiform, ventricle, entorhinal, and whole brain volume are also reported.
In the ADNI clinical data, PIB levels, Ecog scores, Aβ levels, tau protein levels, and DIGITSCOR were missing for a substantial proportion of patients. Therefore, we utilized k-nearest neighbors (kNN) imputation for handling missing data. kNN imputation selects k subjects that are similar to the subject with missing values and is preferred for its ability to handle continuous, categorical, and discrete data in our dataset. Moreover, kNN imputation is shown to improve multiclass prediction of disease in ADNI. [32] We performed imputation in R v4.2.1 using DMwR (data mining with R) package v0.0.2, on clinical features with less than 60% missing data. We set k = 5 as the minimum number of neighbors from which the missing values could be inferred. Imputation was performed on the training dataset and then applied to the test and validation datasets. Only clinical features measured at the time of diagnosis were retained, ie., baseline features were removed. The target variable (i.e., diagnosis of CN, MCI, or AD) was not included in the procedure.
The proposed c-Triadem model
The proposed c-Triadem is a constrained artificial neural network multiclass classifier which aims to accurately predict the patient’s status as CN, MCI, or AD using their genotyping, gene expression, and clinical data. The data pre-processing, model development, and interpretation steps involved in c-Triadem are depicted in the block diagram of Fig 6. In particular, in c-Triadem, the genotype and gene expression inputs consist of nodes that represent genes and are fed into constrained subnetworks. The nodes and edges of the hidden layers in the two subnetworks (for genotype and gene expression) represent Reactome pathways and their biological relationships, respectively. We chose to incorporate Reactome into our deep learning model due to its hierarchical organization and a data model which makes pathways computationally accessible [51]. Reactome data was provided by Elmarakeby et al. [52] in their repository (https://zenodo.org/record/5163213#.Y7wZgNVBxPY). By representing the genes, pathways, and connections among them as nodes and edges in the subnetworks, we can better understand the biological connections which are important for the prediction. Thus, the model’s interpretability is enhanced compared to a dense network with similar architecture.
An overview of data preprocessing, model development, and interpretation is presented in the flow diagram.
Each subnetwork consists of one input layer with 10,151 nodes representing the genes and three non-trainable hidden layers with nodes representing pathways. The sub-network layers are non-trainable as they represent biological pathways from the Reactome pathway database, with their structure predefined. These layers encode prior biological knowledge, constraining the model to reflect known relationships among genes and pathways. Due to the constraints on the edges connecting the genes and pathways, the sparsely connected subnetwork has 32,842 parameters. The constraints are encoded as a binary weights matrix which sets all non-existent connections among the genes and pathways to zero.
In addition to the genotyping and gene expression input, a third input of clinical data is provided. The clinical data with 45 nodes are concatenated along with the output of the two subnetworks and passed through a batch normalization layer, followed by two hidden layers. Kernel regularization is applied to both hidden layers. Bayesian hyperparameter optimization was used to configure the hidden layer sizes, initial learning rate, step interval for learning rate schedule, choice of activation function, and kernel regularization of the hidden layers, by monitoring the validation accuracy over 80 epochs. A random state was set beforehand to ensure consistent results. The first hidden layer is configured with 19 nodes and linear activation followed by another hidden layer with 8 nodes and Rectified Linear Units (ReLU) activation. A dropout layer (rate = 0.597) is included in between the hidden layers. Overall, our network contains 44,239 parameters and 15 layers. Excluding the non-trainable parameters in the subnetworks, we have 11,397 trainable parameters for three types of input representing 20,347 features, whereas an unconstrained network with similar architecture contains over 37 million parameters.
Model training
We empirically observed a four-fold reduction in training time for our model (training time of 0 minutes and 18.905 seconds) compared to the unconstrained model (1 minute and 18.862 seconds) on an Intel Core i5 8th Gen CPU (central processing unit) with a clock speed of 1.6-1.8 GHz. Our model was compiled with an adaptive learning rate initially set at 0.008 with exponential decay occurring every 17 steps at a rate of 0.96. Model training was performed with the Adam optimizer to reduce categorical cross-entropy loss. We used Python v3.8 with the Functional API (Application Programming Interface) of keras v2.4.3 to design and train c-Triadem on genotyping, gene expression, and clinical data from ADNI participants. The dataset comprises 212 CN, 317 MCI, and 97 AD samples. We applied a train/validation/test split of 56-14-30%. We used target-based stratification during the test-train split followed by Synthetic Minority Over-sampling Technique (SMOTE) to address the imbalance in the datasets. Min-max scaling was performed on the training data and transferred to the validation and test datasets. The target was one-hot encoded prior to training.
Along with kernel regularization and dropout, early stopping callbacks on validation loss were used to prevent overfitting. We used the default batch size of 32 and set the maximum number of epochs at 120. The output is converted to respective CN, MCI, and AD probabilities by softmax, with the higher probability used for the classification of disease status. A representation of the model architecture is provided in Fig 7.
SnrepresentsthenthsampleandGn represents the nth gene. Dummy values are provided for input data types.
Model performance evaluation
We evaluated the performance of our model on the validation and test datasets. We used the area under the receiver operating characteristic (AUC) curve, accuracy, precision, recall, and the F1 score as performance metrics, and their formulae are listed below. We defined true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN) for each predicted class (CN, MCI, and AD). For instance, for the AD class, TP is the number of AD samples predicted correctly. TN is the number of non-AD samples predicted as MCI or CN. FP is the number of MCI and CN samples predicted as AD. FN is the number of AD samples predicted as MCI or CN.
Model interpretation and identification of potential biomarkers
Model interpretation is essential to gain user trust and overcome the ‘black box’ reputation of deep learning models. Lundberg and Lee [27] proposed SHAP values as a unified measure of feature importance, computed using game theory. To calculate SHAP values, each individual feature’s contribution to the predicted value is estimated by comparing predictions over different combinations of features. The SHAP value for a feature is the average of all the marginal contributions to predictions from all possible feature combinations. To evaluate the importance of specific genes and clinical data, we computed SHAP values on all features using the shap package v0.41.0%.
Validation using simulated data
To validate the model’s robustness and interpretability, we generated a simulated dataset based on the control group (CN) to avoid confounding effects of Alzheimer’s disease-specific trends. The most important features for this multi-class classification task, as identified by the model using SHAP analysis on real data, were selected for modification. This approach aimed to validate whether amplifying feature associations improves classification performance.
The selected features were modified using a random uniform factor within a range tailored to each feature’s observed values in the dataset and supported by findings from the literature. These adjustments were applied to 50% of the subjects [56]. This ensured realistic, data-driven modifications while maintaining biological relevance. The simulated data preserved the statistical properties (mean, variance, and range) of the original dataset, verified using Kolmogorov-Smirnov tests and visual comparisons (e.g., histograms). Randomized changes and noise were introduced to replicate real-world variability without introducing bias. The resulting dataset was evaluated using the same model pipeline.
Interaction network analysis
In order to gain mechanistic insights and validate the reliability of our model, we thoroughly investigated the interaction network among genes that were prioritized by SHAP. Specifically, we extracted the SHAP-prioritized genes from the top 20 features for each group (CN,MCI, and AD). To explore the functional associations between these genes, we employed STRING, a resource that leverages diverse types of evidence, including experimental data, to identify such connections. By examining the interaction network within STRING [53], we aimed to elucidate the intricate relationships and potential collaborative roles among these genes. Moreover, we also conducted an analysis to identify enriched pathways for the selected genes. This allowed us to gain further insights into the biological processes and molecular mechanisms underlying the observed gene interactions. By comprehensively studying the gene interaction network and identifying enriched pathways, we aimed to reinforce the validity and biological relevance of our model’s predictions.
Intermediate layer activation
To investigate and compare the activation values of intermediate layers in our model, we randomly selected inputs from each prediction class, namely cognitively normal (CN), mild cognitive impairment (MCI), and Alzheimer’s disease (AD). In this context, activation refers to the signed outcome of a node based on its inputs. To determine the significance of differences in activation values, we conducted the Kruskal-Wallis statistical test. This analysis allowed us to assess the variations in activations across the different prediction classes in a rigorous and quantitative manner.
Statistical analysis
Significant distinctions in clinical features among cognitively normal (CN), mild cognitive impairment (MCI), and Alzheimer’s disease (AD) subjects were examined through rigorous statistical analyses. The age of onset and years of education were subjected to an analysis of variance (ANOVA) test to assess their significance. Similarly, the proportion of male and female participants, as well as the presence of the APOE ϵ4 allele, were evaluated using the Chi-square contingency test. For the remaining clinical variables, the non-parametric Kruskal-Wallis test was employed. It is important to note that all statistical analyses were conducted with a significance level of 95.
Acknowledgments
Data used in the preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in the analysis or writing of this report. A complete listing of ADNI investigators can be found at http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf. Reactome data were downloaded from the repository provided by Elmarakeby et al. (2021): https://zenodo. org/record/5163213#.Y7wZgNVBxPY.
References
- 1.
Gauthier S, Rosa-Neto P, Morais J, Webster C. World Alzheimer report 2021: journey through the diagnosis of dementia. Alzheimer’s Disease International; 2021.
- 2. Chintamaneni M, Bhaskar M. Biomarkers in Alzheimer’s disease: a review. ISRN Pharmacol. 2012;2012:984786.
- 3. Reitz C. Genetic diagnosis and prognosis of Alzheimer’s disease: challenges and opportunities. Expert Rev Mol Diagn. 2015;15:339–48.
- 4. Ogomori K, et al. Beta-protein amyloid is widely distributed in the central nervous system of patients with Alzheimer’s disease. Am J Pathol. 1989;134:243–251. pmid:2464938
- 5. Fukutani Y, et al. Neuronal loss and neurofibrillary degeneration in the hippocampal cortex in late-onset sporadic Alzheimer’s disease. Psychiatry Clin Neurosci. 2000;54:523–9.
- 6. Roussarie J-P, et al. Selective neuronal vulnerability in Alzheimer’s disease: a network-based analysis. Neuron. 2020;107:821–835.e12,
- 7. Arriagada P, Growdon J, Hedley-Whyte E, Hyman B. Neurofibrillary tangles but not senile plaques parallel duration and severity of Alzheimer’s disease. Neurology. 1992;42:631–9.
- 8. Jack CR, Jr, et al. NIA-AA research framework: toward a biological definition of Alzheimer’s disease. Alzheimers Dement. 2018;14:535–62.
- 9. Padala SP, Newhouse PA. Blood-based biomarkers in Alzheimer’s disease: a mini-review. Metab Brain Dis. 2023;38:185–193.
- 10. Ovod V, et al. Amyloid β concentrations and stable isotope labeling kinetics of human plasma specific to central nervous system amyloidosis. Alzheimers Dement. 2017;13:841–9.
- 11. Nakamura A. et al. High performance plasma amyloid-β biomarkers for Alzheimer’s disease. Nature. 2018;554:249–54.
- 12. Zetterberg H, Burnham SC. Blood-based molecular biomarkers for Alzheimer’s disease. Mol Brain. 2019;12:26.
- 13. Teunissen CE, et al. Blood-based biomarkers for Alzheimer’s disease: towards clinical implementation. Lancet Neurol. 2022;21:66–77.
- 14. Blennow K, Zetterberg H. Biomarkers for Alzheimer’s disease: current status and prospects for the future. J Intern Med. 2018;284:643–63.
- 15.
Lashley T, et al. Molecular biomarkers of Alzheimer’s disease: progress and prospects. Dis Model Mech. 2018;11:dmm031781. https://doi.org/10.1242/dmm.031781
- 16. Bhagavati S. Commentary: Diagnostic accuracy of blood-based biomarker panels: a systematic review. Front Aging Neurosci. 2022;14:895398.
- 17. Eke CS, et al. Early detection of Alzheimer’s disease with blood plasma proteins using support vector machines. IEEE J Biomed Health Inform. 2021;25:218–26.
- 18. Salech F, et al. Senescence markers in peripheral blood mononuclear cells in amnestic mild cognitive impairment and Alzheimer’s disease. Int J Mol Sci. 2022;23:9387.
- 19. Zettergren A, et al. Association between polygenic risk score of Alzheimer’s disease and plasma phosphorylated tau in individuals from the Alzheimer’s disease neuroimaging initiative. Alzheimers Res Ther. 2021;13:17.
- 20. Stevenson-Hoare J, et al. Plasma biomarkers and genetics in the diagnosis and prediction of Alzheimer’s disease. Brain. 2023;146:690–9.
- 21. Reddy JS, et al. Transcript levels in plasma contribute substantial predictive value as potential Alzheimer’s disease biomarkers in African Americans. EBioMedicine. 2022;78:103929.
- 22. Ji W, An K, Wang C, Wang S. Bioinformatics analysis of diagnostic biomarkers for Alzheimer’s disease in peripheral blood based on sex differences and support vector machine algorithm. Hereditas. 2022;159:38.
- 23.
Stamate D, et al. A metabolite-based machine learning approach to diagnose Alzheimer-type dementia in blood: results from the European Medical information framework for Alzheimer disease biomarker discovery cohort. Alzheimers Dement. (N.Y.) 2019;5:933–8. https://doi.org/10.1016/j.trci.2019.11.001
- 24. Morgan AR, et al. Inflammatory biomarkers in Alzheimer’s disease plasma. Alzheimers Dement. 2019;15:776–87.
- 25. O’Bryant SE, et al. A blood-based algorithm for the detection of Alzheimer’s disease. Dement Geriatr Cogn Disord. 2011;32:55–62.
- 26. Qiu S, et al. Development and validation of an interpretable deep learning framework for Alzheimer’s disease classification. Brain. 2020;143:1920–33.
- 27.
Lundberg S, Lee S-I. A unified approach to interpreting model predictions. In: Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17. Red Hook, NY: Curran Associates Inc.; 2017, pp. 4768–-77.
- 28. Yang CC. Explainable artificial intelligence for predictive modeling in healthcare. J Healthc Inform Res. 2022;6:228–39.
- 29. Saykin AJ, et al. Genetic studies of quantitative MCI and AD phenotypes in ADNI: progress, opportunities, and plans. Alzheimers Dement. 2015;11:792–814.
- 30. Venugopalan J, Tong L, Hassanzadeh HR, Wang MD. Multimodal deep learning models for early detection of Alzheimer’s disease stage. Sci Rep. 2021;11:3254.
- 31. Zhou T, Thung K-H, Zhu X, Shen D. Effective feature learning and fusion of multimodality data using stage-wise deep neural network for dementia diagnosis. Hum. Brain Mapp. 2019;40:1001–16.
- 32. Aghili M, Tabarestani S, Adjouadi M. Addressing the missing data challenge in multi-modal datasets for the diagnosis of Alzheimer’s disease. J Neurosci Methods. 2022;375:109582.
- 33. Avrutsky MI, Troy CM. Caspase-9: a multimodal therapeutic target with diverse cellular expression in human disease. Front Pharmacol. 2021;12:701301.
- 34. Rohn TT, et al. Caspase-9 activation and caspase cleavage of tau in the Alzheimer’s disease brain. Neurobiol Dis. 2002;11:341–54.
- 35. Zhao S, Zhao J, Zhang T, Guo C. Increased apoptosis in the platelets of patients with Alzheimer’s disease and amnestic mild cognitive impairment. Clin Neurol Neurosurg. 2016;143:46–50.
- 36. Blacker D, et al. Results of a high-resolution genome screen of 437 Alzheimer’s disease families. Hum Mol Genet. 2003;12:23–32.
- 37. Zhong W, et al. Lymphocyte-specific protein tyrosine kinase is a novel risk gene for alzheimer disease. J Neurol Sci. 2005;238:53–57.
- 38. Kim E-J, et al. Alzheimer’s disease risk factor lymphocyte-specific protein tyrosine kinase regulates long-term synaptic strengthening, spatial learning and memory. Cell Mol Life Sci. 2013;70:743–59.
- 39. Hudák A, Letoha A, Vizler C, Letoha T. Syndecan-3 as a novel biomarker in Alzheimer’s disease. Int J Mol Sci. 2022;23.
- 40. Zeng K, et al. Defective mitophagy and the etiopathogenesis of Alzheimer’s disease. Transl Neurodegener. 2022;11:32.
- 41. Castellazzi M, et al. Autophagy and mitophagy biomarkers are reduced in sera of patients with Alzheimer’s disease and mild cognitive impairment. Sci Rep. 2019;9:20009.
- 42. Moradi E, Hallikainen I, Hänninen T, Tohka J, Alzheimer’s Disease Neuroimaging Initiative. Rey’s auditory verbal learning test scores can be predicted from whole brain MRI in Alzheimer’s disease. NeuroImage Clin. 2017;13:415–27.
- 43.
Phongpreecha T, et al. Single-cell peripheral immunoprofiling of Alzheimer’s and Parkinson’s diseases. Sci Adv. 2020;6:eabd5575. https://doi.org/10.1126/sciadv.abd5575
- 44.
Josephy-Hernandez S, et al. Pharmacological interrogation of TrkA-mediated mechanisms in hippocampal-dependent memory consolidation. PLoS One. 2019;14:e0218036, doi: https://doi.org/10.1371/journal.pone.0218036
- 45.
Garfias S, et al. Peripheral blood lymphocyte phenotypes in Alzheimer and Parkinson’s diseases. Neurol. (Engl. Ed.) 2022;37:110–21. https://doi.org/10.1016/j.nrleng.2018.10.022
- 46. Lovestone S, et al. AddNeuroMed—the European collaboration for the discovery of novel biomarkers for Alzheimer’s disease. Ann N Y Acad Sci. 2009;1180:36–46.
- 47. Sood S, et al. A novel multi-tissue RNA diagnostic of healthy ageing relates to cognitive health status. Genome Biol. 2015;16:185.
- 48. Absalon S, Kochanek DM, Raghavan V, Krichevsky AM. MiR-26b, upregulated in Alzheimer’s disease, activates cell cycle entry, tau-phosphorylation, and apoptosis in postmitotic neurons. J. Neurosci. 2013;33:14645–59.
- 49. Bommhardt U, Schraven B, Simeoni L. Beyond TCR signaling: emerging functions of LCK in cancer and immunotherapy. Int J Mol Sci. 2019;20:3500.
- 50. Petersen RC, et al. Alzheimer’s disease neuroimaging initiative (ADNI): clinical characterization. Neurology. 2010;74:201–9.
- 51. Sidiropoulos K, et al. Reactome enhanced pathway visualization. Bioinformatics. 2017;33:3461–7.
- 52. Elmarakeby HA, et al. Biologically informed deep neural network for prostate cancer discovery. Nature. 2021;598:348–52.
- 53. Szklarczyk D, et al. The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res. 2021;49:D605–12.
- 54. van der Haar D, et al. An Alzheimer’s disease category progression sub-grouping analysis using manifold learning on ADNI. Sci Rep. 2023;13:10483.
- 55. Shishegar R. Using imputation to provide harmonized longitudinal measures of cognition across AIBL and ADNI. Sci Rep. 2021;11:23788.
- 56. Bhattarai P, Thakuri DS, Nie Y, Chand GB. Explainable AI-based Deep-SHAP for mapping the multivariate relationships between regional neuroimaging biomarkers and cognition. Eur J Radiol. 2024;174:111403.