Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Explainability of three-dimensional convolutional neural networks for functional magnetic resonance imaging of Alzheimer’s disease classification based on gradient-weighted class activation mapping

  • Boyue Song ,

    Roles Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft

    258005p@gs.kochi-tech.ac.jp

    Affiliation Graduate School of Engineering, Kochi University of Technology, Kami City, Kochi Prefecture, Japan

  • Shinichi Yoshida,

    Roles Funding acquisition, Resources, Supervision, Writing – review & editing

    Affiliation School of Information, Kochi University of Technology, Kami City, Kochi Prefecture, Japan

  • for the Alzheimer’s Disease Neuroimaging Initiative

    Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wpcontent/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf

Abstract

Currently, numerous studies focus on employing fMRI-based deep neural networks to diagnose neurological disorders such as Alzheimer’s Disease (AD), yet only a handful have provided results regarding explainability. We address this gap by applying several prevalent explainability methods such as gradient-weighted class activation mapping (Grad-CAM) to an fMRI-based 3D-VGG16 network for AD diagnosis to improve the model’s explainability. The aim is to explore the specific Region of Interest (ROI) of brain the model primarily focuses on when making predictions, as well as whether there are differences in these ROIs between AD and normal controls (NCs). First, we utilized multiple resting-state functional activity maps including ALFF, fALFF, ReHo, and VMHC to reduce the complexity of fMRI data, which differed from many studies that utilized raw fMRI data. Compared to methods utilizing raw fMRI data, this manual feature extraction approach may potentially alleviate the model’s burden. Subsequently, 3D-VGG16 were employed for AD classification, where the final fully connected layers were replaced with a Global Average Pooling (GAP) layer, aimed at mitigating overfitting while preserving spatial information within the feature maps. The model achieved a maximum of 96.4% accuracy on the test set. Finally, several 3D CAM methods were employed to interpret the models. In the explainability results of the models with relatively high accuracy, the highlighted ROIs were primarily located in the precuneus and the hippocampus for AD subjects, while the models focused on the entire brain for NC. This supports current research on ROIs involved in AD. We believe that explaining deep learning models would not only provide support for existing research on brain disorders, but also offer important referential recommendations for the study of currently unknown etiologies.

Introduction

Functional magnetic resonance imaging (fMRI) has been widely used for brain mapping research since the 1990s because of its ability to provide detailed functional information about the brain without requiring injections, surgery, or exposure to ionizing radiation [1]. Despite its significant advantages such as non-invasiveness, functional specificity, and high spatial resolution, fMRI is not commonly used for clinical diagnosis because of its susceptibility to noise and the complexity of its data [2, 3]. Since the useful signal variation in fMRI is generally only 2–5% of the signal strength, even a slight amount of noise can significantly affect the quality of data. Moreover, due to the relatively high temporal and spatial resolutions of fMRI, the complexity of its data is extremely high, which is another primary reason why fMRI data is hard to be directly utilized for medical analysis and diagnosis. Therefore, various statistical methods, such as independent components analysis (ICA) [4, 5] and general linear model (GLM) [6, 7], are often used to reduce data complexity and extract useful information from fMRI data. Tang et al. [8] proposed a method in which each brain was registered to MNI standard space and subdivided into 90 regions, and the regional time series were obtained by calculating the average fMRI signal across all voxels in each region. After that, functional connectivity between each pair of regions by calculating the Pearson correlation coefficient (PCC). With this method, features with good discriminative power were extracted and used to finish classification. In addition to general statistical methods, researchers have developed various dimensionality reduction techniques for fMRI data, such as the amplitude of low-frequency fluctuation (ALFF) [9], fractional ALFF (fALFF) [9], regional homogeneity (ReHo) [10], and voxel-mirrored homotopic connectivity (VMHC) [11]. With these methods, temporal information in the fMRI is compressed to generate only one brain volume, which significantly reduces the complexity of the data. In addition, different methods can be used to extract useful information from various perspectives.

However, traditional statistical methods require handcrafted feature extraction, which can result in inefficiencies and occasional errors. Recent advancements in computer hardware processing power and innovations in new graphical software have led researchers to focus increasingly on automation informed by deep learning. The combination of raw fMRI data and convolutional neural networks (CNNs) has recently enabled the automatic classification of various neurological diseases. As mentioned in [12], AD is the most common cause of dementia among older adults, a progressive disorder that starts with mild symptoms and worsens progressively. The progress from NC to AD can be subdivided into four stages, namely significant memory concern (SMC), early mild cognitive impairment (EMCI), mild cognitive impairment (MCI), late mild cognitive impairment (LMCI). Approximately 96.85% accuracy was achieved in classifying fMRI data from patients with AD compared to NC subjects [13]. The fMRI data were preprocessed using a standard pipeline, sliced into two-dimensional images from axial view and time axes, and randomly divided into training and testing datasets. LeNet-5 was used for classification. In a previous study [12], a similar pre-processing pipeline was employed; however, residual neural networks [14] were utilized to classify AD. By fine-tuning transfer learning, an accuracy of 97.88% was achieved. A modified three dimensional (3D) CNN has been applied in resting-state fMRI data [2], in which four-dimensional fMRI data were sliced along the time axis to aid training. This method was approximately 98.96% accurate for AD. [15] developed a robust low-cost neural network classification system for AD and Mild Cognitive Impairment (MCI) against NC using a CNN with input images based on diffusion maps and gray-matter volumes, achieving competitive results of 93.5% for AD/NC classification. In another study relating raw MRI data [16], the authors employed ResNet-50 and LeNet on AD classification based on MRI slices in three views and categories. The study demonstrated that the selecting slices performed better than using entire slices in MRI images for AD classification and the coronal view showed higher accuracy. Although these methods demonstrate high accuracy, their models lack explainability, thereby diminishing the credibility of the model due to the inability to explain its predictions.

Deep learning models extract features from the input data based on labeled data distributions and then make predictions based on these features. The use of raw fMRI data would evidently increase the difficulty in extracting useful information for the model, which would subsequently affect performance. In some studies, statistical methods or other means have been employed to reduce the complexity of raw fMRI data as a preprocessing step to reduce the difficulty of feature extraction by the model. A framework for the early diagnosis of AD has been developed using deep neural networks and various medical information, in which functional brain networks were constructed from resting-state fMRI signal correlations and used as correlation coefficient data to train the neural network [17]. Similarly, [18] proposed a deep learning-based method to realize binary classification between each pair of the different stages of AD. The accuracy exceeded 99%. In addition, [19] reduced the dimensionality of fMRI data by extracting features as 3D spatial maps for classifying resting-state fMRI images using a 3D-CNN. An accuracy of 85.27% was obtained for the binary classification of AD versus NC. Dimensionality reduction method have not only been applied to AD, but also to schizophrenia. In [20], Group ICA was considered as a preprocessing step for extracting ICA components from a schizophrenia dataset, and 3D-CNN was employed to complete the classification. Furthermore, 98.09% ten-fold cross-validated classification accuracy was achieved. In [3], fMRI images were preprocessed, and functional connectivity analysis was used to extract features. Subsequently, 3D-CNN and a long short-term memory recurrent network were utilized to extract spatial and temporal information for classifying functional activity maps. They achieved an accuracy of 92.32% for the Center for Biomedical Research Excellence dataset [21]. [22] investigates the utility of correlated transfer function (CorrTF) as a novel biomarker for extracting crucial features from resting-state fMRI data. Employing a support vector machine (SVM) in hierarchical and flat multi-classification schemes, the research achieved competitive results of 98.2% for distinguishing between various stages of AD. In our opinion, the more detailed the manually extracted features are in the entire classification task, the simpler the process of automatic feature extraction required by the deep learning models; thus, the performance of the models may be better. However, we cannot guarantee that the manually extracted features are precisely the features required by the model for classification. Therefore, there is a trade-off between manually pre-extracting features and allowing the model to automatically extract features.

Although deep learning has made significant achievements in various fields, it is frequently referred to as a black box because of its lack of explainability, which means that the underlying reasons for a given prediction cannot be ascertained. This holds true whether the prediction is accurate or not and can greatly impact the reliability of the model, particularly in clinical diagnosis. A method called class activation mapping (CAM) was proposed in 2016 to visualize the model [23]. In this approach, the final fully connected layer is replaced by a global average pooling (GAP) layer and feature maps from the last convolutional layer are used to visualize the model. A novel method called Gradient-weighted CAM (Grad-CAM), which builds upon CAM by combining the gradients of the gradient descent algorithm with the feature maps from the final convolutional layer was introduced in 2017 [24]. Subsequently, multiple CAM-based methods have been proposed [2531], and the explainability of models has become an increasingly important direction in computer vision.

In recent years, visualization techniques have been employed to explain deep learning models based on MRI images. In 2019, [32] proposed using layer-wise relevance propagation (LRP) to visualize CNN decisions for AD based on structural MRI (sMRI) data. The results showed that a lot of importance is put on areas in the temporal lobe including the hippocampus. [33] also proposed a CNN for the detection of AD based on sMRI. In this work, the association of relevance scores and hippocampus volume were evaluated to validate the clinical utility. A high accuracy (AUC ≥ 0.91) was achieved for AD versus NC. Relevance maps indicated that hippocampal atrophy was found the most informative factor for AD detection. [34] proposed a 3D-CNN framework using a spatial source phase (SSP) maps derived from complex-valued fMRI data to classify schizophrenia patients (SZ) and NC. Grad-CAM was utilized to localized all contributing ROIs with opposite stengths for SZ and NC. [35] employed CNN trained on three orthogonal views of cerebral regions, specifically the hippocampi, amygdalae, and insulae, to stage the AD spectrum, including preclinical AD, MCI, AD, and NC, using patched from structured MRI. The performance is comparable to state-of-art methods, showcasing the potential of patch-based region of interest (ROI) ensembles in providing informative landmarks for MRI analysis. In addition to MRI images, [36] presented a deep learning system designed to automatically identify four visually explainable signs of emphysema in frontal and lateral chest radiographs, providing explainable labels for the detected signs. [37] leverages a neural network model trained on synthetic NaI(Tl) urban search data to assess and adapt explanation methods for gamma-ray spectral data. It highlights the superior accuracy of black box methods, specifically LIME and SHAP, with a preference for SHAP due to its minimal hyperparameter tuning.

Extensive research have explored the application of fMRI based deep neural networks to diagnose neurological disorders. Despite this, only a limited number of studies have provided results regarding explainability. In this study, we applied several CAM methods to two 3D-VGG16 models, which were used to classify patients with AD and NC based on four types of 3D resting-state functional activity maps. AD is a neurological disorder characterized by the degeneration of memory-related neurons in the brain. As the disease progresses, different regions of the brain exhibit varying patterns of blood oxygenation levels [38, 39]. Blood oxygenation level refers to the proportion of oxygen bound hemoglobin, which can be used to infer brain activity, as neural activity induces alterations in local blood flow and oxygenation levels [40]. We hypothesized that fMRI captures blood oxygenation patterns in AD-affected ROIs. Deep learning models classify AD stages using these patterns and generate Grad-CAM heatmaps to identify the affected ROIs. The use of heatmaps to explain the model’s focus on specific ROIs can not only assist and support researchers in studying diseases with known and unknown causes. Besides, analyzing the heatmaps of cases, where the model made prediction errors, can help improve the performance of the model. The main aims of our work are as follows:

  • To utilize several resting-state functional activity maps as dataset instead of raw fMRI data, which can manually assist the model in extracting pertinent information
  • To replace fully connected layers with GAP layer in the model serves to preserve spatial information within feature maps, mitigate overfitting, and enhance model performance
  • To employ 3D CAM methods on an fMRI-based 3D-VGG16 model for AD diagnosis, validating the model’s efficacy, and identifying specific ROIs as the basis for classification, potentially indicating AD lesions

First, we introduce the dataset used, as well as the preprocessing steps, deep learning frameworks, and Grad-CAM method in the Materials and methods section. In the Results section, we present the experimental results, including the model’s performance and 3D Grad-CAM heatmaps of 3D-VGG16 networks. In the Discussion section, the performance of the models and the Grad-CAM results are discussed. The final section concludes with a summary of the main findings and contributions of the study.

Materials and methods

Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset and preprocessing

Here, we introduce the dataset used in our study, as well as the preprocessing steps, deep learning frameworks. The program is publicly available at https://github.com/Neutrino000/3D-VGG-ADNI. Data used in the preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). The ADNI was launched in 2003 as a public-private partnership, led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD). All ADNI studies are conducted according to the Good Clinical Practice guidelines, the Declaration of Helsinki, and U.S. 21 CFR Part 50 (Protection of Human Subjects) and Part 56 (Institutional Review Boards). Written informed consent was obtained from all participants before protocol-specific procedures were performed. The Institutional Review Boards approved the ADNI protocol of all participating institutions; for up-to-date information, see www.adni-info.org. Based on multiple scans obtained at various time points for each subject, the dataset for our study consisted of 163 scans of fMRI data from 50 NCs and 105 scans of fMRI data from 34 patients with AD, which implies that some subjects possess multiple sets of scan data. Table 1 presents some characteristics of the ADNI dataset. The small size of datasets is a common issue in medical data. Therefore, the typical practice is to include all samples [19, 4143], which may result in data imbalance. However, larger datasets often offer better generalization performance, leading to a trade-off. A standardized preprocessing pipeline is employed to process the ADNI dataset. The pipeline included various steps to remove noise and improve generality.

thumbnail
Table 1. Characteristics of the ADNI dataset.

https://doi.org/10.1371/journal.pone.0303278.t001

First, the dataset was converted from the Digital Imaging and Communications in Medicine (DICOM) format to Neuroimaging Informatics Technology Initiative (NIFTI) format using the dcm2niix toolbox [44]. Subsequently, Data Processing and Analysis for Brain Imaging (DPABI) [45] on the MATLAB 9.12.0 (2022a) platform were applied to the remaining preprocessing steps. Brain extraction was performed using anatomical and functional images. Subsequently, temporal adjustment was achieved through slice-timing correction, while the influence of head motion on data acquisition was removed by motion correction. In addition, the entire dataset was subjected to intensity normalization to ensure that the mean intensity remained consistent and uniform. Spatial registration was then conducted to align the fMRI images from the participants’ individual spaces to the standard space of the MNI152 template. Finally, a 4-mm full-width at half-maximum (FWHM) cubic Gaussian filter was used for spatial smoothing during the application of deconvolution to the 3D images. The standard preprocessing pipeline is shown in Fig 1A.

thumbnail
Fig 1. Overview of the proposed 3D-VGG16-GAP in AD classification.

A. The standard preprocessing pipeline used in this study. B. Overview of the proposed 3D-VGG16 network and Grad-CAM methods. C. Framework of 3D-VGG16 and 3D-VGG16-GAP.

https://doi.org/10.1371/journal.pone.0303278.g001

After all the preprocessing steps was completed, a data matrix of sized 61 × 73 × 61 × 140 was obtained for each participant, where each volume consisted of 61 × 73 × 61 data points recorded over 140 time points.

Resting-state functional activity maps

fMRI is an imaging technique that can capture information with high spatial resolution and relatively high temporal resolution. It can detect activity in the ROIs within a spatial range of a few millimeters with a temporal resolution of several seconds to tens of seconds. However, the high-dimensional images indicate the high complexity of the data, which limits accurate analysis and description. Although deep learning excels at extracting information from massive amounts of data, highly complex data can still affect the performance of the model.

In this study, four different resting-state functional activity maps-ALFF [9], fALFF [9], ReHo [10], and VMHC [11, 46, 47] were extracted from resting-state fMRI data which describe fMRI data from different aspects but with lower complexity [3]. All resting-state functional activity maps were obtained using DPABI. Fig 2 showed these four kinds of resting-state functional activity maps between NC and AD subjects.

thumbnail
Fig 2. Resting-state functional activity maps.

https://doi.org/10.1371/journal.pone.0303278.g002

ALFF.

After preprocessing, the fMRI data were temporally band-pass filtered (0.01 < f < 0.1Hz) to eliminate low-frequency noise from drift and high-frequency noise from respiratory and cardiac activity. Using the fast Fourier transform (FFT), the time series of each voxel were transformed into the frequency domain: (1)

Since the power of a specific frequency is proportional to the square of the corresponding amplitude, the mean square root of the power spectrum across a frequency range of 0.01 − 0.1Hz for each voxel was computed as follows: (2)

fALFF.

fALFF is a variant form of ALFF, which can further reduce the physiological noise by considering the ratio of each frequency (0.01 < f < 0.1Hz) to the total frequency range. In addition, the application of fALFF enhances both the sensitivity and specificity of detecting spontaneous activity. fALFF is calculated as follows: (3)

ReHo.

ReHo measures the similarity of the time series of a given voxel to those of its nearest neighbors in a voxel-wise manner using Kendall’s coefficient concordance (KCC) [48]. ReHo is calculated as follows: (4) in which Ri is the number rank of the ith time point; is the mean of Ri; K is the number of time series within a measured cluster (K = 27 in our study, which means one given voxel plus the number of its neighbors); and n is the number of ranks.

VMHC.

VMHC is a method employed in the analysis of fMRI scans that enables the assessment of functional similarity between the two hemispheres of the brain. To generate VMHC maps, the fMRI data of each participant were used to compute the PCC between a particular voxel and its corresponding voxel in the opposite hemisphere, followed by the application of Fisher’s z-transform to enhance the normality of the values. It is calculated as follows: (5)

Deep learning framework

After obtaining all the resting-state functional activity maps, 3D VGG was used to finish feature extraction and classification.

Three-dimensional VGG.

VGG16 [49] is a deep CNN model developed by the Visual Geometry Group. It is part of the VGG family of models and was designed for image classification tasks. In this study, a 3D version of the VGG16 model was used, as illustrated in Fig 1B and 1C.

The 3D-VGG16 model consisted of 16 layers, including 13 3D convolutional layers and three fully connected layers. The entire model was divided into five VGG blocks, with the first two VGG blocks containing two convolutional layers each, and the last three VGG blocks containing three convolutional layers each. There was a max-pooling layer with a 2 × 2 × 2 filter and stride of 2 × 2 × 2 voxels at the end of each VGG block. The convolutional layers were composed of 3 × 3 × 3 filters with a stride of one voxel and padding of one voxel, each of which was followed by a batch normalization layer and rectified linear unit (ReLU) as the activation layer. The fully connected layers had 2, 048 and 1, 024 units separately, and the final layer was a softmax layer with two units corresponding to the two AD and NC classes.

Global average pooling layer.

Since the CAM method [23] must expectedly be used for the explainability of 3D-VGG16 in our study, we replaced the fully connected layers at the end of the model with a global average pooling layer (GAP) layer [50], which is a necessary structure for CAM method. It is worth noting that GAP layer is not required for other CAM methods, such as Grad-CAM. The model with the GAP layer can retain its remarkable localization ability until the final layer and easily identify discriminative image regions. In addition, GAP layer can also be employed to prevent overfitting due to the reduced number of parameters [51]. As shown in Fig 1C, the GAP layer computes the average value of each feature map unit in the last convolutional layer, which is then combined using a weighted sum to produce the final output of the model. In our experiment, we employed five CAM methods, including the original CAM [23], Grad-CAM [24], Grad-CAM plus plus [25], Eigen-CAM [26] and Eigen Grad-CAM. However, there was little difference between the heatmaps generated using the different CAM methods. Therefore, owing to space limitations, only the results obtained using the Grad-CAM method are presented in the Results section.

Grad-CAM

Grad-CAM [24] is an improved method based on CAM. The gradient information that flowed into the last convolutional layer of the model was used to determine the significance of each neuron in making predictions. In deep learning models, the features extracted by convolutional layers become increasingly high-level as they progress deeper into the network. Therefore, we chose to utilize the feature maps corresponding to the last convolutional layer. It is worth noting that Grad-CAM can be applied beyond the last convolutional layer in the neural network architecture, allowing for its utilization across multiple layers for visualizing the importance of features.

First, the gradients of the score for a particular class, c, which was denoted by, yc, was computed with respect to the feature maps, Ak, of the last convolutional layer, i.e. ∂yc/∂Ak in the last convolutional layer. Subsequently, global average pooling was employed on the gradients to calculate the neuron importance weights, , as follows: (6)

The weight, , represents the importance of the feature map, k, in predicting the target class, c. Z signifies the number of elements in Ak.

Subsequently, a weighted combination of forward feature maps was applied. Finally, ReLU was employed to remove the negative values as follows: (7)

A coarse heat-map was generated with the same resolutions as the feature maps of the final convolutional layer. Once the coarse Grad-CAM heat map was obtained, bilinear interpolation algorithms are required to match the resolutions of the original images, which enables the visualization and comparison of the results in a more intuitive manner.

Results

Experimental setup

The models were built and trained using Python 3.6 with pytorch 1.9.0. on a Linux machine with 512 GB RAM and 32 GB NVIDIA GPU card. To compare the discriminative power, each functional activity map was trained as a dataset individually, aside from combining maps trained as the dataset. For the combined maps, we combine each individual functional activity maps together in a manner akin to assembling RGB channels in natural images.

The Adam optimizer that has a learning rate of 5 × 10−5 was utilized. Due to the small size of ADNI dataset, a small value of 4 is set as batch size, which could enhance the model’s generalization by introducing more randomness in each batch to help prevent overfitting and improve model’s performance on unseen data. Furthermore, due to the limited exposure to data in each batch, the model is compelled to acquire more generalized features rather than memorizing specific samples from the entire dataset. The decay rate of the weights was set to 5 × 10−4. Given that various train/test splits result in dramatically different rankings of models [52, 53], each model was trained 10 times to obtain the average accuracy for the generality and robustness of the model. For every training session, 80% of the data was randomly selected as the training set, whereas 20% was randomly selected as the testing set for prediction. It is noteworthy that, given the presence of multiple scans for certain subjects, subject-level data split is employed to prevent data leakage. Furthermore, each scan corresponds to only one set of resting state functional activity maps. Finally, for normal 3D-VGG16 without a GAP layer, a dropout layer was applied in each fully connected layer at a rate of 0.5.

Evaluation of deep learning models

As shown in Tables 2 and 3, the accuracies of the combined maps were 91.1% and 96.4% for 3D-VGG16 and 3D-VGG16-GAP, respectively. When a single functional activity map was used as dataset, the ReHo map achieved the highest accuracy of 87.5% on 3D-VGG16, whereas the ALFF map and ReHo maps achieved the highest accuracy of 91.1% on 3D-VGG16-GAP. Combining maps achieved the highest average accuracies of 84.1% and 87.9% for both two models, respectively, which may be due to the most comprehensive information contained in the dataset. For 3D-VGG16, both the ALFF and ReHo maps achieved an average accuracy of approximately 80%, the VMHC map obtained an accuracy of approximately 72%, and the fALFF map had the lowest accuracy at 66.8%. For 3D-VGG16-GAP, the average accuracy of the ALFF map at 84.7% was second only to that of the combined maps. The ReHo map achieved an accuracy of approximately 82%, which was 7% higher than that of the VMHC map. The accuracy of the fALFF map was the lowest among the functional activity maps.

thumbnail
Table 2. Accuracy of 3D-VGG16 with ten training runs.

https://doi.org/10.1371/journal.pone.0303278.t002

thumbnail
Table 3. Accuracy of 3D-VGG16-GAP with ten training runs.

https://doi.org/10.1371/journal.pone.0303278.t003

Overall, 3D-VGG16-GAP performed better than 3D-VGG16, which may be due to the effective reduction in the parameters and suppression of overfitting by replacing the fully connected layers with the GAP layer [52]. In addition, the combined maps achieved the highest accuracy for both models. For the single functional activity map, the performances of the ALFF and ReHo maps were better than that of the VMHC map, whereas the fALFF map had the lowest accuracy. The confusion matrices of the different maps of the two models are shown in Fig 3.

thumbnail
Fig 3. Confusion Matrix of AD and NC for ReHo map on 3D-VGG16 (a), combined maps on 3D-VGG16 (b), ALFF map on 3D-VGG16-GAP (c), and combined maps on 3D-VGG16-GAP (d).

https://doi.org/10.1371/journal.pone.0303278.g003

Explainability of deep learning models based on Grad-CAM

In this section, 3D version of the Grad-CAM heatmaps of the two models are presented. It must be noted that in our experiment, we employed five CAM methods, including the original CAM [23], Grad-CAM [24], Grad-CAM plus plus [25], Eigen-CAM [26] and Eigen Grad-CAM. However, there was little difference between the heatmaps generated using the different CAM methods. Therefore, owing to space limitations, only the results obtained using the Grad-CAM method are presented.

Similar to some studies that proposed CAM methods [2326], we used the heatmaps shown in Figs 4 and 5 to visually demonstrate the results of Grad-CAM. Because the coarse heat map has the same resolution as the feature map of the last convolutional layer, which is only 14 × 14 × 14, it is difficult to compare it visually with the original image. Therefore, bilinear interpolation algorithms were employed to match the resolutions of the Grad-CAM and original brain images.

thumbnail
Fig 4. Average heatmaps based on Grad-CAM method of 3D-VGG16.

The Grad-CAM heatmaps of the ALFF map for NC (a) and AD (b), the ReHo map for NC (e) and AD (f), and the combined maps for NC (i) and AD (j) tend to highlight some specific ROIs such as precuneus and hippocampus, corresponding to the fact that the accuracy of the ALFF map, the ReHo map, and the combined maps is relatively high. However, the heatmaps of the fALFF map for NC (c) and AD (d), and the VMHC map for NC (g) and AD (h) cannot locate any specific regions, which could be the reason for the lower accuracy of both. In addition, due to the need to flatten the final feature maps into vector form in 3D-VGG16, the spatial structure of the feature maps is destroyed, resulting in the poor imaging performance.

https://doi.org/10.1371/journal.pone.0303278.g004

thumbnail
Fig 5. Average heatmaps based on Grad-CAM method of 3D-VGG16-GAP.

Unlike 3D-VGG16, fully connected layers are replaced with GAP layer in 3D-VGG16-GAP, so that the spatial structure of the final feature maps is retained, resulting in the better imaging performance. All the Grad-CAM heatmaps for NC subjects (ALFF (a), fALFF (c), ReHo (e), VMHC (g), and combined maps (i)) tend to focus the whole brain, whereas the the Grad-CAM heatmaps for AD subjects (ALFF (b), fALFF (d), ReHo (f), VMHC (h), and combined maps (j)) tend to highlight some specific ROIs such as precuneus and hippocampus.

https://doi.org/10.1371/journal.pone.0303278.g005

Figs 4 and 5 show the average Grad-CAM images of all the test samples of the two models, respectively. As shown, the left column shows the orthographic projection of the Grad-CAM images for the NC category based on different resting-state functional activity maps, while the right column shows the orthographic projection of the Grad-CAM images for the AD category. In the NC category, the model tended to focus on the entire brain without focusing on any specific local region. However, for AD, the model tended to focus more on specific local regions. Because of the low initial resolution of Grad-CAM images (i.e. 14 × 14 × 14), even after interpolation, they can only provide rough localization of a certain region and cannot achieve precise localization of a specific brain area. Compared with that, the precuneus region was highlighted in all Grad-CAM images, as shown in the left column of Fig 6. In addition, exception for fALFF, which had the lowest accuracy, the heatmaps of all other functional activity maps highlighted the hippocampal region, as shown in the right column of Fig 6.

thumbnail
Fig 6. ROIs highlighted by deep learning models.

As shown in Fig 5, all the Grad-CAM heatmaps cover the precuneus. In some cases, such as the heatmaps of the ReHo, VMHC, and combined maps, the hippocampus is also covered.

https://doi.org/10.1371/journal.pone.0303278.g006

The Grad-CAM heatmaps of the two patients that were incorrectly predicted are shown in Fig 7. The model focused on areas other than the precuneus and hippocampus, resulting in inaccurate feature extraction and ultimate incorrect prediction.

thumbnail
Fig 7. Grad-CAM heatmaps of samples that were predicted incorrectly on 3D-VGG16-GAP.

https://doi.org/10.1371/journal.pone.0303278.g007

Discussion

In this section, we explore the potential of deep learning techniques for distinguishing AD from NC using fMRI data. As mentioned in the Introduction, large quantities of information can be extracted from fMRI data owing to its high temporal and spatial resolutions. Generally, the relevant signal variation accounts for only 2% to 5% of the overall signal strength. Consequently, even a minor amount of noise can significantly affect the data. Additionally, a collection of fMRI data can consist of millions of data points because each voxel is scanned in both space and time, resulting in a relatively high level of complexity in the data. In our study, four different kinds of resting-state functional activity maps were used to extract the useful data from four different aspects.

As shown in Tables 2 and 3, our findings demonstrate that utilizing functional activity maps can result in relatively high accuracy in diagnosing AD from the ADNI dataset. Overall, the classification performance of 3D-VGG16-GAP was superior to that of 3D-VGG16. However, [54] reported a prediction performance loss because of the GAP layer, in which sMRI of ADNI serves as dataset, and 3D-VGG and 3D-ResNet are employed for classification. We believe that the main rationale of the performance loss overturned as a benefit is the difference of dataset. In our method, resting-state fMRI is utilized as the raw data from which several resting-state functional activity maps are extracted to form the dataset. fMRI and sMRI images capture different types of information. While fMRI reflects dynamic changes in brain and may benefit from GAP in capturing global properties due to stronger inter-regional correlations, sMRI depicts the anatomical structure of the brain, with lower inter-regional correlations, potentially leading to information loss with GAP application, resulting in performance loss. The use of the GAP layer also reduces the number of model parameters and effectively mitigating overfitting, which is very common in medical image classification tasks with limited samples.

By comparing the results obtained using a single functional activity map versus combining maps, we found that the classification accuracy increased when the combined maps were applied as dataset. For the single functional activity map, the ALFF map exhibited an accuracy value second only to the combined maps on both models because of its precise reflection of the intensity of neuronal activity. Except for these two methods, the ReHo map achieved relatively high accuracies of 80.0% and 81.8% for 3D-VGG16 and 3D-VGG16-GAP, respectively. The ReHo map describes the local functional connectivity of a voxel to neighboring voxels, which may be an important indicator for diagnosing AD becasue of the strong separation performance of the model. The accuracy of the VMHC map was lower than that of the combined, ALFF, and ReHo maps. The VMHC map measures functional homotopic connectivity between a voxel and its mirrored voxel in the contralateral brain hemisphere. The reason for the low accuracy of the VMHC map may be because there was no strong difference in functional homotopic connectivity between the AD and NC categories. Additionally, the models of VMHC map demonstrate the highest variance across all the maps, indicating its pronounced instability in performance on random datasets. The key distinction of VMHC map from other maps lies in its computation of the functional homotopic connectivity between the left and right hemispheres, resulting in symmetric data. We hypothesize that this symmetry may contribute to the model’s susceptibility to overfitting on random split datasets, leading to unstable performance. Finally, the accuracy of the fALFF map was the lowest. The difference between the fALFF and ALFF maps was that the effect of noise is reduced and suppressed by considering the ratio of each frequency to the total frequency range. However, we believe that, while the influence of noise is reduced, the intensity of some useful information required for model classification, such as neuronal activity signals, may also be suppressed, which may be the reason for the poor performance of the model. Table 4 presents the performance comparison between the proposed method and previous studies, demonstrating that our method is competitive and promising.

thumbnail
Table 4. Comparative analysis with previous research on resting-state fMRI.

https://doi.org/10.1371/journal.pone.0303278.t004

In general, CNNs extract features from input images in a hierarchical manner. Therefore, the convolutional layers closer to the front of the model extract the lower-level features, whereas those closer to the back extract the higher-level features. The final fully connected layers integrate these high-level features and perform classification. In our study, the feature maps from the final convolutional layer of the model captured the high-level features of AD. Therefore, by using Grad-CAM to visualize the feature maps from the last layer, the disease-related features can be visualized, and the ROIs associated with the disease can be identified, which provides informative guidance for disease research and improves the performance of the model.

As shown in Fig 4, the ADNI dataset performed worse on 3D-VGG16 than on 3D-VGG16-GAP, which is likely due to the inappropriate high-level features extracted by the model. As shown in Table 2, the accuracy of the fALFF map was the lowest, corresponding to the fact that the distribution of the highlighted regions was random, and the model did not find any specific ROIs to focus on. By contrast, the accuracy of the VMHC map was also relatively low; however, the model focused on almost the entire brain, as shown in Fig 4. For the ALFF, ReHo, and combined maps, which had higher accuracy, the model tended to highlight a specific ROI. We also believe that the improved imaging performance of Grad-CAM heatmaps on 3D-VGG16-GAP can be attributed to the fact that the GAP layer preserves the spatial structure information to a greater extent, in contrast to the fully connected layers in 3D-VGG16 which tend to destroy such information due to the flatten operation. Fig 8 shows the Precision-Recall curves of 3D-VGG16 and 3D-VGG-GAP. For the well-preforming resting-state functional activity maps, such as ALFF, ReHo and combined maps, the performance of 3D-VGG16-GAP consistently surpasses that of its corresponding 3D-VGG16 counterpart, which can be considered as support for the aforementioned discussion.

thumbnail
Fig 8. Precision-Recall curves of 3D-VGG16 and 3D-VGG16-GAP.

https://doi.org/10.1371/journal.pone.0303278.g008

As shown in Tables 2 and 3, the overall accuracy of 3D-VGG16-GAP was higher than that of 3D-VGG16. Intuitively, the Grad-CAM heatmaps in Fig 5 also show that the effect is better than that shown in Fig 4. For the NC category, 3D-VGG16-GAP focused on the entire brain. Specific regions are highlighted in the AD category for comparison. Compared with that in Fig 6, all the Grad-CAM heatmaps cover the precuneus. In some cases, such as the heatmaps of the ReHo, VMHC, and combined one, the hippocampus is also covered.

The precuneus is located inside the brain between the two cerebral hemispheres in the posterior region between the somatosensory cortex and anterior to the cuneus. It has various cognitive and neural functions, including spatial perception, visual attention, perception and consciousness, memory, self-awareness, and emotion processing. Some studies suggest that atrophy and pathology of the precuneus are the main causes of AD. [57] indicated that patients with early-onset AD exhibit a significant and distinct reduction in precuneus size, which is not observed to the same extent in patients with late-onset AD. In addition, the activity of choline acetyltransferase in the precuneus was found to be significantly lower in individuals with AD than in NC and similar between individuals with mild cognitive impairment and NC [58]. There is novel evidence of a difference in repetitive transcranial magnetic stimulation of the precuneus between patients with AD and NCs [59]. In our study, the highlighted ROIs overlay the precuneus area, suggesting that the model’s classification is probably based on this area. Thus, the primary difference between NC and AD is likely located within the precuneus area. This supports and corroborates prior medical research results.

In both primates and humans, the hippocampus is a relatively small structure located in the medial aspect of the temporal lobe, adjacent to the lateral ventricle, and is typically characterized by its horseshoe-shaped morphology, hence its name “hippocampus”. According to several studies [6063], the hippocampus is one of the earliest regions in the brain that experience damage in various forms of dementia, including AD. This is because the hippocampus is responsible for memory formation and retrieval and plays a critical role in the formation of new memories. As dementia progresses, hippocampal damage worsens, leading to memory impairments and other cognitive deficits.

As previously stated, it is believed that for complex datasets such as fMRI, the more manual feature extraction, the less the model needs to do, and thus the better the model’s performance may be. However, manually extracted features may not be the features required for classification by the model, indicating that they may be not discriminative enough for the model. Therefore, the required features for different fMRI data of diseases may vary, and the same method is likely to be ineffective for different diseases. In our study, several 3D resting state functional activity maps extracted from 4D fMRI data are utilized as the dataset, which differs from using 2D or 3D slices of 4D fMRI data. This approach eliminates data leakage and resting state functional activity maps possess higher-level features than raw slices. This may alleviate the pressure on the model for feature extraction, thus improving the model’s performance. Additionally, using 3D resting-state functional activity maps could also retain more spatial information in the model’s feature maps, which could enhance the explainability of the model. Even for AD, due to the small size of the dataset used in this study, which is a common issue in medical datasets due to the strict data collection conditions and the high cost of acquiring equipment, the generalizability is still limited, and further validation is required to determine its applicability to other AD patients.

The lack of explainability in deep learning models has been a longstanding issue; however, it has been partially addressed by the introduction of CAM methods. In our study, Grad-CAM was utilized to explain the model, which not only contributes to the explainability of the model for the diagnosis and research of the disease, but also helps improve the accuracy of the model by studying the cases that were incorrectly predicted. However, because of the inherent limitations of CAM methods [23, 24], in which the feature maps with the maximum resolution of 14 × 14 × 14 are used, even after interpolation, the imaging scope is coarse. In medical imaging, this can undoubtedly affect the accurate localization of lesions.

As mentioned above, the LRP method [32] was proposed to solve the problem of low resolution of CAM methods. However, the LRP method only takes into account model parameters and neuron activations. By this, the heatmaps are less prone to group effects in the data because they are produced individually. The LRP method is very specific for individuals with high inter-patient variability, which is not conducive to the detection of the common features among patients with the same disease. Unlike LRP method, the generation of Grad-CAM heatmaps not only relies on feature maps individually but also takes into account the shared parameters of gradients and other model-related features. The combination of these two points may result in superior performance when extracting common features among different individuals.

In future research, we hope to develop a new method for extracting high-level features without reducing the resolution of feature maps to address the aforementioned limitations. In addition, the improved CAM method could help fix this limitation by explaining models that are unrestricted by the resolutions of feature maps. In addition, considering that the brain is mutually interconnected, with different ROIs forming a topological map based on the strength of their connections with each other, other forms of deep learning models, such as graph neural networks (GNN), may be used to achieve higher accuracy in disease classification [52, 64]. Additionally, relatively little research has been conducted on explainability methods based on GNNs [65], which may be another promising direction for further exploration.

Conclusion

In this study, we applied Grad-CAM to an fMRI-based 3D-VGG16 network for AD diagnosis to substantiate its validity, thereby achieving the localization of AD-related ROIs. In addition, the use of resting-state functional activity maps as the dataset successfully reduced the complexity of the fMRI data, facilitating more efficient feature extraction by the model. Grad-CAM helped achieve the precise localization of disease lesions and analysis of the reasons for misclassification. The results showed that during the prediction process following training, the ROIs on which the model focused were almost identical to the areas where lesions have been shown in current research on AD. This supports and corroborates current research and facilitates the use of deep learning to study other diseases with unknown etiology. However, issues with localization accuracy are still present. Improving the deep learning models, changing the types of deep learning models, and improving the CAM method may help alleviate this limitation.

Acknowledgments

The data utilized in this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete list of ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of California, Los Angeles. We thank Karen Klein MA, ELS (Translational Science Institute, Wake Forest University Health Sciences), for editing the manuscript. We express our gratitude to the anonymous reviewers for their valuable and insightful feedback.

References

  1. 1. Donahue MJ, Strother MK, Lindsey KP, Hocke LM, Tong Y, Frederick Bd. Time delay processing of hypercapnic fMRI allows quantitative parameterization of cerebrovascular reactivity and blood flow delays. Journal of Cerebral Blood Flow & Metabolism. 2016;36(10):1767–1779. pmid:26661192
  2. 2. Parmar H, Nutter B, Long R, Antani S, Mitra S. Spatiotemporal feature extraction and classification of Alzheimer’s disease using deep learning 3D-CNN for fMRI data. Journal of Medical Imaging. 2020;7(5):056001–056001. pmid:37476352
  3. 3. Ghanbari M, Pilevar AH, Bathaeian N. Diagnosis of schizophrenia using brain resting-state fMRI with activity maps based on deep learning. Signal, Image and Video Processing. 2023;17(1):267–275.
  4. 4. Anderson A, Cohen MS. Decreased small-world functional network connectivity and clustering across resting state networks in schizophrenia: an fMRI classification tutorial. Frontiers in human neuroscience. 2013;7:520. pmid:24032010
  5. 5. Oghabian MA, Batouli SAH, Norouzian M, Ziaei M, Sikaroodi H. Using functional magnetic resonance imaging to differentiate between healthy aging subjects, Mild Cognitive Impairment, and Alzheimer’s patients. Journal of research in medical sciences: the official journal of Isfahan University of Medical Sciences. 2010;15(2):84. pmid:21526064
  6. 6. Monti MM. Statistical analysis of fMRI time-series: a critical review of the GLM approach. Frontiers in human neuroscience. 2011;5:28. pmid:21442013
  7. 7. Pernet CR. The general linear model: Theory and practicalities in brain morphometric analyses. Brain Morphometry. 2018; p. 75–85.
  8. 8. Tang Y, Wang L, Cao F, Tan L. Identify schizophrenia using resting-state functional connectivity: an exploratory research and analysis. Biomedical engineering online. 2012;11:1–16. pmid:22898249
  9. 9. Wang JJ, Chen X, Sah S, Zeng C, Li YM, Li N, et al. Amplitude of low-frequency fluctuation (ALFF) and fractional ALFF in migraine patients: a resting-state functional MRI study. Clinical radiology. 2016;71(6):558–564. pmid:27055741
  10. 10. Zang Y, Jiang T, Lu Y, He Y, Tian L. Regional homogeneity approach to fMRI data analysis. Neuroimage. 2004;22(1):394–400. pmid:15110032
  11. 11. Stark DE, Margulies DS, Shehzad ZE, Reiss P, Kelly AC, Uddin LQ, et al. Regional variation in interhemispheric coordination of intrinsic hemodynamic fluctuations. Journal of Neuroscience. 2008;28(51):13754–13764. pmid:19091966
  12. 12. Ramzan F, Khan MUG, Rehmat A, Iqbal S, Saba T, Rehman A, et al. A deep learning approach for automated diagnosis and multi-class classification of Alzheimer’s disease stages using resting-state fMRI and residual neural networks. Journal of medical systems. 2020;44:1–16.
  13. 13. Sarraf S, Tofighi G. Classification of alzheimer’s disease using fmri data and deep learning convolutional neural networks. arXiv preprint arXiv:160308631. 2016;.
  14. 14. He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition; 2016. p. 770–778.
  15. 15. Marzban EN, Eldeib AM, Yassine IA, Kadah YM, Initiative ADN. Alzheimer’s disease diagnosis from diffusion tensor images using convolutional neural networks. PloS one. 2020;15(3):e0230409. pmid:32208428
  16. 16. Pusparani Y, Lin CY, Jan YK, Lin FY, Liau BY, Ardhianto P, et al. Diagnosis of Alzheimer’s disease using convolutional neural network with select slices by landmark on Hippocampus in MRI images. IEEE Access. 2023;.
  17. 17. Ju R, Hu C, Li Q, et al. Early diagnosis of Alzheimer’s disease based on resting-state brain networks and deep learning. IEEE/ACM transactions on computational biology and bioinformatics. 2017;16(1):244–257. pmid:29989989
  18. 18. Odusami M, Maskeliūnas R, Damaševičius R, Krilavičius T. Analysis of features of alzheimer’s disease: detection of early stage from functional brain changes in magnetic resonance images using a finetuned ResNet18 network. Diagnostics. 2021;11(6):1071. pmid:34200832
  19. 19. Duc NT, Ryu S, Qureshi MNI, Choi M, Lee KH, Lee B. 3D-deep learning based automatic diagnosis of Alzheimer’s disease with joint MMSE prediction using resting-state fMRI. Neuroinformatics. 2020;18:71–86. pmid:31093956
  20. 20. Qureshi MNI, Oh J, Lee B. 3D-CNN based discrimination of schizophrenia using resting-state fMRI. Artificial intelligence in medicine. 2019;98:10–17. pmid:31521248
  21. 21. Pardoe HR, K Hiess R, Kuzniecky R. Motion and morphometry in clinical and nonclinical populations. Neuroimage. 2016;135:177–185. pmid:27153982
  22. 22. Mousa D, Zayed N, Yassine IA. Alzheimer disease stages identification based on correlation transfer function system using resting-state functional magnetic resonance imaging. Plos one. 2022;17(4):e0264710. pmid:35413053
  23. 23. Zhou B, Khosla A, Lapedriza A, Oliva A, Torralba A. Learning deep features for discriminative localization. In: Proceedings of the IEEE conference on computer vision and pattern recognition; 2016. p. 2921–2929.
  24. 24. Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D. Grad-cam: Visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE international conference on computer vision; 2017. p. 618–626.
  25. 25. Chattopadhay A, Sarkar A, Howlader P, Balasubramanian VN. Grad-CAM++: Generalized Gradient-Based Visual Explanations for Deep Convolutional Networks. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV); 2018. p. 839–847.
  26. 26. Muhammad MB, Yeasin M. Eigen-cam: Class activation map using principal components. In: 2020 International Joint Conference on Neural Networks (IJCNN). IEEE; 2020. p. 1–7.
  27. 27. Jiang PT, Zhang CB, Hou Q, Cheng MM, Wei Y. Layercam: Exploring hierarchical class activation maps for localization. IEEE Transactions on Image Processing. 2021;30:5875–5888. pmid:34156941
  28. 28. Patro BN, Lunayach M, Patel S, Namboodiri VP. U-cam: Visual explanation using uncertainty based class activation maps. In: Proceedings of the IEEE/CVF International Conference on Computer Vision; 2019. p. 7444–7453.
  29. 29. Sun K, Shi H, Zhang Z, Huang Y. Ecs-net: Improving weakly supervised semantic segmentation by using connections between class activation maps. In: Proceedings of the IEEE/CVF International Conference on Computer Vision; 2021. p. 7283–7292.
  30. 30. Belharbi S, Sarraf A, Pedersoli M, Ben Ayed I, McCaffrey L, Granger E. F-cam: Full resolution class activation maps via guided parametric upscaling. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision; 2022. p. 3490–3499.
  31. 31. Ma X, Ji Z, Niu S, Leng T, Rubin DL, Chen Q. MS-CAM: Multi-scale class activation maps for weakly-supervised segmentation of geographic atrophy lesions in SD-OCT images. IEEE Journal of Biomedical and Health Informatics. 2020;24(12):3443–3455. pmid:32750923
  32. 32. Böhle M, Eitel F, Weygandt M, Ritter K. Layer-wise relevance propagation for explaining deep neural network decisions in MRI-based Alzheimer’s disease classification. Frontiers in aging neuroscience. 2019;11:194. pmid:31417397
  33. 33. Dyrba M, Hanzig M, Altenstein S, Bader S, Ballarini T, Brosseron F, et al. Improving 3D convolutional neural network comprehensibility via interactive visualization of relevance maps: evaluation in Alzheimer’s disease. Alzheimer’s research & therapy. 2021;13(1):1–18. pmid:34814936
  34. 34. Lin QH, Niu YW, Sui J, Zhao WD, Zhuo C, Calhoun VD. SSPNet: An interpretable 3D-CNN for classification of schizophrenia using phase maps of resting-state complex-valued fMRI data. Medical Image Analysis. 2022;79:102430. pmid:35397470
  35. 35. Ahmed S, Kim BC, Lee KH, Jung HY, Initiative ADN. Ensemble of ROI-based convolutional neural network classifiers for staging the Alzheimer disease spectrum from magnetic resonance imaging. PLoS One. 2020;15(12):e0242712. pmid:33290403
  36. 36. Çallı E, Murphy K, Scholten ET, Schalekamp S, van Ginneken B. Explainable emphysema detection on chest radiographs with deep learning. PLoS One. 2022;17(7):e0267539. pmid:35900979
  37. 37. Bandstra MS, Curtis JC, Ghawaly JM Jr, Jones AC, Joshi TH. Explaining machine-learning models for gamma-ray detection and identification. Plos one. 2023;18(6):e0286829. pmid:37339151
  38. 38. Prince M, Wimo A, Guerchet M, Ali GC, Wu YT, Prina M. World Alzheimer report 2015. The global impact of dementia: an analysis of prevalence, incidence, cost and trends. Alzheimer’s disease international; 2015.
  39. 39. Association A, et al. 2018 Alzheimer’s disease facts and figures. Alzheimer’s & Dementia. 2018;14(3):367–429.
  40. 40. Kim SG, Ogawa S. Biophysical and physiological origins of blood oxygenation level-dependent fMRI signals. Journal of Cerebral Blood Flow & Metabolism. 2012;32(7):1188–1206. pmid:22395207
  41. 41. Gupta S, Rajapakse JC, Welsch RE, Initiative ADN, et al. Ambivert degree identifies crucial brain functional hubs and improves detection of alzheimer’s disease and autism spectrum disorder. NeuroImage: Clinical. 2020;25:102186. pmid:32000101
  42. 42. Jie B, Liu M, Lian C, Shi F, Shen D. Designing weighted correlation kernels in convolutional neural networks for functional connectivity based brain disease diagnosis. Medical image analysis. 2020;63:101709. pmid:32417715
  43. 43. Lu H, Liu S, Wei H, Chen C, Geng X. Deep multi-kernel auto-encoder network for clustering brain functional connectivity data. Neural Networks. 2021;135:148–157. pmid:33388506
  44. 44. Li X, Morgan PS, Ashburner J, Smith J, Rorden C. The first step for neuroimaging data analysis: DICOM to NIfTI conversion. Journal of neuroscience methods. 2016;264:47–56. pmid:26945974
  45. 45. Yan CG, Wang XD, Zuo XN, Zang YF. DPABI: data processing & analysis for (resting-state) brain imaging. Neuroinformatics. 2016;14:339–351. pmid:27075850
  46. 46. Fan H, Yang X, Zhang J, Chen Y, Li T, Ma X. Analysis of voxel-mirrored homotopic connectivity in medication-free, current major depressive disorder. Journal of affective disorders. 2018;240:171–176. pmid:30075387
  47. 47. Zuo XN, Kelly C, Di Martino A, Mennes M, Margulies DS, Bangaru S, et al. Growing together and growing apart: regional and sex differences in the lifespan developmental trajectories of functional homotopy. Journal of Neuroscience. 2010;30(45):15034–15043. pmid:21068309
  48. 48. Kendall MG. Rank correlation methods. Griffin; 1948.
  49. 49. Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:14091556. 2014;.
  50. 50. Oquab M, Bottou L, Laptev I, Sivic J. Is object localization for free?-weakly-supervised learning with convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition; 2015. p. 685–694.
  51. 51. Lin M, Chen Q, Yan S. Network in network. arXiv preprint arXiv:13124400. 2013;.
  52. 52. Chen X, Zhou J, Ke P, Huang J, Xiong D, Huang Y, et al. Classification of schizophrenia patients using a graph convolutional network: A combined functional MRI and connectomics analysis. Biomedical Signal Processing and Control. 2023;80:104293.
  53. 53. Shchur O, Mumme M, Bojchevski A, Günnemann S. Pitfalls of graph neural network evaluation. arXiv preprint arXiv:181105868. 2018;.
  54. 54. Yang C, Rangarajan A, Ranka S. Visual explanations from deep 3D convolutional neural networks for Alzheimer’s disease classification. In: AMIA annual symposium proceedings. vol. 2018. American Medical Informatics Association; 2018. p. 1571.
  55. 55. Qiao J, Lv Y, Cao C, Wang Z, Li A. Multivariate deep learning classification of Alzheimer’s disease based on hierarchical partner matching independent component analysis. Frontiers in aging neuroscience. 2018;10:417. pmid:30618723
  56. 56. He Y, Wu J, Zhou L, Chen Y, Li F, Qian H. Quantification of cognitive function in Alzheimer’s disease based on deep learning. Frontiers in Neuroscience. 2021;15:651920. pmid:33815051
  57. 57. Karas G, Scheltens P, Rombouts S, Van Schijndel R, Klein M, Jones B, et al. Precuneus atrophy in early-onset Alzheimer’s disease: a morphometric structural MRI study. Neuroradiology. 2007;49:967–976. pmid:17955233
  58. 58. Ikonomovic M, Klunk W, Abrahamson E, Wuu J, Mathis C, Scheff S, et al. Precuneus amyloid burden is associated with reduced cholinergic activity in Alzheimer disease. Neurology. 2011;77(1):39–47. pmid:21700583
  59. 59. Koch G, Bonnì S, Pellicciari MC, Casula EP, Mancini M, Esposito R, et al. Transcranial magnetic stimulation of the precuneus enhances memory and neural activity in prodromal Alzheimer’s disease. Neuroimage. 2018;169:302–311. pmid:29277405
  60. 60. Dubois B, Hampel H, Feldman HH, Scheltens P, Aisen P, Andrieu S, et al. Preclinical Alzheimer’s disease: definition, natural history, and diagnostic criteria. Alzheimer’s & Dementia. 2016;12(3):292–323. pmid:27012484
  61. 61. Thompson PM, Hayashi KM, De Zubicaray GI, Janke AL, Rose SE, Semple J, et al. Mapping hippocampal and ventricular change in Alzheimer disease. Neuroimage. 2004;22(4):1754–1766. pmid:15275931
  62. 62. Allen G, Barnard H, McColl R, Hester AL, Fields JA, Weiner MF, et al. Reduced hippocampal functional connectivity in Alzheimer disease. Archives of neurology. 2007;64(10):1482–1487. pmid:17923631
  63. 63. Gosche K, Mortimer J, Smith C, Markesbery W, Snowdon D. Hippocampal volume as an index of Alzheimer neuropathology: findings from the Nun Study. Neurology. 2002;58(10):1476–1482. pmid:12034782
  64. 64. Alorf A, Khan MUG. Multi-label classification of Alzheimer’s disease stages from resting-state fMRI-based correlation connectivity data and deep learning. Computers in Biology and Medicine. 2022;151:106240. pmid:36423532
  65. 65. Zhang Z, Li M, Lin X, Wang Y, He F. Multistep speed prediction on traffic networks: A deep learning approach considering spatio-temporal dependencies. Transportation research part C: emerging technologies. 2019;105:297–322.