Autism spectrum disorder (ASD) is a neuro-developmental disorder associated with social impairments, communication difficulties, and restricted and repetitive behaviors. Yet, there is no confirmed cause identified for ASD. Studying the functional connectivity of the brain is an emerging technique used in diagnosing and understanding ASD. In this study, we obtained the resting state functional MRI data of 283 subjects from the National Database of Autism Research (NDAR). An automated autism diagnosis system was built using the data from NDAR. The proposed system is machine learning based. Power spectral densities (PSDs) of time courses corresponding to the spatial activation areas are used as input features, feeds them to a stacked autoencoder then builds a classifier using probabilistic support vector machines. Over the used dataset, around 90% of sensitivity, specificity and accuracy was achieved by our machine learning system. Moreover, the system generalization ability was checked over two different prevalence values, one for the general population and the other for the of high risk population, and the system proved to be very generalizable, especially among the population of high risk. The proposed system generates a full personalized report for each subject, along with identifying the global differences between ASD and typically developed (TD) subjects and its ability to diagnose autism. It shows the impacted areas and the severity of implications. From the clinical aspect, this report is considered very valuable as it helps in both predicting and understanding behavior of autistic subjects. Moreover, it helps in designing a plan for personalized treatment per each individual subject. The proposed work is taking a step towards achieving personalized medicine in autism which is the ultimate goal of our group’s research efforts in this area.
Citation: Dekhil O, Hajjdiab H, Shalaby A, Ali MT, Ayinde B, Switala A, et al. (2018) Using resting state functional MRI to build a personalized autism diagnosis system. PLoS ONE 13(10): e0206351. https://doi.org/10.1371/journal.pone.0206351
Editor: Michelle Hampson, Yale University, UNITED STATES
Received: October 10, 2017; Accepted: October 11, 2018; Published: October 31, 2018
Copyright: © 2018 Dekhil et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Raw (per-subject) results can be found in the supporting information files accompanying this article. Individual subjects are identified by their GUID, an opaque string that links related records in NDAR without exposing any personal identifiable information. Interested researchers can obtain the source materials corresponding to these GUIDs, including fMRI data, ADOS, and DAS-II, from the NIMH Data Archive (https://ndar.nih.gov/access.html).
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Autism spectrum disorder (ASD) is a neuro-developmental disorder associated with three main characteristics : (i) impairments in social functioning, (ii) communication difficulties, and (iii) restricted and repetitive behaviors. The wide variation of clinical and genetic heterogeneity between autistic subjects  has made precision medicine a trending approach for diagnosis and treatment. Precision medicine combines both pathophysiologically based treatments and objective biomarkers to predict the most beneficial treatment for a particular subject. For drugs currently in clinical trials, investigation of the relationships between response and etiologies/biomarkers should be explored to better understand individualized effects for development of subsequent larger trials. The goal would be to optimize targeted treatment for patients with ASD. Existing collaborative approaches of this type have not yet targeted environmental etiologies/risk factors. As our understanding of environmental factors expands, it will be critical to incorporate these factors into experimental approaches. Although ASD is a heterogeneous disorder which varies in both symptoms and severity for each subject, it can be systematically assessed utilizing a data driven approach to split ASD into subgroups. Each subgroup can be explored separately to develop individualized/personalized treatments. The proposed approach will be more efficacious to optimize ASD treatment for each subject rather than conventional methods that are applied broadly for all ASD subjects. Personalized interventions at early ages may show a profound effect on ASD subjects during development. In combination with impactful behavioral therapies (such as early intensive behavioral intervention), this approach will have a significant impact on the overall symptoms of ASD over a lifetime. Consequently, this study has two main objectives: (i) design and implement an accurate machine learning system to classify ASD and TD correctly, and (ii) provide a personalized map that shows the affected areas and severity of autism for each ASD subject. Accomplishing these objectives will facilitate the designing of a precise personalized plan for each autistic subject.
Connectivity analysis is a very common way to determine the abnormalities between ASD and TD subjects [3, 4], where three major patterns are analyzed: (i) gray matter structural connectivity, (ii) white matter structural connectivity, and (iii) functional connectivity. Within the gray matter, microstructural abnormalities in autistic subjects reported, for example,  and , are suggestive of corresponding changes in connectivity. There have been few direct studies of synaptic connections in autism using human postmortem tissue , and alterations in grey matter connectivity have mostly been inferred from other findings. The minicolumns, basic anatomical and functional units of the cerebral cortex, have been found to be more narrow and/or more numerous in autism [5, 6]. The reduction in neuropil around these minicolumns in particular has implications for connectivity. It may lead, for example, to a reduction in the inhibitory capacity of GABAergic interneurons . Other evidence of disrupted synaptic connectivity derives from genetic studies, which have linked mutations in proteins involved in synaptic transmission with the incidence of autism. Studies in mouse models have elucidated what impact these mutations might have in the human brain [7, 9]. Another connectivity abnormality which was detected in the white matter of autistic subjects is the reduced long range connectivity and increased short and medium range connectivity . The reduced long range connectivity was expressed in terms of reduced fractional anisotropy in autistic subjects in many recent studies, for example . Functional connectivity analysis is the third type of analysis. Each subject is asked to perform a task or stay at rest without falling asleep in order to apply a functional connectivity analysis on his/her brain scan. . Underconnectivity theory  states that both neurobiological and cognitive disorders are the main causes of ASD. Synchronized brain activity reduction in integrative processing demanding tasks, such as forming a sentence from 2 or more words, is used to depict the cognitive disorder. More recent studies investigated the brain connectivity associated with different tasks. For example, less activation in the left dorsolateral prefrontal and inferior parietal areas was identified, while more activation was recorded in the right occipital (visuospatial) areas and bilateral superior parietal regions were reported in a figures task experiment in . In , the response to facial expressions was addressed, where autistic individuals showed higher activation in the amygdala, ventral prefrontal cortex, and striatum specifically for sad facial expressions. Another type of task based experiments is the rewards task, where subjects are given either monetary or social reward and their brain activity in response to this reward is recorded [16, 17]. In , less activation in the right nucleus accumbens and more activation in left midfrontal and anterior cingulate gyrus were reported in autistic subjects than were reported in healthy controls in response to social and monetary rewards. Another study  revealed less connectivity in autistic subjects in response to rewards. Also, in , a machine learning algorithm (multivariate autoregressive model) was used to study the alternation in connectivity between the two groups while trying to find the most logical end to a story shown to them.
Resting state is another method used to study the brain activity without performing any task. Resting state brain connectivity has been discussed in various studies. The underconnectivity hypothesis was supported by , where less functional brain connectivity was found in autistic subjects than that found in healthy control subjects. This result was supported for autistic males by , while autistic females and autistic children showed hyperconnectivity in . In , autistic children with more severe social dysfunction were found to be functionally hyperconnected. In , decreased connectivity was noticed in local areas in the frontal and temporal cortex, but no global abnormalities were detected. Also, in , reduced connectivity in visuospatial and superior parietal areas was reported in autistic subjects as compared to healthy control subjects. Reduced connectivity was also reported in  in both dentate nucleus and cerebello-thalamo-cortical (CTC) circuits. Building on previous studies,  depicted alternations in connectivity patterns. These alternations in connectivity patterns appeared in the interhemispheric connectivity analysis, where for autistic subjects there have been areas with decreased connectivity and other areas with increased connectivity compared to the same areas in TD subjects. The altered connectivity result was also supported by , where both hypoconnectivity and hyperconnectivity were reported in autistic subjects. Another study , reported dysfunction in the functional networks, and this dysfunction was more obvious in social information processing related networks.
In addition to its importance in reporting group differences between healthy controls and implicated subjects, studying the resting state connectivity patterns showed promising results in diagnosis of many diseases such as schizophrenia , Alzheimer’s disease,  and autism. In , a deep neural network was used for whole brain classification of schizophrenia. The approach in  achieved high accuracy in schizophrenia diagnosis. In autism diagnosis, a recent study in  used deep neural network to build a diagnostic system using functional connectivity correlation matrix as input to the network.
Many of the mentioned studies, and others , reported findings in different brain areas or lobes among the autistic subjects and healthy controls. This helps in understanding autism causative factors. But to the best of our knowledge, localized abnormalities for each subject haven’t been reported by any of the formerly mentioned studies. Due to the heterogeneity of autism and its various etiology and severity, a more personalized approach is needed to predict and analyze the affected behavior and functionality of each subject; hence, an optimal goal is to achieve an individually designed personalized treatment plan.
In this study, we are expanding our group’s previous work , where a resting state analysis is performed on a dataset with a relatively large number of subjects (283 subjects). The work flow of this analysis is to extract the features of most importance from the time courses corresponding to functional connectivity spatial maps of both autistic and healthy control subjects. Using the extracted features, we build a CAD system that is able to provide a global diagnosis decision for each subject; additionally, it provides a local diagnosis report that shows the most affected areas in the brain, which could help in better understanding and predicting of the affected behaviors and functionalities for each individual subject. In addition to increasing the number of subjects, we provide a correlation analysis between the CAD system output and the Autism Diagnosis Observation Schedule ADOS behavioral reports. This analysis reflects how the CAD system is able to predict the affected behaviors and allows for as early intervention as possible.
Materials and methods
In this study, we obtained fMRI data for 123 ASD and 160 TD children and adolescents (for a total number of 283 subjects) from the National Database for Autism Research (NDAR: http://ndar.nih.gov). Imaging data hosted by NDAR are fully anonymized and linked with other records (diagnostic, behavioral, demographic, etc.) via an opaque identifier, the NDAR globally unique identifier (GUID). GUIDs for all subjects used in this study are provided in the supplemental materials—S1 Table. The data used are obtained from two studies, one done at George Washington University (study ID 2021) and the other at UCLA Autism Center of Excellence (study ID 2026). We selected only subjects who have resting state fMRI, structural MRI, and DTI data available because our planned future work is to expand this work to be a multi-modal personalized diagnosis system. All of the participant subjects have both a high-resolution T1 weighted structural MRI and a resting state functional MRI (fMRI). In addition to the imaging data, many subjects also have (i) cognitive/behavioral data in the form of BRIEF-parent (100 autistic and 140 healthy controls), (ii) child/adolescent symptom inventory (CASI) (67 autistic and 110 healthy controls), (iii) child behavior checklist (CBCL) for ages 6–18 (116 autistic and 160 healthy controls), and (iv) differential ability scales 2nd edition (DAS-II) (105 autistic and 148 healthy controls). Those with a diagnosis of ASD usually had associated scores on the (v) ADOS reports (96 autistic) and (vi) Autism diagnostic interview (ADI-R) (117 autistic). Resting state fMRI and structural MRI data used in this study was acquired using Siemens Magnetom TrioTim with a 3 T magnet. Structural MRI data used an MPRAGE pulse sequence with TR = 2530 ms, TE = 3.31 ms, TI = 1100 ms, and flip angle 7°. Voxel spacing for structural MRI volumes is isotropic with 1 mm. Resting state fMRI scans have TR = 2000 ms, TE = 2000 ms, and flip angel 90° in a two dimensional acquisition sequence to produce images with 3 mm pixel spacing and 4 mm slice spacing. Time to acquire 33 coronal slices spanning the entire brain was 2.01 s, and the resting state data were recorded for approximately 6 min, as it was described in study ID 2021. While for study ID 2026, TR = 3000 ms, TE = 28 ms and flip angle 90° in a two dimensional acquisition sequence to produce images with 3 mm pixel spacing and 4 mm slice spacing. Time to acquire 34 coronal slices spanning the entire brain was 3.01 s, and the resting state data were recorded for approximately 6 minutes.
Resting state fMRI experiment
The main objective while analyzing a resting state fMRI (R-fMRI) scan is to study the low-frequency fluctuations measured in blood oxygenation level dependent (BOLD) signal, which identify spatial and temporal characteristics of the resting state networks (RSNs) . To localize the individual abnormalities, the RSNs of each subject are then mapped to four resting state standard brain atlases by checking the correlation between each atlas area and each RSN. The package used in this experiment for both analysis and preprocessing is FSL MELODIC (Multivariate Exploratory Linear Optimized Decomposition into Independent Components) .
In this study, we applied multiple preprocessing steps on R-fMRI scans before the analysis takes place:
- Brain Extraction Tool (BET) is used to skull stripping. Skull stripping is where we segment an MRI image into a brain and non brain. BET used deformable model for segmenting the brain and it is considered as a robust tool .
- Removing the time differences between acquired 2D slices of fMRI scans using slice timing correction in increasing order.
- To eliminate the effect of subject movement during MRI scan, we apply motion correction using MCFLIRT algorithm [38, 39].
- In order to increase signal to noise ration (SNR), and accommodates for the individual anatomical variations inter-subjects, we used Gaussian filtering for spatial smoothing. we used Gaussian filter with full width half maximum (FWHM) of 2 mm . The relatively small FWHM was selected to avoid activation cluster merging .
With the preprocessed scans, we apply two-phase registration on each scan. First, we registered every preprocessed fMRI scan to it’s corresponding high resolution T1-weighted structural image. Second, we aligned these preprocessed registered fMRI scans to MNI152 standard space. For both registration steps, we used a 12 degree of freedon affine transformation. The registration is performed using FLIRT software in the FSL package.
R-fMRI data analysis and feature extraction.
In R-fMRI, the values of BOLD signal at every voxel over time represents a signal that comprises spatial locations and their corresponding activation time courses. Functional connectivity is defined as the minimal loss decomposition of the source signal (BOLD signal values) into two independent components (spatial locations and time courses). The famous blind source separation problem (BSS) is somehow analogous to our source signal decomposition problem, at which we need to recover set of statistical independent sources signal from a measured signal that comprises a mixture of sources . The BSS assumes that there is no prior knowledge about the sources or the mixture structures.
The BSS problem can be formulated as: (1) Where xi is the BOLD signal measured over time at voxel i, si is the non-Gaussian source signal, ηi ∼ N(0, σ2∑i), and A is the mixing matrix. To solve this BSS problem it is required to find the unmixing matrix W such that (2) is a close approximation of the original measured signal. To solve this BSS problem in the presence of Gaussian noise, a probabilistic independent component analysis (PICA) algorithm is used. In the presence of unknown noise covariance, the unmixing matrix W is estimated in an iterative manner, by iterating estimates of the mixing matrix and the independent sources then reestimating the noise covariance from the residuals. For more mathematical details about finding the solution, uniqueness, correctness, and model order, the reader is referred to .
In this study, 40 matched subjects (20 ASDs and 20 TDs) in terms of age, gender, and IQ are used for group ICA analysis, where subjects are temporally concatenated. The output of the group PICA is 34 spatial components that represent activation patterns in the 40 subjects. To assess statistical significance between the 2 groups, permutation testing and Bonferroni correction are applied to the output components. To obtain spatial components and time courses for each individual subject dual regression is applied. In the first regression phase, group spatial components are used with the subject 4D volume to obtain subject specific time course, then a second regression phase is applied to obtain subject spatial components using the time courses obtained in the first phase. The pipeline is illustrated in Fig 1.
After completing the PICA analysis, we used an atlas of 34 areas as a reference of connectivity networks. we calculated the correlation between every area of that atlas and every extracted spatial map. The area of interested are then selected based on those correlation values, such that areas with maximum correlation are selected. The features used in diagnosis are the power spectral densities (PSDs) corresponding to the activation time courses of the 34 areas of interest. PSDs were used as features because they represent a sensitive way for BOLD signal oscillations description which enhances the ability to analyze the network connectivity . Also, PSD has another advantage, which is being time shift invariant. This means that, among different subjects, if the same activation happens but at different shifts in time, the PSD will not be affected. The feature extraction process is illustrated in Fig 2.
Resting state functional atlas.
In this study, we used four local atlases to create an atlas that defines the expected activation networks during resting state. These four local atlases describe 34 different cortical areas, and those local atlases are:
- Parietal cortex atlas : In this atlas study, both functional connectivity and anatomical connectivity were studied on humans and macaques. Accordingly, the parietal cortex was divided into 10 components, 5 in the inferior parietal lobule (IPL) and 5 in the superior parietal lobule (SPL). Those components were clustered based on cross correlation in the tractography-based connectivity patterns of parietal voxels.
- Temporoparietal junction (TPJ) atlas [45, 46]: In this atlas study, TPJ was examined to check if it is a single area with a heterogeneous functional connectivity or multiple areas, each with its unique connectivity pattern. Accordingly, TPJ was parcellated into 2 components: (i) anterior TPJ cluster, which showed interaction with ventral prefrontal cortex and anterior insula and (ii) posterior TPJ cluster which showed interaction with the posterior cingulate, temporal pole, and anterior medial prefrontal cortex.
- Dorsal frontal cortex : In this atlas study, both DTI and fMRI were used to compare the dorsal frontal cortex organization between humans and macaques. According to this study, the human dorsal frontal cortex is parcellated into 10 components. They are all between the human inferior frontal sulcus and the cingulate cortex.
- Ventral frontal cortex : In this atlas study, similarities and differences between human and macaques’ ventral frontal cortex were identified. Based on the study outcome, the ventral frontal cortex was divided into 11 components, in addition to one more component from the ventrolateral frontal pole.
Local and global classification
To build our diagnostic system, which is expected to (i) classify/diagnose ASD and TD subjects, and (ii) identify local areas with autism related impairments, we used the extracted PSDs as our discriminating features between the ASD and TD groups. However, to enhance the classification process of our diagnostic system, we fed the classifier with a higher level representation of the PSDs.
We used 34 sparse autoencoders (SAEs), such that there is an autoencoder for each functional area, to represent our PSDs in a higher level representation and also to reduce the feature vectors dimensionality [49–52]. We used autoencoders to encode the PSDs through a set of nonlinear filters to a new space. Thus, when decoding them again, they give a reconstructed version of the input with minimal reconstruction error [53, 54]. In the training phase, SAEs weights were updated through error backpropagation with batch gradient descent, where the L-BFGS optimization algorithm  is used for reconstruction error minimization.
To find the optimal set of hyper-parameters for the SAEs network, typically the number of layers, number of nodes in each layer (range: 10: 100), sparsity parameter (range: 0.05: 0.9), sparseness control parameters (range: 1: 20), and L2 regularization (range: 10−3: 10−6), a grid search algorithm with the reconstruction error as the metric to optimize is used Supplemental materials—S1 Fig).
To show the effect of hyper-parameters fine tuning supplementary materials S2 Table showing different sets of hyper-parameters and the corresponding accuracies for each area is uploaded.
After extracting the high level features using SAEs, they are fed into a probabilistic support vector machine (SVM) classifier with RBF kernel to obtain posteriori class membership scores, where the class membership was calculated as the sigmoid of the distance between the sample and the classification hyperplane. The hyper-parameter of the SVM, typically the kernel scale (range: 1: 20) and box constraint (range: 1: 100), are also selected using grid search  using accuracy as the metric to optimize. The selected kernel scale and box-constraint are 5 and 12, respectively. For the global subject diagnosis, we propose a heuristic based on a winner-takes-all approach. All the significant areas scores are averaged per subject, and the class with the largest average value is considered the final global diagnosis for the corresponding subject.
Statistical significance of classifier accuracy was assessed using bootstrapping. The labels (ASD or TD) of the training data set were randomly shuffled to simulate a completely uninformative data set, and the accuracy of a classifier trained on the artificial data was noted. The process was repeated 99 times.
For any new unseen subject, the output of this diagnosis system both makes a global decision indicating whether the subject is autistic or healthy controls, and it generates a vector of area membership scores indicating how much every area is implicated by autism related impairments. Fig 4 illustrates the diagnosis input, pipeline, and output.
The extracted features are fed to SVM. The classification output is a global diagnosis decision, indicating whether the subject is autistic or healthy controls; in addition a personalized color coded map indicating how much each area in the subject is implicated by autism impairments.
To test the system performance, two different validation techniques are used: (i) cross validation, where 2-folds, 4-folds, 10-folds, and leave one subject out (LOSO) are used, and (ii) hold-out testing by data partitioning to training dataset (60% of the data), validation dataset (15% of the data) and testing dataset (25% of the data). For each of the validation techniques used, accuracy, sensitivity, specificity and area under ROC curve are calculated. To show the effect of hyper-parameters on model accuracy, the supplementary materials—S2 Table is now uploaded that shows the accuracies per component for different combinations of hyper parameters.
Also, to check the scalability and extendibility of the diagnostic system, positive and negative predictive values (PPV and NPV) are estimated: (3) (4) where Sens and Spec are the sensitivity and specificity of the classifier, respectively, and Prev is the prevalence or prior probability of a diagnosis of ASD. In this study, two different prevalence values are used. The first prevalence value is 1 out of 68, which is the ASD ratio in the United States population . The second prevalence value is 18.7%, which indicates the autism percent among a high-risk population, where an older sibling has been previously diagnosed with ASD .
Subjects’ demographics, cognitive and behavioral data
ASD and TD subgroups were well-matched with respect to gender and age. Out of 123 ASD subjects, 56 were female (45.5%), while 85 of the 160 TD subjects were female 53.1%). The gender imbalance was statistically insignificant (χ2 = 0.05, p = 0.82). The mean age of ASD subjects was 13.1 years, while the mean age was 12.9 years for the TD group. Again, the difference was statistically insignificant (t = 0.302, p = 0.763). The groups were less well matched with respect to IQ, although the differences in mean scores were less than one standard deviation (Table 2). It was noted that 38 of those with ASD were prescribed medication for behavioral concerns, 19 used prescription medication for reasons other than behavioral, 20 used over-the-counter medication, and 29 took dietary supplements. Medication status data is incomplete, with missing data for six ASD individuals and no available data for any of the healthy controls subgroup. Those diagnosed with ASD presented with a wide range of severity on the ADOS (Table 2).
Global and personalized diagnosis results
For the 2-folds cross validation, the accuracy is 0.84, sensitivity is 0.88, specificity is 0.81, and AUC is 0.9165. For the 4-folds, the four metrics values are 0.88, 0.90, 0.87 and 0.9187 respectively, while for the 10-folds they are 0.91, 0.92, 0.88 and 0.9218. And finally for LOSO, they are 0.92, 0.93, 0.89 and 0.9250, respectively. Table 3 summarizes the four metrics used for every cross validation technique. Also Fig 5 shows the ROC curves for the 4 cross validation experiments.
To make sure that the system is robust enough, each of the k-fold cross validations is repeated 100 times and a summary statistic of the accuracy is reported in Table 4. Minimum accuracy, maximum accuracy, mean accuracy, and accuracy standard deviation are reported. They show homogeneity in the results over 100 runs which gives a good indication of the system robustness. Also, the bootstrap p-value for accuracy of classification was estimated to be 0.01.
In addition to using the cross validation technique for system evaluation, we are also using hold-out testing. In this technique, data is divided into 3 partitions: training, validation, and testing. This experiment aims to assess the system generalization ability and it ensure that the obtained results are robust and reproducible. The obtained accuracy from this experiment is 0.91, sensitivity is 0.88, and specificity is 0.92.
To highlight the effect of using both SAE and SVM, 12 combinations obtained from using 3 dimensionality reduction techniques and 4 different classifiers reported in Table 5. The three dimensionality reduction techniques are: (i) SAE, (ii) PCA, and (iii) Kernel PCA The four algorithms used are: (i) SVM, (ii) random forest, (iii) logistic regression, and (iv) neural network. For all the used algorithms, the hyper-parameters are also selected using a grid searching algorithm. All of the 16 combinations are reported using hold-out testing techniques and accuracies are reported with respect to the testing dataset.
All these accuracies are reported using hold-out testing technique. All classifiers and dimensionality reduction technique hyper-parameters are fine tuned using the grid searching algorithm. The highest accuracy obtained is 93%. It is obtained using SAE followed by SVM with RBF kernel.
The positive and negative predictive values are also calculated using the two prevalence values mentioned in the methodology section. The PPV and NPV indicate the probability of match between the actual diagnosis and the system output diagnosis when applying the system to a population with the prevalence used in PPV and NPV calculations. For the general prevalence, the PPV is 0.19 and the NPV is 0.91, while for high-risk prevalence, the PPV is 0.79 and the NPV is 0.9. Table 6 summarizes the PPV and NPV for the two prevalence values used.
In addition to reporting the global diagnosis result, a detailed report is generated showing a personalized local diagnosis for every subject. Using this report a color coded brain map is generated to show areas most affected with autism related impairments. Fig 6 shows 10 samples of the color coded brain maps with the associated color code used. To identify the functional areas that are highly related to autism diagnosis between the two groups, the sensitivity and specificity of each individual area are calculated, where the area sensitivity and specificity are obtained using the true negative and true positive rates of the diagnosis when using this separate area alone. A bar graph is provided in Fig 7 to illustrate the most significant regions with both sensitivity and specificity above an empirical threshold of 0.65 obtained. They are highly correlated with behavioral reports when using 4-fold cross validation. The full personalized results of the all subjects used with the membership scores of the significant areas to the autism class is in supplementary materials—S1 Table.
It is obvious that autistic subjects have more impacted areas than the healthy controls.
Region 1: BA24, 32, 34 R and L, anterior cingulate gyrus, BA9/10, R and L medial frontal gyrus, BA9/10, R middle frontal gyrus, BA8, right superior frontal gyrus, right caudate nucleus; Region 2: BA22 superior temporal gyrus L≥R, BA19 middle temporal gyrus L≥R, BA39 middle temporal gyrus L≥R; Region 3: BA8/BA9/BA10, Left and Right superior frontal gyrus, BA9/BA10, L and R medial frontal gyrus, BA9/BA10, Right middle frontal gyrus, and Region 4: BA10, Left middle frontal gyrus, BA10 Left superior frontal gyrus.
To cross validate the relevance of these regions to ASD, each brain region was correlated with the Total ADOS score and ADOS severity score. The Pearson correlation coefficient varies modestly from -0.28 to 0.27 for Brodmann area/brain regions involved in neurocircuits previously implicated in ASD (Table 7 and Fig 8).
Over the last few years, remarkable progress in MRI research has allowed the prospective identification of infants with ASD at 24 months based on structural MRI or fMRI features . The methodology presented in this paper using machine learning algorithms allows the identification of ASD vs healthy controls children and teenagers who are age 8-18 years of age with high accuracy, sensitivity, and specificity (Tables 3–6; Figs 6 and 7). In this study. we introduced an automated autism diagnosis system that uses resting state fMRI to localize the altered connectivity patterns for each subject. In addition, the system showed very promising generalization to all populations and certainly in the high-risk population (Table 4). The sample used in this study is well matched in terms of age and gender distribution (Table 2). Although the IQ mean scores are statistically significant, these data are less than one standard deviation apart. In addition, the full range of IQs in healthy controls is fully included inside that of the ASD group.
In general, ASD is thought to be a developmental disconnection syndrome with local functional hyperconnectivity and long range functional underconnectivity . Both task based and resting state fMRI demonstrate a clear separation via unique BOLD patterns between ASD and healthy control children, teenagers, and adults. fMRI is dependent upon a number of maturation factors including gene expression, numbers of synapses, cell numbers, synaptic pruning, myelination, etc., which may limit the developmental context of the signal interpretation [61, 62]. Even in separation of these groups (ASD vs healthy controls), one must remember the developmental context. In high risk infants, R-fMRI networks at 6 months of age correctly predicted those with an ASD diagnosis at 24 months with a sensitivity of 82% (9 out of 11 infants with ASD) and a specificity of 100% (48/48 of those infants without ASD) . However, very few R-fMRI networks were correlated with social communication and cognitive ability in high-risk infants, but many more networks were correlated with repetitive behaviors (self injury, stereotypes, sameness, ritualistic behaviors, compulsions). The interpretation suggests a developmental context since striatal and brainstem neural networks tend to mature earlier than cortically based networks. Alternatively, R-fMRI data could correlate more with ASD core symptoms, and anatomical MRI could be more closely correlated with cortically based symptoms (sensory problems, language impairment, etc.). In the older population, the current functional MRI algorithm identifies similar regions with altered connectivity previously noted in ASD including the pre-motor/supplementary motor cortex, dorsal lateral and medial prefrontal cortex, sensorimotor cortex/superior parietal lobule/supramarginal gyrus, and regions involved in language (angular gyrus, supramarginal gyrus ) being predictive of ASD with a high accuracy, sensitivity, and specificity (Tables 3–6 and Figs 6 and 7) across many models.
The total ADOS score and the ADOS severity score modestly correlate with brain regions (lateral frontopolar region and temporal parietal junction) in deficit cognitive circuits previously implicated in ASD (Table 7, Fig 8) according to Research Domain Criteria (RDoC: https://www.nimh.nih.gov/research-priorities/rdoc/constructs). These deficits could impact restricted interest/repetitive behaviors, attention, social, language, and executive function.
This fMRI algorithm may be more predictive in those of high risk ASD families than in the general population (Table 6). Previous R-fMRI studies have identified some of these regions such as parts of the Default Mode Network, including medial prefrontal cortex and the angular gyri, and interhemispheric connectivity networks (sensorimotor cortex and superior parietal lobule) with reduced connectivity in ASD . Thus, the current data presented suggest the algorithms, especially when combined in a multi-modal approach, have the potential to identify diagnostic category and clear brain regions involved in classical neural circuits previously implicated in ASD. In addition to providing a highly accurate prediction of a subject to have ASD, the proposed system provides a complete map explaining what areas are affected, and to what extent they are affected. (Fig 6 and supplementary materials S1 Table). To the best of our knowledge, this detailed report gains its importance from being the first work that is concerned with localizing impairments for each subject’s brain. The utility of this approach not only identifies those with ASD but might be of more help ascertaining specific impairments and thus, quite useful from the clinical point of view.
Limitations of the approach
Neuroimaging is an attractive intermediary to bridge the gap between genes, environment, and well-defined behavioral phenotypes such as ASD. The idea is to obtain a clinically relevant scan which one can then more closely relate to the neurobiological pathway of risk genes, other biofactors, and/or environmental factors on an individual level. The drawback of the current data and MRI-based methods is defining the developmental trajectory, impact of age/gender, development of clinically applicable techniques for scanning across ages, and the relationships to current clinical psychological methods to diagnose ASD. The current data may only be applicable to high functioning and older ASD patients but may be insensitive at younger ages (Table 2). The current sample size which identified areas implicated in younger children and infants as being predictive of ASD suggest a scalability of this approach to larger more heterogeneous populations. The use of R-fMRI data represents a particular challenge since the field is underdeveloped. The diversity of the subject pool (age/gender), design of the resting state scan, and the preprocessing/methods of analyses are still variables under study. Most importantly, there is a lack of longitudinal data defining normal functional connectivity in infancy through 8 years of age, and thus defining the abnormal developmental trajectory in ASD is difficult under 8 years of age . The analysis in this study was performed using FSL package, to eliminate any limitations from the package and to gain more flexibility to try different recent and up to date approaches at each analysis phase, a home developed package is being developed to be used in future studies. It is believed that more generalization and feasibility of the system could be studied by increasing the number of subjects and the intra-variability between subjects, including age group, multiple sites and multiple scanners data and other factors. This could be achieved by integrating multiple sites and multiple data sources in the dataset used.
Conclusion and future work
The advancement of new research technologies, including sMRI, fMRI, DTI, and genomics has made significant inroads into the potential identification of biomarkers for ASD. Despite significant efforts, smaller studies have made it difficult to generalize findings to larger more heterogeneous populations . This study demonstrates that data points from R-fMRI and machine learning algorithms could refine diagnostic accuracy, with the potential to predict clinical phenotypes, and the potential to develop better individualized treatments. Specific affected networks could be a biomarker for responses to specific types of behavioral interventions (i.e., individual psychotherapy, occupational therapy for sensory impairments, social skills training, etc.) or drug trials (i.e., selective serotonin uptake inhibitors VS. antipsychotic medications, etc.). In addition, the fMRI data could identify more genetically homogeneous groups in which specific neuropathological processes—such as decreased axonal pruning leading to increased mini-column width and altered synaptic connectivity—are common in specific networks of those with similar defects in axonal or synaptic gene function . Future research should focus further on using big data technology to combine multiple datasets from larger populations to better delineate clinically relevant neurobiological pathways and determine response to therapies in ASD. In addition, integrating information from multiple data sources such as behavioral reports and genetic profiles to get more insight about areas of interest observed on each individual subject would be helpful. Future research should also focus on studying the preprocessing steps individually in a more detailed manner as they are reported to have an important role in the diagnosis. For example, the patient’s head movement may cause significant noise and affect the fMRI measures, causing classification bias , . The motion correction in this study was done using the MCFLIRT algorithm  which yielded good results, but more recent motion correction algorithms might be more efficient in the future work. The next phase of this study should focus on including multiple site data from different datasets (ABIDE, for example) to study system robustness and generalization ability.
S1 Table. Personalized result of all 283 subjects used.
In this table, a score between 0 and 1 is used to define the membership score of each area to the autism class, with 0 being normal and 1 being the highest in the autism class membership. The scores in this table are used to generate the color coded visualization in Fig 6.
S2 Table. Sets of hyper-paramters used for both autoencoder and SVM for each individual area during the grid search Fig 7.
Data used in the preparation of this manuscript were obtained from the NIH-supported National Database for Autism Research (NDAR). NDAR is a collaborative informatics system created by the National Institutes of Health to provide a national resource to support and accelerate research in autism. Dataset identifier: 2021 and 2026. This manuscript reflects the views of the authors and may not reflect the opinions or views of the NIH or of the Submitters submitting original data to NDAR.
- 1. American Psychiatric Association, Diagnostic and statistical manual of mental disorders: DSM-5, American Psychiatric Association, Arlington, fifth edition, 2013.
- 2. Happé F, Ronald A, Plomin R. Time to give up on a single explanation for autism. Nature neuroscience. 2006;9(10):1218. pmid:17001340
- 3. Weng SJ, Wiggins JL, Peltier SJ, Carrasco M, Risi S, Lord C, et al. Alterations of resting state functional connectivity in the default network in adolescents with autism spectrum disorders. Brain research. 2010;1313:202–214. pmid:20004180
- 4. Zhang F, Savadjiev P, Cai W, Song Y, Verma R, Westin CF, et al. Fiber clustering based white matter connectivity analysis for prediction of Autism Spectrum Disorder using diffusion tensor imaging. In: Biomedical Imaging (ISBI), 2016 IEEE 13th International Symposium on. IEEE; 2016. p. 564–567.
- 5. Casanova MF, van Kooten IA, Switala AE, van Engeland H, Heinsen H, Steinbusch HW, et al. Minicolumnar abnormalities in autism. Acta neuropathologica. 2006;112(3):287. pmid:16819561
- 6. McKavanagh R, Buckley E, Chance SA. Wider minicolumns in autism: a neural basis for altered processing? Brain. 2015;138(7):2034–2045. pmid:25935724
- 7. Martínez-Cerdeño V. Dendrite and spine modifications in autism and related neurodevelopmental disorders in patients and animal models. Developmental neurobiology. 2017;77(4):393–404. pmid:27390186
- 8. Casanova M, Trippe J. Radial cytoarchitecture and patterns of cortical connectivity in autism. Philosophical Transactions of the Royal Society of London B: Biological Sciences. 2009;364(1522):1433–1436. pmid:19528027
- 9. Cao X, Tabuchi K. Functions of synapse adhesion molecules neurexin/neuroligins and neurodevelopmental disorders. Neuroscience research. 2017;116:3–9. pmid:27664583
- 10. Herbert MR, Ziegler DA, Makris N, Filipek PA, Kemper TL, Normandin JJ, et al. Localization of white matter volume increase in autism and developmental language disorder. Annals of neurology. 2004;55(4):530–540. pmid:15048892
- 11. Thompson A, Murphy D, Dell’Acqua F, Ecker C, McAlonan G, Howells H, et al. Impaired communication between the motor and somatosensory homunculus is associated with poor manual dexterity in autism spectrum disorder. Biological psychiatry. 2017;81(3):211–219. pmid:27639500
- 12. Cherkassky VL, Kana RK, Keller TA, Just MA. Functional connectivity in a baseline resting-state network in autism. Neuroreport. 2006;17(16):1687–1690. pmid:17047454
- 13. Just MA, Cherkassky VL, Keller TA, Minshew NJ. Cortical activation and synchronization during sentence comprehension in high-functioning autism: evidence of underconnectivity. Brain. 2004;127(8):1811–1821. pmid:15215213
- 14. Damarla SR, Keller TA, Kana RK, Cherkassky VL, Williams DL, Minshew NJ, et al. Cortical underconnectivity coupled with preserved visuospatial cognition in autism: Evidence from an fMRI study of an embedded figures task. Autism Research. 2010;3(5):273–279. pmid:20740492
- 15. Weng SJ, Carrasco M, Swartz JR, Wiggins JL, Kurapati N, Liberzon I, et al. Neural activation to emotional faces in adolescents with autism spectrum disorders. Journal of Child Psychology and Psychiatry. 2011;52(3):296–305. pmid:21039484
- 16. Scott-Van Zeeland AA, Dapretto M, Ghahremani DG, Poldrack RA, Bookheimer SY. Reward processing in autism. Autism research. 2010;3(2):53–67. pmid:20437601
- 17. Dekhil O, Ismail M, Shalaby A, Switala A, Elmaghraby A, Keynton R, et al. A novel CAD system for autism diagnosis using structural and functional MRI. In: 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017). IEEE; 2017. p. 995–998.
- 18. Dichter GS, Richey JA, Rittenberg AM, Sabatino A, Bodfish JW. Reward circuitry function in autism during face anticipation and outcomes. Journal of autism and developmental disorders. 2012;42(2):147–160. pmid:22187105
- 19. Cox A, Kohls G, Naples AJ, Mukerji CE, Coffman MC, Rutherford HJ, et al. Diminished social reward anticipation in the broad autism phenotype as revealed by event-related brain potentials. Social cognitive and affective neuroscience. 2015;10(10):1357–1364. pmid:25752905
- 20. Deshpande G, Libero L, Sreenivasan KR, Deshpande H, Kana RK. Identification of neural connectivity signatures of autism using machine learning. Frontiers in human neuroscience. 2013;7:670. pmid:24151458
- 21. Itahashi T, Yamada T, Watanabe H, Nakamura M, Jimbo D, Shioda S, et al. Altered network topologies and hub organization in adults with autism: a resting-state fMRI study. PloS one. 2014;9(4):e94115. pmid:24714805
- 22. Alaerts K, Swinnen SP, Wenderoth N. Sex differences in autism: a resting-state fMRI investigation of functional brain connectivity in males and females. Social cognitive and affective neuroscience. 2016;11(6):1002–1016. pmid:26989195
- 23. Supekar K, Uddin LQ, Khouzam A, Phillips J, Gaillard WD, Kenworthy LE, et al. Brain hyperconnectivity in children with autism and its links to social deficits. Cell reports. 2013;5(3):738–747. pmid:24210821
- 24. Tyszka JM, Kennedy DP, Paul LK, Adolphs R. Largely typical patterns of resting-state functional connectivity in high-functioning adults with autism. Cerebral cortex. 2013;24(7):1894–1905. pmid:23425893
- 25. Rausch A, Zhang W, Haak KV, Mennes M, Hermans EJ, van Oort E, et al. Altered functional connectivity of the amygdaloid input nuclei in adolescents and young adults with autism spectrum disorder: a resting state fMRI study. Molecular autism. 2016;7(1):13. pmid:26823966
- 26. Olivito G, Clausi S, Laghi F, Tedesco AM, Baiocco R, Mastropasqua C, et al. Resting-state functional connectivity changes between dentate nucleus and cortical social brain regions in autism spectrum disorders. The Cerebellum. 2017;16(2):283–292. pmid:27250977
- 27. Hahamy A, Behrmann M, Malach R. The idiosyncratic brain: distortion of spontaneous connectivity patterns in autism spectrum disorder. Nature neuroscience. 2015;18(2):302. pmid:25599222
- 28. Di Martino A, Yan CG, Li Q, Denio E, Castellanos FX, Alaerts K, et al. The autism brain imaging data exchange: towards a large-scale evaluation of the intrinsic brain architecture in autism. Molecular psychiatry. 2014;19(6):659. pmid:23774715
- 29. Plitt M, Barnes KA, Martin A. Functional connectivity classification of autism identifies highly predictive brain features but falls short of biomarker standards. NeuroImage: Clinical. 2015;7:359–366.
- 30. Kim J, Calhoun VD, Shim E, Lee JH. Deep neural network with weight sparsity control and pre-training extracts hierarchical features and enhances classification performance: Evidence from whole-brain resting-state functional connectivity patterns of schizophrenia. Neuroimage. 2016;124:127–146. pmid:25987366
- 31. Suk HI, Lee SW, Shen D, Initiative ADN, et al. Latent feature representation with stacked auto-encoder for AD/MCI diagnosis. Brain Structure and Function. 2015;220(2):841–859. pmid:24363140
- 32. Heinsfeld AS, Franco AR, Craddock RC, Buchweitz A, Meneguzzi F. Identification of autism spectrum disorder using deep learning and the ABIDE dataset. NeuroImage: Clinical. 2018;17:16–23.
- 33. Dvornek NC, Ventola P, Pelphrey KA, Duncan JS. Identifying autism from resting-state fMRI using long short-term memory networks. In: International Workshop on Machine Learning in Medical Imaging. Springer; 2017. p. 362–370.
- 34. Dekhil O, Hajjdiab H, Ayinde B, Shalaby A, Switala A, Sosnin D, et al. Using resting state functional MRI to build a personalized autism diagnosis system. In: Biomedical Imaging (ISBI 2018), 2018 IEEE 15th International Symposium on. IEEE; 2018. p. 1381–1385.
- 35. Beckmann CF, DeLuca M, Devlin JT, Smith SM. Investigations into resting-state connectivity using independent component analysis. Philosophical Transactions of the Royal Society of London B: Biological Sciences. 2005;360(1457):1001–1013. pmid:16087444
- 36. Beckmann CF, Smith SM. Probabilistic independent component analysis for functional magnetic resonance imaging. IEEE transactions on medical imaging. 2004;23(2):137–152. pmid:14964560
- 37. Smith SM. Fast robust automated brain extraction. Human brain mapping. 2002;17(3):143–155. pmid:12391568
- 38. Jenkinson M, Bannister P, Brady M, Smith S. Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage. 2002;17(2):825–841. pmid:12377157
- 39. Zeng LL, Wang D, Fox MD, Sabuncu M, Hu D, Ge M, et al. Neurobiological basis of head motion in brain imaging. Proceedings of the National Academy of Sciences. 2014; p. 201317424.
- 40. Mikl M, Mareček R, Hluštík P, Pavlicová M, Drastich A, Chlebus P, et al. Effects of spatial smoothing on fMRI group inferences. Magnetic resonance imaging. 2008;26(4):490–503. pmid:18060720
- 41. Sacchet MD, Knutson B. Spatial smoothing systematically biases the localization of reward-related brain activity. Neuroimage. 2013;66:270–277. pmid:23110886
- 42. Zarzoso V, Nandi AK. Blind separation of independent sources for virtually any source probability density function. IEEE transactions on signal processing. 1999;47(9):2419–2432.
- 43. Duff EP, Johnston LA, Xiong J, Fox PT, Mareels I, Egan GF. The power of spectral density analysis for mapping endogenous BOLD signal fluctuations. Human brain mapping. 2008;29(7):778–790. pmid:18454458
- 44. Mars RB, Jbabdi S, Sallet J, O’Reilly JX, Croxson PL, Olivier E, et al. Diffusion-weighted imaging tractography-based parcellation of the human parietal cortex and comparison with human and macaque resting-state functional connectivity. Journal of Neuroscience. 2011;31(11):4087–4100. pmid:21411650
- 45. Connectivity-based subdivisions of the human right “temporoparietal junction area”: evidence for different areas participating in different cortical networks. Cerebral cortex. 2011;22(8):1894–1903. pmid:21955921
- 46. Asplund CL, Todd JJ, Snyder AP, Marois R. A central role for the lateral prefrontal cortex in goal-directed and stimulus-driven attention. Nature neuroscience. 2010;13(4):507. pmid:20208526
- 47. Sallet J, Mars RB, Noonan MP, Neubert FX, Jbabdi S, O’Reilly JX, et al. The organization of dorsal frontal cortex in humans and macaques. Journal of Neuroscience. 2013;33(30):12255–12274. pmid:23884933
- 48. Neubert FX, Mars RB, Thomas AG, Sallet J, Rushworth MF. Comparison of human ventral frontal cortex areas for cognitive control and language with areas in monkey frontal cortex. Neuron. 2014;81(3):700–713. pmid:24485097
- 49. Kassraian-Fard P, Matthis C, Balsters JH, Maathuis MH, Wenderoth N. Promises, pitfalls, and basic guidelines for applying machine learning classifiers to psychiatric imaging data, with autism as an example. Frontiers in psychiatry. 2016;7:177. pmid:27990125
- 50. Zeng LL, Wang H, Hu P, Yang B, Pu W, Shen H, et al. Multi-Site Diagnostic Classification of Schizophrenia Using Discriminant Deep Learning with Functional Connectivity MRI. EBioMedicine. 2018;30:74–85. pmid:29622496
- 51. Plis SM, Hjelm DR, Salakhutdinov R, Allen EA, Bockholt HJ, Long JD, et al. Deep learning for neuroimaging: a validation study. Frontiers in neuroscience. 2014;8:229. pmid:25191215
- 52. Parisot S, Ktena SI, Ferrante E, Lee M, Moreno RG, Glocker B, et al. Spectral graph convolutions for population-based disease prediction. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer; 2017. p. 177–185.
- 53. Wang J, He H, Prokhorov DV. A folded neural network autoencoder for dimensionality reduction. Procedia Computer Science. 2012;13:120–127.
- 54. Xu J, Xiang L, Liu Q, Gilmore H, Wu J, Tang J, et al. Stacked sparse autoencoder (SSAE) for nuclei detection on breast cancer histopathology images. IEEE transactions on medical imaging. 2016;35(1):119–130. pmid:26208307
- 55. Zhu C, Byrd RH, Lu P, Nocedal J. Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound-constrained optimization. ACM Transactions on Mathematical Software (TOMS). 1997;23(4):550–560.
- 56. Hsu CW, Chang CC, Lin CJ, et al. A practical guide to support vector classification. Department of Computer Science National Taiwan University. 2003;.
- 57. Christensen DL, Bilder DA, Zahorodny W, Pettygrove S, Durkin MS, Fitzgerald RT, et al. Prevalence and characteristics of autism spectrum disorder among 4-year-old children in the autism and developmental disabilities monitoring network. Journal of Developmental & Behavioral Pediatrics. 2016;37(1):1–8.
- 58. Ozonoff S, Young GS, Carter A, Messinger D, Yirmiya N, Zwaigenbaum L, et al. Recurrence risk for autism spectrum disorders: a Baby Siblings Research Consortium study. Pediatrics. 2011; p. peds–2010. pmid:21844053
- 59. Emerson RW, Adams C, Nishino T, Hazlett HC, Wolff JJ, Zwaigenbaum L, et al. Functional neuroimaging of high-risk 6-month-old infants predicts a diagnosis of autism at 24 months of age. Science translational medicine. 2017;9(393):eaag2882. pmid:28592562
- 60. Minshew NJ, Keller TA. The nature of brain dysfunction in autism: functional brain imaging studies. Current opinion in neurology. 2010;23(2):124. pmid:20154614
- 61. Richiardi J, Altmann A, Milazzo AC, Chang C, Chakravarty MM, Banaschewski T, et al. Correlated gene expression supports synchronous activity in brain networks. Science. 2015;348(6240):1241–1244. pmid:26068849
- 62. Fakhoury M. The habenula in psychiatric disorders: more than three decades of translational investigation. Neuroscience & Biobehavioral Reviews. 2017;.
- 63. Hull JV, Jacokes ZJ, Torgerson CM, Irimia A, Van Horn JD. Resting-state functional connectivity in autism spectrum disorders: A review. Frontiers in psychiatry. 2017;7:205. pmid:28101064
- 64. Power JD, Schlaggar BL, Petersen SE. Recent progress and outstanding issues in motion correction in resting state fMRI. Neuroimage. 2015;105:536–551. pmid:25462692
- 65. Satterthwaite TD, Wolf DH, Loughead J, Ruparel K, Elliott MA, Hakonarson H, et al. Impact of in-scanner head motion on multiple measures of functional connectivity: relevance for studies of neurodevelopment in youth. Neuroimage. 2012;60(1):623–632. pmid:22233733