Figures
Abstract
Autism spectrum disorder (ASD) is a neurological and developmental disorder that manifests in social and behavioral deficits. The onset of symptoms may begin in early childhood, but diagnosis is often subjective, and scores can vary between specialists. Several studies suggest that diffusion tensor imaging (DTI)-derived indicators of anisotropy in water diffusion at microstructural level could be biomarkers for this disorder. Emerging advances in neuroimaging and machine learning can provide a fast and objective alternative for its early diagnosis. We propose and evaluate a machine-learning (ML)-powered computer-aided diagnosis (CAD) system for the detection of ASD from DTI. For the development and validation of the system we used the ABIDE II database (n = 150). The system involves processing the raw DTI to obtain fractional anisotropy (FA), mean diffusivity (MD), radial diffusivity (RD) and axial diffusivity (AD) in 25 ASD-relevant regions of interest defined in the JHU ICBM-DTI-81 White-Matter Labeled Atlas to train a ML binary classifier. We evaluated the use of support vector machine (SVM) with various kernels and random forest (RF) optimized for computational efficiency. The best configuration, which used RF, had a sensitivity of 100%, accuracy of 95.65%, precision of 91.67%, and a specificity of 91.67%. An external test yielded 94.73% sensitivity, 97.37% accuracy, and 100% in precision and specificity. Results in this small sample show the generalization power of the best model, and the utility of carefully leveraging imaging information with clinical knowledge on relevant white matter regions commonly affected by ASD to design a CAD system for ASD.
Author summary
Autism spectrum disorder (ASD) is a condition that starts manifesting in early years but which is difficult to diagnose due to similarities with other conditions like attention deficit hyperactivity disorder. Early diagnosis can make a real difference in the child’s ability to communicate and behave later. Various studies suggest that a modality of magnetic resonance imaging that allow scientists to study the brain connections and, with it, the tissue micro-structure, named diffusion tensor imaging (DTI) can be a good alternative for early diagnosis of this condition. We propose and evaluate a machine-learning (ML)-powered computer-aided diagnosis (CAD) system for the detection of ASD from DTI. We use DTI data from 150 contributors to the ABIDE II public database. From these data, we generated four indicators of brain microstructural integrity in different brain zones, guided by a relevant brain atlas. We used various configurations of two ML popular schemes for our tasks, tuned their parameters, and evaluated different combinations of the brain health indicators we derived from the DTI. We finally evaluated the system that produced the best results in an unseen data. All our evaluations suggest that the CAD system we propose can be useful and a good candidate to be considered for further application in clinical research and practice to diagnose ASD.
Citation: Cardenas-Hernandez NA, Perez-Diaz M, García-Ramó KB, Valdés Hernández MdC (2025) Autism spectrum disorder detection using diffusion tensor imaging and machine learning. PLOS Digit Health 4(12): e0001155. https://doi.org/10.1371/journal.pdig.0001155
Editor: Phat Kim Huynh, North Carolina A&T State University: North Carolina Agricultural and Technical State University, UNITED STATES OF AMERICA
Received: June 23, 2025; Accepted: December 5, 2025; Published: December 23, 2025
Copyright: © 2025 Cardenas-Hernandez et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: This manuscript uses data from a publicly available repository: ABIDE II, available from https://fcon_1000.projects.nitrc.org/indi/abide/abide_II.html.
Funding: This work was partially supported thanks to the “Agence Universitaire de la Francophonie” in the Caribbean, Project Code 04.06.42.22 (KBGR) and the Agency for Nuclear Energy and Advanced Technology of Cuba (AENTA), Project Code PS211LH02-081 (MPD). MCVH is funded by Row Fogo Charitable Trust (Grant no. BRO-D.FID3668413) and The University of Edinburgh. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Autism spectrum disorder (ASD) is a neurological and developmental disorder that manifests in persistent deficits mainly in the following areas of social communication and interaction: social-emotional reciprocity; developing, understanding, and maintaining relationships; and nonverbal communication [1]. The prevalence of ASD is increasing globally, with a median prevalence of 100/10,000 (ranging from 1.09/10,000 to 436.0/10,000) individuals [2]. Research tracking the early development of children with ASD has revealed distinctive patterns that include difficulties in diverting visual attention from a stimulus, deficits in self-regulation ability, and impairments in the neurocognitive processing of social information in different degrees [3]. Early detection and specialized intervention are crucial for the development of skills and quality of life of people with ASD [4].
Currently, ASD diagnostic methods consist of collecting clinical information about the patient’s development and direct observation of their behaviour [5,6]. But ASD also co-occurs with other behavioral disorders like attention deficit hyperactivity disorder (ADHD) [7], dyslexia, dyspraxia, obsessive compulsive disorder (OCD), anxiety and depression [8], and comorbidities like epilepsy, and gastro-intestinal and sleep disorders among others [9] all which introduce complexity to its diagnosis. Taking into account that medical images offer vital information for accurate diagnoses and treatment monitoring, and that among imaging diagnostic modalities, magnetic resonance imaging (MRI) stands out for providing accurate, comprehensive and multiparametric information on brain anatomy and function non-invasively, it has emerged as a tool to support the earlier diagnosis of multiple diseases such as ASD.
Among the different MRI sequences, diffusion-tensor images (DTI) have been widely used to study the microstructural characteristics of the brain tissue [10] through a set of parameters that characterize the diffusivity of water in the brain. These are the axial diffusivity (AD), the fractional anisotropy (FA), the mean diffusivity (MD), and the radial diffusivity (RD), which have been applied to a large group of brain pathologies [11]. In ASD children and adolescents, most studies have found compromised white matter integrity, manifested in low FA and high MD (and RD) values in widespread areas of the white matter [12,13]. A study found decreased FA specifically in white matter tracts that connect brain areas associated with social functioning (i.e., fusiform gyrus, amygdala, and superior temporal gyrus) even after controlling for age and IQ [14]. It has also been reported increase in diffusivity and decrease in FA values in the corpus callosum [15–17]. Other studies have reported similar patterns in the longitudinal fascicles [15,17–19], the brainstem tracts (cerebral peduncle) [17], and projection fibers (anterior and posterior extremities of the internal capsule; anterior, posterior, and superior corona radiata; and posterior thalamic radiation) [17]. Compared to controls, individuals with ASD, ranging from children and adolescents [17] to adults [16], have been found to have lower AD in the regions of the corpus callosum [16,17], cerebral peduncle, sagittal stratum including the longitudinal fasciculus, cingulum, and several projection fibers (anterior and posterior branches of the internal capsule; retrolenticular internal capsule; anterior and superior corona radiata; external capsule) [17]. A study found the subgenual anterior cingulate cortex with increased MD values in ASD children compared to controls. For ASD detection, tensor asymmetry has also been analyzed [20–23], since it reveals information not captured by other features [24]. However, it is important to note that although most studies have found decreased FA in ASD, some studies have found increased FA [25], or a combination of increased and decreased FA in different white matter areas, in people with ASD compared to typically developing individuals [26]. These aspects are not yet fully clarified, so the research area is open.
Machine learning and more recently deep learning methods have been applied to automatically detect ASD from various indicators, being the most common derived from functional MRI, behavioral tests, behavioral electronic monitoring, and structural MRI [5]. To the best of our knowledge, only few studies have been conducted to propose a fully automatic computer-aided diagnosis (CAD) system for ASD diagnosis using solely DTI [20,21,27–29]. From these, three studies propose a support vector machine (SVM) classifier with a linear kernel as the best of the methods evaluated, with best accuracies being 71% [27], 75% [29], and 99% [20,21], and one [28] proposes a linear regression classifier (81% highest accuracy). Deep learning methods have also been evaluated with suboptimal results [20,21]. But the study reporting 99% accuracy does not report Precision (Sensitivity) and Recall (Specificity) and neither performed external testing, so it is not known if the results are reproducible or not in unseen data acquired in a different imaging system with a different MRI protocol.
Our contributions are two-fold:
- 1) We benchmark and document the performance of different configurations of the two most commonly used ML classifiers for ASD detection solely using DTI in large multicenter data.
- 2) The sensitivity and specificity values obtained by the two best models chosen for validation, this done in an external dataset, exceed 90%, being on the order of or exceeding those obtained by other state-of-the-art CAD systems developed for ASD, reviewed by Song et al. [30] and Bae et al. [31].
Materials and methods
Dataset
We use structural MRI (T1-weighted) and diffusion tensor imaging (DTI) scans of the Autism Brain Imaging Data Exchange (ABIDE-II) [32] publicly available database (https://fcon_1000.projects.nitrc.org/indi/abide/abide_II.html). From the 19 contributing centres, 5 provide DTI data in nifti-1 format from 319 individuals (180 with ASD diagnosis and 139 without ASD labelled as typical controls). We used data from three: New York University Langone Medical Centre (NYU samples 1 and 2), San Diego State University (SDSU), and Trinity Centre for Health Sciences (TCD), which have the bval (i.e., lists the b-value for each volume in the series) and bvec (i.e., lists the gradient direction, with one column per volume) files present. Our dataset comprises image and basic demographic data from 150 individuals, of which 87 have ASD diagnosis and 63 have not and are used as controls. Table 1 summarises the demographic characteristics of the samples included.
All MRI data were acquired using 3T scanners from different manufacturers: SIEMENS MAGNETOM Allegra syngo MR 2004A in NYU, GE 3T MR750 with an 8-channel head coil in SDSU, and Philips Intera Achieva SENSE with head-8 coil in TCD. The DTI acquisition for these samples was: TE/TR = 78/5200 ms and 0.32 ms echo spacing for NYU1, TE/TR = 3.25/2530 and 7.4 ms echo spacing for NYU2, TE/TR = min/8500 ms for SDSU, and 79/20244 ms for TCD. NYU DTI acquisition has 64 directions, with voxel sizes 3.0 × 3.0 × 3.0 mm3 for sample 1 and 1.3 × 1.0 × 1.3 mm3 for sample 2. SDSU and TCD acquired DTI with 61 directions and voxel size 2.0 x 2.0 x 2.0 mm3. More details in the DTI acquisition at each centre can be found in the ABIDE II website (https://fcon_1000.projects.nitrc.org/indi/abide/abide_II.html).
Preprocessing and feature extraction
The neuroimaging preprocessing followed a standard pipeline using MRtrix software package [33] (version 3.0.1, www.mrtrix.org/) and FSL (version 4.1) [34] (Fig 1). DTI data preprocessing included denoising, Gibbs ringing removal, corrections for image distortions, head motion, bias field, and image intensity normalization. Due to unavailability of an unweighted diffusion image with reversed phase-encoding Synb0-DISCO [35] was used to synthesize an unweighted diffusion image without susceptibility-induced distortion from the T1-weighted image.
The maps of axial, radial and diffusivity (AD, RD, MD), and fractional anisotropy (FA) were generated using FSL’s diffusion toolbox. An atlas-based segmentation approach was implemented co-registering the John Hopkins University ICBM-DTI-81 white matter (WM) atlas [36] which 48 white matter areas, to subject’s space (Fig 2). This allowed to represent each subject by four features per 48 areas.
Of the 48 regions defined in ICBM-DTI-81, the 25 that, according to the scientific literature, have shown ASD traits were selected based on the regions of interest analyzed in [15,17–19]. The selected regions are shown in Table 2. For each region defined in each image, we calculated the average values of FA, MD, AD, and RD. In this way, a total of 100 DTI features were calculated for each image. These features were organized into a CSV file to be used as input vector to the classifiers.
Machine learning classification and evaluation of models
During this stage, we divided the data into two parts. Data derivatives from three sets (NYU1, NYU2 and SDSU) were used for training and validating the classifiers. Data derivatives from one set (TCD) were used for external testing. The classifiers were trained to classify patients with ASD and healthy individuals, but the input feature vector was constructed in two different ways: in one, all the calculated features were considered for classification, and in the other, the input only consisted of the FA and MD features, as these in combination have proven very useful for studying and revealing early changes in many neurological diseases including ASD [12,37]. The training set was randomly divided into two groups: 80% for training and 20% for validation. But we also accounted for key variables including ASD/control, age, sex and handedness, to ensure a balanced representation between groups as much as possible and minimise possible biases during training and testing. It is worth noting that ASD and controls were not 1-to-1 age- and sex-matched to utilize all the data available, but overall age distributions were not different between ASD and controls (see section Risk of bias and generalizability). Sensitivity, precision, accuracy, and specificity were used as metrics for model validation.
To implement the SVM classifier, we used the Python function SVC from the Scikit-Learn library. The parameters of this function are: kernel type (linear, radial, and polynomial of degrees 3, 5, and 7), regularization parameter C which controls the balance between the complexity of the decision boundary and the accuracy in classifying the training data, and the randomness parameter random_state. We used the default value C = 1, which implied a balanced configuration in the model implementation, and to ensure consistent behaviour the random_state was set up as null, which implied the model used random seeds in each run. To obtain the best fit for this model, several trial-and-error tests were performed, varying the kernel types used, and the best one was chosen for the test data. The trial-and-error strategy was guided by values reported in the literature, software documentation (https://scikit-learn.org/stable/modules/generated/sklearn.svm.SVC.html), and by empirical observations obtained in preliminary experiments, which facilitated the identification of adequate ranges for the classifiers, while allowing the viability of the study with limited computational resources.
To implement the Random Forest models, we used the RandomForestClassifier function from the Scikit-Learn library. The n_estimators parameter controls the number of trees created in the forest. The default values for the parameters that control tree size (e.g., max_depth, min_samples_leaf) were adjusted to reduce memory consumption, complexity, and tree size. random_state was set to none to obtain a consistent model during tuning. Multiple trial-and-error tests were performed on the n_estimators (i.e., 20, 50, and 100) and max_depth (i.e., none and 10) parameters.
All processing was carried out in a workstation with the following specifications: CPU: Intel Core i7-8700 3.2 GHz, graphic card NVIDIA GeForce RTX 3070 8 GB GDDR6, RAM 32 GB (2 x 16 GB) DDR4 3200MHz, base plaque MSI B365M PRO-VH, storage: SSD 128GB M.2 SATA + HDD 1TB (7200RPM).
Other Python libraries used, in addition to Scikit-learn for the ML algorithms, were Nibabel for the manipulation of nifti-1 files, Numpy for the manipulation of matrices and arrays, and Matplotlib and Seaborn for data visualisation and graphs. Statistical analyses of the results (i.e., descriptive statistics and group comparisons) were performed using the Statistical Package for the Social Sciences (SPSS) software. Post-hoc evaluations were performed using MATLAB R2023a. A significance level of 0.05 or less was considered a statistically significant result from any statistical test.
Results
The SVM classifier for various kernel types yielded the results shown in Figs 3 and 4, first for a feature vector composed of the four computed DTI metrics (FA, MD, RD and AD), and then for two features (FA and MD).
The overall results with this approach were not good. Among the SVM models analysed, the classifier that best balanced the metric values was the linear kernel model, for the two-feature input vector. This model obtained 100% precision, 81.8% sensitivity, 100% specificity, and 91.3% accuracy. Fig 5 shows the FA and MD maps of one of the cases classified as false negative (FN), a true positive (TP) ASD patient, and another true negative (TN) patient with the best model.
Figs 6 and 7 show the results obtained with the RF models for the 4-feature (FA, MD, RD and AD) and 2-feature (FA and MD) input vectors, respectively.
We observed that the performance of this type of classification model was better than that obtained with previous SVM models. The best results were obtained for input vectors from two features. The best-performing model was the one trained by initializing with 100 trees without any depth restrictions. This is the model that presents the highest sensitivity value, which is precisely the most important metric in the opinion of medical specialists, while the rest of the metrics also maintain good results. For this model, the Precision was 91.67%, 100% sensitivity, 91.67% specificity, and 95.65% accuracy. Fig 8 shows the FA and MD maps of one of the cases classified as FP, a TP ASD patient, and another TN with the best model.
Comparing the two best models (one from each classifier), it can be seen that the chosen RF model shows high stability across all metrics (it obtained only one FP in validation) and also has the highest sensitivity for detecting ASD, which is highly desirable in the opinion of medical specialists. The SVM model, on the other hand, had the highest specificity, meaning it detected all healthy individuals very well, but it presented two false negatives in validation. Both models underwent external testing.
Note that, regarding the FA maps, the TPs present fewer bright pixels compared to the TNs, which means lower FA in some brain areas, and a loss of directionality in water diffusion, suggesting white matter involvement. However, in the MD maps, no notable differences are seen between ASD and healthy individuals with the naked eye, but the AI models are able to classify the images in ASD and healthy individuals, demonstrating the potential of ML for classification tasks from DTI images.
The results of the external test for the best performing RF and SVM models are shown in the confusion matrices in Fig 9. Table 3 summarizes the metrics corresponding to the results of this test for both models, which allow us to evaluate whether there is generalization power in these models.
The calculated metrics show, during the external test, similar results to those of the validation tests of each model, so it can be stated that both have generalization power to detect ASD with data from diverse sources. When a model has training biases, the results of the metrics in an external test usually fall below 50% of good classification or are lower than those of validation, evidencing that there is overfitting of the model to the training data.
Overall, the RF model metrics were superior to those of the SVM model, so it was chosen as the best performing model. These results indicate that data preprocessing was effective in reducing class overlap [33,35,38–41]. They also indicate that the choice of features to consider in the classifiers has been adequate [12,42], contributing to achieving good performance with data from different sources, not previously analyzed by the model.
When analyzing the characteristics obtained in each region, alterations in the values corresponding to patients with ASD were observed compared to those corresponding to healthy individuals in most of the selected regions.
The average FA and MD values were calculated for the autistic patients and healthy individuals who were part of the training and validation set in each of the selected areas in the images. For each area, we performed a Student’s t-test for the FA and MD characteristics after analyzing the histograms, to determine whether there were significant differences in the values of these metrics between cases with ASD and healthy individuals. The results of this analysis are shown in Table 4 (see corresponding regions in the JHU ICBM-DTI-81 WM atlas in Table 2).
In all regions analyzed, mean FA values are higher in healthy individuals than in patients with ASD, while mean MD values are higher in autistic patients than in healthy individuals, which is consistent with the scientific literature [12,14]; Shukla et al., 2010).
In healthy individuals, higher FA values indicate that water has greater directionality, while lower MD values indicate that water diffuses less in all directions. However, in patients with ASD, lower FA values indicate that water has less directionality. Increasing MD values indicate that water diffuses more freely in different directions. This suggests that, in patients with ASD, water diffusion was less anisotropic than in healthy individuals. That is, white matter fibers are less organized than in normal subjects. Therefore, water molecules can diffuse more freely and in more directions, unlike in healthy subjects, where these molecules are more directed. This means that, in individuals with ASD, there is greater white matter degeneration and impaired connectivity between different brain areas, while in healthy individuals, white matter is more organized.
Regarding the regions analyzed, FA values presented statistically significant differences between autistic and healthy subjects in the anterior and superior corona radiata, the cingulum, and the superior longitudinal fasciculi. Furthermore, MD values presented statistically significant differences between autistic and healthy subjects in all regions analyzed, except for the corpus callosum and the cingulum.
Therefore, the corpus callosum is the only region analyzed in which no significant differences were observed in the values of either of the two metrics that were key in this experiment. It was therefore concluded that it was not a statistically significantly different region between ASD and healthy subjects, unlike the research presented in [22], where it was one of the most statistically significant regions. The remaining regions showed significant differences in at least one of the two metrics.
The most statistically significant regions in the classification were considered to be those that showed significant differences in both characteristics. These regions are the anterior and superior corona radiata, the cingulum, and the superior longitudinal fasciculi, which is consistent with the research presented in [22] and [28].
Explainability of feature reduction outcome
Models that only used FA and MD yielded superior performance than those that included the four features, namely FA, MD, RD and AD. This can be explained by the high correlation among the diffusion variables as they are derived from the same values of the diffusion tensor. For example, MD is the average of the three eigenvalues of the diffusion tensor while RD is the average of the two smaller eigenvalues. The inclusion of highly correlated features adds redundancy to the identification of discriminative patterns and increases the risk of overfitting, especially in small samples like the one used here. By only using FA and MD our models use the most discriminative information and reduce their complexity, translating in better metrics of sensitivity and specificity. We calculated the variance inflation factor (VIF), computed Belsley collinearity diagnostics, and plotted the correlations between FA, MD, RD and AD to explain these results and demonstrate our rationale. Table 5 shows the average adjusted R2 value (VIF = 1/ (1 - R2)) from the regression of each diffusion variable using the rest as predictors. As R2 approaches 1, the VIF approaches infinity, indicating multicollinearity. Fig 10 shows an example of running the Belsley collinearity diagnostics between the four diffusion features in the ROI 3 for the controls. Similar results were obtained for all the rest of the ROIs in both groups (i.e., control and ASD).
The condition indices (condIdx) identify the number and strength of any near dependencies between the variables. All condition indices are bounded between one and the condition number, which is an overall diagnostic for detecting collinearity. The variance-decomposition proportions identify groups of variates involved in near dependencies, and the extent to which the dependencies degrade the regression. The cross-correlation matrix between the four diffusion variables is shown on the right hand side panel.
Risk of bias and generalizability
We analyzed demographic and clinical variables from all subsets of data from the three contributing centres. Table 6 shows the descriptive statistics (i.e., average age (standard deviation), performance IQ, verbal IQ and full IQ, and the sex and handedness information) for each subgroup. ASD and control groups for the training and validation sets were balanced in terms of age and handedness (proportions) but not in sex splits. In the external test set, all (i.e., ASD and controls) were left handed male. Compared with the training and validation set, the individuals in the external testing set were older. The differences in IQ between ASD and controls in the training and validation set were smaller on average than between ASD and controls in the external test set, where the controls had on average 9 units higher IQ scores than the ASD group. The heterogeneity of the samples from different sites and with different demographic characteristics confers confidence in the generalizability of the proposed model and in the validity of the analyses presented.
Due to the lack of the bval/bvec data or incomplete/unusable images, 55 individuals (42 ASD and 13 controls) from the three contributing centres were excluded. They were much younger on average than those who contributed to the training, validation and testing of the models. Given that only four were from the TCD, used as external set, their putative inclusion would have made wider the age disparity between the individuals who contributed information to develop the model and those who contributed in the external testing of the model. Also, as the differences in IQ between ASD and control groups in this excluded dataset were large, their putative inclusion would have made the training/validation set closer to the external dataset in this regard, which would have diminished confidence in the generalizability of the model. It is worth noting that in the excluded subsamples and in the subsamples used in the development of the models there were more males than females, so the putative inclusion of the excluded data would not have facilitated a more balanced data in terms of sex splits either for the development of the models, and rather would have increased the sex imbalance in the development sample in favor of males, perhaps deeming the testing set (which only has data from males) even less relevant in terms of showing the generalizability capacity of the model proposed.
We also generated box plots of the clinical characteristics of the subgroups of individuals who contributed data to the training, validation, and external testing of the models (Supplementary material). As can be appreciated, although across all tests there are differences between the responses from ASD and controls, the means, medians and distributions, especially from the scores from all tests that formed part of the child behavior checklist (ages 6–18), largely differ between the training/validation and the external test groups (see S3 Fig), strengthening the confidence in the capacity of the model proposed to perform well in an unseen population.
Discussion
Detecting ASD is currently a challenge. Although descriptive screening methods [6,43–45] help detect ASD, this cannot always be achieved early. These methods pose challenges for specialists, as they involve lengthy and time-consuming questionnaires for patients and family members. However, they are the methods that have prevailed for years for clinical detection. In most cases, they are administered by psychologists, psychiatrists, or neurologists and require a lengthy follow-up observation process for definitive evaluation [46].
Furthermore, despite the numerous advantages that morphological MRI images offer in identifying ASD, their quality significantly impacts the diagnostic accuracy of clinicians [47]. Furthermore, MRI data from individuals with ASD are obtained through multiple slices and different protocols. As a result, considerable time is also required to thoroughly examine the images. Professionals must be extremely careful, as eye strain can lead to misdiagnoses [48], often necessitating the need for second opinions, which can delay diagnosis. For all these reasons, early detection of ASD is very complex.
DTI images offer a unique view of brain structure and connectivity at the microscopic level, allowing for an assessment of anatomical connections and evaluating brain connectivity based on the organization and integrity of white matter, which is essential for communication between different brain areas [49].
For these reasons, the potential of the DTI-based RF model obtained is highlighted, as it would allow for faster diagnosis and help prevent eye strain, making it a useful support tool for specialists.
The model obtained classifies better for a two-feature input vector (FA and MD). This may be due to the high heterogeneity of the DB used. FA and MD have been shown to be powerful biomarkers after controlling for age, sex, or IQ [14]. However, RD and AD features have been much less studied, and their values may be biased by some of these elements, aspects that warrant future study.
Regarding computational cost, the model training stage took 5 seconds per model with the computational capabilities used. However, the image preprocessing stage resulted in a computational cost of between 1.5 and 2 hours per image, making this stage unfeasible in clinical practice. Images must be preprocessed under high computational performance conditions, for example, using a computer cluster in an institution with these facilities.
Comparing the results obtained with those of previous studies in the scientific literature is not a simple task. This is because systems trained using different databases, calculating different metrics, and employing diverse classification methods are published. To conduct a proper comparison, a review of two recent studies was conducted, which used the same database as in this work (ABIDE II).
In [39], a logistic regression model was trained to calculate a Euclidean distance-based correlation matrix from five DTI features (FA, MD, AD, RD, and anisotropy mode). In this study, it was observed that features obtained from the anterior limb of the left internal capsule to the right superior corona radiata regions contributed most to ASD classification.
In [49], several models were trained, including SVM, RF, logistic regression, neural networks, among others. These classifiers were trained using statistical features obtained from six DTI features (FA, MD, ,
,
, and tensor skewness). The best trained model was the linear SVM. With respect to these studies, the results of the model obtained in the present work are superior to those reported in [28]. Although the accuracy of the model presented in [20,21] is higher than the value obtained in this study, this model only showed validation results, without external testing, so it is not known whether it has generalization power for data different from those used in training.
Although a comparison between studies conducted under different conditions is not appropriate, the comparative analysis of the results can be considered a reference framework in relation to currently accepted international standards. Thus, for example, in study [29], the DB USC Multimodal Connectivity Database was used, consisting of diffusion images from 51 patients with ASD and 41 healthy controls. Metrics based on connectivity matrices were calculated. The algorithms used were SVM and Linear Discriminant Analysis. The best-performing model showed a sensitivity of 81.94% and a specificity of 70.42%.
On the other hand, in [27], the DB used consisted of two sets of diffusion images obtained from the National Database for Autism Research (NDAR), and the SVM algorithm with a vector was used, obtaining a sensitivity of 72% and a specificity of 70%.
These comparisons confirm that the model obtained in this research is within the range of internationally accepted models and presents superior results to some proposed in previous studies. The relatively small sample size used could be considered a limitation of this work. A recent work [50] has proposed an approach that integrates the power of generative adversarial networks (GANs) and Deep Q-Learning to augment limited datasets (namely ABIDE I and II), and evaluates it through the task of identifying ASD using resting state functional MRI. The best performance across the 10 classifiers evaluated by Zhou and colleagues reported the combination of the proposed approach and InfoGAN achieved an accuracy of 87.30%, a sensitivity of 91.60%, and a specificity of 86.49%, all below the figures reported here, but above those achieved without data augmentation or with other data augmentation methods. Future work could use such approach to generate a large sample and re-evaluate the best configuration proposed here.
The success of the best model proposed in this work shows that the preprocessing steps performed on the images, the classification models selected, and the choice of white matter areas to carry out an analysis based on regions of interest, rather than using the entire brain or a specific region, were superior to the approaches followed to date in research [22,27,28,51]. However, our model still can be improved. The application of more systematic hyperparameter tuning strategies like GridSearchCV or RandomSearchCV in future works could optimize the model performance and the stability of the results. Also, further validation in other samples is now needed, involving larger number of age and sex-matched ASD/control cases, individuals from underrepresented socio-economic groups and diverse ethnicity with images from different sites and acquired with different protocols.
A notable aspect is the practical utility of having employed ML methods that do not perform deep learning, such as those required in convolutional neural networks [51,52]. However, these systems that require large amount of data for training are vulnerable to the different sources of bias in the data available for their development. This is one of the most widely recognized challenges to overcome in the translation of the CAD systems developed using research data to clinical practice [53]. The generalization capacity of a ML model with a reduced number of data is an essential element, especially for the medical sector. When handling limited data sets, it is crucial to extract the most relevant and significant information from them, which allows training more robust and generalizable classification models [54], eliminating noise and redundancy while delivering a fairer response.
Conclusions
This research demonstrated that a ML model incorporating a RF classifier is effective in detecting ASD from DTI, even when trained with a relatively low amount of data.
The proposed model performed well in terms of computational effectiveness and efficiency and demonstrated generalization power, obtaining equivalent results in external testing and validation.
The superior longitudinal fascicles, the anterior and superior corona radiata, and the cingulum were the most significant regions in the classification of ASD cases and healthy individuals, while the corpus callosum was the only region that did not show significant differences.
The sensitivity and specificity values obtained by the two best models chosen for validation exceed 90%, being on the order of or exceeding those obtained by international CAD systems. A prospective validation is now needed.
Supporting information
S1 Fig. Box plots of the raw and T-score results from the Social Responsiveness Scale applied to the individuals that contributed data to the study, grouped by ASD and controls in the training/validation and external test subsets.
Data source: ABIDE II (https://fcon_1000.projects.nitrc.org/indi/abide/abide_II.html).
https://doi.org/10.1371/journal.pdig.0001155.s001
(TIF)
S2 Fig. Box plots of the scores from the Repetitive Behaviors Scale (Revised 6) applied to the individuals that contributed data to the study, grouped by ASD and controls in the training/validation and external test subsets.
Data from the Trinity Centre for Health Sciences (TCD), who was used as external validation set, did not contain these results for controls. Data source: ABIDE II (https://fcon_1000.projects.nitrc.org/indi/abide/abide_II.html).
https://doi.org/10.1371/journal.pdig.0001155.s002
(TIF)
S3 Fig. Box plots of the T-scores from the Child Behavior Checklist for ages 6–18, applied to the individuals that contributed data to the study, grouped by ASD and controls in the training/validation and external test subsets.
Data source: ABIDE II (https://fcon_1000.projects.nitrc.org/indi/abide/abide_II.html).
https://doi.org/10.1371/journal.pdig.0001155.s003
(TIF)
References
- 1. Lordan R, Storni C, De Benedictis CA. Autism Spectrum Disorders: Diagnosis and Treatment. In: Grabrucker A, editor. Autism Spectrum Disorders. Exon Publications; 2021. p. 17–32.
- 2. Zeidan J, Fombonne E, Scorah J, Ibrahim A, Durkin MS, Saxena S, et al. Global prevalence of autism: A systematic review update. Autism Res. 2022;15(5):778–90.
- 3. Pugliese CE, Anthony LG, Strang JF, Dudley K, Wallace GL, Naiman DQ, et al. Longitudinal Examination of Adaptive Behavior in Autism Spectrum Disorders: Influence of Executive Function. J Autism Dev Disord. 2016;46(2):467–77. pmid:26349921
- 4. Kodak T, Bergmann S. Autism Spectrum Disorder: Characteristics, Associated Behaviors, and Early Intervention. Pediatr Clin North Am. 2020;67(3):525–35. pmid:32443991
- 5. Talukdar J, Gogoi DK, Singh TP. A comparative assessment of most widely used machine learning classifiers for analysing and classifying autism spectrum disorder in toddlers and adolescents. Healthcare Analytic. 2023;3:100178.
- 6. Thabtah F, Peebles D. Early Autism Screening: A Comprehensive Review. Int J Environ Res Public Health. 2019;16(18):3502. pmid:31546906
- 7. Hours C, Recasens C, Baleyte J-M. ASD and ADHD Comorbidity: What Are We Talking About? Front Psychiatry. 2022;13:837424. pmid:35295773
- 8. Bougeard C, Picarel-Blanchot F, Schmid R, Campbell R, Buitelaar J. Prevalence of Autism Spectrum Disorder and Co-morbidities in Children and Adolescents: A Systematic Literature Review. Front Psychiatry. 2021;12:744709. pmid:34777048
- 9. Al-Beltagi M. Autism medical comorbidities. World J Clin Pediatr. 2021;10(3):15–28. pmid:33972922
- 10. Thiebaut de Schotten M, Ffytche DH, Bizzi A, Dell’Acqua F, Allin M, Walshe M, et al. Atlasing location, asymmetry and inter-subject variability of white matter tracts in the human brain with MR diffusion tractography. Neuroimage. 2011;54(1):49–59. pmid:20682348
- 11. Alexander AL, Hasan K, Kindlmann G, Parker DL, Tsuruda JS. A geometric analysis of diffusion tensor measurements of the human brain. Magn Reson Med. 2000;44(2):283–91.
- 12. Groen WB, Buitelaar JK, van der Gaag RJ, Zwiers MP. Pervasive microstructural abnormalities in autism: a DTI study. J Psychiatry Neurosci. 2011;36(1):32–40. pmid:20964953
- 13.
Shukla DK, Keehn B, Lincoln AJ, Müller RA. White matter compromise of callosal and subcortical fiber tracts in children with autism spectrum disorder: A diffusion tensor imaging study. Journal of the American Academy of Child and Adolescent Psychiatry 2010;49(12): 1269–1278. https://doi.org/10.1016/j.jaac.2010.08.018
- 14. Jou RJ, Mateljevic N, Kaiser MD, Sugrue DR, Volkmar FR, Pelphrey KA. Structural neural phenotype of autism: preliminary evidence from a diffusion tensor imaging study using tract-based spatial statistics. AJNR Am J Neuroradiol. 2011;32(9):1607–13. pmid:21799040
- 15. Dougherty CC, Evans DW, Myers SM, Moore GJ, Michael AM. A Comparison of Structural Brain Imaging Findings in Autism Spectrum Disorder and Attention-Deficit Hyperactivity Disorder. Neuropsychol Rev. 2016;26(1):25–43. pmid:26482248
- 16. Nickel K, Tebartz van Elst L, Perlov E, Endres D, Müller GT, Riedel A, et al. Altered white matter integrity in adults with autism spectrum disorder and an IQ >100: a diffusion tensor imaging study. Acta Psychiatr Scand. 2017;135(6):573–83. pmid:28407202
- 17. Vogan VM, Morgan BR, Leung RC, Anagnostou E, Doyle-Thomas K, Taylor MJ. Widespread White Matter Differences in Children and Adolescents with Autism Spectrum Disorder. J Autism Dev Disord. 2016;46(6):2138–47. pmid:26899725
- 18. Blanken LME, Muetzel RL, Jaddoe VWV, Verhulst FC, van der Lugt A, Tiemeier H, et al. White matter microstructure in children with autistic traits. Psychiatry Res Neuroimaging. 2017;263:127–34. pmid:28384486
- 19. Reyes-Vaca JG, Villegas-Valdez DMM, Bravo-Oro A, Hernández-Rodríguez HG. Estudio comparativo de los fascículos longitudinal superior e inferior con imágenes de tensor de difusión en pacientes pediátricos autistas y no autistas. Revista Anales Radiol Mexico. 2019;18(3):148–58.
- 20. ElNakieb Y, Ali MT, Elnakib A, Shalaby A, Soliman A, Mahmoud A, et al. The Role of Diffusion Tensor MR Imaging (DTI) of the Brain in Diagnosing Autism Spectrum Disorder: Promising Results. Sensors (Basel). 2021;21(24):8171. pmid:34960265
- 21. ElNakieb Y, Ali MT, Soliman A, Mahmoud A, Shalaby A, Switala A, et al. Identifying brain pathological abnormalities of autism for classification using diffusion tensor imaging. In: Neural Engineering Techniques for Autism Spectrum Disorder: Volume 1: Imaging and Signal Analysis. 2021. p. 361–76.
- 22. Elnakieb YA, Ali MT, Soliman A, Mahmoud AH, Shalaby AM, Alghamdi NS, et al. Computer Aided Autism Diagnosis Using Diffusion Tensor Imaging. IEEE Access. 2020;8:191298–308.
- 23. Virues-Ortega J, McKay NS, McCormack JC, Lopez N, Liu R, Kirk I. A callosal biomarker of behavioral intervention outcomes for autism spectrum disorder? A case-control feasibility study with diffusion tensor imaging. PLoS One. 2022;17(2):e0262563. pmid:35113904
- 24. Basser PJ. New histological and physiological stains derived from diffusion-tensor MR images. Ann N Y Acad Sci. 1997;820:123–38. pmid:9237452
- 25. Ben Bashat D, Kronfeld-Duenias V, Zachor DA, Ekstein PM, Hendler T, Tarrasch R, et al. Accelerated maturation of white matter in young children with autism: a high b value DWI study. Neuroimage. 2007;37(1):40–7. pmid:17566764
- 26. Kumar A, Sundaram SK, Sivaswamy L, Behen ME, Makki MI, Ager J, et al. Alterations in frontal lobe tracts and corpus callosum in young children with autism spectrum disorder. Cereb Cortex. 2010;20(9):2103–13. pmid:20019145
- 27. Elnakieb Y, Soliman A, Mahmoud A, Dekhil O, Shalaby A, Ghazal M, et al. Autism Spectrum Disorder Diagnosis framework using Diffusion Tensor Imaging. In: IST 2019 - IEEE International Conference on Imaging Systems and Techniques, Proceedings, 2019. p. 1–5.
- 28. Ratnaik R, Rakshe C, Kumar M, Agastinose Ronickom JF. Diagnostic Classification of ASD Improves with Structural Connectivity of DTI and Logistic Regression. Stud Health Technol Inform. 2023;305:64–7. pmid:37386959
- 29. Saad M, Rabiul Islam SM. Brain connectivity network analysis and classifications from diffusion tensor imaging. In: 1st International Conference on Robotics, Electrical and Signal Processing Techniques, ICREST 2019, 2019. p. 422–7.
- 30. Song D-Y, Kim SY, Bong G, Kim JM, Yoo HJ. The Use of Artificial Intelligence in Screening and Diagnosis of Autism Spectrum Disorder: A Literature Review. Soa Chongsonyon Chongsin Uihak. 2019;30(4):145–52. pmid:32595335
- 31. Bae S, Hong J, Ha S, Moon J, Yu J, Choi H, et al. Multimodal AI for risk stratification in autism spectrum disorder: integrating voice and screening tools. NPJ Digit Med. 2025;8(1):538. pmid:40841482
- 32. Di Martino A, O’Connor D, Chen B, Alaerts K, Anderson JS, Assaf M, et al. Enhancing studies of the connectome in autism using the autism brain imaging data exchange II. Sci Data. 2017;4:170010. pmid:28291247
- 33. Tournier J-D, Smith R, Raffelt D, Tabbara R, Dhollander T, Pietsch M, et al. MRtrix3: A fast, flexible and open software framework for medical image processing and visualisation. Neuroimage. 2019;202:116137. pmid:31473352
- 34. Jenkinson M, Beckmann CF, Behrens TEJ, Woolrich MW, Smith SM. FSL. Neuroimage. 2012;62(2):782–90. pmid:21979382
- 35. Schilling KG, Blaber J, Hansen C, Cai L, Rogers B, Anderson AW, et al. Distortion correction of diffusion weighted MRI without reverse phase-encoding scans or field-maps. PLoS ONE. 2020;15(7):e0236418.
- 36. Hua K, Zhang J, Wakana S, Jiang H, Li X, Reich DS, et al. Tract probability maps in stereotaxic spaces: analyses of white matter anatomy and tract-specific quantification. Neuroimage. 2008;39(1):336–47. pmid:17931890
- 37. Thibault G, Fertil B, Navarro C. Texture indexes and gray level size zone matrix: application to cell nuclei classification. In: Proceedings of the International Conference on Pattern Recognition and Information Processing (PRIP), 2009. p. 140–5.
- 38. Andersson JLR, Skare S, Ashburner J. How to correct susceptibility distortions in spin-echo echo-planar images: application to diffusion tensor imaging. Neuroimage. 2003;20(2):870–88. pmid:14568458
- 39. Andersson JLR, Sotiropoulos SN. An integrated approach to correction for off-resonance effects and subject movement in diffusion MR imaging. Neuroimage. 2016;125:1063–78. pmid:26481672
- 40. Veraart J, Novikov DS, Christiaens D, Ades-Aron B, Sijbers J, Fieremans E. Denoising of diffusion MRI using random matrix theory. Neuroimage. 2016;142:394–406. pmid:27523449
- 41. Zhang Y, Brady M, Smith S. Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. IEEE Trans Med Imaging. 2001;20(1):45–57. pmid:11293691
- 42. Giannelli M, Cosottini M, Michelassi MC, Lazzarotti G, Belmonte G, Bartolozzi C, et al. Dependence of brain DTI maps of fractional anisotropy and mean diffusivity on the number of diffusion weighting directions. J Appl Clin Med Phys. 2009;11(1):2927. pmid:20160677
- 43. Gotham K, Pickles A, Lord C. Standardizing ADOS scores for a measure of severity in autism spectrum disorders. J Autism Dev Disord. 2009;39(5):693–705. pmid:19082876
- 44. Lord C, Rutter M, Le Couteur A. Autism Diagnostic Interview-Revised: a revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders. J Autism Dev Disord. 1994;24(5):659–85. pmid:7814313
- 45. Zwaigenbaum L, Bauman ML, Choueiri R, Fein D, Kasari C, Pierce K, et al. Early Identification and Interventions for Autism Spectrum Disorder: Executive Summary. Pediatrics. 2015;136(Suppl 1):S1–9. pmid:26430167
- 46. Moridian P, Ghassemi N, Jafari M, Salloum-Asfar S, Sadeghi D, Khodatars M, et al. Automatic autism spectrum disorder detection using artificial intelligence methods with MRI neuroimaging: A review. Front Mol Neurosci. 2022;15:999605. pmid:36267703
- 47. Saputra DCE, Maulana Y, Win TA, Phann R, Caesarendra W. Implementation of Machine Learning and Deep Learning Models Based on Structural MRI for Identification Autism Spectrum Disorder. J Ilm Tek Elektro Komput Dan Inform. 2023;9(2):307–18.
- 48. Caranci F, Tedeschi E, Leone G, Reginelli A, Gatta G, Pinto A, et al. Errors in neuroradiology. Radiol Med. 2015;120(9):795–801. pmid:26183038
- 49. Hoppenbrouwers M, Vandermosten M, Boets B. Autism as a disconnection syndrome: A qualitative and quantitative review of diffusion tensor imaging studies. Res Autism Spectrum Dis. 2014;8(4):387–412.
- 50. Zhou Y, Jia G, Ren Y, Ren Y, Xiao Z, Wang Y. Advancing ASD identification with neuroimaging: a novel GARL methodology integrating Deep Q-Learning and generative adversarial networks. BMC Med Imaging. 2024;24(1):186. pmid:39054419
- 51. Herath L, Meedeniya D, Marasingha MAJC, Weerasinghe V. Autism spectrum disorder diagnosis support model using Inception V3. In: Proceedings - International Research Conference on Smart Computing and Systems Engineering, SCSE 2021. 2021. p. 1–7.
- 52. Wei L, Liu B, He J, Zhang M, Huang Y. Autistic Spectrum Disorders Diagnose with Graph Neural Networks. In: MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia. 2023. p. 8819–27.
- 53. Leming MJ, Bron EE, Bruffaerts R, Ou Y, Iglesias JE, Gollub RL, et al. Challenges of implementing computer-aided diagnostic models for neuroimages in a clinical setting. NPJ Digit Med. 2023;6(1):129. pmid:37443276
- 54. Varoquaux G, Cheplygina V. Machine learning for medical imaging: methodological failures and recommendations for the future. NPJ Digit Med. 2022;5(1):48. pmid:35413988