Figures
Abstract
Quantification of dopaminergic neurons in the substantia nigra pars compacta (SNc) of animal models is important for understanding the pathogenesis of Parkinson’s disease (PD). However, conventional manual cell counting method requires the time and effort, and has limited reproducibility due to inter- and intra-examiner variability. Here, we demonstrate that a commercially available convolutional neural network–based artificial intelligence (AI) counting method (TruAI, OLYMPUS, Tokyo, Japan) enables robust and reproducible quantification of TH-positive dopaminergic neurons in mouse, marmoset, and human SNc samples when compared with conventional manual counting. AI-based counting showed a strong correlation with manual counting across mouse, marmoset, and human samples. Good agreement between AI-based and manual counting was observed in mouse and marmoset samples, supporting the applicability of this approach for cross-species quantification of dopaminergic neurons. In the mouse model treated with α-syn preformed fibrils (PFFs), AI-based counting detected a significant reduction in TH-positive neurons consistent with expert manual counting. Non-experts exhibited greater intra-examiner variability than an expert, indicating that the reliability of manual counting depends on experience. Overall, AI-based quantification provides a robust and objective approach for TH-positive cell counting and may improve reproducibility in dopaminergic neuron analysis, particularly for non-expert users and cross-species studies of PD.
Citation: Kunugitani K, Sawamura M, Taguchi T, Hirato T, Uemura N, Ayaki T, et al. (2026) Evaluation of a commercial AI-assisted cell counting software for dopaminergic neurons across species. PLoS One 21(3): e0344621. https://doi.org/10.1371/journal.pone.0344621
Editor: Míriam R. García, Spanish Council for Scientific Research (CSIC), SPAIN
Received: September 29, 2025; Accepted: February 23, 2026; Published: March 17, 2026
Copyright: © 2026 Kunugitani et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting information files.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Parkinson’s disease (PD) is a neurodegenerative disorder characterized by a decrease in dopaminergic neurons in the substantia nigra pars compacta (SNc) and pathological accumulation of α-synuclein (α-syn) [1]. The detailed mechanisms of PD pathogenesis remain unclear; thus, for a better understanding of the condition, animal models of PD have been actively studied [2–4]. Since the loss of dopaminergic neurons in the SNc is a major pathophysiology of PD [1], accurate enumeration of dopaminergic neurons is essential for the development of models and therapeutics for PD pathology. Tyrosine hydroxylase (TH) is an enzyme that synthesizes the precursor of dopamine [5,6], L-3,4-dihydroxyphenylalanine, and quantification of TH-positive cells in the SNc is considered the gold standard for evaluating dopaminergic neurons in rodent models of PD [7]. However, it is difficult to automate the enumeration of TH-positive cells using classical image processing because of their irregular size and shape, unclear boundaries, and high cell density; thus, most studies on PD quantify TH-positive cells manually [8–10]. However, cell counting performed by human examiners limits the objectivity and reproducibility of results and imposes a burden on the experimenter.
In recent years, the development of machine learning technology in the field of artificial intelligence (AI) has led to the automation of tasks traditionally performed by humans [11]. The same is true for biological experiments. Recently, computerized target detection algorithms [12–17] and image analysis tools [18–20] have emerged, offering advantages such as automated measurements, elimination of experimenter bias, data reproducibility, elimination of human error, time and human resource savings, and high-speed high-volume analyses. In particular, deep learning methods using convolutional neural networks (CNNs) have attracted considerable attention. By learning features directly from a set of data, it is now possible to make measurements that were previously difficult with classical methods [21,22]. To our knowledge, multiple CNN-based AIs have been previously reported for enumerating TH-positive cells in the SNc of neurotoxic PD mouse models [23–27]; however, they are yet to be recognized as the gold standard. Moreover, while non-human primate PD models have been reported [3,28], no method has been established for automatically counting TH-positive cells in non-human primates. Here we investigated whether a commercially available CNN-based AI (TruAI, OLYMPUS, Tokyo, Japan) could serve as an alternative to conventional manual counting for enumerating TH-positive cells in mouse and marmoset PD models treated with α-syn preformed fibrils (PFFs), as well as in human PD patients.
Results
Phosphorylated-α-synuclein (p-α-syn) staining confirmed PFFs-treated mice as Lewy body diseases (LBD) model
As previously reported [29], phosphate-buffered saline (PBS) or PFFs were administered to the unilateral striatum of bacterial artificial chromosome (BAC) transgenic mice expressing familial PD-linked A53T mutant human α-syn (A53T BAC-SNCA transgenic mice), and dopaminergic neurons in the ipsilateral substantia nigra were evaluated with immunohistochemistry using anti p-α-syn. Compared with the PBS group (Fig 1a, b), the PFFs group exhibited p-α-syn aggregates in the SNc (Fig 1c, d).
(a-d) Representative immunohistochemical staining images for p-α-syn in PBS (a and b) and PFFs (c and d)-injected heterozygous A53T BAC-SNCA transgenic mice. Low-power images (a and c) and high-power images focused on the SNc (b and d). Scale bar; a and c, 500 µm; b and d, 100 µm. (e-h) Representative images of AI-based counting (e and f) and manual counting (g and h) in mouse SNcd. Scale bar; e and g, 100 µm; f and h, 20 µm. (i) AI-based counting shows a very strong correlation with manual counting (r = 0.997, ***p < 0.001, Pearson correlation). Each data point represents the total number of TH-positive cells per individual animal, obtained by summing counts across all analyzed sections for that animal. The x-axis shows the AI-based counting results, and the y-axis shows the corresponding manual counting results. Black dots represent PBS-injected mice, red dots represent PFFs-injected mice and the blue shaded area indicates the 95% confidence interval. (j) Bland–Altman plot comparing AI-based and manual TH-positive cell counts in mouse SNcd samples. Differences (Manual − AI) of TH-positive cell counts per animal are plotted against the mean of the two methods. The solid line represents the mean difference (bias), and dashed lines represent the 95% limits of agreement (±1.96 SD). Points are color-coded by intervention (PBS and PFFs). (k) Both AI-based and manual counting detect significant decrease in TH-positive cells in the PFFs group compared to the PBS group. Mean TH-positive cell counts per animal obtained by AI-based counting are 267 ± 26 for PBS mice and 111 ± 36 for PFFs mice, while those obtained by manual counting are 250 ± 24 for PBS mice and 98 ± 29 for PFFs mice. Each data point represents the total TH-positive cell count per individual animal, with bars indicating the mean and error bars representing the standard deviation (SD). ***p < 0.001, Welch’s t-test. (l and m) Specimens with weak staining show increased false negatives (m, arrows). Scale bar; l, 50 µm; m, 20 µm.
Validation of AI-based counting against manual counting in mouse SNcd samples
For mouse and marmoset sample counting, SNc was divided into four segments according to the morphology, size, and distribution of the cell as per a previous study [30]: dorsal (SNcd), medial (SNcm), ventral (SNcv), and lateral (SNcl). In this study we focused on the SNcd, where neuronal loss is reported to be the most severe in PD [10].
To evaluate the applicability of AI-based counting for TH-positive cell quantification, we compared its performance with conventional manual counting by an expert with more than five years of experience (Fig 1e–h). AI-based counting showed a very strong correlation with expert manual counting (r = 0.997, p < 0.001, Pearson correlation, Fig 1i), supporting the feasibility of AI-based quantification in mouse SNcd samples. Bland–Altman analysis further demonstrated good agreement between AI-based and manual counting, revealing a small systematic bias in which AI-based counting slightly overestimated TH-positive cell numbers compared with manual counting, with narrow limits of agreement (−29.4 to −1.1 cells; Fig 1j). Importantly, no trend toward increased differences at higher or lower cell counts was observed, indicating the absence of proportional bias. Together, these results support the quantitative reliability of AI-based counting in mouse SNcd samples. Consistent with manual counting, AI-based counting successfully identified a significant reduction in TH-positive cells in the PFFs group (p < 0.001, Welch’s t-test, Fig 1k), demonstrating that AI-based quantification preserves biologically meaningful group differences. The performance metrics yielded a precision of 81.4%, a recall of 88.2%, and an F1-score of 84.7% (Table 1). S1 Table in S1 File provides raw counting data, including true positive (TP), false positive (FP), and false negative (FN) values based on manual counting.
Additionally, the accuracy of AI-based counting was influenced by staining quality. When a different TH antibody (Millipore, #AB152, rabbit, 1:800 dilution) was used [10], the lower staining intensity increased false negatives (Fig 1l, m), highlighting the importance of contrast in AI-based quantification.
Manual counting showed intra-examiner variability over time in repeated counting experiment for selected mouse samples
To examine over time variability of AI-based and manual counting, we performed repeated counting experiment using one mouse each from the PBS and the PFFs groups. AI-based counting showed consistent results over time, whereas manual counting exhibited both inter- and intra-examiner variability. Non-experts (less than 5 years of manual counting experience) exhibited greater intra-examiner variability than an expert, as indicated by the higher coefficients of variation (CV) in Table 2. These results suggest that, within this limited repeated-counting experiment, the variability of manual counting can be influenced by the examiner’s experience.
Validation of AI-based counting against manual counting in marmoset SNcd samples
Next, we revealed that our neural network is applicable for another PD model, using PFFs-treated marmoset samples from the previous report [3]. Since there is no expert in counting marmoset TH-positive cells, three non-experts (examiners 1–3) independently counted TH-positive cells and compared the results with AI-based counting. AI-based counting could identify TH-positive cells comparable to manual counting (Fig 2a–2d). AI-based counting showed strong correlations with each of manual counting (r = 0.96, 0.95, 0.95; p < 0.001, Pearson correlation; Fig 2e–g), with performance metrics of 91.3% precision, 87.8% recall, and 89.5% F1-score (Table 1). Bland–Altman analysis demonstrated good agreement between AI-based counting and the averaged manual counts obtained from three independent examiners, with a small systematic bias and narrow limits of agreement (−23.8 to +8.8 cells; Fig 2h). Differences between the two methods were largely consistent across the range of cell counts, although a minor trend toward more negative differences at higher values was noted. These results further support the quantitative reliability of AI-based counting for TH-positive cell quantification in marmoset samples. The original data are presented in S2 Table in S1 File.
(a-d) Representative images of AI-based counting (a and b) and manual counting (c and d) results in marmoset SNcd. Scale bar; a and c, 200 µm; b and d, 50 µm. (e-g) AI-based counting shows strong correlations with each of the three manual counting (r = 0.96, 0.95, 0.95; ***p < 0.001, Pearson correlation). Each data point represents the TH-positive cell count for an individual unilateral SNc obtained from a single analyzed section. The x-axis shows the AI-based counting results, and the y-axis shows the corresponding manual counting results. The blue shaded area indicates 95% confidence interval. (h) Bland–Altman plot comparing AI-based and manual TH-positive cell counts in marmoset SNcd samples. Differences (Manual − AI) between AI-based counts and the averaged manual counts for an individual unilateral SNc from three independent examiners are plotted against their mean. The solid line indicates the mean difference (bias), and dashed lines indicate the 95% limits of agreement (±1.96 SD).
When we counted TH-positive cells in the marmoset samples using the mouse neural network, the results showed an increase of false negatives (S1a Fig in S1 File). These results indicate that species-specific neural network models are required to achieve optimal detection performance across different species.
Applicability of AI-based counting to human SNc samples
We further examined whether our AI-based counting method is applicable for quantifying human dopaminergic neurons by analyzing human PD patient samples and age-matched control samples from our previous report [31]. Representative images of AI-based counting could identify TH-positive cells comparable to manual counting (Fig 3a–d). AI-based counting showed strong correlations with manual counting performed by three independent examiners (r = 0.96, 0.97, 0.98; p < 0.001, Pearson correlation; Fig 3e–g). Given the limited number of available human sections and the absence of an established reference standard, agreement analysis was not used as a primary quantitative validation for human samples. Performance metrics from human samples are summarized in Table 1 and indicate moderate detection performance under the tested conditions. Consistent with manual counting, AI-based counting detected a trend toward a decrease in TH-positive cell density in PD patient samples compared to controls; however, the difference did not reach statistical significance (n.s., Welch’s t-test; Fig 3h). The analysis was based on a limited number of sections (three sections from 3 PD patients and six sections from 5 control subjects). The original data are presented in S3 Table in S1 File.
(a-d) Representative image of AI-based counting (a and b) and manual counting (c and d) results in the human SNc. Scale bar; a and c, 500 µm, b and d, 100 µm. (e-g) AI-based counting shows a strong correlation with each of the three manual counting (r = 0.96, 0.97, 0.98; ***p < 0.001, Pearson correlation). Each data point represents the TH-positive cell count for an individual unilateral SNc from a single analyzed section. The x-axis and y-axis show the AI-based and corresponding manual counting results, respectively. Black dots indicate control sections, red dots indicate PD patient sections, and the blue shaded area represents the 95% confidence interval. (h) Both AI-based and manual counting detect a trend toward a decrease in TH-positive cell density in PD patient samples compared to controls, though the difference is not statistically significant. The mean TH-positive cell density in the SNc (cells/mm2) per section obtained by AI-based counting is 20 ± 11 for controls and 11 ± 8 for PD patients, while that obtained by manual counting is 26 ± 11 for controls and 12 ± 9 for PD patients. Each data point represents the TH-positive cell density measured in an individual unilateral SNc from a single section, with bars representing the mean and error bars representing the SD. n.s., Welch’s t-test.
Discussion
Here, we demonstrated that our AI-based counting method enables robust and reproducible quantification of TH-positive cells in comparison with conventional manual counting across mouse, marmoset, and human samples, within the scope of the datasets analyzed. Notably, while no established method exists for automatically counting TH-positive cells in non-human primates, our study provides the first demonstration of the applicability of AI-based counting to a marmoset PD model, supported by independent manual counting by multiple examiners. These results underscore the robustness of the method in a biologically and technically challenging primate model. In addition, repeated counting experiments indicated that AI-based counting yielded stable results over time, whereas manual counting exhibited examiner-dependent variability under the tested conditions. These findings suggest that AI-based approaches may contribute to improved objectivity and reproducibility in dopaminergic neuron quantification.
To our knowledge, multiple deep-learning–based approaches have been reported for automated quantification of TH-positive dopaminergic neurons in the mouse SNc [23–27]. The performance metrics of these models are summarized in Table 3. One of these models exhibited 69.0% precision, 79.3% recall, and 73.0% F1-score, which were slightly lower than those of the present study [25]. The remaining models showed high performance, with precision ranging from 86.0–95.3%, recall from 87.8–95.5%, and F1-scores from 88.0–95.3% [23,24,26,27]. These results indicate that CNN-based AI enables automated TH-positive cell counting, which is difficult to achieve using classical image-processing approaches. Most previously reported models were primarily evaluated using neurotoxic PD models, such as 6-hydroxydopamine (6-OHDA) [23] or 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine (MPTP) models [24,25], which are known to induce large, dose-dependent reductions in TH-positive neurons (approximately 50–90%) [32,33]. More recently, Chen et al. demonstrated a deep-learning–based quantification pipeline validated in both neurotoxic (MPTP) and PFF models [27], indicating that CNN-based approaches can be applied across distinct PD paradigms. In addition, Haghighi et al. also reported a CNN-based quantification framework applied to PD model mice [26], although the specific PD induction method was not explicitly categorized. Notably, α-synuclein PFF models are generally characterized by more gradual and modest reductions in TH-positive neurons compared with neurotoxic models. The present study focused on the α-syn PFFs model, which has been reported to show an approximately 38% decrease in TH-positive cells at 2 months post-injection [34]. Our results demonstrate that the proposed AI-based approach can reliably quantify TH-positive neurons even under conditions of relatively small neuronal loss. Taken together, these findings indicate that the method implemented in this study achieves performance comparable to previously reported CNN-based approaches while being applicable to PD models with varying degrees of neuronal loss. In addition, by leveraging a commercially available, user-friendly AI platform, our approach facilitates efficient training and application of deep-learning–based neuron quantification without requiring specialized programming expertise, thereby lowering the technical barrier for routine use in experimental neuroscience and histopathological analysis. Recent advances in generalizable segmentation frameworks, such as Cellpose [35], have improved the accuracy and robustness of automated cell detection across diverse imaging conditions. Integration of such advanced segmentation approaches may further enhance the accuracy and scalability of AI-based quantification of TH-positive neurons in future applications.
Optimized immunohistochemical processes remain critical for reliable AI-based quantification, as staining intensity directly affects image contrast during whole-slide processing and leads to variability in AI counting results, even though the training dataset incorporated reference images spanning a wide range of staining intensities. The main factors that decrease the signal-to-noise ratio (S/N ratio) of a specimen are antibody and diaminobenzidine (DAB) staining. We used two different primary antibodies for TH staining of specimens [10], however one showed a weaker staining and a lower S/N ratio than the other, making accurate measurement difficult. Therefore, it is important to compare multiple antibodies, select the antibody with the highest S/N ratio. Although DAB staining [36,37] is considered sensitive [38], it should be kept in mind that the staining intensity is strongly influenced by horseradish peroxidase (HRP) enzyme activity [39,40], reaction time, temperature, and HRP substrate concentration [41].
In conclusion, our AI-based counting method provides a reliable and objective approach for quantifying TH-positive cells in PD model animals. While further validation using larger and more diverse human datasets will be required, this method represents a practical alternative to conventional manual counting and has the potential to facilitate quantitative and reproducible analyses of dopaminergic neuron populations in experimental studies of PD.
Limitations of the study
A limitation of the present study is the restricted number of human sections available for analysis, which resulted in limited sampling coverage along the rostro–caudal axis of the SNc. This constraint likely reduced the statistical power to detect neuronal loss in PD samples, despite the well-established pathology. Future studies using larger cohorts and anatomically systematic sampling will be required to fully validate the applicability of AI-based counting in human PD tissue.
Another limitation relates to the performance of TruAI in regions with extreme cell density or severe overlap of TH-positive neurons. TruAI relies on pattern recognition learned from reference images rather than explicit rule-based separation of densely packed cells; therefore, detection accuracy in such regions depends on whether similar patterns were sufficiently represented in the training dataset. In the present study, no regions exhibiting extreme cellular crowding that would compromise reliable cell identification were observed. Nevertheless, under conditions of unusually high cell density, detection accuracy may be reduced, and careful visual validation remains important.
Materials and methods
Ethical approval
All experimental procedures were performed in accordance with the national guidelines and the ARRIVE (Animal Research: Reporting of In Vivo Experiments) guidelines. This study was approved by the Animal Research Committee of Kyoto University (mouse: MedKyo 24322; marmoset: MedKyo 16651) and the local ethics committee (human: R1038).
Mouse samples
Seventeen heterozygous A53T BAC-SNCA transgenic mice [10] were used in this study (9 PBS-treated and 8 PFF-treated mice). To confirm the genotype of the A53T BAC-SNCA transgenic mice, polymerase chain reaction was performed as previously described [10]. From these animals, a total of 130 sections were obtained. Of these sections, 4 sections from 1 PBS-treated and 6 sections from 2 PFF-treated mice were used as reference images, and the remaining 72 sections from 8 PBS-treated and 48 sections from 6 PFF-treated mice were used for cell counting. The ipsilateral SNcd corresponding to the PBS- or PFF-treated side was analyzed in each section.
Preparation of recombinant mouse α-syn monomers and PFFs
Mouse α-syn monomers and PFFs were synthesized as the previous reports [10,34,42–44], with minor modifications. Briefly, α-syn was expressed in Escherichia coli BL21 (DE3) (BioDynamics Laboratory) and purified by boiling and subsequent ion exchange chromatography using a Q Sepharose Fast Flow (GE Healthcare) system. After dialyzed against a dialysis buffer (150 mM KCl, 50 mM Tris-HCl, pH 7.5), the α-syn solution (5 mg/ml) was agitated at 37°C at 1000 rpm for 7 days and stored at −80°C. PFF was thawed and sonicated for 60 min (30 sec on/30 sec off) before injection.
As previously reported [34], 3-month-old mice were anesthetized with isoflurane and then stereotaxically inoculated with 2 µl of α-syn PFFs or PBS into the unilateral dorsal striatum (coordinates: Anterior-Posterior +0.2 mm relative to the bregma, Medial-Lateral +2.0 mm from the midline, Dorsal-Ventral −2.6 mm from the skull surface) using a 33-gauge Neuros syringe (Hamilton company).
Immunohistochemistry for mouse samples
Two months after the injection of PFFs or PBS, mice were anesthetized with sevoflurane and transcardially perfused with cold PBS followed by 4% (w/v) paraformaldehyde (PFA) in PBS at 4°C overnight. Paraffinized brains were sectioned coronally in 8 µm thick on a microtome (EG1150, Leica Microsystems). For TH-positive cell counting, the entire SNc was sampled every 10th coronal section (80 µm apart) across its full rostro–caudal extent.
Immunohistochemical staining was performed as described previously [10,34], with minor modifications. The following primary antibodies were used; anti-phosphorylated-α-syn (p-α-syn) (Abcam, #ab51253, Rabbit, 1:20,000 dilution) and anti-TH (Millipore, #MAB318, Mouse, 1:5,000 dilution). The sections were incubated at 4°C with the primary antibodies for 2 days, following which appropriate secondary antibodies were added for p-α-syn (Nichirei Biosciences, # 414341) and TH (Nichirei Biosciences, #414322). The samples were processed for visualization using a peroxidase stain DAB kit (Nacalai tesque).
Marmoset samples
Twenty-four sections from a two-year-old female marmoset (I5937; born at CLEA Japan, Inc.) that had received unilateral olfactory bulb injections of α-Syn PFFs were re-used from our previous study [10]. Of these sections, 5 were used as reference images, and the remaining 19 sections were used for cell counting. The bilateral SNcd were analyzed in each section.
Human samples
Human samples were re-used from our previous study [31] and consisted of 4 sections from 4 patients with PD and 11 sections from 5 age-matched control subjects. Of these sections, 1 section from 1 PD patient and 5 sections from 3 controls were used as reference images, and the remaining 3 sections from 3 PD patients and 6 sections from 5 controls were used for cell counting. The unilateral SNc was analyzed in each section.
Image acquisition of mouse, marmoset, and human samples
Whole-slide images of TH-immunostained coronal sections were acquired using a slide scanner (SLIDEVIEW VS200, OLYMPUS, Tokyo, Japan).
AI-based counting of TH-positive cells in mouse, marmoset, and human samples
AI-based enumeration of TH-positive cells was performed using a commercially available CNN-based platform (TruAI, OLYMPUS, Tokyo, Japan). The SNcd in mouse and marmoset samples, or the SNc of human samples, was defined as the region of interest (ROI). ROI segmentation was manually performed by an expert for mouse samples and by three non-experts for marmoset and human.
For reference images, SNcd of mouse and marmoset, or SNc of human were labeled as the background (in green), including regions outside SNcd or SNc that lacked TH-positive cells. Intact TH-positive cells with visible nuclei in SNcd or SNc were labeled as positive data (in red). In total, 1,353 mouse, 1,313 marmoset, and 1,398 human dopaminergic neurons were annotated and incorporated into the training datasets.
Reference images were intentionally selected to cover a wide range of TH staining intensities, from weak to strong signals, to ensure robust performance under realistic histopathological conditions. Representative low-intensity regions of the reference images are shown in S1b-d Fig in S1 File. In addition, images from PFF-treated mice and marmosets and PD patients were included to allow the model to learn morphological variations and distortions of TH-positive neurons associated with pathological conditions. These reference images were used exclusively for neural network training and were not included in the cell counting analysis to ensure an unbiased assessment of neural network performance and to prevent data leakage [45].
Neural networks were trained independently for each species for 25,000 iterations. The resulting similarity scores were 0.71 for mouse, 0.70 for marmoset, and 0.56 for human datasets. Following training, TH-positive cells within each ROI were automatically counted using the trained neural networks. Detailed parameter settings used for network training are summarized in the Supplementary Methods. The trained TruAI neural network files are provided as supplementary datasets entitled “neuralnetwork_THcell_mouse.nn,” “neuralnetwork_THcell_marmoset.nn,” and “neuralnetwork_THcell_human.nn.” The workflows for AI-based and manual counting are shown in S2a Fig in S1 File (mouse), S3 Fig in S1 File (marmoset), and S4 Fig in S1 File (human).
Manual counting of TH-positive cells in mouse, marmoset, and human samples
Intact TH-positive cells with visible nuclei in each ROI were counted using the multipoint tool in Fiji (ImageJ, Version 1.54j, National Institutes of Health, USA) [46]. Manual counting was performed by an expert (T.T.) for mouse samples and by three non-experts (K.K., M.S., and T.H.) for marmoset and human samples, and compared with AI counting. The expert tester (T.T.) is a board-certified neurologist specializing in Parkinson’s disease, with 13 years of clinical experience and over eight years of research experience in Parkinson’s disease model mice, including neuropathological evaluation and accurate quantification of dopaminergic neuronal loss [10].
Repeated counting experiment for selected mouse samples
One mouse from each of the PBS and the PFFs groups was selected, and the images were randomly reordered to avoid counting bias influenced by memory. Counting was repeated three times at 1-week intervals by an expert (T.T.), two non-experts (K.K. and M.S.), and AI (S2b Fig in S1 File). The multipoint tool in Fiji (ImageJ, Version 1.54i, National Institutes of Health, USA) [46] was used for manual counting.
Performance metrics of AI-based counting
The corresponding images of AI-based and manual counting were opened in Fiji (ImageJ, Version 1.54j, National Institutes of Health, USA) [46]. The SIFT algorithm was used to extract feature points [14] an affine transformation was performed using a plugin, and both counting images were aligned to create a stack image. Using manual counting as a standard (for mouse samples, counting by T.T.; for marmoset and human samples, counting by K.K.), TP, FP, and FN AI counting results were determined visually on the digital stacked images, and the performance of AI-based counting was evaluated by calculating the precision, recall, and F1-score [47].
Statistical analysis
All statistical analysis were conducted using R (version 4.4.1) [48]. All data are shown as mean ± standard deviation (SD). SD was used differently depending on the experimental context. The population standard deviation was used to assess intra-examiner variability within the same sample through repeated counting. The sample standard deviation was used to compare between different samples, such as between manual and AI-based counting methods or between disease model and control groups. For human samples, TH-positive cell density was calculated by normalizing cell counts to the area of the corresponding ROI and is expressed as cells/mm². Pearson correlation coefficient was used to analyze the correlation between AI-based and manual counting. Agreement between AI-based and manual counting was further evaluated using Bland–Altman analysis, in which the difference between the two methods (Manual − AI) was plotted against their mean. The mean difference (bias) and the 95% limits of agreement were calculated as the mean difference ± 1.96 standard deviations. Welch’s t-test was used to compare TH-positive cell counts between the mouse PBS and PFFs groups, and TH-positive cell density in the SNc between human control and PD patients. Statistical significance was set at 5% for all analyses. Asterisks represent level of significance: *p < 0.05; **p < 0.01; ***p < 0.001.
Supporting information
S1 File. Supplementary material including supplementary methods, legends for Supplementary Figures S1–S4, and S1–S3 Tables.
https://doi.org/10.1371/journal.pone.0344621.s001
(ZIP)
Acknowledgments
We gratefully acknowledge Rie Hikawa, Yukako Hatate, Jun Ueda, Miki Oono, Masashi Ikuno, Satomi Toda, Junichiro Ohira, Yuji Narumiya, Jiarui Chang for technical assistance and Masateru Taniguchi for technical advice.
References
- 1. Morris HR, Spillantini MG, Sue CM, Williams-Gray CH. The pathogenesis of Parkinson’s disease. Lancet. 2024;403(10423):293–304. pmid:38245249
- 2. Koprich JB, Kalia LV, Brotchie JM. Animal models of α-synucleinopathy for Parkinson disease drug development. Nat Rev Neurosci. 2017;18(9):515–29. pmid:28747776
- 3. Sawamura M, Onoe H, Tsukada H, Isa K, Yamakado H, Okuda S, et al. Lewy body disease primate model with α-synuclein propagation from the olfactory bulb. Mov Disord. 2022;37(10):2033–44. pmid:35989519
- 4. Sawamura M, Ohira J, Hikawa R, Ishimoto T, Nakanishi E, Yamakado H, et al. Single-dose intranasal administration of α-syn PFFs induce lewy neurite-like pathology in olfactory bulbs. Parkinsonism Relat Disord. 2023;112:105440. pmid:37267818
- 5. Nagatsu T, Nakashima A, Ichinose H, Kobayashi K. Human tyrosine hydroxylase in Parkinson’s disease and in related disorders. J Neural Transm (Vienna). 2019;126(4):397–409. pmid:29995172
- 6. Rausch W-D, Wang F, Radad K. From the tyrosine hydroxylase hypothesis of Parkinson’s disease to modern strategies: a short historical overview. J Neural Transm (Vienna). 2022;129(5–6):487–95. pmid:35460433
- 7. White RB, Thomas MG. Moving beyond tyrosine hydroxylase to define dopaminergic neurons for use in cell replacement therapies for Parkinson’s disease. CNS Neurol Disord Drug Targets. 2012;11(4):340–9. pmid:22483315
- 8. Luk KC, Kehm V, Carroll J, Zhang B, O’Brien P, Trojanowski JQ, et al. Pathological α-synuclein transmission initiates Parkinson-like neurodegeneration in nontransgenic mice. Science. 2012;338(6109):949–53. pmid:23161999
- 9. Tran HT, Chung CH-Y, Iba M, Zhang B, Trojanowski JQ, Luk KC, et al. Α-synuclein immunotherapy blocks uptake and templated propagation of misfolded α-synuclein and neurodegeneration. Cell Rep. 2014;7(6):2054–65. pmid:24931606
- 10. Taguchi T, Ikuno M, Hondo M, Parajuli LK, Taguchi K, Ueda J, et al. α-Synuclein BAC transgenic mice exhibit RBD-like behaviour and hyposmia: a prodromal Parkinson’s disease model. Brain. 2020;143(1):249–65. pmid:31816026
- 11. Jordan MI, Mitchell TM. Machine learning: trends, perspectives, and prospects. Science. 2015;349(6245):255–60. pmid:26185243
- 12. Falk T, Mai D, Bensch R, Çiçek Ö, Abdulkadir A, Marrakchi Y, et al. U-Net: deep learning for cell counting, detection, and morphometry. Nat Methods. 2019;16(1):67–70. pmid:30559429
- 13. Lindeberg T. Feature detection with automatic scale selection. Int J Comput Vis. 1998;30:79–116.
- 14. Lowe DG. Distinctive image features from scale-invariant keypoints. Int J Comput Vis. 2004;60:91–110.
- 15. Mikolajczyk K, Schmid C. Scale & affine invariant interest point detectors. Int J Comput Vis. 2004;60:63–86.
- 16. Mikolajczyk K, Schmid C. Performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell. 2005;27(10):1615–30. pmid:16237996
- 17. Bay H, Ess A, Tuytelaars T, Van Gool L. Speeded-up robust features (SURF). Comput Vis Image Underst. 2008;110(3):346–59.
- 18. Sanka I, Bartkova S, Pata P, Smolander O-P, Scheler O. Investigation of different free image analysis software for high-throughput droplet detection. ACS Omega. 2021;6(35):22625–34. pmid:34514234
- 19. Bonaccorso Marinelli MP, Baiardi G, Valdez SR, Cabrera RJ. Automated quantification of dopaminergic immunostained neurons in substantia nigra using freely available software. Med Biol Eng Comput. 2022;60(10):2995–3007. pmid:36018532
- 20. Xu J, Wang A, Wang Y, Li J, Xu R, Shi H. AICellCounter: a machine learning-based automated cell counting tool requiring only one image for training. Neurosci Bull. 2023;39(1):83–8. pmid:35704210
- 21. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521(7553):436–44. pmid:26017442
- 22. Li Z, Liu F, Yang W, Peng S, Zhou J. A survey of convolutional neural networks: analysis, applications, and prospects. IEEE Trans Neural Netw Learn Syst. 2022;33(12):6999–7019. pmid:34111009
- 23. Penttinen A-M, Parkkinen I, Blom S, Kopra J, Andressoo J-O, Pitkänen K, et al. Implementation of deep neural networks to count dopamine neurons in substantia nigra. Eur J Neurosci. 2018;48(6):2354–61. pmid:30144349
- 24. Zhao S, Wu K, Gu C, Pu X, Guan X. SNc Neuron Detection Method Based on Deep Learning for Efficacy Evaluation of Anti-PD Drugs. In: 2018 Annual American Control Conference (ACC); 2018 Jun 27-29. Milwaukee, WI. Piscataway (NJ): IEEE; 2018. 1981–6.
- 25. Kim D, Bak MS, Park H, Baek IS, Chung G, Park JH, et al. An Automated Cell Detection Method for TH-positive Dopaminergic Neurons in a Mouse Model of Parkinson’s Disease Using Convolutional Neural Networks. Exp Neurobiol. 2023;32(3):181–94. pmid:37403226
- 26. Haghighi F, Ghosh S, Ngu H, Chu S, Lin H, Hejrati M, et al. Self-supervised learning for segmentation and quantification of dopamine neurons in Parkinson’s disease. Trans Mach Learn Res. 2023.
- 27. Chen J, Meng Q, Zhang Y, Liang Y, Ding J, Xia X, et al. TrueTH: A user-friendly deep learning approach for robust dopaminergic neuron detection. Neurosci Lett. 2024;836:137871. pmid:38857698
- 28. Shimozawa A, Ono M, Takahara D, Tarutani A, Imura S, Masuda-Suzukake M, et al. Propagation of pathological α-synuclein in marmoset brain. Acta Neuropathol Commun. 2017;5(1):12. pmid:28148299
- 29. Uemura N, Marotta NP, Ara J, Meymand ES, Zhang B, Kameda H, et al. α-Synuclein aggregates amplified from patient-derived Lewy bodies recapitulate Lewy body diseases in mice. Nat Commun. 2023;14(1):6892. pmid:37898614
- 30. Fu Y, Yuan Y, Halliday G, Rusznák Z, Watson C, Paxinos G. A cytoarchitectonic and chemoarchitectonic analysis of the dopamine cell groups in the substantia nigra, ventral tegmental area, and retrorubral field in the mouse. Brain Struct Funct. 2012;217(2):591–612. pmid:21935672
- 31. Li F, Ayaki T, Maki T, Sawamoto N, Takahashi R. NLRP3 inflammasome-related proteins are upregulated in the putamen of patients with multiple system atrophy. J Neuropathol Exp Neurol. 2018;77(11):1055–65. pmid:30295793
- 32. Hernandez-Baltazar D, Mendoza-Garrido ME, Martinez-Fong D. Activation of GSK-3β and caspase-3 occurs in Nigral dopamine neurons during the development of apoptosis activated by a striatal injection of 6-hydroxydopamine. PLoS One. 2013;8(8):e70951. pmid:23940672
- 33. Dovonou A, Bolduc C, Soto Linan V, Gora C, Peralta Iii MR, Lévesque M. Animal models of Parkinson’s disease: bridging the gap between disease hallmarks and research questions. Transl Neurodegener. 2023;12(1):36. pmid:37468944
- 34. Okuda S, Uemura N, Sawamura M, Taguchi T, Ikuno M, Uemura MT, et al. Rapid induction of dopaminergic neuron loss accompanied by lewy body-like inclusions in A53T BAC-SNCA transgenic mice. Neurotherapeutics. 2022;19(1):289–304. pmid:34935120
- 35. Stringer C, Wang T, Michaelos M, Pachitariu M. Cellpose: a generalist algorithm for cellular segmentation. Nat Methods. 2021;18(1):100–6. pmid:33318659
- 36. Brey EM, Lalani Z, Johnston C, Wong M, McIntire LV, Duke PJ, et al. Automated selection of DAB-labeled tissue for immunohistochemical quantification. J Histochem Cytochem. 2003;51(5):575–84. pmid:12704205
- 37. Ip CW, Cheong D, Volkmann J. Stereological estimation of dopaminergic neuron number in the mouse substantia nigra using the optical fractionator and standard microscopy equipment. J Vis Exp. 2017. pmid:28892024
- 38. Coventry BJ, Neoh SH, Mantzioris BX, Skinner JM, Zola H, Bradley J. A comparison of the sensitivity of immunoperoxidase staining methods with high-sensitivity fluorescence flow cytometry-antibody quantitation on the cell surface. J Histochem Cytochem. 1994;42(8):1143–7. pmid:8027533
- 39. Morales Urrea DA, Haure PM, García Einschlag FS, Contreras EM. Horseradish peroxidase-mediated decolourization of Orange II: modelling hydrogen peroxide utilization efficiency at different pH values. Environ Sci Pollut Res Int. 2018;25(20):19989–20002. pmid:29744778
- 40. Morales-Urrea D, López-Córdoba A, Contreras EM. Inactivation kinetics of horseradish peroxidase (HRP) by hydrogen peroxide. Sci Rep. 2023;13(1):13363. pmid:37591893
- 41. Gonda K, Watanabe M, Tada H, Miyashita M, Takahashi-Aoyama Y, Kamei T, et al. Quantitative diagnostic imaging of cancer tissues by using phosphor-integrated dots with ultra-high brightness. Sci Rep. 2017;7(1):7509. pmid:28790306
- 42. Masuda-Suzukake M, Nonaka T, Hosokawa M, Kubo M, Shimozawa A, Akiyama H, et al. Pathological alpha-synuclein propagates through neural networks. Acta Neuropathol Commun. 2014;2:88. pmid:25095794
- 43. Ihse E, Yamakado H, van Wijk XM, Lawrence R, Esko JD, Masliah E. Cellular internalization of alpha-synuclein aggregates by cell surface heparan sulfate depends on aggregate conformation and cell type. Sci Rep. 2017;7(1):9008. pmid:28827536
- 44. Uemura N, Yagi H, Uemura MT, Hatanaka Y, Yamakado H, Takahashi R. Inoculation of α-synuclein preformed fibrils into the mouse gastrointestinal tract induces Lewy body-like aggregates in the brainstem via the vagus nerve. Mol Neurodegener. 2018;13(1):21. pmid:29751824
- 45. Lones MA. Avoiding common machine learning pitfalls. Patterns (N Y). 2024;5(10):101046. pmid:39569205
- 46. Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, et al. Fiji: an open-source platform for biological-image analysis. Nat Methods. 2012;9(7):676–82. pmid:22743772
- 47. Rainio O, Teuho J, Klén R. Evaluation metrics and statistical tests for machine learning. Sci Rep. 2024;14(1):6086. pmid:38480847
- 48.
R Core Team. R: A Language and Environment for Statistical Computing. Vienna (Austria): R Foundation for Statistical Computing; 2024. Available from: https://www.R-project.org/