Figures
Abstract
Reconstructing causality routes from genetic effects to complex phenotypes in particular cell types is crucial for understanding biological mechanisms underlying the brain-associated phenotypes including imaging-derived phenotypes (IDPs), and brain disorders and behaviors (DBs). Here, we develop a single-cell Mendelian randomization framework to infer cell type-specific causal relationships between gene expression and diverse brain-associated complex phenotypes by integrating single-cell expression quantitative trait loci (cis-eQTLs) and genome-wide association study findings. We identifiy a set of 254 and 217 cis-eQTL target genes (eGenes) that may have causal effects on 112 IDPs and 26 DBs in eight cell types, respectively. These causal eGenes exhibit strong cell type specificity and varied pleiotropy among different types of brain-associated phenotypes. Further integrative analysis reveals putative causality routes among cell type-specific causal eGenes and brain-associated complex phenotypes. Finally, we characterize the spatiotemporal expression patterns of these causal eGenes, and highlight the coordinated associations of the brain-associated phenotypes based on the expression of their causal eGenes. Overall, our study presents a large-scale analysis of the genetic effects of brain structures, disorders and behaviors, providing a catalog of cell type-specific causal eGenes.
Author summary
Our understanding of how genes influence brain phenotypes at the cell type level remains limited. Although genome-wide association studies (GWASs) have identified numerous genetic variants associated with brain structure and disease, the majority reside in non-coding regions and do not directly implicate specific genes or cell types. Here, we integrate GWAS summary statistics with cis-expression quantitative trait loci (cis-eQTL) data from eight brain cell types to identify genes whose expression levels may causally influence 149 brain phenotypes spanning structural features, disorders, and behaviors. Using Mendelian randomization, we detect hundreds of such genes with cell type–specific effects. These eGene–phenotype associations are highly cell type–specific, and many putatively causal genes are shared across phenotype categories. Integrating these findings, we reconstruct putative routes linking cell type–specific causal genes to brain structure and disease. We further reveal spatiotemporal expression patterns of these genes across brain phenotypes and developmental stages. Together, these findings provide a cell type–specific portrait of the genetic contributions to brain phenotypes.
Citation: Yang A, Zhao X, Zhao X-M, Yang YT (2026) Deciphering cell type-specific causal genetic effects on brain imaging-derived phenotypes and disorders with single-cell Mendelian randomization. PLoS Comput Biol 22(6): e1014422. https://doi.org/10.1371/journal.pcbi.1014422
Editor: Daifeng Wang, University of Wisconsin Madison, UNITED STATES OF AMERICA
Received: May 16, 2025; Accepted: June 7, 2026; Published: June 17, 2026
Copyright: © 2026 Yang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The scripts in the study are publicly available via GitHub at https://github.com/sldrcyang/ExpMR. The predicted cell type-specific causal eGenes, and their associated IDPs and DBs can be freely downloaded in our supplementary website (https://mai.fudan.edu.cn/expmr). The cell type-specific cis-eQTL summary statistics dataset was downloaded from https://zenodo.org/record/6104982#.Y1a2WbZBxPY. The GWAS summary statistics datasets are available from the URLs: CI from https://www.nhlbi.nih.gov/grasp/downloads/ResultsFebruary2017/2014/2014_Benyamin_CHIC; EA from https://zenodo.org/records/18584527/files/GWAS_EA_excl23andMe.zip?download=1; INT from https://ctg.cncr.nl/documents/p1651/SavageJansen_IntMeta_sumstats.zip; NEU from https://ctg.cncr.nl/documents/p1651/sumstats_neuroticism_ctg_format.txt.gz; ASP from https://zenodo.org/records/18584527/files/AUTOMOBILE_SPEEDING_PROPENSITY_GWAS.zip?download=1; DPW from https://conservancy.umn.edu/handle/11299/201564; GRT from https://zenodo.org/records/18584527/files/RISK_behavior.RISK_GWAS_MA_UKB+replication.zip?download=1; SC from https://conservancy.umn.edu/handle/11299/201564; INS from https://ctg.cncr.nl/documents/p1651/insomnia_ukb2b_EUR_sumstats_20190311_with_chrX_mac_100.txt.gz; EPI from https://www.epigad.org/gwas_ilae2018_16loci/all_epilepsy_METAL.gz; ICH from https://personal.broadinstitute.org/ryank/3980413.Woo.2014.zip; IS from http://megastroke.org/; AD from https://ctg.cncr.nl/documents/p1651/AD_sumstats_Jansenetal.txt.gz; ALS from https://zenodo.org/records/18584527/files/alsMetaSummaryStats_march21st2018.tab.zip?download=1; MS from https://imsgc.net/data/discovery_metav3.0.meta.gz; PD from https://zenodo.org/records/18584527/files/nallsEtAl2019_excluding23andMe_allVariants.tab.zip?download=1; OCD from https://doi.org/10.6084/m9.figshare.14672103; TS from https://doi.org/10.6084/m9.figshare.14672232; ANX from https://zenodo.org/records/18584527/files/TotAnx_OR_sumstats.zip?download=1; MDD from https://doi.org/10.6084/m9.figshare.14672085; ADHD from https://doi.org/10.6084/m9.figshare.14671965; ASD from https://doi.org/10.6084/m9.figshare.14671989; AUD from https://doi.org/10.6084/m9.figshare.14672187; PTSD from https://ipsych.dk/fileadmin/ipsych.dk/Downloads/daner_woautism_ad_sd8-sd6_woautismstress_cleaned.gz; BIP from https://doi.org/10.6084/m9.figshare.14102594; SCZ from https://doi.org/10.6084/m9.figshare.19426775; IDPs from https://www.med.unc.edu/bigs2/data/gwas-summary-statistics/. The pLI scores are downloaded from ExAc (https://genome.ucsc.edu/cgi-bin/hgTrackUi?db=hg19&g=gnomadPLI; version v2.1.1). The prediction weights and covariance used for TWAS analysis are downloaded from PredictDB Data Repository (https://predictdb.org/), and CMC-derived DLPFC prediction models (https://github.com/laurahuckins/CMC_DLPFC_prediXcan). The cis-eQTL summary statistics of the BrainMeta portal are downloaded from https://yanglab.westlake.edu.cn/data/brainmeta/cis_eqtl/. The documented associations between genes and complex phenotypes of the NHGRI-EBI GWAS Catalog are downloaded from https://www.ebi.ac.uk/gwas/api/search/downloads/associations/v1.0.2?split=false. The cell type-specific eQTL summary statistics from the brainSCOPE resource are downloaded from https://brainscope.gersteinlab.org/integrative_files.html. The differentially expressed genes associated with AD from Mathys et al. 2023 are downloaded from https://github.com/mathyslab7/ROSMAP_snRNAseq_PFC/tree/main/Results/Differential_gene_expression_analysis.
Funding: X.-M.Z. was supported by National Natural Science Foundation of China (T2225015, 62433008), Shanghai Science and Technology Commission Program (23JS1410100), Major Project of Guangzhou National Laboratory (GZNL2024A01003), National Key R&D Program of China (2023YFF1204800, 2025YFA1309200). Y.T.Y. was supported by Shanghai Science and Technology Commission Program (24JS2810100), Shanghai Municipal Education Commission (24KXZNA11), National Key R&D Program of China (2025YFC3409300). The computations in this research were performed using the CFFF platform of Fudan University. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Recent large-scale genome-wide association studies (GWASs) have identified thousands of genetic variants associated with brain-related complex phenotypes [1,2]. However, interpreting the regulatory roles of these loci remains challenging, as the majority reside in non-coding regions and do not directly implicate specific genes [3]. Expression quantitative trait loci (cis-eQTLs), which link genetic variants to the expression levels of target genes, provide a powerful means of bridging this gap [4,5]. By integrating cis-eQTL data with GWAS findings, Mendelian randomization (MR)—a framework for genetic causal inference—can pinpoint genes whose expression causally influences complex neurological, psychological, and behavioral phenotypes [6–9].
Previous MR studies have largely relied on bulk-tissue cis-eQTLs to identify causal genes for complex phenotypes [7,10,11]. Yet cell type–level cis-eQTLs exhibit larger effect sizes and target more evolutionarily constrained genes than their tissue-level counterparts [12]. suggesting that cellular resolution is critical for accurate causal inference. Recent single-cell eQTL atlases—including OneK1K [13], Bryois et al [12], and MetaBrain [14]—have revealed substantial cell type–specific effects of genetic variants on gene expression regulation. Moreover, cell type–level analyses have uncovered disorder-specific genetic controls that remain invisible to bulk tissue approaches [12,13,15–17]. For instance, most genetic variants associated with Alzheimer’s disease (AD) risk appear to act through cis-eQTLs in microglia, whereas schizophrenia (SCZ) exhibits a more polygenic architecture with signals spanning multiple cell types [12]. These observations indicate that cellular-level regulation is more closely coupled to disease pathogenesis than bulk-level regulation. Nevertheless, how genetic variants contribute to causal gene–phenotype relationships at the cell type level remains poorly understood, motivating the cell type–specific causal inference pursued in this study.
An additional layer of complexity lies in the genetic and cellular links between brain imaging–derived phenotypes (IDPs) and brain disorders and behaviors (DBs). Extensive genetic correlations between imaging features and complex phenotypes have been documented [18–23], and recent large-scale GWASs have identified overlapping loci and genes associated with both brain imaging phenotypes and diverse disorders [19,22,24,25]. Notably, causal eGenes shared across psychiatric, neurodegenerative, and structural brain phenotypes often exhibit cell type–specific expression patterns. For example, AKT3—which is implicated in cortical surface area, Parkinson’s disease (PD), and major depressive disorder—shows astrocyte-specific expression [18], whereas DIP2B, shared between SCZ and brain stem volume, is enriched in excitatory neurons [18]. Consistent with these gene-level observations, the genetic heritability of both IDPs and DBs is significantly enriched in common cell types: oligodendrocytes for white matter microstructure and depression [22,26], and excitatory neurons for total surface area [27] and schizophrenia [28]. Collectively, these findings suggest that shared cellular-level genetic architectures may mediate the relationships between brain structural phenotypes and brain disorders. Characterizing these putative pathways could therefore advance the development of mechanistically informed therapeutic targets.
In this study, we systematically inferred cell type–specific causal relationships between gene expression and diverse brain-associated complex phenotypes. We developed a single-cell MR framework that integrates a recently published cis-eQTL atlas from eight brain cell types with large-scale GWAS findings for 26 DBs (8 behavioral-cognitive phenotypes, 10 psychiatric disorders, and 8 neurological disorders) and 123 IDPs (101 brain regional volumes and 22 white matter tracts). For the identified causal cis-eQTL target genes (eGenes), we examined their cell type specificity and pleiotropic patterns across different phenotype categories. We further reconstructed putative causality routes linking cell type–specific causal eGenes to brain-associated complex phenotypes, and characterized the spatiotemporal expression dynamics of these genes using external single-cell data. Our findings illuminate how cell type–specific gene expression shapes brain structure, disorders, and behaviors through genetically driven regulatory mechanisms, offering insights into both healthy and pathological brain states.
Results
Overview of the study
Dysregulation of gene expression in specific brain cell types is a hallmark of psychiatric and neurological disorders [12,13,15,16,29]. To systematically infer cell type–specific causal relationships between gene expression and brain-associated complex phenotypes, we performed two-sample Mendelian randomization (MR) by integrating GWAS summary statistics for 26 disorders and behaviors (DBs) and 123 imaging-derived phenotypes (IDPs) with a recently published cell type–specific cis-eQTL atlas encompassing eight brain cell types [12] (S1–S3 Tables). The eight cell types were astrocytes, endothelial cells, excitatory neurons, inhibitory neurons, microglia, oligodendrocytes, oligodendrocyte precursor cells (OPCs), and pericytes. The 26 DBs comprised three groups: 8 behavioral-cognitive phenotypes, 8 neurological disorders, and 10 psychiatric disorders (mean GWAS sample size: 284,168; S1 Table). The 123 IDPs comprised two groups: 101 brain regional volumes and 22 white matter microstructure phenotypes measured by mean fractional anisotropy (mean GWAS sample size: 22,072; S2 Table). The overall study design is depicted in Fig 1a.
(a) Workflow. The cis-eQTL data from eight brain cell types were integrated with GWAS summary statistics for 123 IDPs and 26 DBs. TWAS pre-screening was followed by cell type–specific Mendelian randomization to infer putatively causal eGenes for 149 phenotypes, classified into two classes (IDPs and DBs) and five groups (psychiatric disorders, neurological disorders, behavioral–cognitive phenotypes, brain regional volumes, and white matter tracts). Downstream analyses included shared eGene characterization, single-cell expression profiling, and causal route reconstruction. Brain region schematics adapted from the Desikan-Killiany Atlas [34]; white matter tract schematics from the Atlas Track [35]. (b) Heatmap of cell type–specific causal eGene counts across IDPs (only IDPs with >3 eGenes are shown). (c) Heatmap of cell type–specific causal eGene counts across DBs.
Given evidence of bidirectional causal effects between brain disorders and imaging phenotypes [20,30], we first screened for potential eGene–phenotype associations using transcriptome-wide association study (TWAS) [31], retaining 43,494 eGene–DB pairs and 284,927 eGene–IDP pairs at nominal significance (P < 0.05) for subsequent MR analysis. To ensure instrument validity, we selected as instrumental variables (IVs) only those single nucleotide polymorphisms (SNPs) that achieved genome-wide significance (P < 5 × 10-8) in the cis-eQTL data. We manually verified the sample descriptions of all datasets and confirmed the absence of participant overlap between the cis-eQTL catalog and the GWAS cohorts.
To enhance the robustness of causal inference, we applied four complementary MR methods: SMR [6], Wald ratio [32], PMR-Egger [8] and GSMR [33]. These methods differ in their IV selection strategies (single-SNP versus multi-SNP; independent versus correlated SNPs), tolerance for horizontal pleiotropy, and analytical assumptions (S1 Fig and Methods), thereby providing convergent evidence for each inferred association. We then assessed the reliability of the predicted cell type–specific causal eGenes using external functional genomic datasets from the human brain. Next, we characterized the pleiotropic landscape of these causal eGenes across phenotype classes and groups, reconstructed putative causality routes linking cell type–specific eGenes to DBs and IDPs, and investigated the spatiotemporal expression dynamics of these genes using independent single-cell data.
Putative causal effects of eGenes on IDPs and DBs
We identified 254 eGenes with putative causal effects on 112 IDPs across eight cell types (FDR < 0.05, corrected across all eGene–cell type–IDP tests), yielding 760 significant eGene–cell type–IDP combinations (Fig 1b and S4 Table). Among these, 184 and 112 eGenes were implicated in 90 brain regional volumes and 22 white matter microstructure phenotypes, respectively. To assess reproducibility, we replicated the MR analyses using cortical eQTLs from BrainMeta [36] and cell type–specific eQTLs from brainSCOPE [37]. Of the eGene–IDP pairs, 64.9% (430/663) and 49.2% (326/663) were replicated in BrainMeta at nominal and FDR < 0.05 levels, respectively (both P < 10-16, hypergeometric test; S5 Tableand S2 Fig). For eGene–cell type–IDP triples, 41.9% (315/751) and 33.7% (253/751) were replicated in brainSCOPE at nominal and FDR < 0.05 levels, respectively (both P < 10-16, hypergeometric test; S5 Table and S3 Fig).
These associations were strongly cell type–specific: 91.4% (605/662) of eGene–IDP pairs were significant in only one cell type, consistent with the notion that brain structure is shaped by distinct cellular programs [38–40]. This specificity persisted after accounting for eQTL availability—among genes with valid instrumental cis-eQTLs in two or more cell types, 80.6% (237/294) remained significant in exactly one cell type (S4 Fig). White matter microstructure phenotypes harbored more causal eGenes in oligodendrocytes and astrocytes than brain regional volumes did (Fig 1b), mirroring prior evidence that white matter heritability is enriched in glial cells [22]. The strongest enrichment occurred between white matter volume and oligodendrocytes (12 eGenes), eight of which (ANKRD44, ZCWPW1, SLC16A8, PILRB, RUNX2, LARP6, C7orf61 and SUPT3H) have been linked to neuroimaging phenotypes in the GeneCards database [41]. Collectively, these results nominate candidate cell type–specific regulators of brain structural phenotypes. Nevertheless, because the cis-eQTL data were derived from the prefrontal cortex, temporal cortex, and deep white matter—not all anatomical regions represented by the imaging outcomes—these findings should not be interpreted as pinpointing the precise brain region where each regulatory effect operates [42–44].
Turning to DBs, we identified 217 eGenes with putative causal effects on 26 DBs across eight cell types (FDR < 0.05, corrected across all eGene–cell type–DB tests), comprising 298 eGene–cell type–DB combinations (Fig 1c and S6 Table). Of these, 109, 61, and 85 eGenes were associated with 10 psychiatric disorders, 8 neurological disorders, and 8 behavioral-cognitive phenotypes, respectively. We validated these findings using three independent resources: BrainMeta cortical eQTLs [36], PsychENCODE differential expression data [29], and the GWAS Catalog [1]. Replication rates were 35.7% (87/244) and 23.0% (56/244) in BrainMeta at nominal and FDR < 0.05 levels, respectively; 29.5% (23/78) in PsychENCODE; and 23.8% (57/240) in the GWAS Catalog (all P < 10-4, hypergeometric test; S7 Table and S2 Fig). For eGene–cell type–DB triples, replication in brainSCOPE [37] was 41.9% (315/751) nominally and 33.7% (253/751) at FDR < 0.05 (both with P < 10-16, hypergeometric test; S7 Table and S3 Fig). Notably, 12 of the 14 AD-associated cell type–specific causal eGenes identified here were corroborated by Mathys et al. [45].
As with IDPs, the eGene–DB associations were highly cell type–specific: 82.1% (197/240) of pairs were restricted to a single cell type, and among genes with valid instruments in multiple cell types, 69.2% (92/133) were significant in exactly one (S4 Fig). These patterns support the view that most genes influencing brain disorders and behaviors exert their effects through discrete cell types [12,46]. Excitatory neurons harbored the largest number of causal eGenes across nearly all phenotype groups, notably in SCZ, bipolar disorder (BIP), and insomnia (INS). Inhibitory neurons contained fewer causal eGenes overall but remained prominently involved in SCZ. The SCZ–excitatory neuron pair yielded the greatest number of causal eGenes (20), four of which (ANKRD27, LIN28B, UROS and CCHCR1) have been linked to SCZ in the GeneCards database [41]. Moreover, five of the six eGenes predicted to influence AD in excitatory neurons—including MSH3 [47], ICA1L [48], RGS14 [49], C17orf97 [50] and ZSCAN31 [51]—have established roles in AD and other neurodegenerative disorders, suggesting their convergent contributions to neurodegenerative pathogenesis.
Pervasive phenotype pleiotropy of causal eGenes
To characterize the pleiotropic landscape of the identified causal eGenes, we systematically compared their sharing patterns across phenotype classes (IDPs versus DBs) and phenotype groups (brain regional volumes and white matter microstructure phenotypes within IDPs; behavioral-cognitive phenotypes, neurological disorders, and psychiatric disorders within DBs). Significant eGene overlap was observed between every pair of phenotype groups (all with P < 10-10, hypergeometric test; S5 Fig). Within the IDP class, the strongest overlap occurred between brain regional volumes and white matter microstructure phenotypes (42 shared eGenes, 37.5%), with the inferior fronto-occipital fasciculus (IFO) contributing the largest fraction of white matter phenotypes to this overlap (22.81%; Figs 2a and S6). Notably, this pleiotropy was not fully explained by genetic correlations among IDPs. Across 7,503 IDP pairs, the absolute genetic correlation was only weakly associated with eGene overlap (Spearman’s rank correlation coefficient r = 0.092, P = 1.95 × 10-15; S7b Fig), and the association remained weak for between-group pairs (Spearman’s rank correlation coefficient r = 0.073, P = 6.10 × 10-4). Indeed, even the most strongly correlated between-group pair (PLIC_FA and right lateral ventricle, rg = 0.49) shared no causal eGenes. Within the DB class, four eGenes exhibited pleiotropic effects across all three DB groups—DDHD2 and LIN28B in excitatory neurons, and MAP2K5 and XKR6 in inhibitory neurons (S8 Table)—all of which have been implicated in multiple psychiatric disorders [52–55].
(a) Upset plot of cell type–specific causal eGenes shared across phenotype classes and groups. (b) pLI scores of pairwise shared causal eGenes. (c) Top two enriched gene sets for pairwise shared eGenes. Within-group: shared by ≥2 phenotypes within a group. Within-class: shared across groups of the same class. Between-class: shared between the DB and IDP classes. BRV: Brain regional volume; WMMP: White matter microstructure phenotype; BCP: Behavioral-cognitive phenotype; NEUD: Neurological disorder; PSYD: Psychiatric disorder.
Between-class pleiotropy was similarly extensive. The brain regional volume group overlapped most strongly with the behavioral-cognitive phenotype group (21 shared eGenes, 24.7%), driven primarily by drinks per week (DPW) (10.45%; Figs 2a and S6); and with the psychiatric disorder group (26 shared eGenes, 23.85%), driven primarily by attention-deficit/hyperactivity disorder (ADHD) (15.38%; Figs 2a and S6). More broadly, 17 eGenes were shared between two IDP groups and one DB group, nine between one IDP group and two DB groups, four across four groups (XKR6 in inhibitory neurons, MAPT in astrocytes, ZSCAN31 in astrocytes, and MSH3 in excitatory neurons), and one eGene—XKR6 in inhibitory neurons—was shared across all five phenotype groups (Fig 2a and S8 Table). Collectively, these findings indicate that cell type–specific genetic susceptibilities are shared across diverse categories of brain-associated phenotypes.
We next examined the evolutionary constraint and functional profiles of shared causal eGenes using probability of loss-of-function intolerance (pLI) scores from ExAc [56] and gene-set enrichment analysis [57]. Among genes shared within individual phenotype groups, those in the behavioral-cognitive phenotype group exhibited the highest constraint (μ = 0.302, σ2 = 0.203), followed by the brain regional volume group (μ = 0.20, σ2 = 0.14; Fig 2b, left panel). Functionally, the behavioral-cognitive shared eGenes were enriched in pathways governing memory, cognition, and neurotransmission (Fig 2c and S9 Table), consistent with their established roles in learning and cognitive performance [58,59]. By contrast, brain regional volume–shared eGenes were enriched in protein-binding functions, notably Hsp90 and MHC class I protein binding (Fig 2c and S9 Table), which regulate synapse formation and [60,61].
For eGenes shared within the same phenotype class but across groups, the neurological disorder–psychiatric disorder overlap exhibited the highest pLI scores (μ = 0.409, σ2 = 0.235), whereas the brain regional volume–white matter microstructure overlap showed the lowest (μ = 0.153, σ2 = 0.125; Fig 2b, middle panel). Concordantly, the neuropsychiatric shared eGenes were enriched in cell maintenance and signaling pathways such as plasma membrane organization (Fig 2c and S9 Table), which have been linked to the etiology of neuropsychiatric disorders [62,63].
Finally, for between-class shared eGenes, those bridging brain regional volumes and DB groups (behavioral-cognitive, neurological, and psychiatric) exhibited higher pLI scores than those bridging white matter microstructure and DB groups (Fig 2b, right panel). This suggests that volumetric measures may capture more fundamental and evolutionarily conserved genetic determinants of brain disorders than microstructural measures—a pattern consistent with evidence that microstructural changes typically precede volumetric changes during disease progression [64,65], and that the genetic drivers of such early microstructural alterations are correspondingly less constrained. Notably, the brain regional volume–neurological disorder shared eGenes, which displayed the highest between-class pLI scores, were enriched in four pathways related to extracellular matrix dynamics (Fig 2c and S9 Table). Given the central role of the extracellular matrix in brain volume regulation, neural development, and disease pathophysiology [66–68], these results underscore the importance of extracellular matrix biology in the shared genetic architecture of brain structure and neurological disorders.
Shared cell type specificity of causal eGenes between different phenotype classes
Having established the pervasive sharing of causal eGenes across phenotype classes, we next examined the specific cell types that mediate these shared associations between IDPs and DBs. The cell type landscape differed markedly across DB–IDP pairs (Fig 3a). SCZ and drinks per week (DPW) each shared causal eGenes with brain regional volumes across a broad spectrum of cell types—including astrocytes, excitatory neurons, microglia, oligodendrocytes, and OPCs for SCZ, and astrocytes, excitatory neurons, microglia, inhibitory neurons, and OPCs for DPW—consistent with their polygenic architecture [69,70]. By contrast, neurodevelopmental disorders such as autism spectrum disorder (ASD), post-traumatic stress disorder (PTSD), and ADHD showed restricted sharing confined to inhibitory neurons (Fig 3a), supporting the hypothesis that disrupted inhibitory neurotransmission underlies both the structural and functional abnormalities seen in these conditions [71–73]. Notably, several risky behaviors—general risk tolerance (GRT), automobile speeding propensity (ASP), and DPW—also shared inhibitory neuron eGenes with both brain regional volumes and white matter microstructure phenotypes (Fig 3a). Furthermore, OPC-specific sharing with brain regional volumes was observed exclusively for SCZ and the behavioral-cognitive phenotypes DPW, ASP, and educational attainment (Fig 3a), in line with evidence that OPC gene expression shapes neural circuit development and cognitive outcomes [74,75].
(a) Brain cell types showing significant sharing of causal eGenes between IDP groups and individual DB phenotypes. Cell types were tested by hypergeometric test within each cell type and selected at FDR < 0.05. (b) Causal eGenes shared between brain regional volumes and psychiatric disorders.
These patterns point to a common cell type–specific genetic susceptibility linking brain structure to disorders and behaviors. Among psychiatric disorders, the MAPT gene in astrocytes emerged as a prominent shared signal, implicated in SCZ alongside 26 brain regional volumes and 15 white matter microstructure phenotypes (Figs 3b and S8). Increased astrocytic MAPT expression was associated with elevated SCZ risk, reduced mean fractional anisotropy in 14 white matter tracts, and decreased volume in 17 brain regions. Located at 17q21, MAPT is pivotal for human brain size and patterning, with established links to intracranial volume, white matter microstructure, and cortical architecture [24]. MAPT mutations drive brain atrophy [76], and MAPT dysregulation has been tied to SCZ [77], while astrocytic dysfunction itself is increasingly recognized as a causal contributor to SCZ pathophysiology [78]. The MAPT gene encodes tau, a protein essential for microtubule stabilization and axonal transport [79]. Given tau’s central role across both neuronal and glial compartments, and the established modulation of tau pathology in neurodegenerative disease [80], our findings raise the prospect of targeting astrocytic MAPT expression therapeutically to address SCZ-related structural brain abnormalities.
In the context of neurological disorders, three excitatory neuron eGenes—ZSCAN31 [51,81], SYT14 [82,83] and ICA1L [48,84]—were shared between AD and multiple IDPs (S8 and S9 Figs), converging with prior evidence that variants in these genes influence both brain imaging measurements and AD susceptibility [48,51,81–84].
Finally, among behavioral-cognitive phenotypes, XKR6 in inhibitory neurons was shared across ASP, GRT, 15 brain regional volumes (including right lateral orbitofrontal cortex and left isthmus cingulate), and 5 white matter microstructure phenotypes (including fornix–stria terminalis [FXST], IFO, and uncinate fasciculus [UNC]; S8 and S9 Figs). XKR6 belongs to the Kell blood group complex subunit–related family [85]. Large-scale GWAS of risky behavior have identified XKR6-related variants and implicated inhibitory neurotransmission in shaping individual variation in risk tolerance [86]. Separately, astrocytic SRR expression was causally linked to smoking cessation (SC), DPW, neuroticism (NEU), 5 brain regional volumes, and one white matter microstructure phenotype. Encoding serine racemase, SRR is expressed in excitatory glutamatergic neurons of the human forebrain [87] and plays a key role in glutamatergic synaptic signaling [88]. Its expression has been associated with substance use phenotypes, and SRR represents a promising pharmacological target for smoking cessation [88].
Cell type-informed clustering of phenotypes reveals biologically meaningful patterns
To identify phenotypes with shared cell type–specific regulatory architectures, we computed a phenotype × phenotype Jaccard similarity matrix based on shared causal eGenes and their corresponding cell types (S1 Text and S10 Fig). Oligodendrocytes, excitatory neurons, and astrocytes were the dominant contributors to the global similarity structure (S11 Fig), reflecting their greater overall abundance of phenotype-associated causal eGenes (Fig 1b-1c). Hierarchical clustering identified 12 robust phenotype clusters (S10 Fig and S10 Table), of which eight (C2–C9 and C12) contained both IDP and DB phenotypes (S12 Fig). Clusters C6 and C7 were the most heterogeneous, each spanning all five phenotype groups, with inhibitory neurons as the principal cell type driving shared molecular regulation across these diverse phenotypic categories. Cluster C4 was significantly enriched for white matter microstructure phenotypes (FDR = 0.0014, hypergeometric test) and nominally enriched for psychiatric disorders (P = 0.03), with microglia as the predominant cell type—implicating microglia-mediated mechanisms in the shared genetic basis of white matter integrity and psychiatric vulnerability.
We next asked whether these molecularly defined clusters mirror clinically observed comorbidity patterns. Focusing on disorder–disorder pairs that fell within the same cluster, we surveyed the literature for documented comorbidities and epidemiological associations (S11 Table). All nine psychiatric–psychiatric pairs identified showed established comorbidity support; for example, BIP clustered with both SCZ and anxiety disorder (ANX) [89,90]. Cross-category pairs also displayed compelling clinical parallels. Epilepsy (EPI) and PTSD co-occurred in cluster 6, consistent with documented clinical comorbidity [91]. IS and obsessive-compulsive disorder (OCD) shared cluster 7, supported by longitudinal evidence of elevated stroke risk in OCD patients [92]. PD and alcohol use disorder (AUD) clustered together in C12, aligning with epidemiological findings of increased AUD prevalence among PD patients [93]. Together, these observations indicate that phenotype clusters derived from shared cell type–specific causal eGenes capture not only molecular convergence but also clinically meaningful disease relationships.
Putative causality routes among cell type-specific causal eGenes and brain-associated complex phenotypes
Genetic risk for brain disorders may operate partly through intermediate imaging phenotypes—altering brain structure to influence disease risk, or conversely, manifesting as structural changes downstream of disease processes [20,94,95]. To dissect these causal chains at cellular resolution, we performed bidirectional Mendelian randomization between IDPs and DBs (Methods). This identified six causal effects from IDPs to DBs and three from DBs to IDPs. By overlaying these bidirectional phenotype relationships with the cell type–specific eGene–phenotype associations identified above, we reconstructed four eGene–IDP–DB routes and seven eGene–DB–IDP routes in which the eGene was causal for both the imaging phenotype and the disorder (Fig 4 and S12 Table).
(a) eGene–IDP–DB routes: the eGene is causal for both the IDP and the DB, and the IDP is causal for the DB. (b) eGene–DB–IDP routes: the eGene is causal for both the IDP and the DB, and the DB is causal for the IDP.
One such route links ICA1L expression in excitatory neurons to ACR microstructure and MS (Fig 4a and S12 Table). ICA1L encodes a BAR-domain protein family member implicated in excitatory synaptic signaling [96,97]. Its dysregulation has been tied to MS pathogenesis [98], and variants in this gene are associated with ACR microstructure [19], a tract critical for executive control [99]. Diffusion tensor imaging (DTI) abnormalities of the corona radiata are also documented in MS [100]. Our results position ICA1L in excitatory neurons as a upstream regulator: it negatively affects ACR microstructure (SMR: β = -0.16, FDR = 5.42 × 10-4; Wald ratio: β = -0.17, FDR = 1.16 × 10-36; GSMR: β = -0.15, FDR = 1.91 × 10-4), which in turn elevates MS risk (PMR-Egger: β = 0.11, FDR = 0.002; GSMR: β = 1.06, FDR = 2.16 × 10-15). ICA1L also exerts a direct negative causal effect on MS (SMR: β = -0.20, FDR = 0.011; Wald ratio: β = -0.19, FDR = 1.1 × 10-6; GSMR: β = -0.19, FDR = 9.63 × 10-3), suggesting both indirect and potentially pleiotropic influences on disease risk.
Conversely, an eGene–DB–IDP route connects ZSCAN31 expression in astrocytes to SCZ and posterior thalamic radiation (PTR) microstructure (Fig 4b and S12 Table). ZSCAN31 encodes a C2H2-type zinc finger protein [101] with established links to both SCZ risk and neuroimaging phenotypes [81]. Thalamic radiation abnormalities are well documented in SCZ [102], and thalamocortical dysconnectivity appears to worsen progressively during psychosis onset [103]. Our results indicate that astrocytic ZSCAN31 expression contributes to the effect of SCZ progression on PTR microstructure (PMR-Egger: β = 0.087, FDR = 1.54 × 10-5; GSMR: β = 0.033, FDR = 1.45 × 10-5). ZSCAN31 is itself positively associated with SCZ (Wald ratio: β = 0.17, FDR = 2.38 × 10-26; GSMR: β = 0.14, FDR = 1.53 × 10-5; SMR: β = 0.16, FDR = 6.89 × 10-5) and with PTR microstructure (Wald ratio: β = 0.08, FDR = 2.81 × 10-8; GSMR: β = 0.08, FDR = 2.57 × 10-3; SMR: β = 0.08, FDR = 0.03), positioning it as a shared astrocytic regulator of both disease risk and structural brain changes.
Together, these integrated causality routes nominate cell type–specific genes through which brain structure and disorder may be causally coupled, offering testable entry points for mechanistic and therapeutic follow-up.
To identify regulatory variants underlying the 11 causality routes (Fig 4), we further performed SNP-level fine-mapping, colocalization, and regulatory annotation (S2 Text). This yielded 56 SNP–cell type–eGene–phenotype chains, each linking a regulatory variant to its cell type–specific eGene and two associated phenotypes. For example, the chain rs3130455–CCHCR1 (excitatory neurons)–SCZ–PTR (effect direction: + , + , +) places this variant within an active promoter chromatin state (ChromHMM 1_TssA) across neural lineages, where it disrupts neurodevelopmental transcription factor motifs including Klf4 and Klf7. This supports a model in which increased rs3130455 dosage elevates CCHCR1 expression in excitatory neurons, raising SCZ risk and enhancing white matter connectivity in the PTR. Another chain, rs55938136–MAPT (astrocytes)–GCC–SCZ (effect direction: + , − , +), reveals perfect colocalization of the intronic SNP with both astrocytic MAPT expression and genu of corpus callosum (GCC) fractional anisotropy (PP.H4 = 1; PP.H4.snp = 1). This suggests that higher rs55938136 dosage increases astrocytic MAPT expression, diminishes GCC structural connectivity, and thereby elevates SCZ susceptibility [104,105]. All 56 chains are catalogued in S13 Table.
Spatiotemporal expression patterns of causal eGenes for IDPs and DBs
We next examined the expression patterns of causal eGenes in human brain single-cell data (Methods). For DBs, the causal eGenes were markedly enriched in glial cells relative to neurons (P < 2.2 × 10-16, Wilcoxon test), with the strongest enrichment in astrocytes (S13 Fig). Psychiatric and behavioral-cognitive causal eGenes showed higher neuronal expression during the prenatal period than postnatally, whereas glial cells displayed the inverse pattern (P < 0.001, Wilcoxon test; Figs 5a and S14a). Most neurological disorder eGenes were elevated in neural cells after birth; notably, amyotrophic lateral sclerosis (ALS) eGenes were exclusively neuronal and showed postnatal upregulation (S14b Fig), consistent with prior observations [106–108].
(a) Prenatal versus postnatal expression of psychiatric disorder causal eGenes in glial and neuronal cells. (b-c) Lifespan trajectories of psychiatric disorder causal eGenes in (b) glial and (c) neuronal cells; shaded regions, 95% confidence intervals. (d) Postnatal expression of white matter microstructure causal eGenes in glial and neuronal cells. Developmental stages: P4, 13 PCW ≤ Age < 16 PCW; P8, Birth ≤ Age < 6 Months; P12, 12 ≤ Age < 20 Years. ANX, anxiety disorder; ASD, autism spectrum disorder; BIP, bipolar disorder; MDD, major depressive disorder; SCZ, schizophrenia; TS, Tourette syndrome. Glia: microglia, astrocytes, oligodendrocytes, and OPCs. Neurons: inhibitory and excitatory neurons.
Across the lifespan, psychiatric disorder eGenes in glial cells declined through late mid-fetal development (19–24 PCW) before rising again in early childhood (1–6 years), while their neuronal expression decreased monotonically (Figs 5b-5c and S15). Behavioral-cognitive phenotypes followed a similar trajectory (S16 Fig), suggesting convergent developmental programs for these related phenotype categories.
For IDPs, causal eGenes for white matter microstructure phenotypes were strongly enriched in adult glial cells—particularly astrocytes and microglia (Fig 5d)—mirroring the predominance of glial cells in white matter tissue [109]. These spatiotemporal expression profiles provide an independent biological layer that complements the genetically inferred causal relationships. Whereas the MR framework captures cis-eQTL–driven regulatory architecture, the observed cell type–specific and developmental stage–specific transcriptional dynamics offer convergent evidence of biological plausibility, pointing to coordinated regulatory and transcriptional programs underlying brain-associated complex phenotypes.
Expression of causal eGenes indicates associations of brain-associated complex phenotypes
Brain function relies on the interplay between regional structure and white matter connectivity [110,111], yet how these relationships are manifest at the cellular level remains unclear. To address this, we examined the associations between brain regional volumes and white matter microstructure phenotypes using cell type–specific expression data from the STAB2 [112]. Regional volumes and white matter phenotypes formed largely distinct clusters (Fig 6a and Methods). Notably, causal eGene expression tracked cerebral hemisphere specificity: homologous regions from opposite hemispheres were assigned to different modules (Spearman’s rank correlation coefficient r > 0.7), indicating that genetically driven cell type–specific expression underlies hemispheric structural divergence.
(a) Correlation heatmap among imaging-derived phenotypes (IDPs), with phenotype group membership indicated by color bars (brain regional volumes; white matter microstructure phenotypes). (b) Correlation heatmap between white matter microstructure phenotypes and psychiatric disorders. Values are Spearman’s rank correlation coefficients of mean causal eGene expression across eight cell types (Methods). *, FDR < 0.05; **, FDR < 0.01; ***, FDR < 0.001.
A similar analysis of DBs revealed that causal eGene expression profiles carried functional associations with brain disorders and phenotypes (S17 Fig). For instance, the expression signatures of educational attainment and intelligence correlated strongly with those of AD (Spearman’s rank correlation coefficient r > 0.9), echoing epidemiological evidence that higher cognitive reserve lowers AD risk [113,114].
These expression patterns also corroborate the causal routes linking IDPs to DBs identified above (Fig 4). Fractional anisotropy in the PTR and the inferior fronto-occipital fasciculus (IFO) was significantly associated with SCZ and general risk tolerance (GRT), respectively (Figs 6b and S18), consistent with the bidirectional causal relationships inferred by MR. Beyond these known links, we detected significant expression associations between several regional volumes and psychiatric disorders (FDR < 0.05, Spearman’s rank correlation test). The strongest was between major depressive disorder (MDD) and the left rostral middle frontal cortex (FDR = 4.01 × 10-45, Spearman’s rank correlation test; S19 Fig), implicating this prefrontal region in the transcriptional architecture of depression.
Discussion
In this study, we employed a single-cell Mendelian randomization framework to dissect potential causal relationships between gene expression and brain-associated complex phenotypes, leveraging complementary analytical methods to enhance reliability. We identified 254 and 217 eGenes with putative causal effects on 112 IDPs and 26 DBs, respectively, across eight brain cell types. These causal eGenes exhibited strong cell type specificity: over 90% of eGene–IDP associations and approximately 80% of eGene–DB associations were restricted to a single cell type. At the same time, the causal eGenes showed widespread phenotype pleiotropy, exemplified by DDHD2 in excitatory neurons, XKR6 in inhibitory neurons, and MAPT and ZSCAN31 in astrocytes. Genes shared across different categories of brain-associated phenotypes displayed distinct patterns of evolutionary constraint and cell type enrichment, and were overrepresented in biological processes including memory and cognition, neurotransmitter regulation, and cellular maintenance and signaling. We further characterized putative causality routes linking cell type–specific causal eGenes to IDPs and DBs, and, by examining the spatiotemporal expression dynamics of these genes in independent single-cell data, uncovered coordinated transcriptional programs that underpin the relationships among brain-related complex phenotypes.
Regulation by cell type-specific causal eGenes
Previous studies have pointed to an important yet underrecognized role for excitatory neurons in AD etiology [115,116]. In our analysis, six eGenes—SYT14, MSH3, ICA1L, RGS14, C17orf97 and ZSCAN31—were causally associated with AD in excitatory neurons (S6 Table). Among these, five (MSH3, ICA1L, RGS14, C17orf97 and ZSCAN31) showed a consistent positive causal effect. Several of these genes have established links to excitatory neuron or synaptic function: SYT14 mediates calcium-dependent neurotransmitter release and synaptic vesicle trafficking [117,118]; ICA1L and RGS14 modulate synaptic signaling pathways in excitatory neurons [97,119]; and ZSCAN31, a zinc finger protein, regulates neuronal excitability and intrinsic circuitry within this subtype [120,121]. This convergence suggests that these genes may influence AD progression through shared mechanisms affecting neuronal function. Moreover, we observed significant overlap between AD and brain structural phenotypes with respect to cell type–specific causal eGenes in excitatory neurons (Fig 3), including ZSCAN31, SYT14, and ICA1L. These findings collectively strengthen the mechanistic links among genes, excitatory neurons, AD, and brain structure.
ZSCAN31 is among the most pervasively pleiotropic genes in our catalog. In astrocytes, it showed a positive causal effect on all 11 associated phenotypes spanning four groups: MDD and SCZ (psychiatric disorders), GRT (behavioral-cognitive phenotypes), right supramarginal volume (brain regional volumes), and seven white matter microstructure phenotypes—mean fractional anisotropy of the GCC (genu of corpus callosum), PTR (posterior thalamic radiation), SLF (superior longitudinal fasciculus), SCC (splenium of corpus callosum), BCC (body of corpus callosum), ACR (anterior corona radiata), and the average across all tracts (Fig 3b and S4 Table). By contrast, ZSCAN31 in excitatory neurons exerted a negative causal effect on right hippocampal and total white matter volume. This divergence in effect direction across cell types indicates that ZSCAN31 is subject to cell type–specific regulatory control over brain-associated phenotypes, extending prior findings [101]. We accordingly inferred three putative routes linking ZSCAN31 in astrocytes to IDPs and DBs (Fig 4 and S12 Table). For example, the route ZSCAN31 in astrocytes → SCZ → PTR suggests that astrocytic ZSCAN31 expression may mediate the influence of SCZ disease progression on PTR microstructural change. Consistent with this, external single-cell data indicated that causal eGenes in multiple cell types participate in the association between SCZ and PTR (Fig 6b), illuminating how cell type–specific eGene regulation shapes brain-associated phenotype relationships.
Shared genetic signals between risky behaviors and IDPs
Our analysis supports genetically driven links between risky behaviors and multiple IDPs at the cellular level. We observed significant sharing of causal eGenes within inhibitory neurons between risky behaviors (GRT and ASP) and both brain regional volumes and white matter microstructure phenotypes (Fig 3a). Among these shared signals, XKR6 in inhibitory neurons was implicated in both risky behaviors (GRT and ASP) and several IDPs, including cortical volumetric phenotypes and white matter phenotypes such as the FXST, IFO and UNC. Using external expression data across cell types, we also detected significant expression-profile correlations between GRT and IFO, and between GRT and FXST (S18 Fig), pointing to a mediating role for cell type–specific causal eGenes in these associations. These results suggest a shared cell type–specific genetic architecture linking risky behaviors to selected IDPs. They should not, however, be taken as evidence for the precise brain regions in which these effects operate; that anatomical context awaits further validation.
Interpretation of cell type-specific associations
The high cell type specificity observed here (91.4% for eGene–IDP pairs and 82.1% for eGene–DB pairs) should be interpreted cautiously. These rates reflect the specificity of significant eGene–phenotype associations after TWAS and MR filtering, not the inherent specificity of eQTL effects. This definitional distinction may partly explain why our estimates exceed those reported in studies focused on cell type–specific eGenes or eQTLs [13,122]. In addition, methodological factors—such as significance filtering (TWAS + MR) and uneven cis-eQTL availability across cell types (e.g., the comparatively small number in endothelial cells and pericytes)—may inflate the apparent specificity.
It is also important to note that the cis-eQTL data used here were derived from a limited set of brain regions (prefrontal cortex, temporal cortex, and deep white matter) rather than the full complement of anatomical regions represented by the imaging phenotypes. Given the context dependence of molecular eQTL effects and known transcriptomic divergence across brain regions—especially between cortical and subcortical structures [44]—associations involving unmatched regions should be treated with caution. More broadly, MR captures the effect of a genetically proxied expression perturbation and does not identify the exact biological context or site of action [42]. Our results are therefore best regarded as prioritizing putative cell type–specific genes for brain phenotypes, rather than pinpointing the precise brain regions in which each regulatory effect occurs.
Limitations
Several avenues remain open for future work. First, the comparatively small number of cis-eQTLs in endothelial cells and pericytes (S3 Table) [12], likely constrained the number of causal genes identified in these cell types (Fig 1b). Larger single-cell eQTL datasets with greater sample sizes and finer cell type resolution should improve power to detect eGene–phenotype associations in these populations. Second, when performing MR between IDPs and DBs, half of the DB GWASs were excluded because they included participants from UK Biobank [123], limiting the completeness of the inferred causality routes among eGenes, IDPs, and DBs (Fig 4). In addition, shared causal eGenes identified across IDPs and/or DBs may be influenced by unavoidable sample overlap between some GWAS datasets. Future studies should assemble more fully independent GWAS resources for brain-associated phenotypes to strengthen the robustness and coverage of these causal inferences. Third, although our findings highlight putative causal relationships, they do not establish definitive causality and require confirmation by randomized controlled trials, the gold standard for causal inference. Considerable work remains ahead in treatment development, and our results should be interpreted accordingly.
Outlook
In summary, our study provides a systematic investigation of cell type–specific causal genes for brain-associated complex phenotypes, together with evidence that brain structure and disorders/behaviors share genetic foundations operating at the cellular level. We hope that the connections identified among genes, cell types, and complex phenotypes will serve as starting points for developing therapeutic strategies targeting brain structural abnormalities, disorders, and behaviors.
Methods
Cell type-specific cis-eQTL summary statistics
We obtained published cell type–specific cis-eQTL summary statistics comprising effect sizes for 67,815,924 SNP–gene pairs across 4,690,822 SNPs and 17,881 gene transcripts [12]. The data cover eight major brain cell types: astrocytes, excitatory neurons, inhibitory neurons, oligodendrocytes, OPCs, microglia, endothelial cells, and pericytes (Fig 1a and S3 Table). The cis-eQTLs were identified by integrating high-quality single-nuclei RNA sequencing data from the prefrontal cortex, temporal cortex, and white matter with corresponding genotype data from 192 individuals. All individuals fell within three standard deviations of the mean for the first and second principal components of European-ancestry populations from the 1000 Genomes Project [12].
GWAS summary statistics for brain disorders and behaviors
We obtained publicly available GWAS summary statistics for 26 brain DBs, including eight behavioral-cognitive phenotypes, eight neurological disorders, and 10 psychiatric disorders (S1 Table). Most were published recently and derived from meta-analyses with large sample sizes (mean ~280,000 individuals). All GWAS summary statistics were of European ancestry, and there was no participant overlap with the cell type–specific cis-eQTL dataset.
The sample size for each GWAS summary statistics dataset is listed below.
- Behavioral-cognitive phenotypes
- (1) Automobile speeding propensity (ASP): 404,291 individuals (heritability: 7.9%) [86]
- (2) Childhood intelligence (CI): 12,441 individuals (heritability: 22%-46%) [124]
- (3) Drinks per week (DPW): 941,280 individuals (heritability: 4%) [125]
- (4) Educational attainment (EA): 766,345 individuals (heritability: 12.2%) [126]
- (5) General risk tolerance (GRT): 466,571 individuals (heritability: 4.6%) [86]
- (6) Intelligence (INT): 269,867 individuals (heritability: 19%) [127]
- (7) Neuroticism (NEU): 390,278 individuals (heritability: 10%) [128]
- (8) Smoking cessation (SC): 547,219 individuals (heritability: 5%) [125]
- Neurological disorders
- (1) Alzheimer’s disease (AD): 71,880 cases and 383,378 controls (heritability: 5.5%) [129]
- (2) Amyotrophic lateral sclerosis (ALS): 20,806 cases and 59,804 controls (heritability: not reported in the original GWAS publication) [130]
- (3) Epilepsy (EPI): 15,212 cases and 29,677 controls (heritability: 9.7%) [131]
- (4) Intracerebral hemorrhage (ICH): 1,545 cases and 1,481 controls (heritability: not reported in the original GWAS publication) [132]
- (5) Insomnia (INS): 109,548 cases and 277,440 controls (heritability: 7.2%) [133]
- (6) Ischemic stroke (IS): 67,162 cases and 454,450 controls (heritability: not reported in the original GWAS publication) [134]
- (7) Multiple sclerosis (MS): 47,429 cases and 68,374 controls (heritability: 19.2%) [135]
- (8) Parkinson’s disease (PD): 33,674 cases and 449,056 controls (heritability: 22%-27%) [136]
- Psychiatric disorders
- (1) Attention-deficit/hyperactivity disorder (ADHD): 20,183 cases and 35,191 controls (heritability: 21.6%) [137]
- (2) Anxiety disorder (ANX): 25,453 cases and 58,113 controls (heritability: 26%) [138]
- (3) Autism spectrum disorder (ASD): 18,381 cases and 27,969 controls (heritability: 11.8%) [139]
- (4) Alcohol use disorder (AUD): 11,569 cases and 34,999 controls (heritability: 9.0%) [140]
- (5) Bipolar disorder (BIP): 41,917 cases and 371,549 controls (heritability: 18.6%) [141]
- (6) Major depressive disorder (MDD): 135,458 cases and 344,901 controls (heritability: 8.7%) [142]
- (7) Obsessive-compulsive disorder (OCD): 2,688 cases and 7,037 controls (heritability: 28%) [143]
- (8) Post-traumatic stress disorder (PTSD): 9,831 cases and 19,225 controls (heritability: 28%) [144]
- (9) Schizophrenia (SCZ): 76,755 cases and 243,649 controls (heritability: 24%) [145]
- (10) Tourette syndrome (TS): 4,819 cases and 9,488 controls (heritability: 21%) [146]
GWAS summary statistics for imaging-derived phenotypes
We obtained publicly available GWAS summary statistics for 123 brain IDPs, including 101 brain regional (and total) volume phenotypes and 22 white matter microstructure phenotypes (S2 Table). The brain volume GWAS summary statistics were processed by Zhao et al. [147] from 19,629 UK Biobank participants of European ancestry (reported mean heritability ~40%). Volumes included total brain volume, gray matter, white matter, and cerebrospinal fluid, labeled with the Mindboggle-101 atlas [148]. The white matter tract GWAS summary statistics were processed by Zhao et al. [22] from 33,292 UK Biobank participants of European ancestry (reported mean heritability ~46.3%). Tracts were segmented with the JHU ICBM-DTI-81 white-matter atlas [149–151], and microstructure was quantified by mean fractional anisotropy (FA).
Transcriptome-wide association study
To investigate potential links between gene expression and brain-associated complex phenotypes, we conducted a TWAS for the GWAS summary statistics of 123 IDPs and 26 DBs [31]. Prediction weights and covariance were obtained from two resources: (1) PredictDB Data Repository (https://predictdb.org/), where prediction models were built from GTEx V8 expression data [152] across 13 brain regions (amygdala, anterior cingulate cortex [BA24], caudate, cerebellar hemisphere, cerebellum, cortex, frontal cortex [BA9], hippocampus, hypothalamus, nucleus accumbens, putamen, spinal cord [cervical C-1] and substantia nigra); (2) CMC-derived DLPFC prediction models (https://github.com/laurahuckins/CMC_DLPFC_prediXcan), built from CommonMind Consortium expression data [153] from dorsolateral prefrontal cortex (DLPFC) tissue. TWAS was implemented with MetaXcan [31]. eGene–phenotype reaching nominal significance (P < 0.05) in at least one brain tissue were retained for subsequent MR analysis.
Primary Mendelian randomization analysis
To investigate the causal effects of cell type–specific gene expression on brain-associated complex phenotypes, we performed two-sample MR using four complementary methods: two single-SNP methods (SMR [6] and Wald ratio [32]) and two multi-SNP methods (PMR-Egger [8] and GSMR [33]). Details on each method are provided below (S1 Fig).
- (1) The SMR method tests whether the expression level of a gene is associated with a phenotype under the assumption of either causality or pleiotropy [6]. It is designed specifically for gene expression as the exposure and requires only one SNP as the instrument. The top cis-eQTL associated with each gene expression in a given cell type at genome-wide significance (P < 5 × 10-8) was selected as the instrument. SMR incorporates the HEIDI test to detect linkage [6]; results with HEIDI P < 0.05 were discarded. SMR and the HEIDI test were implemented with SMR software (version 1.3.1, https://yanglab.westlake.edu.cn/software/smr).
- (2) The Wald ratio method computes the change in disease risk per standard deviation change in gene expression, using the instrumental cis-eQTL for the target gene. It is applicable to any exposure and requires only one SNP as the instrument under the standard MR assumptions (relevance, independence, and exclusion restriction). For instrument selection, LD clumping (r2 < 0.1, window = 250 kb, 1000 Genomes EUR reference panel) was first performed for cis-eQTLs associated with each gene expression at genome-wide significance (P < 5 × 10-8) using PLINK v1.90 [154]. The independent cis-eQTL(s) for each gene in the given cell type were then selected as instrument(s). After instrument selection, 86.6% of cell type–specific genes had one valid instrumental cis-eQTL, so the Wald ratio could be applied in most analyses. For the remaining 13.4% of genes with more than one valid instrument, the inverse-variance weighted (IVW, fixed-effects) method was used as a supplementary approach [155], which is commonly paired with the Wald ratio in the literature [20,156]. Both methods were implemented with the R package TwoSampleMR (version 0.5.10) [157]. The harmonise_data() function in TwoSampleMR was used to harmonize effect alleles and SNP effects between exposure and outcome. Heterogeneity (function mr_heterogeneity, P < 0.05) and pleiotropy (function mr_pleiotropy_test, P < 0.05) tests were conducted for IVW results.
- (3) The PMR-Egger method estimates the causal effect of gene expression on a phenotype in the presence of horizontal pleiotropy within an MR likelihood framework [8]. It is designed specifically for gene expression as the exposure, can handle multiple correlated variants as instruments, and accounts for linkage disequilibrium (LD) among them. The cis-eQTLs (clumping r2 < 0.9, window = 250 kb, 1000 Genomes EUR reference panel) associated with each gene expression in a given cell type at genome-wide significance (P < 5 × 10-8) were selected as instruments. PMR-Egger was implemented with the R package PMR (version 1.0). The LD correlation matrix was derived from the 1000 Genomes EUR reference panel, and the maximum number of iterations was set to 5000.
- (4) The GSMR method extends SMR to estimate the causal effect of an exposure on an outcome, leveraging multiple variants while accounting for LD among them [33]. It is applicable to any exposure under the standard MR assumptions. The cis-eQTLs (clumping r2 < 0.9, 1000 Genomes EUR reference panel) associated with each gene expression in a given cell type at genome-wide significance (P < 5 × 10-8) were selected as instruments. GSMR incorporates the HEIDI-outlier test to identify and exclude instruments with significant pleiotropic effects on the outcome (P < 0.05). Both GSMR and the HEIDI-outlier test were implemented with the R package gsmr (version 1.1.0).
A gene was deemed putatively causal for a phenotype within a given cell type (cell type–specific causal eGene) if it met the following criteria: (1) significant in at least one single-SNP method after FDR correction (FDR < 0.05) [158]; (2) significant in at least one multi-SNP method after FDR correction (FDR < 0.05); (3) consistent causal effect direction across all four methods, reducing the likelihood of false positives.
We also calculated the F-statistic for each instrument using the Cragg-Donald statistic to assess instrument strength [159] by , where
represents the proportion of exposure variance explained by the instrument, N is the sample size from the cis-eQTL summary statistics, β is the effect size, se is the standard error, and k is the number of instruments used in the MR estimate (k = 1 for a single instrument). Genes for which the averaged F-statistic across all instruments exceeded 10 were retained. A brief introduction to MR for non-expert readers is provided in S3 Text.
Significance of cell type-specific causal eGenes sharing
We applied a hypergeometric test to assess whether cell type–specific causal eGenes were significantly shared between any two brain-associated complex phenotypes. The background gene set was derived by multiplying the total number of genes (only those with the same biotypes as the causal eGenes) by the total number of cell types. When estimating overlap within a particular cell type, the background comprised the total number of genes with the same biotypes as the causal eGenes. An FDR threshold of 0.05 was used to select significant cell types. Gene annotation was obtained from Ensembl (GRCh37 release 87) [160].
Gene set enrichment analysis
We utilized all gene sets from the Biological Process (BP), Molecular Function (MF), and Cellular Component (CC) sub-ontologies in the Gene Ontology (GO) database for gene set enrichment analysis [161,162]. GO enrichment was implemented with the enrichGO() function in the R package clusterProfiler (version 4.10.0) [57].
Replication of predicted associations
We performed replication analysis on the predicted causal associations between eGenes and brain-associated complex phenotypes identified in our MR analysis. To validate eGene–phenotype pairs, three external datasets were used. (1) The cis-eQTL summary statistics from the BrainMeta portal (https://yanglab.westlake.edu.cn/data/brainmeta/cis_eqtl/), derived from RNA sequencing of 2,865 brain cortex samples from 2,443 unrelated European-ancestry individuals [36]. The same MR analysis procedure was applied to infer causal relationships between cortical gene expression and 149 brain-associated complex phenotypes. An eGene–phenotype pair was considered replicated if it achieved nominal significance in at least one MR method and showed consistent effect directions across all MR methods and with the discovery analysis. (2) The differentially expressed genes and transcripts (at 5% FDR) in ASD, SCZ and BIP from PsychENCODE [29], which used in vivo gene expression profiles to identify transcriptome-wide isoform alterations across the three major psychiatric disorders. (3) Published eGene–phenotype associations from the NHGRI-EBI GWAS Catalog (version 2023-06-09, www.ebi.ac.uk/gwas/) [1].
To validate eGene–cell type–phenotype triples, two external datasets were used: (1) cell type–specific eQTL summary statistics from the brainSCOPE resource (https://brainscope.gersteinlab.org/integrative_files.html), generated from single-cell resolution data of 388 adult DLPFC samples [37]. Six major cell types were considered: astrocytes, excitatory neurons, inhibitory neurons, oligodendrocytes, OPCs, and microglia. The same MR analysis procedure was applied to infer causal relationships between cell type–specific gene expression and 149 brain-associated complex phenotypes. An eGene–cell type–phenotype triple was considered replicated if it achieved nominal significance in at least one MR method and showed consistent effect directions across all MR methods and with the discovery analysis. (2) Differentially expressed genes associated with AD pathology in each cell type from Mathys et al. [45]. For each external dataset, the replication rate was calculated as the number of replicated associations divided by the total number tested, and significance was assessed by hypergeometric test. Replication was evaluated only for phenotypes present in both the discovery and replication studies.
Mendelian randomization between brain IDPs and DBs
To characterize putative causality routes among cell type–specific genes, IDPs, and DBs, we performed bidirectional MR between DBs and IDPs. Because the IDP GWAS was conducted in UK Biobank participants, we selected DB GWAS datasets that did not include UK Biobank samples to minimize sample overlap. Thirteen DB GWASs were retained: CI, EPI, ICH, IS, ALS, MS, OCD, TS, ADHD, AUD, ASD, PTSD, and SCZ. Unlike MR with gene expression as the exposure—where a single independent genome-wide significant cis-eQTL typically serves as the instrument—analyses with DBs or IDPs as exposures often involve multiple independent instruments. We therefore primarily employed multi-SNP MR methods: PMR-Egger, GSMR, and methods from the R package TwoSampleMR (IVW [155], MR-Egger [163], Weighted median [164], Simple mode [165], Weighted mode [166]). or instrument selection, variants were selected at genome-wide significance (P < 5 × 10-8) with clumping r2 < 0.6 for PMR-Egger and r2 < 0.1 for GSMR and TwoSampleMR methods. Heterogeneity (function mr_heterogeneity, P < 0.05) and pleiotropy (function mr_pleiotropy_test, P < 0.05) tests were conducted for TwoSampleMR results. A phenotype was deemed putatively causal for another if it met the following criteria: (1) significant in at least two MR methods after FDR correction (FDR < 0.05); (2) consistent causal effect direction across all methods.
Expression analysis of causal eGenes using single-cell data
External normalized single-cell gene expression data were obtained from STAB2 [112]. We extracted cell type–specific expression data of the causal eGenes for the eight cell types used in this study: microglia, astrocytes, oligodendrocytes, OPCs, inhibitory neurons, excitatory neurons, endothelial cells, and pericytes. To assess associations between and within IDPs and DBs at the cell type level, we first compiled gene expression data of the causal eGenes for each IDP or DB in each cell type. For each IDP and DB, we then created an eight-dimensional expression vector, in which each value represented the mean expression of the causal eGenes in a given cell type. Spearman’s rank correlation coefficient was used to evaluate associations between IDPs and DBs based on the expression profiles of the corresponding causal eGenes across cell types. Additionally, hierarchical clustering (complete linkage) was employed to investigate associations of IDPs and DBs based on gene expression correlation coefficients.
Supporting information
S1 Text. Phenotype similarity matrix construction and clustering analysis.
https://doi.org/10.1371/journal.pcbi.1014422.s001
(DOCX)
S2 Text. SNP-level colocalization and regulatory annotation.
https://doi.org/10.1371/journal.pcbi.1014422.s002
(DOCX)
S3 Text. Overview of Mendelian randomization and causal inference framework in this study.
https://doi.org/10.1371/journal.pcbi.1014422.s003
(DOCX)
S1 Fig. Study workflow for the primary Mendelian randomization analysis.
IV: instrumental variables.
https://doi.org/10.1371/journal.pcbi.1014422.s004
(TIF)
S2 Fig. Replication results for predicted eGene–IDP and eGene–DB associations.
Reproducibility of eGene–IDP pairs was assessed using cortical cis-eQTL summary statistics from BrainMeta [36]. Reproducibility of eGene–DB pairs was assessed using cortical cis-eQTLs from BrainMeta [36], differentially expressed genes from PsychENCODE [29], and eGene–phenotype associations from the GWAS Catalog [1]. Significance of overlap was evaluated by hypergeometric test.
https://doi.org/10.1371/journal.pcbi.1014422.s005
(TIF)
S3 Fig. Replication results for predicted eGene–cell type–IDP and eGene–cell type–DB associations.
Reproducibility of eGene–cell type–IDP triples was assessed using cell type–specific eQTL summary statistics from brainSCOPE [37]. Reproducibility of eGene–cell type–DB triples was assessed using cell type–specific eQTLs from brainSCOPE [37] and differentially expressed genes from Mathys et al. [45] (only AD-associated causal eGenes were tested). Significance of overlap was evaluated by hypergeometric test.
https://doi.org/10.1371/journal.pcbi.1014422.s006
(TIF)
S4 Fig. Distribution of cell type specificity for predicted eGene–phenotype associations.
Bar plots show the number of eGene–phenotype associations predicted in exactly k cell types (x-axis), restricted to eGenes with valid instrumental cis-eQTLs in two or more cell types. The y-axis indicates the number of predicted eGene–phenotype associations detected in exactly k cell types.
https://doi.org/10.1371/journal.pcbi.1014422.s007
(TIF)
S5 Fig. Overlap of causal eGenes between each pair of phenotype groups.
Overlaps were assessed by hypergeometric test. BRV: brain regional volume; WMMP: white matter microstructure phenotype; BCP: behavioral–cognitive phenotype; NEUD: neurological disorder; PSYD: psychiatric disorder.
https://doi.org/10.1371/journal.pcbi.1014422.s008
(TIF)
S6 Fig. Percentage and count of shared cell type–specific causal eGenes between different phenotype groups.
(a) Percentage and count of shared cell type–specific causal eGenes between the brain regional volume group and white matter microstructure phenotypes. (b) Percentage and count of shared cell type–specific causal eGenes between the brain regional volume group and DBs.
https://doi.org/10.1371/journal.pcbi.1014422.s009
(TIF)
S7 Fig. Genetic correlation and causal eGene overlap among IDPs.
(a) Pairwise genetic correlations (rg) were estimated for all 123 IDPs using LD Score Regression based on GWAS summary statistics [167]. Correlations range from −0.83 to 1. The strongest correlation between IDPs from different groups was observed between the fractional anisotropy of the posterior limb of the internal capsule (PLIC_FA) and the volume of the right lateral ventricle (rg = 0.49). (b) Relationship between pairwise genetic correlation and overlap of cell type–specific causal eGenes among IDPs. The x-axis shows absolute pairwise genetic correlation (|rg|), and the y-axis shows the number of overlapping cell type–specific causal eGenes identified by MR analysis. Each point represents one IDP pair, colored by pair type: between-group (one brain regional volume and one white matter microstructure phenotype), within-BRV (both brain regional volumes), or within-WMMP (both white matter microstructure phenotypes). The black curve shows a LOESS-smoothed trend, with the shaded area indicating the 95% confidence interval.
https://doi.org/10.1371/journal.pcbi.1014422.s010
(TIF)
S8 Fig. Cell type–specific causal eGenes shared between white matter microstructure phenotypes and behavioral–cognitive phenotypes (a), neurological disorders (b), and psychiatric disorders (c).
https://doi.org/10.1371/journal.pcbi.1014422.s011
(TIF)
S9 Fig. Cell type–specific causal eGenes shared between brain regional volumes and behavioral–cognitive phenotypes (a), as well as neurological disorders (b).
https://doi.org/10.1371/journal.pcbi.1014422.s012
(TIF)
S10 Fig. Hierarchical clustering of the phenotype similarity evaluated based on the shared cell type–specific causal eGenes.
For each cell type, we constructed a phenotype × phenotype Jaccard similarity matrix based on shared causal eGenes. These were averaged to generate a phenotype similarity matrix, followed by hierarchical clustering. The resulting 12 phenotype clusters (C1–C12) are annotated on the heatmap.
https://doi.org/10.1371/journal.pcbi.1014422.s013
(TIF)
S11 Fig. Cell type contributions to the phenotype similarity.
Bar plot showing the Pearson correlation between each cell type–specific phenotype similarity matrix and the phenotype similarity matrix. Higher correlations indicate greater contributions of the corresponding cell type to the global phenotype similarity pattern.
https://doi.org/10.1371/journal.pcbi.1014422.s014
(TIF)
S12 Fig. Phenotype group enrichment and dominant cell type per cluster.
Bar plot showing the enrichment ratio of each phenotype group within the 12 identified phenotype clusters. Significance level is indicated (* FDR < 0.05, # P < 0.05; hypergeometric test). The dominant cell type contributing to each cluster’s similarity structure (defined as having the highest correlation with the phenotype similarity matrix within that cluster) is shown on the left.
https://doi.org/10.1371/journal.pcbi.1014422.s015
(TIF)
S13 Fig. Expression levels of the causal eGenes of 26 DBs from glial and neural cells.
(a) Expression differences in glial and neural cell types for various DBs, with significance indicated as ***, P < 0.001. (b) Detailed expression profiles of causal eGenes for 26 DBs in glial and neural cell types. Glia: glial cells, including astrocytes, microglia, oligodendrocytes, and OPCs. Neuro: neuronal cells, including inhibitory and excitatory neurons.
https://doi.org/10.1371/journal.pcbi.1014422.s016
(TIF)
S14 Fig. Spatiotemporal expression levels of the causal eGenes of behavioral–cognitive phenotypes (a) and neurological disorders (b).
Glia: glial cells, including astrocytes, microglia, oligodendrocytes, and OPCs. Neuro: neuronal cells, including inhibitory and excitatory neurons. * P < 0.05; ** P < 0.01; **** P < 0.001.
https://doi.org/10.1371/journal.pcbi.1014422.s017
(TIF)
S15 Fig. Expression dynamics of the causal eGenes of psychiatric disorders in astrocytes (a), oligodendrocytes (b), microglia (c), OPCs (d), inhibitory neurons (e) and excitatory neurons (f).
Locally estimated scatterplot smoothing (LOESS) curves are shown with 95% confidence intervals. P1, 4 ≤ age < 8 PCW; P2, 8 ≤ age < 10 PCW; P3, 10 ≤ age < 13 PCW; P4, 13 ≤ age < 16 PCW; P5, 16 ≤ age < 19 PCW; P6, 19 ≤ age < 24 PCW; P7, 24 ≤ age < 38 PCW; P8, 0 ≤ age < 6 months; P9, 6 ≤ age < 12 months; P10, 1 ≤ age < 6 years; P11, 6 ≤ age < 12 years; P12, 12 ≤ age < 20 years; P13, 20 ≤ age < 40 years; P14, 40 ≤ age < 60 years; P15, > 60 years.
https://doi.org/10.1371/journal.pcbi.1014422.s018
(TIF)
S16 Fig. Expression dynamics of the causal eGenes of behavioral–cognitive phenotypes in astrocytes (a), oligodendrocytes (b), microglia (c), OPCs (d), inhibitory neurons (e) and excitatory neurons (f).
For the locally estimated scatterplot smoothing (LOESS) plots, smooth curves are shown with 95% confidence intervals. P1, 4 ≤ Age < 8 PCW; P2, 8 ≤ Age < 10 PCW; P3, 10 ≤ Age < 13 PCW; P4, 13 ≤ Age < 16 PCW; P5, 16 ≤ Age < 19 PCW; P6, 19 ≤ Age < 24 PCW; P7, 24 ≤ Age < 38 PCW; P8, 0 ≤ Age < 6 Months; P9, 6 ≤ Age < 12 Months; P10, 1 ≤ Age < 6 Years; P11, 6 ≤ Age < 12 Years; P12, 12 ≤ Age < 20 Years; P13, 20 ≤ Age < 40 Years; P14, 40 ≤ Age < 60 Years; P15, > 60 Years.
https://doi.org/10.1371/journal.pcbi.1014422.s019
(TIF)
S17 Fig. Heatmap showing the expression associations of the eGenes for different groups of DBs.
Associations were evaluated based on expression correlation (Spearman’s rank correlation coefficient) of causal eGenes across eight cell types (Methods), and clusters were derived using hierarchical clustering (complete linkage).
https://doi.org/10.1371/journal.pcbi.1014422.s020
(TIF)
S18 Fig. Heatmap showing the expression associations of the eGenes for DBs and white matter microstructure phenotypes.
Associations were evaluated based on expression correlation (Spearman’s rank correlation coefficient) of causal eGenes across eight cell types (Methods). * FDR < 0.05; ** FDR < 0.01; *** FDR < 0.001.
https://doi.org/10.1371/journal.pcbi.1014422.s021
(TIF)
S19 Fig. Heatmap showing the expression associations of the eGenes for DBs and brain regional volumes.
Associations were evaluated based on expression correlation (Spearman’s rank correlation coefficient) of causal eGenes across eight cell types (Methods). * FDR < 0.05; ** FDR < 0.01; *** FDR < 0.001.
https://doi.org/10.1371/journal.pcbi.1014422.s022
(TIF)
S1 Table. GWAS summary statistics for disorders and behaviors (DBs) used in this study.
https://doi.org/10.1371/journal.pcbi.1014422.s023
(XLSX)
S2 Table. GWAS summary statistics for imaging-derived phenotypes (IDPs) used in this study.
https://doi.org/10.1371/journal.pcbi.1014422.s024
(XLSX)
S3 Table. Summary of the cell type–specific cis-eQTL dataset used in this study.
https://doi.org/10.1371/journal.pcbi.1014422.s025
(XLSX)
S4 Table. Summary of significant MR results for cell type–specific causal eGenes on imaging-derived phenotypes (IDPs).
https://doi.org/10.1371/journal.pcbi.1014422.s026
(XLSX)
S5 Table. Replication results for predicted eGene–IDP and eGene–cell type–IDP associations.
https://doi.org/10.1371/journal.pcbi.1014422.s027
(XLSX)
S6 Table. Summary of significant MR results for cell type–specific causal eGenes on disorders and behaviors (DBs).
https://doi.org/10.1371/journal.pcbi.1014422.s028
(XLSX)
S7 Table. Replication results for predicted eGene–DB and eGene–cell type–DB associations.
https://doi.org/10.1371/journal.pcbi.1014422.s029
(XLSX)
S8 Table. Cell type–specific causal eGenes shared across different phenotype groups.
https://doi.org/10.1371/journal.pcbi.1014422.s030
(XLSX)
S9 Table. Enriched gene sets of genes shared across phenotype groups and classes (pairwise sharing).
https://doi.org/10.1371/journal.pcbi.1014422.s031
(XLSX)
S10 Table. Phenotype composition of each of the 12 clusters derived from hierarchical clustering of phenotype similarity.
https://doi.org/10.1371/journal.pcbi.1014422.s032
(XLSX)
S11 Table. Literature support for disorder–disorder pairs co-clustered based on shared cell type–specific causal eGenes.
https://doi.org/10.1371/journal.pcbi.1014422.s033
(XLSX)
S12 Table. Putative routes among cell type–specific causal eGenes, IDPs, and DBs.
https://doi.org/10.1371/journal.pcbi.1014422.s034
(XLSX)
S13 Table. Putative SNP–cell type–specific eGene–phenotype1 (IDP or DB)–phenotype2 (DB or IDP) chains.
https://doi.org/10.1371/journal.pcbi.1014422.s035
(XLSX)
S1 STROBE Checklist. STROBE-MR checklist of recommended items to address in reports of Mendelian randomization studies [168,169].
https://doi.org/10.1371/journal.pcbi.1014422.s036
(DOCX)
Acknowledgments
The computations in this research were performed using the CFFF platform of Fudan University.
References
- 1. Sollis E, Mosaku A, Abid A, Buniello A, Cerezo M, Gil L, et al. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res. 2023;51(D1):D977–85. pmid:36350656
- 2. Uffelmann E, Huang QQ, Munung NS, de Vries J, Okada Y, Martin AR, et al. Genome-wide association studies. Nat Rev Methods Primers. 2021;1(1).
- 3. MacArthur J, Bowler E, Cerezo M, Gil L, Hall P, Hastings E, et al. The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog). Nucleic Acids Res. 2017;45(D1):D896–901. pmid:27899670
- 4. Fehrmann RSN, Jansen RC, Veldink JH, Westra H-J, Arends D, Bonder MJ, et al. Trans-eQTLs reveal that independent genetic variants associated with a complex phenotype converge on intermediate genes, with a major role for the HLA. PLoS Genet. 2011;7(8):e1002197. pmid:21829388
- 5. Nicolae DL, Gamazon E, Zhang W, Duan S, Dolan ME, Cox NJ. Trait-associated SNPs are more likely to be eQTLs: annotation to enhance discovery from GWAS. PLoS Genet. 2010;6(4):e1000888. pmid:20369019
- 6. Zhu Z, Zhang F, Hu H, Bakshi A, Robinson MR, Powell JE, et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat Genet. 2016;48(5):481–7. pmid:27019110
- 7. Baird DA, Liu JZ, Zheng J, Sieberts SK, Perumal T, Elsworth B, et al. Identifying drug targets for neurological and psychiatric disease via genetics and the brain transcriptome. PLoS Genet. 2021;17(1):e1009224. pmid:33417599
- 8. Yuan Z, Zhu H, Zeng P, Yang S, Sun S, Yang C, et al. Testing and controlling for horizontal pleiotropy with probabilistic Mendelian randomization in transcriptome-wide association studies. Nat Commun. 2020;11(1):3861. pmid:32737316
- 9. Porcu E, Rüeger S, Lepik K, eQTLGen Consortium, BIOS Consortium, Santoni FA, et al. Mendelian randomization integrating GWAS and eQTL data reveals genetic determinants of complex and clinical traits. Nat Commun. 2019;10(1):3300. pmid:31341166
- 10. Kibinge NK, Relton CL, Gaunt TR, Richardson TG. Characterizing the causal pathway for genetic variants associated with neurological phenotypes using human brain-derived proteome data. Am J Hum Genet. 2020;106(6):885–92. pmid:32413284
- 11. Yang C, Fagan AM, Perrin RJ, Rhinn H, Harari O, Cruchaga C. Mendelian randomization and genetic colocalization infer the effects of the multi-tissue proteome on 211 complex disease-related phenotypes. Genome Med. 2022;14(1):140. pmid:36510323
- 12. Bryois J, Calini D, Macnair W, Foo L, Urich E, Ortmann W, et al. Cell-type-specific cis-eQTLs in eight human brain cell types identify novel risk genes for psychiatric and neurological disorders. Nat Neurosci. 2022;25(8):1104–12. pmid:35915177
- 13. Yazar S, Alquicira-Hernandez J, Wing K, Senabouth A, Gordon MG, Andersen S, et al. Single-cell eQTL mapping identifies cell type-specific genetic control of autoimmune disease. Science. 2022;376(6589):eabf3041. pmid:35389779
- 14. de Klein N, Tsai EA, Vochteloo M, Baird D, Huang Y, Chen C-Y, et al. Brain expression quantitative trait locus and network analyses reveal downstream effects and putative drivers for brain-related diseases. Nat Genet. 2023;55(3):377–88. pmid:36823318
- 15. Kim-Hellmuth S, Aguet F, Oliva M, Muñoz-Aguirre M, Kasela S, Wucher V, et al. Cell type-specific genetic regulation of gene expression across human tissues. Science. 2020;369(6509):eaaz8528. pmid:32913075
- 16. Johansen N, Somasundaram S, Travaglini KJ, Yanny AM, Shumyatcher M, Casper T, et al. Interindividual variation in human cortical cell type abundance and expression. Science. 2023;382(6667):eadf2359. pmid:37824649
- 17. Cuomo ASE, Nathan A, Raychaudhuri S, MacArthur DG, Powell JE. Single-cell genomics meets human genetics. Nat Rev Genet. 2023;24(8):535–49. pmid:37085594
- 18. Wingo TS, Liu Y, Gerasimov ES, Vattathil SM, Wynne ME, Liu J, et al. Shared mechanisms across the major psychiatric and neurodegenerative diseases. Nat Commun. 2022;13(1):4314. pmid:35882878
- 19. Zhao B, Zhang J, Ibrahim JG, Luo T, Santelli RC, Li Y, et al. Large-scale GWAS reveals genetic architecture of brain white matter microstructure and genetic overlap with cognitive and mental health traits (n = 17,706). Mol Psychiatry. 2021;26(8):3943–55. pmid:31666681
- 20. Guo J, Yu K, Dong S-S, Yao S, Rong Y, Wu H, et al. Mendelian randomization analyses support causal relationships between brain imaging-derived phenotypes and risk of psychiatric disorders. Nat Neurosci. 2022;25(11):1519–27. pmid:36216997
- 21. Zhao B, Li T, Smith SM, Xiong D, Wang X, Yang Y, et al. Common variants contribute to intrinsic human brain functional networks. Nat Genet. 2022;54(4):508–17. pmid:35393594
- 22. Zhao B, Li T, Yang Y, Wang X, Luo T, Shan Y, et al. Common genetic variation influencing human white matter microstructure. Science. 2021;372(6548):eabf3736. pmid:34140357
- 23. Makowski C, van der Meer D, Dong W, Wang H, Wu Y, Zou J, et al. Discovery of genomic loci of the human cerebral cortex using genetically informed brain atlases. Science. 2022;375(6580):522–8. pmid:35113692
- 24. Zhao X, Yang A, Zhang Z-C, Yang YT, Zhao X-M. Deciphering the genetic architecture of human brain structure and function: a brief survey on recent advances of neuroimaging genomics. Brief Bioinform. 2023;24(2):bbad060. pmid:36847697
- 25. Thompson PM, Jahanshad N, Ching CRK, Salminen LE, Thomopoulos SI, Bright J, et al. ENIGMA and global neuroscience: a decade of large-scale studies of the brain in health and disease across more than 40 countries. Transl Psychiatry. 2020;10(1):100. pmid:32198361
- 26. Als TD, Kurki MI, Grove J, Voloudakis G, Therrien K, Tasanko E, et al. Depression pathophysiology, risk prediction of recurrence and comorbid psychiatric disorders using genome-wide analyses. Nat Med. 2023;29(7):1832–44. pmid:37464041
- 27. Grasby KL, Jahanshad N, Painter JN, Colodro-Conde L, Bralten J, Hibar DP, et al. The genetic architecture of the human cerebral cortex. Science. 2020;367(6484):eaay6690. pmid:32193296
- 28. Finucane HK, Reshef YA, Anttila V, Slowikowski K, Gusev A, Byrnes A, et al. Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types. Nat Genet. 2018;50(4):621–9. pmid:29632380
- 29. Gandal MJ, Zhang P, Hadjimichael E, Walker RL, Chen C, Liu S, et al. Transcriptome-wide isoform-level dysregulation in ASD, schizophrenia, and bipolar disorder. Science. 2018;362(6420):eaat8127. pmid:30545856
- 30. Mu C, Dang X, Luo X-J. Mendelian randomization analyses reveal causal relationships between brain functional networks and risk of psychiatric disorders. Nat Hum Behav. 2024;8(7):1417–28. pmid:38724650
- 31. Barbeira AN, Dickinson SP, Bonazzola R, Zheng J, Wheeler HE, Torres JM, et al. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun. 2018;9(1):1825. pmid:29739930
- 32. Wald A. The fitting of straight lines if both variables are subject to error. Ann Math Statist. 1940;11(3):284–300.
- 33. Zhu Z, Zheng Z, Zhang F, Wu Y, Trzaskowski M, Maier R, et al. Causal associations between risk factors and common diseases inferred from GWAS summary data. Nat Commun. 2018;9(1):224. pmid:29335400
- 34. Desikan RS, Ségonne F, Fischl B, Quinn BT, Dickerson BC, Blacker D, et al. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage. 2006;31(3):968–80. pmid:16530430
- 35. Hagler DJ Jr, Ahmadi ME, Kuperman J, Holland D, McDonald CR, Halgren E, et al. Automated white-matter tractography using a probabilistic diffusion tensor atlas: application to temporal lobe epilepsy. Hum Brain Mapp. 2009;30(5):1535–47. pmid:18671230
- 36. Qi T, Wu Y, Fang H, Zhang F, Liu S, Zeng J, et al. Genetic control of RNA splicing and its distinct role in complex trait variation. Nat Genet. 2022;54(9):1355–63. pmid:35982161
- 37. Emani PS, Liu JJ, Clarke D, Jensen M, Warrell J, Gupta C, et al. Single-cell genomics and regulatory networks for 388 human brains. Science. 2024;384(6698):eadi5199. pmid:38781369
- 38. Wu Q, Jia Z. Wiring the brain by clustered protocadherin neural codes. Neurosci Bull. 2021;37(1):117–31. pmid:32939695
- 39. Shen J, Yang B, Xie Z, Wu H, Zheng Z, Wang J, et al. Cell-type-specific gene modules related to the regional homogeneity of spontaneous brain activity and their associations with common brain disorders. Front Neurosci. 2021;15:639527. pmid:33958982
- 40. Jiang P, Scarpa JR, Fitzpatrick K, Losic B, Gao VD, Hao K, et al. A systems approach identifies networks and genes linking sleep and stress: implications for neuropsychiatric disorders. Cell Rep. 2015;11(5):835–48. pmid:25921536
- 41. Stelzer G, Rosen N, Plaschkes I, Zimmerman S, Twik M, Fishilevich S, et al. The GeneCards suite: from gene data mining to disease genome sequence analyses. Curr Protoc Bioinformatics. 2016;54:1.30.1–1.30.33. pmid:27322403
- 42. Gill D, Dib M-J, Cronjé HT, Karhunen V, Woolf B, Gagnon E, et al. Common pitfalls in drug target Mendelian randomization and how to avoid them. BMC Med. 2024;22(1):473. pmid:39407214
- 43. Dong P, Song L, Bendl J, Misir R, Shao Z, Edelstien J, et al. A multi-regional human brain atlas of chromatin accessibility and gene expression facilitates promoter-isoform resolution genetic fine-mapping. Nat Commun. 2024;15(1):10113. pmid:39578476
- 44. Siletti K, Hodge R, Mossi Albiach A, Lee KW, Ding S-L, Hu L, et al. Transcriptomic diversity of cell types across the adult human brain. Science. 2023;382(6667):eadd7046. pmid:37824663
- 45. Mathys H, Peng Z, Boix CA, Victor MB, Leary N, Babu S, et al. Single-cell atlas reveals correlates of high cognitive function, dementia, and resilience to Alzheimer’s disease pathology. Cell. 2023;186(20):4365–4385.e27. pmid:37774677
- 46. Haglund A, Zuber V, Yang Y, Abouzeid M, Feleke R, Ko JH, et al. Single-cell Mendelian randomisation identifies cell-type specific genetic effects on human brain disease and behaviour. bioRxiv. 2022.
- 47. O’Reilly D, Belgrad J, Ferguson C, Summers A, Sapp E, McHugh C, et al. Di-valent siRNA-mediated silencing of MSH3 blocks somatic repeat expansion in mouse models of Huntington’s disease. Mol Ther. 2023;31(6):1661–74. pmid:37177784
- 48. Ou Y-N, Yang Y-X, Deng Y-T, Zhang C, Hu H, Wu B-S, et al. Identification of novel drug targets for Alzheimer’s disease by integrating genetics and proteomes from brain and blood. Mol Psychiatry. 2021;26(10):6065–73. pmid:34381170
- 49. Masmudi-Martín M, Navarro-Lobato I, López-Aranda MF, Browning PGF, Simón A-M, López-Téllez JF, et al. Reversal of object recognition memory deficit in perirhinal cortex-lesioned rats and primates and in rodent models of aging and Alzheimer’s diseases. Neuroscience. 2020;448:287–98. pmid:32905841
- 50. Mahadevan J, Pathak AK, Vemula A, Nadella RK, Viswanath B, Jain S, et al. Analysis of whole exome sequencing in severe mental illness hints at selection of brain development and immune related genes. Sci Rep. 2021;11(1):21088. pmid:34702870
- 51. Adewuyi EO, O’Brien EK, Nyholt DR, Porter T, Laws SM. A large-scale genome-wide cross-trait analysis reveals shared genetic architecture between Alzheimer’s disease and gastrointestinal tract disorders. Commun Biol. 2022;5(1):691. pmid:35851147
- 52. Dang X, Liu J, Zhang Z, Luo X-J. Mendelian randomization study using dopaminergic neuron-specific eQTL identifies novel risk genes for schizophrenia. Mol Neurobiol. 2023;60(3):1537–46. pmid:36517655
- 53. Nishioka M, Kazuno A-A, Nakamura T, Sakai N, Hayama T, Fujii K, et al. Systematic analysis of exonic germline and postzygotic de novo mutations in bipolar disorder. Nat Commun. 2021;12(1):3750. pmid:34145229
- 54. Mulvey B, Dougherty JD. Transcriptional-regulatory convergence across functional MDD risk variants identified by massively parallel reporter assays. Transl Psychiatry. 2021;11(1):403. pmid:34294677
- 55.
Feng Y, Shen J. Psychiatric disorders and dietary habits: an investigation using Mendelian randomization analysis and insights into gene expression regulatory mechanisms. 2023.
- 56. Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016;536(7616):285–91. pmid:27535533
- 57. Wu T, Hu E, Xu S, Chen M, Guo P, Dai Z, et al. ClusterProfiler 4.0: a universal enrichment tool for interpreting omics data. Innovation (Camb). 2021;2(3):100141. pmid:34557778
- 58. Zhao X, Song L, Yang A, Zhang Z, Zhang J, Yang YT, et al. Prioritizing genes associated with brain disorders by leveraging enhancer-promoter interactions in diverse neural cells and tissues. Genome Med. 2023;15(1):56. pmid:37488639
- 59. Sarter M, Bruno JP, Parikh V. Abnormal neurotransmitter release underlying behavioral and cognitive disorders: toward concepts of dynamic and function-specific dysregulation. Neuropsychopharmacology. 2007;32(7):1452–61. pmid:17164812
- 60. Elmer BM, McAllister AK. Major histocompatibility complex class I proteins in brain development and plasticity. Trends Neurosci. 2012;35(11):660–70. pmid:22939644
- 61. Wang K, Shang Y, Dou F. Brain aging: Hsp90 and neurodegenerative diseases. Adv Exp Med Biol. 2018;1086:93–103. pmid:30232754
- 62. Petit-Pedrol M, Groc L. Regulation of membrane NMDA receptors by dynamics and protein interactions. J Cell Biol. 2021;220(1):e202006101. pmid:33337489
- 63. Al-Harthi L. Wnt/β-catenin and its diverse physiological cell signaling pathways in neurodegenerative and neuropsychiatric disorders. J Neuroimmune Pharmacol. 2012;7(4):725–30. pmid:23114888
- 64. Andica C, Kamagata K, Hatano T, Saito A, Uchida W, Ogawa T, et al. Free-water imaging in white and gray matter in Parkinson’s disease. Cells. 2019;8(8):839. pmid:31387313
- 65. Radhakrishnan H, Ubele MF, Krumholz SM, Boaz K, Mefford JL, Jones ED, et al. Tacrolimus protects against age-associated microstructural changes in the Beagle brain. J Neurosci. 2021;41(23):5124–33. pmid:33952632
- 66. Laham BJ, Gould E. How stress influences the dynamic plasticity of the brain’s extracellular matrix. Front Cell Neurosci. 2022;15:814287. pmid:35145379
- 67. Reichardt LF, Prokop A. Introduction: the role of extracellular matrix in nervous system development and maintenance. Dev Neurobiol. 2011;71(11):883–8. pmid:21898856
- 68. Soles A, Selimovic A, Sbrocco K, Ghannoum F, Hamel K, Moncada EL, et al. Extracellular matrix regulation in physiology and in brain disease. Int J Mol Sci. 2023;24(8):7049. pmid:37108212
- 69. Smeland OB, Frei O, Dale AM, Andreassen OA. The polygenic architecture of schizophrenia - rethinking pathogenesis and nosology. Nat Rev Neurol. 2020;16(7):366–79. pmid:32528109
- 70. Johnson EC, Sanchez-Roige S, Acion L, Adams MJ, Bucholz KK, Chan G, et al. Polygenic contributions to alcohol use and alcohol use disorders across population-based and clinically ascertained samples. Psychol Med. 2021;51(7):1147–56. pmid:31955720
- 71. Zikopoulos B, Barbas H. Altered neural connectivity in excitatory and inhibitory cortical circuits in autism. Front Hum Neurosci. 2013;7:609. pmid:24098278
- 72. Morello F, Voikar V, Parkkinen P, Panhelainen A, Rosenholm M, Makkonen A, et al. ADHD-like behaviors caused by inactivation of a transcription factor controlling the balance of inhibitory and excitatory neuron development in the mouse anterior brainstem. Transl Psychiatry. 2020;10(1):357. pmid:33087695
- 73. Zhang K, Yu F, Zhu J, Han S, Chen J, Wu X, et al. Imbalance of excitatory/inhibitory neuron differentiation in neurodevelopmental disorders with an NR2F1 point mutation. Cell Rep. 2020;31(3):107521. pmid:32320667
- 74. Pepper RE, Pitman KA, Cullen CL, Young KM. How do cells of the oligodendrocyte lineage affect neuronal circuits to influence motor function, memory and mood? Front Cell Neurosci. 2018;12:399. pmid:30524235
- 75. Fang L-P, Zhao N, Caudal LC, Chang H-F, Zhao R, Lin C-H, et al. Impaired bidirectional communication between interneurons and oligodendrocyte precursor cells affects social cognitive behavior. Nat Commun. 2022;13(1):1394. pmid:35296664
- 76. Dobson-Stone C, Polly P, Korgaonkar MS, Williams LM, Gordon E, Schofield PR, et al. GSK3B and MAPT polymorphisms are associated with grey matter and intracranial volume in healthy individuals. PLoS One. 2013;8(8):e71750. pmid:23951236
- 77. Lencz T, Lambert C, DeRosse P, Burdick KE, Morgan TV, Kane JM, et al. Runs of homozygosity reveal highly penetrant recessive loci in schizophrenia. Proc Natl Acad Sci U S A. 2007;104(50):19942–7. pmid:18077426
- 78. Windrem MS, Osipovitch M, Liu Z, Bates J, Chandler-Militello D, Zou L, et al. Human iPSC glial mouse chimeras reveal glial contributions to schizophrenia. Cell Stem Cell. 2017;21(2):195–208.e6. pmid:28736215
- 79. Butler VJ, Salazar DA, Soriano-Castell D, Alves-Ferreira M, Dennissen FJA, Vohra M, et al. Tau/MAPT disease-associated variant A152T alters tau function and toxicity via impaired retrograde axonal transport. Hum Mol Genet. 2019;28(9):1498–514. pmid:30590647
- 80. Caillet-Boudin M-L, Buée L, Sergeant N, Lefebvre B. Regulation of human MAPT gene expression. Mol Neurodegener. 2015;10:28. pmid:26170022
- 81. van der Meer D, Frei O, Kaufmann T, Shadrin AA, Devor A, Smeland OB, et al. Understanding the genetic determinants of the brain with MOSTest. Nat Commun. 2020;11(1):3512. pmid:32665545
- 82. Jun G, Ibrahim-Verbaas CA, Vronskaya M, Lambert J-C, Chung J, Naj AC, et al. A novel Alzheimer disease locus located near the gene encoding tau protein. Mol Psychiatry. 2016;21(1):108–17. pmid:25778476
- 83. van der Meer D, Kaufmann T, Shadrin AA, Makowski C, Frei O, Roelfs D, et al. The genetic architecture of human cortical folding. Sci Adv. 2021;7(51):eabj9446. pmid:34910505
- 84. Persyn E, Hanscombe KB, Howson JMM, Lewis CM, Traylor M, Markus HS. Genome-wide association study of MRI markers of cerebral small vessel disease in 42,310 participants. Nat Commun. 2020;11(1):2175. pmid:32358547
- 85. Geurtsen ML, Jaddoe VWV, Gaillard R, Felix JF. Associations of maternal early-pregnancy blood glucose and insulin concentrations with DNA methylation in newborns. Clin Epigenetics. 2020;12(1):134. pmid:32894192
- 86. Karlsson Linnér R, Biroli P, Kong E, Meddens SFW, Wedow R, Fontana MA, et al. Genome-wide association analyses of risk tolerance and risky behaviors in over 1 million individuals identify hundreds of loci and shared genetic influences. Nat Genet. 2019;51(2):245–57. pmid:30643258
- 87. Balu DT, Takagi S, Puhl MD, Benneyworth MA, Coyle JT. D-serine and serine racemase are localized to neurons in the adult mouse and human forebrain. Cell Mol Neurobiol. 2014;34(3):419–35. pmid:24436034
- 88. Toikumo S, Xu H, Gelernter J, Kember RL, Kranzler HR. Integrating human brain proteomic data with genome-wide association study findings identifies novel brain proteins in substance use traits. Neuropsychopharmacology. 2022;47(13):2292–9. pmid:35941285
- 89. Dong G, Feng J, Sun F, Chen J, Zhao X-M. A global overview of genetically interpretable multimorbidities among common diseases in the UK Biobank. Genome Med. 2021;13(1):110. pmid:34225788
- 90. Hossain MM, Khan N, Sultana A, Ma P, McKyer ELJ, Ahmed HU, et al. Prevalence of comorbid psychiatric disorders among people with autism spectrum disorder: an umbrella review of systematic reviews and meta-analyses. Psychiatry Res. 2020;287:112922. pmid:32203749
- 91. Ertan D, Tarrada A, El-Hage W, Sanchez S, Four E, Mezouar N, et al. Prevalence of posttraumatic stress disorder in adults with epilepsy: a meta-analysis. Seizure. 2025;126:32–42. pmid:39919539
- 92. Chen M-H, Tsai S-J, Su T-P, Li C-T, Lin W-C, Chen T-J, et al. Increased risk of stroke in patients with obsessive-compulsive disorder: a nationwide longitudinal study. Stroke. 2021;52(8):2601–8. pmid:34039028
- 93. Kaur J, Sandhu RK, Kubra KT, Canenguez Benitez JS, Onyeaka HK, Akter S, et al. Substance use disorders in patients with Parkinson’s disease and adverse hospitalization outcomes: a national inpatient study. Cureus. 2021;13(6):e16033. pmid:34336520
- 94. Sprooten E, Franke B, Greven CU. The P-factor and its genomic and neural equivalents: an integrated perspective. Mol Psychiatry. 2022;27(1):38–48. pmid:33526822
- 95. Yang A, Rolls ET, Dong G, Du J, Li Y, Feng J, et al. Longer screen time utilization is associated with the polygenic risk for attention-deficit/hyperactivity disorder with mediation by brain white matter microstructure. EBioMedicine. 2022;80:104039. pmid:35509143
- 96. Suurmeijer AJ, Dickson BC, Swanson D, Sung Y-S, Zhang L, Antonescu CR. Novel SRF-ICA1L fusions in cellular myoid neoplasms with potential for malignant behavior. Am J Surg Pathol. 2020;44(1):55–60. pmid:31478943
- 97. Zhang C, Qin F, Li X, Du X, Li T. Identification of novel proteins for lacunar stroke by integrating genome-wide association data and human brain proteomes. BMC Med. 2022;20(1):211. pmid:35733147
- 98. Jia T, Ma Y, Qin F, Han F, Zhang C. Brain proteome-wide association study linking-genes in multiple sclerosis pathogenesis. Ann Clin Transl Neurol. 2023;10(1):58–69. pmid:36475386
- 99. Vataja R, Pohjasvaara T, Mäntylä R, Ylikoski R, Leppävuori A, Leskelä M, et al. MRI correlates of executive dysfunction in patients with ischaemic stroke. Eur J Neurol. 2003;10(6):625–31. pmid:14641506
- 100. Bisecco A, Caiazzo G, d’Ambrosio A, Sacco R, Bonavita S, Docimo R, et al. Fatigue in multiple sclerosis: The contribution of occult white matter damage. Mult Scler. 2016;22(13):1676–84. pmid:26846989
- 101. Luo X-J, Mattheisen M, Li M, Huang L, Rietschel M, Børglum AD, et al. Systematic integration of brain eQTL and GWAS identifies ZNF323 as a novel schizophrenia risk gene and suggests recent positive selection based on compensatory advantage on pulmonary function. Schizophr Bull. 2015;41(6):1294–308. pmid:25759474
- 102. Woodward ND, Karbasforoushan H, Heckers S. Thalamocortical dysconnectivity in schizophrenia. Am J Psychiatry. 2012;169(10):1092–9. pmid:23032387
- 103. Anticevic A, Haut K, Murray JD, Repovs G, Yang GJ, Diehl C, et al. Association of thalamic dysconnectivity and conversion to psychosis in youth and young adults at elevated clinical risk. JAMA Psychiatry. 2015;72(9):882–91. pmid:26267151
- 104. Patel S, Mahon K, Wellington R, Zhang J, Chaplin W, Szeszko PR. A meta-analysis of diffusion tensor imaging studies of the corpus callosum in schizophrenia. Schizophr Res. 2011;129(2–3):149–55. pmid:21530178
- 105. Sivagnanasundaram S, Crossett B, Dedova I, Cordwell S, Matsumoto I. Abnormal pathways in the genu of the corpus callosum in schizophrenia pathogenesis: a proteome study. Proteomics Clin Appl. 2007;1(10):1291–305. pmid:21136626
- 106. Saez-Atienzar S, Bandres-Ciga S, Langston RG, Kim JJ, Choi SW, Reynolds RH, et al. Genetic analysis of amyotrophic lateral sclerosis identifies contributing pathways and cell types. Sci Adv. 2021;7(3):eabd9036. pmid:33523907
- 107. van Rheenen W, van der Spek RAA, Bakker MK, van Vugt JJFA, Hop PJ, Zwamborn RAJ, et al. Common and rare variant association analyses in amyotrophic lateral sclerosis identify 15 risk loci with distinct genetic architectures and neuron-specific biology. Nat Genet. 2021;53(12):1636–48. pmid:34873335
- 108. Longinetti E, Fang F. Epidemiology of amyotrophic lateral sclerosis: an update of recent literature. Curr Opin Neurol. 2019;32(5):771–6. pmid:31361627
- 109. Walhovd KB, Johansen-Berg H, Káradóttir R. T. Unraveling the secrets of white matter--bridging the gap between cellular, animal and human imaging studies. Neuroscience 2014;276:2–13,
- 110. Sporns O. Structure and function of complex brain networks. Dialogues Clin Neurosci. 2013;15(3):247–62. pmid:24174898
- 111. Betzel RF, Bassett DS. Specificity and robustness of long-distance connections in weighted, interareal connectomes. Proc Natl Acad Sci U S A. 2018;115(21):E4880–9. pmid:29739890
- 112. Yang YT, Gan Z, Zhang J, Zhao X, Yang Y, Han S, et al. STAB2: an updated spatio-temporal cell atlas of the human and mouse brain. Nucleic Acids Res. 2024;52(D1):D1033–41. pmid:37904591
- 113. Stern Y. Cognitive reserve in ageing and Alzheimer’s disease. Lancet Neurol. 2012;11(11):1006–12. pmid:23079557
- 114. Rentz DM, Mormino EC, Papp KV, Betensky RA, Sperling RA, Johnson KA. Cognitive resilience in clinical and preclinical Alzheimer’s disease: the Association of Amyloid and Tau Burden on cognitive performance. Brain Imaging Behav. 2017;11(2):383–90. pmid:27738998
- 115. Fu H, Rodriguez GA, Herman M, Emrani S, Nahmani E, Barrett G, et al. Tau pathology induces excitatory neuron loss, grid cell dysfunction, and spatial memory deficits reminiscent of early Alzheimer’s disease. Neuron. 2017;93(3):533-541.e5. pmid:28111080
- 116. Palop JJ, Chin J, Roberson ED, Wang J, Thwin MT, Bien-Ly N, et al. Aberrant excitatory neuronal activity and compensatory remodeling of inhibitory hippocampal circuits in mouse models of Alzheimer’s disease. Neuron. 2007;55(5):697–711. pmid:17785178
- 117. Quintero-Rivera F, Chan A, Donovan DJ, Gusella JF, Ligon AH. Disruption of a synaptotagmin (SYT14) associated with neurodevelopmental abnormalities. Am J Med Genet A. 2007;143A(6):558–63. pmid:17304550
- 118. Meyer A, Ku C, Hatleberg WL, Telmer CA, Hinman V. New hypotheses of cell type diversity and novelty from orthology-driven comparative single cell and nuclei transcriptomics in echinoderms. Elife. 2023;12:e80090. pmid:37470227
- 119. Lee SE, Simons SB, Heldt SA, Zhao M, Schroeder JP, Vellano CP, et al. RGS14 is a natural suppressor of both synaptic plasticity in CA2 neurons and hippocampal-based learning and memory. Proc Natl Acad Sci U S A. 2010;107(39):16994–8. pmid:20837545
- 120. Ren A, Zhang H, Xie Z, Ma X, Ji W, He DZZ, et al. Regulation of hippocampus-dependent memory by the zinc finger protein Zbtb20 in mature CA1 neurons. J Physiol. 2012;590(19):4917–32. pmid:22777671
- 121. Jeub M, Emrich M, Pradier B, Taha O, Gailus-Durner V, Fuchs H, et al. The transcription factor Smad-interacting protein 1 controls pain sensitivity via modulation of DRG neuron excitability. Pain. 2011;152(10):2384–98. pmid:21862221
- 122. Bryois J, Calini D, Macnair W, Foo L, Urich E, Ortmann W, et al. Cell-type-specific cis-eQTLs in eight human brain cell types identify novel risk genes for psychiatric and neurological disorders. Nat Neurosci. 2022;25(8):1104–12. pmid:35915177
- 123. Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562(7726):203–9. pmid:30305743
- 124. Benyamin B, Pourcain B, Davis OS, Davies G, Hansell NK, Brion M-JA, et al. Childhood intelligence is heritable, highly polygenic and associated with FNBP1L. Mol Psychiatry. 2014;19(2):253–8. pmid:23358156
- 125. Liu M, Jiang Y, Wedow R, Li Y, Brazel DM, Chen F, et al. Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use. Nat Genet. 2019;51(2):237–44. pmid:30643251
- 126. Lee JJ, Wedow R, Okbay A, Kong E, Maghzian O, Zacher M, et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat Genet. 2018;50(8):1112–21. pmid:30038396
- 127. Savage JE, Jansen PR, Stringer S, Watanabe K, Bryois J, de Leeuw CA, et al. Genome-wide association meta-analysis in 269,867 individuals identifies new genetic and functional links to intelligence. Nat Genet. 2018;50(7):912–9. pmid:29942086
- 128. Nagel M, Jansen PR, Stringer S, Watanabe K, de Leeuw CA, Bryois J, et al. Meta-analysis of genome-wide association studies for neuroticism in 449,484 individuals identifies novel genetic loci and pathways. Nat Genet. 2018;50(7):920–7. pmid:29942085
- 129. Jansen IE, Savage JE, Watanabe K, Bryois J, Williams DM, Steinberg S, et al. Genome-wide meta-analysis identifies new loci and functional pathways influencing Alzheimer’s disease risk. Nat Genet. 2019;51(3):404–13. pmid:30617256
- 130. Nicolas A, Kenna KP, Renton AE, Ticozzi N, Faghri F, Chia R, et al. Genome-wide analyses identify KIF5A as a Novel ALS Gene. Neuron. 2018;97(6):1267–88. pmid:29566793
- 131. International League Against Epilepsy Consortium on Complex Epilepsies. Genome-wide mega-analysis identifies 16 loci and highlights diverse biological mechanisms in the common epilepsies. Nat Commun. 2018;9(1):5269. pmid:30531953
- 132. Woo D, Falcone GJ, Devan WJ, Brown WM, Biffi A, Howard TD, et al. Meta-analysis of genome-wide association studies identifies 1q22 as a susceptibility locus for intracerebral hemorrhage. Am J Hum Genet. 2014;94(4):511–21. pmid:24656865
- 133. Watanabe K, Jansen PR, Savage JE, Nandakumar P, Wang X, 23andMe Research Team, et al. Genome-wide meta-analysis of insomnia prioritizes genes associated with metabolic and psychiatric pathways. Nat Genet. 2022;54(8):1125–32. pmid:35835914
- 134. Malik R, Chauhan G, Traylor M, Sargurupremraj M, Okada Y, Mishra A, et al. Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. Nat Genet. 2018;50(4):524–37. pmid:29531354
- 135. International Multiple Sclerosis Genetics Consortium. Multiple sclerosis genomic map implicates peripheral immune cells and microglia in susceptibility. Science. 2019;365(6460):eaav7188. pmid:31604244
- 136. Nalls MA, Blauwendraat C, Vallerga CL, Heilbron K, Bandres-Ciga S, Chang D, et al. Identification of novel risk loci, causal insights, and heritable risk for Parkinson’s disease: a meta-analysis of genome-wide association studies. Lancet Neurol. 2019;18(12):1091–102. pmid:31701892
- 137. Demontis D, Walters RK, Martin J, Mattheisen M, Als TD, Agerbo E, et al. Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder. Nat Genet. 2019;51(1):63–75. pmid:30478444
- 138. Purves KL, Coleman JRI, Meier SM, Rayner C, Davis KAS, Cheesman R, et al. A major role for common genetic variation in anxiety disorders. Mol Psychiatry. 2020;25(12):3292–303. pmid:31748690
- 139. Grove J, Ripke S, Als TD, Mattheisen M, Walters RK, Won H, et al. Identification of common genetic risk variants for autism spectrum disorder. Nat Genet. 2019;51(3):431–44. pmid:30804558
- 140. Walters RK, Polimanti R, Johnson EC, McClintick JN, Adams MJ, Adkins AE, et al. Transancestral GWAS of alcohol dependence reveals common genetic underpinnings with psychiatric disorders. Nat Neurosci. 2018;21(12):1656–69. pmid:30482948
- 141. Mullins N, Forstner AJ, O’Connell KS, Coombes B, Coleman JRI, Qiao Z, et al. Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology. Nat Genet. 2021;53(6):817–29. pmid:34002096
- 142. Wray NR, Ripke S, Mattheisen M, Trzaskowski M, Byrne EM, Abdellaoui A, et al. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nat Genet. 2018;50(5):668–81. pmid:29700475
- 143. International Obsessive Compulsive Disorder Foundation Genetics Collaborative (IOCDF-GC) and OCD Collaborative Genetics Association Studies (OCGAS). Revealing the complex genetic architecture of obsessive-compulsive disorder using meta-analysis. Mol Psychiatry. 2018;23(5):1181–8. pmid:28761083
- 144. Meier SM, Trontti K, Purves KL, Als TD, Grove J, Laine M, et al. Genetic variants associated with anxiety and stress-related disorders: a genome-wide association study and mouse-model study. JAMA Psychiatry. 2019;76(9):924–32. pmid:31116379
- 145. Trubetskoy V, Pardiñas AF, Qi T, Panagiotaropoulou G, Awasthi S, Bigdeli TB, et al. Mapping genomic loci implicates genes and synaptic biology in schizophrenia. Nature. 2022;604(7906):502–8. pmid:35396580
- 146. Yu D, Sul JH, Tsetsos F, Nawaz MS, Huang AY, Zelaya I, et al. Interrogating the genetic determinants of tourette’s syndrome and other tic disorders through genome-wide association studies. Am J Psychiatry. 2019;176(3):217–27. pmid:30818990
- 147. Zhao B, Luo T, Li T, Li Y, Zhang J, Shan Y, et al. Genome-wide association analysis of 19,629 individuals identifies variants influencing regional brain volumes and refines their genetic co-architecture with cognitive and mental health traits. Nat Genet. 2019;51(11):1637–44. pmid:31676860
- 148. Klein A, Tourville J. 101 labeled brain images and a consistent human cortical labeling protocol. Front Neurosci. 2012;6:171. pmid:23227001
- 149.
Mori S, Wakana S, Van Zijl PC, Nagae-Poetscher L. MRI atlas of human white matter. Elsevier; 2005.
- 150. Wakana S, Caprihan A, Panzenboeck MM, Fallon JH, Perry M, Gollub RL, et al. Reproducibility of quantitative tractography methods applied to cerebral white matter. Neuroimage. 2007;36(3):630–44. pmid:17481925
- 151. Hua K, Zhang J, Wakana S, Jiang H, Li X, Reich DS, et al. Tract probability maps in stereotaxic spaces: analyses of white matter anatomy and tract-specific quantification. Neuroimage. 2008;39(1):336–47. pmid:17931890
- 152. Barbeira AN, Bonazzola R, Gamazon ER, Liang Y, Park Y, Kim-Hellmuth S, et al. Exploiting the GTEx resources to decipher the mechanisms at GWAS loci. Genome Biol. 2021;22(1):49. pmid:33499903
- 153. Huckins LM, Dobbyn A, Ruderfer DM, Hoffman G, Wang W, Pardiñas AF, et al. Gene expression imputation across multiple brain regions provides insights into schizophrenia risk. Nat Genet. 2019;51(4):659–74. pmid:30911161
- 154. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. pmid:17701901
- 155. Burgess S, Butterworth A, Thompson SG. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol. 2013;37(7):658–65. pmid:24114802
- 156. Kurilshikov A, Medina-Gomez C, Bacigalupe R, Radjabzadeh D, Wang J, Demirkan A, et al. Large-scale association analyses identify host factors influencing human gut microbiome composition. Nat Genet. 2021;53(2):156–65. pmid:33462485
- 157. Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D, et al. The MR-base platform supports systematic causal inference across the human phenome. Elife. 2018;7:e34408. pmid:29846171
- 158. Benjamini Y, Hochberg Y. Controlling the false discovery rate - a practical and powerful approach to multiple testing. J R Stat Soc B. 1995;57(1):289–300. pmid:WOS:A1995QE45300017
- 159. Cragg JG, Donald SG. Testing identifiability and specification in instrumental variable models. Econom Theory. 1993;9(2):222–40.
- 160. Flicek P, Amode MR, Barrell D, Beal K, Billis K, Brent S, et al. Ensembl 2014. Nucleic Acids Res. 2014;42:D749–55. pmid:24316576
- 161. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. Nat Genet. 2000;25(1):25–9. pmid:10802651
- 162. Aleksander SA, Balhoff J, Carbon S, Cherry JM, Drabkin HJ, Ebert D, et al. The Gene Ontology knowledgebase in 2023. Genetics. 2023;224(1). pmid:36866529
- 163. Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol. 2015;44(2):512–25. pmid:26050253
- 164. Bowden J, Davey Smith G, Haycock PC, Burgess S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genet Epidemiol. 2016;40(4):304–14. pmid:27061298
- 165. Milne RL, Kuchenbaecker KB, Michailidou K, Beesley J, Kar S, Lindström S, et al. Identification of ten variants associated with risk of estrogen-receptor-negative breast cancer. Nat Genet. 2017;49(12):1767–78. pmid:29058716
- 166. Hartwig FP, Davey Smith G, Bowden J. Robust inference in summary data Mendelian randomization via the zero modal pleiotropy assumption. Int J Epidemiol. 2017;46(6):1985–98. pmid:29040600
- 167. Bulik-Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh P-R, et al. An atlas of genetic correlations across human diseases and traits. Nat Genet. 2015;47(11):1236–41. pmid:26414676
- 168. Skrivankova VW, Richmond RC, Woolf BAR, Yarmolinsky J, Davies NM, Swanson SA. Strengthening the Reporting of Observational Studies in Epidemiology using Mendelian Randomization (STROBE-MR) Statement. JAMA. 2021.
- 169. Skrivankova VW, Richmond RC, Woolf BAR, Davies NM, Swanson SA, VanderWeele TJ, et al. Strengthening the reporting of observational studies in epidemiology using mendelian randomisation (STROBE-MR): explanation and elaboration. BMJ. 2021;375:n2233. pmid:34702754