Genetic variants that modify brain gene expression may also influence risk for human diseases. We measured expression levels of 24,526 transcripts in brain samples from the cerebellum and temporal cortex of autopsied subjects with Alzheimer's disease (AD, cerebellar n = 197, temporal cortex n = 202) and with other brain pathologies (non–AD, cerebellar n = 177, temporal cortex n = 197). We conducted an expression genome-wide association study (eGWAS) using 213,528 cisSNPs within ±100 kb of the tested transcripts. We identified 2,980 cerebellar cisSNP/transcript level associations (2,596 unique cisSNPs) significant in both ADs and non–ADs (q<0.05, p = 7.70×10−5–1.67×10−82). Of these, 2,089 were also significant in the temporal cortex (p = 1.85×10−5–1.70×10−141). The top cerebellar cisSNPs had 2.4-fold enrichment for human disease-associated variants (p<10−6). We identified novel cisSNP/transcript associations for human disease-associated variants, including progressive supranuclear palsy SLCO1A2/rs11568563, Parkinson's disease (PD) MMRN1/rs6532197, Paget's disease OPTN/rs1561570; and we confirmed others, including PD MAPT/rs242557, systemic lupus erythematosus and ulcerative colitis IRF5/rs4728142, and type 1 diabetes mellitus RPS26/rs1701704. In our eGWAS, there was 2.9–3.3 fold enrichment (p<10−6) of significant cisSNPs with suggestive AD–risk association (p<10−3) in the Alzheimer's Disease Genetics Consortium GWAS. These results demonstrate the significant contributions of genetic factors to human brain gene expression, which are reliably detected across different brain regions and pathologies. The significant enrichment of brain cisSNPs among disease-associated variants advocates gene expression changes as a mechanism for many central nervous system (CNS) and non–CNS diseases. Combined assessment of expression and disease GWAS may provide complementary information in discovery of human disease variants with functional implications. Our findings have implications for the design and interpretation of eGWAS in general and the use of brain expression quantitative trait loci in the study of human disease genetics.
Genetic variants that regulate gene expression levels can also influence human disease risk. Discovery of genomic loci that alter brain gene expression levels (brain expression quantitative trait loci = eQTLs) can be instrumental in the identification of genetic risk underlying both central nervous system (CNS) and non–CNS diseases. To systematically assess the role of brain eQTLs in human disease and to evaluate the influence of brain region and pathology in eQTL mapping, we performed an expression genome-wide association study (eGWAS) in 773 brain samples from the cerebellum and temporal cortex of ~200 autopsied subjects with Alzheimer's disease (AD) and ~200 with other brain pathologies (non–AD). We identified ~3,000 significant associations between cisSNPs near ~700 genes and their cerebellar transcript levels, which replicate in ADs and non–ADs. More than 2,000 of these associations were reproducible in the temporal cortex. The top cisSNPs are enriched for both CNS and non–CNS disease-associated variants. We identified novel and confirmed previous cisSNP/transcript associations for many disease loci, suggesting gene expression regulation as their mechanism of action. These findings demonstrate the reproducibility of the eQTL approach across different brain regions and pathologies, and advocate the combined use of gene expression and disease GWAS for identification and functional characterization of human disease-associated variants.
Citation: Zou F, Chai HS, Younkin CS, Allen M, Crook J, et al. (2012) Brain Expression Genome-Wide Association Study (eGWAS) Identifies Human Disease-Associated Variants. PLoS Genet 8(6): e1002707. doi:10.1371/journal.pgen.1002707
Editor: Greg Gibson, Georgia Institute of Technology, United States of America
Received: October 14, 2011; Accepted: March 27, 2012; Published: June 7, 2012
Copyright: © 2012 Zou et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: Support for this research was provided by the National Institutes of Health grants: National Institute on Aging (R01 032990 to NET and R01 AG018023 to NR Graff-Radford and SG Younkin); Mayo Alzheimer's Disease Research Center: (P50 AG016574 to RC Petersen, DW Dickson, NR Graff-Radford, SG Younkin, and N Ertekin-Taner); Mayo Alzheimer's Disease Patient Registry: (U01 AG006576 to RC Petersen); National Institute on Aging (AG025711, AG017216, AG003949 to DW Dickson). This project was also generously supported by the Robert and Clarice Smith and Abigail Van Buren Alzheimer's Disease Research Program (to RC Petersen, DW Dickson, NR Graff-Radford, and SG Younkin), and by the Palumbo Professorship in Alzheimer's Disease Research (to SG Younkin). N Ertekin-Taner is the recipient of National Institutes of Health (KL2 RR024151) and Siragusa Foundation grants. ADGC Authors: The National Institutes of Health, National Institute on Aging (NIH-NIA) supported this work through the following grants: ADGC, U01 AG032984, RC2 AG036528; NACC, U01 AG016976; NCRAD, U24 AG021886; NIA LOAD, U24 AG026395, U24 AG026390; Banner Sun Health Research Institute, U24 NS072026, P30 AG19610 Arizona Department of Health Services contract 211002, Arizona Biomedical Research Commission contracts 4001, 0011, 05-901 and 100, Michael J. Fox Foundation for Parkinson's Research; Boston University, P30 AG013846, R01 HG02213, K24 AG027841, U01 AG10483, R01 CA129769, R01 MH080295, R01 AG009029, R01 AG017173, R01 AG025259; Columbia University, P50 AG008702, R37 AG015473; Duke University, P30 AG028377; Emory University, AG025688; Group Health UO1 AG06781; Indiana University, P30 AG10133; Johns Hopkins University, P50 AG005146, R01 AG020688; Massachusetts General Hospital, P50 AG005134; Mayo Clinic, P50 AG016574; Mount Sinai School of Medicine, P50 AG005138, P01 AG002219; New York University, P30 AG08051, MO1RR00096, and UL1 RR029893; Northwestern University, P30 AG013854; Oregon Health and Science University, P30 AG008017, R01 AG026916; Rush University, P30 AG010161, R01 AG019085, R01 AG15819, R01 AG17917, R01 AG30146; University of Alabama at Birmingham, P50 AG016582, UL1RR02777; University of Arizona/TGEN, P30 AG019610, R01 AG031581, R01 NS059873; University of California, Davis, P30 AG010129; University of California, Irvine, P50 AG016573, P50, P50 AG016575, P50 AG016576, P50 AG016577; University of California, Los Angeles, P50 AG016570; University of California, San Diego, P50 AG005131; University of California, San Francisco, P50 AG023501, P01 AG019724; University of Kentucky, P30 AG028383; University of Michigan, P50 AG008671; University of Pennsylvania, P30 AG010124, AG17586; University of Pittsburgh, P50 AG005133, AG030653; University of Southern California, P50 AG005142; University of Texas Southwestern, P30 AG012300; University of Miami, R01 AG027944, AG010491, AG027944, AG021547, AG019757; University of Washington, P50 AG005136, UO1 HG004610; Vanderbilt University, R01 AG019085; and Washington University, P50 AG005681, P01 AG03991. ADNI Funding for ADNI is through the Northern California Institute for Research and Education by grants from Abbott, AstraZeneca AB, Bayer Schering Pharma AG, Bristol-Myers Squibb, Eisai Global Clinical Development, Elan, Genentech, GE Healthcare, GlaxoSmithKline, Innogenetics, Johnson and Johnson, Eli Lilly, Medpace, Merck, Novartis AG, Pfizer Inc, F. Hoffman-La Roche, Schering-Plough, Synarc, Alzheimer's Association, Alzheimer's Drug Discovery Foundation, the Dana Foundation, and by the National Institute of Biomedical Imaging and Bioengineering and NIA grants U01 AG024904, RC2 AG036535, K01 AG030514. We thank Creighton Phelps, Marcelle Morrison-Bogorad, and Marilyn Miller from NIA who are ex-officio ADGC members. Support was also from the Alzheimer's Association (LAF, IIRG-08-89720; MP-V, IIRG-05-14147) and the Veterans Affairs Administration. P.S.G.-H. is supported by Wellcome Trust, Howard Hughes Medical Institute, and the Canadian Institute of Health Research. Additional acknowledgements for TGen data: Many data and biomaterials were collected from several National Institute on Aging (NIA) and National Alzheimer's Coordinating Center (NACC, grant #U01 AG016976) funded sites. Amanda J. Myers, PhD (University of Miami, Department of Psychiatry) and John A. Hardy, PhD (Reta Lila Weston Institute, University College London) collected and prepared the series. Marcelle Morrison-Bogorad, PhD., Tony Phelps, PhD, and Walter Kukull PhD are thanked for helping to co-ordinate this collection. The directors, pathologist and technicians involved include: National Institute on Aging: Ruth Seemann, John Hopkins Alzheimer's Disease Research Center (NIA grant # AG05146): Juan C. Troncoso, MD, Dr. Olga Pletnikova, University of California, Los Angeles (NIA grant # P50 AG16570): Harry Vinters, MD, Justine Pomakian, The Kathleen Price Bryan Brain Bank, Duke University Medical Center (NIA grant #AG05128, NINDS grant # NS39764, NIMH MH60451 also funded by Glaxo Smith Kline): Christine Hulette, MD, Director, John F. Ervin, Stanford University: Dikran Horoupian, MD, Ahmad Salehi, MD, PhD, New York Brain Bank, Taub Institute, Columbia University (NYBB): Jean Paul Vonsattel, MD, Katerina Mancevska, Massachusetts Alzheimer's Disease Research Center (P50 AG005134): E. Tessa Hedley-Whyte, MD, MP Frosch, MD, Karlotta Fitch, University of Michigan (NIH grant P50-AG08671): Dr. Roger Albin, Lisa Bain, Eszter Gombosi, University of Kentucky (NIH #AG05144): William Markesbery, MD, Sonya Anderson, Mayo Clinic, Jacksonville: Dennis W. Dickson, MD, Natalie Thomas, University of Southern California: Caroll A. Miller, MD, Jenny Tang, M.S., Dimitri Diaz, Washington University, St Louis Alzheimer's Disease Research Center (NIH #P50AG05681): Dan McKeel, MD, John C. Morris, MD, Eugene Johnson, Jr., PhD, Virginia Buckles, PhD, Deborah Carter, University of Washington, Seattle (NIH #P50 AG05136): Thomas Montine, MD, PhD, Aimee Schantz, MEd., University of Pennsylvania School of Medicine, Alzheimer's Disease Research Center: John Q. Trojanowski, MD, Virginia M Lee, MD, Vivianna Van Deerlin, MD, Terry Schuck, Boston University Alzheimer's Disease Research Center (NIH grant P30-AG13846): Ann C. McKee, Carol Kubilus Sun Health Research Institute, Arizona (NIA #P30 AG19610): Joseph Rogers, PhD, Thomas G. Beach, MD, PhD, Lucia I. Sue Emory University: Bruce H. Wainer, MD, PhD, Marla Gearing, PhD, University of Texas, Southwestern Medical School: Charles L. White, III, M.D., Roger Rosenberg, Marilyn Howell, Joan Reisch, University of California, Davis: William Ellis, MD, Mary Ann Jarvis, Rush University Medical Center, Rush Alzheimer's Disease Center (NIH #AG10161): David A. Bennett, M.D. Julie A. Schneider, MD, MS, Karen Skish, MS, PA (ASCP)MT, Wayne T Longman, University of Miami/NPF Brain Endowment Bank: Deborah C. Mash, MD, Margaret J Basile, Mitsuko Tanaka Oregon Health and Science University: Randy Wotljer, PhD. Additional tissues include samples from the following sites: Newcastle Brain Tissue Resource (funding via the Medical Research Council, local NHS trusts and Newcastle University): C. M. Morris, MD, Ian G. McKeith, Robert H. Perry MRC London Brain Bank for Neurodegenerative Diseases (funding via the Medical Research Council): Simon Lovestone, Md PhD, Safa Al-Sarraj. MD, Claire Troakes, South West Dementia Brain Bank (funding via numerous sources including the Higher Education Funding Council for England (HEFCE), Alzheimer's Research Trust (ART), BRACE as well as North Bristol NHS Trust Research and Innovation Department and DeNDRoN): Seth Love, MD, Patrick Kehoe, PhD, Laura Palmer, The Netherlands Brain Bank (funding via numerous sources including Stichting MS Research, Brain Net Europe, Hersenstichting Nederland Breinbrekend Werk, International Parkinson Fonds, Internationale Stiching Alzheimer Onderzoek): Inge Huitinga, MD, Marleen Rademaker, Michiel Kooreman, Institut de Neuropatologia, Servei Anatomia Patologica, Universitat de Barcelona: Isidre Ferrer Abizanda, MD, PhD, Susana Casas Boluda.
Competing interests: NR Graff-Radford has served as a consultant to Codman and received grant support from Elan Pharmaceutical Research, Pfizer Pharmaceuticals, Medivation, and Forrest. RC Petersen has been a consultant to GE Healthcare and Elan Pharmaceuticals, has served on a data safety monitoring committee for Pfizer and Janssen Alzheimer Immunotherapy, and has provided a CME lecture for Novartis. ADGC Authors: T.D.B. received licensing fees from and is on the speaker's bureau of Athena Diagnostics. M.R.F. receives research funding from BristolMyersSquibb Company, Danone Research, Elan Pharmaceuticals, Eli Lilly, Novartis Pharmaceuticals, OctaPharma AG, Pfizer, and Sonexa Therapeutics; receives honoraria as scientific consultant from Accera, Astellas Pharma US Inc., Baxter, Bayer Pharmaceuticals Corporation, BristolMyersSquibb, Eisai Medical Research, GE Healthcare, Medavante, Medivation, Merck, Novartis Pharmaceuticals, Pfizer, Prana Biotechnology, QR Pharma, The sanofi-aventis Group, and Toyama Chemical; and is speaker for Eisai Medical Research, Forest Laboratories, Pfizer, and Novartis Pharmaceuticals. A.M.G. has research funding from AstraZeneca, Pfizer and Genentech, and has received remuneration for giving talks at Pfizer and Genentech. R.C.P. is on the Safety Monitory Committee of Pfizer (Wyeth) and a consultant to the Safety Monitoring Committee at Janssen Alzheimer's Immunotherapy Program (Elan), to Elan Pharmaceuticals, and to GE Healthcare. R.E.T. is a consultant to Eisai, Japan, in the area of Alzheimer's genetics and a shareholder in, and consultant to, Pathway Genomics, San Diego, CA.
Expression quantitative trait loci (eQTL) are genomic loci that influence levels of gene transcripts and can be mapped by genetic linkage in families or eGWAS in unrelated populations . eQTLs are distinct from other complex trait loci, because they directly identify the target gene, since the transcript trait is a reflection of the mRNA level from a single gene. Furthermore, eQTLs imply regulation of gene expression as the mechanism of action for the underlying variants. Recently, few studies identified an enrichment of eQTLs from lymphocytes  and lymphoblasts  amongst human complex disease and trait GWAS loci, suggesting that eQTLs may be useful in mapping human disease-associated variants.
Most human eQTL mapping studies to date assessed immortalized lymphoblastoid cell lines , , , , , , , ,  and family-based samples from the CEPH , , , , ,  (Centre d'Etude du polymorphisme humain) or HapMap , , ,  repositories. Multiple other small and large scale eQTL studies investigated other tissues and populations including lymphocytes , monocytes , T-cells , fibroblasts , skin , subcutaneous and omental adipose tissue , , bone , liver  and brain , .
Despite the assumption that brain eQTLs would also influence human diseases and traits, there are no systematic gene mapping studies for human diseases that utilize brain gene expression phenotypes. Furthermore, the brain region most relevant for such studies and the influence of brain pathology on eQTL mapping studies are largely unknown. To address these issues, we performed an eQTL using cerebellar tissue from 197 subjects with Alzheimer's disease (AD) neuropathology and 177 with other pathologies (non–AD). We validated the results in a different brain region using temporal cortex samples from 202 ADs and 197 non–ADs (Supplementary Tables 1 and 2 in Dataset S1), 85% of whom overlapped with the cerebellar group. We evaluated significant cisSNPs from our study for association with human diseases/traits using a GWAS catalog . We also assessed our significant eGWAS cisSNPs for association with two central nervous system (CNS) diseases, progressive supranuclear palsy (PSP)  and AD risk , using two recent GWAS for these diseases.
Our results demonstrate the power of the brain eQTL approach in the identification and characterization of many human CNS and non–CNS disease-associated variants. This study also highlights the remarkable reproducibility of human eQTLs across different brain regions and pathologies, which has implications for the design of eGWAS in general. Combined assessment of eQTLs and disease risk loci can be instrumental in mapping disease genes with regulatory variants.
Levels of 24,526 transcripts for 18,401 genes were measured in 773 brain samples from the cerebellum and temporal cortex of ~200 ADs and ~200 non–ADs, using WG-DASL assays. Nearly 70% of all probes could be detected in >75% of the samples tested. All autopsied subjects were genotyped for 313,330 single nucleotide polymorphisms (SNPs) from Illumina HumanHap300-Duo Genotyping BeadChips, as part of the Mayo AD GWAS . An eGWAS testing association of transcript levels with cisSNPs was performed using multivariable linear regression correcting for APOE ε4 dosage, age at death, gender and multiple technical variables. False discovery rate (FDR)-based q values  (q) were used for corrections of multiple testing.
To achieve internal replication, we first analyzed the ADs and non–ADs separately. In our cerebellar eGWAS, at q<0.05, there were 5,271 significant cisSNP/transcript associations (1,156 unique genes) in the AD, 4,450 (1,022 unique genes) in the non–AD and 10,281 (1,875 unique genes) in the combined datasets. Q-Q plots suggested a clear excess of significant results (Figure 1, Figure S1a–S1d). 2,980 cisSNP/transcript associations (2,596 unique cisSNPs, 686 unique genes) were significant at q<0.05 in both ADs and non–ADs (Table 1, Supplementary Table 3 in Dataset S1, Figure S2). The direction and magnitude of associations in both groups demonstrate remarkable similarities (Pearson's correlation coefficient = 0.98, p<0.0001). The box plots depicted for some of these top associations (Figure S3a–S3c) demonstrate this replication in ADs and non–ADs. Most associations have an additive or dominant pattern with respect to the minor allele.
To assess the genetic component contributing to gene expression variability, we estimated intraclass correlation coefficients (ICC)  in the 15 samples measured in replicate on 5–6 different plates and 2–3 different days. Between-subject variance accounted for a median of 60% of total probe expression variance (Supplementary Table 4 in Dataset S1; Figure S4). The 746 probes for the top 2,980 cerebellar cisSNP associations had higher between-subject variance (median = 78%).
Using multivariable linear regression, we next estimated the percent variation in cerebellar probe expression levels due to the “best” cisSNP for each transcript after accounting for technical and biological covariates. We found that the “best” cisSNP explained a median of ~3% of the expression variation. For the top 746 probes, the “best” cisSNPs accounted for a median of 18% of the expression variance (Table 2, Supplementary Table 5 in Dataset S1).
The top 2,980 cerebellar eGWAS associations were followed up in the temporal cortex validation study. We found that 2,685 top cerebellar cisSNP/transcript associations could be tested in the temporal cortex (2,387 unique cisSNPs, 677 unique probes and 625 unique genes) (Figure 1, Table 3, Supplementary Table 6 in Dataset S1). A total of 2,089 of these (1,888 unique cisSNPs, 502 unique probes and 471 unique genes) were significant after study-wide Bonferroni corrections, many of which had effect sizes showing remarkable similarity to those from the cerebellar eGWAS (Pearson's correlation coefficient = 0.94, p<0.0001).
The top cerebellar eGWAS results were also compared to published liver  and brain ,  eGWAS and overlap was identified for 4–11% of the top transcripts from these published studies (Text S1) Using HapMap2 genotypes, all transcripts and association threshold p<1.0E-4 in our eGWAS, we determined that 24–32% of the top transcripts from the published eGWAS overlapped with ours.
We used the cerebellar eGWAS as the discovery analysis and the temporal cortex eGWAS as the validation; since our goal is to identify significant cisSNP associations while minimizing any confounding factors due to pathology and given the fact that half of our subjects had pathologic AD, in which cerebellum is relatively unaffected whereas temporal cortex is one of the first affected brain regions. Nonetheless, we have also used temporal cortex as the discovery set and cerebellum as the validation, with remarkably similar results (Text S1, Supplementary Tables 7 and 8 in Dataset S1).
Enrichment of brain cisSNPs among human disease-associated SNPs
To examine whether the brain eGWAS approach identified variants implicated in human diseases/traits, we linked the 2,596 top cerebellar eGWAS cisSNPs to the “Catalog of Published GWAS” , which compiles weekly search results from all published GWAS of ≥100,000 SNPs where associations of p≤1.0E-05 are reported. We identified 47 cisSNPs that were also associated with 36 diseases/traits (Table 4, Supplementary Table 9 in Dataset S1). This represents a 2.4-fold enrichment of significant cerebellar cisSNPs amongst disease/trait associated SNPs, which is significant (p<10−6) based on simulations adjusted for minor allele frequencies  (Text S1).
Among the 36 diseases/traits associating with top cerebellar cisSNPs were central nervous system (CNS)-related conditions including Parkinson's disease (PD), Moyamoya disease, cognitive performance and attention-deficit hyperactivity disorder (ADHD). We both identified novel cisSNP/transcript associations and confirmed some previously reported ones. We found novel associations between rs6532197, which confers increased risk of PD , and higher brain levels of MMRN1 (cerebellar eGWAS p = pCer = 4.86×10−12; temporal cortex eGWAS p = pTCx = 4.57×10−9). MMRN1 encodes for multimerin and was found to be in a region of duplication/triplication with SNCA (encoding α-synuclein), a well-established risk gene in PD . We found no significant cisSNP/SNCA level associations. These results suggest that MMRN1 may deserve further investigations as an additional PD risk gene.
Another example of a cisSNP which associates with human disease risk is rs8070723, the minor allele of which is associated with reduced risk of PD  and reduced brain MAPT levels (pCer = 3.36×10−7–7.02×10−69; pTCx = 9.03×10−4–8.61×10−44). Rs11012 minor allele, which confers increased risk of PD , showed association with lower brain LRRC37A4 levels (pCer = 1.69×10−33; pTCx = 3.378E−20). MAPT region variants were previously identified to associate with brain levels of MAPT and LRRC37A4 in neurologically normal subjects , , in a MAPT haplotype H1/H2-dependent manner . Indeed, rs8070723 is in tight linkage disequilibrium with rs1052553 (r2 = 0.95, D′ = 0.97), the major allele of which marks the MAPT-H1 haplotype and associates with higher brain MAPT levels .
Many top cerebellar cisSNPs also associate with non–CNS diseases/traits (Supplementary Table 9 in Dataset S1). IRF5 cisSNP rs4728142 is associated with both cerebellar IRF5 levels and risk of systemic lupus erythematosus (SLE) . Previously, IRF variants were shown to influence IRF splicing and expression as well as SLE risk , . Interestingly, rs4728142 is also associated with ulcerative colitis (UC)  where both IRF5 and TNPO3 are reported as candidate genes. Given its influence on IRF5, but not TNPO3 expression levels, rs4728142 most likely marks IRF5, but not TNPO3 as the candidate UC risk gene.
Our approach to identify human disease-associated SNPs amongst the 2,596 top cerebellar eGWAS cisSNPs may be overly conservative, given our selection criteria to only include transcripts that are detectable in >75% of the subjects and only those cisSNPs that are significant in both independent cohorts (ADs and non–ADs). Furthermore, given that our eGWAS genotyping platform consisted of ~300 K SNPs, it is plausible that transcript associations with SNPs from the “Catalog of Published GWAS”  may be missed if those SNPs did not exist in our platform. To address these issues, we repeated the cerebellar and temporal cortex eGWAS, without restrictions for transcript detection rates and using genotypes imputed to HapMap2 (>2 million SNPs). Comparison of the eGWAS associations with p<1.0E-4 to the “Catalog of Published GWAS” identified 392 unique cerebellar cisSNPs that also associate with 189 human diseases/traits; and 339 such temporal cortex cisSNPs associating with 167 diseases/traits (Text S1, Supplementary Tables 10 and 11 in Dataset S1). Amongst the associations identified by this less stringent approach were those for brain levels of CLU , , CR1  and GAB2  which were identified as risk loci in GWAS of Alzheimer's disease.
We also performed comparisons of the eGWAS results from the ADs and non–ADs separately to determine whether there were any results unique to these diagnostic groups (Text S1, Supplementary Tables 12, 13, 14, 15 in Dataset S1). Although 13–25% of the disease/trait associations were with cisSNPs that were unique to ADs or non–ADs, all but a few of these could also be identified in the combined analysis of all subjects. There were only 2–7 human diseases/traits with cisSNP associations that were detectable just in ADs or non–ADs, but not the combined group.
Of these unique cisSNP, those that associate with cerebellar levels of C9orf72 in non–ADs are interesting, as these variants were previously identified in GWAS of amyotrophic lateral sclerosis (ALS), where C9orf72 was one of the candidate genes at the disease locus , . This gene was recently identified as the most common cause of familial ALS, with a repeat expansion leading to loss of an alternatively spliced transcript , . These results further support the utility of the combined eGWAS and disease GWAS approaches in the potential identification of disease genes with modified transcript levels as the plausible disease mechanism.
Identification of brain cisSNPs among PSP GWAS loci
In a recent PSP GWAS , four loci near MAPT, STX6, EIF2AK3, and MOBP conferred significant risk, in addition to three suggestive loci at 1q41 intergenic locus, BMS1 and SLCO1A2. We assessed these seven strongest PSP risk loci in our eGWAS in the ADs, non–ADs and combined datasets, as well as the PSP subset of non–ADs (Table 5, Supplementary Table 16 in Dataset S1). We found novel, significant rs11568563 minor allele associations with reduced brain SLCO1A2 levels (pCer = 2.33×10−8; pTCx = 4.36×10−2–9.14×10−18), which confers increased PSP risk . SLCO1A2 encodes solute carrier organic anion transporter family member 1a2 and is a drug transporter into the CNS . Fine-mapping of the SLCO1A2 region revealed rs11568563 to be the strongest cisSNP influencing brain levels of this gene (Figure S5). This SNP was also identified as the top PSP-associating variant at this locus . All other cisSNPs that associate with brain SLCO1A2 levels have weaker effects that appear to be due to their LD with s11568563, which is a missense coding mutation within SLCO1A2. Whether rs11568563 is merely tagging the functional variant(s) regulating levels of SLCO1A2 or coding changes also influence expressed transcript levels require further investigations. Additionally, MAPT/rs242557 minor allele increased PSP risk  and brain MAPT levels (pCer = 9.78×10−3–8.8×10−13, pTCx = 1.1×10−8). MAPT/rs8070723 minor allele associated with lower brain MAPT levels in our eGWAS, decreased PSP risk , similar to a PD GWAS . We also found nominally significant increases in brain MOBP levels (pCer = 2.13×10−2–1.71×10−7; pTCx = 1.55×10−2–1.57×10−6) with rs1768208, which increases PSP risk .
The recent PSP GWAS by Hoglinger et al.  included eQTL analysis for the significant loci using brain expression levels from 387 subjects without clinical neurologic diseases. In addition to associations between MAPT locus cisSNPs with brain MAPT and LRRC37A4 levels, they also detected signals for the nearby ARL17A and PLEKHM1 genes, neither of which were detectable in our eGWAS. They also identified cisSNP associations with brain MOBP levels but even stronger influence on the nearby SLC25A38 levels. We did not identify significant cisSNP/SLC25A38 brain expression associations. Although some of the significant probes for MOBP and MAPT harbor variants within their probe sequence, which may potentially confound associations with expression levels, these genes had other significant probes without any sequence variants (Text S1).
Most non–AD subjects in our study had pathologic diagnosis of PSP (nCer = 98, nTCx = 107, Supplementary Table 2 in Dataset S1). We assessed the 2,980 top cerebellar cisSNP/transcript associations in this subset, and found that most results were consistent with the ADs (Supplementary Tables 17 and 18 in Dataset S1).
Enrichment of brain cisSNPs in the AD GWAS from ADGC
To investigate whether any of the significant brain cisSNPs may influence risk of AD, we compared our eGWAS results to the AD risk associations from the large AD GWAS conducted by ADGC . We obtained results of meta-analyses for the ADGC Stage 1+2 cohort (11,840 LOAD vs. 10,931 controls)  and investigated those SNPs with suggestive AD risk association in this dataset (pmeta<10−3). To ensure uniform comparison between our eGWAS and the ADGC GWAS, we assessed results from >2 million SNPs for each study using SNPs genome-wide imputed to HapMap phase 2 (release 22). There were 77,126 cerebellar (63,652 unique SNPs, 2,338 unique genes) and 68,172 temporal cortex (57,922 unique SNPs and 2,201 unique genes) cisSNP/transcript associations significant at q<0.05 representing a clear excess (Figure S6). There were 380 cisSNPs that were significant for the cerebellar transcript associations and also had suggestive AD risk associations (2.9-fold enrichment), 432 such temporal cortex cisSNPs (3.3-fold enrichment) and 356 cisSNPs significant in both the cerebellum and temporal cortex (2.7-fold enrichment, p<10−6 for all three analyses) (Figure 1, Supplementary Tables 19 and 20 in Dataset S1).
MAPT and LRRC37A4 cisSNPs, implicated in PSP  and PD  GWAS and which significantly influenced brain levels of these genes also had suggestive AD risk associations (pmeta = 8.82×10−4–1.53×10−5). CisSNP alleles associating with lower brain MAPT levels were associated with lower AD risk, similar to PD  and PSP  GWAS, which may suggest a common mechanism for these neurodegenerative diseases. ABCA7, identified recently as a novel LOAD risk locus , , had significant cerebellar cisSNPs. Further investigations of the other genes with evidence of brain transcript and AD risk association is warranted to understand their role in AD (Text S1).
To ensure that we did not miss any associations due to the stringent eGWAS criteria that we applied, we repeated the analyses using no restrictions for transcript detection rates and eGWAS p value threshold of p<1.0E-4. We also investigated cisSNPs identified in AD and non–AD brains, both separately, and jointly, given that some cisSNP associations may be unique to one group. We compared these eGWAS results to the ADGC GWAS as described above (Supplementary Tables 21, 22, 23, 24, 25, 26 in Dataset S1). Using cerebellar and temporal cortex eGWAS from all subjects, 561 and 488 unique transcripts with cisSNPs that yield suggestive AD risk associations were identified, respectively. There were 259–312 such transcripts identified in each AD or non–AD eGWAS, with >50% overlap between the two diagnostic groups' results, although many of these results could be identified in the eGWAS of combined samples. About 7–10% of the transcripts could only be identified in just ADs or non–ADs, but not the combined eGWAS. Amongst such unique transcripts were CLU and BIN1, which reside at the LOAD GWAS loci , ,  and associate with cisSNPs in the cerebellum of non–ADs. Detailed analyses of the CLU locus cisSNP/transcript associations are in-press .
In a large eQTL study on 773 brain samples from ~400 autopsied subjects, we demonstrate significant contribution of genetic factors to human brain gene expression, reliably detected across different brain regions and pathologies. There is significant enrichment of brain cisSNPs amongst disease-associated variants, advocating gene expression changes as a mechanism for the first time for certain genes implicated in human diseases, including PSP (SLCO1A2), PD (MMRN1), Paget's disease (OPTN) while replicating others (e.g. PD/MAPT, SLE/UC/IRF5). MAPT cisSNPs associating with PSP, PD and AD risk highlight potential common mechanisms for these neurodegenerative diseases.
The reported results have several important implications for the genetics of human brain gene expression: First, despite technical challenges of gene expression measurements in post-mortem brain tissue , ~70% of the transcriptome can be reliably detected in >75% of the subjects across two brain regions and different disease pathologies. Second, although there is significant contribution from technical covariates, genetic factors account for a substantial proportion of the variance in brain gene expression levels. We estimate that genetic factors explain an average 3% (range: 0–85%) of the variance in human cerebellar gene expression overall, and 18% (range: 8–85%) of the variance for the top cis-regulated transcripts. These estimates show remarkable similarity to those from other eQTL studies, such as a large, family-based lymphocyte eQTL, where cis eQTLs had an overall median effect size of 1.8% and significant eQTLs accounted for 24.6% of the variance in expression . Similarly, significant cisSNPs explained 2–90% of expression variance in a liver eGWAS .
Third, there is remarkable replication of significant cisSNP associations across different brain regions and underlying tissue pathologies. Indeed, the 2,980 top cerebellar cisSNP/transcript associations represent 58% and 68%of all significant associations in the ADs and non–ADs. Since >50% of the non–ADs were comprised of subjects with PSP, we also conducted a separate analysis of this pathologically distinct group of non–ADs and again determined that many of the top cisSNPs were also significant in the PSPs despite the small sample size (n = 98). Importantly, most of the cisSNPs had highly similar effect sizes in the ADs, non–ADs and PSP subset of non–ADs. Furthermore, 78% of the top cerebellar cis-associations were also significant in the temporal cortex. Cerebellum is a relatively unaffected region in AD, whereas temporal cortex is typically one of the first areas to harbor neuropathology . It is not inherently evident whether the unaffected or affected tissue regions would be most suitable for eQTL studies. Whereas unaffected regions would have the advantage of minimizing confounding on expression measurements from pathology (such as inflammation and cell death), affected regions may be more relevant for disease-associated eQTL mapping. The substantial overlap in significant cisSNP associations between different brain regions and disease types in our study implies that sample size may be the most critical element of successful eQTL mapping. In other words, analysis of expression data collected in different tissue regions and diseases, provided there is careful statistical control, could greatly enhance power to detect eQTLs. Nevertheless, there may be important eQTLs that are specific to brain region and disease.
It is not obvious whether the cisSNP that display similar effects in different brain regions and different disease types would have relevance to human disease. The top 2,596 cerebellar cisSNPs that are significant in both ADs and non–ADs, and many of which are also significant in the temporal cortex are also enriched for variants implicated in human disease, including CNS disease, such as PD and PSP. Thus, the fourth implication of our study is that it may be possible to map disease-associated variants using eQTL studies conducted in unaffected tissue or unaffected subjects. In addition to providing a general characterization of the genetics of brain gene expression, this study successfully replicated many previously published cisSNP associations, such as rs8070723/ MAPT level, rs11012/ LRRC37A4 level associations, both of which were implicated in PD. We found novel brain expression level associations for transcripts implicated in disease, including rs11568563 association with SLCO1A2, recently identified in a PSP GWAS. The disease-associating cisSNP associations identified in this study were not restricted to CNS diseases, but also included non–CNS diseases, such as SLE, where we replicated the previously published rs4728142/ IRF5 level associations.
These findings imply that many disease-associated cisSNPs can influence gene expression idependently of tissue/region/pathology, and be mapped reliably in tissue which is unaffected, not disease-related or from unaffected subjects. Indeed, our findings are consistent with a study of lymphoblastoid cell lines from subjects affected and unaffected with asthma, where Dixon et al.  found no differences between asthmatics and non-asthmatics. Furthermore, they detected significant transcript level associations with SNPs that also associate with asthma. Emilsson et al.  performed eQTL mapping in both blood and adipose tissue and determined that >50% of significant adipose tissue cisSNPs were also significant in blood. This is similar to the overlap we detected for cerebellum and temporal cortex, though two brain regions are more likely to have similar eQTL profiles than two different tissues.
Although many cisSNP effects can be detected in many different tissue types and disease conditions as shown here and by others , , there conceivably exist expression variants which exert their effects in a tissue or disease-specific manner. For example in the eQTL comparing blood and adipose tissue, Emilsson et al.  also found that more transcripts from adipose tissue had significant correlations with obesity-related traits. In reality, both scenarios may be at play, such that some expression variants have more ubiquitous effects, whereas others may need tissue/cell/region/disease specific factors to exert their influence on gene expression. Indeed, many of the CNS disease related cisSNP associations in our brain eGWAS could not be identified in our comparison to a liver eGWAS  or an existing database for a LCL eGWAS , suggesting that disease-relevant tissue may be necessary to detect effects of certain cisSNPs, and highlighting the value of this brain eGWAS for CNS traits/conditions.
Despite the enrichment of our samples with tissue from AD subjects and our use of both cerebellar and temporal cortex tissue, we did not identify strong transcript associations for some of the top genes recently implicated in AD risk in large LOAD GWAS studies , , , , . This could be because the AD risk variants in these genes exert their effects via mechanisms other than influencing transcript levels, namely changes in protein conformation. If so, even the negative results from an eGWAS could be informative in guiding the future deep-sequencing efforts which should focus on coding rather than non-coding, functional regions. Alternative explanations include technical shortcomings, such as inability to measure all transcript species, measurements of global rather than cell-specific gene expression, not including all tested disease-associated variants in our genotyping platform. We also need to consider that the top genes nearest the strongest variants from the LOAD GWAS may not be actual disease genes. These loci require further investigations to account for this possibility. Additionally, our criteria for selection of the top cisSNPs, requiring significance in both ADs and non–ADs, might be too stringent, thereby leading to some false negative results. Finally, it may be possible to identify additional disease-related expression variants by focusing on those that have differential influence in disease vs. non-disease tissue, although this was not a focus of analysis in this study. Given that our non–AD tissue also consisted of subjects with other neurodegenerative diseases, there may be more similarities with the AD tissue, making it more difficult to detect variants with differential disease-related expression-associations in our current study. Nevertheless, we did find associations with cisSNPs for ABCA7, a novel AD risk locus gene ,  and MAPT , , ,  implicated in AD.
It is important to emphasize that although the identification of transcript level associations provides another layer of confidence for disease-associating variants and genes, it is entirely possible that a variant in an LD region encompassing multiple genes, could be marking a functional disease variant in one gene and an expression variant in another gene. Thus, although highly useful in conjunction with disease association studies, eGWAS should be seen as a guide rather than ultimate evidence in disease-mapping efforts. Similarly, absence of eGWAS associations for a disease-associated variant should not be seen as contradictory evidence, but rather raise the possibility of alternative functional mechanisms for that variant.
Despite the wealth of information our study provides, we acknowledge several shortcomings. First, our non–ADs were not normal controls but often had other brain pathologies. It will be necessary to seek replication of these findings or novel cisSNP/transcript associations in normal brain tissue, as well. Second, we only focused on single SNP associations. The preliminary observations from our eGWAS findings suggest that multiple independent variants may affect brain expression levels of some genes, whereas others might be under the influence of a single strong variant. Finally, like any association study, it is not clear whether the cisSNPs identified in our eGWAS are themselves the functional SNPs or simply in LD with un-genotyped regulatory variants. Future studies focusing on analysis of haplotypes, SNPxSNP interactions, novel variant discovery and functional in-vitro studies testing effects of multiple variants are required to dissect the genetic variation underlying brain gene expression levels.
In summary, this cerebellar eGWAS study and the temporal cortex validations provide insight about the genetics of brain gene expression, a framework to guide future studies with respect to tissue/region/disease choice in eQTL studies, examples about the utility of this approach in gene mapping, replication of some known transcript associations and evidence for novel transcript associations in human disease. Combined eGWAS-disease GWAS approach may provide complementary information in mapping human disease and enable identification of functional variants that may not be possible by either approach alone.
The complete set of results from the brain eGWAS can be accessed at the National Institute on Aging Genetics of Alzheimer's Disease Data Storage (NIAGADS) website at http://alois.med.upenn.edu/niagads/. Questions about the dataset can be addressed to the corresponding author of this manuscript (email@example.com).
All subjects were participants in the published Mayo LOAD GWAS  as part of the autopsy-based series (AUT_60–80). All subjects had neuropathologic evaluation by DWD. All ADs had definite diagnosis according to the NINCDS-ADRDA criteria  and had Braak scores of ≥4.0. All non–ADs had Braak scores of ≤2.5, and many had brain pathology unrelated to AD (Supplementary Tables 1 and 2 in Dataset S1). Three-hundred forty subjects had measurements in both cerebellum and temporal cortex. This study was approved by the appropriate institutional review board.
Expression genome-wide association study (eGWAS)
RNA extraction and gene expression measurements.
Total RNA was extracted from frozen brain samples using the Ambion RNAqueous kit according to the manufacturer's instructions. The quantity and quality of the RNA samples were determined by the Agilent 2100 Bioanalyzer using the Agilent RNA 6000 Nano Chip. Transcript levels were measured using the Whole Genome DASL assay (Illumina, San Diego, CA). Probe annotations were done based on NCBI Ref Seq, Build 36.2. The RNA samples were randomized across the chips and plates using a stratified approach to ensure balance with respect to diagnosis, age, gender, RINs and APOE genotype. Replicate samples were utilized for QC and also for intra-class coefficient (ICC) estimations. Raw probe level mRNA expression data were exported from GenomeStudio software (Illumina Inc.) for preprocessing with background correction, variance stabilizing transformation, quantile normalization and probe filtering using the lumi package of BioConductor ,  (Text S1). A probe with detectable signal in >75% of the samples was regarded as informative and used in subsequent analyses, although we also did supplementary analyses without imposing any restrictions based on probe detection levels. The number of informative probes differed slightly between the AD, non–AD and combined groups (Figure S7).
Genotypes were generated using Illumina's HumanHap300-Duo Genotyping BeadChips and analyzed with an Illumina BeadLab Station (Illumina, San Diego, CA) at the Mayo Clinic Genotyping Shared Resource according to the manufacturer's protocols. The LOAD GWAS QC methods were previously published  (Text S1).
Statistical methods for eGWAS.
Linear regression analysis to test for cisSNP/transcript associations were done in PLINK . Preprocessed probe transcript levels were utilized as endophenotypes. Each probe was assessed separately, even though one gene may have multiple probes. CisSNPs localized to ±100 kb flanking region of the gene targeted by the probe of interest, mapped according to NCBI Build 36, were assessed for transcript level associations, using an additive model, with the minor allele dosage (0, 1, 2) as the independent variable, and APOE ε4 dosage (0, 1, 2), age at death, gender, PCR plate, RIN, (RIN-RINmean)2 as covariates. The cerebellum and temporal cortex expression levels were analyzed separately. The ADs and non–ADs were analyzed both separately and jointly. The joint analyses included diagnosis as an additional covariate (AD = 1, non–AD = 0). We also ran analyses including the top 10 eigenvectors from EIGENSTRAT, and compared eGWAS results to those excluding the eigenvectors (Text S1, Figure S8, Supplementary Table 27 in Dataset S1) .
Q values used for multiple testing corrections are based on false-discovery rates  and were corrected for genomic inflation of significance (Text S1). In addition, permutations (pperm-WY) and Bonferroni adjustment were used for comparison of correction strategies. Permutation p values were obtained by shuffling the endophenotype, while maintaining the covariate structure, 10,000 times and applying the Westfall and Young  resampling-style stepdown approach to account for correlations between probes.
Variance of gene expression
To assess the genetic contribution to the variance in human cerebellar gene expression, we first determined between-subject variance, as a percentage of the total variance in probe expression, using ICC  for 15 samples measured in replicate on 5–6 different plates and 2–3 different days.
Using multivariable linear regression models, we then calculated the proportion of variance in cerebellar gene expression levels that were explained by technical effects (PCR plate, RIN, (RIN-RINmean)2), biological covariates (APOE ε4 dosage, age at death, gender) and the “best” cisSNP for each probe. These analyses were carried out on the combined dataset consisting of cerebellar expression measurements from 374 subjects and 15,283 probes with at least one cisSNP (Text S1).
Replication of top cerebellar eGWAS hits in the temporal cortex
We identified 2,980 cisSNP/transcript associations (2,596 unique SNPs, 746 unique probes and 686 unique genes) that achieved genome-wide significance within both the ADs and non–ADs analyses with q values<0.05. All 2,980 cisSNP/transcript associations achieved genome-wide significance with q<0.05 and pBonf<0.05 in the combined ADs+non–ADs analysis. We sought validation of these hits in the temporal cortex of 399 subjects who had WG-DASL whole transcriptome measurements and whole-genome genotypes. RNA extractions, QC, WG-DASL measurements, transcript level detections and association analyses were performed for these temporal cortex samples, in the same manner as that for the cerebellar samples. After appropriate QC, 2,685 of the 2,980 top cerebellar cisSNP/transcript associations remained detectable among the temporal cortex results (2,387 unique SNPs, 677 unique probes and 625 unique genes).
Comparison of cerebellar eGWAS results with other published complex disease and trait GWAS
To determine whether the cerebellar eGWAS captured variants implicated in complex diseases/traits, we compared the top 2,980 cerebellar eGWAS cisSNPs with the top disease/trait associated SNPs in the “Catalog of Published GWAS” , curated by the National Human Genome Research Institute (www.genome.gov/gwastudies). This catalog compiles weekly search results from all published GWAS of ≥100,000 SNPs where associations of p≤1.0E-05 are reported. The catalog accessed on 04/23/2011 had 5,272 entries. We restricted our search to those entries where the “SNPs” column had only one SNP with an rs number. Thus, haplotypes and variants without rs numbers were excluded. There were 5,101 entries after this exclusion, comprised of 4,248 unique SNPs and 433 unique diseases. One SNP may associate with >1 disease/trait and each disease/trait may have ≥1 associating SNP. This list was linked to the 2,980 top cerebellar cisSNPs by common rs numbers.
To assess whether the number of observed cisSNPs that have both significant cerebellar eGWAS and disease/trait associations represent a significant enrichment, we performed simulations while adjusting for cisSNP minor allele frequencies, as previously reported . We performed 1 million simulations and adjusted for the minor allele frequencies of all the tested cisSNPs in 10 bins from 0–0.05 to 0.45–0.50. Using the total number of cisSNPs that are both transcript and disease/trait associating for each simulation, we obtained an empirical p value and an estimate of fold-enrichment.
Alzheimer's Disease Genetics Consortium (ADGC) meta-analyses
To determine whether any of the cisSNPs significant at q<0.05 influenced risk of AD, we obtained meta-analyses results from the ADGC . The cohorts that are assessed by ADGC, as well as the methodological details of the meta-analyses are described in detail in a recent publication . Briefly, the meta-analyses of the ADGC dataset results reported here (Supplementary Tables 17 and 18 in Dataset S1) are generated from the combined analyses of stage 1 and stage 2 cohorts (Text S1), with detailed descriptions provided elsewhere . Stage 1 cohorts are comprised of 8,309 LOAD cases and 7,366 cognitively normal elder controls. Stage 2 has 3,531 LOAD vs. 3,565 control subjects. Each cohort was tested for AD risk association using a logistic regression approach, assuming an additive model and adjusting for age, sex, APOE ε4 dosage and principal components from EIGENSTRAT . The meta-analyses results were generated using the inverse variance method implemented in the software package METAL .
This file includes Supplementary Tables 1–27. The individual supplementary table legends are included in the first tab of this file.
Q-Q-Plots: Q-Q plots of observed (y-axis) versus expected (X-axis) −log(p) values of association for all cisSNP/transcript associations in the combined cerebellar 374 samples obtained before (a,b) and after (c,d) inflation-adjustments. Q-Q plots for all data points (a, c), as well as those that are in the lower, left hand corner (b,d) are shown. The data in b and d account reflect the association results, where there should be no deviations from the expected (i.e. null hypothesis of no association).
Venn diagram of significant cerebellar cisSNP/transcript associations: Q values<0.05 in the ADs, non–ADs and combined (All) analyses. Notably, 2,980 cis-SNP/transcript associations are significant both in the ADs and non–ADs.
Box Plots of some top cisSNP/transcript associations in the non–AD (a), AD (b) and combined groups (c): The SNP genotypes are shown on the X-axis with the genotype counts in parentheses. Variance stabilizing transformed (VST) expression levels are on the Y-axis. The bottom and top of a box represent the lower and upper quartiles, respectively. The band near the middle of the box is the median. The ends of the whiskers depict the most extreme observations still within 1.5 inter quartile range of the corresponding quartile. Any data not included between the whiskers are plotted as dots.
Histogram of intra-class coefficients (ICC) for the cerebellar probe expressions. Using 15 replicate samples, ICC, which is the between-subject variance, as a percentage of the total variance in probe expression, was estimated for 17,121 probes.
Data plots of SNPs tested for association with expression levels of SLCO1A2 in the Temporal Cortex and Cerebellum. Forty-six SNPs were tested for association of SLCO1A2 levels in the Cerebellum (Blue lines) and Temporal Cortex (Pink lines). P-values were transformed using −log10 and are plotted against the position of each SNP along the chromosome (Kbp). Genes found within the locus boundaries are shown from the UCSC genome browser (http://genome.ucsc.edu/). The LD across the locus is represented by a plot generated with Haploview, using data from the Mayo GWAS. The top eSNP in this study, rs11568563, is highlighted on the p-value plot by red squares and a red box around the SNP in the list of rs numbers. This is also the top PSP-associating SNP at this locus in Hoglinger et al. (Nat Genet, 2011) .
Q-Q-Plots for cerebellar and temporal cortex cisSNP/transcript associations with the HapMap phase 2 imputed genotypes: Q-Q plots of observed (y-axis) versus expected (X-axis) −log(p) values of association for all cisSNP/transcript associations in the combined dataset obtained before (a) and after (b) genomic inflation-adjustments, as discussed in the text. Also shown are the Q-Q plots for the temporal cortex associations in the combined dataset obtained obtained before (c) and after (d) inflation-adjustments.
Venn diagram of detectable cerebellar probes. Venn diagram of cerebellar probes detectable in ≥75% of subjects in the AD (AD), non–AD (CON) and combined (All) analyses. Notably, 13,349 probes were detectable in all 374 subjects.
Scatterplots of −log10 p values for eGWAS associations with and without inclusion of eigenvectors. Transformed P-values of a) Cerebellar and b) Temporal Cortex eGWAS cisSNP/transcript associations from models including (y-axis) and excluding (x-axis) the top 10 eigenvectors are plotted. A linear line demonstrating the null hypothesis of no deviation of the results between the two datasets is also shown. The results are displayed for those SNPs with a Hardy-Weinberg P-value>1.0E-07 and a probe detection threshold >75%.
Supplementary Results, Methods and References.
Liana G. Apostolova, MD,1 Steven E. Arnold, MD,2 Clinton T. Baldwin, PhD,3 Robert Barber, PhD,4 Michael M. Barmada, PhD,5 Thomas Beach, MD, PhD,6 Gary W. Beecham, PhD,7,8 Duane Beekly, BS,9 David A. Bennett, MD,10,11 Eileen H. Bigio, MD,12 Thomas D. Bird, MD,13 Deborah Blacker, MD,14,15 Bradley F. Boeve, MD,16 James D. Bowen, MD,17 Adam Boxer, MD, PhD,18 James R. Burke, MD, PhD,19 Jacqueline Buros, BS,3 Joseph D. Buxbaum, PhD,20,21,22 Nigel J. Cairns, PhD, FRCPath,23 Laura B. Cantwell, MPH,24 Chuanhai Cao, PhD,25 Chris S. Carlson, PhD,26 Regina M. Carney, MD,27 Steven L. Carroll, MD, PhD,28 Helena C. Chui, MD,29 David G. Clark, MD,30 Jason Corneveaux, BS,31 Carl W. Cotman, PhD,32 Paul K. Crane, MD, MPH,33 Carlos Cruchaga, PhD,34 Jeffrey L. Cummings, MD,1 Philip L. De Jager, MD, PhD,35,36 Charles DeCarli, MD,37 Steven T. DeKosky, MD,38 F. Yesim Demirci, MD,5 Ramon Diaz-Arrastia, MD, PhD,39 Malcolm Dick, PhD,32 Beth A. Dombroski, PhD,24 Ranjan Duara, MD,40 William G. Ellis, MD,41 Denis Evans, MD,42 Kelley M. Faber, MS,114 Kenneth B. Fallon, MD,28 Martin R. Farlow, MD,43 Steven Ferris, PhD,44 Tatiana M. Foroud, PhD,114 Matthew P. Frosch, MD, PhD,45 Douglas R. Galasko, MD,46 Paul J. Gallins, MS,7 Mary Ganguli, MD,47 Marla Gearing, PhD,48,49 Daniel H. Geschwind, MD, PhD,50 Bernardino Ghetti, MD,51 John R. Gilbert, PhD,7,8 Sid Gilman, MD, FRCP,52 Bruno Giordani, PhD,53 Jonathan D. Glass, MD,54 Alison M. Goate, D.Phil,34 Robert C. Green, MD,3,55,56 John H. Growdon, MD,57 Hakon Hakonarson, MD, PhD,58 Ronald L. Hamilton, MD,59 John Hardy, PhD,60 Lindy E. Harrell, MD, PhD,30 Elizabeth Head, PhD,61 Lawrence S. Honig, MD, PhD,62 Matthew J. Huentelman, PhD,31 Christine M. Hulette , MD,63 Bradley T. Hyman, MD, PhD,57 Gail P. Jarvik, MD, PhD,64,65 Gregory A. Jicha, MD, PhD,66 Lee-Way Jin, MD, PhD,41 Nancy Johnson, PhD,67 Gyungah Jun, PhD,68,3,69 M. Ilyas Kamboh, PhD,5,70 Jason Karlawish, MD,71 Anna Karydas, BA,18 John S.K. Kauwe, PhD,72 Jeffrey A. Kaye, MD,73,74 Ronald Kim, MD,75 Edward H. Koo, MD,46 Neil W. Kowall, MD,55,76 Patricia Kramer, PhD,77,73 Walter A. Kukull, PhD,78 James J. Lah, MD, PhD,54 Eric B. Larson, MD, MPH,79 Allan I. Levey, MD, PhD,54 Andrew P. Lieberman, MD, PhD,80 Oscar L. Lopez, MD,70 Kathryn L. Lunetta, PhD,68 Wendy J. Mack, PhD,81 Daniel C. Marson, JD, PhD,30 Eden R. Martin, PhD,7,8 Frank Martiniuk, PhD,82 Deborah C. Mash, PhD,83 Eliezer Masliah, MD,46,84 Wayne C. McCormick, MD, MPH,33 Susan M .McCurry, PhD,85 Andrew N. McDavid, BA26 Ann C. McKee, MD,55,76 Marsel Mesulam, MD,86 ,87 Bruce L. Miller, MD,18 Carol A. Miller, MD,88 Joshua W. Miller, PhD,41 Thomas J. Montine, MD, PhD,89 John C. Morris, MD,23,90 Amanda J. Myers, PhD,91 Adam C. Naj, PhD,7 Petra Nowotny, PhD,34 Joseph E. Parisi, MD,92,93 Daniel P. Perl, MD,94 Elaine Peskind, MD,95 Wayne W. Poon, PhD,32 Huntington Potter, PhD,25 Joseph F. Quinn , MD,73 Ashok Raj, MD,25 Ruchita A. Rajbhandary, MPH,7 Murray Raskind, MD,95 Eric M. Reiman, MD,31,96,97,98 Barry Reisberg, MD,44,99 Christiane Reitz, MD, PhD,100,101,102 John M. Ringman, MD,1 Erik D. Roberson, MD, PhD,30 Ekaterina Rogaeva, PhD,103 Roger N. Rosenberg, MD,39 Mary Sano, PhD,21 Andrew J. Saykin, PsyD,33,104 Julie A. Schneider, MD,105,10 Lon S. Schneider, MD,29,106 William Seeley, MD,18 Michael L. Shelanski, MD, PhD,107 Michael A. Slifer, MD, PhD,7,8 Charles D. Smith, MD,66 Joshua A. Sonnen, MD,89 Salvatore Spina, MD,51 Peter St George-Hyslop, MD, FRCP,103,108 Robert A. Stern, PhD,55 Rudolph E. Tanzi, PhD,57 John Q. Trojanowski, MD, PhD,24 Juan C. Troncoso, MD,109 Debby W. Tsuang, MD,95 Vivianna M. Van Deerlin, MD, PhD,24 Badri Narayan Vardarajan , MS,3 Harry V. Vinters, MD,1,110 Jean Paul Vonsattel, MD,111 Li-San Wang, PhD,24 Sandra Weintraub, PhD,86,87 Kathleen A. Welsh-Bohmer, PhD,19,112 Jennifer Williamson, MS,62 Randall L. Woltjer, MD, PhD113
ADGC Author Affiliations
1Department of Neurology, University of California Los Angeles, Los Angeles, California; 2Department of Psychiatry, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania; 3Department of Medicine (Genetics Program), Boston University, Boston, Massachusetts; 4Department of Pharmacology and Neuroscience, University of Texas Southwestern, Fort Worth, Texas; 5Department of Human Genetics, University of Pittsburgh, Pittsburgh, Pennsylvania; 6Civin Laboratory for Neuropathology, Banner Sun Health Research Institute, Phoenix, Arizona; 7The John P. Hussman Institute for Human Genomics, University of Miami, Miami, Florida; 8Dr. John T. Macdonald Foundation Department of Human Genetics, University of Miami, Miami, Florida; 9National Alzheimer's Coordinating Center, University of Washington, Seattle, Washington; 10Department of Neurological Sciences, Rush University Medical Center, Chicago, Illinois; 11Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, Illinois; 12Department of Pathology, Northwestern University, Chicago, Illinois; 13Department of Neurology, University of Washington, Seattle, Washington; 14Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts; 15Department of Psychiatry, Massachusetts General Hospital/Harvard Medical School, Boston, Massachusetts; 16Department of Neurology, Mayo Clinic, Rochester, Minnesota; 17Swedish Medical Center, Seattle, Washington; 18Department of Neurology, University of California San Francisco, San Fransisco, California; 19Department of Medicine, Duke University, Durham, North Carolina; 20Department of Neuroscience, Mount Sinai School of Medicine, New York, New York; 21Department of Psychiatry, Mount Sinai School of Medicine, New York, New York; 22Departments of Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York, New York; 23Department of Pathology and Immunology, Washington University , St. Louis, Missouri; 24Department of Pathology and Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania; 25Byrd Alzheimer Institute, University of Southern Florida Health, Tampa, Florida; 26Fred Hutchinson Cancer Research Center, Seattle, Washington; 27Department of Psychiatry, Vanderbilt University, Nashville, Tennessee; 28Department of Pathology, University of Alabama at Birmingham, Birmingham, Alabama; 29Department of Neurology, University of Southern California, Los Angeles, California; 30Department of Neurology, University of Alabama at Birmingham, Birmingham, Alabama; 31Neurogenomics Division, Translational Genomics Research Institute, Phoenix, Arizona; 32Institute for Memory Impairments and Neurological Disorders, University of California Irvine, Irvine, California; 33Department of Medicine, University of Washington, Seattle, Washington; 34Department of Psychiatry and Hope Center Program on Protein Aggregation and Neurodegeneration, Washington University School of Medicine, St. Louis, Missouri; 35Program in Translational NeuroPsychiatric Genomics, Department of Neurology, Brigham and Women's Hospital, Boston, Massachusetts; 36Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts; 37Department of Neurology, University of California Davis , Sacramento, California; 38University of Virginia School of Medicine, Charlottesville, Virginia; 39Department of Neurology, University of Texas Southwestern, Dallas, Texas; 40Wien Center for Alzheimer's Disease and Memory Disorders, Mount Sinai Medical Center, Miami Beach, Florida; 41Department of Pathology and Laboratory Medicine, University of California Davis , Sacramento, California; 42Rush Institute for Healthy Aging, Department of Internal Medicine, Rush University Medical Center, Chicago, Illinois; 43Department of Neurology, Indiana University, Indianapolis, Indiana; 44Department of Psychiatry, New York University, New York, New York; 45C.S. Kubik Laboratory for Neuropathology, Massachusetts General Hospital, Charlestown, Massachusetts; 46Department of Neurosciences, University of California San Diego, La Jolla, California; 47Department of Psychiatry, University of Pittsburgh, Pittsburgh, Pennsylvania; 48Department of Pathology and Laboratory Medicine, Emory University, Atlanta, Georgia; 49Emory Alzheimer's Disease Center, Emory University, Atlanta, Georgia; 50Neurogenetics Program, University of California Los Angeles, Los Angeles, California; 51Department of Pathology and Laboratory Medicine, Indiana University, Indianapolis, Indiana; 52Department of Neurology, University of Michigan , Ann Arbor, Michigan; 53Department of Psychiatry, University of Michigan , Ann Arbor, Michigan; 54Department of Neurology, Emory University, Atlanta, Georgia; 55Department of Neurology, Boston University, Boston, Massachusetts; 56Department of Epidemiology, Boston University, Boston, Massachusetts; 57Department of Neurology, Massachusetts General Hospital/Harvard Medical School, Boston, Massachusetts; 58Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania; 59Department of Pathology (Neuropathology), University of Pittsburgh, Pittsburgh, Pennsylvania; 60Institute of Neurology, University College London, Queen Square, London; 61Department of Molecular and Biomedical Pharmacology, University of California Irvine, Irvine, California; 62Taub Institute on Alzheimer's Disease and the Aging Brain, Department of Neurology, Columbia University , New York, New York; 63Department of Pathology, Duke University, Durham, North Carolina; 64Department of Genome Sciences, University of Washington, Seattle, Washington; 65Department of Medicine (Medical Genetics), University of Washington, Seattle, Washington; 66Department of Neurology, University of Kentucky , Lexington, Kentucky; 67Department of Psychiatry and Behavioral Sciences, Northwestern University, Chicago, Illinois; 68Department of Biostatistics, Boston University, Boston, Massachusetts; 69Department of Ophthalmology, Boston University, Boston, Massachusetts; 70University of Pittsburgh Alheimer's Disease Research Center, Pittsburgh, Pennsylvania; 71Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania; 72Department of Biology, Brigham Young University, Provo, Utah; 73Department of Neurology, Oregon Health & Science University, Portland, Oregon; 74Department of Biomedical Engineering , Oregon Health & Science University, Portland, Oregon; 75Department of Pathology and Laboratory Medicine, University of California Irvine, Irvine, California; 76Department of Pathology, Boston University, Boston, Massachusetts; 77Department of Molecular & Medical Genetics, Oregon Health & Science University, Portland, Oregon; 78Department of Epidemiology, University of Washington, Seattle, Washington; 79Group Health Research Institute, Seattle, Washington; 80Department of Pathology, University of Michigan , Ann Arbor, Michigan; 81Department of Preventive Medicine, University of Southern California, Los Angeles, California; 82Department of Medicine - Pulmonary, New York University, New York, New York; 83Department of Neurology, University of Miami, Miami, Florida; 84Department of Pathology, University of California San Diego, La Jolla, California; 85School of Nursing Northwest Research Group on Aging, University of Washington, Seattle, Washington; 86Alzheimer's Disease Center, Northwestern University, Chicago, Illinois; 87Cognitive Neurology, Northwestern University, Chicago, Illinois; 88Department of Pathology, University of Southern California, Los Angeles, California; 89Department of Pathology, University of Washington, Seattle, Washington; 90Department of Neurology, Washington University , St. Louis, Missouri; 91Department of Psychiatry & Behavioral Sciences, University of Miami, Miami, Florida; 92Department of Anatomic Pathology, Mayo Clinic, Rochester, Minnesota; 93Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, Minnesota; 94Department of Pathology, Mount Sinai School of Medicine, New York, New York; 95Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, Washington; 96Department of Psychiatry, University of Arizona, Phoenix, Arizona; 97Arizona Alzheimer's Consortium, Phoenix, Arizona; 98Banner Alzheimer's Institute, Phoenix, Arizona; 99Alzheimer's Disease Center, New York University, New York, New York; 100Taub Institute on Alzheimer's Disease and the Aging Brain, Columbia University , New York, New York; 101Gertrude H. Sergievsky Center, Columbia University , New York, New York; 102Department of Neurology, Columbia University , New York, New York; 103Tanz Centre for Research in Neurodegenerative Disease, University of Toronto, Toronto, Ontario; 104Department of Radiology and Imaging Sciences, Indiana University, Indianapolis, Indiana; 105Department of Pathology (Neuropathology), Rush University Medical Center, Chicago, Illinois; 106Department of Psychiatry, University of Southern California, Los Angeles, California; 107Department of Pathology, Columbia University , New York, New York; 108Cambridge Institute for Medical Research and Department of Clinical Neurosciences, University of Cambridge, Cambridge, Massachusetts; 109Department of Pathology, Johns Hopkins University, Baltimore, Maryland; 110Department of Pathology & Laboratory Medicine, University of California Los Angeles, Los Angeles, California; 111Taub Institute on Alzheimer's Disease and the Aging Brain, Department of Pathology, Columbia University , New York, New York; 112Department of Psychiatry & Behavioral Sciences, Duke University, Durham, North Carolina; 113Department of Pathology , Oregon Health & Science University, Portland, Oregon, 114Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, Indiana.
Conceived and designed the experiments: SG Younkin, N Ertekin-Taner. Performed the experiments: F Zou, MM Carrasquillo, CN Rowley, T Nguyen, L Ma, KG Malphrus, R Palusak, S Lincoln, G Bisceglio, N Kouri, CP Kolbert, J Jen. Analyzed the data: F Zou, HS Chai, CS Younkin, J Crook, VS Pankratz, M Allen, MM Carrasquillo, AA Nair, S Middha, S Maharjan, C Gorgescu, SG Younkin, N Ertekin-Taner. Contributed reagents/materials/analysis tools: Alzheimer's Disease Genetics Consortium, JL Haines, R Mayeux, MA Pericak-Vance, LA Farrer, GD Schellenberg, RC Petersen, NR Graff-Radford, DW Dickson, SG Younkin. Wrote the paper: F Zou, HS Chai, CS Younkin, J Crook, VS Pankratz, M Allen, SG Younkin, N Ertekin-Taner.
- 1. Ertekin-Taner N (2011) Gene expression endophenotypes: a novel approach for gene discovery in Alzheimer's disease. Mol Neurodegener 6: 31.
- 2. Murphy A, Chu JH, Xu M, Carey VJ, Lazarus R, et al. (2010) Mapping of numerous disease-associated expression polymorphisms in primary peripheral blood CD4+ lymphocytes. Hum Mol Genet 19: 4745–4757.
- 3. Nicolae DL, Gamazon E, Zhang W, Duan S, Dolan ME, et al. (2010) Trait-associated SNPs are more likely to be eQTLs: annotation to enhance discovery from GWAS. PLoS Genet 6: e1000888. doi:10.1371/journal.pgen.1000888.
- 4. Schadt EE, Monks SA, Drake TA, Lusis AJ, Che N, et al. (2003) Genetics of gene expression surveyed in maize, mouse and man. Nature 422: 297–302.
- 5. Cheung VG, Conlin LK, Weber TM, Arcaro M, Jen KY, et al. (2003) Natural variation in human gene expression assessed in lymphoblastoid cells. Nat Genet 33: 422–425.
- 6. Morley M, Molony CM, Weber TM, Devlin JL, Ewens KG, et al. (2004) Genetic analysis of genome-wide variation in human gene expression. Nature 430: 743–747.
- 7. Monks SA, Leonardson A, Zhu H, Cundiff P, Pietrusiak P, et al. (2004) Genetic inheritance of gene expression in human cell lines. Am J Hum Genet 75: 1094–1105.
- 8. Cheung VG, Spielman RS, Ewens KG, Weber TM, Morley M, et al. (2005) Mapping determinants of human gene expression by regional and genome-wide association. Nature 437: 1365–1369.
- 9. Stranger BE, Forrest MS, Clark AG, Minichiello MJ, Deutsch S, et al. (2005) Genome-wide associations of gene expression variation in humans. PLoS Genet 1: e78. doi:10.1371/journal.pgen.0010078.
- 10. Stranger BE, Nica AC, Forrest MS, Dimas A, Bird CP, et al. (2007) Population genomics of human gene expression. Nat Genet 39: 1217–1224.
- 11. Stranger BE, Forrest MS, Dunning M, Ingle CE, Beazley C, et al. (2007) Relative impact of nucleotide and copy number variation on gene expression phenotypes. Science 315: 848–853.
- 12. Dixon AL, Liang L, Moffatt MF, Chen W, Heath S, et al. (2007) A genome-wide association study of global gene expression. Nat Genet 39: 1202–1207.
- 13. Dausset J, Cann H, Cohen D, Lathrop M, Lalouel JM, et al. (1990) Centre d'etude du polymorphisme humain (CEPH): collaborative genetic mapping of the human genome. Genomics 6: 575–577.
- 14. (2005) A haplotype map of the human genome. Nature 437: 1299–1320.
- 15. Stranger BE, Forrest MS, Clark AG, Minichiello MJ, Deutsch S, et al. (2005) Genome-wide associations of gene expression variation in humans. PLoS Genet 1: e78. doi:10.1371/journal.pgen.0010078.
- 16. Goring HH, Curran JE, Johnson MP, Dyer TD, Charlesworth J, et al. (2007) Discovery of expression QTLs using large-scale transcriptional profiling in human lymphocytes. Nat Genet 39: 1208–1216.
- 17. Rotival M, Zeller T, Wild PS, Maouche S, Szymczak S, et al. (2011) Integrating genome-wide genetic variations and monocyte expression data reveals trans-regulated gene modules in humans. PLoS Genet 7: e1002367. doi:10.1371/journal.pgen.1002367.
- 18. Dimas AS, Deutsch S, Stranger BE, Montgomery SB, Borel C, et al. (2009) Common regulatory variation impacts gene expression in a cell type-dependent manner. Science 325: 1246–1250.
- 19. Ding J, Gudjonsson JE, Liang L, Stuart PE, Li Y, et al. (2010) Gene expression in skin and lymphoblastoid cells: Refined statistical method reveals extensive overlap in cis-eQTL signals. Am J Hum Genet 87: 779–789.
- 20. Emilsson V, Thorleifsson G, Zhang B, Leonardson AS, Zink F, et al. (2008) Genetics of gene expression and its effect on disease. Nature 452: 423–428.
- 21. Zhong H, Beaulaurier J, Lum PY, Molony C, Yang X, et al. (2010) Liver and adipose expression associated SNPs are enriched for association to type 2 diabetes. PLoS Genet 6: e1000932. doi:10.1371/journal.pgen.1000932.
- 22. Grundberg E, Adoue V, Kwan T, Ge B, Duan QL, et al. (2011) Global analysis of the impact of environmental perturbation on cis-regulation of gene expression. PLoS Genet 7: e1001279. doi:10.1371/journal.ppat.1001279.
- 23. Schadt EE, Molony C, Chudin E, Hao K, Yang X, et al. (2008) Mapping the genetic architecture of gene expression in human liver. PLoS Biol 6: e107. doi:10.1371/journal.pbio.0060107.
- 24. Myers AJ, Gibbs JR, Webster JA, Rohrer K, Zhao A, et al. (2007) A survey of genetic human cortical gene expression. Nat Genet 39: 1494–1499.
- 25. Webster JA, Gibbs JR, Clarke J, Ray M, Zhang W, et al. (2009) Genetic control of human brain transcript expression in Alzheimer disease. Am J Hum Genet 84: 445–458.
- 26. Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, et al. (2009) Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A 106: 9362–9367.
- 27. Hoglinger GU, Melhem NM, Dickson DW, Sleiman PM, Wang LS, et al. (2011) Identification of common variants influencing risk of the tauopathy progressive supranuclear palsy. Nat Genet 43: 699–705.
- 28. Naj AC, Jun G, Beecham GW, Wang LS, Vardarajan BN, et al. (2011) Common variants at MS4A4/MS4A6E, CD2AP, CD33 and EPHA1 are associated with late-onset Alzheimer's disease. Nat Genet 43: 436–441.
- 29. Carrasquillo MM, Zou F, Pankratz VS, Wilcox SL, Ma L, et al. (2009) Genetic variation in PCDH11X is associated with susceptibility to late-onset Alzheimer's disease. Nat Genet 41: 192–198.
- 30. Storey JD, Tibshirani R (2003) Statistical significance for genomewide studies. Proc Natl Acad Sci U S A 100: 9440–9445.
- 31. Barry WT, Kernagis DN, Dressman HK, Griffis RJ, Hunter JD, et al. (2010) Intratumor heterogeneity and precision of microarray-based predictors of breast cancer biology and clinical outcome. J Clin Oncol 28: 2198–2206.
- 32. Simon-Sanchez J, Schulte C, Bras JM, Sharma M, Gibbs JR, et al. (2009) Genome-wide association study reveals genetic risk underlying Parkinson's disease. Nat Genet 41: 1308–1312.
- 33. Fuchs J, Nilsson C, Kachergus J, Munz M, Larsson EM, et al. (2007) Phenotypic variation in a large Swedish pedigree due to SNCA duplication and triplication. Neurology 68: 916–922.
- 34. Edwards TL, Scott WK, Almonte C, Burt A, Powell EH, et al. (2010) Genome-wide association study confirms SNPs in SNCA and the MAPT region as common risk factors for Parkinson disease. Ann Hum Genet 74: 97–109.
- 35. Han JW, Zheng HF, Cui Y, Sun LD, Ye DQ, et al. (2009) Genome-wide association study in a Chinese Han population identifies nine new susceptibility loci for systemic lupus erythematosus. Nat Genet 41: 1234–1237.
- 36. Graham RR, Kozyrev SV, Baechler EC, Reddy MV, Plenge RM, et al. (2006) A common haplotype of interferon regulatory factor 5 (IRF5) regulates splicing and expression and is associated with increased risk of systemic lupus erythematosus. Nat Genet 38: 550–555.
- 37. Graham RR, Kyogoku C, Sigurdsson S, Vlasova IA, Davies LR, et al. (2007) Three functional variants of IFN regulatory factor 5 (IRF5) define risk and protective haplotypes for human lupus. Proc Natl Acad Sci U S A 104: 6758–6763.
- 38. Anderson CA, Boucher G, Lees CW, Franke A, D'Amato M, et al. (2011) Meta-analysis identifies 29 additional ulcerative colitis risk loci, increasing the number of confirmed associations to 47. Nat Genet 43: 246–252.
- 39. Harold D, Abraham R, Hollingworth P, Sims R, Gerrish A, et al. (2009) Genome-wide association study identifies variants at CLU and PICALM associated with Alzheimer's disease. Nat Genet.
- 40. Lambert JC, Heath S, Even G, Campion D, Sleegers K, et al. (2009) Genome-wide association study identifies variants at CLU and CR1 associated with Alzheimer's disease. Nat Genet 41: 1094–1099.
- 41. Reiman EM, Webster JA, Myers AJ, Hardy J, Dunckley T, et al. (2007) GAB2 alleles modify Alzheimer's risk in APOE epsilon4 carriers. Neuron 54: 713–720.
- 42. van Es MA, Veldink JH, Saris CG, Blauw HM, van Vught PW, et al. (2009) Genome-wide association study identifies 19p13.3 (UNC13A) and 9p21.2 as susceptibility loci for sporadic amyotrophic lateral sclerosis. Nat Genet 41: 1083–1087.
- 43. Laaksovirta H, Peuralinna T, Schymick JC, Scholz SW, Lai SL, et al. (2010) Chromosome 9p21 in amyotrophic lateral sclerosis in Finland: a genome-wide association study. Lancet Neurol 9: 978–985.
- 44. Dejesus-Hernandez M, Mackenzie IR, Boeve BF, Boxer AL, Baker M, et al. (2011) Expanded GGGGCC Hexanucleotide Repeat in Noncoding Region of C9ORF72 Causes Chromosome 9p-Linked FTD and ALS. Neuron 72: 245–256.
- 45. Renton AE, Majounie E, Waite A, Simon-Sanchez J, Rollinson S, et al. (2011) A hexanucleotide repeat expansion in C9ORF72 is the cause of chromosome 9p21-linked ALS-FTD. Neuron 72: 257–268.
- 46. Lee W, Glaeser H, Smith LH, Roberts RL, Moeckel GW, et al. (2005) Polymorphisms in human organic anion-transporting polypeptide 1A2 (OATP1A2): implications for altered drug disposition and central nervous system drug entry. J Biol Chem 280: 9610–9617.
- 47. Hollingworth P, Harold D, Sims R, Gerrish A, Lambert JC, et al. (2011) Common variants at ABCA7, MS4A6A/MS4A4E, EPHA1, CD33 and CD2AP are associated with Alzheimer's disease. Nat Genet 43: 429–435.
- 48. Seshadri S, Fitzpatrick AL, Ikram MA, DeStefano AL, Gudnason V, et al. (2010) Genome-wide analysis of genetic loci associated with Alzheimer disease. JAMA 303: 1832–1840.
- 49. Allen M, Zou F, Chai HS, Younkin CS, Crook J, et al. (2012) Novel late-onset Alzheimer's disease loci variants associate with brain gene expression. Neurology. in-press.
- 50. Atz M, Walsh D, Cartagena P, Li J, Evans S, et al. (2007) Methodological considerations for gene expression profiling of human brain. J Neurosci Methods 163: 295–309.
- 51. Braak H, Braak E (1991) Neuropathological stageing of Alzheimer-related changes. Acta Neuropathol (Berl) 82: 239–259.
- 52. Caffrey TM, Joachim C, Paracchini S, Esiri MM, Wade-Martins R (2006) Haplotype-specific expression of exon 10 at the human MAPT locus. Hum Mol Genet 15: 3529–3537.
- 53. Caffrey TM, Joachim C, Wade-Martins R (2008) Haplotype-specific expression of the N-terminal exons 2 and 3 at the human MAPT locus. Neurobiol Aging 29: 1923–1929.
- 54. Myers AJ, Pittman AM, Zhao AS, Rohrer K, Kaleem M, et al. (2007) The MAPT H1c risk haplotype is associated with increased expression of tau and especially of 4 repeat containing transcripts. Neurobiol Dis 25: 561–570.
- 55. McKhann G, Drachman D, Folstein M, Katzman R, Price D, et al. (1984) Clinical diagnosis of Alzheimer's disease: report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer's Disease. Neurology 34: 939–944.
- 56. Du P, Kibbe WA, Lin SM (2008) lumi: a pipeline for processing Illumina microarray. Bioinformatics 24: 1547–1548.
- 57. Lin SM, Du P, Huber W, Kibbe WA (2008) Model-based variance-stabilizing transformation for Illumina microarray data. Nucleic Acids Res 36: e11.
- 58. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, et al. (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81: 559–575.
- 59. Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, et al. (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38: 904–909.
- 60. Westfall PH, Zaykin DV, Young SS (2002) Multiple tests for genetic effects in association studies. Methods Mol Biol 184: 143–168.
- 61. Willer CJ, Li Y, Abecasis GR (2010) METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26: 2190–2191.