Diabetic kidney disease, or diabetic nephropathy (DN), is a major complication of diabetes and the leading cause of end-stage renal disease (ESRD) that requires dialysis treatment or kidney transplantation. In addition to the decrease in the quality of life, DN accounts for a large proportion of the excess mortality associated with type 1 diabetes (T1D). Whereas the degree of glycemia plays a pivotal role in DN, a subset of individuals with poorly controlled T1D do not develop DN. Furthermore, strong familial aggregation supports genetic susceptibility to DN. However, the genes and the molecular mechanisms behind the disease remain poorly understood, and current therapeutic strategies rarely result in reversal of DN. In the GEnetics of Nephropathy: an International Effort (GENIE) consortium, we have undertaken a meta-analysis of genome-wide association studies (GWAS) of T1D DN comprising ∼2.4 million single nucleotide polymorphisms (SNPs) imputed in 6,691 individuals. After additional genotyping of 41 top ranked SNPs representing 24 independent signals in 5,873 individuals, combined meta-analysis revealed association of two SNPs with ESRD: rs7583877 in the AFF3 gene (P = 1.2×10−8) and an intergenic SNP on chromosome 15q26 between the genes RGMA and MCTP2, rs12437854 (P = 2.0×10−9). Functional data suggest that AFF3 influences renal tubule fibrosis via the transforming growth factor-beta (TGF-β1) pathway. The strongest association with DN as a primary phenotype was seen for an intronic SNP in the ERBB4 gene (rs7588550, P = 2.1×10−7), a gene with type 2 diabetes DN differential expression and in the same intron as a variant with cis-eQTL expression of ERBB4. All these detected associations represent new signals in the pathogenesis of DN.
The global prevalence of diabetes has reached epidemic proportions, constituting a major health care problem worldwide. Diabetic kidney disease, or diabetic nephropathy (DN)—the major long term microvascular complication of diabetes—is associated with excess mortality among patients with type 1 diabetes. Even though DN has been shown to cluster in families, the underlying genetic and molecular pathways remain poorly defined. We have undertaken the largest genome-wide association study and meta-analysis to date on DN and on its most severe form of kidney disease, end-stage renal disease (ESRD). We identified new loci significantly associated with diabetic ESRD: AFF3 and an intergenic locus on chromosome 15q26 residing between RGMA and MCTP2. Our functional analyses suggest that AFF3 influences renal tubule fibrosis, a pathological hallmark of severe DN. Another locus in ERBB4 was suggestively associated with DN and resides in the same intronic region as a variant affecting the expression of ERBB4. Subsequent pathway analysis of the genes co-expressed with ERBB4 indicated involvement of fibrosis.
Citation: Sandholm N, Salem RM, McKnight AJ, Brennan EP, Forsblom C, Isakova T, et al. (2012) New Susceptibility Loci Associated with Kidney Disease in Type 1 Diabetes. PLoS Genet 8(9): e1002921. https://doi.org/10.1371/journal.pgen.1002921
Editor: Carsten A. Böger, University Hospital Regensburg, Germany
Received: February 14, 2012; Accepted: July 12, 2012; Published: September 20, 2012
Copyright: © Sandholm et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: United Kingdom: The Warren 3/UK GoKinD Study Group was jointly funded by Diabetes UK and the Juvenile Diabetes Research Foundation. EJ Swan is supported by Diabetes UK Studentship. The Golden Years cohort, established by GV Gill, AH Barnett, SC Bain, was funded by Diabetes UK. FinnDiane: The FinnDiane Study was supported by grants from the Folkhälsan Research Foundation, the Wilhelm and Else Stockmann Foundation, Liv och Hälsa Foundation, Helsinki University Central Hospital Research Funds (EVO), the Sigrid Juselius Foundation, the Signe and Arne Gyllenberg Foundation, Finska Läkaresällskapet, TEKES, the European Union's Seventh Framework Program (FP7/2007–2013) for the Innovative Medicine Initiative under grant agreement IMI/115006 (the SUMMIT consortium). V-P Mäkinen is supported by Orion-Farmos Research Foundation. Boston: The GENIE Consortium is supported by a US Ireland R&D partnership award funded by Science Foundation Ireland under Grant No. SFI/08/US/B1517, The Northern Ireland Research Development office, and NIH NIDDK R01 DK081923 to JN Hirschhorn, JC Florez, and P-H Groop. RM Salem was supported by a Juvenile Diabetes Research Foundation post doctoral fellowship (JDRF # 3-2011-70). Dublin: The ROI collection was supported by funding from the Health Research Board Ireland to Hugh R. Brady. Sweden (Stockholm and Umeå samples): Supported by Family Erling-Persson Foundation and Swedish Medical Research Council. Karolinska Institutet: Supported by Knut and Alice Wallengberg Foundation, Swedish Diabetes Society Foundation, Swedish Medical Research Council, and Novo Nordisk Foundation. SDR (Scania Diabetes Registry): The research at Lund University diabetes Centre was supported by the European Union's Seventh Framework Program (FP7/2007–2013) for the Innovative Medicine Initiative under grant agreement IMI/115006 (the SUMMIT consortium). E Ahlqvist was funded by a grant from ENGAGE. Romania: The RomDiane study was supported by Projects CNCSIS-PNCDI-II-RU-PD 133/2010, CNCSIS-PNCDI-II-RU-TD 66/2007, and CNCSIS–PNCDI-II-RU-PI-CE-ID 1194/2009. Steno: The study was made possible through support from the European Commission through contract QLG2-CT-2001-01669, the Danish Diabetes Association, and the Sehested Hansen Foundation. DCCT/EDIC: The DCCCT/EDIC Research Group is sponsored through research contracts from the National Institute of Diabetes, Endocrinology, and Metabolic Diseases of the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) and the National Institutes of Health. The Diabetes Control and Complications Trial (DCCT) and its follow-up the Epidemiology of Diabetes Interventions and Complications (EDIC) study were conducted by the DCCT/EDIC Research Group and supported by National Institute of Health grants and contracts and by the General Clinical Research Center Program, NCRR. SB Bull held a Canadian Institutes of Health Research (CIHR) Senior Investigator award (2002–7). AD Patterson holds a Canada Research Chair in the Genetics of Complex Diseases. This work has received support from National Institute of Diabetes and Digestive and Kidney Diseases Contract N01-DK-6-2204, National Institute of Diabetes and Digestive and Kidney Diseases Grant R01-DK-077510, and support from the Canadian Network of Centres of Excellence in Mathematics and from Genome Canada through the Ontario Genomics Institute. eQTL analysis in Pima Indians: This work was supported, in part, by the Intramural Research Program of the National Institute of Diabetes and Digestive and Kidney Diseases. France: GENEDIAB/GENESIS studies were supported by grants from the French Ministry of Health (Programmes Hospitaliers de Recherche Clinique), from the Société Francophone du Diabète, and from the Association Française des Diabétiques. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: JC Florez has received consulting honoraria from Novartis, Lilly, and Pfizer. M Kretzler received grant support from Hoffman La Roche and Fibrotech. P-H Groop has received lecture honorariums from Abbot, Boehringer Ingelheim, Cebix, Eli Lilly, Genzyme, Novartis, Novo Nordisk, MSD, and research grants from Eli Lilly and Roche. P-H Groop is also an advisory board member of Boehringer Ingelheim and Novartis.
Diabetic kidney disease, or diabetic nephropathy (DN), is the leading cause of end-stage renal disease (ESRD) worldwide . It affects approximately 30% of patients with long-standing type 1 and type 2 diabetes , , and confers added risks of cardiovascular disease and mortality. DN is a progressive disorder that is characterized by proteinuria (abnormal loss of protein from the blood compartment into the urine) and gradual loss of kidney function. Early in its course, the kidneys are hypertrophic, and glomerular filtration is increased. However, with progression over several years, proteinuria and decline in kidney function set in, and may result in fibrosis and terminal kidney failure, necessitating costly renal replacement therapies, such as dialysis and renal transplantation. While current treatments that decrease proteinuria will moderately abate DN progression, recent studies show that even with delivery of optimal care, high risks of cardiovascular disease, ESRD and mortality persist , . Therefore, discovery of genetic factors that influence development and susceptibility to DN is a critical step towards the identification of novel pathophysiologic mechanisms that may be targeted for interventions to improve the adverse clinical outcomes in diabetic patients.
Whereas the degree of glycemia plays a pivotal role in DN, a subset of individuals with poorly controlled type 1 diabetes (T1D) do not develop DN. Furthermore, strong familial aggregation supports genetic susceptibility to DN. The sibling risk of DN has been estimated to be 2.3-fold . While prior studies of individuals with T1D have reported on the possible existence of genetic associations for DN, results have been inconclusive. In GENIE, we leveraged three existing collections for T1D nephropathy (All Ireland Warren 3 Genetics of Kidneys in Diabetes UK Collection [UK-ROI], Finnish Diabetic Nephropathy Study [FinnDiane], and Genetics of Kidneys in Diabetes US Study [GoKinD US]) comprising 6,691 individuals to perform the most comprehensive and well powered DN susceptibility genome-wide association study (GWAS) and meta-analysis to date, with the aim to identify genetic markers associated with DN by meta-analyzing independent GWAS, imputed to HapMap CEU II (Table 1, Figure 1). As a result, we here present two new loci associated with ESRD and a locus suggestively associated with DN.
We applied a two stage study design, where the top signals from the meta-analysis of three GENIE studies (UK-ROI, FinnDiane and GoKinD US) were followed up in phase two analysis, consisting of nine T1D cohorts. After combined meta-analysis, two signals reached genome-wide significance in the analysis of ESRD (P<5×10−8). For DN phenotype no loci reached this threshold, but the strongest association was observed for ERBB4. These signals were followed up with eQTL studies and functional analysis. The number of patients (N) refers to the number of samples after genotype quality control; either the total number of samples or divided into cases/controls.
The primary phenotype of interest was DN, defined by the presence of persistent macroalbuminuria or ESRD in individuals aged over 18 who had T1D for at least 10-year duration. Controls were defined as individuals with T1D for at least 15 years but without any clinical evidence of kidney disease (see Methods for more detailed definitions). Meta-analysis of the DN results from each cohort resulted in five independent signals with P<10−5 (Table S1, Figure S1A). In a parallel analysis of ESRD versus non-ESRD (n cases = 1,399, n controls = 5,253; referred to as “ESRD” analysis throughout the manuscript, unless otherwise stated), SNP rs7583877 on chromosome 2q11.2-q12 achieved genome-wide significance (P = 4.8×10−9), primarily driven by FinnDiane and the UK-ROI samples, along with six other independent signals reaching P<10−5 (Figure 2A, Table S1, Figure S1C).
Panels represent independent signals for the primary DN and ESRD analysis. The color of the SNP symbol indicates the linkage disequilibrium (r2) with the index SNP which is colored purple. Blue and red gene colors in the lower part of each figure panel indicate up and down regulation in tubulointerstitial or glomerular DN kidney biopsies, respectively. Genes with no change in expression are indicated with black; no data on gene expression with gray color. (A) Association of rs7583877 with ESRD. (B) Association of rs12437854 with ESRD. (C) Association of rs7588550 with DN.
We invited investigators responsible for available collections with similar phenotypes to participate in the secondary genotyping phase of the top ranked SNPs (n = 41 including proxies, representing 24 independent signals) from the initial meta-analysis. Nine independent cohorts contributed 5,873 individuals with comparable phenotypic inclusion criteria (Table S2). After the combined meta-analysis of the first and second phase cohorts, the association of the intronic SNP rs7583877 in AFF3 with ESRD retained genome-wide significance (odds ratio [OR] = 1.29, 95% confidence interval [CI]: 1.18–1.40, P = 1.2×10−8; Figure 3A), with the bulk of the association evidence still provided by the FinnDiane and UK-ROI cohorts. The population attributable risk [PAR] for the causal variant underlying the observed association at rs7583877 was estimated to be 3.5%–10.5%. AFF3 belongs to the AFF (AF4/FMR2) family and encodes a transcriptional activator, with DNA-binding activity, initially found to be fused with MLL in some acute lymphoblastic leukemia patients , . Recent evidence points to a role for AFF3 as an RNA-binding protein, with overexpression affecting organization of nuclear speckles and splice machinery integrity . Variants near AFF3 have been associated with acute lymphoblastic leukemia , rheumatoid arthritis ,  and recently T1D , . Another locus between the RGMA (RGM domain family, member A) and MCTP2 (multiple C2 domains, transmembrane 2) genes on chromosome 15q26 also reached genome-wide significance for association with ESRD (rs12437854, OR 1.80, 95% CI: 1.48–2.17, P = 2.0×10−9; Table 2, Figure 3B). PAR estimates for this locus varied from 0.5% to 4.1%. For the primary DN phenotype, an intronic SNP in the ERBB4 gene demonstrated consistent protective effects in the replication samples and was the top associated SNP identified from the combined discovery and second stage analysis; however, this did not reach genome-wide statistical significance (rs7588550, OR 0.66, 95% CI: 0.56–0.77, P = 2.1×10−7, PAR 28.3%–32.5% for removal of the major risk allele; Table 2, Figure 3C). ERBB4 encodes an epidermal growth factor receptor subfamily member, and has been implicated in cardiac, mammary gland and neural development , . Mutations in ERBB4 have previously been reported in cancer . Several studies using Madin-Darby canine kidney (MDCK) cells and conditional ERBB4 overexpression/knock-out mice, suggest a crucial role for ERBB4 in renal development and tubulogenesis , .
Plots show the study-specific association estimates (OR) and 95% confidence intervals for the discovery and second phase studies. (A) Association of rs7583877 with ESRD; heterogeneity P = 0.037. (B) Association of rs12437854 with ESRD; heterogeneity P = 0.046. (C) Association of rs7588550 with DN; heterogeneity P = 0.467. The association estimate and confidence interval for the meta-analysis combining the discovery and second-stage results are denoted by the diamond.
It is possible that our observed signal is in linkage disequilibrium with an untyped SNP, or exerts functional effects over an extended genomic region. To explore a putative biological signature we identified, for the top three SNPs, all genes within a 2 Mb window (1 Mb upstream and downstream). Gene ontology analysis revealed no significant enrichment of biological terms or pathways within this subset of flanking genes (Table S3). We determined whether any of these genes were differentially expressed in microarray data derived from tubulointerstitial (n = 49) or glomerular (n = 70) human early DN renal biopsy material versus pre-transplant renal biopsies from living kidney donors (n = 32) . Around rs7583877 (AFF3), we noted upregulation of LIPT1 and TXNDC9, while TSGA10 was downregulated in both tubulointerstitial and glomerular enriched kidney biopsies (Figure 2 and Table S4). NPAS2, which flanks rs7583877 (AFF3), and FAM174B and CHD2, which flank rs12437854 (15q26), were downregulated in glomerular enriched biopsies of DN patients versus control, but remained unchanged in tubulointerstitial biopsies (Figure 2 and Table S4). NPAS2 (neuronal PAS domain protein 2), has been implicated in circadian rhythms in the distal nephron segments, acting as a regulator of kidney function . Interestingly, mutations in chromodomain helicase DNA binding protein 2 (CHD2), encoding a chromatin-remodeling enzyme, result in impaired glomerular function in mice . Furthermore, at the rs7588550 (ERBB4) locus expression of ERBB4 was down, and SPAG16 upregulated in tubulointerstitial enriched kidney biopsy tissue of DN versus control subjects (Figure 2 and Table S4).
We also examined whether any of the top three SNPs modulated expression of neighboring genes in cis in a dataset of glomerular and tubulointerstitial kidney biopsies of Pima Indians with type 2 diabetes and DN who had been genotyped on the Affymetrix 6.0 array . In Pima Indians, no adequate proxies (haplotype-based D′≥0.8) for the Affymetrix 6.0 SNPs that were strongly correlated with GWAS findings (r2≥0.8) could be found for rs12437854, and expression of AFF3 was below detectable thresholds in this dataset; however, two SNPs in the same intron of ERBB4 as rs7588550 (rs17418640 and rs17418814) were associated with genotype-specific expression of ERBB4 in tubulointerstitial but not in glomerular tissue in the Pima cohort (P<0.05; Figure S2). Follow-up work is required to investigate the DN associated and eQTL signals in this ERBB4 intron.
To explore the potential functional role of these ERBB4 SNPs, we looked for other genes whose expression is correlated with that of ERBB4. A total of 388 ERBB4-correlated genes were found in the Pima population (Benjamini-Hochberg Q-value<0.1). Pathway analysis of these genes indicates coexpression of ERBB4 with collagen-related genes, which have been implicated in renal fibrosis ,  (Genomatix Pathway System; Table S5).
Because the low expression level of AFF3 limited exploration of this gene using expression data, we pursued additional functional experiments in an in vitro model of renal fibrosis, namely human tubular epithelia exposed to transforming growth factor-β1 (TGF-β1). Low-level basal expression of the AFF3 mouse homologue (LAF4) has been reported in kidney tubules during embryonic development  suggesting proximal renal tubule epithelial cells may be suitable for detection and functional interrogation of AFF3. TGF-β1 is implicated in the development of diabetic glomerulosclerosis, and there is recent appreciation of its role as a key driver of tubulointerstitial fibrosis. TGF-β1 induces epithelial cell de-differentiation into a more mesenchymal-like phenotype, characterized by a switch in predominant cadherins from E-cadherin (epithelial) to N-cadherin (mesenchymal), and increased vimentin, α-smooth muscle actin, connective tissue growth factor (CTGF) and Jagged 1 , . TGF-β1-mediated loss of E-cadherin in renal epithelia, is believed to be mediated through loss of miR-192 expression . We and others have previously shown that Jagged 1, a ligand for multiple Notch receptors, is up-regulated in human diabetic kidney disease , , with the Notch signaling pathway implicated in driving renal fibrosis , . CTGF is a member of the CCN protein family, with biological roles in differentiation and tissue repair. CTGF is induced by TGF-β1 and enhances expression of multiple extracellular matrix proteins observed in DN, including collagens and fibronectin, and CTGF expression is elevated in the glomeruli of STZ (streptozotocin) - treated rats, an in vivo model of T1D . Basal AFF3 expression was detectable in HK-2 cells, and expression levels were upregulated upon stimulation with TGF-β1 (5 ng/ml; 48 h), as measured at protein and RNA level (Figure 4A–4B). Inhibition of AFF3 by siRNA attenuated the expression of TGF-β1-driven markers of fibrosis - CTGF and N-cadherin (Figure 4C–4E). Taken together, these data suggest that AFF3 may play a role in TGF-β1-induced fibrotic responses of renal epithelial cells.
(A) Western blot of AFF3 protein expression in HK-2 cells stimulated with TGF-β1 (5 ng/ml; 24–48 h). (B) TaqMan quantitative PCR analysis of AFF3 mRNA expression in HK-2 cells stimulated with TGF-β1 (5 ng/ml; 48 h) and (C) AFF3 mRNA expression in HK-2 cells transfected with AFF3 siRNA in the presence (black bar)/absence (grey bar) of TGF-β1 (5 ng/ml; 48 h). (D) TaqMan quantitative PCR analysis of N-cadherin, CTGF, Jagged1 and E-cadherin expression in HK-2 cells transfected with AFF3 siRNA in the presence (black bar)/absence (grey bar) of TGF-β1 (5 ng/ml; 48 h). (E) Representative Western blot of N-cadherin, CTGF, Jagged1 and E-cadherin protein responses in HK-2 cells transfected with AFF3 siRNA in the presence/absence of TGF-β1 (5 ng/ml; 48 h). HK-2 cells transfected with control siRNA were selected as a control. For TaqMan PCR, expression was normalized to GAPDH. Data are plotted as mean ± SE (n = 3; *P<0.05, **P<0.01).
Traditionally, DN has been viewed as a continuous trait with onset at microalbuminuria, progression to macroalbuminuria, loss of GFR, and culmination in ESRD. Recent studies have called this paradigm into question, suggesting that the syndrome may perhaps be composed of varying phenotypes , . Association of rs7583877 (AFF3) and rs12437854 (RGMA – MCTP2) with the different stages of DN was tested on a time-to-event analysis of relevant endpoints using longitudinal data for participants in the FinnDiane discovery collection. Consistent with our case-control GWAS analyses, the strongest association for rs7583877 was observed for the time from T1D diagnosis to development of ESRD (hazard ratio [HR] 1.33, 95% CI: 1.18–1.49, P = 1.9×10−6), but also the time from T1D diagnosis to development of macroalbuminuria (HR 1.15, 95% CI: 1.04–1.27, P = 0.006) and the time from macroalbuminuria to ESRD (HR 1.16, 95% CI: 1.01–1.36, P = 0.04) reached nominal significance. Similarly, rs12437854 was associated with time from T1D diagnosis to development of macroalbuminuria (HR 1.31, 95% CI: 1.03–1.67, P = 0.03) and ESRD (HR 1.35, 95% CI: 1.02–1.77, P = 0.03) (Text S1, Table S6, Figure S3). When we studied these SNPs and their association with various DN-related phenotypes in the case-control setting of the discovery cohorts, similar observations were made supporting the role of these SNPs in the development of ESRD: Whereas we found evidence of association between rs7583877 (AFF3) and all the examined phenotypes with ESRD as the case definition, only moderate association was observed for the DN phenotype (OR = 1.14, P = 0.002) and no association when patients with macroalbuminuria were compared to controls with normoalbuminuria (OR = 1.00, P = 0.95). rs12437854 (RGMA – MCTP2) had the strongest association with the original ESRD phenotype (controls defined as all non-ESRD subjects) and with the ESRD vs. normoalbuminuria phenotype, and moderate association with the DN phenotype and comparison of ESRD vs. macroalbuminuric patients (Table S7).
An alternative explanation for our ESRD findings may be that the associated variants in AFF3 gene and on chromosome 15q26 might be markers of survival. Mortality rates are extremely high in patients with kidney disease and macroalbuminuria, with at least 25% of macroalbuminuric patients dying before they reach ESRD . Thus, the selection of patients with ESRD may be biased towards selection of severe kidney disease survival. To address this question, we used the time until death as the final end point in the longitudinal analysis. Neither of the loci associated with ESRD was also associated with mortality (Text S1, Table S6, Figure S3), suggesting that these loci are associated with ESRD per se.
To explore whether these SNPs contribute to DN via related intermediate phenotypes, such as adiposity, fasting lipid levels, or blood pressure we performed in silico searching of publicly available GWAS datasets for our top SNPs –. We found nominal, directionally consistent associations of rs12437854 with fasting glucose (P = 0.03)  and of rs7583877 with waist-hip ratio (P = 0.04)  (Table S8). We also considered if previously published T1D and CKD SNP associations were associated with DN or ESRD in our GWAS meta analyses. Eight of 80 SNPs at T1D-associated loci showed nominal significance with DN or ESRD (including three at AFF3 that are in weak LD [r2 0.030–0.046 in CEU] with the SNPs described here), while no CKD SNPs were nominally significant (Table S9) –. The lack of association with DN for CKD-associated SNPs suggests that the genetic risk factors for DN may differ from the genetic risk factors for CKD in a nondiabetic population.
Finally, to generate further biological hypotheses based on our GWAS results, we employed MAGENTA  gene set enrichment analysis software integrating Gene Ontology (GO) terms, KEGG and Ingenuity pathways and PANTHER database entries (Table S10). In the analysis of DN as a case phenotype, enriched gene sets included “sugar binding” (P = 0.0006), “double stranded DNA binding” (P = 0.001) and “nucleic acid binding” (P = 0.004). In the analysis of ESRD significantly enriched gene sets (P<0.01) included an enrichment of terms associated with DNA binding, including “sequence-specific DNA binding” (P = 0.003), “positive regulation of transcription” (P = 0.003), and “homeobox transcription factor” (P = 0.004). Taken together, the principal biological signal found within GWAS data suggests an enrichment of transcriptional regulators.
In this largest meta-analysis to date of DN from individuals with T1D, we found two genome-wide significant associations with ESRD. Variants in AFF3 have been shown to be associated with juvenile idiopathic rheumatoid arthritis, Graves' disease, celiac disease and T1D, indicating this may be a pan-autoimmune disease gene. It is possible that the AFF3 signal represents an association with T1D and/or is a false positive finding, as it was not seen in the follow-up cohorts. However, we note the following: 1) both FinnDiane and UK-ROI yielded very similar association results, 2) the number of ESRD cases in the replication cohorts is small (n = 363), indicating that statistical power to replicate the original association is limiting, 3) the association result in the second stage, while non-significant, trends in a consistent direction (OR 1.11), 4) after evaluating >12,000 individuals the AFF3 signal remained genome-wide significant (P = 1.2×10−8), and 5) we have provided supportive functional evidence that suggests AFF3 may be a relevant contributor to renal disease. Although survival bias is a possibility in the analyses of ESRD, longitudinal analysis revealed the association of the AFF3 and chromosome 15q26 loci with renal end-points and not with death. Experimental models provide independent evidence of AFF3 involvement in renal fibrosis and support an association of this locus with a renal phenotype. Importantly, despite our large sample size, we did not achieve genome-wide statistical significance for DN using a combined proteinuria/ESRD phenotype, suggesting that this phenotype may have been too heterogeneous to detect significant associations with a sample of this size. For example, lifelong glycemic control, a known risk factor for DN, is not well captured in most existing cohorts. Nevertheless, this study is the largest, well powered GWAS on DN to date. We demonstrated a suggestive signal of association at ERBB4 that is supported by experimental data showing haplotype specific mRNA expression in DN biopsies. Our findings reinforce the need for additional studies of patients with T1D and a homogeneous renal phenotype, in whom additional GWAS, fine-mapping and sequencing to uncover rare variants could be performed. Integration of our findings with ongoing GWAS in both type 1 and type 2 diabetes DN may also lead to discovery of additional genetic determinants of DN. The traditional phenotypic definition of DN for individuals with type 2 diabetes may be even more challenging for genetic studies given the heterogeneity of vascular complications and differential renal diagnoses. Several larger-scale GWAS have now been conducted for renal phenotypes –, however in most cases the true disease-causing variant and functional impact for specific phenotypes remains to be established. Encouraging reports include the association of uromodulin with CKD , MYH9/APOL1 with non-diabetic ESRD , , and PLA2R1 with membranous nephropathy, where anti-PLA2R antibodies appear to predict activity of the disease as well as response to therapy .
Our findings point to two transcriptional networks centered around AFF3 and ERBB4 that may be operational in the pathogenesis of kidney disease in diabetes.
All human research was approved by the relevant institutional review boards, and conducted according to the Declaration of Helsinki.
We implemented a two stage analysis, in which a GWAS was performed using a set of three discovery cohorts in the GENIE consortium, and top signals for the DN and ESRD analyses were analyzed further in the second phase in a set of nine independent cohorts (described below) with 5,873 patients in total. The patient numbers in the individual studies are given in Table S11. Additional details are provided in the online material Text S1.
All Ireland, Warren 3, Genetics of Kidneys in Diabetes UK (UK-ROI) Collection 
Inclusion criteria included white individuals with T1D, diagnosed before 31 years of age, whose parents and grandparents were born in the UK and Ireland. The case group comprised 903 individuals with persistent proteinuria (>500 mg/24 h) developing more than 10 years after the diagnosis of diabetes, hypertension (>135/85 mmHg and/or treatment with antihypertensive medication), and retinopathy; ESRD (27.2%) was defined as individuals requiring renal replacement therapy or having received a kidney transplant. Absence of DN was defined as persistent normal urine albumin excretion rate (AER; 2 out of 3 urine albumin to creatinine ratio [ACR] measurements <20 µg of albumin/mg of creatinine) despite duration of T1D for at least 15 years, while not taking an antihypertensive medication, and having no history of treatment with ACE inhibitors; 1,001 individuals formed the control group. After exclusion of patients with low quality DNA samples, 914 DN/ESRD cases and 956 controls remained for the GWAS.
Finnish Diabetic Nephropathy Study (FinnDiane) 
The FinnDiane study is a Finnish cohort of more than 4,800 adult ethnic Finns with T1D, recruited from across Finland, diagnosed prior to age 35 and insulin treatment begun within 1 year. This study comprises 1,721 patients with normal AER, 516 with microalbuminuria, 733 with macroalbuminuria and 682 with ESRD. The disease status was defined by urine AER or urine ACR in at least two out of three consecutive urine collections at local centers: Microalbuminuria was defined as AER≥20<200 µg min−1 or ≥30<300 mg/24 h or an ACR of 2.5–25 mg mmol−1 for men and 3.5–35 mg mmol−1 for women in overnight, 24-hour or spot urine collections, respectively. Similarly, the limit for macroalbuminuria was AER≥200 µg min−1 or ≥300 mg/24 h or ACR≥25 mg mmol−1 for men and ≥35 mg mmol−1 for women. ESRD was defined as ongoing dialysis treatment or transplanted kidney. Control patients with normal AER were required to have T1D duration of at least 15 years. 558 of these patients were included from an independent Finnish cohort collected by the National Institute of Health and Welfare. These patients met the FinnDiane diagnosis and selection criteria, and were analyzed together with the FinnDiane cohort.
Genetics of Kidneys in Diabetes US Study (GoKinD US) 
The GoKinD US study consists of a DN case-control cohort of individuals diagnosed with T1D prior to 31 years of age who began insulin treatment within 1 year of T1D diagnosis. Controls were 18–59 years of age, with T1D for at least 15 years but without DN, n = 889. DN definition includes individuals with ESRD, dialysis or kidney transplant and persistent macroalbuminuria (at least 2 out of 3 tests positive for albuminuria by dipstick ≥1+, or ACR>300 µg albumin/mg of urine creatinine). Cases were defined as people 18–54 years of age, with T1D for at least 10 years and DN, n = 903. Individuals recruited to the control group employed the same inclusion criteria as UK-ROI. Individuals were recruited at two study centers, George Washington University (GWU) and the Joslin Diabetes Centre (JDC) using differing methods of ascertainment and recruitment . Analysis of the GoKinD US cohort was limited to individuals whose primary ethnicity was Caucasian.
Collections genotyped in Phase 2
DNA was sought from worldwide case-control collections of individuals with T1D and known renal status. A total of 5,873 individuals from nine independent collections were genotyped or imputed for the top-ranked SNPs (n = 41 including 17 proxies), with the exception of the DCCT/EDIC cohort where GWAS data was imputed. All the patients included in the phase two analysis were adults of European descent and had T1D diagnosed before 35 years of age. Controls with normal AER had duration of T1D at least 15 years, and cases with DN had minimum T1D duration of 10 years. If a collection included patients with microalbuminuria, they were excluded from the primary analysis of DN, but included as controls in the analysis of ESRD versus non-ESRD. The main clinical characteristics of all the replication cohorts are shown in the Table S2 and the cohorts are described in Text S1.
The primary phenotype of interest was DN, defined as individuals aged over 18, with T1D for at least 10 years and diabetic kidney disease. DN includes ESRD or persistent macroalbuminuria as defined in the cohort descriptions above. Controls were defined as individuals with T1D for at least 15 years but without any clinical evidence of kidney disease. Individuals with microalbuminuria were excluded from the primary DN analysis in all cohorts. Disease status definitions were consistent across all the study cohorts. Details of clinical characteristics for each cohort are defined in Table 1 and Table S2. We evaluated a second phenotype to gain further insights into the genetic basis of the most severe form of DN (leading to ESRD), and compared ESRD cases to all those without ESRD. This phenotype is referred to as the “ESRD” or “ESRD vs. non-ESRD” phenotype throughout the manuscript. We also considered individuals with ESRD compared to T1D controls with no clinical evidence of DN. Results for this comparison are given in the online supporting material (Tables S1, S6, S7, S9, S10), where this contrast is called “ESRD vs. normoalbuminuria” or “ESRD vs. normo”.
DNA from individuals in the UK-ROI collection were genotyped using the Omni1-Quad array (Illumina, San Diego, CA, USA) while FinnDiane samples employed Illumina's BeadArray 610-Quad array. Samples in UK-ROI and FinnDiane were excluded if they had insufficient DNA quality, quantity or poor genotype concordance with previous genotypes during the fingerprint evaluation stage. Existing genotype data for the GoKinD US genotype data was downloaded from dbGAP (phs000018.v2.p1, retrieved June 2010), containing updated genotype data from Affymetrix 500 K set (Affymetrix, Santa Clara, CA, USA).
Genotype quality control
Samples for UK-ROI and FinnDiane were excluded for insufficient DNA quality, quantity or poor genotype concordance with previous genotypes during a fingerprint evaluation stage. In the UK-ROI sample, 1,830 unique case (n = 872) and control (n = 958) individuals were submitted for genotyping on the Omni1-Quad. For FinnDiane, 3,651 individuals (cases, n = 1,934; controls n = 1,721) were submitted for genotyping on the 610-Quad. For all three discovery datasets (UK-ROI, FinnDiane, GoKinD US), uniform and extensive genotype quality control procedures were applied: SNPs were filtered for those with call rates greater than 90%, minor allele frequency (MAF) exceeding 1%, and concordance with Hardy Weinberg Equilibrium (HWE, P<10−7). Sample filters included individual call rates greater than 95%, no extreme heterozygosity and cryptic relatedness as determined using identity by descent (first degree relatives, estimated identity by descent >0.4), and admixture assessment using principal components (plotted with HapMap reference panel, Figure S4). Additional quality control measures included test of missing by haplotype (P<10−8), missing by phenotype (P>10−8) and plate effects (P<10−7). These quality control steps were performed using PLINK  with custom Perl and R analysis scripts. Known copy number variation and mitochondrial SNPs were excluded from analyses. Detailed results of each QC step are reported in Table S12 for each study population.
A HapMap control sample was included on all genotyping plates for UK-ROI; average call rate was 99.9% with HapMap concordance equaling 99.7%. The average sample call rate was 99.5% in UK-ROI with sample heterozygosity 22.1%. Concordance with internal control for FinnDiane was 99.996% with an average sample call rate of 99.8%.
Principal Component Analysis (PCA) was performed separately for each of the three studies with the EIGENSTRAT program  in order to detect genetic outliers and to adjust the analyses for population structure. Genetic outliers were defined as more than six standard deviations away from the center of distribution along any of the ten first principal components and the procedure was repeated until no outliers were detected. After filtering, PCA were calculated for each study cohort combined with unrelated individuals from three original HapMap populations (www.hapmap.org), and plotted to identify additional admixed individuals. The first ten principal components were employed to adjust the association analysis for any residual population structure from the cleaned datasets.
In total, directly genotyped results for 823 cases and 903 controls in 791,687 SNPs passed QC procedure in UK-ROI. Similarly, 549,530 SNPs with average genotyping rate of 99.9% passed the QC filters in 1,319 cases, 1,591 controls and 460 individuals with microalbuminuria for FinnDiane. 360,899 SNPs in 774 cases and 821 controls for GoKinD US passed quality control and were included in the analysis.
Imputation was performed after the quality control employing MACH 1.0 software (http://www.sph.umich.edu/csg/abecasis/MACH) with HapMap phase II CEU population as a reference, resulting in ∼2.4 million SNPs for each cohort. The cross-over and error rates were estimated with 50 iteration rounds in roughly 300 randomly selected samples. The imputation was run with the greedy algorithm and the maximum likelihood method in order to obtain expected allele dosages rather than integer allele counts. SNPs with low imputation quality (r2<0.6) are not reported.
PLINK v1.07  was employed to conduct association tests for the allele dosage data with logistic regression adjusted for sex, age, the duration of diabetes and the ten first components of the study specific principal component analysis. UK-ROI and GoKinD US were adjusted for study center, but in the primary DN phenotype the two GoKinD US centers; GWU and JDC, were analyzed separately. Results from individual studies were adjusted for study specific genomic inflation factor and then combined by fixed effect meta-analysis model using METAL , to estimate the combined effect sizes and significances from beta values and standard error. Regional association plots were generated using hg18 in LocusZoom . Quantile-Quantile plots were generated to evaluate the number and magnitude of observed associations compared with those expected under the null hypothesis (Figure S1).
Second-phase SNP selection and genotyping
All SNPs observed with P<10−5 were selected for further analysis. These SNPs were reviewed and a top SNP (with a proxy) was selected for each independent signal (SNPs more than 500 kb distant or LD r2<0.3 in HapMap II CEU) using the LD-based clumping procedure implemented in PLINK. De novo genotyping was performed for all phase two cohorts except for DCCT/EDIC using identical designs of Sequenom IPLEX assays (Sequenom Inc, San Diego, US). The DCCT/EDIC samples were imputed from their GWAS results that had undergone their respective quality control procedure. The statistical analysis was similar to the discovery cohorts with the difference that the models were not adjusted for principal components. All results were then combined by meta-analysis using METAL software as previously described.
Time to event analyses were performed on longitudinal data from the FinnDiane discovery cohort using Kaplan-Meier and Cox proportional hazards regression with the aim to evaluate the genetic association of rs7583877 and rs12437854 with time from the diagnosis of T1D to the onset of the following end points: microalbuminuria, macroalbuminuria or ESRD. Additionally, we analyzed time from onset of macroalbuminuria to development of ESRD. The most recent kidney status data were utilized for each patient. We also examined if the two main association loci, rs7583877 and rs12437854, were associated with mortality using data from the Finnish Death Registry (as per 30.9.2010). As DN (defined as macroalbuminuria or ESRD) is strongly associated with mortality, the time to death was separately analyzed for patients without DN (time from T1D onset to death; patients who developed DN were censored at the time of the onset of DN) and for those with DN (time from onset of DN to death and time from onset of ESRD to death). Analyses were performed using the ‘survival’ package in R software (version 2.36-10, http://cran.r-project.org/web/packages/survival). (See Text S1.)
SNPs were annotated with associated genes and function using dbSNP build 132, human build 37.1. Cytogenetic locations for genes were sourced from Entrez gene; locations for SNPs that were not associated with genes were recorded from NCBI MapView. In silico analyses included gene set enrichment using MAGENTA . To explore functional implications of AFF3, human kidney epithelial cells (HK-2) were cultured and evaluated (Figure 4).
Renal biopsy populations
Gene expression was measured in renal tissue compartments micro-dissected from renal biopsies from Pima Indians with type 2 diabetes and early stage DN (n = 77), as well as from Caucasian living kidney transplant donors (n = 20). Pima Indian subjects are 25–68 in age, with measured ACR in the range 5.23–7162, and GFR in the range 40.45–274.80.
Renal biopsies were micro-dissected into glomeruli and either tubulointerstitial or cortical compartments, and gene expression measured using the Affymetrix HGU-133A and HGU-133 Plus 2 platforms . Background adjustment, quantile normalization and probe-set summarization were performed with in a GenePattern (www.genepattern.org) pipeline using Robust Multichip Analysis  with batch correction using Combat . The differential expression data sets were processed with the Entrez Gene Custom CDF v.10, and the eQTL data sets were processed with the RefSeq Custom CDF v.12  for probe-sets common to both expression platforms.
The Affymetrix 6.0 genotyping platform was used to genotype Pima Indians with glomerular expression (n = 65), a subset of which (n = 54) also had tubulointerstitial/cortical expression. The cis region of each gene was defined as 150 kb upstream of the transcript start site and 50 kb downstream of the transcription end site.
Manhattan and QQ-plots for DN and ESRD phenotypes. Manhattan plots (panels A and C) highlighting P values from the discovery meta-analysis where dotted horizontal lines represent the threshold for follow up, P<1×10−5, and the solid horizontal lines indicate the threshold for genome-wide significance, P<5×10−8. The nearest genes are indicated above regions of interest. SNPs that reached threshold P<1×10−5 and were selected for follow up are denoted as black diamonds, SNPs in linkage disequilibrium (R>0.6) with top SNP are denoted with blue dots, and final meta analysis P values (discovery+phase 2 results) as red triangles. Q-Q plots (panels B and D) evaluated inflation of the GWAS results and show the expected versus observed P values; the diagonal line is the line of identity. The inflation factor λ for the genomic control is indicated in the Q-Q plots.
Box and whisker plots of normalized ERBB4 expression intensities in glomerulus (A,B) and tubulointerstitium (C,D) by genotype showing eQTL associations in tubulointerstitium. Both SNPs show significant eQTL associations in tubulointerstitial kidney biopsies of Pima Indians with type 2 diabetes and DN (P = 0.018 for rs1718640, P = 0.024 for rs17418814; linear regression using additive model). Association remained significant for rs17418640 when the subject with homozygous minor allele was excluded (P = 0.043). Associations with glomerular expression are not significant. Gene expression in kidneys was evaluated with Affy HGU-133A custom CDF probesets annotated to RefSeq transcripts NM_005235 and NM_001042599, and SNPs were genotyped with Affy 6.0 genotyping platform. Conditional analysis indicates rs17418814 is dependent on rs1718640 (P = 0.95 conditioned on rs1718640, versus rs1718640 P = 0.48 conditioned on rs17418814). Both SNPs lie within the same intron of the ERBB4 gene as rs7588550 that was suggestively associated with DN.
Longitudinal analyses in FinnDiane for rs7583877 (AFF3) and rs12437854 (chromosome 15q26). Analyses assume an additive model of the SNP effects. The plotted survival curves have been truncated at the point at which fewer than five participants remained with the corresponding genotype. The genotype legend in each figure indicates the number of samples with the corresponding genotype, shown in parentheses. The P-value is indicated for the nominally significant associations (P<0.05). ns = not significant. The bottom part of each figure indicates the number of samples at risk at ten-year intervals.
Rooted Principal Component Analysis of the discovery cohorts. Two first principal components (PC1 and PC2) are shown for (A) UK-ROI, (B) FinnDiane and (C) GoKinD US. Principal Component Analysis was calculated with EIGENSTRAT software including CEU, YRI and CBT from HapMap II as reference samples.
Top ranked SNPs selected for DN, ESRD vs. non ESRD, and ESRD vs. normoalbuminuria phenotypes.
Clinical characteristics and information on genotyping of the phase two cohorts.
Gene ontology analysis of all genes within ±1 Mbp of top GWAS signals: rs7583877/AFF3; rs12437854/15q26; rs7588550/ERBB4.
Gene expression in early DN versus living donor kidney biopsies. All genes within a 2 Mb window (1 Mb upstream and downstream) of the three main signals (rs7583877/AFF3, rs12437854/15q26, rs7588550/ERBB4) were studied.
Significantly enriched pathways (Genomatix Pathway System) for the ERBB4-correlated genes in early diabetic nephropathy.
Cross-sectional and longitudinal analyses in FinnDiane for rs7583877 (AFF3) and rs12437854 (chromosome 15q26).
Additional kidney phenotype analysis results for the three main loci.
P-value for association with DN related traits for the main signals after combined meta-analysis of DN and ESRD phenotypes. A1 is associated with increasing risk of ESRD/DN.
GENIE GWAS associations for SNPs that have been previously associated with T1D or chronic kidney disease.
Gene set enrichment analysis with MAGENTA. Gene sets with nominal P-value<0.01 for the three analyzed phenotypes.
Number of patients included in the study.
Quality control and filtering for the discovery GWAS data.
Physicians and nurses participating in the collection of the FinnDiane study subjects.
The DCCT/EDIC Study Research Group.
Supplementary Methods: Detailed explanation of employed methods.
We thank Zoe Paschkis for excellent editorial assistance in the preparation of this manuscript.
We acknowledge all the subjects for their participation.
United Kingdom: The Warren 3/UK GoKinD Study Group includes, Belfast: A. P. Maxwell, A. J. McKnight, D. A. Savage; Edinburgh: J. Walker; London: S. Thomas, G. C. Viberti; Manchester: A. J. M. Boulton; Newcastle: S. Marshall; Plymouth: A. G. Demaine, B. A. Millward; Swansea: S. C. Bain. We are grateful to all those who participated in the collections, including clinicians, research nurses and Jill Kilner for excellent technical support.
FinnDiane: We acknowledge the technical assistance of Anna Sandelin, Jaana Tuomikangas, Anna-Reetta Salonen and Tuula Soppela, and the physicians and nurses at each center participating in the collection of patients (Table S13).
Dublin: We are grateful to our colleagues at the Renal Unit, Mater Misericordiae University Hospital, Dublin, Ireland.
France-Belgium: S Brihoual (INSERM CIC0802, Poitiers), N Emery (INSERM U 695, Paris Bichat) for organization help with study samples. N Hager-Vionnet (CNG, Evry) & PJ Saulnier (INSERM CIC 802, Poitiers) for their intellectual input in data analysis. All patients and participating centers of the GENESIS and GENEDIAB studies are greatly acknowledged for their fantastic commitment in these studies (see list in Text S1).
DCCT/EDIC: The authors are grateful to the subjects in the DCCT/EDIC cohort for their long-term participation. Clinical data and DNA from the DCCT/EDIC study will be made available through the National Institute of Diabetes and Digestive and Kidney Diseases repository at https://www.niddkrepository.org/niddk/home.do. The data and samples from the DCCT/EDIC study were supplied by the NIDDK Central Repositories. This manuscript was not prepared under the auspices of and does not represent analyses or conclusions of the NIDDK Central Repositories, or the NIH. List of the persons and Institutes that participated in the DCCT/EDIC Study Research Group is provided in Table S14.
Conceived and designed the experiments: AP Maxwell, JC Florez, JN Hirschhorn, P-H Groop, C Godson, F Martin, G Zerbini, M Marre, S Hadjadj, HF Gu, K Brismar, L Groop, NM Panduru, SC Bain, M Lajer, HM Colhoun, AD Paterson, D Waggott, SB Bull, M Kretzler. Performed the experiments: DA Savage, AJ McKnight, EJ Swan, C Guiducci, DB Mirel, A Taylor, C Godson, N Sandholm, M Parkkonen, J Söderlund, A Syreeni, P Lahermo, K Tryggvason, B He, A-M Österholm, AD Paterson, S Maestroni, S Hadjadj, M Marre, V Rigalleau, R Roussel, F Alhenc-Gelas, P Lefebvre, HF Gu, A Möllsten, T Gu, E Ahlqvist, M Ioana, M Stavarachi, D Cimponeriu, SM Hosseini, M Kretzler, RL Hanson. Analyzed the data: AJ McKnight, GJ McKay, EJ Swan, JC Florez, JN Hirschhorn, C Palmer, RM Salem, N Sandholm, C Forsblom, V-P Mäkinen, EP Brennan, F Martin, S Hadjadj, M Marre, V Rigalleau, R Roussel, F Alhenc-Gelas, P Lefebvre, D-A Tregouet, C Ladenvall, HA Deshmukh, AD Paterson, AP Boright, D Waggott, BJ Keller, H Huang, E Ahlqvist. Contributed reagents/materials/analysis tools: S Prior, SB Bull, GV Gill, AJ McKnight, AP Maxwell, T Isakova, WW Williams, N Sandholm, C Forsblom, V Harjutsalo, AJ Ahola, E Fagerholm, D Gordin, O Heikkilä, K Hietala, J Kytö, M Lehto, V-P Mäkinen, M Rosengård-Bärlund, LM Thorn, N Tolonen, M Saraheimo, A Soro-Paavonen, J Wadén, H Tikkanen, J Tuomilehto, P Lahermo, P-H Groop, J Pitkäniemi, C Sarti, K Tryggvason, B He, A-M Österholm, AD Paterson, DM Sadlier, G Zerbini, A Maestroni, HF Gu, A Möllsten, H Falhammar, E Ahlqvist, M Mota, EM, SC Bain, GV Gill, S Prior, M Lehto, L Tarnow, P Rossing, H-H Parving, HM Colhoun, HA Deshmukh, M Kretzler, RG Nelson, RL Hanson, R Lithovius, K Brismar. Wrote the paper: N Sandholm, RM Salem, AJ McKnight, EP Brennan, C Forsblom, T Isakova, GJ McKay, WW Williams, DM Sadlier, F Martin, JN Hirschhorn, C Godson, JC Florez, P-H Groop, AP Maxwell.
- 1. U S Renal Data System (1-12-0011) USRDS 2011 Annual Data Report. Atlas of Chronic Kidney Disease and End-Stage Renal Disease in the United States, National Institutes of Health, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD, 2011.
- 2. Ritz E, Zeng XX, Rychlik I (2011) Clinical manifestation and natural history of diabetic nephropathy. Contrib Nephrol 170: 19–27.
- 3. Krolewski AS, Warram JH, Christlieb AR, Busick EJ, Kahn CR (1985) The changing natural history of nephropathy in type I diabetes. Am J Med 78: 785–794.
- 4. de Boer IH, Rue TC, Hall YN, Heagerty PJ, Weiss NS, et al. (2011) Temporal trends in the prevalence of diabetic kidney disease in the United States. JAMA 305: 2532–2539.
- 5. Rosolowsky ET, Skupien J, Smiles AM, Niewczas M, Roshan B, et al. (2011) Risk for ESRD in type 1 diabetes remains high despite renoprotection. J Am Soc Nephrol 22: 545–553.
- 6. Harjutsalo V, Katoh S, Sarti C, Tajima N, Tuomilehto J (2004) Population-based assessment of familial clustering of diabetic nephropathy in type 1 diabetes. Diabetes 53: 2449–2454.
- 7. von Bergh AR, Beverloo HB, Rombout P, van Wering ER, van Weel MH, et al. (2002) LAF4, an AF4-related gene, is fused to MLL in infant acute lymphoblastic leukemia. Genes Chromosomes Cancer 35: 92–96.
- 8. Ma C, Staudt LM (1996) LAF-4 encodes a lymphoid nuclear protein with transactivation potential that is homologous to AF-4, the gene fused to MLL in t(4;11) leukemias. Blood 87: 734–745.
- 9. Melko M, Douguet D, Bensaid M, Zongaro S, Verheggen C, et al. (2011) Functional characterization of the AFF (AF4/FMR2) family of RNA-binding proteins: insights into the molecular pathology of FRAXE intellectual disability. Hum Mol Genet 20: 1873–1885.
- 10. von Bergh AR, Beverloo HB, Rombout P, van Wering ER, van Weel MH, et al. (2002) LAF4, an AF4-related gene, is fused to MLL in infant acute lymphoblastic leukemia. Genes Chromosomes Cancer 35: 92–96.
- 11. Barton A, Eyre S, Ke X, Hinks A, Bowes J, et al. (2009) Identification of AF4/FMR2 family, member 3 (AFF3) as a novel rheumatoid arthritis susceptibility locus and confirmation of two further pan-autoimmune susceptibility genes. Hum Mol Genet 18: 2518–2522.
- 12. Plant D, Flynn E, Mbarek H, Dieude P, Cornelis F, et al. (2010) Investigation of potential non-HLA rheumatoid arthritis susceptibility loci in a European cohort increases the evidence for nine markers. Ann Rheum Dis 69: 1548–1553.
- 13. Barrett JC, Clayton DG, Concannon P, Akolkar B, Cooper JD, et al. (2009) Genome-wide association study and meta-analysis find that over 40 loci affect risk of type 1 diabetes. Nat Genet 41: 703–707.
- 14. Wallace C, Rotival M, Cooper JD, Rice CM, Yang JH, et al. (2012) Statistical colocalization of monocyte gene expression and genetic risk variants for type 1 diabetes. Hum Mol Genet
- 15. Gassmann M, Casagranda F, Orioli D, Simon H, Lai C, et al. (1995) Aberrant neural and cardiac development in mice lacking the ErbB4 neuregulin receptor. Nature 378: 390–394.
- 16. Tidcombe H, Jackson-Fisher A, Mathers K, Stern DF, Gassmann M, et al. (2003) Neural and mammary gland defects in ErbB4 knockout mice genetically rescued from embryonic lethality. Proc Natl Acad Sci U S A 100: 8281–8286.
- 17. Prickett TD, Agrawal NS, Wei X, Yates KE, Lin JC, et al. (2009) Analysis of the tyrosine kinome in melanoma reveals recurrent mutations in ERBB4. Nat Genet 41: 1127–1132.
- 18. Veikkolainen V, Naillat F, Railo A, Chi L, Manninen A, et al. (2012) ErbB4 modulates tubular cell polarity and lumen diameter during kidney development. J Am Soc Nephrol 23: 112–122.
- 19. Zeng F, Zhang MZ, Singh AB, Zent R, Harris RC (2007) ErbB4 isoforms selectively regulate growth factor induced Madin-Darby canine kidney cell tubulogenesis. Mol Biol Cell 18: 4446–4456.
- 20. Berthier CC, Zhang H, Schin M, Henger A, Nelson RG, et al. (2009) Enhanced expression of Janus kinase-signal transducer and activator of transcription pathway members in human diabetic nephropathy. Diabetes 58: 469–477.
- 21. Zuber AM, Centeno G, Pradervand S, Nikolaeva S, Maquelin L, et al. (2009) Molecular clock is involved in predictive circadian adjustment of renal function. Proc Natl Acad Sci U S A 106: 16523–16528.
- 22. Marfella CG, Henninger N, LeBlanc SE, Krishnan N, Garlick DS, et al. (2008) A mutation in the mouse Chd2 chromatin remodeling enzyme results in a complex renal phenotype. Kidney Blood Press Res 31: 421–432.
- 23. Malhotra A, Kobes S, Knowler WC, Baier LJ, Bogardus C, et al. (2011) A genome-wide association study of BMI in American Indians. Obesity (Silver Spring) 19: 2102–2106.
- 24. Kim Y, Kleppel MM, Butkowski R, Mauer SM, Wieslander J, et al. (1991) Differential expression of basement membrane collagen chains in diabetic nephropathy. Am J Pathol 138: 413–420.
- 25. Yagame M, Kim Y, Zhu D, Suzuki D, Eguchi K, et al. (1995) Differential distribution of type IV collagen chains in patients with diabetic nephropathy in non-insulin-dependent diabetes mellitus. Nephron 70: 42–48.
- 26. Britanova O, Lukyanov S, Gruss P, Tarabykin V (2002) The mouse Laf4 gene: exon/intron organization, cDNA sequence, alternative splicing, and expression during central nervous system development. Genomics 80: 31–37.
- 27. Brennan EP, Morine MJ, Walsh DW, Roxburgh SA, Lindenmeyer MT, et al. (2012) Next-generation sequencing identifies TGF-beta1-associated gene expression profiles in renal epithelial cells reiterated in human diabetic nephropathy. Biochim Biophys Acta 1822: 589–599.
- 28. Kalluri R, Weinberg RA (2009) The basics of epithelial-mesenchymal transition. J Clin Invest 119: 1420–1428.
- 29. Krupa A, Jenkins R, Luo DD, Lewis A, Phillips A, et al. (2010) Loss of MicroRNA-192 promotes fibrogenesis in diabetic nephropathy. J Am Soc Nephrol 21: 438–447.
- 30. Morrissey J, Guo G, Moridaira K, Fitzgerald M, McCracken R, et al. (2002) Transforming growth factor-beta induces renal epithelial jagged-1 expression in fibrotic disease. J Am Soc Nephrol 13: 1499–1508.
- 31. Walsh DW, Roxburgh SA, McGettigan P, Berthier CC, Higgins DG, et al. (2008) Co-regulation of Gremlin and Notch signalling in diabetic nephropathy. Biochim Biophys Acta 1782: 10–21.
- 32. Bielesz B, Sirin Y, Si H, Niranjan T, Gruenwald A, et al. (2010) Epithelial Notch signaling regulates interstitial fibrosis development in the kidneys of mice and humans. J Clin Invest 120: 4040–4054.
- 33. Dressler GR (2008) Another niche for Notch. Kidney Int 73: 1207–1209.
- 34. Murphy M, Godson C, Cannon S, Kato S, Mackenzie HS, et al. (1999) Suppression subtractive hybridization identifies high glucose levels as a stimulus for expression of connective tissue growth factor and other genes in human mesangial cells. J Biol Chem 274: 5830–5834.
- 35. Kramer HJ, Nguyen QD, Curhan G, Hsu CY (2003) Renal insufficiency in the absence of albuminuria and retinopathy among adults with type 2 diabetes mellitus. JAMA 289: 3273–3277.
- 36. Perkins BA, Nelson RG, Ostrander BE, Blouch KL, Krolewski AS, et al. (2005) Detection of renal function decline in patients with diabetes and normal or elevated GFR by serial measurements of serum cystatin C concentration: results of a 4-year follow-up study. J Am Soc Nephrol 16: 1404–1412.
- 37. Forsblom C, Harjutsalo V, Thorn LM, Waden J, Tolonen N, et al. (2011) Competing-risk analysis of ESRD and death among patients with type 1 diabetes and macroalbuminuria. J Am Soc Nephrol 22: 537–544.
- 38. Dupuis J, Langenberg C, Prokopenko I, Saxena R, Soranzo N, et al. (2010) New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk. Nat Genet 42: 105–116.
- 39. Ehret GB, Munroe PB, Rice KM, Bochud M, Johnson AD, et al. (2011) Genetic variants in novel pathways influence blood pressure and cardiovascular disease risk. Nature 478: 103–109.
- 40. Heid IM, Jackson AU, Randall JC, Winkler TW, Qi L, et al. (2010) Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution. Nat Genet 42: 949–960.
- 41. Willer CJ, Sanna S, Jackson AU, Scuteri A, Bonnycastle LL, et al. (2008) Newly identified loci that influence lipid concentrations and risk of coronary artery disease. Nat Genet 40: 161–169.
- 42. Dupuis J, Langenberg C, Prokopenko I, Saxena R, Soranzo N, et al. (2010) New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk. Nat Genet 42: 105–116.
- 43. Heid IM, Jackson AU, Randall JC, Winkler TW, Qi L, et al. (2010) Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution. Nat Genet 42: 949–960.
- 44. Kottgen A, Glazer NL, Dehghan A, Hwang SJ, Katz R, et al. (2009) Multiple loci associated with indices of renal function and chronic kidney disease. Nat Genet 41: 712–717.
- 45. Chambers JC, Zhang W, Lord GM, van der HP, Lawlor DA, et al. (2010) Genetic loci influencing kidney function and chronic kidney disease. Nat Genet 42: 373–375.
- 46. Burren OS, Adlem EC, Achuthan P, Christensen M, Coulson RM, et al. (2011) T1DBase: update 2011, organization and presentation of large-scale data sets for type 1 diabetes research. Nucleic Acids Res 39: D997–1001.
- 47. Kottgen A, Pattaro C, Boger CA, Fuchsberger C, Olden M, et al. (2010) New loci associated with kidney function and chronic kidney disease. Nat Genet 42: 376–384.
- 48. Segre AV, Groop L, Mootha VK, Daly MJ, Altshuler D (2010) Common inherited variation in mitochondrial genes is not enriched for associations with type 2 diabetes or related glycemic traits. PLoS Genet 6 doi:10.1371/journal.pgen.1001058.
- 49. Chambers JC, Zhang W, Lord GM, van der HP, Lawlor DA, et al. (2010) Genetic loci influencing kidney function and chronic kidney disease. Nat Genet 42: 373–375.
- 50. Feehally J, Farrall M, Boland A, Gale DP, Gut I, et al. (2010) HLA has strongest association with IgA nephropathy in genome-wide analysis. J Am Soc Nephrol 21: 1791–1797.
- 51. Rampoldi L, Scolari F, Amoroso A, Ghiggeri G, Devuyst O (2011) The rediscovery of uromodulin (Tamm-Horsfall protein): from tubulointerstitial nephropathy to chronic kidney disease. Kidney Int 80: 338–347.
- 52. Kao WH, Klag MJ, Meoni LA, Reich D, Berthier-Schaad Y, et al. (2008) MYH9 is associated with nondiabetic end-stage renal disease in African Americans. Nat Genet 40: 1185–1192.
- 53. Rosset S, Tzur S, Behar DM, Wasser WG, Skorecki K (2011) The population genetics of chronic kidney disease: insights from the MYH9-APOL1 locus. Nat Rev Nephrol 7: 313–326.
- 54. Stanescu HC, Arcos-Burgos M, Medlar A, Bockenhauer D, Kottgen A, et al. (2011) Risk HLA-DQA1 and PLA(2)R1 alleles in idiopathic membranous nephropathy. N Engl J Med 364: 616–626.
- 55. Kottgen A, Glazer NL, Dehghan A, Hwang SJ, Katz R, et al. (2009) Multiple loci associated with indices of renal function and chronic kidney disease. Nat Genet 41: 712–717.
- 56. Kottgen A, Pattaro C, Boger CA, Fuchsberger C, Olden M, et al. (2010) New loci associated with kidney function and chronic kidney disease. Nat Genet 42: 376–384.
- 57. Rampoldi L, Scolari F, Amoroso A, Ghiggeri G, Devuyst O (2011) The rediscovery of uromodulin (Tamm-Horsfall protein): from tubulointerstitial nephropathy to chronic kidney disease. Kidney Int 80: 338–347.
- 58. Kao WH, Klag MJ, Meoni LA, Reich D, Berthier-Schaad Y, et al. (2008) MYH9 is associated with nondiabetic end-stage renal disease in African Americans. Nat Genet 40: 1185–1192.
- 59. Rosset S, Tzur S, Behar DM, Wasser WG, Skorecki K (2011) The population genetics of chronic kidney disease: insights from the MYH9-APOL1 locus. Nat Rev Nephrol 7: 313–326.
- 60. Stanescu HC, Arcos-Burgos M, Medlar A, Bockenhauer D, Kottgen A, et al. (2011) Risk HLA-DQA1 and PLA(2)R1 alleles in idiopathic membranous nephropathy. N Engl J Med 364: 616–626.
- 61. McKnight AJ, Patterson CC, Pettigrew KA, Savage DA, Kilner J, et al. (2010) A GREM1 gene variant associates with diabetic nephropathy. J Am Soc Nephrol 21: 773–781.
- 62. Syreeni A, El Osta A, Forsblom C, Sandholm N, Parkkonen M, et al. (2011) Genetic examination of SETD7 and SUV39H1/H2 methyltransferases and the risk of diabetes complications in patients with type 1 diabetes. Diabetes 60: 3073–3080.
- 63. Pezzolesi MG, Poznik GD, Mychaleckyj JC, Paterson AD, Barati MT, et al. (2009) Genome-wide association scan for diabetic nephropathy susceptibility genes in type 1 diabetes. Diabetes 58: 1403–1410.
- 64. Pezzolesi MG, Poznik GD, Mychaleckyj JC, Paterson AD, Barati MT, et al. (2009) Genome-wide association scan for diabetic nephropathy susceptibility genes in type 1 diabetes. Diabetes 58: 1403–1410.
- 65. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, et al. (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81: 559–575.
- 66. Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, et al. (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38: 904–909.
- 67. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, et al. (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81: 559–575.
- 68. Willer CJ, Li Y, Abecasis GR (2010) METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26: 2190–2191.
- 69. Pruim RJ, Welch RP, Sanna S, Teslovich TM, Chines PS, et al. (2010) LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics 26: 2336–2337.
- 70. Cohen CD, Kretzler M (2002) Gene expression analysis in microdissected renal tissue. Current challenges and strategies. Nephron 92: 522–528.
- 71. Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs B, et al. (2003) Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res 31: e15.
- 72. Johnson WE, Li C, Rabinovic A (2007) Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics 8: 118–127.
- 73. Dai M, Wang P, Boyd AD, Kostov G, Athey B, et al. (2005) Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data. Nucleic Acids Res 33: e175.