Skip to main content
  • Loading metrics

A high throughput, functional screen of human Body Mass Index GWAS loci using tissue-specific RNAi Drosophila melanogaster crosses

  • Thomas J. Baranski ,

    Roles Conceptualization, Data curation, Funding acquisition, Investigation, Project administration, Resources, Supervision, Writing – original draft, Writing – review & editing (TJB); (MAP)

    Affiliation Department of Internal Medicine, Division of Endocrinology, Metabolism and Lipid Research, Washington University School of Medicine, St. Louis, Missouri, United States of America

  • Aldi T. Kraja,

    Roles Data curation, Formal analysis, Resources, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Genetics and Center for Genome Sciences and Systems Biology, Division of Statistical Genomics, Washington University School of Medicine, St. Louis, Missouri, United States of America

  • Jill L. Fink,

    Roles Data curation, Resources, Validation, Writing – review & editing

    Affiliation Department of Internal Medicine, Division of Endocrinology, Metabolism and Lipid Research, Washington University School of Medicine, St. Louis, Missouri, United States of America

  • Mary Feitosa,

    Roles Writing – review & editing

    Affiliation Department of Genetics and Center for Genome Sciences and Systems Biology, Division of Statistical Genomics, Washington University School of Medicine, St. Louis, Missouri, United States of America

  • Petra A. Lenzini,

    Roles Data curation, Formal analysis, Software, Writing – original draft, Writing – review & editing

    Affiliation Department of Genetics and Center for Genome Sciences and Systems Biology, Division of Statistical Genomics, Washington University School of Medicine, St. Louis, Missouri, United States of America

  • Ingrid B. Borecki,

    Roles Conceptualization, Funding acquisition, Investigation, Project administration, Supervision, Writing – review & editing

    Affiliation Department of Biostatistics, University of Washington, Seattle, Washington, United States of America

  • Ching-Ti Liu,

    Roles Funding acquisition, Writing – review & editing

    Affiliation Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts, United States of America

  • L. Adrienne Cupples,

    Roles Funding acquisition, Writing – review & editing

    Affiliation Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts, United States of America

  • Kari E. North,

    Roles Funding acquisition, Writing – review & editing

    Affiliation Department of Epidemiology, University of North Carolina, Chapel Hill, North Carolina, United States of America

  • Michael A. Province

    Roles Investigation, Methodology, Project administration, Supervision, Writing – original draft (TJB); (MAP)

    Affiliation Department of Genetics and Center for Genome Sciences and Systems Biology, Division of Statistical Genomics, Washington University School of Medicine, St. Louis, Missouri, United States of America


Human GWAS of obesity have been successful in identifying loci associated with adiposity, but for the most part, these are non-coding SNPs whose function, or even whose gene of action, is unknown. To help identify the genes on which these human BMI loci may be operating, we conducted a high throughput screen in Drosophila melanogaster. Starting with 78 BMI loci from two recently published GWAS meta-analyses, we identified fly orthologs of all nearby genes (± 250KB). We crossed RNAi knockdown lines of each gene with flies containing tissue-specific drivers to knock down (KD) the expression of the genes only in the brain and the fat body. We then raised the flies on a control diet and compared the amount of fat/triglyceride in the tissue-specific KD group compared to the driver-only control flies. 16 of the 78 BMI GWAS loci could not be screened with this approach, as no gene in the 500-kb region had a fly ortholog. Of the remaining 62 GWAS loci testable in the fly, we found a significant fat phenotype in the KD flies for at least one gene for 26 loci (42%) even after correcting for multiple comparisons. By contrast, the rate of significant fat phenotypes in RNAi KD found in a recent genome-wide Drosophila screen (Pospisilik et al. (2010) is ~5%. More interestingly, for 10 of the 26 positive regions, we found that the nearest gene was not the one that showed a significant phenotype in the fly. Specifically, our screen suggests that for the 10 human BMI SNPs rs11057405, rs205262, rs9925964, rs9914578, rs2287019, rs11688816, rs13107325, rs7164727, rs17724992, and rs299412, the functional genes may NOT be the nearest ones (CLIP1, C6orf106, KAT8, SMG6, QPCTL, EHBP1, SLC39A8, ADPGK /ADPGK-AS1, PGPEP1, KCTD15, respectively), but instead, the specific nearby cis genes are the functional target (namely: ZCCHC8, VPS33A, RSRC2; SPDEF, NUDT3; PAGR1; SETD1, VKORC1; SGSM2, SRR; VASP, SIX5; OTX1; BANK1; ARIH1; ELL; CHST8, respectively). The study also suggests further functional experiments to elucidate mechanism of action for genes evolutionarily conserved for fat storage.

Author summary

Human Genome Wide Association Studies have successfully found thousands of novel genetic variants associated with many diseases. While these undoubtedly point to new biology, the field has been slowed in exploiting these new findings to reach a better understanding of exactly how they confer increased risk. Many, if not most, appear to be regulatory not coding variants, so their immediate consequence is not obvious. A real rate limiting step is even identifying which gene these variants might be regulating, and in what tissues they are operating to increase disease risk. In the absence of any other information, a first order assumption is that they may be more likely to be regulating a nearby gene, and such variants are often initially annotated by the “nearest” gene until their function is more definitively validated. Exploiting the idea that many genes may have conserved function across species, we conducted a high-throughput screen of fruit-fly orthologs of human genes nearby 78 well validated GWAS variants for human obesity, in order to more precisely identify the gene(s) of action. We systematically knocked down the function of each of these nearby genes in the brain and fat-body of the flies, raised them on a standard diet, and compared their percent body fat with control flies, in order to validate which genes showed a fat response. 43% of the time when fly orthologs existed in the region, we were able to identify the causal gene. Interestingly, nearly half the time (46%), it was not the nearest gene but another nearby one that regulated fat.


Human Genome-Wide Association Scans (GWAS) have been successful in discovering many genetic loci that are significantly associated with Body Mass Index (BMI). These associations have been replicated across consortia consisting of many large and independent studies, sometimes numbering into hundreds of thousands of subjects. From a statistical point of view, the evidence that single nucleotide polymorphisms (SNPs) are tagging something real is overwhelming. However, progress has been slowed in moving beyond the discovery phase to a deeper understanding the biological significance of these findings, due to difficulties isolating the driving causal variants or even identifying the acting genes tagged by these GWAS variants. Most of the findings are not in gene coding regions. Indeed, many are intergenic, suggesting that much of the underlying modes of action of these loci may be regulatory. Unfortunately, our limited biological understanding of the regulome has hampered further progress.

While the field has begun serious annotation of the regulatory regions of the genome with initiatives and resources such as ENCODE, RoadMap and GTEx, the annotation is still far from complete and the answers that are emerging are complex. As a result, many publications annotate the statistically significant SNPs simply with the “closest” gene, even though trans-acting regulatory sequences certainly exist[1], and some enhancers have been shown to regulate multiple genes[2]. Recently, it was reported that rs1421085 T-to-C intron of the well-known obesity-associated FTO gene, disrupts a conserved motif for the ARID5B repressor, which leads to derepression of a potent preadipocyte enhancer and a doubling of the transcription factors IRX3 and IRX5 expression during early adipocyte differentiation. Irx3-deficient mice showed a 25–30% reduction of body weight, primarily through the loss of fat mass and increase in basal metabolic rate. Hypothalamic expression of a dominant-negative form of Irx3 reproduces the metabolic phenotypes of Irx3-deficient mice. Thus, IRX3 has been suggested as a functional long-range target of obesity-associated variants within FTO and represents a novel determinant of body mass and composition, by regulating the process of thermogenesis as they can prevent the process in which energy is turned into heat, thus stored as fat [3, 4].

The above research supports the idea that an intronic location of an associated SNP does not even establish that the genetic effect is on that gene. Carrying out functional mapping of these GWAS-associated regions can provide valuable information to sort out which genes are causal to adiposity, and possibly provide biological insight into their action. While mouse models of obesity can serve as powerful platforms to functionally probe a small number of candidate genes, this approach is expensive and time consuming, limiting the number of genes that can be readily assessed. However, many important biochemical pathways involved in growth, metabolism, fat storage and retrieval are ancient and are therefore well conserved across the animal kingdom from C. elegans and Drosophila to rodents and humans. For example, forward genetic screens in C. elegans and Drosophila have identified conserved genes that regulate triglyceride storage[5, 6]. Readily available genetic tools in Drosophila, including mutations and inducible RNA interference (RNAi), coupled with the short life span, offer the opportunity for high-throughput functional screening of candidate genes, such as those proximal to GWAS putatively regulatory variants. A Drosophila genetic approach was recently used to follow up a small-scale GWAS for Alzheimer pathology[7] and type 2 diabetes mellitus and related metabolic disorders[8]. Capitalizing on this approach, we conducted a high throughput functional screen in Drosophila of all nearby genes to 78 BMI SNPs from two recently published GWAS meta-analyses to see if we could make progress in identifying the possible genes of action for these novel loci, as outlined in Fig 1.

Fig 1. Experimental design of the Drosophila BMI loci functional screen.

WT = Corresponding RNAi Wild Type (as detailed in Fig 3).


In Table 1, we show the specific qualitative results of the Drosophila screen for each of the 78 index BMI SNPs (detailed results for each gene are given in S1 Table). For 16 (21%) of the BMI loci, the 500-kb region around the locus could not be interrogated with our Drosophila screen, as none of those genes had a fly ortholog. Details for these unscreened loci and regions are listed in Table 1. For the remaining 62 (79%) loci, at least one gene in the 500-kb region had a fly ortholog which could be KD in our screen. Details of the results for these 62 loci are given in Table 1. In Table 1, we show the lead SNP identified by GWAS, its BMI association P-value (as previously reported), the first author of the reference paper identifying that SNP association, its position (per db147), the nearest gene, the annotation of the role of the SNP (if within the nearest gene), and finally we list all other genes within 250 kb radius of the lead SNP. From Table 1, a total of 439 genes were found within the queried intervals of these 78 SNPs. 224 (51%) of those genes had fly orthologs with transgenic RNAi stocks available. Additionally, 30 (13%) of the RNAi KD crosses were lethal (or pupal lethal), and thus could not be evaluated for %BF phenotype. Overall, 36 RNAi gene KDs showed significantly higher or lower %BF at adulthood compared to controls, using Dunnett’s Multiple Comparison test, which accounts for multiple comparisons with common controls.

Table 1. Results of the functional drosophila screen for all genes nearby (± 250 kb) the 78 BMI-associated GWAS loci from speliotes et al. 2010[30] and locke et al. 2015[31].

The results for the screened loci from Table 1 are summarized in Fig 2. In Fig 2A, we show that 26 of the 62 screened loci (42%), have at least one significant KD phenotype gene in the nearby region (± 250 kb). For 18 of these loci, there was only one gene significant, whereas we found 2 significant genes producing phenotypes in 6 regions and 3 significant genes producing phenotype in 2 of the regions (for a total of 36 significant genes).

Fig 2. Summary of Drosophila functional scan for the 61 BMI GWAS loci that could be tested in the fly.

Number of GWAS BMI loci for which nearby genes were validated in RNAi KDs in Drosophila. (a) Distribution of Number of Significant Fly KD Genes per BMI Locus Region. Significance determined by Dunnett’s Multiple Comparisons Test. (b) Number of Significant Fly KD Genes per BMI Locus by Proximity to SNP. Significance determined by Dunnett’s Multiple Comparisons Test. CNBT = Could Not Be Tested in Fly (either No Fly Ortholog or KD lethal).

Since the nearest gene is commonly used to annotate a significantly associated SNP, we show the yield from our screen for proximal vs. more distal genes in Fig 2B. Considering only the nearest genes: 45 (74% of the original 78 loci) had fly orthologs that could be phenotyped, while 16 (26%) either had no fly ortholog, or the KD was lethal. Of the 45 nearest genes that could be assessed in the fly, 11 (24%), tested as having significantly different %BF than controls by Dunnett’s Multiple Comparison test. For 9 of these 11 significant loci, no other gene tested in the 500kb region showed a significant phenotype, suggesting that the nearest gene may indeed represent the functional target of the original GWAS SNP. These 9 genes are: ELAVL4, ERBB4, FOXO3, PARK2, RALYL, NT5C2, HSD17B12, NRXN3 and SBK1. For 3 of the 11 significant nearest genes, one (TCF7L2) had an additional nearby gene (within 250kb radius of the BMI SNP) with a significant phenotype (VTI1A); another significant nearest gene (TAL1) had two neighboring genes (within 250KB radius of the BMI lead-SNP) with significant phenotypes (CYP4A22, FOXD2/FOXD2-AS1); and the third significant nearest gene (HSD17B2) had no additional nearby candidates that had a fly ortholog.

Interestingly, for 5 BMI-associated SNPs, the nearest gene was testable in the fly, but did not show a significant difference in %BF from control when knocked down. However exactly one other gene in the 500-kb region of the lead SNP did show significant %BF differences from control. These 5 significant genes are OTX1, BANK1, ARIH1, ELL, and CHST8. Our experiment would suggest that the corresponding lead SNPs may be tagging one or more regulatory elements functionally operating on these more distal genes and not the most proximal ones. Additionally, there were 5 SNPs where the closest gene could not be screened in the fly (either because there is no ortholog RNAi line available, or the KD is lethal), but one or more genes in the 500-kB region showed a significant %BF change from control. These 6 significant genes from the 5 regions are YPEL3, PAGR1, RACGAP1, PDK4, ZNF704, CLUAP1. These may also be regions where the more distal gene is the target (but the evidence is less clear, since the nearest gene could not be tested in the fly). Finally, there were 8 SNPs for which multiple genes in the 500-kb region showed a significant KD phenotype, suggesting that perhaps the original SNP finding may be reflecting combined signals from multiple obesity genes.

Quantitative details of the results for the 36 significant genes are shown in Fig 3. We display the mean and standard deviation of %BF at adulthood for each gene KD, along with box-plots depicting the distributions of the data. For each of the KD gene lines, we give the corresponding human gene, and identify which fly control background (GD or KK) was used to assess significance, under the same feeding and environmental lab conditions. The largest significant changes were found for the SPDEF ortholog (Ets98B) KD, which showed an average 2.46-fold increase in %BF, and for the ZNF704 ortholog (Glut4EF) KD, which showed a 5-fold decrease in %BF. As can be seen, the experiment was able to detect significant increases of at least 1.5 fold and decreases of at least 1.8 fold in specific KD lines.

Fig 3. Percent body fat distributions for the 36 RNAi knock down Drosophila gene crosses, testing significantly different from corresponding Wild Type control, by Dunnett’s multiple comparisons test.

In Fig 3, we also annotate the major classes of genes demonstrating significant %BF changes. Three of the 36 significant genes are DNA methyltransferases (SETD1A, PAGR1, and YPEL3), 9 are transcription factor genes (ZNF704, TAL1, TCF7L2, FOXO3, SIX5, ZCCHC8, OTX1, FOXD2, and SPDEF), 7 are metabolism enzymes (NT5C2, CYP4A22, CHST8, VKORC1, NUDT3, HSD17B12, and SRR), and 7 vesicular transport genes (VTI1A, CLAP1, VASP, RACGAP1, SGSM2 and NRXN3), and the remaining 9 have different functions (RSRC2, SBK1, PARK2, BANK1, RALYL, ELAVL4, PDK4, ARIH1, and ERBB4) shown in Fig 3.

Functional validation in the mouse

To further validate the Drosophila functional results, we queried several bioinformatic resources, including the Mouse Genome Informatics (MGI) website (, which catalogues publication results of mouse experiments, as well as the International Mouse Phenotyping Consortium (IMPC) website (, which also contains unpublished as well as published extensive phenotype characterizations screens of knock out (KO) experiments (the results are summarized in S2 Table). At the IMPC website, at this time, there are results of whole organism knockdown experiments for only 8 of our 36 human-fly obesity genes (many more are planned in the future). One of these, SBK1, was pre-weening lethal as a whole organism knockdown, and thus could not be evaluated. For the other seven, there were extensive phenotypic characterizations of adult mice, including Dexa fat mass evaluations. Three of these genes showed highly statistically significant fat mass differences between the KO and WT mice: SETD1A (P = 2.46E-08), TCF7L2 (P = 5.97E-10) and FOXO3 (P = 3.41E-05) while two were nearly statistically significant different from WT: ZNF704 (P = 0.086) and YPEL3 (P = 0.082) and two showed no differences from WT (CLUAP1 and PDK4). Thus, 5 of the 8 of our genes that have thus far been interrogated by whole body KO in mice in the IMPC confirm our results. Obesity-related phenotypes have been previously published for two of the three most significant genes. TCF7L2 KO mice were shown to be leaner and have improved glucose tolerance[9], and further, TCF7L2 was shown to negatively regulate adipocyte differentiation[10]. FOXO3 has been shown to be downregulated in the brains of high-fat diet induced obese mice[11] and mRNA of FOXO3 levels were associated with chicken growth traits, including fat body weight[12]. The MGI website listed confirmatory mouse-obesity evidence for several of the same genes as the IMPC, but no additional ones. In literature search of PubMed, we found suggestive or validation evidence for 5 more of the 36 human-fly obesity genes. Whole body siRNA of NTSC2 in mice showed increased lipolysis [13] and VTI1A was shown to interact with GLUT4 in adipocytes in mice[14]. More directly, PARK2 KO mice show decreased fat absorption and are leaner on high-fat diets[15]. VASP KO mice have reduced body weight and increased brown adipocytes[16]. They also show increased triglyceride accumulation in liver[17]. Finally, mouse expression and fly KDs confirm our findings for NUDT3 by Williams et al. 2015[18]. NUDT3 was significantly up regulated in the reward and feeding related regions of the hypothalamus and amygdala of the mouse brain. In all, we found validation evidence supporting obesity phenotypes in mice for 10 of the 36 genes found in our fly validation screen of cis genes near human BMI loci (ZNF704, SETD1A, VTI1A, TCF7L2, FOXO3, NUDT2, PARK2, VASP, and YPEL3).

However, it should be noted that few of the published or unpublished catalogued experiments, if any, actually reproduce the exact conditions of our fly screen. The IMPC KO screen, as well as most of the published mouse data available are on whole organism knock outs, whereas in our fly screen, we used tissue-specific drivers to confine the knock down only to the brain and to adipose tissue. This is an extremely important difference. Whole organism knock out animals typically experience a wide range of phenotypes across many systems and organs, and in some cases, very severe defects (some are even lethal). Thus, the lack of concordance in obesity phenotypes in such experiments does not mean that our fly experiment has been properly tested for replication in the mouse and it failed. Rather, the fact that we already have found suggestive or strong evidence for 10 of the 36 of the fly genes in the mouse, demonstrates the utility of using the fly as a high-throughput functional screen to help us identify which genes the non-coding human GWAS statistical loci might be regulating, as an important step to moving from statistical association to mechanism of action.

eQTL validation in human tissues

We used the GTEX resource to see if there is any evidence that the 78 BMI loci are eQTLs for any of the 36 human-fly obesity genes, in the relevant human tissues (S2 Table). We considered 6 human tissues available in GTEX: Adipose Subcutaneous, Adipose Omentum, and Liver (corresponding to the fly fat body), and Brain Hypothalamus, Brain Hippocampus and Pancreas (corresponding to the fly brain—the last because clusters of cells in the brain of flies secrete insulin). We find 19 of the 36 gene-loci pairs show significant eQTL evidence in at least one relevant human tissue.


There is a vast literature on the genetics of fat in Drosophila, using many different experimental approaches [1921]. In fact, we find 1,198 references to [“Drosophila” AND “fat” AND “genetics”] in Pubmed. Some of this literature shows some of the same genes regulating fat storage as we find in our human-to-fly screen (e.g. FOXO3[22]). However, our primary goal here is not to add to knowledge about Drosophila biology per se, but rather to use the fly model as an efficient screen to help the human genetics field move from GWAS statistical “loci” of largely unknown function, to identification of the gene of action, in cases where the current human annotation is unclear or ambiguous.

In a similar experiment to the one we report here, in which the goal was to elucidate GWAS findings for diabetes, Baranski et al. studied 38 human genomic regions in which SNPs were associated with type 2 diabetes and other related metabolic disorders [8]. Knock-Down of 34 candidate genes resulted in sugar-dependent lethality, including HHEX, THADA, PPARG, KCNJ11. For 23 regions, the KD of at least one candidate gene resulted in sugar-dependent toxicity [23]. Moreover, at several loci more than a single candidate gene demonstrated phenotypes when knocked down, suggesting that the SNP marked a region where several genes with similar function reside. These analyses demonstrated the utility and feasibility of using Drosophila melanogaster KD as an experimental model for testing functionality of orthologous human genes. Revising these methodologies to investigate adiposity, we have evaluated the functional effects of KD of cis-candidate genes based on BMI-associated SNPs robustly identified by GWAS.

For 26 of the 62 (42%) BMI loci that could be screened in the fly, we found at least one gene within the +250KB radius that showed a significant change in %BF in the fly ortholog KD compared to control (for a total of 36 significant genes). By contrast, large screens of metabolism phenotypes by RNAi screens of Drosophila typically identify 5–10% candidate genes, not correcting for multiple comparisons. For example, Ugrankar et al (2015)[24] found 61 of about 650 (9%) random RNAi transgenes resulted in significant glucose elevations (S1 Table) by student t-test, p < 0.05. More relevant to our human BMI candidate region screen, in an RNAi adiposity genome-wide screen of the fly, Pospisilik et al. (2010)[6], found ~500 functional genes for adiposity from screening 10,500 open reading frames. (~5%), also using student t-test, p < 0.05 for significance (without correcting for multiple comparisons). Thus, our findings (even after multiple comparisons corrections) are considerably enriched compared to background by screening the regions containing of BMI loci. Further, the GTEx resource ( confirms that all 36 of the human genes for which the corresponding fly ortholog KD resulted in a significant phenotype (as shown in Fig 3), are expressed in either the brain or adipose tissue in humans. The large number of positive findings suggest that much of the biological machinery to store and retrieve fat is conserved deep in the evolutionary tree, which makes a strong argument for the utility of this kind of high throughput functional genetic screening strategies for evolutionarily conserved genes in simpler model organisms.

Some loci regions contained more than one functional gene for adiposity. For two of the index BMI SNPs, our experiment identified three nearby genes that showed %BF KD changes in the fly. The first is rs977747, for which the closest human gene is TAL1 (the SNP is in the 3’-utr of TAL1—Table 1, locus 1). The KD of the fly ortholog of TAL1 showed a significant decrease in %BF compared to control, as did the KD for the ortholog of the nearby CYP4A22 gene. In contrast, the KD for the nearby FOXD2 gene showed a significant increase in %BF. In MCF7 cells, CYP4A22-AS1 expression has been shown to stimulate TAL1 gene expression[25]. If this holds in either the brain or fat body of the fly (as well as in the human), it could be that the KD of CYP4A22 (ortholog) results in reduced TAL1 (ortholog) expression, which would result in a similar phenotype to the TAL1 KD. By contrast, FOXD2 KO mice have been shown to have decreased PKA expression[26]. Functional studies implicate cAMP–PKA in initiation of vascular and hematopoietic differentiation of embryonic stem cells via recruitment of the transcriptional activator cAMP response element–binding protein (CREB) to the Etv2 promoter, resulting in up-regulation of among other genes, TALl1[27]. This would be consistent with the idea that KD of FOXD2 might in turn reduce PKA expression, which would in turn overexpress TAL1, resulting in an opposite phenotype to the KD for TAL1 (increase rather than decrease in %BF). On the other hand, both TAL1 and FOXD2 are transcription factor genes, and FOXD2 is listed as one of the many targets of TAL1, so the direction of causality may be from TAL1 to FOXD2. Another explanation, would be that all 3 of these genes operate independently on %BF, and in the human, the proximity of the genes in the genome means that the lead SNP is tagging a common haplotype with critical regulatory variants for each. These are testable hypotheses to follow up in future experiments.

Many of the positive findings (11 of the 26 regions with at least one significant KD gene, 42%) implicate the influence of the nearest gene to the BMI lead SNP on adiposity in the fly. As our KDs were specific only to brain and adipose tissue, these results provide powerful preliminary data for further detailed experiments on the mechanism of action for these genes as the drivers of the human BMI findings.

But perhaps the most scientifically useful outcome from our screen is the identification of 10 human BMI regions where the closest gene to the BMI locus was testable in the fly and did not show a fat phenotype, but one or more other nearby genes did show a significant fat phenotype (Table 1). This “non-nearest gene” case represents 16% of the human BMI loci that could be screened in the fly. Furthermore, our screen identifies one or more strong, alternative candidates for additional functional study. Specifically, as shown in Table 1, our screen would suggest that for human BMI SNPs rs11057405, rs205262, rs9925964, rs9914578, rs2287019, rs11688816, rs13107325, rs7164727, rs17724992, and rs299412, the functional genes may NOT be the nearest ones (CLIP1, C6orf106, KAT8, SMG6, QPCTL, EHBP1, SLC39A8, ADPGK /ADPGK-AS1, PGPEP1, KCTD15, respectively), as these were all negative in our Drosophila screen. Instead, our screen suggests that different nearby cis gene may be more fruitful for further functional follow up for obesity, namely ZCCHC8, VPS33A, RSRC2; SPDEF, NUDT3; SETD1, VKORC1; SGSM2, SRR; VASP, SIX5; OTX1; BANK1; ARIH1; ELL; CHST8, respectively. Furthermore, there are an additional 5 BMI SNP regions, for which the nearest gene could not be tested in the fly (no ortholog or no available KO RNAi Fly line), but one or more nearby gene(s) did show a significant KD phenotype in the fly. These findings would suggest that for these 5 BMI SNPs rs4787491, rs7138803, rs6465468, rs16907751, and rs758747, it might be worth investigating the nearby genes YPEL3/PAGR1, RACGAP1, PDK4, ZNF704 and CLUAP1 (respectively) instead of (only) the nearest genes INO80E, BCDIN3D/BCDIN3-AS1, ASB4, ZBTB10 and NLRC3 (respectively).

For two of the 10 cases where a distal gene was significant rather than the nearest gene, and one of the 5 for which the nearest gene could not be tested in the fly, bioinformatic databases confirm that the SNP is an eQTL for the significant distal gene in humans. BMI SNP rs9914578 is intronic to gene SMG6, but that gene’s fly ortholog did not show a significant fly phenotype. However, that SNP is an eQTL for the distal gene SRR in adipose tissue (P = 8.1e-7) according to GTEX, and this is one of our 36 significant genes. Similarly, BMI SNP rs4787491 is intronic to gene INO80E (which had no available KO RNAi Fly Line), but the SNP is an eQTL for YPEL3 in both adipose tissue (P = 3.0e-26) as well as in the human brain (P = 1.6e-7) according to GTEX, and that gene is significant in our fly screen. Finally, BMI SNP rs11688816 is intronic to EHBP1, but this gene’s ortholog KD did not show a significant change in %BF (1.28+0.19 vs.1.35±0.45 for control). However, the nearby OTX1 ortholog KD did show a significant %BF phenotype compared to control (2.70±0.17 vs. 1.35±0.45, respectively). The NESDA NTR Conditional eQTL Catalog ( which provides eQTL results from human whole blood expression studies, confirms that this BMI SNP is a significant cis eQTL for OTX1 (False Discovery Rate<1.3e-5), but is not a significant eQTL for EHBP1 itself[28]. Also, Westra et al (2013)[29] found that rs11688816 regulates OTX1 with a P = 1.8E-24 in whole blood. This evidence in humans combined with our RNAi tissue specific KD results in the fly, strongly suggest that the nearby distal genes, not the ones in which the SNP actually resides, are the genes which are conferring obesity risk in humans.

In summary, our Drosophila screen has demonstrated evidence that a high percentage of human obesity loci may be evolutionary conserved down to the insect (33%: 26 out of 78). For more than a third of these (N = 10), we found that the nearest gene to the BMI lead SNP did not seem to be the one that was functional in the fly, but one or more nearby genes were functional in the fly. Furthermore, functions of the genes identified in the fly affecting %BF are important in relation to brain, glucose and fat metabolism, cell proliferation and growth and contributing to transcription regulation. We have therefore identified specific, novel, better motivated biological targets for further study in the study of the genetic architecture of obesity. For those genes that are conserved in the insect, our study points the way towards further experimental approaches to more clearly define the mechanisms of action for loci already demonstrated to be relevant for humans.


The interpretation of the effect of a candidate gene is straightforward when a human gene—single fly ortholog exist. But ~30% of human genes do not have fly orthologs and therefore cannot be evaluated. The gene ontology matches between human and Drosophila genes in many cases is difficult, especially when many human homologs exist for a single Drosophila gene or there are many Drosophila orthologs (many-to-many). For these cases, we selected the best ortholog for KD; but assigning observed functions to specific genes was more difficult. The sensitivity of the model system might be limited due to the how accurately Drosophila models human obesity as measured via BMI. Also, our tissue-specific KDs interrogated gene effects only in the brain and the fat body, so we would miss effects that operated through other organs or tissues. While many fundamental processes in energy regulation are likely to be conserved, there will be complexities of human physiology that are not modeled well in insects. For selected genes, future studies will require validation in mammalian models.

Materials and methods

We show in Fig 1 the outline of our experimental design. In Step 1, we began with a list of 78 BMI-associated loci seen in preliminary analyses of our own studies which were ultimately published in two comprehensive GWAS meta-analysis publications: Speliotes et al. 2010[30] and Locke et al. 2015[31]. All 78 loci, marked by the most significantly associated “lead” SNP, had reached genome-wide significance levels of P<10−8 in meta-analyses of multiple cohorts. In Step 2, we identified all (human) genes within 250kb (in either direction) of the 78 significant BMI associated lead SNPs, according to dbSNP build 147 of assembly GRCh37/HG19. In Step 3, using NCBI’s Entrez gene for identifying human gene symbols (, we identified the closest Drosophila orthologs to each of these human genes according to DIOPT—DRSC Integrative Ortholog Prediction Tool ([32], as well as pairwise alignments generated using BLAST of NCBI (for example: In Step 4, we obtained available RNAi knockdown lines for each of the fly orthologs from Step 3, and crossed these with flies containing tissue-specific drivers to knock down the expression of the genes only in the brain and the fat body. Finally, in Step 5, we raised the flies on a control diet, and compared the amount of fat/triglyceride in the tissue-specific KD group compared to the driver-only control flies.

Drosophila melanogaster model

Fly stocks.

RNAi stocks (listed in S1 Table) were acquired from the Vienna Drosophila Resource Center, as well as genetic background controls w1118 (for GD lines, VDRC #60000) and yw1118; P[attP, y+, w3’]VIE-260B (for KK lines, VDRC #60100) [24], Cg-GAL4 (BDSC #7011). Preference was given to KK lines, in which UAS-RNAi hairpins have been targeted to a landing site in Chromosome 2. The GD lines insert the UAS-RNAi at random positions in chromosomes X, 2, or 3. For three candidate genes (ARIH1, VIL1, and ZC3H4), the only lines available were GD in which the UAS-RNAi mapped to the X-chromosome. For these three lines, only female offspring carried the UAS-RNAi hairpin, therefore females were analyzed for metabolic studies. To increase the extent of KD, we crossed Cg-GAL4 with a UAS-Dcr stock to generate UAS-Dcr2; cgGAL4 stock that was used in all crosses to transgenic RNAi lines. Crosses were allowed to lay eggs for set periods to control for larval numbers. Flies were housed in temperature and humidity regulated incubators and kept in the dark for the entire experiment.

Fly media.

We modified a commonly used Drosophila semi-defined medium as previously described [10]. Briefly, we replaced all added sugars in the recipe (glucose and sucrose) with 51.3 g/L sucrose to yield 0.15 M sucrose.

Metabolic studies.

Triglycerides were measured using the Infinity Triglycerides Reagent kit (Thermo Fisher #TR22321) on whole-insect homogenates. Ten animals were homogenized in PBS + 0.1% Tween and heated for 5 minutes at 65°C to inactivate lipases. 2 ml of this homogenate was mixed with 198 ml of Thermo Infinity Triglyceride Reagent and analyzed as per the manufacturer’s instructions. Non-esterified fatty acids were extracted with chloroform and methanol (Marshall et al., 1999), and analyzed as per the manufacturers’ instructions [NEFA-HR(2), Wako Chemicals, Richmond, VA]. Per-animal mass was measured by weighing groups of 10 animals. Each value represents at least 10 independent determinations.

Tissue specific KD: In pilot studies, we performed KD of 18 candidate genes using tissue specific drivers (Cg-GAL4; expresses in fat body and brain, R4-GAL4; expresses in fat body and midgut) or ubiquitous drivers (actin-GAL4 and da-GAL4). We validated expression of the drivers by crossing to UAS-lacZ (S1 Fig, demonstrates brain expression by Cg-GAL4). Triglyceride content was determined in wandering male or female L3 larvae and three-day old adult male or female flies. The largest effects of KD were observed in male adult flies using the Cg-GAL4 tissue specific drivers. Therefore, whenever possible, we used this driver KD of all candidate genes in the brain and fat body, and measured the effects of gene KD adult male flies. Given the known roles of adipose tissue and hypothalamus in regulation of appetite and metabolism, it is perhaps not surprising that the Cg-GAL4 driver produced the most marked phenotypes on percent body fat for KD candidate genes. For 3 target genes (ARIH1, VIL1, and ZC3H4), RNAi gene lines for males were not available, so we tested female flies against female controls.

Statistical analyses

The quantitative percent body fat (%BF) distributions in the adult flies were tested against the corresponding controls using an analysis of variance model as implemented in PROC GLM (SAS v 9.4.), with Dunnett’s Multiple Comparisons test, which corrects for multiple comparisons when common control sets are used for multiple experimental conditions, to provide a 5% experiment-wise error rate.

Supporting information

S1 Table. An excel spreadsheet showing all of the data for this experiment, including the initial GWAS BMI SNPs, all nearby genes, their fly orthologs, and the results of the KDs for each othorlog.


S2 Table. An excel spreadsheet showing validation of 36 significant BMI SNP-Gene pairs in human eQTL expression, in mouse KD experiments, and in previous literature.


S1 Fig. Confirmation of Cg-GAL4 expression in Drosophila brain, using Cg-GAL4 crossed to UAS-lacZ.

Two biological replicates are shown for each of larval and adult brains.



We would like to thank Laura Musselman (Binghamton University, State University of New York) for beta-galactosidase staining of larval and adult brains.


  1. 1. Visel A RE, Pennacchio LA. Genomic views of distant-acting enhancers. Nature. 2009:199–205. pmid:19741700
  2. 2. Mohrs M BC, Wang ZE, Loots GG, Afzal V, Hadeiba H, Shinkai K,. Deletion of a coordinate regulator of type 2 cytokine. Nat Immunol. 2001:842–7.
  3. 3. Smemo S, Tena JJ, Kim K-H, Gamazon ER, Sakabe NJ, Gomez-Marin C, et al. Obesity-associated variants within FTO form long-range functional connections with IRX3. Nature. 2014;507(7492):371–5. pmid:24646999
  4. 4. Claussnitzer M, Dankel SN, Kim KH, Quon G, Meuleman W, Haugen C, et al. FTO Obesity Variant Circuitry and Adipocyte Browning in Humans. N Engl J Med. 2015;373(10):895–907. pmid:26287746; PubMed Central PMCID: PMCPMC4959911.
  5. 5. Ashrafi K, Chang FY, Watts JL, Fraser AG, Kamath RS, Ahringer J, et al. Genome-wide RNAi analysis of Caenorhabditis elegans fat regulatory genes. Nature. 2003;421(6920):268–72. pmid:12529643.
  6. 6. Pospisilik JA, Schramek D, Schnidar H, Cronin SJ, Nehme NT, Zhang X, et al. Drosophila genome-wide obesity screen reveals hedgehog as a determinant of brown versus white adipose cell fate. Cell. 2010;140(1):148–60. pmid:20074523.
  7. 7. Shulman JM, Chipendo P, Chibnik LB, Aubin C, Tran D, Keenan BT, et al. Functional screening of Alzheimer pathology genome-wide association signals in Drosophila. American journal of human genetics. 2011;88(2):232–8. pmid:21295279; PubMed Central PMCID: PMC3035702.
  8. 8. Pendse J, Ramachandran PV, Na J, Narisu N, Fink JL, Cagan RL, et al. A Drosophila functional evaluation of candidates from human genome-wide association studies of type 2 diabetes and related metabolic traits identifies tissue-specific roles for dHHEX. BMC genomics. 2013;14:136. pmid:23445342; PubMed Central PMCID: PMC3608171.
  9. 9. Yang H, Li Q, Lee JH, Shu Y. Reduction in Tcf7l2 expression decreases diabetic susceptibility in mice. Int J Biol Sci. 2012;8(6):791–801. Epub 2012/06/22. pmid:22719219; PubMed Central PMCID: PMCPMC3372883.
  10. 10. Tian L, Song Z, Shao W, Du WW, Zhao LR, Zeng K, et al. Curcumin represses mouse 3T3-L1 cell adipogenic differentiation via inhibiting miR-17-5p and stimulating the Wnt signalling pathway effector Tcf7l2. Cell Death Dis. 2017;8(1):e2559. Epub 2017/01/20. pmid:28102847; PubMed Central PMCID: PMCPMC5386366.
  11. 11. Zemva J, Schilbach K, Stohr O, Moll L, Franko A, Krone W, et al. Central FoxO3a and FoxO6 expression is down-regulated in obesity induced diabetes but not in aging. Exp Clin Endocrinol Diabetes. 2012;120(6):340–50. Epub 2011/12/22. pmid:22187289.
  12. 12. Chen B, Xu J, He X, Xu H, Li G, Du H, et al. A Genome-Wide mRNA Screen and Functional Analysis Reveal FOXO3 as a Candidate Gene for Chicken Growth. PloS one. 2015;10(9):e0137087. Epub 2015/09/15. pmid:26366565; PubMed Central PMCID: PMCPMC4569328.
  13. 13. Kulkarni SS, Karlsson HK, Szekeres F, Chibalin AV, Krook A, Zierath JR. Suppression of 5'-nucleotidase enzymes promotes AMP-activated protein kinase (AMPK) phosphorylation and metabolism in human and mouse skeletal muscle. J Biol Chem. 2011;286(40):34567–74. Epub 2011/08/30. pmid:21873433; PubMed Central PMCID: PMCPMC3186409.
  14. 14. Bose A, Guilherme A, Huang S, Hubbard AC, Lane CR, Soriano NA, et al. The v-SNARE Vti1a regulates insulin-stimulated glucose transport and Acrp30 secretion in 3T3-L1 adipocytes. J Biol Chem. 2005;280(44):36946–51. Epub 2005/09/01. pmid:16131485.
  15. 15. Costa DK, Huckestein BR, Edmunds LR, Petersen MC, Nasiri A, Butrico GM, et al. Reduced intestinal lipid absorption and body weight-independent improvements in insulin sensitivity in high-fat diet-fed Park2 knockout mice. Am J Physiol Endocrinol Metab. 2016;311(1):E105–16. Epub 2016/05/12. pmid:27166280; PubMed Central PMCID: PMCPMC4967148.
  16. 16. Jennissen K, Siegel F, Liebig-Gonglach M, Hermann MR, Kipschull S, van Dooren S, et al. A VASP-Rac-soluble guanylyl cyclase pathway controls cGMP production in adipocytes. Sci Signal. 2012;5(239):ra62. Epub 2012/08/31. pmid:22932701.
  17. 17. Tateya S, Rizzo NO, Handa P, Cheng AM, Morgan-Stevenson V, Daum G, et al. Endothelial NO/cGMP/VASP signaling attenuates Kupffer cell activation and hepatic insulin resistance induced by high-fat feeding. Diabetes. 2011;60(11):2792–801. Epub 2011/09/14. pmid:21911751; PubMed Central PMCID: PMCPMC3198085.
  18. 18. Williams MJ, Eriksson A, Shaik M, Voisin S, Yamskova O, Paulsson J, et al. The Obesity-Linked Gene Nudt3 Drosophila Homolog Aps Is Associated With Insulin Signaling. Mol Endocrinol. 2015;29(9):1303–19. Epub 2015/07/15. pmid:26168034; PubMed Central PMCID: PMCPMC5414682.
  19. 19. Zheng H, Yang X, Xi Y. Fat body remodeling and homeostasis control in Drosophila. Life Sci. 2016;167:22–31. Epub 2016/10/25. pmid:27773719.
  20. 20. Gronke S, Mildner A, Fellert S, Tennagels N, Petry S, Muller G, et al. Brummer lipase is an evolutionary conserved fat storage regulator in Drosophila. Cell Metab. 2005;1(5):323–30. Epub 2005/08/02. pmid:16054079.
  21. 21. Arrese EL, Soulages JL. Insect fat body: energy, metabolism, and regulation. Annu Rev Entomol. 2010;55:207–25. Epub 2009/09/04. pmid:19725772; PubMed Central PMCID: PMCPMC3075550.
  22. 22. DiAngelo JR, Birnbaum MJ. Regulation of fat cell mass by insulin in Drosophila melanogaster. Mol Cell Biol. 2009;29(24):6341–52. Epub 2009/10/14. pmid:19822665; PubMed Central PMCID: PMCPMC2786867.
  23. 23. Musselman LP, Fink JL, Narzinski K, Ramachandran PV, Hathiramani SS, Cagan RL, et al. A high-sugar diet produces obesity and insulin resistance in wild-type Drosophila. Disease models & mechanisms. 2011;4(6):842–9. pmid:21719444; PubMed Central PMCID: PMC3209653.
  24. 24. Ugrankar R, Berglund E, Akdemir F, Tran C, Kim MS, Noh J, et al. Drosophila glucome screening identifies Ck1alpha as a regulator of mammalian glucose metabolism. Nat Commun. 2015;6:7102. pmid:25994086; PubMed Central PMCID: PMCPMC4455130.
  25. 25. Orom UA, Derrien T, Beringer M, Gumireddy K, Gardini A, Bussotti G, et al. Long noncoding RNAs with enhancer-like function in human cells. Cell. 2010;143(1):46–58. pmid:20887892; PubMed Central PMCID: PMCPMC4108080.
  26. 26. Johansson CC, Dahle MK, Blomqvist SR, Gronning LM, Aandahl EM, Enerback S, et al. A winged helix forkhead (FOXD2) tunes sensitivity to cAMP in T lymphocytes through regulation of cAMP-dependent protein kinase RIalpha. J Biol Chem. 2003;278(19):17573–9. Epub 2003/03/07. pmid:12621056.
  27. 27. Yamamizu K, Matsunaga T, Katayama S, Kataoka H, Takayama N, Eto K, et al. PKA/CREB signaling triggers initiation of endothelial and hematopoietic cell differentiation via Etv2 induction. Stem Cells. 2012;30(4):687–96. pmid:22267325.
  28. 28. Jansen R, Hottenga JJ, Nivard MG, Abdellaoui A, Laport B, de Geus EJ, et al. Conditional eQTL analysis reveals allelic heterogeneity of gene expression. Hum Mol Genet. 2017;26(8):1444–51. pmid:28165122.
  29. 29. Westra HJ, Peters MJ, Esko T, Yaghootkar H, Schurmann C, Kettunen J, et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat Genet. 2013;45(10):1238–43. pmid:24013639; PubMed Central PMCID: PMCPMC3991562.
  30. 30. Speliotes EK, Willer CJ, Berndt SI, Monda KL, Thorleifsson G, Jackson AU, et al. Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index. Nature genetics. 2010;42(11):937–48. Epub 2010/10/12. pmid:20935630; PubMed Central PMCID: PMC3014648.
  31. 31. Locke AE, Kahali B, Berndt SI, Justice AE, Pers TH, Day FR, et al. Genetic studies of body mass index yield new insights for obesity biology. Nature. 2015;518(7538):197–206. Epub 2015/02/13. pmid:25673413; PubMed Central PMCID: PMC4382211.
  32. 32. Hu Y, Flockhart I, Vinayagam A, Bergwitz C, Berger B, Perrimon N, et al. An integrative approach to ortholog prediction for disease-focused and other functional studies. BMC Bioinformatics. 2011;12:357. pmid:21880147.