The adult Drosophila melanogaster body develops from imaginal discs, groups of cells set-aside during embryogenesis and expanded in number during larval stages. Specification and development of Drosophila imaginal discs have been studied for many years as models of morphogenesis. These studies are often based on mutations with large developmental effects, mutations that are often lethal in embryos when homozygous. Such forward genetic screens can be limited by factors such as early lethality and genetic redundancy. To identify additional genes and genetic pathways involved in leg imaginal disc development, we employed a Genome Wide Association Study utilizing the natural genetic variation in leg proportionality found in the Drosophila Genetic Reference Panel fly lines. In addition to identifying genes already known to be involved in leg development, we identified several genes involved in pathways that had not previously been linked with leg development. Several of the genes appear to be involved in signaling activities, while others have no known roles at this time. Many of these uncharacterized genes are conserved in mammals, so we can now begin to place these genes into developmental contexts. Interestingly, we identified five genes which, when their function is reduced by RNAi, cause an antenna-to-leg transformation. Our results demonstrate the utility of this approach, integrating the tools of quantitative and molecular genetics to study developmental processes, and provide new insights into the pathways and networks involved in Drosophila leg development.
Citation: Grubbs N, Leach M, Su X, Petrisko T, Rosario JB, Mahaffey JW (2013) New Components of Drosophila Leg Development Identified through Genome Wide Association Studies. PLoS ONE 8(4): e60261. doi:10.1371/journal.pone.0060261
Editor: Edward E. Schmidt, Montana State University, United States of America
Received: December 4, 2012; Accepted: February 24, 2013; Published: April 1, 2013
Copyright: © 2013 Grubbs et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work is supported by a grant (IOS-0842201) from the Developmental Mechanisms Program of the National Science Foundation (www.nsf.gov) to J.W.M. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
In the fruit fly, Drosophila melanogaster, development of the adult body begins when small clusters of cells are set-aside during embryogenesis to form the imaginal discs –. The leg primordia arise in the thoracic segments at positions where segmentally repeated expression of the signaling factor Wingless (Wg) activates expression of Distal-less (Dll) . Abdominal Hox genes block Dll activation posterior to the thoracic segments , while Decapentaplegic (Dpp) and the Epidermal Growth Factor Receptor pathways limit Dll expression dorsally and ventrally, respectively .
After the disc primordia are established they receive patterning instructions via a series of signaling molecules, morphogens and transcription factors expressed during the larval stages that establish the proximal-distal (P-D) pattern of the legs (Fig. 1A). The hedgehog (hh) gene is activated in the posterior compartment of the disc through the action of the transcription factor Engrailed, and while Engrailed defines cells of the posterior compartment, the Hh signal is transmitted to cells of the anterior compartment where it initiates P-D development . On the anterior side of the anterior-posterior border, where Hh signal is strongest, wg and dpp are activated; Dpp is responsible for dorsal fate and Wg for ventral. In addition to defining the dorsal-ventral orientation, these morphogens also work together to define the P-D axis of the leg. In the center of the disc where cells experience high levels of both Wg and Dpp, Dll is activated to establish the distal portions of the leg (mid-tibia through tarsus) –. As the mutual presence of Wg and Dpp decreases, a threshold is reached that permits the activation of dachshund (dac), which is responsible for patterning the middle leg regions (femur and tibia) . Near the edge of the disc, where the mutual concentration of Wg and Dpp is lowest, homothorax (hth) is expressed. Here, the Hth protein imports Extradenticle (Exd) into the nucleus, specifying the proximal portion of the leg and its connection with the body wall , , . Recently, an alternative model has been suggested proposing different molecular controls activating medial fate , , .
A) Leg imaginal disc patterning begins (left) when En, present in the posterior (yellow) portion of the disc, activates hh expression. The Hh ligand diffuses into the anterior portion of the disc. There it activates the expression of dpp (purple) dorsally and wg (green) ventrally. Dpp and Wg, in turn, diffuse throughout the dorsal and ventral portions, respectively, of the disc, creating a gradient of their mutual presence. This gradient is responsible for the pattern of expression of transcription factors (middle) that establish the proximal-distal axis of the leg (right). In the center of the disc, mutual Dpp and Wg is highest, activating the expression of Dll, which is responsible for patterning the most distal structures of the leg, including the terminal structures (claw and pulvillus). Reduced mutual Dpp and Wg results in the activation of dac, responsible for the patterning of the middle portions of the leg. Where there is almost no mutual Dpp or Wg, Hth and Exd are active, patterning the proximal leg portions and the junction with the rest of the body. B–C illustrate how proportions might change without affecting over-all length. B) Enhanced Hh signal (dark yellow) could result in an expansion of Wg and Dpp, causing a broader domain of Dll expression, at the expense of dac (middle). This could, in turn, cause the tarsal segments to make up a larger portion of the leg without changing leg length (right). C) Similarly, the proportion of the femur could be expanded (right) if dac expression were expanded (middle). This might result from a decrease in expression or distribution of Dpp and Wg (left), which could, in turn, be caused by reduced Hh signal (light yellow).
Many of the homologs of genes and genetic pathways that pattern Drosophila legs have been shown to be important in the development and growth of appendages and appendage-like structures in other metazoans , , . Further, misexpression of many of these genes contributes to developmental disorders and cancer , –.
Though we know many of the factors that establish or pattern appendages during Drosophila development, gaps still remain in our knowledge, gaps that, when filled, will solidify our understanding of how molecular networks establish appendage and organ development. For example, many of the genes known to be involved encode transcription factors, but their targets remain largely a mystery , and even some major players in leg development were missed in directed screens for appendage factors . It is critical to fill these gaps to fully understand the genetic architecture of appendage development.
Like any characteristic of the adult fly, appendage morphology is a multigenic trait. Variation in expression, coding ability or mRNA stability of allelic combinations within a population will cause variation in the final morphology of the appendage, within tolerances of morphological constraints. We have taken advantage of this natural variation and employed a genome-wide association study (GWAS) using the wild-derived Drosophila melanogaster lines of the Drosophila Genetic Reference Panel (DGRP, ) to identify genes contributing to leg development. We identified single-nucleotide polymorphisms (SNPs) associated with variation in proportion of the leg segments relative to total leg length. Candidate genes were selected based on proximity to the SNPs associated with this variation and were further tested using in situ hybridization and RNA interference (RNAi) to determine whether or not the genes had a role in leg development. We identified genes from known pathways that had not been previously associated with leg development as well as previously uncharacterized genes, and demonstrated their role in this process. In addition, we identified five genes that, when expression is reduced, cause an antenna-to-leg transformation. Our results will help provide a more comprehensive view of the processes involved in appendage development in Drosophila.
Materials and Methods
Growth of DGRP lines
The DGRP lines were provided by the Mackay lab (NCSU, Raleigh, NC) and raised in standard environmental conditions: plastic vials with cornmeal-agar-molasses media placed in 25°C incubator. We initially screened the set of 40 DGRP lines that were to be the first sequenced. Males and females were randomly selected from each line and seven of each placed in four individual vials for egg laying. Parents were removed after three to four days depending on the amount of larval activity. We extended our screen to 100 lines (45 and 55 each) that were later sequenced to improve the power of association mapping. For these lines, 20 to 100 adults were placed in a collection cup and allowed to lay eggs on grape juice agar plates that were changed daily. After allowing an additional 24 hours for larvae to hatch, fifty randomly selected larvae were transferred to each of four fresh vials. Adult flies were preserved in ethanol 3 to 5 days after eclosing. Analysis of variance (ANOVA) indicated that there was no significant difference between flies collected in these two methods (data not shown).
Dissection of thoracic legs and measurement
First (T1) and second (T2) thoracic legs were dissected and mounted in 70% glycerol on glass slides. Images of the legs were taken with a MicroPublisher camera (QImaging, Surrey, BC, Canada) mounted on a Zeiss AxioScope microscope (Thornwood, NY, USA) at 10× magnification. Three flies of each sex from each vial were randomly selected for leg dissection. Measurements of the legs (Fig. S1) were recorded using Adobe Photoshop and ImageJ . Measurements of the femur, tibia, and tarsal segments were recorded separately (Table S1). Since the proximal end of femurs could not be seen clearly in some images, femur length was measured as the distance of the central line between the most proximal bristle socket to the distal end of the femur. Tibia length was measured as the distance between the proximal and distal ends. Each tarsal segment length was measured as the shortest distance between the ends and joint centers where the tarsus is noticeably thinner. The tarsal segments were added together to get total tarsal length, and the lengths of all segments were summed to arrive at “total” leg length.
Factorial, mixed model ANOVAs of form Y = μ + S + L + V(L) + SxL + SxV(L) + ε were used to partition variation in leg segment proportion between sexes (S, fixed), DGRP lines (L, random), vial from which the measured fly came (V, random), the SxL interaction (random), the SxV interaction (random), and the error variance (ε) (Fig. S2 and Table S2). Association between variability in leg segment proportionality and SNP markers was tested using SAS software (Cary, NC, USA). Markers were tested for association using a simple ANOVA model: phenotype = SNP, assayed independently by position, by sex, and by leg type (i.e. T1, T2). Sites with a minor allele frequency of less than five were removed from analysis. We used data from Freeze One of the DGRP  for our analysis. We employed Fisher's method for combining P-values to merge data from both legs and from both sexes. We sorted by test statistics as a ranking method only; we did not convert the number to a P-value.
Note on our use of Fisher's method for combining P-values to sort our data.
Our goal was to identify SNPs that affect general appendage development, and not those that might be sex or thoracic segment specific. Therefore, we required that SNPs affect leg proportionality in both thoracic segments and both sexes. If this were the case they would have lower P-values in all categories, both sexes and legs both thoracic segments. To sort and identify such SNPs, we calculated the summary statistic using Fisher's method of combining P-values. Fisher's method relies on independence of the traits measured, and this is not the case with our leg data. Therefore, the P-values obtained would be anti-conservative. We sorted by the summary statistic as a ranking method only, and did not utilize the P-value for any analyses. For this reason, we only report the summary statistic in our table (Table S6).
Identifying candidate genes
From association mapping, SNP positions that had the largest test statistic were identified independently for the femur, tibia, and tarsus. During early rounds of experiments we identified a few candidates with leg imaginal disc expression (see below). We continued on with these candidates though some had somewhat lower significance as determined in later rounds of association. The GBrowse program on FlyBase version 2012_3 (www.flybase.org, ) was used to identify candidate genes by proximity to SNP positions. If a SNP was found between two neighboring genes, both genes were analyzed as candidates. Genes near SNPs of interest with a known function in leg development were not examined further. SNP positions greater than 8 Kbp from an annotated gene were discarded.
Cloning and in situ hybridization
Genes were cloned using primers (Table S3) designed from exon sequences on FlyBase . Vector NTI (Life Technologies Corporation, Grand Island, NY, USA) was used to help design primers for a coding region of each candidate gene. These regions were amplified using PCR of Drosophila genomic DNA extracted from Ore-R flies. PCR products were then cloned into a pGEM vector using the pGEM-T Easy Vector System (Promega, Madison WI, USA).
In situ hybridization was used to determine expression patterns of candidate genes in imaginal discs and embryos. Drosophila imaginal discs from third instar larvae were dissected in PBS and placed into standard in situ fixative. Digoxigenin-labeled antisense RNAs were prepared and in situ hybridization was done essentially as in .
RNAi lines were obtained from the Vienna Drosophila RNAi Center (VDRC)  directly, and from the Transgenic RNAi Project (TRiP) from Harvard Medical School through the Bloomington Stock Center (Table S4). Males from RNAi lines were crossed to virgin females from five Gal4 lines: rotund (rn) , decapentaplegic (dpp) , armadillo , Distal-less (Dll) and teashirt . Flies were grown at room temperature, though some were repeated at 25 degrees. Progeny from crosses were examined to determine whether or not there was an effect of silencing candidate genes in Gal4 driver domains. Efficacy of RNAi was confirmed by examining expression of the candidate gene in the RNAi larvae using in situ hybridization (data not shown).
Quantitative variation in leg proportions of the DGRP lines
Our goal was to assess the use of the tools of quantitative genetics for identifying genes involved in Drosophila development. Variation in leg patterning is a multigenic trait, and though there will be constraints on the degree of variation, if measureable, we could use statistical analyses to map candidate loci contributing to that variation. Though previous forward genetic studies have revealed a wealth of information on appendage development in Drosophila, the use of natural variation offered a unique perspective with different sensitivities. We chose to examine variation in proportion of leg segments to bring into focus current models of leg patterning (Fig. 1). For example, a small change in the strength of Hh, Dpp or Wg signals could alter the expression of determinant genes (hth, dac, or Dll), which, in turn, could shift the boundaries of leg segmentation and, hence, change proportioning without necessarily affecting over-all length. Therefore, variation in proportion of the segments (femur, tibia, tarsus) seemed the more informative trait to study if we wanted to identify genes that might affect early appendage patterning. Further, this would remove or reduce other influences, such as variation in metabolism or growth rate.
We dissected legs from first (T1) and second (T2) thoracic segments measuring twelve individuals of both sexes from 137 DGRP lines , . Of these, 117 were included in the Freeze 1 sequences of the DGRP (Fig. S1). The proportion of each segment was calculated as compared to the total as defined in Materials and Methods. Statistical analyses indicated strong correlations between leg segments of both thoracic segments and both sexes within a line, as we would expect if the variation were due to alleles of general appendage-patterning genes (Table S5). Further, there were significant correlations between most of the segments within a leg (femur, tibia and tarsus, Table S5), though the correlation between proportion of tibia and of femur was relatively low (not significant in male T1). In addition, the correlation between the tarsus and other segments was negative, which might be expected if expanding the proportion of one segment would require a simultaneous reduction in another. The variation in proportionality for the leg segments for males and females of the 117 DGRP lines is shown in Fig. S2.
GWAS and candidate gene selection
The top 20 SNP positions associated with variation in each of the three leg segments, along with the genes identified as closest to the SNP are listed in Table S6. (FlyBase numbers for most candidate genes are included in the Table S6. Those not in the table are included in the text.) Although some SNPs were not located near any known or predicted genes, many SNP positions were found in introns, exons, and protein coding regions, as well as just upstream or downstream of genes. Several of the genes neighboring or harboring the SNPs were already known to be involved in leg development, including Moesin (Moe) , Enhancer of zeste (E(z)) ,  and myospheroid (mys; FBgn0004657) – or suspected to be, disco-related (disco-r) . Identification of these genes helped to validate our methods. Some candidates were named genes with known functions, but their role in leg development had not been examined. These were selected for further study to assess whether they also had roles in leg development. Some candidate genes had not been previously characterized.
Expression of candidates during Drosophila development
To determine if our candidate genes played a role in leg development, we first examined distribution of mRNAs encoded by these genes using in situ hybridization, looking for expression in the leg imaginal discs of larvae or their primordia in embryos (Fig. 2 and Table S7). Of the 37 genes examined, we did not detect staining in the leg imaginal discs for seven (CG5549, M-spondin (mspo), dpr8, CG7949, CR33218, CG42353, CG42354). Of these, only mspo and CG7949 had embryonic expression, but neither was expressed in the disc primordia. These seven genes were excluded from further analysis.
Wild-type third instar imaginal discs stained to detect mRNAs from several of our candidate genes. Examples with several different mRNA distribution patterns are shown. There are several that are of note, the cross-like pattern of boi in the wing disc that borders wg and dpp expression, the narrow tarsal staining of eIF6. We also note that disco-r is expressed in the ventral edge of the wing which had not been reported previously.
Transcripts from several genes accumulated in broad regions of the leg discs, such as brother of ihog (boi) and eIF6, while others were enhanced in the presumptive tarsal region, like CG43173 and PI31 (FBgn0033669). We also observed staining for many of these genes in the developing antennae, where some were also patterned. This is not unexpected as the antennae and legs are serially homologous structures –. Transcripts for some genes were observed in the wing, haltere and eye as well. Of particular note was the Hedgehog pathway member boi, which, in the wing discs, had enhanced staining in the wing blade region outlining the expression domains of wg and dpp (Fig. 2). We observed staining in embryos with 17 of the 30 probes that had expression in the imaginal discs; however, with the exception of disco-r , none of these appeared to have staining specifically in the developing disc primordia (data not shown).
RNAi phenocopies in appendages
To determine whether any of our candidate genes were necessary for normal leg development, we used RNAi to reduce gene function of those candidates with mRNAs detected in the leg discs. We used transgenic RNA interference lines from the Vienna Drosophila RNAi Center (VDRC)  and Transgenic RNAi Project (TRiP) with several Gal4 drivers (see Materials and Methods). However, the Dll-Gal4 driver , expressing Gal4 in the medial-to-distal portion of the leg discs, was most informative, so results described below (and summarized in Table 1) are primarily from the use of this driver (referred to as Dll-RNAi below). RNAi lines were not available for four candidate genes (yantar, GG43173, CG43174 and Hexosaminidase 1 [FBgn0041630]).
We observed altered leg morphology with six candidate genes. At room temperature, Dll-RNAi with COP9 complex homolog subunit 1b (CSN1b) resulted in smaller legs relative to overall body size (Fig. 3B) and occasional loss of the fifth tarsal segment. These flies also had reduced aristae in the antennae (Fig. 4B), and males had reduced sex-combs (data not shown). At 25 degrees reduction of CSN1b caused loss of multiple tarsal segments. Dll-RNAi of PI31 reduced the relative size of tarsal segments, occasionally deleting one of the middle tarsal segments (Fig. 3C), and also deleted the aristae in the antennae (Fig. 4C).
Second thoracic legs from female flies with Dll-Gal4 driving UAS-RNAi insertions. All flies were grown at room temperature. A) Wild-type. B) RNAi of CSN1b resulted in reduced leg size, but did not alter shape. C) RNAi of PI31 reduced length of tarsal segments, with occasional loss of a single distal segment. D) RNAi of eIF6 caused deletions and fusions of tarsal segments. E) RNAi of CG6841 deleted most of the tarsal segments and terminal structures. F) RNAi of Cka was similar to that of CG6841, but more severe. Abbreviations: fe, femur; ti, tibia; ta, tarsal segments; cl, claw.
Heads from flies with both Dll-Gal4 and UAS-RNAi insertions that disrupted antennal development. Arrows with numbers indicate the antennal segments. A) Wild-type. B) RNAi of CSN1b resulted in loss of arista, and appearance of sclerotized structure C). RNAi of PI31 resulted in loss of arista while the remaining structures appear normal. D) RNAi of CG6841 resulted in loss of arista, and drastic changes to the appearance of antennal segment 3, and some reduction of A2. E) RNAi of Cka resulted in loss of all but proximal structures. F) RNAi of CG9129 is characteristic of the five lines that resulted in transformation of antennae toward leg identity beginning in distal A2, though slight differences were noted between the five lines that caused these transformations.
Dll-RNAi of eIF6, CG6841 and Connector of kinase to AP-1 (Cka) each resulted in fused and deleted tarsi, with the later two also deleting terminal structures, the claws and pulvilli (Fig. 3D–F). In addition, Dll-RNAi of eIF6 caused reductions in the sex-comb tooth number (data not shown). RNAi of CG6841 and Cka also altered or deleted distal antennal segments (Fig. 4D, E).
In some cases Dll-RNAi was lethal. For example, Dll-RNAi with sickie (sick) was lethal prior to pupal formation, so we could not determine if there was an effect on leg development with that driver. However, with the rn driver (rn-RNAi, expressing Gal4 in the distal portion of the leg and in the wing blade region) reducing sick had no apparent effect in the legs, except with sex-comb tooth numbers in males, which were either greatly decreased or increased (data not shown). Reduced expression of bnl (FBgn0014135) and Chronologically inappropriate morphogenesis (chinmo) caused early death with multiple drivers.
RNAi of many candidates, driven by either rn- or Dll-Gal4, also resulted in altered wing morphology, ranging from a hooded phenocopy with slight reduction in size accompanied by a downward curvature, to near total loss of wing blade (data not shown).
Somewhat surprisingly, Dll-RNAi with five candidates (CG9129, CG32333, CG30371, Piezo [FBgn0031993] and disco-r) caused antenna-to-leg transformations (for example see CG9129 in Fig. 4F), yet no overt disruption of leg morphology was observed with these candidates. Since all five RNAi lines were from the VDRC we were suspicious that this could be due to a background effect with some of the VDRC RNAi lines. To test this, we generated flies with the RNAi insertions and Dll-Gal4, with and without tubulin-Gal80 on the X chromosome. Presence of Gal80 would block activation of the RNAi construct, so if the antenna-to-leg transformations were due to a genetic background effect in some of the VDRC lines, then we would expect to observe the transformation regardless of whether Gal80 was present. However, presence of Gal80 completely blocked the antenna-to-leg transformation indicating that the transformations were due to activation of the RNAi constructs, likely through reduction of the candidates.
General considerations of the GWAS approach for identifying genes important for limb development
Studies of appendage development in Drosophila have been extremely valuable in understanding the genetic networks governing many aspects of organ and structure development. Further, comparative studies in other organisms have opened the door for examining how genetic changes lead to diversification of structures and, thereby, evolution. GWAS takes advantage of the natural variation in a trait or phenotype to identify genes that contribute to the development of that trait. It is most important to understand that it is the combination of many genes with small effect that gives rise to the natural variation of a trait. Hence, a large effect is not expected from any one SNP. By associating a particular SNP with variation in the trait of interest, we can map the positions of genes that contribute to the trait. Thus GWAS offers a different perspective on gene identification and can circumvent issues affecting more standard forward genetic screens, such as early lethality and redundancy. Although GWAS has been used in the past to locate genetic causes of variation in behavioral and quantitative traits and causes of heritable diseases , , it has not, to our knowledge, been employed to identify genes involved in developmental processes. Our study demonstrates that integrating the tools of Quantitative and Developmental Genetics can be very productive, addressing issues of complex genetic interactions governing development.
The presence of functionally redundant paralogs in the genome could mask a leg development role in most forward genetic analyses. From our GWAS, we identified several candidates where a paralog is known or suspected (Table S8). Of these, boi, in particular, stands out. boi, and its paralog, interference hedgehog (ihog, FBgn0031872) have recently been shown to encode proteins critical for Hedgehog (Hh) signaling during Drosophila wing development –. Currently, neither has been implicated in leg development. However, since Hh signaling is critical in the legs, it is reasonable that these genes could play a part in leg development and that SNPs in these genes could contribute to quantitative variation in leg phenotypes. Therefore, it is not surprising they would show up in our GWA Study. Boi and Ihog bind Hh, strengthening the activation of the Hh signaling pathway, while limiting the range of the ligand , . In the leg discs, any change that broadens or narrows the range of Hh could, in turn, broaden or narrow the expression domains of the downstream responders (wg or dpp), thereby altering the domains of expression of the transcription factors responding to these signals (Fig. 1). While this potential connection to the leg development model makes boi an intriguing candidate to pursue, reduction of boi (or ihog) alone (through Dll-RNAi) caused no obvious malformation of legs or antennae. We suspect that the redundancy of these genes prevented any gross abnormalities in leg development. Presumably, subtle tweaks to boi expression or function could have effects on leg patterning within the realm of a multigenic characteristic of a population.
Redundancy might also have an impact on the antenna-to-leg phenocopy we observed with five candidate genes (CG9129, CG32333, CG30371, Piezo and disco-r, see Table S8). We suspect that redundancy is the reason that we did not observe malformation of the legs simultaneously with the antenna-to-leg transformations, and that these candidates actually do have roles in leg development. This is certainly true for disco-r, where the known paralog, disconnected (disco) masks the role of loss of disco-r alone during leg development , . Perhaps, antennae are more sensitive to RNAi of one particular paralog versus the other, or that the antennae are more sensitive than the legs to the reduced Dll function of the Dll-Gal4 mutation in combination with the RNAi (see below). Indeed, we have noted that the RNAi process appears to be more robust in the antennae than in the legs (data not shown). Because of redundancy, it is possible that neither boi nor the antenna-to-leg candidates would easily have been discovered from standard forward genetic screens. This demonstrates the power of GWAS as a tool to overcome some of the limitations of more traditional methods of identifying genes involved in developmental pathways.
As a final comment about target gene identification from our GWAS study, we do expect some false positives in our analysis. An estimated False Discovery Rate (FDR) indicates that approximately 20 SNPs could have arisen by chance using a cutoff of 1e-5 (Table S9). However, because this FDR estimation assumes all tests are independent, it is an overestimate. Lack of independence arises because of linkage disequilibrium between neighboring SNPs, and this would mean that the actual number of analyzed SNPs is somewhat lower than that used for the FDR estimation. Hence, the false discovery estimate is inflated.
New insights into the pathways of appendage development
In Table 2 we summarize what is known about the proteins and their functions of our candidates. Though a few of these encode known proteins, many are computed genes (CGs) at this point, so the data obtained from this study will help in defining pathways associated with these genes.
Three candidates potentially could be involved in expanding or restricting the signals or morphogens involved in leg development (boi, CSN1b, mys). From the model shown in Fig. 1, it is easy to understand how altering components of Hh signaling could contribute to changes in leg proportionality. As described above, boi is known to function in the Hh pathway. CSN1b encodes a subunit of the COP9 signalosome/ubiquitin-proteasome system, which has recently been implicated in Hh signaling in the wing . However, the role of this highly conserved signalosome ,  could be broader, as a potential function in transcription regulation . The Drosophila mys gene encodes a homolog of vertebrate beta PS integrin , , , which is involved in cell adhesion and signaling. Loss of mys leads to loss of portions or all of the leg .
PI31 might also be involved in signal modulation. PI31 is a highly conserved regulator of proteasome function , . It has both inhibitory and activation properties, depending upon the test system. In Drosophila, PI31 binds to the F-box factor, Nutcracker to activate proteasome and caspase during sperm maturation. It is also active in cell cycle regulation. Perhaps, like CSN1b, PI31 might be involved in a signal modulation that involves the proteasome.
Several genes identified in our GWAS encode proteins with transcription factor domains (Table 2); particularly prevalent were genes encoding Zn-finger transcription factors (CG43444, disco-r, chinmo). At present, we do not know whether these factors might function upstream of the Hh signal or are integrated into the response. We do know that ectopic expression of disco-r or its paralog disco can induce ectopic ventral appendage structures , and that cells require at least one of these genes to form ventral appendages (J.R and J.W.M. unpublished). Further work will be required to discern where these other genes fit into the appendage network.
The classic antenna-to-leg transformation arises from ectopic expression of the homeotic gene Antennapedia , . Hth is central to antennal specification and, therefore, the transformation. When Hth and Exd are present in the same cells, Hth helps recruit Exd to the nucleus, inducing antennal fate. If Hth is reduced, Exd remains cytoplasmic and the antennae develop leg-like characteristics.
Only a few genes are known where recessive mutations cause this transformation, Dll, hth , , exd  and spineless . Ectopic Antennapedia or reduction of either Dll or Spineless reduces expression of hth, thereby decreasing import of Exd into the nucleus. Given this rather straightforward model for antennal specification, we were quite surprised to find that five different RNAi lines caused an antenna-to-leg transformation when driven with Dll-Gal4. However, since ectopic expression of disco-r can induce ectopic antennae in the eyes (J. W. Mahaffey, unpublished), it is perhaps not surprising that reduction of disco-r can cause this transformation.
The only other gene of our antenna-to-leg candidates with a postulated function is Piezo, which encodes a mechanically gated ion channel thought to be involved in reception of mechanical stimuli , . How this and the other uncharacterized genes integrate into antennal, and possibly leg, development remains unknown. Perhaps it is worth noting that recent data indicate that Piezo is involved in tissue homeostasis, functioning to extrude cells from epithelia under crowded conditions .
An important point to consider with the antenna-to-leg transformations is that the Gal4 insertion of Dll-Gal4 creates a hypomorphic allele of Dll, Dllmd13 . While Dll-Gal4 alone does not cause an antenna-to-leg transformation, this mutation could generate a sensitized background for our candidate RNAi, so it is possible that the transformations we observe are the result of a synergistic effect between reduction of the candidate and reduced Dll. Though we do not know what, if any, contribution Dll-Gal4 has on the transformation, inclusion of Gal80 in our experiments demonstrates that the RNAi activation was required, suggesting that these genes are likely involved in the “antenna or leg” genetic decision.
The genes we have identified in this study might be expected to participate in some way with other genes known in the appendage developmental network (for example, hh, wg, dpp, Dll, dac, and hth), and we thought perhaps that we could identify possible network links using programs such as DroID . These programs provide a mechanism to search for potential interactions discerned from prior genetic research and genome wide studies of protein interactions. We used DroID and Cytoscape  to search for known and predicted protein-protein or genetic interactions that could link our candidates to the canonical appendage development network. There was little information for many of our candidates, which was not surprising since they are CG's; however, three networks predicted for Cka, PI31 and CG6841 were informative and are shown in (Fig. 5). These three genes were two steps or less from the canonical genes mentioned above, though dac was an exception sharing few, if any, connections. Not surprisingly, a network of proteasome-related genes connected PI31 to the canonical pathway (Fig. 5A); this may support our hypothesis that PI31 could play a role in regulating signaling processes via protein degradation. PI31 also shared connections with other of our candidates, eIF6, Cka and disco-r. Cka (Fig. 5B), the RNAi of which caused a strong phenocopy in the leg, was linked to the canonical leg pathway through a number of genes encoding kinases, including members of the JNK pathway. The role of Cka in the JNK pathway has previously been shown to be important in the proper expression of dpp during dorsal closure , so similar mechanisms may play an important role in Dpp function during leg development. Another candidate with a strong RNAi phenocopy, CG6841 (Fig. 5C), has not been extensively studied, yet this network analysis identified several connections to the pathways of known leg development genes, including disco and engrailed, an early component of the cascade . These links support the usefulness of our method, and perhaps can be used to identify other novel leg development genes.
Genes in orange connect directly to the candidate (Green), and may be linked directly or through genes in blue to the canonical appendage cascade genes (red). Yellow identifies those of our candidate genes that are linked to the test gene. A) Analysis of PI31. Links shared with Cka are shown in dark blue. B) Analysis of Cka. Links shared with PI31 are shown in dark orange and dark blue. C) Analysis of CG6841, including a link to disco in pink.
We have shown that Genome Wide Association Studies provide a useful method for identifying genes contributing to a developmental process. Further, we have evidence that this method can overcome some of the limitations of traditional developmental genetic screens, such as redundancy. We demonstrate this by successfully utilizing a GWAS screen to identify several new components of signaling, transcription and unknown pathways that contribute to Drosophila appendage development. Thus, we believe these results show the applicability of this method to any developmental trait. All that is required is measurable variation in morphology of a relevant trait arising from the developmental process targeted. This method should prove beneficial for extending what is known about even well studied developmental genetic pathways.
Measuring legs. An example DGRP female, T2 leg marked to show the measuring method. In the femur, a black circle marks each of the two most proximal bristles. Measurement begins in the middle of the line (black) between these two bristles and ends at the end of the segment (red line). The tibia is measured from its beginning, in the joint with the femur, and ends where it forms a joint with the first tarsal segment (blue line). Each tarsal segment was measured separately (green lines), beginning with the joint to the more proximal segment and ending with the joint to the more distal segment (green arrow heads). In the case of the first tarsal segment, measurement began at the joint with the tibia. Measuring the final tarsal segment ended at the tip of the leg, not including terminal structures.
Variation in proportions of leg segments. Mean leg segment proportion for each measured DGRP line is graphed on the Y-axis, organized from smallest to largest based on female values (red) for each trait. Males are shown in blue. The P-value for line effect is also shown, and was very significant in all cases.
ANOVA of leg proportions.
Primers for gene cloning.
Source of RNAi Lines.
Correlations between traits. Degree of significance from the lowest possible value (0) and the highest possible value (1) were also calculated (recorded in columns ‘t’ and ‘P-value’). Fe, femur proportion; Ta, tarsus proportion; Ti, tibia proportion; F, female, M, Male; A, both sexes; T1, first thoracic leg; T2, second thoracic leg.
Top candidate genes. Candidates were sorted using Fisher's method for combining P-values to identify SNPs with the highest summary statistic (Fishers) in all four measurements (T1 and T2 and Males and Females). Candidate genes were identified by proximity to SNPs In cases where multiple SNPs identified a candidate, only the most significant SNP position is shown. FlyBase IDs are included in this table.
Observed expression of analyzed candidates. * = selected early, † = previously published, P = patterned expression, U = ubiquitous expression.
Possibility of paralogs. Probable = Genes with potential paralogs. Genes with known redundant paralogs are labeled as “Yes” along with the name of the paralog. The five candidates that gave antenna-to-leg transformation with the Dll driver are marked with an asterisk.
False Discovery Rate Estimate.
We thank Drs. Trudy Mackay, Michael Magwire, Julian Ayroles, Dahlia Nelson and Eric Stone for helping us with the statistical aspects of this project. We thank the TRiP at Harvard Medical School (NIH/NIGMS R01-GM084947) and the Bloomington Drosophila Stock Center for providing the several of the transgenic RNAi fly lines used in this study.
Conceived and designed the experiments: JWM XS NG ML JBR TP. Performed the experiments: JWM XS NG ML JBR TP. Analyzed the data: JWM NG ML JBR XS. Wrote the paper: JWM NG.
- 1. Kojima T (2004) The mechanism of Drosophila leg development along the proximodistal axis. Dev Growth Differ 46: 115–129.
- 2. Morata G (2001) How Drosophila appendages develop. Nat Rev Mol Cell Biol 2: 89–97.
- 3. Pueyo JI, Couso JP (2005) Parallels between the proximal-distal development of vertebrate and arthropod appendages: homology without an ancestor? Curr Opin Genet Dev 15: 439–446.
- 4. Estella C, Voutev R, Mann RS (2012) A dynamic network of morphogens and transcription factors patterns the fly leg. Curr Top Dev Biol 98: 173–198.
- 5. Panganiban G (2000) Distal-less function during Drosophila appendage and sense organ development. Dev Dyn 218: 554–562.
- 6. Cohen SM (1990) Specification of limb development in the Drosophila embryo by positional cues from segmentation genes. Nature 343: 173–177.
- 7. Vachon G, Cohen B, Pfeifle C, McGuffin ME, Botas J, et al. (1992) Homeotic genes of the Bithorax complex repress limb development in the abdomen of the Drosophila embryo through the target gene Distal-less. Cell 71: 437–450.
- 8. Goto S, Hayashi S (1997) Specification of the embryonic limb primordium by graded activity of Decapentaplegic. Development 124: 125–132.
- 9. Diaz-Benjumea FJ, Cohen B, Cohen SM (1994) Cell interaction between compartments establishes the proximal-distal axis of Drosophila legs. Nature 372: 175–179.
- 10. Lecuit T, Cohen SM (1997) Proximal-distal axis formation in the Drosophila leg. Nature 388: 139–145.
- 11. Cohen SM, Jurgens G (1989) Proximal-distal pattern formation in Drosophila: cell autonomous requirement for Distal-less gene activity in limb development. EMBO J 8: 2045–2055.
- 12. Wu J, Cohen SM (1999) Proximodistal axis formation in the Drosophila leg: subdivision into proximal and distal domains by Homothorax and Distal-less. Development 126: 109–117.
- 13. Abu-Shaar M, Mann RS (1998) Generation of multiple antagonistic domains along the proximodistal axis during Drosophila leg development. Development 125: 3821–3830.
- 14. Panganiban G, Rubenstein JL (2002) Developmental functions of the Distal-less/Dlx homeobox genes. Development 129: 4371–4386.
- 15. Mardon G, Solomon NM, Rubin GM (1994) dachshund encodes a nuclear protein required for normal eye and leg development in Drosophila. Development 120: 3473–3486.
- 16. Gonzalez-Crespo S, Abu-Shaar M, Torres M, Martinez AC, Mann RS, et al. (1998) Antagonism between extradenticle function and Hedgehog signalling in the developing limb. Nature 394: 196–200.
- 17. Wu J, Cohen SM (2000) Proximal distal axis formation in the Drosophila leg: distinct functions of teashirt and homothorax in the proximal leg. Mech Dev 94: 47–56.
- 18. McKay DJ, Estella C, Mann RS (2009) The origins of the Drosophila leg revealed by the cis-regulatory architecture of the Distalless gene. Development 136: 61–71.
- 19. Giorgianni MW, Mann RS (2011) Establishment of medial fates along the proximodistal axis of the Drosophila leg through direct activation of dachshund by Distalless. Dev Cell 20: 455–468.
- 20. Panganiban G, Irvine SM, Lowe C, Roehl H, Corley LS, et al. (1997) The origin and evolution of animal appendages. Proc Natl Acad Sci U S A 94: 5162–5166.
- 21. Briscoe J, Therond P (2005) Hedgehog signaling: from the Drosophila cuticle to anti-cancer drugs. Dev Cell 8: 143–151.
- 22. Morini M, Astigiano S, Gitton Y, Emionite L, Mirisola V, et al. (2010) Mutually exclusive expression of DLX2 and DLX5/6 is associated with the metastatic potential of the human breast cancer cell line MDA-MB-231. BMC Cancer 10: 649.
- 23. Polakis P (2007) The many ways of Wnt in cancer. Curr Opin Genet Dev 17: 45–51.
- 24. Popov VM, Wu K, Zhou J, Powell MJ, Mardon G, et al. (2010) The Dachshund gene in development and hormone-responsive tumorigenesis. Trends Endocrinol Metab 21: 41–49.
- 25. Patel M, Farzana L, Robertson LK, Hutchinson J, Grubbs N, et al. (2007) The appendage role of insect disco genes and possible implications on the evolution of the maggot larval form. Dev Biol 309: 56–69.
- 26. Mackay TF, Richards S, Stone EA, Barbadilla A, Ayroles JF, et al. (2012) The Drosophila melanogaster Genetic Reference Panel. Nature 482: 173–178.
- 27. Schneider CA, Rasband WS, Eliceiri KW (2012) NIH Image to ImageJ: 25 years of image analysis. Nat Methods 9: 671–675.
- 28. McQuilton P, St Pierre SE, Thurmond J (2012) FlyBase 101–the basics of navigating FlyBase. Nucleic Acids Res 40: D706–714.
- 29. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215: 403–410.
- 30. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402.
- 31. Tautz D, Pfeifle C (1989) A non-radioactive in situ hybridization method for the localization of specific RNAs in Drosophila embryos reveals translational control of the segmentation gene hunchback. Chromosoma 98: 81–85.
- 32. Dietzl G, Chen D, Schnorrer F, Su KC, Barinova Y, et al. (2007) A genome-wide transgenic RNAi library for conditional gene inactivation in Drosophila. Nature 448: 151–156.
- 33. St Pierre SE, Galindo MI, Couso JP, Thor S (2002) Control of Drosophila imaginal disc development by rotund and roughened eye: differentially expressed transcripts of the same gene encoding functionally distinct zinc finger proteins. Development 129: 1273–1281.
- 34. Staehling-Hampton K, Jackson PD, Clark MJ, Brand AH, Hoffmann FM (1994) Specificity of bone morphogenetic protein-related factors: cell fate and gene expression changes in Drosophila embryos induced by decapentaplegic but not 60A. Cell Growth Differ 5: 585–593.
- 35. Sanson B, White P, Vincent JP (1996) Uncoupling cadherin-based adhesion from wingless signalling in Drosophila. Nature 383: 627–630.
- 36. Calleja M, Moreno E, Pelaz S, Morata G (1996) Visualization of gene expression in living adult Drosophila. Science 274: 252–255.
- 37. Ayroles JF, Carbone MA, Stone EA, Jordan KW, Lyman RF, et al. (2009) Systems genetics of complex traits in Drosophila melanogaster. Nat Genet 41: 299–307.
- 38. Speck O, Hughes SC, Noren NK, Kulikauskas RM, Fehon RG (2003) Moesin functions antagonistically to the Rho pathway to maintain epithelial integrity. Nature 421: 83–87.
- 39. Shearn A, Hersperger G, Hersperger E, Pentz ES, Denker P (1978) Multiple Allele Approach to the Study of Genes in DROSOPHILA MELANOGASTER That Are Involved in Imaginal Disc Development. Genetics 89: 355–370.
- 40. Phillips MD, Shearn A (1990) Mutations in polycombeotic, a Drosophila polycomb-group gene, cause a wide range of maternal and zygotic phenotypes. Genetics 125: 91–101.
- 41. Zusman S, Patel-King RS, Ffrench-Constant C, Hynes RO (1990) Requirements for integrins during Drosophila development. Development 108: 391–402.
- 42. MacKrell AJ, Blumberg B, Haynes SR, Fessler JH (1988) The lethal myospheroid gene of Drosophila encodes a membrane protein homologous to vertebrate integrin beta subunits. Proc Natl Acad Sci U S A 85: 2633–2637.
- 43. Wilcox M, DiAntonio A, Leptin M (1989) The function of PS integrins in Drosophila wing morphogenesis. Development 107: 891–897.
- 44. Casares F, Mann RS (2001) The ground state of the ventral appendage in Drosophila. Science 293: 1477–1480.
- 45. Casares F, Mann RS (1998) Control of antennal versus leg development in Drosophila. Nature 392: 723–726.
- 46. Struhl G (1982) Genes controlling segmental specification in the Drosophila thorax. Proc Natl Acad Sci U S A 79: 7380–7384.
- 47. Snodgrass RE (1935) Principles of insect morphology. New York ; London: McGraw-Hill book company inc. ix,667 p.
- 48. Snodgrass RE (1958) Evolution of arthropod mechanisms. Washington,: Smithsonian Institution. 77 p. p.
- 49. Mahaffey JW, Griswold CM, Cao QM (2001) The Drosophila genes disconnected and disco-related are redundant with respect to larval head development and accumulation of mRNAs from deformed target genes. Genetics 157: 225–236.
- 50. Mackay TF, Stone EA, Ayroles JF (2009) The genetics of quantitative traits: challenges and prospects. Nat Rev Genet 10: 565–577.
- 51. Visscher PM, Brown MA, McCarthy MI, Yang J (2012) Five years of GWAS discovery. Am J Hum Genet 90: 7–24.
- 52. Lum L, Yao S, Mozer B, Rovescalli A, Von Kessler D, et al. (2003) Identification of Hedgehog pathway components by RNAi in Drosophila cultured cells. Science 299: 2039–2045.
- 53. Yao S, Lum L, Beachy P (2006) The ihog cell-surface proteins bind Hedgehog and mediate pathway activation. Cell 125: 343–357.
- 54. Camp D, Currie K, Labbe A, van Meyel DJ, Charron F (2010) Ihog and Boi are essential for Hedgehog signaling in Drosophila. Neural Dev 5: 28.
- 55. Yan D, Wu Y, Yang Y, Belenkaya TY, Tang X, et al. (2010) The cell-surface proteins Dally-like and Ihog differentially regulate Hedgehog signaling strength and range during development. Development 137: 2033–2044.
- 56. Zheng X, Mann RK, Sever N, Beachy PA (2010) Genetic and biochemical definition of the Hedgehog receptor. Genes Dev 24: 57–71.
- 57. McLellan JS, Yao S, Zheng X, Geisbrecht BV, Ghirlando R, et al. (2006) Structure of a heparin-dependent complex of Hedgehog and Ihog. Proc Natl Acad Sci U S A 103: 17208–17213.
- 58. Zhang F, McLellan JS, Ayala AM, Leahy DJ, Linhardt RJ (2007) Kinetic and structural studies on interactions between heparin or heparan sulfate and proteins of the hedgehog signaling pathway. Biochemistry 46: 3933–3941.
- 59. McLellan JS, Zheng X, Hauk G, Ghirlando R, Beachy PA, et al. (2008) The mode of Hedgehog binding to Ihog homologues is not conserved across different phyla. Nature 455: 979–983.
- 60. Wu JT, Lin WH, Chen WY, Huang YC, Tang CY, et al. (2011) CSN-mediated deneddylation differentially modulates Ci(155) proteolysis to promote Hedgehog signalling responses. Nat Commun 2: 182.
- 61. Wei N, Serino G, Deng XW (2008) The COP9 signalosome: more than a protease. Trends Biochem Sci 33: 592–600.
- 62. Schwechheimer C, Isono E (2010) The COP9 signalosome and its role in plant development. Eur J Cell Biol 89: 157–162.
- 63. Tsuge T, Matsui M, Wei N (2001) The subunit 1 of the COP9 signalosome suppresses gene expression through its N-terminal domain and incorporates into the complex through the PCI domain. J Mol Biol 305: 1–9.
- 64. Leptin M, Bogaert T, Lehmann R, Wilcox M (1989) The function of PS integrins during Drosophila embryogenesis. Cell 56: 401–408.
- 65. Bader M, Benjamin S, Wapinski OL, Smith DM, Goldberg AL, et al. (2011) A conserved F box regulatory complex controls proteasome activity in Drosophila. Cell 145: 371–382.
- 66. Zaiss DM, Standera S, Kloetzel PM, Sijts AJ (2002) PI31 is a modulator of proteasome formation and antigen processing. Proc Natl Acad Sci U S A 99: 14344–14349.
- 67. Schneuwly S, Kuroiwa A, Gehring WJ (1987) Molecular analysis of the dominant homeotic Antennapedia phenotype. EMBO J 6: 201–206.
- 68. Sunkel CE, Whittle JRS (1987) Brista: a gene involved in the specification and differentiation of distal cephalic and thoracic structures in Drosophila melanogaster. Development Genes and Evolution 196: 124–132.
- 69. Rauskolb C, Smith KM, Peifer M, Wieschaus E (1995) extradenticle determines segmental identities throughout Drosophila development. Development 121: 3663–3673.
- 70. Dong PD, Dicks JS, Panganiban G (2002) Distal-less and homothorax regulate multiple targets to pattern the Drosophila antenna. Development 129: 1967–1974.
- 71. Coste B, Xiao B, Santos JS, Syeda R, Grandl J, et al. (2012) Piezo proteins are pore-forming subunits of mechanically activated channels. Nature 483: 176–181.
- 72. Kim SE, Coste B, Chadha A, Cook B, Patapoutian A (2012) The role of Drosophila Piezo in mechanical nociception. Nature 483: 209–212.
- 73. Eisenhoffer GT, Loftus PD, Yoshigi M, Otsuna H, Chien CB, et al. (2012) Crowding induces live cell extrusion to maintain homeostatic cell numbers in epithelia. Nature 484: 546–549.
- 74. Murali T, Pacifico S, Yu J, Guest S, Roberts GG 3rd, et al. (2011) DroID 2011: a comprehensive, integrated resource for protein, transcription factor, RNA and gene interactions for Drosophila. Nucleic Acids Res 39: D736–743.
- 75. Smoot ME, Ono K, Ruscheinski J, Wang PL, Ideker T (2011) Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics 27: 431–432.
- 76. Chen HW, Marinissen MJ, Oh SW, Chen X, Melnick M, et al. (2002) CKA, a novel multidomain protein, regulates the JUN N-terminal kinase signal transduction pathway in Drosophila. Mol Cell Biol 22: 1792–1803.