|Home | About | Journals | Submit | Contact Us | Français|
Conceived and designed the experiments: JWM XS NG ML JBR TP. Performed the experiments: JWM XS NG ML JBR TP. Analyzed the data: JWM NG ML JBR XS. Wrote the paper: JWM NG.
The adult Drosophila melanogaster body develops from imaginal discs, groups of cells set-aside during embryogenesis and expanded in number during larval stages. Specification and development of Drosophila imaginal discs have been studied for many years as models of morphogenesis. These studies are often based on mutations with large developmental effects, mutations that are often lethal in embryos when homozygous. Such forward genetic screens can be limited by factors such as early lethality and genetic redundancy. To identify additional genes and genetic pathways involved in leg imaginal disc development, we employed a Genome Wide Association Study utilizing the natural genetic variation in leg proportionality found in the Drosophila Genetic Reference Panel fly lines. In addition to identifying genes already known to be involved in leg development, we identified several genes involved in pathways that had not previously been linked with leg development. Several of the genes appear to be involved in signaling activities, while others have no known roles at this time. Many of these uncharacterized genes are conserved in mammals, so we can now begin to place these genes into developmental contexts. Interestingly, we identified five genes which, when their function is reduced by RNAi, cause an antenna-to-leg transformation. Our results demonstrate the utility of this approach, integrating the tools of quantitative and molecular genetics to study developmental processes, and provide new insights into the pathways and networks involved in Drosophila leg development.
In the fruit fly, Drosophila melanogaster, development of the adult body begins when small clusters of cells are set-aside during embryogenesis to form the imaginal discs –. The leg primordia arise in the thoracic segments at positions where segmentally repeated expression of the signaling factor Wingless (Wg) activates expression of Distal-less (Dll) . Abdominal Hox genes block Dll activation posterior to the thoracic segments , while Decapentaplegic (Dpp) and the Epidermal Growth Factor Receptor pathways limit Dll expression dorsally and ventrally, respectively .
After the disc primordia are established they receive patterning instructions via a series of signaling molecules, morphogens and transcription factors expressed during the larval stages that establish the proximal-distal (P-D) pattern of the legs (Fig. 1A). The hedgehog (hh) gene is activated in the posterior compartment of the disc through the action of the transcription factor Engrailed, and while Engrailed defines cells of the posterior compartment, the Hh signal is transmitted to cells of the anterior compartment where it initiates P-D development . On the anterior side of the anterior-posterior border, where Hh signal is strongest, wg and dpp are activated; Dpp is responsible for dorsal fate and Wg for ventral. In addition to defining the dorsal-ventral orientation, these morphogens also work together to define the P-D axis of the leg. In the center of the disc where cells experience high levels of both Wg and Dpp, Dll is activated to establish the distal portions of the leg (mid-tibia through tarsus) –. As the mutual presence of Wg and Dpp decreases, a threshold is reached that permits the activation of dachshund (dac), which is responsible for patterning the middle leg regions (femur and tibia) . Near the edge of the disc, where the mutual concentration of Wg and Dpp is lowest, homothorax (hth) is expressed. Here, the Hth protein imports Extradenticle (Exd) into the nucleus, specifying the proximal portion of the leg and its connection with the body wall , , . Recently, an alternative model has been suggested proposing different molecular controls activating medial fate , , .
Many of the homologs of genes and genetic pathways that pattern Drosophila legs have been shown to be important in the development and growth of appendages and appendage-like structures in other metazoans , , . Further, misexpression of many of these genes contributes to developmental disorders and cancer , –.
Though we know many of the factors that establish or pattern appendages during Drosophila development, gaps still remain in our knowledge, gaps that, when filled, will solidify our understanding of how molecular networks establish appendage and organ development. For example, many of the genes known to be involved encode transcription factors, but their targets remain largely a mystery , and even some major players in leg development were missed in directed screens for appendage factors . It is critical to fill these gaps to fully understand the genetic architecture of appendage development.
Like any characteristic of the adult fly, appendage morphology is a multigenic trait. Variation in expression, coding ability or mRNA stability of allelic combinations within a population will cause variation in the final morphology of the appendage, within tolerances of morphological constraints. We have taken advantage of this natural variation and employed a genome-wide association study (GWAS) using the wild-derived Drosophila melanogaster lines of the Drosophila Genetic Reference Panel (DGRP, ) to identify genes contributing to leg development. We identified single-nucleotide polymorphisms (SNPs) associated with variation in proportion of the leg segments relative to total leg length. Candidate genes were selected based on proximity to the SNPs associated with this variation and were further tested using in situ hybridization and RNA interference (RNAi) to determine whether or not the genes had a role in leg development. We identified genes from known pathways that had not been previously associated with leg development as well as previously uncharacterized genes, and demonstrated their role in this process. In addition, we identified five genes that, when expression is reduced, cause an antenna-to-leg transformation. Our results will help provide a more comprehensive view of the processes involved in appendage development in Drosophila.
The DGRP lines were provided by the Mackay lab (NCSU, Raleigh, NC) and raised in standard environmental conditions: plastic vials with cornmeal-agar-molasses media placed in 25°C incubator. We initially screened the set of 40 DGRP lines that were to be the first sequenced. Males and females were randomly selected from each line and seven of each placed in four individual vials for egg laying. Parents were removed after three to four days depending on the amount of larval activity. We extended our screen to 100 lines (45 and 55 each) that were later sequenced to improve the power of association mapping. For these lines, 20 to 100 adults were placed in a collection cup and allowed to lay eggs on grape juice agar plates that were changed daily. After allowing an additional 24 hours for larvae to hatch, fifty randomly selected larvae were transferred to each of four fresh vials. Adult flies were preserved in ethanol 3 to 5 days after eclosing. Analysis of variance (ANOVA) indicated that there was no significant difference between flies collected in these two methods (data not shown).
First (T1) and second (T2) thoracic legs were dissected and mounted in 70% glycerol on glass slides. Images of the legs were taken with a MicroPublisher camera (QImaging, Surrey, BC, Canada) mounted on a Zeiss AxioScope microscope (Thornwood, NY, USA) at 10× magnification. Three flies of each sex from each vial were randomly selected for leg dissection. Measurements of the legs (Fig. S1) were recorded using Adobe Photoshop and ImageJ . Measurements of the femur, tibia, and tarsal segments were recorded separately (Table S1). Since the proximal end of femurs could not be seen clearly in some images, femur length was measured as the distance of the central line between the most proximal bristle socket to the distal end of the femur. Tibia length was measured as the distance between the proximal and distal ends. Each tarsal segment length was measured as the shortest distance between the ends and joint centers where the tarsus is noticeably thinner. The tarsal segments were added together to get total tarsal length, and the lengths of all segments were summed to arrive at “total” leg length.
Factorial, mixed model ANOVAs of form Y=μ + S + L + V(L) + SxL + SxV(L) + ε were used to partition variation in leg segment proportion between sexes (S, fixed), DGRP lines (L, random), vial from which the measured fly came (V, random), the SxL interaction (random), the SxV interaction (random), and the error variance (ε) (Fig. S2 and Table S2). Association between variability in leg segment proportionality and SNP markers was tested using SAS software (Cary, NC, USA). Markers were tested for association using a simple ANOVA model: phenotype=SNP, assayed independently by position, by sex, and by leg type (i.e. T1, T2). Sites with a minor allele frequency of less than five were removed from analysis. We used data from Freeze One of the DGRP  for our analysis. We employed Fisher's method for combining P-values to merge data from both legs and from both sexes. We sorted by test statistics as a ranking method only; we did not convert the number to a P-value.
Our goal was to identify SNPs that affect general appendage development, and not those that might be sex or thoracic segment specific. Therefore, we required that SNPs affect leg proportionality in both thoracic segments and both sexes. If this were the case they would have lower P-values in all categories, both sexes and legs both thoracic segments. To sort and identify such SNPs, we calculated the summary statistic using Fisher's method of combining P-values. Fisher's method relies on independence of the traits measured, and this is not the case with our leg data. Therefore, the P-values obtained would be anti-conservative. We sorted by the summary statistic as a ranking method only, and did not utilize the P-value for any analyses. For this reason, we only report the summary statistic in our table (Table S6).
From association mapping, SNP positions that had the largest test statistic were identified independently for the femur, tibia, and tarsus. During early rounds of experiments we identified a few candidates with leg imaginal disc expression (see below). We continued on with these candidates though some had somewhat lower significance as determined in later rounds of association. The GBrowse program on FlyBase version 2012_3 (www.flybase.org, ) was used to identify candidate genes by proximity to SNP positions. If a SNP was found between two neighboring genes, both genes were analyzed as candidates. Genes near SNPs of interest with a known function in leg development were not examined further. SNP positions greater than 8 Kbp from an annotated gene were discarded.
Genes were cloned using primers (Table S3) designed from exon sequences on FlyBase . Vector NTI (Life Technologies Corporation, Grand Island, NY, USA) was used to help design primers for a coding region of each candidate gene. These regions were amplified using PCR of Drosophila genomic DNA extracted from Ore-R flies. PCR products were then cloned into a pGEM vector using the pGEM-T Easy Vector System (Promega, Madison WI, USA).
In situ hybridization was used to determine expression patterns of candidate genes in imaginal discs and embryos. Drosophila imaginal discs from third instar larvae were dissected in PBS and placed into standard in situ fixative. Digoxigenin-labeled antisense RNAs were prepared and in situ hybridization was done essentially as in .
RNAi lines were obtained from the Vienna Drosophila RNAi Center (VDRC)  directly, and from the Transgenic RNAi Project (TRiP) from Harvard Medical School through the Bloomington Stock Center (Table S4). Males from RNAi lines were crossed to virgin females from five Gal4 lines: rotund (rn) , decapentaplegic (dpp) , armadillo , Distal-less (Dll) and teashirt . Flies were grown at room temperature, though some were repeated at 25 degrees. Progeny from crosses were examined to determine whether or not there was an effect of silencing candidate genes in Gal4 driver domains. Efficacy of RNAi was confirmed by examining expression of the candidate gene in the RNAi larvae using in situ hybridization (data not shown).
Our goal was to assess the use of the tools of quantitative genetics for identifying genes involved in Drosophila development. Variation in leg patterning is a multigenic trait, and though there will be constraints on the degree of variation, if measureable, we could use statistical analyses to map candidate loci contributing to that variation. Though previous forward genetic studies have revealed a wealth of information on appendage development in Drosophila, the use of natural variation offered a unique perspective with different sensitivities. We chose to examine variation in proportion of leg segments to bring into focus current models of leg patterning (Fig. 1). For example, a small change in the strength of Hh, Dpp or Wg signals could alter the expression of determinant genes (hth, dac, or Dll), which, in turn, could shift the boundaries of leg segmentation and, hence, change proportioning without necessarily affecting over-all length. Therefore, variation in proportion of the segments (femur, tibia, tarsus) seemed the more informative trait to study if we wanted to identify genes that might affect early appendage patterning. Further, this would remove or reduce other influences, such as variation in metabolism or growth rate.
We dissected legs from first (T1) and second (T2) thoracic segments measuring twelve individuals of both sexes from 137 DGRP lines , . Of these, 117 were included in the Freeze 1 sequences of the DGRP (Fig. S1). The proportion of each segment was calculated as compared to the total as defined in Materials and Methods. Statistical analyses indicated strong correlations between leg segments of both thoracic segments and both sexes within a line, as we would expect if the variation were due to alleles of general appendage-patterning genes (Table S5). Further, there were significant correlations between most of the segments within a leg (femur, tibia and tarsus, Table S5), though the correlation between proportion of tibia and of femur was relatively low (not significant in male T1). In addition, the correlation between the tarsus and other segments was negative, which might be expected if expanding the proportion of one segment would require a simultaneous reduction in another. The variation in proportionality for the leg segments for males and females of the 117 DGRP lines is shown in Fig. S2.
The top 20 SNP positions associated with variation in each of the three leg segments, along with the genes identified as closest to the SNP are listed in Table S6. (FlyBase numbers for most candidate genes are included in the Table S6. Those not in the table are included in the text.) Although some SNPs were not located near any known or predicted genes, many SNP positions were found in introns, exons, and protein coding regions, as well as just upstream or downstream of genes. Several of the genes neighboring or harboring the SNPs were already known to be involved in leg development, including Moesin (Moe) , Enhancer of zeste (E(z)) ,  and myospheroid (mys; FBgn0004657) – or suspected to be, disco-related (disco-r) . Identification of these genes helped to validate our methods. Some candidates were named genes with known functions, but their role in leg development had not been examined. These were selected for further study to assess whether they also had roles in leg development. Some candidate genes had not been previously characterized.
To determine if our candidate genes played a role in leg development, we first examined distribution of mRNAs encoded by these genes using in situ hybridization, looking for expression in the leg imaginal discs of larvae or their primordia in embryos (Fig. 2 and Table S7). Of the 37 genes examined, we did not detect staining in the leg imaginal discs for seven (CG5549, M-spondin (mspo), dpr8, CG7949, CR33218, CG42353, CG42354). Of these, only mspo and CG7949 had embryonic expression, but neither was expressed in the disc primordia. These seven genes were excluded from further analysis.
Transcripts from several genes accumulated in broad regions of the leg discs, such as brother of ihog (boi) and eIF6, while others were enhanced in the presumptive tarsal region, like CG43173 and PI31 (FBgn0033669). We also observed staining for many of these genes in the developing antennae, where some were also patterned. This is not unexpected as the antennae and legs are serially homologous structures –. Transcripts for some genes were observed in the wing, haltere and eye as well. Of particular note was the Hedgehog pathway member boi, which, in the wing discs, had enhanced staining in the wing blade region outlining the expression domains of wg and dpp (Fig. 2). We observed staining in embryos with 17 of the 30 probes that had expression in the imaginal discs; however, with the exception of disco-r , none of these appeared to have staining specifically in the developing disc primordia (data not shown).
To determine whether any of our candidate genes were necessary for normal leg development, we used RNAi to reduce gene function of those candidates with mRNAs detected in the leg discs. We used transgenic RNA interference lines from the Vienna Drosophila RNAi Center (VDRC)  and Transgenic RNAi Project (TRiP) with several Gal4 drivers (see Materials and Methods). However, the Dll-Gal4 driver , expressing Gal4 in the medial-to-distal portion of the leg discs, was most informative, so results described below (and summarized in Table 1) are primarily from the use of this driver (referred to as Dll-RNAi below). RNAi lines were not available for four candidate genes (yantar, GG43173, CG43174 and Hexosaminidase 1 [FBgn0041630]).
We observed altered leg morphology with six candidate genes. At room temperature, Dll-RNAi with COP9 complex homolog subunit 1b (CSN1b) resulted in smaller legs relative to overall body size (Fig. 3B) and occasional loss of the fifth tarsal segment. These flies also had reduced aristae in the antennae (Fig. 4B), and males had reduced sex-combs (data not shown). At 25 degrees reduction of CSN1b caused loss of multiple tarsal segments. Dll-RNAi of PI31 reduced the relative size of tarsal segments, occasionally deleting one of the middle tarsal segments (Fig. 3C), and also deleted the aristae in the antennae (Fig. 4C).
Dll-RNAi of eIF6, CG6841 and Connector of kinase to AP-1 (Cka) each resulted in fused and deleted tarsi, with the later two also deleting terminal structures, the claws and pulvilli (Fig. 3D–F). In addition, Dll-RNAi of eIF6 caused reductions in the sex-comb tooth number (data not shown). RNAi of CG6841 and Cka also altered or deleted distal antennal segments (Fig. 4D, E).
In some cases Dll-RNAi was lethal. For example, Dll-RNAi with sickie (sick) was lethal prior to pupal formation, so we could not determine if there was an effect on leg development with that driver. However, with the rn driver (rn-RNAi, expressing Gal4 in the distal portion of the leg and in the wing blade region) reducing sick had no apparent effect in the legs, except with sex-comb tooth numbers in males, which were either greatly decreased or increased (data not shown). Reduced expression of bnl (FBgn0014135) and Chronologically inappropriate morphogenesis (chinmo) caused early death with multiple drivers.
RNAi of many candidates, driven by either rn- or Dll-Gal4, also resulted in altered wing morphology, ranging from a hooded phenocopy with slight reduction in size accompanied by a downward curvature, to near total loss of wing blade (data not shown).
Somewhat surprisingly, Dll-RNAi with five candidates (CG9129, CG32333, CG30371, Piezo [FBgn0031993] and disco-r) caused antenna-to-leg transformations (for example see CG9129 in Fig. 4F), yet no overt disruption of leg morphology was observed with these candidates. Since all five RNAi lines were from the VDRC we were suspicious that this could be due to a background effect with some of the VDRC RNAi lines. To test this, we generated flies with the RNAi insertions and Dll-Gal4, with and without tubulin-Gal80 on the X chromosome. Presence of Gal80 would block activation of the RNAi construct, so if the antenna-to-leg transformations were due to a genetic background effect in some of the VDRC lines, then we would expect to observe the transformation regardless of whether Gal80 was present. However, presence of Gal80 completely blocked the antenna-to-leg transformation indicating that the transformations were due to activation of the RNAi constructs, likely through reduction of the candidates.
Studies of appendage development in Drosophila have been extremely valuable in understanding the genetic networks governing many aspects of organ and structure development. Further, comparative studies in other organisms have opened the door for examining how genetic changes lead to diversification of structures and, thereby, evolution. GWAS takes advantage of the natural variation in a trait or phenotype to identify genes that contribute to the development of that trait. It is most important to understand that it is the combination of many genes with small effect that gives rise to the natural variation of a trait. Hence, a large effect is not expected from any one SNP. By associating a particular SNP with variation in the trait of interest, we can map the positions of genes that contribute to the trait. Thus GWAS offers a different perspective on gene identification and can circumvent issues affecting more standard forward genetic screens, such as early lethality and redundancy. Although GWAS has been used in the past to locate genetic causes of variation in behavioral and quantitative traits and causes of heritable diseases , , it has not, to our knowledge, been employed to identify genes involved in developmental processes. Our study demonstrates that integrating the tools of Quantitative and Developmental Genetics can be very productive, addressing issues of complex genetic interactions governing development.
The presence of functionally redundant paralogs in the genome could mask a leg development role in most forward genetic analyses. From our GWAS, we identified several candidates where a paralog is known or suspected (Table S8). Of these, boi, in particular, stands out. boi, and its paralog, interference hedgehog (ihog, FBgn0031872) have recently been shown to encode proteins critical for Hedgehog (Hh) signaling during Drosophila wing development –. Currently, neither has been implicated in leg development. However, since Hh signaling is critical in the legs, it is reasonable that these genes could play a part in leg development and that SNPs in these genes could contribute to quantitative variation in leg phenotypes. Therefore, it is not surprising they would show up in our GWA Study. Boi and Ihog bind Hh, strengthening the activation of the Hh signaling pathway, while limiting the range of the ligand , . In the leg discs, any change that broadens or narrows the range of Hh could, in turn, broaden or narrow the expression domains of the downstream responders (wg or dpp), thereby altering the domains of expression of the transcription factors responding to these signals (Fig. 1). While this potential connection to the leg development model makes boi an intriguing candidate to pursue, reduction of boi (or ihog) alone (through Dll-RNAi) caused no obvious malformation of legs or antennae. We suspect that the redundancy of these genes prevented any gross abnormalities in leg development. Presumably, subtle tweaks to boi expression or function could have effects on leg patterning within the realm of a multigenic characteristic of a population.
Redundancy might also have an impact on the antenna-to-leg phenocopy we observed with five candidate genes (CG9129, CG32333, CG30371, Piezo and disco-r, see Table S8). We suspect that redundancy is the reason that we did not observe malformation of the legs simultaneously with the antenna-to-leg transformations, and that these candidates actually do have roles in leg development. This is certainly true for disco-r, where the known paralog, disconnected (disco) masks the role of loss of disco-r alone during leg development , . Perhaps, antennae are more sensitive to RNAi of one particular paralog versus the other, or that the antennae are more sensitive than the legs to the reduced Dll function of the Dll-Gal4 mutation in combination with the RNAi (see below). Indeed, we have noted that the RNAi process appears to be more robust in the antennae than in the legs (data not shown). Because of redundancy, it is possible that neither boi nor the antenna-to-leg candidates would easily have been discovered from standard forward genetic screens. This demonstrates the power of GWAS as a tool to overcome some of the limitations of more traditional methods of identifying genes involved in developmental pathways.
As a final comment about target gene identification from our GWAS study, we do expect some false positives in our analysis. An estimated False Discovery Rate (FDR) indicates that approximately 20 SNPs could have arisen by chance using a cutoff of 1e-5 (Table S9). However, because this FDR estimation assumes all tests are independent, it is an overestimate. Lack of independence arises because of linkage disequilibrium between neighboring SNPs, and this would mean that the actual number of analyzed SNPs is somewhat lower than that used for the FDR estimation. Hence, the false discovery estimate is inflated.
In Table 2 we summarize what is known about the proteins and their functions of our candidates. Though a few of these encode known proteins, many are computed genes (CGs) at this point, so the data obtained from this study will help in defining pathways associated with these genes.
Three candidates potentially could be involved in expanding or restricting the signals or morphogens involved in leg development (boi, CSN1b, mys). From the model shown in Fig. 1, it is easy to understand how altering components of Hh signaling could contribute to changes in leg proportionality. As described above, boi is known to function in the Hh pathway. CSN1b encodes a subunit of the COP9 signalosome/ubiquitin-proteasome system, which has recently been implicated in Hh signaling in the wing . However, the role of this highly conserved signalosome ,  could be broader, as a potential function in transcription regulation . The Drosophila mys gene encodes a homolog of vertebrate beta PS integrin , , , which is involved in cell adhesion and signaling. Loss of mys leads to loss of portions or all of the leg .
PI31 might also be involved in signal modulation. PI31 is a highly conserved regulator of proteasome function , . It has both inhibitory and activation properties, depending upon the test system. In Drosophila, PI31 binds to the F-box factor, Nutcracker to activate proteasome and caspase during sperm maturation. It is also active in cell cycle regulation. Perhaps, like CSN1b, PI31 might be involved in a signal modulation that involves the proteasome.
Several genes identified in our GWAS encode proteins with transcription factor domains (Table 2); particularly prevalent were genes encoding Zn-finger transcription factors (CG43444, disco-r, chinmo). At present, we do not know whether these factors might function upstream of the Hh signal or are integrated into the response. We do know that ectopic expression of disco-r or its paralog disco can induce ectopic ventral appendage structures , and that cells require at least one of these genes to form ventral appendages (J.R and J.W.M. unpublished). Further work will be required to discern where these other genes fit into the appendage network.
The classic antenna-to-leg transformation arises from ectopic expression of the homeotic gene Antennapedia , . Hth is central to antennal specification and, therefore, the transformation. When Hth and Exd are present in the same cells, Hth helps recruit Exd to the nucleus, inducing antennal fate. If Hth is reduced, Exd remains cytoplasmic and the antennae develop leg-like characteristics.
Only a few genes are known where recessive mutations cause this transformation, Dll, hth , , exd  and spineless . Ectopic Antennapedia or reduction of either Dll or Spineless reduces expression of hth, thereby decreasing import of Exd into the nucleus. Given this rather straightforward model for antennal specification, we were quite surprised to find that five different RNAi lines caused an antenna-to-leg transformation when driven with Dll-Gal4. However, since ectopic expression of disco-r can induce ectopic antennae in the eyes (J. W. Mahaffey, unpublished), it is perhaps not surprising that reduction of disco-r can cause this transformation.
The only other gene of our antenna-to-leg candidates with a postulated function is Piezo, which encodes a mechanically gated ion channel thought to be involved in reception of mechanical stimuli , . How this and the other uncharacterized genes integrate into antennal, and possibly leg, development remains unknown. Perhaps it is worth noting that recent data indicate that Piezo is involved in tissue homeostasis, functioning to extrude cells from epithelia under crowded conditions .
An important point to consider with the antenna-to-leg transformations is that the Gal4 insertion of Dll-Gal4 creates a hypomorphic allele of Dll, Dllmd13 . While Dll-Gal4 alone does not cause an antenna-to-leg transformation, this mutation could generate a sensitized background for our candidate RNAi, so it is possible that the transformations we observe are the result of a synergistic effect between reduction of the candidate and reduced Dll. Though we do not know what, if any, contribution Dll-Gal4 has on the transformation, inclusion of Gal80 in our experiments demonstrates that the RNAi activation was required, suggesting that these genes are likely involved in the “antenna or leg” genetic decision.
The genes we have identified in this study might be expected to participate in some way with other genes known in the appendage developmental network (for example, hh, wg, dpp, Dll, dac, and hth), and we thought perhaps that we could identify possible network links using programs such as DroID . These programs provide a mechanism to search for potential interactions discerned from prior genetic research and genome wide studies of protein interactions. We used DroID and Cytoscape  to search for known and predicted protein-protein or genetic interactions that could link our candidates to the canonical appendage development network. There was little information for many of our candidates, which was not surprising since they are CG's; however, three networks predicted for Cka, PI31 and CG6841 were informative and are shown in (Fig. 5). These three genes were two steps or less from the canonical genes mentioned above, though dac was an exception sharing few, if any, connections. Not surprisingly, a network of proteasome-related genes connected PI31 to the canonical pathway (Fig. 5A); this may support our hypothesis that PI31 could play a role in regulating signaling processes via protein degradation. PI31 also shared connections with other of our candidates, eIF6, Cka and disco-r. Cka (Fig. 5B), the RNAi of which caused a strong phenocopy in the leg, was linked to the canonical leg pathway through a number of genes encoding kinases, including members of the JNK pathway. The role of Cka in the JNK pathway has previously been shown to be important in the proper expression of dpp during dorsal closure , so similar mechanisms may play an important role in Dpp function during leg development. Another candidate with a strong RNAi phenocopy, CG6841 (Fig. 5C), has not been extensively studied, yet this network analysis identified several connections to the pathways of known leg development genes, including disco and engrailed, an early component of the cascade . These links support the usefulness of our method, and perhaps can be used to identify other novel leg development genes.
We have shown that Genome Wide Association Studies provide a useful method for identifying genes contributing to a developmental process. Further, we have evidence that this method can overcome some of the limitations of traditional developmental genetic screens, such as redundancy. We demonstrate this by successfully utilizing a GWAS screen to identify several new components of signaling, transcription and unknown pathways that contribute to Drosophila appendage development. Thus, we believe these results show the applicability of this method to any developmental trait. All that is required is measurable variation in morphology of a relevant trait arising from the developmental process targeted. This method should prove beneficial for extending what is known about even well studied developmental genetic pathways.
Measuring legs. An example DGRP female, T2 leg marked to show the measuring method. In the femur, a black circle marks each of the two most proximal bristles. Measurement begins in the middle of the line (black) between these two bristles and ends at the end of the segment (red line). The tibia is measured from its beginning, in the joint with the femur, and ends where it forms a joint with the first tarsal segment (blue line). Each tarsal segment was measured separately (green lines), beginning with the joint to the more proximal segment and ending with the joint to the more distal segment (green arrow heads). In the case of the first tarsal segment, measurement began at the joint with the tibia. Measuring the final tarsal segment ended at the tip of the leg, not including terminal structures.
Variation in proportions of leg segments. Mean leg segment proportion for each measured DGRP line is graphed on the Y-axis, organized from smallest to largest based on female values (red) for each trait. Males are shown in blue. The P-value for line effect is also shown, and was very significant in all cases.
ANOVA of leg proportions.
Primers for gene cloning.
Source of RNAi Lines.
Correlations between traits. Degree of significance from the lowest possible value (0) and the highest possible value (1) were also calculated (recorded in columns ‘t’ and ‘P-value’). Fe, femur proportion; Ta, tarsus proportion; Ti, tibia proportion; F, female, M, Male; A, both sexes; T1, first thoracic leg; T2, second thoracic leg.
Top candidate genes. Candidates were sorted using Fisher's method for combining P-values to identify SNPs with the highest summary statistic (Fishers) in all four measurements (T1 and T2 and Males and Females). Candidate genes were identified by proximity to SNPs In cases where multiple SNPs identified a candidate, only the most significant SNP position is shown. FlyBase IDs are included in this table.
Observed expression of analyzed candidates. *=selected early, †=previously published, P=patterned expression, U=ubiquitous expression.
Possibility of paralogs. Probable=Genes with potential paralogs. Genes with known redundant paralogs are labeled as “Yes” along with the name of the paralog. The five candidates that gave antenna-to-leg transformation with the Dll driver are marked with an asterisk.
False Discovery Rate Estimate.
We thank Drs. Trudy Mackay, Michael Magwire, Julian Ayroles, Dahlia Nelson and Eric Stone for helping us with the statistical aspects of this project. We thank the TRiP at Harvard Medical School (NIH/NIGMS R01-GM084947) and the Bloomington Drosophila Stock Center for providing the several of the transgenic RNAi fly lines used in this study.