• Loading metrics

An Indel Polymorphism in the MtnA 3' Untranslated Region Is Associated with Gene Expression Variation and Local Adaptation in Drosophila melanogaster


Insertions and deletions (indels) are a major source of genetic variation within species and may result in functional changes to coding or regulatory sequences. In this study we report that an indel polymorphism in the 3’ untranslated region (UTR) of the metallothionein gene MtnA is associated with gene expression variation in natural populations of Drosophila melanogaster. A derived allele of MtnA with a 49-bp deletion in the 3' UTR segregates at high frequency in populations outside of sub-Saharan Africa. The frequency of the deletion increases with latitude across multiple continents and approaches 100% in northern Europe. Flies with the deletion have more than 4-fold higher MtnA expression than flies with the ancestral sequence. Using reporter gene constructs in transgenic flies, we show that the 3' UTR deletion significantly contributes to the observed expression difference. Population genetic analyses uncovered signatures of a selective sweep in the MtnA region within populations from northern Europe. We also find that the 3’ UTR deletion is associated with increased oxidative stress tolerance. These results suggest that the 3' UTR deletion has been a target of selection for its ability to confer increased levels of MtnA expression in northern European populations, likely due to a local adaptive advantage of increased oxidative stress tolerance.

Author Summary

Although molecular variation is abundant in natural populations, understanding how this variation affects organismal phenotypes that are subject to natural selection remains a major challenge in the field of evolutionary genetics. Here we show that a deletion mutation in a noncoding region of the Drosophila melanogaster Metallothionein A gene leads to a significant increase in gene expression and increases survival under oxidative stress. The deletion is in high frequency in three distinct geographic regions: in northern European populations, in northern populations along the east coast of North America, and in southern populations along the east coast of Australia. In northern European populations the deletion shows population genetic signatures of recent positive selection. Thus, we provide evidence for a regulatory polymorphism that underlies local adaptation in natural populations.


Natural populations adapt constantly to their changing environments, with alterations in protein sequences and gene expression providing the main sources of variation upon which natural selection can act. At present, understanding how changes in gene expression contribute to adaptation is one of the major challenges in evolutionary genetics. The fruit fly Drosophila melanogaster has populations distributed throughout the world, with environments ranging from tropical to temperate. On the basis of biogeographical, anatomical and population genetic studies, the center of origin of D. melanogaster has been inferred to be in sub-Saharan Africa [13]. Several genomic studies concluded that D. melanogaster underwent a population expansion around 60,000 years ago within Africa that set the ground for an out-of-Africa expansion 13,000–19,000 years ago and the subsequent colonization of Europe and Asia 2,000–5,000 years ago [46]. Because the colonization of new habitats requires that species adapt to new environmental conditions, there has been considerable interest in identifying the genetic and phenotypic changes that occurred during the out-of-Africa expansion of D. melanogaster [79].

In order to identify genes that differed in expression between a D. melanogaster population from Europe (the Netherlands) and one from sub-Saharan Africa (Zimbabwe), whole-transcriptome comparisons were carried out using adult males and females [10,11], as well as the dissected brains and Malpighian tubules of each sex [12,13]. These studies identified several hundred genes that were differentially expressed between the two populations and which represent candidates for adaptive regulatory evolution. One of the candidate genes that showed a large difference in expression between populations in the brains of both sexes was the metallothionein (MT) gene Metallothionein A (MtnA). MtnA lies on chromosome arm 3R (Fig 1) and belongs to a gene family of five members that also includes MtnB, MtnC, MtnD and MtnE [14,15]. Metallothioneins are present in all eukaryotes and have also been identified in some prokaryotes [16]. In general, MTs are cysteine-rich proteins, a feature that makes them thermostable, and have a strong affinity to metal ions, especially zinc and copper ions [17]. Some of the biological functions that have been described for MTs include: sequestration and dispersion of metal ions; zinc and copper homeostasis; regulation of the biosynthesis of zinc metalloproteins, enzymes and zinc dependent transcription factors; and protection against reactive oxygen species, ionizing radiation and metals [18]. In natural isolates of D. melanogaster, increased MtnA expression has been linked to copy number and insertion and deletion (indel) variation and is associated with increased tolerance to heavy metals [19,20].

Fig 1. Structure of the MtnA locus.

Two transcripts that differ only in their 3’ UTRs have been annotated for MtnA (MtnA-RA and MtnA-RB). Dark blue boxes represent the UTRs with the arrowheads indicating the direction of transcription. Orange boxes represent the coding exons. The thin lines joining the coding exons represent introns. The location of the polymorphic indel, which is shared by both transcripts, is indicated by the red triangle. For the coding genes flanking MtnA (CG12947 and CG8500), only the whole gene model is shown.

In this paper we show that the expression difference of MtnA between a European and a sub-Saharan African population is not associated with copy number variation, but is associated with a derived 49-bp deletion in the MtnA 3’ untranslated region (UTR). Outside of sub-Saharan Africa, the deletion shows a latitudinal cline in frequency across multiple continents, reaching very high frequencies in northern Europe. Using transgenic reporter genes, we show that the indel polymorphism in the 3’ UTR contributes to the expression difference observed between populations. Furthermore, we use hydrogen peroxide tolerance assays to show that the deletion is associated with increased oxidative stress tolerance. Population genetic analyses indicate that MtnA has been the target of positive selection in non-African populations. Taken together, these results suggest that a cis-regulatory polymorphism in the MtnA 3’ UTR has undergone recent positive selection to increase MtnA expression and oxidative stress tolerance in derived northern populations of D. melanogaster.


Differential expression of MtnA between an African and a European population of D. melanogaster

A previous RNA-seq study of gene expression in the brain found MtnA to have four times higher expression in a European population (the Netherlands) than in a sub-Saharan African population (Zimbabwe) [12]. Of the members of the Mtn gene family, only MtnA showed high levels of expression and a significant difference in expression between populations (Fig 2A). To confirm this expression difference, we performed qRT-PCR on RNA extracted from dissected brains of flies from each population following the same pooling strategy used previously [12]. With this approach, we found MtnA to have 5-fold higher expression in the European population than in the African population (Fig 2B).

Fig 2. Expression of metallothionein genes in the brain in two populations of D. melanogaster.

(A) Expression level of Mtn paralogs in the brain from RNA-seq data. Expression is reported in reads per kilobase per million mapped reads (RPKM). Only MtnA showed a significant difference in expression between a European (the Netherlands), shown in blue, and an African (Zimbabwe), shown in green, population (adjusted P < 10−7 in the RNA-seq analysis [12]). Expression of MtnC was not detected. (B) MtnA expression in the brains of European and African flies, as determined by qRT-PCR. The expression difference between populations is highly significant (t-test, P = 5x10-5). In both panels, the error bars indicate the standard error of the mean.

The RNA-seq and qRT-PCR analyses were performed on a "per gene" basis and did not discriminate between the two annotated transcripts of MtnA, which differ only in the length of their 3' UTR (Fig 1). The MtnA-RA transcript completely overlaps with that of MtnA-RB and contains no unique sequence. The MtnA-RB transcript, however, contains an extra 371 bp at the 3' end that can be used to assess isoform-specific expression. Using RNA-seq data [12], we found that the MtnA-RB isoform represents only a small proportion of total MtnA expression (1.50% in the European population and 0.13% in the African population). Thus, almost all of the observed expression difference in MtnA can be attributed to the MtnA-RA isoform. Although present at very low levels, the MtnA-RB transcript showed much higher expression (50-fold) in Europe than in Africa (S1 Table).

Absence of MtnA copy number variation

Previous studies found copy number variation (CNV) for MtnA in natural isolates of D. melanogaster and showed that an increase in copy number was associated with higher MtnA expression [19,20]. To determine if CNV could explain the observed expression difference between the European and the African populations, we assayed MtnA copy number in flies of both populations by quantitative PCR. We found no evidence for CNV within or between the populations (Fig 3). In both populations, MtnA copy number was equal to that of the control single-copy gene RpL32 and was about half that of the nearly-identical paralogs AttA and AttB [21], which can be co-amplified by the same PCR primers and serve as a positive control. These results indicate that CNV cannot account for the observed variation in MtnA gene expression.

Fig 3. Results of CNV assays.

Flies from Africa (Zimbabwe), shown in green, and Europe (the Netherlands), shown in blue, were tested for MtnA CNV. The close paralogs AttA and AttB were used as a positive control for multiple gene copies, while RpL32 was used as a single-copy reference.

An indel polymorphism in the MtnA 3' UTR is associated with expression variation

To identify cis-regulatory variants that might be responsible for the difference in MtnA expression between European and African flies, we sequenced a 6-kb region encompassing the MtnA transcriptional unit (Fig 1) in 12 lines from the Netherlands (NL) and 11 lines from Zimbabwe (ZK). In addition, we quantified MtnA expression in a subset of eight lines from each population in both the brain and the gut by qRT-PCR. Across the 6-kb region, only a polymorphic 49-bp indel and a linked single nucleotide polymorphism (SNP) in the MtnA 3’ UTR showed a large difference in frequency between the populations, being this deletion present in 10 of the 12 European lines, but absent in Africa (Fig 4A). This indel was previously observed to segregate in natural populations from North America [20]. A comparison with three outgroup species (D. sechellia, D. simulans, and D. yakuba) indicated that the deletion was the derived variant. The qRT-PCR data revealed that the two European lines that lacked the deletion had MtnA expression that was similar to that of the African lines, but much lower than the other European lines. This result held for both brain and gut expression. Taken together, these results suggest that the 3' UTR polymorphism contributes to MtnA expression variation in natural populations. Furthermore, the expression variation is not limited to the brain, but shows a correlated response in at least one other tissue (Fig 4B).

Fig 4. Association between an indel polymorphism in the MtnA 3' UTR and gene expression variation.

(A) An indel (and a linked SNP marked in gray) in the MtnA 3' UTR are the only polymorphisms within the 6-kb MtnA region that show a large difference in frequency between an African and a European population of D. melanogaster. A comparison with three outgroup species, D. sechellia (Sec), D. simulans (Sim) and D. yakuba (Yak), indicated that the deletion is the derived variant. (B) MtnA expression in the brain and the gut of eight European (NL) lines, shown in blue, and eight African (ZK) lines, shown in green. The two European lines lacking the deletion, NL11 and NL15, show lower MtnA expression than those with the deletion.

Functional test of the effect of the MtnA 3' UTR polymorphism on gene expression

To test if the 49-bp deletion in the MtnA 3' UTR has an effect on gene expression, we designed expression constructs in which the MtnA promoter was placed upstream of either a green fluorescent protein (GFP) or lacZ reporter gene. Two versions of each reporter gene were made, one with the ancestral MtnA 3' UTR sequence and one with the derived MtnA 3' UTR sequence, which has the 49-bp deletion (Fig 5A). The reporter genes were then introduced into the D. melanogaster genome by PhiC31 site-specific integration [22,23].

Fig 5. Reporter gene constructs and their expression.

(A) The gray boxes represent the MtnA promoter, which is identical between the African and European alleles. The white boxes represent the GFP/lacZ reporter genes. The blue hatched box represents the MtnA 3’ UTR with the deletion. The green box represents the MtnA 3’ UTR with the additional 49 bp marked in red. The same color scheme applies to the bar plots. (B) The two versions of the GFP reporter gene differ significantly in expression in heads (t-test, P = 0.0019) and bodies (t-test, P = 0.0046), as assayed by qRT-PCR. (C) The two versions of the lacZ reporter gene differed significantly in expression in heads (t-test, P = 0.0006) and guts (t-test, P = 0.0001) as measured by β-galactosidase enzymatic activity. The error bars represent the standard error of the mean.

Our analysis of MtnA expression in the brain and gut indicated that the difference in expression observed between African and European populations is not brain-specific (Fig 4B). This is further supported by the expression of the reporter gene constructs. For the GFP reporter gene, the presence of the 3’ UTR deletion led to increased expression in both the brain and body (Fig 5B), with the difference in expression being 2.3-fold and 1.75-fold, respectively. A similar result was found for the lacZ reporter gene, where the 3’ UTR deletion led to 1.7-fold and 1.4-fold higher expression in the head and gut, respectively (Fig 5C).

MtnA expression in the brain

MtnA shows high expression in most D. melanogaster organs, including the fat body, digestive system, Malpighian tubule, and brain [24]. Although it has been documented that MtnA and its paralogs are involved in heavy metal homeostasis and tolerance, it is poorly understood which other functions MtnA might have and in which cells it is expressed. To get a more detailed picture of MtnA expression in the brain, we examined the expression of the GFP reporter gene by confocal imaging of dissected brain tissue (Fig 6).

Fig 6. Expression of an MtnA-GFP reporter gene in the brain.

(A-C) GFP expression driven by the reporter gene construct with the ancestral MtnA 3’ UTR variant. (D-G) Higher magnification of the brain regions where GFP is expressed. AL: antennal lobe, MB: mushroom bodies, SOG: subesophageal ganglion, Lo: lobula, Me: medulla. In (G) the arrow indicates cells expressing GFP. Green: GFP, red: anti-disclarge, targeting general neuropil.

GFP expression driven by the MtnA promoter is evident in cells that form a mesh-like structure surrounding the brain and in between the neuropiles (Fig 6). MtnA does not appear to be expressed at a discernible level in neurons, as the cells expressing GFP do not have dendrites or axonal processes. The shape and localization of the cells expressing GFP in the brain suggest that they are glia, which provide neurons with developmental, structural and trophic support as well as with protection against toxic elements [2527]. In a genome-wide expression profiling study it was found that MtnA is expressed in the astrocyte glial cells of larvae and adults of D. melanogaster [28]. Although we cannot be certain that MtnA expression is limited to the glia in the brain, our results provide direct evidence that MtnA is expressed in cell types other than the copper cells of the midgut and Malpighian tubules, as previously reported [29].

Frequency of the MtnA 3' UTR deletion in additional populations

To better characterize the geographical distribution of the indel polymorphism in the MtnA 3' UTR, we used a PCR-based assay to screen ten additional D. melanogaster populations across a latitudinal range spanning from tropical sub-Saharan Africa to northern Europe (Table 1). We found that the deletion was at very low frequency in sub-Saharan Africa, but nearly fixed in populations from northern Europe. This suggests that, at least outside of the ancestral species range, there is a latitudinal cline in the deletion frequency. Indeed, when the sub-Saharan populations are excluded, there is a highly significant correlation between latitude and deletion frequency (linear regression; R = 0.95, P = 0.0004). This correlation still holds when the sub-Saharan populations are included (using the absolute value of latitude), but is weaker (R = 0.80, P = 0.001).

Table 1. Frequency of the MtnA 3’ UTR deletion in different populations.

To investigate if the clinal distribution of the MtnA 3’ UTR deletion is present on other continents, we analyzed pooled sequencing (pool-seq) data from North America and Australia [30,31]. In North America, there is a significant correlation between latitude and deletion frequency (R = 0.94, P = 0.005) (Table 2). A similar pattern was seen in Australia, although data from only two populations were available. The deletion is at a frequency of 42% in Queensland (latitude 16 S) and 61% is Tasmania (latitude 42 S). The difference in deletion frequency between the two populations is significant (Fisher’s exact test, P = 0.02).

Table 2. Frequency of the MtnA 3’ UTR deletion in North American populations.

Evidence for positive selection at the MtnA locus

To test for a history of positive selection at the MtnA locus, we performed a population genetic analysis of the 6-kb MtnA region in the original European (the Netherlands) and African (Zimbabwe) population samples. In addition, we sequenced this region in 12 lines of a Swedish population, in which the 49-bp 3' UTR deletion was at a frequency of 100% (Table 1). Across the entire region, the Zimbabwean population showed the highest nucleotide diversity, having 1.43- and 2.50-fold higher values of π than the Dutch and Swedish populations, respectively (Table 3). Tajima’s D was negative in all three populations, and was significantly negative in both Zimbabwe and the Netherlands (Table 3). This could reflect a history of past positive or negative selection at this locus, but could also be caused by demographic factors, such as population expansion.

A sliding window analysis was performed to determine the distribution of nucleotide diversity (θ) (Fig 7A) and population differentiation (Fst) (Fig 7B) across the MtnA region. The region flanking the 3’ UTR indel polymorphism showed very low sequence variation in Zimbabwe and Sweden, but higher variation in the Netherlands. This pattern is due to the fact that the ancestral state of the indel polymorphism is fixed in the Zimbabwean population and the derived state is fixed in the Swedish population. In the Dutch population, the MtnA 3’ UTR is polymorphic for the deletion (two of the 12 lines have the ancestral state). This leads to higher nucleotide diversity than in the Swedish population, because the ancestral, non-deletion alleles contain more SNPs than the derived, deletion alleles. On average, Sweden and Zimbabwe showed the greatest population differentiation, with Fst reaching a peak in the 3’ UTR of MtnA, whereas values of Fst were lowest for the comparison of the Dutch and Swedish populations, indicating that there is very little differentiation between them (Fig 7b).

Fig 7. Evidence for positive selection at the MtnA locus.

(A) Watterson’s θ of D. melanogaster populations from Zimbabwe (ZK), the Netherlands (NL) and Sweden (SU) calculated in sliding windows of 500 bp with a step size of 250 bp. (B) Fst values for pairwise comparisons of ZK, NL and SU calculated in sliding windows of 500 bp with a step size of 250 bp. (C) Selective sweep (SweepFinder) analysis of the Netherlands population showing the composite likelihood ratio (CLR) statistic in sliding windows of 1000 bp. (D) Selective sweep (SweepFinder) analysis of the Swedish population showing the CLR statistic in sliding windows of 1000 bp. The black line indicates the 5% significance threshold calculated using the demographic model of Duchen et al. [5] for neutral simulations. The red line indicates the 5% significance threshold calculated using the demographic model of Werzner et al. [6] for neutral simulations and the gray dashed line indicates the 5% significance threshold using the model of Thornton and Andolfatto [35]. (E) Gene models for the 6-kb region analyzed. The gray highlighted region indicates the position of the 49-bp indel polymorphism in the MtnA 3’ UTR.

If positive selection has favored the derived MtnA allele (with the 49-bp 3' UTR deletion) in northern populations, then in this region of the genome one would expect there to be less variation among chromosomes containing the deletion than among those with the ancestral form of the allele. Indeed, this is what we observe in the Netherlands, where both alleles are segregating. Across the 6-kb region, there are 41 segregating sites within the Dutch population (Table 3). Among the 10 chromosomes with the deletion, there are 18 segregating sites, while between the two chromosomes lacking the deletion there are 23 segregating sites. This indicates that chromosomes with the deletion, which are in high frequency, shared a much more recent common ancestor. To test if this pattern differs from that expected under neutral evolution, we performed the Hudson's haplotype test (HHT) [36] using three different demographic models of the D. melanogaster out-of-Africa bottleneck for neutral simulations. Under the model of Werzner et al. [6], HHT was significant (P = 0.031). Under the models of Thornton and Andolfatto [35] and Duchen et al. [5], HHT was marginally significant (P = 0.076 and P = 0.094, respectively). These results suggest that neutral evolution and demography are unlikely to explain the observed patterns of DNA sequence variation.

To further test if the MtnA locus has experienced recent positive selection in northern Europe, we used the composite likelihood ratio (CLR) test to calculate the likelihood of a selective sweep at a given position in the genome, taking into account the recombination rate, the effective population size, and the selection coefficient of the selected mutation [37,38]. Within the Dutch population, the CLR statistic shows a peak in the region just adjacent to the MtnA 3' UTR deletion (Fig 7C). This peak was significant when the demographic models of Duchen et al. [5], Werzner et al. [6], and Thornton and Adolfatto [35] were used for neutral simulations, which provides compelling evidence for a recent selective sweep at the MtnA locus in the Netherlands population. A similar result was obtained for the Swedish population (Fig 7D), where the CLR statistic was above the 5% significance threshold determined from all three of the bottleneck models, suggesting that the selective sweep was not limited to a single population, but instead affected multiple European populations.

To test the possibility that the deletion in the MtnA 3’ UTR might have risen to high frequency as a result of hitchhiking with another linked polymorphism, we examined linkage disequilibrium (LD) across a 100 kb region flanking the MtnA locus in the Netherlands population (S1 Fig). The degree of linkage disequilibrium, r2 [39], was calculated between all pairs of SNPs present in the 100 kb region, excluding singletons. The SNP corresponding to the indel polymorphism (Fig 4a), position 53 of the linkage disequilibrium matrix, is not in significant LD with any of the 94 SNPs present along the 100 kb region analyzed (S1 Fig). These results indicate that the high frequency of the MtnA 3’ UTR deletion cannot be explained by linkage with another positively selected locus.

Association of the MtnA 3' UTR deletion with increased oxidative stress tolerance

MtnA expression has been linked to increased heavy metal tolerance [19,20,40] and metallothioneins in general have been associated with protection against oxidative stress [18,41]. To test if MtnA plays a role in oxidative stress and/or heavy metal tolerance, we used RNA interference (RNAi) to knockdown MtnA expression; these flies, along with their respective controls, were exposed to either hydrogen peroxide or copper sulfate. A knockdown in MtnA expression was significantly associated with increased mortality in the presence of hydrogen peroxide (P < 0.001; Fig 8A) and copper sulphate (P = 0.026; Fig 9A and 9B), although for the latter, this decrease was only significant in females.

Fig 8. Proportional mortality after oxidative stress.

(A) RNAi-mediated MtnA knockdown (hatched lines) and control flies (solid lines). P-values are shown for within population/background comparisons. (B) Dutch (blue) and Malaysian (orange) flies with the deletion (hatched lines) and without the deletion (solid lines) in the MtnA 3’ UTR. Error bars indicate the standard error of the mean. *P < 0.05, **P < 0.01, ***P < 0.005.

Fig 9. Proportional mortality after copper sulphate exposure.

(A,B) Copper tolerance in RNAi-mediated MtnA knockdown flies (white, RNAi-MtnA/Act5C-GAL4) and control flies expressing normal levels of MtnA (solid grey, control/Act5C-GAL4). (C) Male and (D) female flies from the Dutch (NL, blue) and the Malaysian (KL, orange) population with the deletion (hatched) and without the deletion (solid). P-values are shown for within population/background comparisons. Error bars indicate the standard error of the mean. *P < 0.05, **P < 0.01, ***P < 0.005.

To further test if the deletion in the MtnA 3’ UTR could be associated with an increase in oxidative stress and/or heavy metal tolerance, a subset of D. melanogaster lines from the Dutch and Malaysian populations, either with or without the deletion, were exposed to hydrogen peroxide and copper sulfate. The 3’ UTR deletion was associated with a significant increase in survival in the presence of hydrogen peroxide in both the Dutch (P = 0.001; Fig 8B) and Malaysian (P = 0.001; Fig 8B) populations. The 3’ UTR deletion had no significant effect on survival in the presence of copper sulfate in Dutch and Malaysian females (P = 0.976 and P = 0.732 respectively; Fig 9D) or males (P = 0.578 and P = 0.904 respectively; Fig 9C). Thus, the deletion in the MtnA 3’ UTR was associated with increased oxidative stress tolerance, but not increased heavy metal tolerance.


Differential expression of MtnA between a European and an African population of D. melanogaster was first detected in a brain-specific RNA-seq analysis [12]. In the present study, we confirm this inter-population expression difference by qRT-PCR and show that it is associated with an indel polymorphism in the MtnA 3’ UTR. We also perform reporter gene experiments to demonstrate that a large proportion of the expression difference can be attributed to this indel polymorphism. The ancestral state of the 3’ UTR contains a 49-bp sequence that is deleted in a derived allele that is present in worldwide populations. The deletion is nearly absent from sub-Saharan Africa, but present in frequencies >80% in northern Europe (Table 1). The deletion is present at intermediate frequency in Egypt (60%), Cyprus (65%) and Malaysia (45%). These findings suggest that positive selection has favored the 3' UTR deletion, at least within northern European populations. This interpretation is supported by population genetic analyses that indicate a recent selective sweep at the MtnA locus in populations from the Netherlands and Sweden (Fig 7). Furthermore, a clinal relationship between deletion frequency and latitude is also seen in North America and Australia, suggesting that there is a common selection gradient affecting all populations outside of sub-Saharan Africa.

Although chromosome arm 3R is known to harbor inversion polymorphisms that vary in frequency with latitude in cosmopolitan populations [42], we can rule out linkage to a segregating inversion as a cause for the clinal pattern seen for the MtnA 3’ UTR deletion. A previous analysis of the same Dutch population used in our study found that only one of the isofemale lines harbored an inversion on 3R, In(3R)P [43]. This was line NL13, which is one of the 10 lines with the MtnA 3’ UTR deletion (Fig 4A). Thus, there is no evidence for linkage between the inversion and the deletion. Moreover, the MtnA gene lies 7 Mb outside of the nearest breakpoint of In(3R)P.

Using hydrogen peroxide tolerance assays, we found evidence that knocking down MtnA expression decreases oxidative stress tolerance (Fig 8B). The association of the deletion in the MtnA 3’ UTR with increased survival in the presence of hydrogen peroxide (Fig 8A) suggests that the deletion has been selectively favored in some environments because it confers increased tolerance to oxidative stress. While cytotoxic reactive oxygen species (ROSs) are generated by natural metabolic processes, they can also be introduced via abiotic factors in the environment, such as radiation, UV light or exposure to toxins. The significant correlation between the frequency of the 3' UTR deletion and latitude, coupled with its association with increased oxidative stress tolerance suggests that environmentally induced oxidative stress may vary clinally, with greater stress in northern European environments. Regulation of the oxidative stress response usually occurs via upregulation of antioxidant protective enzymes in response to the binding of a cis-acting antioxidant-responsive element (ARE), which contains a characteristic sequence to which stress-activated transcription factors can bind [41]. A recent example of adaptation to oxidative stress in Drosophila is the insertion of the Bari-Jheh transposable element into the intergenic region of Juvenile Hormone Epoxy Hydrolase (Jheh) genes, which adds additional AREs that upregulate two downstream Jheh genes and was associated with increased oxidative stress tolerance [44]. Interestingly, the Bari-Jheh insertion also shows evidence for a partial selective sweep in non-African D. melanogaster [45], suggesting that oxidative stress may have imposed an important selective constraint on the colonization of Europe. However, the MtnA 3’ UTR deletion cannot mediate its associated increase in oxidative stress tolerance in a similar way, since it does not add any new AREs.

Due to their high inducibility in response to heavy metals, metallothioneins have traditionally been thought to play a role as detoxifiers specifically of heavy metals. However, this view has come into question recently, and metallothioneins are now thought to be a part of the general stress response and may function as scavengers of free radicals [41]. The association of the MtnA 3’UTR deletion with increased oxidative stress tolerance (Fig 8A) is in line with this more recent view of the role of metallothioneins, while the observed increased mortality after copper exposure in females in which MtnA expression has been knocked down (Fig 9D) is in keeping with the more traditional view. However, we found no association between the presence of the deletion and copper tolerance. This may be because the RNAi knockdown results in an MtnA expression level that is much lower than that of naturally occurring alleles, and copper tolerance is only affected when MtnA expression falls below a minimal threshold. The precise mechanisms of how metallothioneins interact with other metal processing systems after their initial binding and help remove excess of heavy metals, remain unclear [41].

At present, the mechanism by which the 3' UTR deletion affects MtnA gene expression is unknown. Although the deletion appears to have an effect on the usage of the MtnA-RB transcript isoform (S1 Table), this isoform is too rare (<2% of all MtnA transcripts) to account for the observed 4-fold difference in MtnA expression. Another possibility is that the deleted 3' UTR region contains one or more binding sites for a microRNA (miRNA). miRNAs are short, non-coding RNAs that modulate the expression of genes by inhibiting transcription or inducing mRNA degradation [46]. They are known to bind to a seed region that consists of 6–8 nucleotides in the 3’ UTR of their target mRNA. Post-transcriptional gene expression regulation by miRNAs can result in the fine-tuned regulation of a specific transcript or can cause the complete silencing of a gene in a particular tissue or developmental stage [4648]. To identify miRNAs that might bind specifically to the 49-bp sequence present in the ancestral form of the MtnA 3’ UTR, we used the UTR predictor [49]. The UTR predictor takes into account the three-dimensional structure of the miRNA and the 3’ UTR, as well as the energetic stability of the miRNA-3’ UTR base-pair binding. The score given by the UTR predictor is an energetic score, with the most negative scores indicating the most probable interactions. Our analysis of the MtnA 3' UTR identified five candidate miRNAs with scores below -6 that had predicted binding sites overlapping with the 49-bp indel region (Table 4). These candidates should serve as a good starting point for future functional tests of putative miRNA-3' UTR interactions.

Table 4. Top scoring miRNAs predicted to bind within the polymorphic 49-bp sequence in the MtnA 3' UTR.

Genetic variation provides the substrate upon which natural selection acts, resulting in an increase in the frequency of alleles that are beneficial in a given environment. Because changes in gene expression, especially those caused by variation in cis-regulatory elements, are predicted to have fewer pleiotropic effects than changes occurring within coding regions, it has been proposed that they are the most frequent targets of positive selection [5052]. In contrast to structural changes in protein sequences, changes in gene expression can be specific to a particular a tissue or developmental stage. Our results indicate that the observed variation in MtnA expression is not specific to the brain, as a similar expression pattern is also seen in the gut (Fig 4). This suggests that the 3' UTR deletion has a general effect on MtnA expression, which is present at high levels in almost all organs of D. melanogaster [24]. However, tissue-specific effects of the difference in MtnA expression cannot be ruled out. As shown in Fig 6, GFP expression driven by the MtnA promoter in the brain is limited to what seems only one cell type, which according to their morphological and anatomical characteristics, could correspond to glia. It has been reported that glia cells protect neurons and other brain cells from ROS damage caused by oxidative stress [53,54] and the fact that MtnA has been found to be expressed in the astrocyte glial cells in larva and adult flies [28], suggests that MtnA expression in glia could serve as neuronal protection against environmental factors, such as exposure to xenobiotics, that trigger an oxidative stress response [29,5557]. Our functional experiments showing an association between genetic variation in MtnA and oxidative stress tolerance are consistent with MtnA expression in glia providing protection against oxidative stress, which may be especially important in the brain, as neurons are highly susceptible to ROS damage.

Materials and Methods

Fly strains

This study used isofemale lines from 12 populations of D. melanogaster, including: Zimbabwe (Lake Kariba), Zambia (Lake Kariba), Rwanda (Gikongoro), Cameroon (Oku), Egypt (Cairo), Cyprus (Nicosia), Malaysia (Kuala Lumpur), France (Lyon), Germany (Munich), the Netherlands (Leiden), Denmark (Aarhus) and Sweden (Umeå). The lines from Zimbabwe and the Netherlands were the same as those used in previous expression studies [1012]. Flies from Germany were collected from different locations in the greater Munich area. Flies from Cyprus were collected from a single location near Nicosia. Flies from Denmark were kindly provided by Volker Loeschcke (Aarhus University). Flies from Sweden and Malaysia were kindly provided by Ricardo Wilches and Wolfgang Stephan (University of Munich). The remaining fly lines were collected as part of the Drosophila Population Genomics Project [8] and were kindly provided by John Pool and Charles Langley (University of California, Davis).

Flies expressing hairpin RNA targeted against MtnA mRNA under the control of the GAL4/UAS system (RNAi-MtnA; transformant ID: 105011) and the host line used in their creation (control; transformant ID: 60100) were obtained from the Vienna Drosophila Stock Center [58] Act5C/Cyo flies expressing GAL4 under the control of an Act5C driver were kindly provided by Ilona Grunwald Kadow. For tolerance assays, Act5C/Cyo females were crossed to RNAi-MtnA and control males and the progeny (RNAi-MtnA/Act5C-GAL4 and control/Act5C-GAL4) were used in tolerance assays. Using qRT-PCR as described below, MtnA expression was confirmed to be knocked down by 90.03% in males and 87.58% in females in RNAi-MtnA/Act5C-GAL4 flies in comparison to control/Act5C-GAL4. Flies were maintained on standard cornmeal-molasses medium at a constant temperature of 22° with a 14 hour light/10 hour dark cycle.

Quantitative reverse transcription PCR (qRT-PCR)

Validation of the MtnA expression results obtained from brain RNA-seq data [12] was performed by qRT-PCR using TaqMan probes (Applied Biosystems, Foster City, California, USA). For population-level comparisons, six brains were dissected from males and females of each of the 11 lines from Zimbabwe (ZK84, ZK95, ZK131, ZK145, ZK157, ZK186, ZK191, ZK229, ZK377, ZK384, ZK398) and five brains were dissected from males and females of each of the 12 lines from the Netherlands (NL01, NL02, NL11, NL12, NL13, NL14, NL15, NL16, NL17, NL18, NL19, NL20). The dissected brains of each population and sex were pooled following the RNA-seq strategy previously described [12]. The above procedure was repeated in two biological replicates for each sex and population. To compare the MtnA expression of individual lines within populations, subsets of eight lines were chosen from Zimbabwe (ZK84, ZK95, ZK131, ZK145, ZK157, ZK186, ZK377, ZK384) and the Netherlands (NL01, NL02, NL11, NL12, NL15, NL16, NL17, NL18). Thirty whole brains and digestive tracts (from foregut to hindgut) were dissected per line. Two biological replicates of each line (each consisting of 30 brains or guts) were processed. Tissue was dissected from flies 4–6 days old in 1X PBS (phosphate buffered saline). The tissue was stored in RNAlater (Life Technologies, Carlsbad, CA, USA) at -80° until RNA extraction. Total RNA extraction and DNase I digestion was performed using the MasterPure RNA Purification Kit (Epicentre, Madison, WI, USA). One microgram of total RNA was reverse transcribed using random primers and SuperScript II reverse transcriptase (Life Technologies) following the manufacturer’s instructions. TaqMan gene expression assays (Applied Biosystems) were used for MtnA (Dm02362764_s1) and RpL32 (Dm02151827_g1). qRT-PCR was performed using a Real-Time thermal cycler CFX96 (Bio-Rad, Hercules, CA, USA). Two biological replicates, each with two technical replicates, were processed for each sample. The ΔΔCt method was used to compute the normalized expression of MtnA using the ribosomal protein gene RpL32 as the reference [59].

CNV assays

The paralogous genes AttacinA (AttA) and AttacinB (AttB) were used as positive controls for CNV assays, because they share 97% nucleotide identity [21] and can be co-amplified with the same primer set. The sequences for AttA and AttB were downloaded from FlyBase [60] and aligned using the ClustalW2 algorithm implemented in SeaView (version 4) [61]. Primers were designed for the second coding exon, where the nucleotide identity of AttA and AttB is 100%. The primer sequences were as follows: forward (5’-GGTGCCTCTTTGACCAAAAC-3’) and reverse (5’-CCAGATTGTGTCTGCCATTG-3’). The ribosomal protein gene RpL32, which is not known to show CNV, was used as a negative control. The RpL32-specific primers were: forward (5’-GACAATCTCCTTGCGCTTCT-3’) and reverse (5’-AGCTGGAGGTCCTGCTCAT-3’). The primers specific for MtnA were: forward (5’-CACTTGACCATCCCATTTCC-3’) and reverse (5’-GGTCTGCGGCATTCTAGGT-3’). CNV was assessed among 12 lines from the Netherlands and 11 lines from Zimbabwe. Individual DNA extractions were performed separately for three flies of each line and copy number was assessed individually for each fly. Genomic DNA was extracted using the MasterPure DNA Purification Kit (Epicentre). The assessment of CNV from genomic DNA was done with iQ SYBR Green Supermix (Bio-Rad) following the manufacturer’s instructions. CNV assays were performed using a Real-Time thermal cycler CFX96 (Bio-Rad). The relative copy numbers of MtnA and AttA/AttB were obtained by the ΔCt method using RpL32 as the reference gene.

Sequencing of the MtnA locus

Approximately 6 kb of the MtnA genomic region, spanning from the second intron of CG12947 to the 3’ UTR of CG8500 (genome coordinates 3R: 5,606,733–5,612,630), were sequenced in 12 Dutch, 11 Zimbabwean and 12 Swedish lines (Fig 1). The following primer pairs were used (all 5’ to 3’): GATGGTGGAATACCCTTTGC and AAAGCGGGTTTACCAGTGTG; GTTGGCCTGGCTTAATAACG and ACTGGCACTGGAGCTGTTTC; GCTCTTGCTAGCCATTCTGG and AGAACCCGGCATATAAACGA; GATATGCCCACACCCATACC and GTAGAGGCGCTGCATCTTGT; CACTTGACCATCCCATTTCC and CAAGTCCCCAAAGTGGAGAA; CTTGATTTTGCTGCTGACCA and ATCGCCACGATTATGATTGC; CAGGACAATCAAGCGGAAGT and TTATGAAGCGCAGCACCAGT; GACCCACTCGAATCCGTATC and TGCTTCTTGGTGTCCAGTTG. PCR products were purified with ExoSAP-IT (Affymetrix, Santa Clara, CA, USA) and sequenced using BigDye chemistry on a 3730 automated sequencer (Applied Biosystems). Trace files were edited using Sequencher 4.9 (Gene Codes Corporation, Ann Arbor, MI, USA) and a multiple sequence alignment was generated using the ClustalW2 algorithm in SeaView (version 4) [61]. All sequences have been submitted to GenBank/EMBL under the accession numbers KT008059–KT008093.

MtnA indel polymorphism screening and latitude correlation study

For individual flies of the isofemale lines described above, the presence or absence of the MtnA 3' UTR deletion was assessed by performing a two-step PCR (35 cycles of 98° for 5 sec. and 60° for 10 sec.) using the following primers: forward (5’-GCCGCAGACCAATTGATTA-3’) and reverse (5’-TTCTTTCCAGGATGCAAATG-3’). The frequency of the deletion was estimated on an allelic basis, as heterozygous individuals were detected in some populations. Binomial 95% confidence intervals were calculated for the frequency of the deletion using the probit method implemented in R [62]. The strength and significance of the correlation between the frequency of the deletion and latitude was determined using linear regression.

To determine the frequency of the MtnA 3’ UTR deletion on other continents, raw pool-seq reads from North America [30] and Australia [31] were downloaded from the National Center for Biotechnology Information (NCBI) short read archive (SRA). The reads were mapped to either the ancestral or derived (with 49-bp deletion) version of the MtnA 3’ UTR using NextGenMap [63]. Only reads spanning the site of the indel were considered informative. The deletion frequency was estimated as the proportion of informative reads that matched the deletion allele. The 95% confidence interval was estimated using the probit method in R [55].

Cloning and transgenesis

To test whether the indel polymorphism found in the MtnA 3’ UTR can account for the difference in expression observed between the Dutch and the Zimbabwean populations, we constructed transgenic flies using the phiC31 transgenesis system [23]. Two expression vectors containing a green fluorescent protein (GFP) reporter gene were constructed. MtnA 3’ UTR sequences from the Netherlands (line NL20) and Zimbabwe (line ZK84), corresponding to chromosome arm 3R coordinates 5,607,448–5,611,691, were PCR-amplified with forward (5’-TTTCCTCGAACTTGTTCACTTG -3’) and reverse (5’- GCCCGATGTGACTAGCTCTT -3’) primers and cloned into the pCR2.1-TOPO vector (Invitrogen). The promoter region of MtnA (corresponding to genome coordinates 3R: 5,607,983–5,612,438), which is identical in the Dutch and the Zimbabwean populations, was also PCR amplified and cloned separately into the pCR2.1-TOPO vector using forward (5’-GCCGCAGACCAATTGATTA-3’) and reverse (5’-TTCTTTCCAGGATGCAAATG-3’) primers. To generate the GFP expression construct, the MtnA promoter was excised with EcoRI and ligated into the EcoRI site at the 5’ end of GFP in the plasmid pRSET/EmGPP (Invitrogen). Using AvaI and XbaI, the fragment containing the MtnA promoter and GFP was excised from the pRSET/EmGPP plasmid and ligated into the AvaI–XbaI sites proximal to the MtnA 3’ UTR in the pCR2.1-TOPO vector. The whole construct (promoter + GFP + 3’ UTR) was then excised with XbaI and KpnI and ligated into the XbaI–KpnI sites of the pattB integration vector [23]. For the lacZ constructs, the MtnA promoter was excised from the pCR2.1-TOPO vector with EcoRI and ligated into the EcoRI site 5’ of the lacZ coding sequence in the pCMV-SPORT-βgal plasmid (Life Technologies). PCR primers with overhangs containing restriction sites for XhoI and XbaI (forward 5’- GGTCCGACTCGAGGCGAAATACGGGCAGACATG -3’ and reverse 5’- GGTGCTCTAGAGCTCCATAGAAGACACCGGGAC -3’) were used to amplify the MtnA promoter/lacZ fragment and the product was ligated into the XhoI–XbaI sites just upstream of the MtnA 3’ UTR fragment in the pCR2.1-TOPO vector. Finally, the whole construct was excised using XbaI and KpnI and ligated into the XbaI–KpnI sites of the pattB vector (Fig 5). PhiC31 site-specific transgenesis was used to generate flies that differed only in the presence or the absence of the 49–bp sequence in the 3’ UTR of the reporter gene. The M{vas-int.Dm}ZH-2A, M{3xP3-RFP.attP}ZH-51D line was used for embryo microinjections. Microinjection and screening for transformants were carried out by Fly Facility (Clermont-Ferrand Cedex, France) and Rainbow Transgenic Flies (Camarillo, CA, USA). The successfully transformed flies were crossed to a yellow, white (yw) strain for two generations to eliminate the integrase.

Reporter gene assays

GFP assays.

The expression of the reporter gene GFP was measured in heterozygous flies generated by crossing transformant males to yw females. We tested for differences in the expression of GFP in bodies and heads separately. Differences in GFP expression between lines were tested by qRT-PCR. For this, total RNA was extracted from five bodies and ten heads of each transformant line using the RNA extraction and reverse transcription protocols described above. Thirteen biological replicates (six male and seven female) were processed for each line, each with two technical replicates. qRT-PCR was performed as described above using a custom Taqman probe for GFP (Applied Biosystems; forward primer: 5’-GAGCGCACCATCTTCTTCAAG-3’, reverse primer: 5’-TGTCGCCCTCGAACTTCAC-3’, FAM-labeled primer: 5’-ACGACGGCAACTACA-3’) and a probe for RpL32 (Dm02151827_g1), which was used as an endogenous control. The data analysis was also performed as described above for MtnA gene expression. A t-test was performed to assess significance.

β-galactosidase assays.

β-galactosidase activity was measured in groups of 30 heads or eight guts of homozygous transformant flies. Proteins were extracted from the tissues and the β-galactosidase activity assay was performed as described in [64] with the following exceptions. After dissection in cold PBS, tissues were frozen with liquid nitrogen and homogenized before the addition of 150 μL of the 0.1 M Tris-HCl, 1 mM EDTA, and 7 mM 2-mercaptoethanol buffer (pH 7.5). For each assay, two technical replicates of 60 μL of the supernatant containing the soluble proteins were combined with 50 μl of the 200 mM sodium phosphate (pH 7.3), 2 mM MgCl2, 100 mM 2-mecaptoethanol assay buffer. β-galactosidase activity was measured spectrophotometrically by following the change in absorbance at 420 nm at 37° Celsius. Four to five biological replicates were performed per tissue and per sex. Significance was assessed using a t-test.

Brain confocal imaging

Brain tissue was dissected in ice-cold 1X PBS and fixed with PLP (8% paraformaldehyde in NaOH and PBS with lysine (1)-HCl) for one hour at room temperature as described in [65]. After fixation, the tissue was washed twice for 15 minutes with PBS-0.5% Triton X and then incubated for one hour in blocking solution (20% donkey serum, 0.5% Triton X in PBS) at room temperature. The primary antibody, mouse anti-disclarge (Developmental Studies Hybridoma Bank, University of Iowa, USA) was used at a 1:200 dilution and incubated overnight at 4° Celsius in blocking solution. After washing twice with PBS-0.5% Triton X, the tissue was incubated with the secondary antibody, 1:200 anti-rat-CY3 (Dianova, Hamburg, Germany). The brains were mounted in Vectashield mounting medium (Vector Laboratories, Burlingame, CA, USA) and scanned using confocal microscopy with a Leica SP5-2. The images were analyzed using the StackGroom plugin in ImageJ [66].

Population genetic analysis and tests for selection

Summary statistics, including the number of segregating sites (S), number of haplotypes and Tajima’s D [34] were calculated using DnaSP v.5.10.1 [67]. The mean pairwise nucleotide diversity (π) [33], Watterson’s [32] estimate of nucleotide diversity (θ) and Fst [68] were calculated as described in [5]. Hudson’s haplotype test (HHT) was carried out using ms [69] to perform coalescent simulations and psubs [70] to calculate the probability of observing a subset of n sequences containing p or fewer polymorphic sites. The demographic models of Thornton and Andolfatto [35], Duchen et al. [5], and Werzner et al. [6] were used to simulate the out-of Africa bottleneck.

To test for a selective sweep, a SweepFinder analysis was performed using the SweeD software [38]. The background site frequency spectrum (SFS) was calculated for the entire 3R chromosome arm using 11 whole genome sequences from the Netherlands population and one whole genome sequence from the French (Lyon) population [8]. The French sequence was included in order to have a constant sample size of 12 sequences for the calculation of the SFS. This approach did not bias the background, as the French sequence did not differ more from the Netherlands sequences than the Netherlands sequences did from each other (S2 Table, S2a Fig). Furthermore, the inclusion of a French line did not lead to a skew in the background SFS (S2b Fig). For the Swedish population, the background SFS of chromosome arm 3R was determined from 12 whole genome sequences from the Umeå population (S3 Table). In order to increase the power of the test, the invariant sites in the alignment were also included [37]. To assess the significance of the composite likelihood ratio (CLR) statistic, neutral simulations were performed using ms [69]. In the neutral simulations three demographic models were taken into account [5,6,35]. These models differ in several parameters, including: the timing of the out-of-Africa bottleneck, the current effective population sizes of the European and African populations, and the ancient demographic history of the African population. For our analyses, it is the estimated time of the out-of-Africa bottleneck that has the largest impact on the results. Duchen et al. [5] infer this bottleneck to have occurred around 19,000 years ago, Thornton and Andolfatto [35] around 16,000 years ago, and Werzner et al. [6] around 13,000 years ago. However, the 95% confidence intervals of the estimates are very wide, ranging from 7,359–43,000 years ago. Thus, the three estimates are not incompatible with each other. The recombination rate of the MtnA genomic region was obtained from the D. melanogaster recombination rate calculator [71]. A total of 10,000 simulations were performed. For each simulation, the maximum value of the CLR statistic was extracted and used to determine the 5% significance threshold. Linkage disequilibrium was calculated between all pairs of SNPs present using Lewontin’s r2 = D2/p1q1p2q2, where D is the frequency of the haplotypes and p and q represent the allele frequencies [39]. A fragment of ~100 kb flanking the MtnA locus (3R: 9,732,746..9,835,406) was analyzed, with singletons excluded. A Fisher’s exact test was used to assess significance of the r2 values.

Copper and oxidative stress tolerance assays

Copper sulfate and hydrogen peroxide tolerance assays were performed using five D. melanogaster lines containing the MtnA 3’ UTR deletion (two Dutch and three Malaysian lines) and three lines without the deletion (two Malaysian and one Dutch line), as well as an MtnA knockdown line (RNAi-MtnA/Act5C-GAL4 and its control (control/Act5C-GAL4). Assays were performed at 25°C in tolerance chambers consisting of a plastic vial (diameter = 25 mm, height = 95 mm) with compressed cotton at the bottom containing 2.5 ml copper sulfate (Sigma Aldrich) or hydrogen peroxide (Sigma Aldrich) solution supplemented with 5% sucrose and sealed with a cork. Four to six day-old flies were separated by sex and tested in groups of 20. For each assay, one concentration of copper sulfate (50 mM) or two concentrations of hydrogen peroxide (5 or 10%) were tested with 5–7 replicates per sex and concentration. A control solution containing only sucrose was also tested with 3–5 (10–15 for Act5C-GAL4 background) replicates per sex for each assay. Mortality was recorded as the number of dead flies after 48 ± 1 hours. To determine the effect of the deletion, lines with and without the deletion were compared within each population or background. For copper sulfate assay analysis, t-tests were performed to assess significance. In order to account for potential differences in mortality inherent among the lines, proportional mortality data was scaled by mortality at 0 mM using the formula mortality/(1 + mean mortality at 0mM). For hydrogen peroxide assay analysis, the data for each assay and population was fit to a generalized linear model (GLM) using concentration, line, sex, and presence of the deletion as factors and a quasibinomial distribution using the glm function in R [62]. The tolerance results for each sex (S3 Fig) and the GLM coefficients (S4S12 Tables) are provided as supporting information.

Supporting Information

S1 Fig. Linkage disequilibrium matrix.

Linkage disequilibrium was assessed from a genomic region of 100 kb with MtnA at the center. The upper and right axes show the SNPs found in the 100 kb fragment, excluding singletons, and each cell represents r2 values. The left and bottom part of the matrix shows the results of a Fisher’s exact test for each pair of SNPs. Red boxes indicate significant P-values.


S2 Fig. Nucleotide diversity (π) and site frequency spectrum (SFS) of chromosome arm 3R.

(A) Nucleotide diversity (π) for 11 lines from the Netherlands (NL), eight lines from France (FR), all the Dutch and French lines combined (FR‐NL), and the French line FR14 combined with 11 lines from the Netherlands (FR14‐NL). (B) Dark blue bars indicate the SFS for the 11 Dutch lines for which complete genome sequences are available. Light blue bars indicate the SFS of 10 of these Netherlands lines plus one French line. In order to have a constant sample size of 12 for the SweepFinder analysis, one French line (FR14) was included with the NL lines to calculate the background site frequency spectrum.


S3 Fig. Oxidative stress tolerance by sex.

Proportional mortality of D. melanogaster males (A, C) and females (B, D) after exposure to hydrogen peroxide for 48 hours in (A,B) flies with (hatched lines) and without (solid lines) the deletion in the MtnA 3’ UTR and (C,D) RNAi-mediated MtnA knockdown (hatched lines) and control (solid lines) flies (C,D). (A,B) The Dutch (NL) population is shown in blue and the Malaysian (KL) population in orange. Legends are provided to the right of each row. P-values are shown for within population/background and sex comparisons. Error bars represent standard error of the mean.


S1 Table. Isoform-specific expression of MtnA in the brain.


S2 Table. Average pairwise differences per kb between French (FR) and Dutch (NL) lines.


S3 Table. Site frequency spectrum (SFS) of the Swedish population drawn from the whole 3R chromosome arm.


S4 Table. Oxidative stress tolerance glm coefficients for Malaysian population.


S5 Table. Oxidative stress tolerance glm coefficients for Dutch population.


S6 Table. Oxidative stress tolerance glm coefficients for MtnA knockdown and control lines.


S7 Table. Male oxidative stress tolerance glm coefficients for Malaysian population.


S8 Table. Male oxidative stress tolerance glm coefficients for Dutch population.


S9 Table. Male oxidative stress tolerance glm coefficients for MtnA knockdown and control lines.


S10 Table. Female oxidative stress tolerance glm coefficients for Malaysian population.


S11 Table. Female oxidative stress tolerance glm coefficients for Dutch population.


S12 Table. Female oxidative stress tolerance glm coefficients for MtnA knockdown and control lines.



We thank Hilde Lainer, Hedwig Gebhart, Angélica Cuevas and Annabella Königer for excellent assistance in the lab. We also thank Ilona Grunwald Kadow for sharing fly lines and primary/secondary antibodies, as well as for helpful comments on the confocal images. We further thank Ricardo Wilches and Wolfgang Stephan for sharing fly lines and sequences. Finally we thank Sebastian Hoehna and Kyle McCulloch for comments on the manuscript. This work is part of the dissertation of Ana Catalán.

Author Contributions

Conceived and designed the experiments: AC AGS JP. Performed the experiments: AC AGS EA. Analyzed the data: AC AGS PD JP. Wrote the paper: AC AGS JP.


  1. 1. David J, Capy P. Genetic variation of Drosophila melanogaster natural populations. Trends Genet. 1988;4: 106–111. pmid:3149056
  2. 2. Lachaise D, Cariou M-L, David JR, Lemeunier F, Tsacas L, Ashburner M. Historical biogeography of the Drosophila melanogaster species subgroup. Evol Biol. Plenum Press; 1988;22: 159–225. Available:
  3. 3. Stephan W, Li H. The recent demographic and adaptive history of Drosophila melanogaster. Heredity (Edinb). 2007;98: 65–8.
  4. 4. Laurent SJY, Werzner A, Excoffier L, Stephan W. Approximate Bayesian analysis of Drosophila melanogaster polymorphism data reveals a recent colonization of Southeast Asia. Mol Biol Evol. 2011;28: 2041–51. pmid:21300986
  5. 5. Duchen P, Zivkovic D, Hutter S, Stephan W, Laurent S. Demographic inference reveals African and European admixture in the North American Drosophila melanogaster population. Genetics. 2013;193: 291–301. pmid:23150605
  6. 6. Werzner A, Pavlidis P, Ometto L, Stephan W, Laurent S. Selective sweep in the Flotillin-2 region of European Drosophila melanogaster. PLoS One. Public Library of Science; 2013;8: e56629.
  7. 7. Saminadin-Peter SS, Kemkemer C, Pavlidis P, Parsch J. Selective sweep of a cis-regulatory sequence in a non-African population of Drosophila melanogaster. Mol Biol Evol. 2012;29: 1167–74. pmid:22101416
  8. 8. Pool JE, Corbett-Detig RB, Sugino RP, Stevens KA, Cardeno CM, Crepeau MW, et al. Population genomics of sub-saharan Drosophila melanogaster: African diversity and non-African admixture. Malik HS, editor. PLoS Genet. Public Library of Science; 2012;8: e1003080.
  9. 9. Glaser-Schmitt A, Catalán A, Parsch J. Adaptive divergence of a transcriptional enhancer between populations of Drosophila melanogaster. Philos Trans R Soc Lond B Biol Sci. 2013;368: 20130024. pmid:24218636
  10. 10. Hutter S, Saminadin-Peter SS, Stephan W, Parsch J. Gene expression variation in African and European populations of Drosophila melanogaster. Genome Biol. 2008;9: R12. pmid:18208589
  11. 11. Müller L, Hutter S, Stamboliyska R, Saminadin-Peter SS, Stephan W, Parsch J. Population transcriptomics of Drosophila melanogaster females. BMC Genomics. 2011;12: 81. pmid:21276238
  12. 12. Catalán A, Hutter S, Parsch J. Population and sex differences in Drosophila melanogaster brain gene expression. BMC Genomics. 2012;13: 654. pmid:23170910
  13. 13. Huylmans AK, Parsch J. Population- and sex-biased gene expression in the excretion organs of Drosophila melanogaster. G3 (Bethesda). 2014;4: 2307–15.
  14. 14. Egli D, Selvaraj A, Yepiskoposyan H, Zhang B, Hafen E, Georgiev O, et al. Knockout of “metal-responsive transcription factor” MTF-1 in Drosophila by homologous recombination reveals its central role in heavy metal homeostasis. EMBO J. 2003;22: 100–8. pmid:12505988
  15. 15. Pérez-Rafael S, Kurz A, Guirola M, Capdevila M, Palacios O, Atrian S. Is MtnE, the fifth Drosophila metallothionein, functionally distinct from the other members of this polymorphic protein family? Metallomics. The Royal Society of Chemistry; 2012;4: 342–9.
  16. 16. Guirola M, Naranjo Y, Capdevila M, Atrian S. Comparative genomics analysis of metallothioneins in twelve Drosophila species. J Inorg Biochem. 2011;105: 1050–1059. Available: pmid:21726767
  17. 17. Capdevila M, Bofill R, Palacios Ò, Atrian S. State-of-the-art of metallothioneins at the beginning of the 21st century. Coord Chem Rev. 2012;256: 46–62. Available:
  18. 18. Nath R, Kumar D, Li T, Singal PK. Metallothioneins, oxidative stress and the cardiovascular system. Toxicology. 2000;155: 17–26. Available: pmid:11154793
  19. 19. Maroni G, Wise J, Young JE, Otto E. Metallothionein gene duplications and metal tolerance in natural populations of Drosophila melanogaster. Genetics. 1987;117: 739–44. Available: pmid:2828157
  20. 20. Lange BW, Langley CH, Stephan W. Molecular evolution of Drosophila metallothionein genes. Genetics. 1990;126: 921–32. Available: pmid:1981765
  21. 21. Lazzaro BP, Clark AG. Evidence for recurrent paralogous gene conversion and exceptional allelic divergence in the Attacin genes of Drosophila melanogaster. Genetics. 2001;159: 659–671. Available: pmid:11606542
  22. 22. Groth CA, Fish M, Nusse R, Calos PM. Construction of transgenic Drosophila by using the site-specific integrase from phage C31. Genetics. 2004;166: 1775–1782. pmid:15126397
  23. 23. Bischof J, Maeda RK, Hediger M, Karch F, Basler K. An optimized transgenesis system for Drosophila using germ-line-specific phiC31 integrases. Proc Natl Acad Sci U S A. 2007;104: 3312–7. pmid:17360644
  24. 24. Chintapalli VR, Wang J, Dow JAT. Using FlyAtlas to identify better Drosophila melanogaster models of human disease. Nat Genet. 2007;39: 715–20. pmid:17534367
  25. 25. Hartenstein V, Spindler S, Pereanu W, Fung S. The development of the Drosophila larval brain. Adv Exp Med Biol. 2008;628: 1–31. pmid:18683635
  26. 26. Edwards TN, Meinertzhagen IA. The functional organisation of glia in the adult brain of Drosophila and other insects. Prog Neurobiol. 2010;90: 471–97. pmid:20109517
  27. 27. Hartenstein V. Morphological diversity and development of glia in Drosophila. Glia. 2011;59: 1237–52. pmid:21438012
  28. 28. Huang Y, Ng FS, Jackson FR. Comparison of larval and adult Drosophila astrocytes reveals stage-specific gene expression profiles. G3 (Bethesda). 2015;5: 551–8.
  29. 29. Egli D, Domènech J, Selvaraj A, Balamurugan K, Hua H, Capdevila M, et al. The four members of the Drosophila metallothionein family exhibit distinct yet overlapping roles in heavy metal homeostasis and detoxification. Genes Cells. 2006;11: 647–58. pmid:16716195
  30. 30. Bergland AO, Behrman EL, O’Brien KR, Schmidt PS, Petrov DA. Genomic evidence of rapid and stable adaptive oscillations over seasonal time scales in Drosophila. PLoS Genet. Public Library of Science; 2014;10: e1004775.
  31. 31. Reinhardt JA, Kolaczkowski B, Jones CD, Begun DJ, Kern AD. Parallel geographic variation in Drosophila melanogaster. Genetics. 2014;197: 361–373. pmid:24610860
  32. 32. Watterson GA. On the Number of segregating sites in genetical models without recombination. Theor Popul Biol. 1975;276: 256–276. Available:
  33. 33. Tajima F. Evolutionary relationship of DNA sequences in finite populations. Genetics. 1983;105: 437–60. Available: pmid:6628982
  34. 34. Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. Genetics Society of America; 1989;123: 585–95. Available:
  35. 35. Thornton K, Andolfatto P. Approximate Bayesian inference reveals evidence for a recent, severe bottleneck in a Netherlands population of Drosophila melanogaster. Genetics. 2006;172: 1607–19. pmid:16299396
  36. 36. Hudson RR. The how and why of generating gene genealogies. Takahata N, Clark AG, editors. MA: Sinauer Associates, Sunderland, MA.; 1993.
  37. 37. Nielsen R, Williamson S, Kim Y, Hubisz MJ, Clark AG, Bustamante C. Genomic scans for selective sweeps using SNP data. Genome Res. 2005;15: 1566–75. pmid:16251466
  38. 38. Pavlidis P, Živković D, Stamatakis A, Alachiotis N. SweeD: likelihood-based detection of selective sweeps in thousands of genomes. Mol Biol Evol. 2013;30: 2224–34. pmid:23777627
  39. 39. Lewontin RC. The interaction of selection and linkage. I. General considerations; Heterotic models. Genetics. 1964;49: 49–67. Available: pmid:17248194
  40. 40. Maroni G, Otto E, Lastowski-Perry D. Molecular and cytogenetic characterization of a metallothionein gene of Drosophila. Genetics. 1986;112: 493–504. Available: pmid:3007277
  41. 41. Straalen van NM, Roelofs D. An introduction to ecological genomics [Internet]. 2nd Edition. OUP Oxford; 2012. Available:
  42. 42. Kapun M, Van Schalkwyk H, McAllister B, Flatt T, Schlötterer C. Inference of chromosomal inversion dynamics from Pool-Seq data in natural and laboratory populations of Drosophila melanogaster. Mol Ecol. 2014;23: 1813–1827. pmid:24372777
  43. 43. Hutter S, Li H, Beisswanger S, De Lorenzo D, Stephan W. Distinctly different sex ratios in African and European populations of Drosophila melanogaster inferred from chromosomewide single nucleotide polymorphism data. Genetics. 2007;177: 469–480. pmid:17660560
  44. 44. Guio L, Barrón MG, González J. The transposable element Bari-Jheh mediates oxidative stress response in Drosophila. Mol Ecol. 2014;23: 2020–30. pmid:24629106
  45. 45. González J, Macpherson JM, Petrov DA. A recent adaptive transposable element insertion near highly conserved developmental loci in Drosophila melanogaster. Mol Biol Evol. 2009;26: 1949–61. pmid:19458110
  46. 46. Chen K, Rajewsky N. The evolution of gene regulation by transcription factors and microRNAs. Nat Rev Genet. 2007;8: 93–103. pmid:17230196
  47. 47. Berezikov E. Evolution of microRNA diversity and regulation in animals. Nat Rev Genet. Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved.; 2011;12: 846–60.
  48. 48. Flynt AS, Lai EC. Biological principles of microRNA-mediated regulation: shared themes amid diversity. Nat Rev Genet. 2008;9: 831–42. pmid:18852696
  49. 49. Kertesz M, Iovino N, Unnerstall U, Gaul U, Segal E. The role of site accessibility in microRNA target recognition. Nat Genet. 2007;39: 1278–84. pmid:17893677
  50. 50. Carroll SB. Endless Forms. Cell. 2000;101: 577–580. pmid:10892643
  51. 51. Wray GA. The evolutionary significance of cis-regulatory mutations. Nat Rev Genet. Nature Publishing Group; 2007;8: 206–16.
  52. 52. Carroll SB. Evo-devo and an expanding evolutionary synthesis: a genetic theory of morphological evolution. Cell. 2008;134: 25–36. pmid:18614008
  53. 53. Gandhi S, Abramov AY. Mechanism of oxidative stress in neurodegeneration. Oxid Med Cell Longev. 2012;2012: 428010. pmid:22685618
  54. 54. Shih AY, Johnson DA, Wong G, Kraft AD, Jiang L, Erb H, et al. Coordinate regulation of glutathione biosynthesis and release by Nrf2-expressing glia potently protects neurons from oxidative stress. J Neurosci. 2003;23: 3394–3406. Available: pmid:12716947
  55. 55. Gruenewald C, Botella JA, Bayersdorfer F, Navarro JA, Schneuwly S. Hyperoxia-induced neurodegeneration as a tool to identify neuroprotective genes in Drosophila melanogaster. Free Radic Biol Med. 2009;46: 1668–1676. Available: pmid:19345730
  56. 56. Logan-Garbisch T, Bortolazzo A, Luu P, Ford A, Do D, Khodabakhshi P, et al. Developmental ethanol exposure leads to dysregulation of lipid metabolism and oxidative stress in Drosophila. G3 (Bethesda). 2014;5: 49–59.
  57. 57. Hosamani R. Acute exposure of Drosophila melanogaster to paraquat causes oxidative stress and mitochondrial dysfunction. Arch Insect Biochem Physiol. 2013;83: 25–40. pmid:23564607
  58. 58. Dietzl G, Chen D, Schnorrer F, Su K-C, Barinova Y, Fellner M, et al. A genome-wide transgenic RNAi library for conditional gene inactivation in Drosophila. Nature. 2007;448: 151–156. pmid:17625558
  59. 59. Pfaffl MW. A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res. 2001;29: e45. Available: pmid:11328886
  60. 60. Dos Santos G, Schroeder AJ, Goodman JL, Strelets VB, Crosby MA, Thurmond J, et al. FlyBase: introduction of the Drosophila melanogaster release 6 reference genome assembly and large-scale migration of genome annotations. Nucleic Acids Res. 2015;43: D690–7. pmid:25398896
  61. 61. Gouy M, Guindon S, Gascuel O. SeaView version 4: A multiplatform graphical user interface for sequence alignment and phylogenetic tree building. Mol Biol Evol. 2010;27: 221–4. pmid:19854763
  62. 62. R Core Team. R: A language and environment for statistical computing. [Internet]. Viena Austria; 2015. Available:
  63. 63. Sedlazeck FJ, Rescheneder P, Von Haeseler A. NextGenMap: Fast and accurate read mapping in highly polymorphic genomes. Bioinformatics. 2013;29: 2790–2791. pmid:23975764
  64. 64. Hense W, Baines JF, Parsch J. X chromosome inactivation during Drosophila spermatogenesis. Noor MAF, editor. PLoS Biol. Public Library of Science; 2007;5: e273.
  65. 65. Cayirlioglu P, Kadow IG, Zhan X, Okamura K, Suh GSB, Gunning D, et al. Hybrid neurons in a microRNA mutant are putative evolutionary intermediates in insect CO2 sensory systems. Science. 2008;319: 1256–60. pmid:18309086
  66. 66. Schneider CA, Rasband WS, Eliceiri KW. NIH Image to ImageJ: 25 years of image analysis. Nat Methods. Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved.; 2012;9: 671–675.
  67. 67. Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25: 1451–2. pmid:19346325
  68. 68. Hudson RR, Slatkin M, Maddison WP. Estimation of levels of gene flow from DNA sequence data. Genetics. 1992;132: 583–589. Available: pmid:1427045
  69. 69. Hudson RR. Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics. 2002;18: 337–338. pmid:11847089
  70. 70. Hudson RR, Bailey K, Skarecky D, Kwiatowski J, Ayala FJ. Evidence for positive selection in the superoxide dismutase (Sod) region of Drosophila melanogaster. Genetics. 1994;136: 1329–1340. Available: pmid:8013910
  71. 71. Fiston-Lavier A-S, Singh ND, Lipatov M, Petrov DA. Drosophila melanogaster recombination rate calculator. Gene. 2010;463: 18–20. pmid:20452408