Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Population genetic analysis of autophagy and phagocytosis genes in Drosophila melanogaster and D. simulans

  • Joo Hyun Im ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Writing – original draft, Writing – review & editing

    ji72@cornell.edu

    Affiliations Cornell Institute of Host-Microbe Interactions and Disease, Cornell University, Ithaca, NY, United States of America, Graduate Field of Genetics, Genomics, and Development, Cornell University, Ithaca, NY, United States of America, Department of Entomology, Cornell University, Ithaca, NY, United States of America

  • Brian P. Lazzaro

    Roles Conceptualization, Funding acquisition, Supervision, Writing – review & editing

    Affiliations Cornell Institute of Host-Microbe Interactions and Disease, Cornell University, Ithaca, NY, United States of America, Graduate Field of Genetics, Genomics, and Development, Cornell University, Ithaca, NY, United States of America, Department of Entomology, Cornell University, Ithaca, NY, United States of America

Abstract

Autophagy and phagocytosis are cellular immune mechanisms for internalization and elimination of intracellular and extracellular pathogens. Some pathogens have evolved the ability to inhibit or manipulate these processes, raising the prospect of adaptive reciprocal co-evolution by the host. We performed population genetic analyses on phagocytosis and autophagy genes in Drosophila melanogaster and D. simulans to test for molecular evolutionary signatures of immune adaptation. We found that phagocytosis and autophagy genes as a whole exhibited an elevated level of haplotype homozygosity in both species. In addition, we detected signatures of recent selection, notably in the Atg14 and Ykt6 genes in D. melanogaster and a pattern of elevated sequence divergence in the genderblind (gb) gene on the D. simulans lineage. These results suggest that the evolution of the host cellular immune system as a whole may be shaped by a dynamic conflict between Drosophila and its pathogens even without pervasive evidence of strong adaptive evolution at the individual gene level.

Introduction

Phagocytosis is a primary cellular immune process in Drosophila [1]. During phagocytosis, extracellular pathogens are recognized by opsonins and phagocytic receptors, engulfed at the host membrane, and then internalized and degraded in compartments called phagosomes [2] (Fig 1). Autophagy is an alternative cellular mechanism to remove intracellular pathogens [3]. During autophagy, intracellular bacteria and viruses are encapsulated by isolation membranes called phagosphores, which then are nucleated and expanded to form autophagosomes to destroy the pathogen [4, 5] (Fig 1). Both phagosomes and autophagosomes are eventually fused with a lysosome to degrade internalized pathogens [6]. While autophagy and phagocytosis were previously thought to be distinct pathways, many autophagy proteins participate in the later stages of phagocytosis [68]. When phagocytosis fails to eliminate pathogens due to modification or damage to the phagosome by bacteria, autophagy works as a back-up process to overcome infection [9, 10].

thumbnail
Fig 1. Stages of autophagy and phagocytosis pathways.

Genes in autophagy (red) and phagocytosis (blue) pathway function to recognize, internalize, and degrade cell debris and intracellular (purple) and extracellular pathogens (green). Organelles, such as phagosomes and autophagosomes, are form in the course of the process and are eventually fused with a lysosome full of hydrolytic enzymes (yellow) to degrade internalized pathogens.

https://doi.org/10.1371/journal.pone.0205024.g001

Pathogens evolve to escape, resist, or compromise the host immunity [11]. Bacteria are known to inhibit or evade phagocytosis by preventing host opsonins and phagocytic receptors from binding to bacterial molecules [12, 13] and blocking host signaling pathways via effector proteins and toxins [1416]. Similarly, bacteria and viruses also interfere with host autophagy by disrupting the signaling [17, 18] and blocking the production of reactive oxygen species (ROS) that sustains autophagy [19]. When a pathogen factor interferes with a critical host protein, the host protein can counter-evolve the bacterial hindrance via novel mutations. The change in the host may place renewed evolutionary pressure on the bacteria, and the process can repeat ad infinitum leading to a dynamic co-evolutionary conflict, or arms race [20, 21]. Host alleles that resist or overcome pathogen interference mechanisms may be adaptive, potentially resulting in signatures of positive selection as beneficial alleles favored by natural selection rise to high frequency within a population. These alleles would exhibit a reduced level of polymorphism around the selected sites, and would accumulate a proportional excess of rare variants as the population recovers from a selective sweep [22]. Recurrent adaptive fixations over long evolutionary timescales could result in an elevated rate of amino acid divergence between species [23].

In this study, we used molecular population genetic analyses to test for adaptation in autophagy and phagocytosis genes. We analyzed published sequence data from D. melanogaster and D. simulans collected in Eastern Africa [24, 25]. We found that genes in autophagy and phagocytosis pathways as a whole showed an elevated level of homozygosity in both species. The individual genes Atg14 and Ykt6 showed evidence of non-neutral evolution in D. melanogaster while gb showed signatures of non-neutral evolution in D. simulans.

Materials and methods

Samples used in the population genetic analysis

For D. melanogaster, the sequenced genomes of 197 haploid embryo Siavonga lines from the Drosophila Genome Nexus Project 3 were used [25]. These lines represent a single ancestral population of D. melanogaster from Zambia. For D. simulans, genome sequences from 20 isofemale lines, 10 collected in Madagascar and 10 collected in Kenya, were used [24]. In addition, a reference sequence of D. yakuba was used as an outgroup [26].

Curating genes of interest and control genes from the literature

A list of genes known to be involved in phagocytosis and autophagy was established by reviewing the primary literature (S1 and S2 Tables). In addition to the known phagocytosis and autophagy genes in Drosophila, Drosophila homologs of phagocytosis or autophagy genes characterized in other organisms were included. Each gene was assigned to a functional category (e.g. Autophagy induction) (S1 and S2 Tables; Fig 1). To test whether phagocytosis and autophagy pathways show different evolutionary patterns than canonical humoral pathways, known humoral immune signaling and recognition genes were used as a comparison. To control for effects of gene structure and chromosomal position, control genes were chosen to be similar to the focal immune genes in gene length (0.5-2x length of the coding region of the focal gene) and gene location (within 60kb ± of the start site of the focal gene). Three or four control genes that matched the criteria and had no annotated immune function were chosen for each focal gene. If a focal gene had fewer than three control genes that matched the criteria, it was removed from downstream analysis. Pairing focal and control genes controlled for potential effects of gene length and recombination rate. To ensure that focal and control genes indeed are matched for local recombination rate, the predicted recombination rates for all genes were examined using the Drosophila melanogaster Recombination Rate Calculator (RRC, version 2.3, [27]). This tool provides estimates of recombination rates based on the Marey map approach [27] and direct measurement [28]. Both RRC and Comeron estimates showed that the correlation between recombination rates of focal and control genes in D. melanogaster is strong, confirming that focal and control genes truly share a similar recombination environment (R2 = 0.994 based on RRC midpoint and R2 = 0.890 based on Comeron midpoint, S1 Fig). The presence of inversions can create strong haplotype structure and influence patterns of polymorphisms [29]. To minimize the effect of inversions, no focal gene or control gene used in the study was near the boundaries of known inversions. Populations collected in Africa sometimes contain genomic segments inferred to have recent cosmopolitan (non-African) ancestry [30]. To specifically analyze African genetic variation and eliminate the effect of relatedness among individual lines, genomic regions that were thought to have come from cosmopolitan (non-African) ancestry and that showed evidence of identity-by-descent in D. melanogaster were masked using Perl scripts obtained from http://www.johnpool.net/genomes.html.

Processing sequence data prior to population genetics analyses

Using custom scripts and bedtools [31], the coding sequence of each gene was extracted based on coordinates from the General Feature Format (GFF) file of the reference sequence for D. melanogaster (FlyBase release 5.25) and from the GFF file provided by the Thornton Lab GitHub (https://github.com/ThorntonLab) for D. simulans. When multiple isoforms were available for a gene, the longest sequence was chosen for downstream analysis. Custom filters were applied to exclude sequences with sites containing more than 10% missing data (noted as Ns) and then alignment was performed using PRANK [32]. Genes with poor sequencing or alignment quality, with large regions of gap, or with no annotation in any of the species were excluded from downstream analysis. To standardize the number of lines surveyed for each gene, 149 D. melanogaster lines and 14 D. simulans lines were randomly subsampled.

Surveying the level of polymorphisms and divergence

To evaluate the patterns of polymorphisms in each species, the following population genetic statistics were calculated: Watterson’s θ (θw), Tajima’s D (TajD), normalized Fay and Wu’s H (nFWH), Ewens-Watterson test statistic (EW), and the compound statistics that combined Tajima’s D, normalized Fay and Wu’s H with Ewens-Watterson test statistic (DHEW). θw indicates the level of DNA sequence variation, and a reduction in sequence diversity can be due to a recent selective sweep [33]. Tajima’s D test statistic compares the number of pairwise differences between individuals to the total number of segregating sites and detects the level of mutations of intermediate frequency relative to mutations that segregate at low frequencies [34]. The mutations that initially arise after variation is purged in a selective sweep will necessarily be at low frequency, skewing the site frequency spectrum and causing Tajima’s D to become negative [23]. Negative Tajima’s D values can also be caused by population expansion. Fay and Wu’s H test statistic detects excess of high frequency alleles in the derived state, which is a signature of selective sweeps that is less likely under population expansion [35]. The normalized version of the H test was used to increase statistical power. The EW test statistic measures haplotype homozygosity by comparing the observed homozygosity to the expected homozygosity [36, 37]. A P-value greater than 0.95 suggests that allele frequencies are more unevenly distributed than the neutral expectation, which suggests directional selection. The DHEW statistic is a composite statistic that provides more power to detect cases of positive selection [38, 39]. By combining Tajima’s D and normalized Fay and Wu’s H with Ewens-Wattersons expected homozygosity (EW), which is generally not sensitive to recombination, the DHEW overcomes the effect of recombination rates on the site frequency spectrum and makes the inference more conservative. The DHEW test first calculates required statistics separately and then combines the respective P-values into a vector to determine whether this vector deviates from the expectation under selective neutrality [38, 39]. A significant P-value points to positive selection. To calculate aforementioned statistics, the program DH was used [3840]. Each test statistic was compared to 1) null distributions created using coalescent simulations with no recombination to obtain a P-value [39], and 2) the test statistics obtained from genomic control genes.

To evaluate the patterns of amino acid divergence, the ratio of non-synonymous to synonymous polymorphic sites in each species (Pn/Ps) and the ratio of non-synonymous to synonymous differences between the species (Dn/Ds) were calculated and the McDonald-Kreitman test (MK test) was performed [41] using a custom script. If the observed value of Pn/Ps is much different from Dn/Ds at a locus as determined by Fishers exact test, that locus is rapidly diverging between the two species at the level of amino acid sequence, which is consistent with adaptive evolution. In addition, the Direction of Selection (DoS) coefficient was calculated [42]. DoS >0 indicates the action of positive selection and DoS <0 indicates the action of purifying selection. To correct for multiple testing, the p.adjust function in R based on the Benjamini and Hochberg method was implemented [43].

Processing statistics and testing for significance

To assess whether phagocytosis and autophagy pathways as a whole show a departure from their control genes for various test statistics, the difference between the value of a given statistic in the focal gene and the median value of the statistic for the matched control genes, hereafter called a ‘comparison score’, was calculated for each focal-control pairing. We then evaluated whether the mean of all comparison scores was significantly different from 0 using both a t-test (parametric) and the Wilcox rank sum test (non-parametric) (Table 1). If the 95% confidence interval contains 0, the test provides no evidence for any statistically significant difference between the two groups as a whole. To identify individual genes that bear patterns of non-neutral sequence evolution in each species, the P-values of, D, H, EW, and the compound test statistic DHEW were used. To ensure that the pattern seen on focal genes is indeed due to a local selection, the comparison score was calculated for Tajima’s D, Fay and Wu’s H, and DoS for each focal-control pair and the scores were ranked from largest to smallest (listed as ‘rank’ column in Tables 2 and 3). A total of 68 and 65 comparison values make up the distributions for D. melanogaster and D. simulans, respectively. Randomly assigning a control gene to be "focal" and re-calculating the ranks of comparison scores for these statistics created a distribution of ranks. Then comparison scores from true coupling of focal and control genes were compared to this permutated distribution (listed as ‘rank against null’ column in Tables 2 and 3). A total of 245 and 232 values were used to build null distributions for D. melanogaster and D. simulans, respectively. The more extreme this rank is, the stronger the confidence is that the focal gene differs from the control genes for a given statistic. We set a threshold for concluding evidence that a gene had experienced a selective sweep as requiring 1) a significant P-value for DHEW, 2) significant corrected P-values for two of any of the test statistics, and 3) comparison scores for D, H, or DoS ranking in the top 10% for the ‘rank’ test and the top 5% for the ‘rank against null’ test. Individual genes that have potentially undergone adaptive divergence on each lineage were identified as having a statistically significant (P <0.05) MK test result and a Direction of Selection (DoS) coefficient >0.

thumbnail
Table 1. Evaluation of polymorphism and divergence at the pathway level.

https://doi.org/10.1371/journal.pone.0205024.t001

thumbnail
Table 2. Genes that show significant and nominal evidence of recent selection in D. melanogaster and in D. simulans.

https://doi.org/10.1371/journal.pone.0205024.t002

thumbnail
Table 3. Genes that show significant and nominal evidence of amino acid divergence on the D. melanogaster lineage and on the D. simulans lineage.

https://doi.org/10.1371/journal.pone.0205024.t003

Data

The sequences and the scripts to process them are available in https://github.com/imjoohyu.

Results

Survey of autophagy and phagocytosis genes in D. melanogaster

Autophagy and phagocytosis genes as a whole.

To examine whether autophagy and phagocytosis genes exhibit any signature of recent positive selection, we calculated the following summary statistics for each focal gene and its corresponding control genes: Watterson’s estimator (θw), Tajima’s D (TajD), normalized Fay and Wu’s H (nFWH), Ewens-Watterson’s homozygosity (EW), and the compound test statistic DHEW. For analyses of longer-term molecular evolution, we inferred the ancestral state of each substitution using D. yakuba and D. simulans as outgroups and assuming strict parsimony with no reverse or convergent mutation. We then compared polymorphism and divergence at synonymous and non-synonymous sites using the MK test [41] and calculated the Direction of Selection (DoS) on each gene. To determine whether the combined group of all autophagy and phagocytosis genes shows evidence of recent selection or recurrent adaptive evolution, we explored whether the population genetic statistics of these genes are statistically significantly different from their respective control genes by calculating the mean of comparison scores for each focal-control gene pairing for each statistic (Table 1). The only observed significant difference was in the EW statistic between autophagy and phagocytosis genes and their control genes in D. melanogaster (t-test p = 0.022, Wilcox test p = 0.055), pointing to the enrichment of low-frequency derived alleles with an increase in haplotype homozygosity in the focal genes. Randomly assigning control genes to serve as “focal” genes and repeating the analysis removed this significant difference, indicating that the pattern of reduced polymorphism is unique to the autophagy and phagocytosis genes. Removing 10% of the genes with the lowest individual P-values and repeating the analysis still preserved the pattern. Thus, this statistically significant difference between EW values of focal and control genes is a cumulative effect over the full set of autophagy and phagocytosis genes and is not driven by a few genes that are strongly divergent from the null expectation. Contrary to what was observed in autophagy and phagocytosis genes, a set of humoral signaling genes and a set of recognition genes did not differ significantly from their control genes in any of the statistics.

Individual autophagy and phagocytosis genes.

We tested for signatures of recent selection in individual genes and only Atg14 and Ykt6 in D. melanogaster had significant test statistics after false discovery correction and extreme comparison scores (Table 2). Atg14 (D = -1.926, p = 0.048, EW = 0.06, p = 0.013) encodes an endoplasmic reticulum (ER) protein that helps autophagosomes nucleate [44]. The Atg14 comparison score for D was the largest and for H was in the top 5% among 71 gene pairs surveyed (Table 2). These observations can be attributed to natural selection because effects of chromosomal position or demographic history would also be expected to impact the control genes. Ykt6, a SNARE protein involved in internalization of particles during phagocytosis and expansion of the autophagosome membrane [45, 46], showed significant P-values for D and EW (D = -2.151, p = 0.027, EW = 0.552, p = 0.001). The D comparison score for Ykt6 was the 5th largest in the D. melanogaster data set, illustrating the enrichment of low-frequency variants specifically at this locus (Table 2). In addition, CD36 scavenger receptors emp and pes [4749], autophagy gene Atg16, and phagocytosis gene, Rac2, showed significant EW and DHEW statistics but did not meet the multiple testing correction threshold (Table 1). Next, we looked for evidence of recurrent adaptive evolution in individual autophagy and phagocytosis genes. While Ird1 showed significant MK test results even after multiple testing correction (<0.05), the DoS value for each of these genes was negative and was not drastically different from its control genes according to the DoS comparison score. This result indicates that slightly deleterious mutations are segregating at these loci (S1 Table) and provides no support for recurrent adaptive amino acid substitution.

Survey of autophagy and phagocytosis genes in D. simulans

Autophagy and phagocytosis genes as a whole.

We tested whether phagocytosis and autophagy pathways as a whole show distinct patterns of sequence evolution compared to control genes in D. simulans. As was the case in D. melanogaster, the significant difference between D. simulans phagocytosis and autophagy genes and their control genes was seen in the EW statistic (t-test p = 0.003, Wilcox test p = 0.005, Table 1). Randomly re-assigning a control gene to be “focal” eliminated this significant difference, suggesting that the observed pattern is attributable to phagocytosis and autophagy genes. Again, consistent with the patterns in D. melanogaster, the statistically significant difference between EW values of focal and control genes is not due to a few divergent genes, but rather is a pattern seen across all genes. While humoral immune signaling genes in D. simulans did not differ significantly from their control genes in any of the statistics, the DoS statistic for recognition genes showed deviation from that of respective control genes according to the t-test (p = 0.005; Table 1). This trend was largely driven by the fact that only three gene pairs existed in this category and that the DoS values of four out of ten corresponding control genes are highly positive (>0.2).

Individual autophagy and phagocytosis genes.

We tested for signatures of recent selection in individual genes in D. simulans. While four genes, Rab1, Atg8b, Vamp7 and Rac1 (the first three are involved in both phagocytosis and autophagy pathways, whereas the last is implicated only in phagocytosis) showed nominally significant D, EW, and DHEW statistics, none remained significant after the multiple testing correction. When we compared polymorphism and divergence at synonymous and non-synonymous sites using the MK test to test for recurrent adaptive amino acid substitution along the D. simulans lineage, we found that only gb had significant MK test results after the multiple testing correction and a positive DoS value (MK corrected FET p = 0.046, DoS = 0.396, Table 3) The DoS comparison score for gb also ranked the highest when compared to other gene pairs (Table 3). gb encodes a glutamate transporter that regulates the extracellular glutamate levels in the nervous system [50] and controls internal ROS and to promote phagosome maturation [51]. The other glutamate transporter polyph [50, 51], as well as phagocytic receptors pes and scb [49, 52], and Rbsn-5, which facilitates phagosome maturation [2], also showed nominally significant MK results that did not meet the multiple testing correction threshold (Table 3). We identified no autophagy genes evolving with an elevated rate of amino acid substitution along the D. simulans lineage.

Discussion

Dynamic conflict between hosts and pathogens may result in co-evolutionary adaptation in host genes, resulting in signatures of positive selection. Previous work to understand the molecular evolutionary patterns of immune genes in Drosophila has enriched our understanding of how the innate immune system has evolved. However, most population genetic studies on innate immunity have so far focused on the humoral immune response genes and phagocytic receptor genes, so the evolution of most of cellular immunity remains to be understood. In this study, we examined molecular evolutionary patterns of autophagy and phagocytosis genes in D. melanogaster and D. simulans. We found that phagocytosis and autophagy pathways as a whole showed an elevated level of haplotype homozygosity in both species, suggesting that genes in these pathways demonstrate small indications of adaptation that collectively result in a statistically measurable deviation from neutrality. The EW test statistic is more powerful for detecting a very recent selective sweep compares to other tests [53]. The aggregate significance of the EW test would therefore seem to indicate that many autophagy and phagocytosis genes have been targets of recent sweeps, but that the pattern has not persisted over enough evolutionary time to leave signatures detectable by other test statistics. It is unclear what biological scenario would trigger a widespread set of recent sweeps, although ecological shift or invasion of a novel pathogen is possible. We identified several individual genes that exhibit indications of positive selection, although only a subset of them were statistically significant after controlling for multiple testing.

Positive selection on autophagy genes

We looked for signatures of natural selection in autophagy genes, which have both defensive and housekeeping roles. Besides removal of pathogens, autophagy degrades damaged host proteins and organelles to recycle nutrients during stressful conditions, such as starvation [5]. The core autophagy machinery is conserved from yeast to higher eukaryotes [4]. The role of autophagy in maintaining organismal homeostasis could act as a constraint on adaptation in response to pathogen pressure, which might explain why we did not see pervasive signatures of positive selection in autophagy genes of either D. melanogaster or D. simulans. However, we did identify signatures of recent positive selection in Atg14 and Ykt6, which remained significant after multiple testing correction in D. melanogaster. Atg14 is conserved in both Drosophila and mammals and is involved in the nucleation of phagosphore membrane [3]. Upon infecting mouse and human cell lines, the intracellular bacterium Brucella abortus forms a Brucella-containing vacuole using the host proteins Atg14 and Beclin-1, the mammalian homolog of Atg6, in order to be trafficked to the ER where the bacterium proliferates [54]. Similar interactions with pathogens may have led to Atg14 adapting to play a more specific role in immune defense and to our observed evolutionary pattern in D. melanogaster. While the test statistics of Atg16 did not meet the multiple testing correction threshold, its nominally significant test statistics indicate that it may have experienced a strong selective pressure because it is responsible for removing both intra- and extracellular pathogens [54].

Positive selection on glutamate transporters

We also identified a glutamate transporter gene, gb, to have elevated amino acid sequence divergence on the D. simulans lineage (Table 3). gb encodes a transporter that controls the extracellular glutamate levels in the nervous system [51]. When gb is mutated, glutamate level in the hemolymph is reduced, synthesis of glutathione (a major antioxidant) is disrupted, and the intracellular ROS is increased [55], leading to a failure in producing mature phagosomes and in phagocytosis of Staphylococcus aureus in Drosophila [51]. Another putative glutamate transporter gene polyph that also plays a role in regulating glutamate level showed a significant MK result prior to multiple testing correction [51]. Although it is unknown whether these proteins physically interact with each other, both genes are expressed in Drosophila blood cells and share a function [51]. Due to this shared function and evolutionary pattern, it is tempting to speculate that positive selection may be acting on these proteins together. A host protein that evolves to escape pathogen interference may also evolve away from its native function within the host. Thus, compensatory mutation in interacting proteins that restore full function could become adaptive [56].

Positive selection on recognition genes in phagocytosis

Opsonins and phagocytic receptors have been hypothesized to evolve under host-pathogen co-evolution because they directly bind to molecules from pathogens in order to promote phagocytosis [5761]. Previous research reported that emp, pes, and scb, which encode phagocytic receptors that bind to pathogens bore evidence of recent and adaptive evolution in Drosophila [58, 59, 62]. TepII, TepIV (opsonin), Sr-CI (scavenger receptor) and nimrod genes (NimC1, NimB4) also show an excess of non-synonymous fixations between D. melanogaster and D. simulans in [5759]. While emp and pes in D. melanogaster and pes and scb in D. simulans in this study showed some indication of selection, the test statistics were not significant after multiple testing correction. These distinctions between previous studies and the present one may be due to differences in methods since our method employed a composite method in addition to typical test statistics used in the field to determine significance.

No parallel adaptation between D. melanogaster and D. simulans

One objective of this study was to evaluate whether the same genes in D. melanogaster and D. simulans show similar patterns of non-neutral evolution as a test for parallel adaptation. However, we did not find evidence for parallel adaptation at the gene level. While D. melanogaster and D. simulans are closely related species, the locations in which they were collected, their ecology, and their demographic histories differ [63]. While D. melanogaster and D. simulans are known to share some viruses, such as the Galbut Virus [64], there is no information about whether they share any bacterial pathogens in the wild and it is reasonable to suppose they may generally be exposed to different pathogens because of difference in micro- or macro-ecology. If these distinct pathogens interact differently with the two host species, then parallel adaptation may be unlikely. It is important to note that detection of parallel evolution relies on the detection of statistically significant patterns in both species. Thus, they may be under-detected if the analysis has low statistical power. In our particular study, the number of lines sampled from each species differed, so the power to detect selection was not equivalent across the species. Nevertheless, our finding is consistent with previous reports suggesting that parallel adaption is not common. For example, a previous study noted that while selective sweeps affect antiviral pathways in many insect species, the affected genes varied considerably across species [65]. Another study reported that a humoral signaling gene, Relish, known to have undergone adaptive divergence in D. simulans, did not exhibit as strong of evidence of natural selection in three other sister species [66].

Soft sweeps and balancing selection

Our study mainly focused on detecting strong signatures of natural selection, as would be expected from hard selective sweeps favoring novel mutations [67], and on recurrent adaptive amino acid substitutions in a gene [41, 68]. These are the signatures expected from classic arms race model, which was our core biological hypothesis. We have less power to detect other forms of adaptation that might act on immune system genes, including soft sweeps of adaptation from standing genetic variation and balancing selection. Soft sweeps refer to a mode of adaptation where multiple distinguishable adaptive alleles are present in the population at the same time [69]. Soft sweeps may be common in D. melanogaster [70, 71] but are not easily detected by frequency-based statistics such as Tajima’s D or Fay and Wu’s H because genetic diversity is not as severely reduced when the selected site is on multiple haplotype backgrounds or is not driven to complete fixation [72]. A previous study that looked for genes evolving via soft sweeps in D. melanogaster [71] showed evidence for soft sweeps in chromosomal regions that include Vamp7 and Sr-CII, which did not show statistically significant results in our study, although the precise targets of these sweeps remain unidentified.

In the context of host-pathogen interactions, balancing selection can theoretically be generated if polymorphisms arise at co-evolving loci of both hosts and pathogens and two or more alleles are maintained at static intermediate or oscillating frequencies, or if alleles are costly in the absence of infection so cannot be driven to fixation [11, 7375]. The genomic signatures of balancing selection can be detected in organisms whose breeding structures or population sizes result in linkage disequilibrium that extends over long physical stretches of chromosomes, such as Arabidopsis [73, 76] and humans [77, 78]. However, it is much more difficult to detect balancing selection in organisms like D. melanogaster that have large population sizes and high rates of recombination [79, 80]. Therefore, we cannot rule out the possibility that some components of autophagy and phagocytosis systems in Drosophila may have evolved under undetected balancing selection.

Supporting information

S1 Fig. Correlation between recombination rates of focal genes and recombination rates of corresponding control genes (L).

Recombination rates of focal genes (x-axis) were plotted against mean recombination rates of respective control genes (y-axis) based on the study of [27]. (R) Recombination rates of focal genes (x-axis) were plotted against mean recombination rates of respective control genes (y-axis) based on the study of [28].

https://doi.org/10.1371/journal.pone.0205024.s001

(TIF)

S1 Table. List of all genes in D. melanogaster surveyed for population genetic analysis and the results of the analysis.

https://doi.org/10.1371/journal.pone.0205024.s002

(XLSX)

S2 Table. List of all genes in D. simulans surveyed for population genetic analysis and the results of the analysis.

https://doi.org/10.1371/journal.pone.0205024.s003

(XLSX)

Acknowledgments

We would like to thank Rob Unckless, Vanika Gupta, Ashley Frank, and Radhika Ravikumar for helpful discussion and comments on the manuscript. We also would like to thank Lynn Johnson and Françoise Vermeylen at Cornell Statistical Consulting Unit for the help with statistical analysis. This work was supported by NIH grants R01 AI064950 and R56 AI083932.

References

  1. 1. Lemaitre B, Hoffmann J. The host defense of Drosophila melanogaster. Annual review of immunology. 2007;25:697–743. pmid:17201680.
  2. 2. Flannagan RS, Cosio G, Grinstein S. Antimicrobial mechanisms of phagocytes and bacterial evasion strategies. Nature reviews Microbiology. 2009 May;7(5):355–66. pmid:19369951.
  3. 3. Moy RH, Cherry S. Antimicrobial autophagy: a conserved innate immune response in Drosophila. Journal of innate immunity. 2013;5(5):444–55. pmid:23689401. Pubmed Central PMCID: 3799998.
  4. 4. Zirin J, Perrimon N. Drosophila as a model system to study autophagy. Seminars in immunopathology. 2010 Dec;32(4):363–72. pmid:20798940. Pubmed Central PMCID: 3562086.
  5. 5. Huang J, Brumell JH. Bacteria-autophagy interplay: a battle for survival. Nature reviews Microbiology. 2014 Feb;12(2):101–14. pmid:24384599.
  6. 6. Cadwell K, Philips JA. Autophagy meets phagocytosis. Immunity. 2013 Sep 19;39(3):425–7. pmid:24054324.
  7. 7. Sanjuan MA, Dillon CP, Tait SW, Moshiach S, Dorsey F, Connell S, et al. Toll-like receptor signalling in macrophages links the autophagy pathway to phagocytosis. Nature. 2007 Dec 20;450(7173):1253–7. pmid:18097414.
  8. 8. Shui W, Sheu L, Liu J, Smart B, Petzold CJ, Hsieh TY, et al. Membrane proteomics of phagosomes suggests a connection to autophagy. Proceedings of the National Academy of Sciences of the United States of America. 2008 Nov 4;105(44):16952–7. pmid:18971338. Pubmed Central PMCID: 2579359.
  9. 9. Gutierrez MG, Master SS, Singh SB, Taylor GA, Colombo MI, Deretic V. Autophagy is a defense mechanism inhibiting BCG and Mycobacterium tuberculosis survival in infected macrophages. Cell. 2004 Dec 17;119(6):753–66. pmid:15607973.
  10. 10. Sanjuan MA, Green DR. Eating for good health: linking autophagy and phagocytosis in host defense. Autophagy. 2008 Jul;4(5):607–11. PubMed pmid:18552553. Pubmed Central PMCID: 2612584.
  11. 11. Woolhouse ME, Webster JP, Domingo E, Charlesworth B, Levin BR. Biological and biomedical implications of the co-evolution of pathogens and their hosts. Nature genetics. 2002 Dec;32(4):569–77. pmid:12457190.
  12. 12. Fischetti VA. Streptococcal M protein: molecular design and biological behavior. Clinical microbiology reviews. 1989 Jul;2(3):285–314. PubMed pmid:2670192. Pubmed Central PMCID: 358122.
  13. 13. China B, Sory MP, N'Guyen BT, De Bruyere M, Cornelis GR. Role of the YadA protein in prevention of opsonization of Yersinia enterocolitica by C3b molecules. Infection and immunity. 1993 Aug;61(8):3129–36. PubMed pmid:8335343. Pubmed Central PMCID: 280979.
  14. 14. Beauregard KE, Lee KD, Collier RJ, Swanson JA. pH-dependent perforation of macrophage phagosomes by listeriolysin O from Listeria monocytogenes. The Journal of experimental medicine. 1997 Oct 6;186(7):1159–63. PubMed pmid:9314564. Pubmed Central PMCID: 2199064.
  15. 15. Ramarao N, Gray-Owen SD, Backert S, Meyer TF. Helicobacter pylori inhibits phagocytosis by professional phagocytes involving type IV secretion components. Molecular microbiology. 2000 Sep;37(6):1389–404. PubMed pmid:10998171.
  16. 16. Pethe K, Swenson DL, Alonso S, Anderson J, Wang C, Russell DG. Isolation of Mycobacterium tuberculosis mutants defective in the arrest of phagosome maturation. Proceedings of the National Academy of Sciences of the United States of America. 2004 Sep 14;101(37):13642–7. pmid:15340136. Pubmed Central PMCID: 518761.
  17. 17. Choy A, Dancourt J, Mugo B, O'Connor TJ, Isberg RR, Melia TJ, et al. The Legionella effector RavZ inhibits host autophagy through irreversible Atg8 deconjugation. Science. 2012 Nov 23;338(6110):1072–6. pmid:23112293. Pubmed Central PMCID: 3682818.
  18. 18. Chaumorcel M, Lussignol M, Mouna L, Cavignac Y, Fahie K, Cotte-Laffitte J, et al. The human cytomegalovirus protein TRS1 inhibits autophagy via its interaction with Beclin 1. Journal of virology. 2012 Mar;86(5):2571–84. pmid:22205736. Pubmed Central PMCID: 3302257.
  19. 19. Shin DM, Jeon BY, Lee HM, Jin HS, Yuk JM, Song CH, et al. Mycobacterium tuberculosis eis regulates autophagy, inflammation, and cell death through redox-dependent signaling. PLoS pathogens. 2010;6(12):e1001230. pmid:21187903. Pubmed Central PMCID: 3002989.
  20. 20. Van Valen L. A new evolutionary law. Evolutionary theory. 1973;1:1–30.
  21. 21. Dawkins R, Krebs JR. Arms races between and within species. Proceedings of the Royal Society of London Series B, Biological sciences. 1979 Sep 21;205(1161):489–511. pmid:42057
  22. 22. Casillas S, Barbadilla A. Molecular Population Genetics. Genetics. 2017 Mar;205(3):1003–35. pmid:28270526. Pubmed Central PMCID: 5340319.
  23. 23. Vitti JJ, Grossman SR, Sabeti PC. Detecting natural selection in genomic data. Annual review of genetics. 2013;47:97–120. pmid:24274750.
  24. 24. Rogers RL, Cridland JM, Shao L, Hu TT, Andolfatto P, Thornton KR. Landscape of standing variation for tandem duplications in Drosophila yakuba and Drosophila simulans. Molecular biology and evolution. 2014 Jul;31(7):1750–66. pmid:24710518. Pubmed Central PMCID: 4069613.
  25. 25. Lack JB, Cardeno CM, Crepeau MW, Taylor W, Corbett-Detig RB, Stevens KA, et al. The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population. Genetics. 2015 Jan 27. pmid:25631317.
  26. 26. Drosophila 12 Genomes C, Clark AG, Eisen MB, Smith DR, Bergman CM, Oliver B, et al. Evolution of genes and genomes on the Drosophila phylogeny. Nature. 2007 Nov 8;450(7167):203–18. pmid:17994087.
  27. 27. Fiston-Lavier AS, Singh ND, Lipatov M, Petrov DA. Drosophila melanogaster recombination rate calculator. Gene. 2010 Sep 1;463(1–2):18–20. pmid:20452408.
  28. 28. Comeron JM, Ratnappan R, Bailin S. The many landscapes of recombination in Drosophila melanogaster. PLoS genetics. 2012;8(10):e1002905. pmid:23071443. Pubmed Central PMCID: 3469467.
  29. 29. Corbett-Detig RB, Hartl DL. Population genomics of inversion polymorphisms in Drosophila melanogaster. PLoS genetics. 2012;8(12):e1003056. pmid:23284285. Pubmed Central PMCID: 3527211.
  30. 30. Pool JE, Corbett-Detig RB, Sugino RP, Stevens KA, Cardeno CM, Crepeau MW, et al. Population Genomics of sub-saharan Drosophila melanogaster: African diversity and non-African admixture. PLoS genetics. 2012;8(12):e1003080. pmid:23284287. Pubmed Central PMCID: 3527209.
  31. 31. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010 Mar 15;26(6):841–2. pmid:20110278. Pubmed Central PMCID: 2832824.
  32. 32. Loytynoja A, Goldman N. An algorithm for progressive multiple alignment of sequences with insertions. Proceedings of the National Academy of Sciences of the United States of America. 2005 Jul 26;102(30):10557–62. pmid:16000407. Pubmed Central PMCID: 1180752.
  33. 33. Watterson GA. On the number of segregating sites in genetical models without recombination. Theoretical population biology. 1975;7(2):256–76. pmid:1145509
  34. 34. Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. 1989 Nov;123(3):585–95. PubMed pmid:2513255. Pubmed Central PMCID: 1203831.
  35. 35. Fay JC, Wyckoff GJ, Wu CI. Testing the neutral theory of molecular evolution with genomic data from Drosophila. Nature. 2002 Feb 28;415(6875):1024–6. pmid:11875569.
  36. 36. Ewens WJ. Concepts of substitutional load in finite populations. Theoretical population biology. 1972;3(2):153–61. pmid:4667081
  37. 37. Watterson GA. Reversibility and the age of an allele. II. Two-allele models, with selection and mutation. Theoretical population biology. 1977;12(2):179–96. pmid:929456
  38. 38. Zeng K, Mano S, Shi S, Wu CI. Comparisons of site- and haplotype-frequency methods for detecting positive selection. Molecular biology and evolution. 2007 Jul;24(7):1562–74. pmid:17449894.
  39. 39. Zeng K, Shi S, Wu CI. Compound tests for the detection of hitchhiking under positive selection. Molecular biology and evolution. 2007 Aug;24(8):1898–908. pmid:17557886.
  40. 40. Zeng K, Fu YX, Shi S, Wu CI. Statistical tests for detecting positive selection by utilizing high-frequency variants. Genetics. 2006 Nov;174(3):1431–9. pmid:16951063. Pubmed Central PMCID: 1667063.
  41. 41. McDonald JH, Kreitman M. Adaptive protein evolution at the Adh locus in Drosophila. Nature. 1991 Jun 20;351(6328):652–4. pmid:1904993.
  42. 42. Stoletzki N, Eyre-Walker A. Estimation of the neutrality index. Molecular biology and evolution. 2011 Jan;28(1):63–70. pmid:20837603.
  43. 43. Benjamini YH Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B 1995;57(1):289–300.
  44. 44. Fujita N, Itoh T, Omori H, Fukuda M, Noda T, Yoshimori T. The Atg16L complex specifies the site of LC3 lipidation for membrane biogenesis in autophagy. Molecular biology of the cell. 2008 May;19(5):2092–100. pmid:18321988. Pubmed Central PMCID: 2366860.
  45. 45. Peltan A, Briggs L, Matthews G, Sweeney ST, Smith DF. Identification of Drosophila gene products required for phagocytosis of Leishmania donovani. PloS one. 2012;7(12):e51831. pmid:23272175. Pubmed Central PMCID: 3521716.
  46. 46. Nair U, Jotwani A, Geng J, Gammoh N, Richerson D, Yen WL, et al. SNARE proteins are required for macroautophagy. Cell. 2011 Jul 22;146(2):290–302. pmid:21784249. Pubmed Central PMCID: 3143362.
  47. 47. Hart K, Wilcox M. A Drosophila gene encoding an epithelial membrane protein with homology to CD36/LIMP II. Journal of molecular biology. 1993 Nov 5;234(1):249–53. pmid:7693949.
  48. 48. Agaisse H, Burrack LS, Philips JA, Rubin EJ, Perrimon N, Higgins DE. Genome-wide RNAi screen for host factors required for intracellular bacterial infection. Science. 2005 Aug 19;309(5738):1248–51. pmid:16020693.
  49. 49. Philips JA, Rubin EJ, Perrimon N. Drosophila RNAi screen reveals CD36 family member required for mycobacterial infection. Science. 2005 Aug 19;309(5738):1251–3. pmid:16020694.
  50. 50. Augustin H, Grosjean Y, Chen K, Sheng Q, Featherstone DE. Nonvesicular release of glutamate by glial xCT transporters suppresses glutamate receptor clustering in vivo. The Journal of neuroscience: the official journal of the Society for Neuroscience. 2007 Jan 3;27(1):111–23. pmid:17202478. Pubmed Central PMCID: 2193629.
  51. 51. Gonzalez EA, Garg A, Tang J, Nazario-Toole AE, Wu LP. A glutamate-dependent redox system in blood cells is integral for phagocytosis in Drosophila melanogaster. Current biology: CB. 2013 Nov 18;23(22):2319–24. pmid:24210616. Pubmed Central PMCID: 4010129.
  52. 52. Nonaka S, Nagaosa K, Mori T, Shiratsuchi A, Nakanishi Y. Integrin alphaPS3/betanu-mediated phagocytosis of apoptotic cells and bacteria in Drosophila. The Journal of biological chemistry. 2013 Apr 12;288(15):10374–80. pmid:23426364. Pubmed Central PMCID: 3624420.
  53. 53. Zhai W, Nielsen R, Slatkin M. An Investigation of the Statistical Power of Neutrality Tests Based on Comparative and Population Genetic Data. Mol Biol Evol. 2009 Feb; 26(2): 273–283. pmid:18922762. Pubmed Central PMCID: PMC2727393.
  54. 54. Starr T, Child R, Wehrly TD, Hansen B, Hwang S, Lopez-Otin C, et al. Selective subversion of autophagy complexes facilitates completion of the Brucella intracellular cycle. Cell host & microbe. 2012 Jan 19;11(1):33–45. pmid:22264511. Pubmed Central PMCID: 3266535.
  55. 55. Piyankarage SC, Augustin H, Grosjean Y, Featherstone DE, Shippy SA. Hemolymph amino acid analysis of individual Drosophila larvae. Analytical chemistry. 2008 Feb 15;80(4):1201–7. pmid:18193891. Pubmed Central PMCID: 2267754.
  56. 56. Kimura M. The role of compensatory neutral mutations in molecular evolution. J Genet. 1985;64(7).
  57. 57. Lazzaro BP. Elevated polymorphism and divergence in the class C scavenger receptors of Drosophila melanogaster and D. simulans. Genetics. 2005 Apr;169(4):2023–34. pmid:15716507. Pubmed Central PMCID: 1449580.
  58. 58. Jiggins FM, Kim KW. Contrasting evolutionary patterns in Drosophila immune receptors. Journal of molecular evolution. 2006 Dec;63(6):769–80. pmid:17103056. Pubmed Central PMCID: 1808192.
  59. 59. Sackton TB, Lazzaro BP, Schlenke TA, Evans JD, Hultmark D, Clark AG. Dynamic evolution of the innate immune system in Drosophila. Nature genetics. 2007 Dec;39(12):1461–8. pmid:17987029.
  60. 60. Juneja P, Lazzaro BP. Haplotype structure and expression divergence at the Drosophila cellular immune gene eater. Molecular biology and evolution. 2010 Oct;27(10):2284–99. pmid:20444883. Pubmed Central PMCID: 2944027.
  61. 61. Salazar-Jaramillo L, Paspati A, van de Zande L, Vermeulen CJ, Schwander T, Wertheim B. Evolution of a cellular immune response in Drosophila: a phenotypic and genomic comparative analysis. Genome biology and evolution. 2014 Feb;6(2):273–89. pmid:24443439. Pubmed Central PMCID: 3942026.
  62. 62. Obbard DJ, Welch JJ, Kim KW, Jiggins FM. Quantifying adaptive evolution in the Drosophila immune system. PLoS genetics. 2009 Oct;5(10):e1000698. pmid:19851448. Pubmed Central PMCID: 2759075.
  63. 63. Capy P, Gibert P. Drosophila melanogaster, Drosophila simulans: so similar yet so different. Genetica. 2004 Mar;120(1–3):5–16. PubMed pmid:15088643.
  64. 64. Webster CL, Waldron FM, Robertson S, Crowson D, Ferrari G, Quintana JF, et al. The Discovery, Distribution, and Evolution of Viruses Associated with Drosophila melanogaster. PLoS biology. 2015 Jul;13(7):e1002210. pmid:26172158. Pubmed Central PMCID: 4501690.
  65. 65. Palmer WH, Hadfield JD, Obbard DJ. RNA-Interference Pathways Display High Rates of Adaptive Protein Evolution in Multiple Invertebrates. Genetics. 2018 Apr;208(4):1585–99. pmid:29437826. Pubmed Central PMCID: 5887150.
  66. 66. Levine MT, Begun DJ. Comparative population genetics of the immunity gene, Relish: is adaptive evolution idiosyncratic? PloS one. 2007;2(5):e442. pmid:17505533. Pubmed Central PMCID: 1859838.
  67. 67. Smith JM, Haigh J. The hitch-hiking effect of a favourable gene. Genetical research. 1974 Feb;23(1):23–35. PubMed pmid:4407212.
  68. 68. Hughes AL, Nei M. Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection. Nature. 1988 Sep 8;335(6186):167–70. pmid:3412472.
  69. 69. Hermisson J, Pennings PS. Soft sweeps: molecular population genetics of adaptation from standing genetic variation. Genetics. 2005 Apr;169(4):2335–52. pmid:15716498. Pubmed Central PMCID: 1449620.
  70. 70. Garud NR, Messer PW, Buzbas EO, Petrov DA. Recent selective sweeps in North American Drosophila melanogaster show signatures of soft sweeps. PLoS genetics. 2015 Feb;11(2):e1005004. pmid:25706129. Pubmed Central PMCID: 4338236.
  71. 71. Garud NR, Petrov DA. Elevated Linkage Disequilibrium and Signatures of Soft Sweeps Are Common in Drosophila melanogaster. Genetics. 2016 Jun;203(2):863–80. pmid:27098909. Pubmed Central PMCID: 4896199.
  72. 72. Messer PW, Petrov DA. Population genomics of rapid adaptation by soft selective sweeps. Trends in ecology & evolution. 2013 Nov;28(11):659–69. pmid:24075201. Pubmed Central PMCID: 3834262.
  73. 73. Stahl EA, Dwyer G, Mauricio R, Kreitman M, Bergelson J. Dynamics of disease resistance polymorphism at the Rpm1 locus of Arabidopsis. Nature. 1999 Aug 12;400(6745):667–71. pmid:10458161.
  74. 74. Tian D, Traw MB, Chen JQ, Kreitman M, Bergelson J. Fitness costs of R-gene-mediated resistance in Arabidopsis thaliana. Nature. 2003 May 1;423(6935):74–7. pmid:12721627.
  75. 75. Decaestecker E, Gaba S, Raeymaekers JA, Stoks R, Van Kerckhoven L, Ebert D, et al. Host-parasite 'Red Queen' dynamics archived in pond sediment. Nature. 2007 Dec 6;450(7171):870–3. pmid:18004303.
  76. 76. Tian D, Araki H, Stahl E, Bergelson J, Kreitman M. Signature of balancing selection in Arabidopsis. Proceedings of the National Academy of Sciences of the United States of America. 2002 Aug 20;99(17):11525–30. pmid:12172007. Pubmed Central PMCID: 123289.
  77. 77. Andres AM, Hubisz MJ, Indap A, Torgerson DG, Degenhardt JD, Boyko AR, et al. Targets of balancing selection in the human genome. Molecular biology and evolution. 2009 Dec;26(12):2755–64. pmid:19713326. Pubmed Central PMCID: 2782326.
  78. 78. Leffler EM, Gao Z, Pfeifer S, Segurel L, Auton A, Venn O, et al. Multiple instances of ancient balancing selection shared between humans and chimpanzees. Science. 2013 Mar 29;339(6127):1578–82. pmid:23413192. Pubmed Central PMCID: 3612375.
  79. 79. Langley CH, Stevens K, Cardeno C, Lee YC, Schrider DR, Pool JE, et al. Genomic variation in natural populations of Drosophila melanogaster. Genetics. 2012 Oct;192(2):533–98. pmid:22673804. Pubmed Central PMCID: 3454882.
  80. 80. Unckless RL, Lazzaro BP. The potential for adaptive maintenance of diversity in insect antimicrobial peptides. Philosophical transactions of the Royal Society of London Series B, Biological sciences. 2016 May 26;371(1695). pmid:27160594. Pubmed Central PMCID: 4874389.