The Genetic Polymorphisms and Colonization Process of Olive Fly Populations in Turkey

The olive fruit fly, Bactrocera oleae, is the most important pest of olives in olive growing regions worldwide, especially in the Mediterranean basin and North America. Despite the economic importance of the olive fly, the colonization route of this species is unclear. We used nuclear microsatellite markers and mitochondrial DNA to provide information about the population structure and invasion route of olive fly populations in Turkey, as representative of the Eastern Mediterranean region. Adult fly samples were collected from 38 sublocations covering all olive growing regions in Turkey. The simple sequence variability data revealed a significant genetic variability in olive fly populations and a certain degree of differentiation between Mediterranean and Aegean populations. Mediterranean populations harbor higher levels of microsatellite variation than Aegean populations, which points to the eastern part of the Mediterranean as the putative source of invasion. mtDNA results suggest olive flies from the western part of Turkey are closely related to Italo-Aegean flies of the Mediterranean basin and the olive fly populations have invaded the northern part of the Mediterranean basin through western Turkey. In addition, finding specific American haplotypes in high frequencies might indicate that Turkey is the possible source of American olive fly populations. In order to more precisely characterize the population structure and invasion routes of this organism, more DNA-based sequence analysis should be carried out worldwide.


Introduction
The olive fly, Bactrocera oleae (Gmelin) (Diptera: Tephritidae), is a serious insect pest of olive crops worldwide. The distribution of this insect is limited to regions where cultivated and wild olive trees are grown. Although it is widespread throughout the Mediterranean basin and Middle East, the current distribution of the species includes Central and South Africa, California, and Central America [1,2,3]. The olive fly causes significant quantitative and qualitative loss in the production of table olives and oil. Its larvae are monophagous, feeding exclusively on wild and cultivated olive fruits. The adult female lays her eggs beneath the epidermis of ripening olive fruits; the hatched larvae feed inside the fruit by destroying the pulp, allowing entry of microbial infections. Feeding damage can cause premature fruit drop and reduces fruit quality for table olive and oil production. Damage has been estimated at 15% of total olive production, nearly USD 800 million/year [4]. Turkey, an important trading center for olive products for thousands of years, is the fifth leading olive producer in the world, with 8.2 6 10 5 ha (data from FAO, 2010; http://faostat. fao.org).
A detailed understanding of the biology, genetic structure, and geographical variability of a given species is a prerequisite to designing effective quarantine, control, or eradication strategies [5]. Initial molecular studies of the olive fly were based on gel electrophoresis techniques and restricted to one or a few natural populations or their comparisons with laboratory colonies [6][7][8][9][10][11][12]. Recent advances in molecular technologies have provided new tools to monitor natural populations and their invasion pathways. Microsatellites, a nuclear co-dominant marker subject to Mendelian inheritance, display considerable polymorphisms due to variation in the number of repeat units, making them useful molecular markers for population studies [13][14][15]. Mitochondrial DNA (mtDNA) is an important molecular tool for reconstructing evolutionary events, such as identification of the region of origin of a species and the pathways of invasion and historical demography, and complements well with the information provided by microsatellite markers. The power of mtDNA analyses derives from its simple mode of inheritance (maternal and non-recombining), relatively high mutation rate, and the availability of comparative data with other species [16,17]. The nuclear microsatellite markers of the olive fly have been developed [2,18,19] and the complete mtDNA sequence has been published [16]. Polymorphic microsatellite markers and mitochondrial DNA haplotypes have also been used to study genetic polymorphisms in other Tephritidae species such as B. dorsalis [17,20], B. tyroni [21,22], B. cucurbitae [23,24], and Ceratitis capitata [25,26], in order to understand evolutionary influences on invasive processes, and to identify routes of colonization.
Using microsatellite and mtDNA markers, genetic differentiation in different geographical populations and the invasive process of the olive fly were analyzed at the macrogeographical level. After examining the expansion and colonization history of the olive fly, 3 separate genetic groups, Pakistan, Africa, and the Mediterranean plus America, were revealed; Africa is suggested as the possible origin of this species [18], while the American samples seem to originate from the eastern Mediterranean [18,27,28]. Microsatellite markers revealed 3 subpopulations in the northern part of the Mediterranean basin: western (Iberian Peninsula), central (Greece-Italy), and eastern (Cyprus) [29], which was subsequently extended to Israel and California [27]. The results also indicated a westward expansion of the species associated with a gradual decrease in variability (expected heterozygosity, H e ) from Cyprus to Portugal [29]. Westward expansion of the species, associated with a gradual decrease in variability, may have occurred concurrent with the introduction of the cultivated olive from its Levantine center of domestication to the Mediterranean basin [27,29]. The invasion of olive flies across Europe is one hypothesis that has been proposed to explain the observed gradient of genetic variability across the Mediterranean area. An alternative interpretation, based upon another study by sequencing the whole mitochondrial genome of olive flies sampled from around the world [28] suggested an older origin, associated with the fragmentation of the wild olive host in different glacial refugia on this continent. The early co-history of the olive fly with its wild host was explained in detail in the same study. Besides these two main scenarios, the olive fly population of Tunisia was found genetically different from the populations of the northern shores of the Mediterranean basin [30]; therefore, expansion might occur from the southern to the northern coast of the Mediterranean basin.
The Eastern Mediterranean region is a putative invasion point for olive fly into the Mediterranean basin [29] and America [18,27,28]. Despite its importance for understanding the overall picture of genetic diversity and bio-invasion within this species, unfortunately, no detailed research has been carried out in this region. Genetic analysis of the olive fly bio-invasion process in this region can provide us a more complete understanding of the historical distribution, pattern of olive fly movements, and additional information for control of this important pest. The aims of this study were a) to investigate the population structure and genetic variability of different geographical populations of B. oleae in Turkey and b) to provide detailed information about the expansion and colonization history of the species. Field-collected populations of olive fly from 38 different sublocations, selected to be representative of the entire distribution area and covering the eastern to western parts of Turkey (from the far eastern point of Islahiye to the far western point of Gökçeada, covering a distance .1300 km) were analyzed for the 12 most polymorphic microsatellite loci known and their mtDNA haplotypes.

Ethics Statement
No specific permits were required for the described field studies for this wide spread agriculture pest. We confirm that the location is not privately owned or protected. The field studies did not involve endangered or protected species.

Olive Fly Samples
Olive fly samples were collected from egg-infested fruits in olive orchards during the harvest season in all major olive growing regions of Turkey in 2010. The sampled provinces included Ç anakkale, Bursa, Balıkesir, Manisa İzmir, Aydın, and Mugla, in the Aegean region; Mersin, Adana, Osmaniye, Hatay, and Gaziantep in the Mediterranean region (Table 1 and Figure 1). From each province, 3 different sublocations (for Bursa and Ç anakkale 4 different sublocations) were used as sampling sites (in total 38 different sublocations from 12 provinces). Different trees were sampled to limit sibling collections. Samples from each population were kept in separate cages to prevent mixing and incubated in the laboratory at 25uC until larvae emerged and developed to adulthood. Adult samples were frozen and stored at 280uC until use.

Amplification of Microsatellite Loci
The 12 most polymorphic microsatellites primers (listed in Table 2) were tested and chosen from the 20 previously characterized microsatellite primers for this organism [2,18,29]. Primers were labeled by using 3 different fluorescent dyes, HEX, 6-FAM, and NED. From each sublocation, 10 adult individuals (total 380) were used for microsatellite analysis. Total DNA was extracted by the Lifton method [31]. Amplification of microsatellite loci was performed as described [18,29]. After PCR, 1 mL of each reaction was visualized on 2% agarose gel and the products were analyzed on an ABI PRISM 3100 Genetic Analyzer (Applied Biosystems). Electropherograms were manually checked with the Applied Biosystems Peak Scanner program (http://www. appliedbiosystems.com) and recorded.

Amplification of Mitochondrial Haplotypes
From each sublocation, 7 individual flies (total 266) were used for sequencing the most polymorphic 574 bp of the first subunit of the mitochondrial NADH dehydrogenase (ND1) gene. The experiment was performed as described [18] with modifications. In order to amplify the corresponding region, a new primer pair (Bo3EDF: 59-AGTCAATGAGCTTGAACAAGCATGTG-39 and Bo4EDR: 59-AGGTATTCCTCAACCTTTTTGTGAC-39) was used. These primers were designed using the published mitochondrial genome of B. oleae, GenBank accession no. AY210703. After PCR, the products were visualized and isolated from 1.2% agarose gel by Qiagen QIAquick PCR purification kit according to the manufacturer's instructions and sequenced directly by using BoND1F (59-TTTAGTTGCTTGGTTGTG-TATTCC39; obtained from [18]) and Bo4EDR primers. Sequencing was performed on an Applied Biosystems A3100 automated DNA sequencer. Electropherograms were checked manually.

Data Analysis
Microsatellite data analysis. Genetic polymorphism within populations was determined as the mean number of alleles per locus (n a ), effective number of alleles (n e ), and observed (H o ) and expected heterozygosity (H e ) using POPGENE version 1.31 [32]. The same program was used to calculate the genetic distance values according to Nei [33] and agreement of genotypic frequencies with Hardy-Weinberg equilibrium (HWE) in populations with chi-square (x 2 ) and likelihood ratio G 2 ; corrections were performed by applying Bonferroni correction for multiple comparisons [34]. Statistical comparisons were performed using PAST (version 2.07) [35]. Linkage disequilibrium, significance of association between genotypes at all pairs of loci, was tested with POPGENE and GENETIX software [36]. The phylogenetic relationships between populations based on genetic distances were depicted by a neighbor-joining tree, constructed using POPULA-TIONS version 1.2.32 [37] with bootstrap value = 100. The Analysis of Molecular Variance (AMOVA) test was performed for 2 different population groupings (east-west) in ARLEQUIN v. 3.5 [38]. Genetic variation was partitioned into 3 levels; among populations, among populations within individuals, and within individuals. Relationships between genetic and geographic dis-tances among populations, isolation by distance, were assessed by the Mantels test [39]. Geographic distances between localities were estimated using the web site www.googleearth.com. STRUC-TURE software 2.1 [40,41] was used to investigate the number of possible genetic clusters (or distinct groups) of B. oleae in Turkey. This program assumes a model in which there are K populations, each of which is characterized by a set of allele frequencies at each locus. To determine the most likely number of clusters (K) in our population samples, we used different values of K ranging from 1 to 12. The degrees of genetic differentiation among populations were analyzed as pairwise F ST values using FSTAT software 2.9.3 [42]. This program was also used to calculate allelic richness (AR). Gene flow (N e m, the number of effective migrants per generation) between geographic regions was calculated from F ST (averaged over the 12 loci) according to the formula: N e m = [(1/F ST )21)]/4 [43]. Finally, BOTTLENECK software [44] was used to infer demographic expansion/contraction in each population. mtDNA data analysis. DNA sequences were edited and verified as follows. First, primer sequences were removed from the raw files. The 574-bp portion of ND1 mtDNA sequences from 266 B. oleae samples were aligned with and without the previously published mitochondrial ND1 gene sequences (GenBank accession numbers AY998304 to AY998325 [18] and GU108459 to GU108478 [28]) using CLUSTALW [45]. Descriptive statistics (number of variable sites, number of haplotypes, haplotype diversity, average number of nucleotide differences between haplotypes) were calculated in Dnasp (ver. 5.0) [46]. Medianjoining networks of haplotypes, including previously identified haplotypes, were constructed using NETWORK (ver. 4.6) [47,48].

Results
To unravel the variability and colonization process of B. oleae in Turkey, genetic polymorphism of natural olive fly populations was studied by using nuclear DNA (microsatellite) and mitochondrial markers.

Microsatellite Variability
The analysis of 12 microsatellite loci in 380 flies, captured from 38 sublocations in 12 provinces, showed high levels of variability (Table S1). The number of alleles per locus varied from 5 (Boms61) to 23 (Bo-D54), with an overall mean of 13.9264.94. The mean number of effective alleles detected per locus was 4.4861.14, while the highest number was observed at Bomic15 with 7.63; the lowest value was at Bo-D37 with 3.34. Polymorphism was also determined with regard to the level of heterozygosity. Observed (H o ) and expected (H e ) heterozygosity values were 0.7860.08 and 0.7760.05, respectively, when averaged over loci.
H o values ranged from 0.69 (Boms61 and Bo-D53) to 0.91 (Bomic15); H e values ranged from 0.70 (Bo-D37) to 0.87 (Bomic15). Geographical distribution of microsatellite alleles by population for each locus was also determined. The number of detected alleles varied from 4 to 16 in all populations. The highest number of private alleles (up to 5) was present at Bo-D51 locus and no private allele was detected at Bo-D48 and Boms61 loci.
After sequential Bonferroni correction [34], all sampled populations were confirmed to be in HWE at all loci according to x 2 and G 2 criteria (at P,0.05). The average F ST over all loci and the number of effective migrants per generation, N e m, were 0.03 and 8.36 respectively. No linkage disequilibrium was detected between genotypes at all loci; all were considered independent. Table 3 shows the overall level of variability relative to the 12 loci in the 12 analyzed olive fly populations. The mean number of alleles (n a ) ranged from 7.58 (Adana) to 9.50 (Mersin) and allelic richness (AR) from 7.05 (Bursa) to 8.88 (Mersin). The amount of genetic variation seemed to be homogeneously distributed among different populations, considering mean heterozygosity estimates of H o and H e . The observed and expected heterozygosity values varied from 0.75 (Mugla) to 0.82 (Aydın) and 0.74 (Bursa, Manisa, and Adana) to 0.78 (Mersin, Osmaniye, and Gaziantep). However, mean number of effective alleles (n e ), number of private alleles (n p ), and frequency of private alleles (A p ) were found to be significantly different between two regions (P,0.05, Mann-Whitney U test). The results were: n e = 4.01, n p = 1, A p = 0.01 in Aegean populations and n e = 4.36, n p = 3.4, A p = 0.03 in the Mediterranean popula- tions. In total, 24 new private alleles were determined in low frequencies (encountered only once or twice), 17 of them in the Mediterranean region. Although the number of private alleles is dependent on sample size [49,50], the Bursa, Ç anakkale, and Manisa populations had no private alleles. The Mersin population presented the highest level of genetic diversity including mean and effective number of alleles, allelic richness, private alleles with high frequency ($0.04), and expected heterozygosity among studied populations. In comparison with the other populations, lower variability values were observed in the Bursa population.
Genetic relationships among populations were quantified by pairwise F ST (Table 4). The F ST values ranged from 20.00197 (between Mugla-Ç anakkale) to 0.05146 (between Aydın-Adana). Analysis of significance of pairwise F ST values among all samples indicated the possibility of grouping in 2 subpopulations. Mediterranean populations were generally significantly different from, although at different probability levels, populations of the Aegean region. However, the genetic differentiation within regions was low. This lack of differentiation can be explained by smooth topography, continuous plant cultivation (including plant exchanges), and extensive olive trades among provinces in these regions. Considering the genetic variability among Mediterranean and Aegean populations, the data were analyzed by separating the populations into 2 main geographical groups and the mean pairwise F ST value was 0.01379 (P,0.001). The distance between these 2 geographical regions is greater than 700 km. The level of gene flow, considering this distance, for olive fly populations in Turkey can be considered an important factor influencing the shape of the genetic structure of this organism.
The unrooted NJ tree of 12 populations based on genetic distances presented in Figure 2. Most of the branches had low bootstrap values. The observed tree is not inconsistent with, but nor does it conclusively demonstrate, the Aegean and Mediterranean populations forming two separate lineages.
To further analyze the structure of olive fly populations in Turkey, Factorial Correspondence Analysis was performed. This analysis also showed the same topology as the unrooted NJ tree with 2 main clusters: Mediterranean and Aegean ( Figure S1).
To analyze the isolation by distance among populations, the Mantel test was performed using microsatellite markers ( Figure  S2). Results indicated the presence of a correlation between genetic and geographic distances (Spearman Rank correlation coefficient r = 0.4738; Mantel P,0.002).
Tests of homogeneity among the populations were performed using AMOVA. For this purpose, the population groups were chosen according to the 2 major geographical areas clustered in the population tree: the Aegean and Mediterranean regions of Turkey. The results are summarized in Table 5. AMOVA confirmed a regional structure among groups (P,0.0019). The main contribution to genetic variance was due to variation within individuals while little genetic variation was attributable to the variation among populations/within groups.
The genetic structure of the populations was also analyzed based on microsatellites using STRUCTURE software ( Figure  S3). A low level of differentiation is depicted by this analysis. Although the number of clusters (K) varied from K = 1 to 12, no significant genetic differentiation was observed. We also tested the hypothesis of a recent bottleneck based on the TPM. The bottleneck test, with a mode shift in allele frequency classes, attributed an L-shaped distribution to all populations, consistent with normal frequency class distribution ranges (P.0.05).

mtDNA Analysis
To explore the invasion history of this species from the eastern Mediterranean to Europe, a portion of the mitochondrial ND1 gene was used to investigate genetic diversity, haplotype phylogeny, and demographic history of 12 olive fly populations from all olive cultivating areas in Turkey. Forty-four haplotypes, variant sequence forms, were observed and given haplotype designations in 266 individuals. Thirty-five of these haplotypes were unique to Turkey and 9 of them were shared with previously identified haplotypes [18,28]. The availability of such unique haplotype variants provided us a powerful tool for resolving questions relating to various aspects of the invasion process of olive fly   In 574 bp, there were 38 polymorphic sites (6.62% of total length), 17 of these sites were singletons, the mutation being present in a single haplotype sequence, and 21 of them were parsimony informative. The list of identical haplotypes from this study and previous studies are presented in Table S2. Although fewer flies were used (45 flies), 11 different haplotypes from 8 locations were identified in the Mediterranean basin (from Israel-Haifa to Portugal-Paradale) [18]. Our results showed that the western region of Turkey and the European part of the Mediterranean basin are most closely related, sharing 6 out of 10 haplotypes. The number of these common haplotypes was 1/4 from Portugal, 2/3 from France, 3/4 from Italy, and all haplotypes from Greece, Israel, and previous reports from Turkey. In America, 5 different haplotypes were identified [18] and 4 of them, except for haplotype L (found in Ensenada-Mexico and Burguret Forest-Kenya), were observed in our study. Two of these haplotypes (haplotypes N and O) were specific to the American continent and we observed both haplotypes with high frequencies with no regional specification (Table S2 and S3). Ten haplotypes were reported from Pakistan and Africa, the source of Mediterranean populations [18]. We found just one African haplotype reported previously [18] in Turkey, haplotype A (Table S2). In another study, 4 previously known haplotypes, H1, H2, H4, and H17 were identified by using 11 samples from the Mediterranean basin [28]. Three of them, H1, H2, and H4, were observed in our study; however, we did not observe haplotype H17 (found in Paradale-Portugal and Sig City-Algeria) ( Table S3).
The mitochondrial haplotypes were differentially distributed among the Turkish olive fly populations. Table S3 lists the haplotypes, frequencies, and distributions among the populations of B. oleae. Four haplotypes, H1, H2, H4, and H8, were found at higher frequencies and were the most common, widely distributed variants, comprising 75.1% of all 266 sequences in Turkey. Haplotypes H1 and H2 were shared by all studied populations and they seem to be fixed in Turkey. Haplotype H1, comprising 37.7% of all 266 sequences, was a common (dominant) haplotype in our region. Sixty percent of the eastern Mediterranean and 40% of the Aegean regions are grouped in H1. The second-most common haplotypes H2 and H8 were found at a frequency of 13.1%. In haplotype H2, 52% of Mediterranean and 48% of Aegean samples were grouped together. However, samples from the Aegean region had a higher frequency (with 83%) of haplotype H8. H4, which is the fourth most common haplotype with a frequency of 11.2%, contained only samples from western Turkey.
A rough association was observed between geographical source of individuals and genetic groups of haplotypes. Ten and twentyseven haplotypes were specific to the Mediterranean and Aegean regions, respectively. However, each of the remaining 7 haplotypes was observed in more than one region. Haplotype 23 was found on the small island Gokceada, a sublocation of Ç anakkale, and seems to be unique to this island.
The relationships between mitochondrial haplotypes identified in our dataset, together with previously published worldwide data (GenBank accession numbers AY998304-AY998325 and GU108459-GU108478), were defined by a haplotype network (Figure 3). The haplotype-based network analyses allowed us to  (Table S3), haplotypes from Portugal, Italy, and Algeria (haplotypes 17, 28, 29, and 30) formed the western European cluster. Four main haplotypes of Turkey (H1, H2, H4, and H8) and previously identified haplotypes were positioned in the network and the remaining haplotypes were found generally at lower frequencies and connecting to these haplotypes through few mutations. The sequence of the most common haplotype, H1, shared 100% identity with the previously identified haplotypeA [18]. In his study, this haplotype was found with the overall frequency of 0.30%; in our study, the frequency of this haplotype was slightly higher (37.7%). Combining our results with his study, H1 seems to be a common haplotype for the olive fly in the eastern Mediterranean and America. Later, another identical haplotype of A was found in a sample from Morocco [28]. The second most common haplotype, H2, has the same sequence as haplotypeN, which was only identified in Mexico, the USA [18], and USA -haplotypeOroville- [28]. HaplotypeO (in H13) has only been reported in the USA [18] and is connected to H2 by one mutation. This haplotype also seems to be specific to the American continent. Another common haplotype, H8, is connected to H1 by two mutations and is predominant in western samples; only 17% of this haplotype was formed by Mediterranean sequences. H4 contains only samples from western Turkey and has the same sequence as previously identified haplotypes from Bari and Vaggia (Italy) [28]; haplotype I is the most common and widely distributed allele in Europe [18]. HaplotypeJ (in H10) was found in Italy [18] and was connected to H4 by one mutation. This haplotype and other haplotypes (except for H8) diverge by one or two mutational steps and were all derived from western Turkey; it seems that H4 is specific to this and the Italo-Aegean region.

Discussion
In this study, we used both mitochondrial and nuclear DNA markers to unravel the patterns of genetic differentiation and the potential invasive route of B. oleae from the eastern Mediterranean to Europe. Turkey is an important part of the eastern Mediterranean region, but previously relatively little was known mainly because of the limited number of analyzed specimens [18,29].

Population Structure of B. oleae in Turkey
The simple sequence variability data reveal two main findings: (i) a level of genetic variability is present in the olive fly populations in Turkey; (ii) a certain degree of differentiation between Mediterranean and Aegean populations might indicate the possible expansion of this fly from east to west.  According to some authors [51,52], a correlation is expected between the level of genetic variability and the degree of environmental diversity of a species. The potential for ecological heterogeneity to increase genetic diversity, and perhaps divergence, has been suggested [53,54]. In our study, B. oleae, a strictly monophagous species, met such expectations over a wide area. We observed a high level of genetic variability among the olive fly populations by using 12 polymorphic microsatellite markers and samples collected from different regions of Turkey. This result is consistent with the literature; previous population studies [18,27,29,30] of this organism reported a high degree of genetic variability on a regional geographic scale, which seems to be characteristic of this species. One of the main causes of this high genetic variation is the length of time that has elapsed since the species became established in the eastern Mediterranean. Domestication of wild olive trees took place in the Near East around the 4th millennium BC [28,55,56], and archeological ruins indicate that the olive was processed in Anatolia for the first time around 3 th -2 nd millennium BC [57]. However, it is possible that most of the evolutionary history of the species has in fact taken place on wild olives before the domestication of cultivated olives [58]. The oldest olive seed was found in Anatolia (specifically in Tarsus-Mersin region) and dated from the Neolithic age, 8000 to 5000 BC [59]. The elevated effective size of the populations is another factor contributing to genetic variability, since olive groves cover wide expanses in our region and olive fly populations are expected to remain large over time.
Cluster analysis points to the existence of two subpopulations in Turkey: Mediterranean and Aegean. The results of the NJ tree, although with low bootstrap values (see Figure 2) and PCA ( Figure  S1), support the existence of these major subpopulation groups. The Mantel test performed with microsatellite markers revealed a positive correlation between geographic distances and genetic distances among populations, indicating isolation by distance.
Three main hypotheses have been proposed to describe the colonization and expansion history of olive fly populations towards the European parts of the Mediterranean basin [27][28][29][30] but the underlying population dispersion processes remain partially unclear. Our SSR data, coupled with the presence of a high number of low-frequency alleles, seem to suggest the westward expansion of this fly from the eastern Mediterranean to the northern Mediterranean basin [27,29]. The Mediterranean populations are the most polymorphic with higher genetic variability values and a greater number of private alleles. Briefly, as stated before [27,29], the east-to-west colonization of the olive fly is accompanied by a gradient loss of polymorphism and linked to the westward expansion of olive cultivation. Furthermore, according to archeological data, olives were transported from Anatolia to the Greek islands and Greece [60].
The observed high level of gene flow (Nem = 8.36) between populations does not seem to be enough to homogenize the olive fly populations in Turkey. SSR markers revealed genetic variability and differentiation among olive fly populations in Turkey. There are 3 possible reasons for this differentiation. First, continuous host resources, the absence of natural barriers to gene flow, and appropriate climatic conditions are favorable to the olive fly at the Syrian border of Turkey. This may form a natural route for population dispersal from the Middle East to the Mediterranean region of Turkey, i.e. from south to north or vice versa. Different authors have reported the high dispersal capacity for geographical expansion of the olive fly [3,61,62]. Second, limited human-mediated effects (such as new olive plantations, transportation, and trade) and less continuous distribution of olives from east to west might be another factor. Third, local variation in selection intensity (for agricultural purposes) might be strong enough to maintain variation between regions. The western regions of Turkey have large areas of olive cultivation and intense selection pressures have being applied to these populations through long-standing eradication programs.
In our study, the Mersin population showed the highest genetic variability in Turkey. There are 2 possible reasons for this high variability; First, Mersin is one of the first regions where olive cultivation was systematically initiated in Anatolia between 2000-1200 BC [57,60]. The second reason for this variability might be that the region is the closest point between Turkey and Cyprus, among the first Mediterranean islands where olive trees were systematically cultivated. An intensive olive trade has been carried out with this island and other Mediterranean countries such as Egypt and the Greek islands since historical times via a seaport in this city [60].

Haplotype Analysis and the Population Structure of B. oleae in Turkey
Our mtDNA results provide valuable information for understanding olive fly invasion from the eastern Mediterranean to Europe. We have 3 main conclusions; (i) olive flies from western Turkey are most closely related to Italo-Aegean flies of the Mediterranean basin; (ii) olive fly populations invaded the northern Mediterranean basin through western Turkey; (iii) and Turkey is the possible source of American olive fly populations.
Unlike the results obtained from microsatellites, mitochondrial data indicates a higher level of mean genetic diversity (number of haplotypes, haplotype diversity, and nucleotide diversity) in the Aegean than in the Mediterranean region (Table 6). One reason for this difference is a single haplotype, H1, predominates in southeastern Turkey. This discrepancy may also be the result of different evolutionary patterns of both markers; mtDNA is maternally transmitted in animals, it evolves quite slowly in comparison to microsatellites, and it is more prone to genetic drift [63].
Population structure of the Mediterranean basin has been studied previously by using mtDNA sequences [18], but failed to identify population-level genetic differentiation. A clear phylogenetic separation between eastern and central/western Mediterranean populations was reported after sequencing the whole mitochondrial genome in a limited number of flies [28]. Our network analysis, similar to the findings of a recent study [64], indicates 3 main groups in the Mediterranean basin; eastern Mediterranean-America, Italo-Aegean-western Turkey, and western Europe (or Iberian clade). Although no clear split has been observed between the eastern Mediterranean-America and Italo-Aegean populations in our study, Turkey seems to contain both subpopulations ( Figure 3). Six European-specific haplotypes (H4, H7, H8, H10, H11 and H14 in Table S2 and S3) out of 10 previously identified haplotypes [18] from Greece to France are found specifically in western Turkey, although at different frequencies (H8 is also found in the Mediterranean region). These common haplotypes are distributed to different torsos in the Network (Figure 3). It seems that the border of the Italo-Aegean population extends from the western coast of Turkey to France, from which 2 haplotypes (out of 3) were identified in Turkey. In addition, it was mentioned by [60] that Phokaians transferred cultivated Anatolian olive varieties from western Turkey to Marseille (France) in 600 BC. These kinds of olive transportations during history may explain the observation of common haplotypes between central Europe and western Turkey. However, more detailed sequence analysis should be performed on samples from central Europe to clarify this issue. The Western Europe group in the network contains H17, H28, H29, and H30 haplotypes (Figure 3) mainly identified in western Europe and northwestern Africa (Table S3). Arabian olive varieties were introduced to the Iberian Peninsula after the 8 th century AC [29]. This might explain the separation of the Western Europe from the Anatolian groups. H4, a specific haplotype for the Aegean region, contains the common European haplotype I (Table S2) and is connected to European-specific haplotype J, which is one mutational step away (Figure 3). H8 predominates in western samples and seems to be a transient torso between H1 and H4.
H1, the only shared haplotype from Africa, is the most common and widespread variant especially in southeastern populations of Turkey. H2, being a specific haplotype for American samples, is found at very high frequency in all studied regions. Another specific American haplotype is H13, which is differentiated from  Table S2 and S3. The areas of the circles are proportional to the number of samples sharing each haplotype. Empty circles represent haplotypes not observed in the sample. Haplotypes are colored by region. *Data obtained from [18] and [28]. doi:10.1371/journal.pone.0056067.g003 H2 by one mutational step and is observed in 2 different regions of Turkey. Previous studies [18,27] supported the Middle Eastern origin of American populations. Determination of the wide distribution and high frequencies of these specific haplotypes in our study indicates the possible origin of Turkey for the American populations. However, it should be kept in mind, limited number of sequences was available from this continent.
Based on Network analysis and the distribution of haplotypes, an east (Mediterranean) to west (Aegean) invasion route is inferred, as suggested by our SSR data. The fact that the Aegean region shares many haplotypes (6/10) with eastern and central Europe indicates the species may have invaded Europe through western Turkey, i.e. olive fly expansion occurred via the northern part of the Mediterranean basin. This observed westward expansion of the species supports a previously expressed hypothesis [29].
It is always possible that additional haplotypes might exist in natural olive fly populations in the Eastern Mediterranean-America and Italo-Aegean regions; however, these would be limited to the tips or fringes of the network, considering the large number of flies and wide collection area used in this study. Direct analysis of DNA sequence-based haplotypes from other parts of genomes might help us to understand the movement of this pest from the Middle East to Europe.
The olive fruit fly is the most important pest of wild and cultivated olives. Laboratory and field observations have shown that female B. oleae exhibited strong ovipositional preferences for certain varieties of cultivated olives [65,66,67,68]. The resulting larvae performed better in preferred olive varieties than in lesser or non-preferred varieties. These varietal differences in larval performance in olive have implications for the success of an invasion and range expansion process, which effects population growth and dynamics of the organism during its adaptation to a new environment. Once introduced into an area the fly's establishment time could be effected by the olive variety present [68]. More information about cultivar susceptibility to olive fruit fly will also help us to clarify olive fly distribution and colonization process in future.