The tsetse fly, Glossina morsitans morsitans, is a significant problem in Zambia and Malawi. It is the vector for the human infective parasite Trypanosoma brucei rhodesiense, which causes human African trypanosomiasis, and various Trypanosoma species, which cause African animal trypanosomiasis. Understanding the genetic diversity and population structure of G. m. morsitans is the basis of elucidating the connectivity of the tsetse fly populations, information that is essential in implementing successful tsetse fly control activities. This study conducted a population genetic study using partial mitochondrial cytochrome oxidase gene 1 (CO1) and 10 microsatellite loci to investigate the genetic diversity and population structure of G. m. morsitans captured in the major HAT foci in Zambia and Malawi. We have included 108 and 99 G. m. morsitans samples for CO1 and microsatellite analyses respectively. Our results suggest the presence of two different genetic clusters of G. m. morsitans, existing East and West of the escarpment of the Great Rift Valley. We have also revealed genetic similarity between the G. m. morsitans in Kasungu National Park and those in the Luangwa river basin in Zambia, indicating that this population should also be included in this historical tsetse belt. Although further investigation is necessary to illustrate the whole picture in East and Southern Africa, this study has extended our knowledge of the population structure of G. m. morsitans in Southern Africa.
Techniques from population genetics have been widely applied to assess problems in evolutionary biology, computational biology, wildlife conservation, animal breeding, etc. In this study, we used population genetics approaches to elucidate the genetic population structure of the tsetse fly Glossina morsitans morsitans, an important vector of the Trypanosoma parasite. Our aim was to identify the optimal range of areas in which to implement effective tsetse fly control programs. We focused on five areas of active human African trypanosomiasis foci in Zambia and Malawi and used both mitochondrial and nuclear markers for population genetics analyses. High levels of genetic differentiation were observed between G. m. morsitans in Nkhotakota Wildlife Reserve, Malawi, and those at the other locations included in this study. Nkhotakota Wildlife Reserve lies on the eastern side of the escarpment of the Great Rift Valley at an altitude that is at the biological limit of G. m. morsitans, thereby functioning as a geographical barrier restricting gene flow from the G. m. morsitans on the western side of the escarpment. The western side of the escarpment is part of a historically-known tsetse belt that resides within the Luangwa river basin. We have demonstrated that a previously undescribed population of G. m. morsitans in Kasungu National Park, Malawi, should be included in this historical tsetse belt.
Citation: Nakamura Y, Yamagishi J, Hayashida K, Osada N, Chatanga E, Mweempwa C, et al. (2019) Genetic diversity and population structure of Glossina morsitans morsitans in the active foci of human African trypanosomiasis in Zambia and Malawi. PLoS Negl Trop Dis 13(7): e0007568. https://doi.org/10.1371/journal.pntd.0007568
Editor: Brian L. Weiss, Yale School of Public Health, UNITED STATES
Received: November 7, 2018; Accepted: June 21, 2019; Published: July 25, 2019
Copyright: © 2019 Nakamura et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All mitochondrial CO1 haplotype sequences are available from the DDBJ database (accession numbers LC455935, LC455936, LC455937, LC455938, LC455939, LC455940, LC455941, LC455942, LC455943, LC455944, LC455945,LC455946, LC455947, LC455948, LC455949, LC455950, LC455951, LC458946, LC458947, LC458948, LC458949, LC458950, LC458951, LC458952, LC458953). All microsatellite genotype files are availabe from the Dryad database (doi:10.5061/dryad.122hs54).
Funding: This research was supported by Japan Agency for Medical Research and Development (AMED) under Grant Number 16jm0510001h0002. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Tsetse flies (Glossina spp.) are the vector for the Trypanosoma parasites that cause human African trypanosomiasis (HAT) and African animal trypanosomiasis (AAT). Both diseases present a significant burden in terms of public health and economy in sub-Saharan Africa [1,2]. Control of both diseases are difficult, mainly due to the presence of wild and domestic animal reservoirs, the lack of prophylactic drugs and vaccines, and the high cost and severe side effects of available drugs. Therefore, tsetse fly control, in combination with chemotherapeutic methods, is still the most theoretically desirable method for controlling both AAT and HAT [3,4]. To conduct effective tsetse fly control programs, it is necessary to identify the extent of the tsetse fly distribution and its connectivity with residing populations. Several population genetics studies have been successful in identifying population structure and the extent to which the discrete populations are connected by dispersal and migration in several tsetse-infested African countries [5–8].
Zambia and Malawi, along with Mozambique and Zimbabwe, lie within the “common fly belt” in Southern Africa . Tsetse flies found in these areas include three subspecies or species of the Morsitans group (subgenera Glossina Wiedemann) of tsetse flies: G. morsitans morsitans, G. morsitans centralis, and G. pallidipes; one species from the Fusca group (subgenera Austenina Townsend); G. brevipalpis; and one species from the Palpalis group (subgenera Nemorhina Robineau-Desvoidy); G. fuscipes . Among these species of tsetse flies, G. m. morsitans and G. pallidipes are the major vectors of AAT and HAT in these countries. The prevalence of human infective T. b. rhodesiense, as assessed by the presence of the T. b. rhodesiense-specific human serum resistance-associated gene, was highest in G. m. morsitans . G. m. morsitans inhabits savanna woodlands, and its distribution has been mapped primarily by determining the abundance of mammal hosts, and the analyses of tsetse habitat using geographical information systems and remotely-sensed satellite data [12,13]. The G. m. morsitans populations in East and Southern Africa are distributed across Tanzania, Mozambique, Zimbabwe, Zambia and Malawi  and constitute four allopatric belts . The genetic diversity and population structure of G. m. morsitans in two of the four allopatric belts has been explored using mitochondrial and microsatellite markers, and significant differences have been identified among five populations from Zambia, Mozambique and Zimbabwe [15,16]. Further research involving four populations from Tanzania has also revealed high differentiation and low rates of gene flow, suggesting that the discontinuous distribution of the populations leads to genetic drift, overwhelming gene flow . Although it has been suggested that further sampling from other allopatric belts would increase our understanding of the breeding structure of G. m. morsitans in this region , there has, to our knowledge, been no research conducted that includes G. m. morsitans populations from Malawi.
Zambia and Malawi, two countries in South-East Africa with a significant HAT burden, report fewer than100 new cases annually . The major HAT loci are the Lower Zambezi region and the Luangwa valley in eastern Zambia , and Kasungu National Park and Nhkotakota Wildlife Reserve in Malawi . The two countries share foci in the Northern region: The Chama district of Zambia and the Rumphi district of Malawi. Therefore, both countries should be included when assessing questions related to HAT. To investigate the genetic diversity and population structure of G. m. morsitans in the major HAT loci in Zambia and Malawi, which data are fundamental for vector control, we conducted a population genetics study using a 407-bp region of mitochondrial cytochrome oxidase 1 gene (CO1) with 10 microsatellite loci as genetic markers.
Materials and methods
Sample collection and DNA extraction
G. m. morsitans were collected from three locations in Zambia, and two locations in Malawi (Fig 1). The three sampling locations in Zambia include the Lower Zambezi National Park (LZNP, S15° 37.577, E29° 36.132), Shikabeta (SHKB, S14° 57.208, E29° 49.924), and the Musalangu Game Management Area (MGMA, S11° 09.807, E33° 24.224). Sampling within the national parks and GMAs was conducted with permission from the Zambian Wildlife Authority (ZAWA). The two sampling locations in Malawi include the Kasungu National Park (KNP, S13° 01.289, E33° 08.517) and the Nkhotakota Wildlife Reserve (NWR, S12° 52.212, E34° 08.283), in which sampling was conducted with permission from the Department of National Parks and Wildlife (DNPW) of Malawi. Samples were collected between May 2012 and February 2018. At each sampling location, flies were captured when driving down roads while deploying mobile tsetse traps attached to the rear end of the car. Each drive was around one kilometer, a distance which is within the average lifetime dispersal of G. morsitans . The captured flies were inspected using microscopy for morphological identification of G. m. morsitans  and sexing. Apparently pregnant females were not included in the analysis. The flies were then put into separate 2 mL sample tubes with silica beads to dry. The dried flies were transferred into new tubes with beads, and were smashed using a Beads Cell Disrupter (Micro Smash MS-100, Tomy, Japan) at 3,000 rpm for 45 s. DNA was extracted using a modified protocol with the DNA Isolation Kit for Mammalian Blood (Roche, Switzerland). Briefly, 330 μl of white cell lysis buffer was added directly into each tube, vortexed, and heated at 37°C for 30 min. Then 170 μl of protein precipitation solution was added, vortexed thoroughly, and centrifuged at 15,000 rpm for 20 min. DNA was precipitated by the addition of ethanol. Extracted DNA was stored at −30°C until further use.
Markers indicate the five sampling locations for the G. m. morsitans populations included in this study. The full names of the location codes can be found in Table 1. The layers were obtained from MapCruzin.com (https://mapcruzin.com/), and the figure was created using QGIS v3.0 (https://qgis.org/en/site/).
CO1 amplification, sequencing, and analyses
We generated new primers (Forward: 5′–CTT TAC CTG TAT TAG CCG GAG C–3′, Reverse: 5′–ACT CCT GTT AAA CCT CCT ACT G–3′) for the amplification of a 477-bp fragment of the mitochondrial CO1 gene. Reactions contained 0.25 μl (1–10 ng) of template DNA, 5 μl Ampdirect Plus (Shimadzu Corp.), 0.05 μl BioTaq HS DNA Polymerase (Bioline), 0.5 μl (10 mM) primers, and 3.7 μl of water for a total volume of 10 μl. Amplification included an initial denaturation step at 95°C for 10 min, followed by 94°C for 30 s of denaturation, 30 cycles each for 30 s at 60°C for annealing, 1 min at 72°C for extension, and a final extension step at 72°C for 7 min. The PCR products were purified using ExoSAP-IT (Applied Biosystems), following the manufacturer’s protocol. Forward and reverse strands were sequenced with ABI 3130/3500xl sequencers (Applied Biosystems). The chromatograms were visually inspected, and poor-quality data were trimmed using ApE Plasmid Editor v2.0.51 (M. Wayne Davis, Univ. Utah, USA). Both forward and reverse sequences were used to create a consensus sequence for each sample, and multiple sequences were aligned using online MAFFT v7 , resulting in 108 fully aligned 407-bp sequence fragments.
The summary statistics calculated were: the average number of nucleotide differences between individuals within locations (Nd); the average nucleotide diversity within locations (π); the number of haplotypes within locations (H); and haplotype diversity within locations (Hd). the summary statistics and the mismatch distribution of pairwise differences were analyzed using DnaSP v6.11.01 (Julio Rozas, Universitat de Barcelona, Spain) . Neutrality tests (Tajima’s D: based on the difference between expected segregating sites, and Fu’s Fs: based on the degree of excess of rare alleles), population pairwise ϕST (an analogue of FST, which estimates the deviation from random mating among demes) and the genetic diversity within and among populations evaluated by the analysis of molecular variance (AMOVA) using the haplotype frequencies distance method, were analyzed using ARLEQUIN v220.127.116.11 [24,25]. Bonferroni correction was used for multiple-comparison correction of population pairwise comparisons.
To infer and visualize the evolutionary relationships of the haplotypes, we constructed a median-joining haplotype network using POPART v1.7 .
Microsatellite amplification and marker validation
We used 12 autosomal microsatellite loci, which have been described in other studies [27–30]. Amplifications were performed using fluorescently-labeled forward primers (FAM, VIC, NED, and PET) in a reaction volume of 10 μl, containing 1 μl template DNA, 0.05 μl Multiplex PCR mix 1 and 5.0 μl Multiplex PCR mix 2 (Multiplex PCR Assay Kit, Takara), 0.2 μl (10 mM) primers, and 3.55 μl of water. Amplification included an initial denaturation step at 94°C for 1 min, followed by 94°C for 30 s of denaturation, 30 cycles each of 90 s at 57°C for annealing, 90 s at 72°C for extension, and a final extension step at 72°C for 10 min. PCR products were diluted from ×1 to ×20 according to the thickness of the band from the electrophoresis results, and four different fluorescence samples were pooled to be genotyped on an ABI 3130 sequencer. Alleles were scored using the software Peak Scanner v1.5 (Applied Biosystems) and the scored peaks were manually edited. Micro-Checker v2.2.3 was used to check for null alleles, and two loci were dropped due to the presence of null alleles , resulting in using 10 loci for further analyses.
MSA v4.05  was used to generate a genepop file, which was used in GENEPOP v4.7  to test for deviation from Hardy-Weinberg equilibrium and linkage disequilibrium (LD) using the Markov chain method with 10,000 dememorizations, 1,000 batches and 10,000 iterations per batch . Genetic diversity indices including the mean number of alleles (NA), allelic size range (AS, the range in nucleotide length among microsatellite alleles), expected heterozygosity among polymorphic loci (HE), observed heterozygosity among polymorphic loci (HO), estimation of the inbreeding coefficients (FIS, estimation of the deviation from random mating within demes), and population pairwise FST based on number of different alleles were calculated using ARLEQUIN v18.104.22.168. The genetic diversity within and among populations were calculated by AMOVA analysis as implemented in ARLEQUIN v22.214.171.124 . Bonferroni correction was used for multiple-comparison correction of the population pairwise comparison.
The genetic structure of G. m. morsitans was determined using the Bayesian clustering method used in STRUCTURE v2.3.4 . Ten replicate runs for each K = 1–10 were carried out with a burn-in length of 20,000 followed by 200,000 iterations. The most likely value of K was determined using the Evanno method  implemented in STRUCTURE HARVESTER v0.6.94 . The replicates for the most likely K were aligned using CLUMPP v1.1.2 , and the aligned cluster assignments were visualized using DISTRUCT v1.1 .
The effective population size (Ne), including both female and male samples, was estimated for each population using the LD method in NeEstimator v2.1 , and tests for population bottleneck occurrence was conducted using the two-phase mutation (TPM) model implemented in BOTTLENECK v1.2.02 , as recommended for microsatellite loci . Significance was assessed using Wilcoxon’s signed-rank test. Tests for recent bottleneck events were carried out using a mode-shift indicator of allele frequency distributions .
CO1 genetic diversity and haplotype diversity
Analysis of the 407-bp fragment of the CO1 gene of 108 individual G. m. morsitans flies from five locations resulted in the identification of 16 haplotypes, Hap_1 to Hap_16 (Fig 2, Table 1). The number of haplotypes found within each sampling location in Zambia and Malawi varied from three to nine. Haplotype diversity (Hd) in these locations were generally high, ranging from 0.582 in SHKB to 0.801 in KNP. In contrast, nucleotide diversity (π) was low, ranging from 0.002 in SHKB to 0.007 in NWR. The most common haplotype, Hap_1, was found in 33 individuals (30.6%) from all sampling locations except NWR, indicating a common ancestry of the G. m. morsitans from four locations (S1 Table). The second most common haplotype Hap_3 was found in 18 individuals (16.7%) from four sampling locations. Seven other haplotypes occurred in two or three sampling locations, and the other seven haplotypes were unique to specific localities and were dubbed “private” haplotypes. Among the private haplotypes, four were singletons, where the haplotype was found in only a single individual. Interestingly, none of the haplotypes were shared between KNP and NWR despite their relatively close geographic distance. The only haplotype that was not unique to NWR was Hap_9 which occurred in both NWR and MGMA. KNP shared many common haplotypes with the Zambian locations and seems to be a part of the major haplogroup. The other four haplotypes (Hap_13, Hap_14, Hap_15, and Hap_16) observed in NWR formed a haplogroup that was separated from the major haplogroup by an inferred missing haplotype (Fig 2).
The median-joining network was constructed using 108 CO1 sequences and was visualized using POPART v1.7. Each circle represents a haplotype, and the size of a circle is proportional to the number of sequences assigned to that haplotype. The location from which the sequence was obtained is indicated by color in the legend. The number of hatch marks indicate the number of nucleotide differences that separate the haplotypes. A black dot represents an intermediate missing haplotype.
We tested for neutrality using both Tajima’s D and Fu’s FS test. As a result, KNP showed significant p values for both Tajima’s D and Fu’s FS (p < 0.05, p < 0.02 respectively), suggesting the effects of either positive selection or population expansion in KNP (Table 2). From the mismatch distribution of pairwise differences, the observed distribution had a bimodal distribution (S1 Fig; Raggedness index: 0.1088, Mean Absolute Error: 0.6533).
The AMOVA analysis of the five locations indicated that the overall genetic variation within populations was larger (84.29%) than the variation among populations (15.71%) (Table 3), suggesting little genetic structure. However, population pairwise ϕST values showed significantly different values after Bonferroni correction (p < 0.05) between NWR and other locations, with values ranging from 0.203 to 0.307. All comparisons between LZNP and the other locations showed significant ϕST values, ranging from 0.177 to 0.273 (Table 4).
Microsatellite genetic diversity and population structure
A total of 99 G. m. morsitans samples were genotyped at 10 microsatellite loci. None of the microsatellite loci pairs showed significant results of the LD tests (S2 Table). NA was highest in LZNP (10.1) and lowest in SHKB and KNP (8.8) in the five locations (Table 1). The lowest HE was observed in SHKB (0.768), and the highest was observed in LZNP (0.835).
The percentage of variation within individuals was highest (108.63%) compared to the percentage of variation among populations (2.76%) and among individuals within populations (−11.39%) in the AMOVA analysis (Table 3). The FST estimate among locations was 0.028 (p < 0.001). All pairwise comparisons between NWR and other locations were statistically significant (p > 0.05), with FST values after Bonferroni correction ranging from 0.024 (NWR vs. LZNP) to 0.075 (NWR vs. SHKB) (Table 4), indicating small to moderate genetic distance according to Wright’s criteria . The other pairwise comparison (SHKB vs. LZNP) which showed statistical significance at p > 0.05 had a value of 0.015, indicating low genetic distance .
Including all five locations in the STRUCTURE analysis, the Evanno method resulted in the identification of two genetic clusters (S2 Fig). NWR was the only location in which the majority of the genetic cluster is shown as red, whereas the majority of the clusters in the other locations were green. This observation indicates the presence of high genetic divergence between NWR and the other four locations (Fig 3).
Structure plot for K = 2 inferred populations, based on individuals from all five locations.
Ne was estimated using the LD method. SHKB and KNP had 95% confidence intervals which included infinity. LZNP, MGMA, and NWR showed Ne of 121.1 (95%CI 65.0–606.2), 133.5 (95%CI 65.7–2763.0), and 32.1 (95% CI 24.2–45.5), respectively. NWR had relatively low Ne values compared to the other locations (Table 5). None of the locations were positive using BOTTLENECK analysis under the TPM model (Table 5). All five populations were approximated to have the expected normal L-shaped allele frequency distributions, and are therefore likely to be near mutation-drift equilibrium  (Table 5).
Our haplotype network analysis of the CO1 sequences and microsatellite STRUCTURE analyses revealed high differentiation between G. m. morsitans from NWR from other locations (Figs 2 and 3). The pairwise population comparison of mitochondrial CO1 (ϕST, lower diagonal in Table 4) also suggested high genetic distance between the flies in NWR and the flies from other locations. The pairwise population comparison of microsatellite alleles (FST, upper diagonal in Table 4) also showed significant FST values between NWR and the other locations but had relatively low values, ranging from 0.024 to 0.075. Pairwise FST values ranged between 0.027 and 0.161 in a previous study of G. m. morsitans among one population from Zambia, three populations from Zimbabwe, one population from Mozambique, and four populations from Tanzania, and pairwise genetic differences were considered to be high when the FST value exceeded 0.03 . When we used the same criteria, the overall pairwise genetic differences were high, except for the comparisons of LZNP with SHKB (FST = 0.015) and LZNP with NWR (FST = 0.024). The discrepancy between CO1 and microsatellite pairwise comparisons may be due to differences in the level of differentiation between mitochondrial and nuclear genomes. The mitochondrial genome is known to reach genetic-drift equilibrium earlier than the nuclear genome, due to the smaller effective population size of the mitochondrial genome. These results suggest that there is limited gene flow between G. m. morsitans in NWR from the same subspecies found in other locations, a finding that corresponds well with the estimation of G. m. morsitans distribution conducted in previous studies [12,13,17]. NWR is adjacent to Lake Malawi, a southern constituent of the Great Rift Valley. This area extends from a 1638 m high escarpment in the West and stretches East up to a narrow plain beside Lake Malawi, which has an altitude of around 500 m. This escarpment is a potential geographical barrier preventing gene flow between NWR and other locations studied, since the upper altitudinal limit for the survival of tsetse flies is known to be in a range of 1600 m to 2200 m [45,46], which underlying variable is temperature .
LZNP also exhibited statistically significant ϕST values compared with other locations (Table 4). However, since there are five to six years-interval between the sampling date between LZNP and the other locations, these results may be due to temporal flux over this interval. LZNP, SHKB, and MGMA are all included in the Luangwa river basin (Luangwa tsetse belt), as illustrated in the G. m. morsitans distribution maps [12–14]. However, there has been no identification of G. m. morsitans around KNP in any previously reported maps. According to our results, KNP did not show evidence of genetic differentiation from the other locations in the Luangwa tsetse belt, and should therefore be included in this historical tsetse belt.
Population size changes
High Hd and low π were inferred from the results of the CO1 analysis (Table 1). This pattern of high Hd and low π is consistent with observations of other Glossina species in Uganda and Kenya [47,48], and suggests that the populations have experienced a significant population decline resulting in the loss of genetic diversity and subsequently diverged into different haplotypes as the population size recovered. The tsetse fly population in Southern Africa is known to have experienced severe decreases due to a rinderpest epizootic in the 1890s, leading to an up to 90% decline in the size of wildlife populations . Since the major blood meal source of G. m. morsitans are wildlife, this event is possibly the cause of the historic loss of genetic diversity in southern African countries such as Zambia and Malawi. Subsequently, the G. m. morsitans population may have expanded as wildlife recovered from the rinderpest epidemic . This hypothesis, that the high Hd and low π observed in this study reflects the state of recovery from the rinderpest epidemic, remains speculative since the BOTTLENECK analysis did not detect a positive event in any of the locations included in this study (Table 5). It is likely that the sample size used in this study was insufficient to detect a bottleneck event 120 years ago, or that sufficient generations have passed between the presumed bottleneck event and the sampling generation to allow re-establishment of a mutation-drift equilibrium. This method is known to be able to detect bottleneck events up to 40–80 generations in the past [17,43], and 120 years is approximately 973 tsetse fly generations (the generations were estimated by using a lifecycle of 45 days per fly) . Relatively large Ne have been estimated for SHKB and KNP, both including infinity within the 95% confidence interval. Population expansion or selection was suggested for KNP, since this site showed statistically significant negative values in both Tajima’s D and Fu’s Fs (Table 2). Both tests will test the deviation from equilibrium expectations based on the infinite-site model without recombination. Negative values suggest recent population expansion, or decrease in genetic variation due to positive selection. The observed mismatch distribution of pairwise differences had a bimodal distribution (S1 Fig) with low fitness to the estimated allele frequency under a population expansion model (Raggedness index: 0.1088, Mean Absolute Error: 0.6533). This indicates low possibility of population expansion . However, in order to differentiate population expansion and selection, factors such as the evenness of the distribution of mutation across the whole genome and the ratio between nonsynonymous and synonymous mutations must be explored . Therefore, further investigations are required to obtain definite conclusion for population history in KNP. The Ne of LZNP, MGMA, and NWR were relatively small. NWR had the smallest Ne at 32.1 with a 95% confidence interval of 24.3–45.5 (Table 5). This Ne estimate is lower than any Ne estimate observed in tsetse flies in the same Morsitans group tsetse fly, G. pallidipes. Estimated Ne using microsatellites were 180 and 551 in different regions of Kenya . The low Ne of NWR and the restricted gene flow between NWR and the other locations indicate that NWR is potentially an isolated location with relatively small population size. Therefore, cost-effective tsetse control activities may be possible. However, the evidence presented in this study is not strong enough to definitely draw this conclusion, since the other locations surrounding Lake Malawi have not been explored in this study. In addition, the Ne estimates should be handled with care, since the census population densities are usually much greater than the estimated Ne, and the Ne estimation is affected by a number of demographic and genetic phenomena, such as sex ratio and temporal variation in population size . In this study, the estimation of Ne was conducted using both female and male flies. Although care was taken to make the sex ratio 50:50, we cannot rule out the possibility of the biasing affect leading to smaller Ne estimates.
We analyzed partial mitochondrial CO1 sequences and 10 microsatellite loci of G. m. morsitans collected from three locations of Zambia and two locations from Malawi, and identified two genetically separated clusters: NWR and others. This result was in keeping with previous descriptions of the distribution of G. morsitans morsitans in East and Southern Africa. There appears to be restricted gene flow between NWR and the other locations, and we hypothesize that the escarpment of the Great Rift Valley acts as an environmental barrier, since its high altitude is at the limit of the tsetse fly’s biological habitat range. In addition to its apparently restricted gene flow, the small effective population size indicates that NWR may be a population where tsetse control activities can be applied at a lower cost compared to non-isolated populations . Tsetse control activities include artificial baits, insecticide-treated cattle (ITC), aerial spraying, and the sterile insect technique (SIT) in combination with the insecticide-based methods. The Restricted Application Protocol (RAP) using ITC has been shown to be the most cost-effective control method in areas where tsetse flies and livestock co-exist . However, further research will be needed to identify the genetic population structure in other low-altitude sites around Lake Malawi that have not been included in this study, in order to confirm that there are no re-invasions into NWR from adjacent areas. We have detected a new location (KNP) infested by G. m. morsitans, which has not been illustrated in the previous G. m. morsitans distribution maps. KNP is probably a part of the major tsetse belt in the Luangwa river basin, and has a relatively large effective population size. Further study is needed to elucidate the extent of the tsetse belt, and assess the migration into adjacent reservoir communities.
S1 Fig. Mismatch distribution of pairwise differences.
The X-axis represents the number of pairwise differences, and the Y-axis represents their frequency.
S2 Fig. Delta K plot for detecting the most likely K for Glossina morsitans morsitans individuals from five locations.
Plots were generated using the Evanno method  implemented in STRUCTURE HARVESTER v0.6.94.
S1 Table. Haplotype distributions among each Glossina morsitans morsitans populations.
We are grateful to the Center for Ticks and Tick-borne Diseases (CTTBD) for providing laboratory space during the sampling period in Malawi.
- 1. World Health Organization. Control and surveillance of human African trypanosomiasis. World Health Organ Tech Rep Ser. 2013;(984):1–237. pmid:24552089
- 2. World Health Organization. Report of the Regional Director. 2004;(54):1–9.
- 3. Legros D, Ollivier G, Gastellu-Etchegorry M, Paquet C, Burri C, Jannin J, et al. Treatment of human African trypanosomiasis—present situation and needs for research and development. Lancet Infect Dis. 2002;2(7):437–40. pmid:12127356
- 4. Brun R, Schumacher R, Schmid C, Kunz C, Burri C. The phenomenon of treatment failures in human African trypanosomosis. Trop Med Int Heal. 2001;6(11):906–14.
- 5. Beadell JS, Hyseni C, Abila PP, Azabo R, Enyaru JCK, Ouma JO, et al. Phylogeography and Population Structure of Glossina fuscipes fuscipes in Uganda: Implications for Control of Tsetse. PLoS Negl Trop Dis. 2010;4(3):e636. pmid:20300518
- 6. Opiro R, Saarman NP, Echodu R, Opiyo EA, Dion K, Halyard A, et al. Evidence of temporal stability in allelic and mitochondrial haplotype diversity in populations of Glossina fuscipes fuscipes (Diptera: Glossinidae) in northern Uganda. Parasit Vectors. 2016;9:258. pmid:27141947
- 7. Hyseni C, Kato AB, Okedi LM, Masembe C, Ouma JO, Aksoy S, et al. The population structure of Glossina fuscipes fuscipes in the Lake Victoria basin in Uganda: implications for vector control. Parasit Vectors. 2012;5(1):222.
- 8. Solano P, Kaba D, Ravel S, Dyer NA, Sall B, Vreysen MJB, et al. Population genetics as a tool to select tsetse control strategies: suppression or eradication of Glossina palpalis gambiensis in the Niayes of Senegal. PLoS Negl Trop Dis. 2010;4(5):e692. pmid:20520795
- 9. Robinson T, Rogers D, Williams B. Univariate analysis of tsetse habitat in the common fly belt of southern Africa using climate and remotely sensed vegetation data. Med Vet Entomol. 1997;11:223–34. pmid:9330253
- 10. Wint W, Rogers D. Predicted distributions of tsetse in Africa. FAO Consult Rep. 2000.
- 11. Laohasinnarong D, Goto Y, Asada M, Nakao R, Hayashida K, Kajino K, et al. Studies of trypanosomiasis in the Luangwa valley, north-eastern Zambia. Parasit Vectors. 2015;8(1):497.
- 12. Robinson T, Rogers D, Williams B. Mapping tsetse habitat suitability in the common fly belt of southern Africa using multivariate analysis of climate and remotely sensed vegetation data. Med Vet Entomol. 1997;11(3):235–45. pmid:9330254
- 13. Rogers DJ, Robinson TP. Tsetse Distribution. In: Maudlin I, Holmes PH, Miles MA, editors. The Trypanosomiasis. CABI Publishing; 2004. p. 139–79.
- 14. Jordan AM. Tsetse-flies (Glossinidae). Lane RP, Crosskey RW, editors. Chapman & Hall; 1993. 333–388 p.
- 15. Wohlford DL, Krafsur ES, Griffiths NT, Marquez JG, Baker MD. Genetic differentiation of some Glossina morsitans morsitans populations. Med Vet Entomol. 1999;13(4):377–85. pmid:10608226
- 16. Krafsur ES, Endsley MA. Microsatellite diversities and gene flow in the tsetse fly, Glossina morsitans s.l. Med Vet Entomol. 2002;16(3):292–300. pmid:12243230
- 17. Ouma JO, Marquez JG, Krafsur ES. Patterns of genetic diversity and differentiation in the tsetse fly Glossina morsitans morsitans Westwood populations in East and southern Africa. Genetica. 2007;130(2):139–51. pmid:16897444
- 18. Simarro PP, Cecchi G, Franco JR, Paone M, Diarra A, Ruiz-Postigo JA, et al. Estimating and Mapping the Population at Risk of Sleeping Sickness. PLoS Negl Trop Dis. 2012;6(10):e1859. pmid:23145192
- 19. Chisi JE, Muula AS, Ngwira B, Kabuluzi S. A retrospective study of Human African Trypanosomiasis in three Malawian districts. Tanzan J Health Res. 2011;13(1):79–86.
- 20. Jackson CHN. The analysis of a tsetse-fly population III. Ann Eugen Cambridge. 1948;14:91–108.
- 21. Pollock JN. Training Manual for Tsetse Control Personnel. Vol.1: Tsetse biology, systematics and distribution, techniques. FAO. 1982;1:1–280.
- 22. Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 2017;1–7.
- 23. Rozas J, Ferrer-Mata A, Sánchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, et al. DnaSP 6: DNA Sequence Polymorphism Analysis of Large Data Sets. Mol Biol Evol. 2017;34:3299–302. pmid:29029172
- 24. Excoffier L, Smouse PE, Quattro JM. Analysis of molecular variance inferred from metric distances among DNA haplotypes: Application to human mitochondrial DNA restriction data. Genetics. 1992;131(2):479–91. pmid:1644282
- 25. Excoffier L, Lischer HEL. Arlequin suite ver 3.5: A new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10(3):564–7. pmid:21565059
- 26. Leigh JW, Bryant D. POPART: Full-feature software for haplotype network construction. Methods Ecol Evol. 2015;6(9):1110–6.
- 27. Baker MD, Krafsur ES. Identification and properties of microsatellite markers in tsetse flies Glossina morsitans sensu lato (Dipetera: Glossinidae). Mol Ecol Notes. 2001;1(4):234–6. pmid:16479272
- 28. Hyseni C, Beadell JS, Ocampo-Gomez J, Okedi LM, Gaunt M, Caccone A. The G. m. morsitans (Diptera: Glossinidae) genome as a source of microsatellite markers for other tsetse fly (Glossina) species. Mol Ecol Resour. 2011;11:586–9.
- 29. Ouma JO, Marquez JG, Krafsur ES. New Polymorphic Microsatellites in Glossina pallidipes (Diptera: Glossinidae) and Their Cross-Amplification in Other Tsetse Fly Taxa. Biochem Genet. 2006;44(910).
- 30. Ouma JO, Cummings MA, Jones KC, Krafsur ES. Characterization of microsatellite markers in the tsetse fly, Glossina pallidipes (Diptera: Glossinidae). Mol Ecol Notes. 2003;3(3):450–3. pmid:16718306
- 31. van Oosterhout C, Hutchinson WF, Wills DPM, Shipley P. MICRO—CHECKER: software for identifying and correcting genotyping errors in microsatellite data. Mol Ecol Notes. 2004;4:535–8.
- 32. Dieringer D, Schlotterer C. microsatellite analyser (MSA): a platform independent analysis tool for large microsatellite data sets. Mol Ecol Notes. 2003;3:167–9.
- 33. Rousset F. GENEPOP’007: A complete re-implementation of the GENEPOP software for Windows and Linux. Mol Ecol Resour. 2008;8(1):103–6. pmid:21585727
- 34. Raymond M, Rousset F. An exact test for population differentiation. Evolution. 1995. 1280–3. pmid:28568523
- 35. Pritchard JK, Stephens M, Donnelly P. Inference of Population Structure Using Multilocus Genotype Data. Genetics. 2000;155:945–59. pmid:10835412
- 36. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: A simulation study. Mol Ecol. 2005;14(8):2611–20. pmid:15969739
- 37. Earl DA, vonHoldt BM. STRUCTURE HARVESTER: A website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour. 2012;4(2):359–61.
- 38. Jakobsson M, Rosenberg NA. CLUMPP: A cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics. 2007;23(14):1801–6. pmid:17485429
- 39. Rosenberg NA. DISTRUCT: A program for the graphical display of population structure. Mol Ecol Notes. 2004;4(1):137–8.
- 40. Do C, Waples RS, Peel D, Macbeth GM, Tillett BJ, Ovenden JR. NeEstimator v2: Re-implementation of software for the estimation of contemporary effective population size (Ne) from genetic data. Mol Ecol Resour. 2014;14(1):209–14. pmid:23992227
- 41. Cornuet JM, Luikart G. Description and power analysis of two tests for detecting recent population bottlenecks from allele frequency data. Genetics. 1996;144(4):2001–14. pmid:8978083
- 42. Di Rienzo A, Peterson AC, Garzat JC, Valdes AM, Slatkint M, Freimer NB. Mutational processes of simple-sequence repeat loci in human populations. Genetics. 1994;91:3166–70.
- 43. Luikart G, Allendorf FW, Cornuet JM, Sherwin WB. Distortion of allele frequency distributions provides a test for recent population bottlenecks. J Hered. 1998;89(3):238–47. pmid:9656466
- 44. Wright S. Variability within and among Natural Populations. Evol Genet Popul. 1984;4.
- 45. Tikubet G, Gemetchu T. Altitudinal distribution of tsetse in the Finchaa river valley (western part of Ethiopia). Int J Trop Insect Sci. 1984;5(5):389–95.
- 46. Langridge WP. A Tsetse and Trypanosomiasis survey of Ethiopia. 1976.
- 47. Kato AB, Hyseni C, Okedi LM, Ouma JO, Aksoy S, Caccone A, et al. Mitochondrial DNA sequence divergence and diversity of Glossina fuscipes fuscipes in the Lake Victoria basin of Uganda: Implications for control. Parasit Vectors. 2015;8(1):1–13.
- 48. Ouma JO, Beadell JS, Hyseni C, Okedi LM, Krafsur ES, Aksoy S, et al. Genetic diversity and population structure of Glossina pallidipes in Uganda and western Kenya. Parasit Vectors. 2011;4(1):122.
- 49. Ford J. The role of the trypanosomiases in African ecology: a study of the tsetse fly problem. Oxford Univ Press. 1971.
- 50. Government of Malawi National Statistical Office. 2008 Population And Houseing Census: Preliminary Report. Popul (English Ed.) 2008;1–35.
- 51. Leak SGA. Biology. In: Tsetse biology and ecology: their role in epidemiology and control of trypanosomosis. CABI Publishing. 1999;17–24.
- 52. Harpending HC, Sherry ST, Rogers AR, Stoneking M. The Genetic Structure of Ancient Human Populations. Curr Anthropol. 1993;34(4):483–96.
- 53. Hahn MW, Rausher MD, Cunningham CW. Distinguishing between selection and population expansion in an experimental lineage of bacteriophage T7. Genetics. 2002;161(1): 11–20. pmid:12019219
- 54. Ouma JO, Marquez JG, Krafsur ES. Microgeographical breeding structure of the tsetse fly, Glossina pallidipes in south-western Kenya. Med Vet Entomol. 2006;20(1):138–49. pmid:16608498
- 55. Krafsur ES, Maudlin I. Tsetse fly evolution, genetics and the trypanosomiases-A review. Infect Genet Evol. 2018;64:185–206. pmid:29885477
- 56. Shaw APM, Torr SJ, Waiswa C, Cecchi G, Wint GRW, Mattioli RC, Robinson TP. Estimating the costs of tsetse control options: An example for Uganda. Prev Vet Med. 2013;110(3–4): 290–303. pmid:23453892
- 57. Vale G, Torr S. Development of bait technology to control tsetse. In: Maudlin I, Holmes PH, Miles MA, editors. The Trypanosomiasis. CABI Publishing. 2004;509–523.