Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Mitochondrial metabolic genes provide phylogeographic relationships of global collections of Aedes aegypti (Diptera: Culicidae)

  • H. S. D. Fernando,

    Roles Conceptualization, Formal analysis, Methodology, Writing – original draft

    Affiliation Department of Zoology, Center for Biotechnology, University of Sri Jayewardenepura, Nugegoda, Sri Lanka

  • Menaka Hapugoda,

    Roles Resources, Supervision, Writing – review & editing

    Affiliation Molecular Medicine Unit, Faculty of Medicine, University of Kelaniya, Kelaniya, Sri Lanka

  • Rushika Perera,

    Roles Resources, Writing – review & editing

    Affiliation Department of Microbiology, Immunology and Pathology, Colorado State University, Fort Collins, Colorado, United States of America

  • William C. Black IV,

    Roles Conceptualization, Formal analysis, Resources, Software, Writing – original draft, Writing – review & editing

    Affiliation Department of Microbiology, Immunology and Pathology, Colorado State University, Fort Collins, Colorado, United States of America

  • B. G. D. N. K. De Silva

    Roles Conceptualization, Funding acquisition, Resources, Supervision, Writing – review & editing

    Affiliation Department of Zoology, Center for Biotechnology, University of Sri Jayewardenepura, Nugegoda, Sri Lanka

Mitochondrial metabolic genes provide phylogeographic relationships of global collections of Aedes aegypti (Diptera: Culicidae)

  • H. S. D. Fernando, 
  • Menaka Hapugoda, 
  • Rushika Perera, 
  • William C. Black IV, 
  • B. G. D. N. K. De Silva


Phylogeographic relationships among global collections of the mosquito Aedes aegypti were evaluated using the mitochondrial Cytochrome C Oxidase 1 (CO1) and NADH dehydrogenase subunit 4 (ND4) genes including new sequences from Sri Lanka. Phylogeographic analysis estimated that Ae. aegypti arose as a species ~614 thousand years ago (kya) in the late Pleistocene. At 545 kya an “early” East African clade arose that continued to differentiate in East Africa, and eventually gave rise to three lineages one of which is distributed throughout all tropical and subtropical regions, a second that contains Southeast Asian/Sri Lankan mosquitoes and a third that contains mostly New World mosquitoes. West African collections were not represented in this early clade. The late clade continued to differentiate throughout Africa and gave rise to a lineage that spread globally. The most recent branches of the late clade are represented by South-East Asia and India/Pakistan collections. Analysis of migration rates suggests abundant gene flow between India/Pakistan and the rest of the world with the exception of Africa.


Viruses transmitted by mosquitoes have become one of the primary contributors to human disease, globally. These include Dengue virus (DENV), Chikungunya virus (CHIKV), Yellow fever virus and Zika virus. Dengue is currently regarded as the most important arboviral disease in the world with approximately 50% of the worlds’ population living in dengue endemic countries. The public health impact of DENV, CHIKV, Yellow fever virus and Zika virus has increased dramatically over the years with both diseases spreading to new areas. Aedes aegypti, is the main vector of these viruses and continues to cause a significant amount of human morbidity and mortality worldwide [1, 2].

Dengue is currently estimated to occur in 128 countries worldwide. Regional distribution of dengue shows that it is an aggravated and continuous problem in south and Southeast Asia and Central and South America. Major dengue epidemics have been occurring in Asian and Southeast Asian countries following World War II, which have turned in to pandemics recently [2]. In the Americas, although the control campaign by the Pan American Health Organization (PAHO) restricted the epidemics till 1970s, the suspension of the campaign has reinvested the region to hyper endemic levels of dengue cases [2]. In Sri Lanka, dengue has become a serious arboviral disease with an alarming increase in the number of reported cases. In 2017, 185,688 dengue cases were reported to health authorities with 45% of the cases being reported from the western province, the main commercial province in Sri Lanka [3]. This is 4.7-fold higher than the average number of cases for the same time period between 2010 and 2016 [3]. Although the numbers have decreased in year 2018 with 51,586 recorded dengue cases [3], drought conditions with low rainfall prevailed throughout the year. Aedes aegypti is considered as the primary vector of DENVs in Sri Lanka and was first reported in Sri Lanka during 1930s [4].

The global expansion of dengue and other arboviruses was preceded by the global spread of their main vector Ae. aegypti. Aedes aegypti arose as a species in Africa where it differentiated into two subspecies. In Africa, Ae. aegypti formosus (Aaf) has a dark cuticle, no scales on the first abdominal tergite, and lives in forested habitats where it oviposits in natural containers (e.g. tree holes). Adults probably occupied the forest canopy while they fed on non-human primates. Outside of Africa, populations generally consist of the subspecies Ae. aegypti aegypti (Aaa) with a light tan cuticle, scales on the first abdominal tergite, and strong anthropophilic habits [5, 6]. The subspecies eventually adapted to feeding on humans and breeding in artificial containers (e.g. water jars and barrels, and tires). In this way Aaa probably colonized the tropical and subtropical regions of the world through shipping trade during the 17th—20th centuries [6]. Increasing globalization and the ability to adapt to human environments have enhanced the worldwide spread and establishment of Aaa [7], an invasive mosquito that has become a very important arbovirus vector in Asia during the last few years due to the increasing number of dengue cases. Although the two subspecies were originally defined based on differences in coloration and scales on the first abdominal tergite [5, 8] genetic studies have revealed that the West African populations that have these pale scales appear to be genetically more similar to Aaf populations than Aaa from elsewhere in the tropics [911].

Many genetic studies using mitochondrial genes and nuclear genes have evaluated the global colonization history of the Ae. aegypti subspecies. The earliest studies [1214] examined the genetic structure and vector competence of Ae. aegypti using biochemical markers and indicated that global collections fell into two genetic groups. A new world group contained Aaa populations from East Africa, South America and the Caribbean and was thought to be derived from East Africa. The second group contained Aaa populations from Asia and Southeastern U.S and has a basal branch containing Aaf from both East and West Africa. This same pattern has been noted in more recent genetic studies [1520]. Studies carried out using mitochondrial and nuclear DNA markers have also confirmed the existence of two clades. A study carried out using NADH dehydrogenase subunit 4 (ND4) mitochondrial DNA sequence data from populations of Ae. aegypti from Senegal, West Africa and East Africa, identified two clades [15], West Africa basal clade and the East Africa derived clade. But these also did not correspond to Aaa and Aaf. More recent studies based on nuclear markers have used microsatellites [16], a Single Nucleotide Polymorphism (SNP) chip [17, 18] and exon enriched whole genome sequencing [19, 20]. Bennett et al., 2016 and Gloria-Soria et al., 2016 applied Approximate Bayesian Computation (ABC) to both nuclear and mitochondrial markers [7, 16]. ABC supported a demographic model of lineage diversification, historical admixture and recent population structuring. These studies suggested that because Aaf are dependent on forests, the effects of forest fragmentation and expansion generated by Pleistocene climatic change may have been a factor in early diversification of populations. Human movement across Africa also probably facilitated more recent divergences. In agreement with other studies Bennett et al., 2016 and Gloria-Soria et al., 2016 [7, 16] noted a decreasing genetic diversity cline consistent with cumulative bottlenecks as Ae. aegypti was shipped to the Americas and later to Asia. These studies all confirm the existence of a clade at the base of the tree containing Aaf, with a second clade derived from the first that contains mosquitoes from East and West Africa. Two genetic groups of Ae. aegypti in Colombia [21] had haplotypes from both basal and derived clades. This study [21] also revealed that the populations with relatives from the derived clade exhibited insecticide resistance while the population with relatives from the basal clade did not. Studying the genetic differentiation of two lineages is epidemiologically important as populations from different origins vary in vector competencies for arboviruses and insecticide resistance [9, 22, 23].

Given the importance of mitochondria as the primary energy metabolic hub in cells, mitochondrial genes have become instrumental for reconstructing evolutionary relationships within species compared to other genetic markers [24, 25]. Specifically, they are maternally inherited, single copy, non-recombining and abundant [24]. Mitochondria are the site at which the oxidative phosphorylation (OXPHOS) pathway for production of adenosine triphosphate (ATP) takes place. Four of the five complexes in OXPHOS are encoded by mitochondrial genes, NADH dehydrogenase (complex I), cytochrome bc1 complex (complex III), cytochrome c oxidase (complex IV), and ATP synthase (complex V). Thus, mitochondrial protein encoding genes play an important role in the energy metabolism of insects [26]. The mitochondrial genes encoding COI and ND4 are used as the molecular markers in this study. Cytochrome oxidase I (COI) gene possesses special characteristics which make it suitable as a molecular marker for evolutionary studies. It is the largest protein coding gene in the metazoan mitochondrial genome which enables one to amplify and sequence many more characteristics within the same functional complex. It also contains a mix of highly conserved and variable regions so closely associated together thus making COI gene particularly useful for evolutionary studies [27].

The present study is an attempt to estimate the phylogenetic relatedness of Sri Lankan Ae. aegypti relative to collections from other countries and to estimate a global phylogeographic history for the species. Due to the strategic location in the Indian Ocean, Sri Lanka may be playing a significant role in the distribution of Ae. aegypti mosquitoes and the four dengue virus serotypes (DENV1-4) in the world. We compile and analyze 546 CO1 and 67 ND4 Ae. aegypti sequences, including sequences available in GenBank and new CO1 and ND4 sequences from Sri Lanka, a country for which there is little data on the population genetics of Ae. aegypti.

Materials and methods

Adults and larvae of Ae. aegypti were collected from January 2013 –December 2015 in seven districts in Sri Lanka (Table 1; Fig 1). Permission to collect mosquito samples were obtained from respective Medical Officer of Health (MOH) offices of the study district. Samples were collected using BG-Sentinel adult mosquito traps and ovitraps. Collected larvae were reared to adults and killed by freezing. All adults were morphologically identified to species using standard taxonomic keys [28]. Identified adults were preserved by desiccation with silica gel. For the extraction of DNA, a maximum of 5 individuals per trap were used to avoid over-sampling of siblings. DNA was extracted from individual adults using a Phenol/Chloroform method [29]. Extracted DNA was stored at -80°C in 100 μL TE buffer (10 mM Tris, 1 mM EDTA, pH 8.0). The polymerase chain reaction (PCR) for ND4 and CO1 was carried out in an Eppendorf® Thermo-cycler in 25 μL reaction volumes.

Fig 1. Map of sampling sites in Sri Lanka.

The red dot on the global map indicates the geographical position of Sri Lanka in the world.

Table 1. Sampling districts, geographic coordinates and the number of Ae. aegypti samples analyzed per each mitochondrial region.

A 349 bp region of the ND4 gene was amplified [21]. Each ND4 reaction consisted of template DNA (2 ng/μL), 1 x PCR Buffer (Promega, USA), 2.0 mM MgCl2, 200 μM dNTPs, 0.2 mM of reverse and forward primer and 1 U of Taq polymerase (Promega, USA). The reaction conditions entailed an initial denaturation at 94°C for 5 minutes, 35 cycles of 94°C for 30 seconds, 50°C for 30 seconds and 72°C for 30 seconds with a final extension at 72°C for 5 minutes.

A 460 bp region of the mitochondrial CO1 gene was amplified [30]. The CO1 reaction mixture was the same as the ND4 mixture except that it contained 1.5 mM of MgCl2 and reaction conditions differed in having an initial denaturation of 94°C for 4 minutes, followed by 40 cycles of denaturation at 95°C for 40 seconds, annealing at 45°C for 1 minute and extension at 68°C for 1 minute, with a final extension of 72°C for 10 minutes. Amplified ND4 and CO1 products were run on a 1.5% agarose gel and the positive products were sent to Macrogen Inc. Korea for sequencing.

Genetic variability in ND4 and CO1 were evaluated by examining the number of substitutions, haplotypes (h), nucleotide diversity (π) [31], average number of nucleotide differences, k [32] and Fu and Li’s F* [33]. Significantly negative F* indicate an excess of low-frequency variants indicative of population expansion, weak negative selection or positive selection. Significantly positive F* indicate an excess of intermediate-frequency alleles associated with population bottlenecks, breeding structure and/or balancing selection. Diversity at synonymous substitution sites (πs) and a replacement substitution sites (πa) were calculated as were their ratio w = (πa) /(πs) using DnaSP v.5.10 [34]. The sequences used in analysis of the ND4 were all haplotypes recorded previously [15].

To assess the phylogenetic relationship of Sri Lankan Ae. aegypti to global collections, both ND4 and CO1 sequences from GenBank and Sri Lanka were aligned using CLUSTALW ( and visually inspected to ensure correct alignment along codons. Any sites exhibiting two or more alternate nucleotides were removed because they were likely to represent nuclear mtDNA (NUMTs) which are abundant in the Ae. aegypti genome [35, 36].

Sixty-seven sequences of the Ae. aegypti ND4 gene were aligned which included 27 sequences from GenBank and 40 sequences from Sri Lanka. Six Aedes species were included as outgroups (S1 Table). RAxML v.8.000 [37] was used to estimate Maximum Likelihood (ML) trees with bootstrap analysis. The bootstrap node support was estimated with 1,000 pseudoreplicates and the resulting tree was edited using TreeGraph2 (v. 2.0.47) [38]. The support for each labelled branch in the MCC phylogeny was placed above the branch. Branch lengths and their 95% HPD were placed over all labelled nodes.

GenBank contains 546 Ae. aegypti CO1 sequences and 363 of these contain the same 460 bp region of CO1 as in the current study [3943]. Sequences from laboratory strains were excluded. All aligned CO1 sequences were compared using RAxML v.8 to identify identical haplotypes and 142 of the 363 sequences were unique (S2 Table). These sequences were partitioned into eight geographic groupings: West Africa (18 sequences), East Africa (38 sequences), India and Pakistan (17 sequences), Sri Lanka (14 sequences), Southeast Asia (13 sequences), South Pacific (6 sequences), North America (6 sequences) and South America (30 sequences). Two sequences (MG242505, HQ693081) from Ae. mascarensis were included as outgroups since this is considered to be one of the most recent ancestors of Ae. aegypti [44, 45].

Phylogeographic analysis followed the Bayesian phylogeographic model implemented in BEAST2 [46] v. 2.4.8. The alignment file was converted to an *.xml file using BEAUti following the example provided by Taylor ( Sequences were partitioned into the 8 geographic regions as described above: The Site Substitution model was HKY [47] with the substitution rate set to 1.0, the gamma category count set to 8, the estimated shape parameter set to 1, frequencies were empirical and the estimated Kappa value set to 10. A strict clock model was used. The Clock Rate was set to 3.54 x 10−8/site/year [48]. Ten percent of the chains were discarded as burn in. Priors were set to a coalescent constant population model, the gamma shape parameter was set to exponential with a mean of 1, the kappa value was assumed to follow an exponential distribution with a mean of 10, the nonzero states were assumed to follow a Poisson distribution with the lambda set to 0.693 and the offset set to 7, the “popSize” parameter was set to 1/X, the variable “relativeGeorates” were assumed to follow an exponential with a mean of 1.0 and the relative trait clock rate was assumed to follow an exponential with a mean of 1000. This dataset was analyzed with a chain length of 40 million after initial trials with shorter chains failed to produce Effective Sample Sizes (ESS) > 200 for most variables.

Tracer (vers.1.6) ( was used to calculate the posterior probability that the migration rate (“rateIndicator”) is positive between a pair of geographic regions x and y. A large probability indicates direct migration between x and y while a small posterior probability suggests barriers to gene flow. The strength of support for a particular pair of regions was also assessed by calculating its Bayes Factor (BF) as the ratio of the posterior and prior probabilities following ( If D is the number of regions and pi is the posterior probability that the migration rate is positive then the prior probability (qx,y) is (1)

There are 8 regions and so qx,y = 0.2748.


Rates with Bayes factors greater than 3 were considered to be well supported. TreeAnnotator v.2.4.3 was used to create the maximum clade credibility tree with each taxon identified.


Variability in different geographic regions

Table 2 shows genetic diversity measures at both genes. Diversity was relatively low in West Africa but was similar in all other regions including Sri Lanka. Fu and Li’s F* measures were significantly negative (excess of singletons) when considering all regions. This is probably indicative of population expansion. ND4 sequences that occurred in common among all or some of the regions were analyzed separately and appear as “Global” in Table 2. These had a significantly positive Fu and Li’s F* indicative of an excess of intermediate-frequency haplotypes associated with population bottlenecks, genetic structure and/or balancing selection. In all cases w is small and close to zero suggesting purifying selection acting in both genes.

Table 2. Genetic variability in ND4 and CO1 were evaluated by examining the number of substitutions (S), haplotypes (h), nucleotide diversity (π) [31], average number of nucleotide differences, k [32] and Fu and Li’s F* [33].

Phylogenetic analysis of ND4 markers

Fig 2 is the bootstrapped ML tree derived using the ND4 gene. It has a “basal” clade (bootstrap support = 96%) containing all ND4 Ae. aegypti sequences but including West African collections at the base, global collections (AF203348, DQ176837) and the East African sequence from Kenya (EU446278). It also has a “derived” clade (bootstrap support = 74%) that has both East and West African collections at its base, global collections (AF203356) and many East African sequences from Kenya. Thus, Sri Lankan populations consist of the same two previously designated West (basal) and East (derived) African clades. The paucity of ND4 sequences on the web and the absence of a molecular clock for ND4 obviated more intensive phylogeographic analyses. Within this clade, there is only moderate resolution of clades with support values never exceeding 75.

Fig 2. Bootstrapped Maximum Likelihood tree derived using the ND4 gene.

The “basal” clade has bootstrap support of 96% and contains the West African collections at its base. The “derived” clade (bootstrap support = 74%) has both East and West African collections at its base, global collections (AF203356) and many East African sequences from Kenya. All Sri Lankan collections appear in red and are present in both clades. Information in parentheses indicate the collection site and its unique identification number in the aligned dataset. The thickness of lines is proportional to the bootstrap support. Maximum likelihood branch lengths are proportional to the number of nucleotide substitutions per site.

Phylogenetic analysis of CO1 markers

Fig 3 is the Mrbayes tree derived using the CO1 gene. The tree has a main clade (posterior probability = 1) containing both East and West African sequences. The derived clade mainly contains East African collections. The nodes arising from this clade gave rise to Sri Lankan, South American and Southeast Asian collections.

Fig 3. Bayesian phylogenetic tree of relationships of Ae. aegypti based on CO1 gene.

The support values written on the branches correspond to posterior probabilities. The branch lines are colored by geographic distribution of the corresponding haplotypes.

Fig 4 is the coalescent tree derived using the CO1 gene. The A clade arose 614 thousand years ago (kya) in East Africa. The Node B clade corresponds to the previously designated “derived” clade and it began to diversify 545 kya in East Africa. The East African Node C gave rise to a clade that has spread globally. East African Node D gave rise to a clade that eventually spread to the New World while clade E spread to Sri Lanka and Southeast Asia. One East African lineage arising from Node E arose 179 kya and has diversified in East Africa.

Fig 4. Phylogeographic analysis of the CO1 gene followed the Bayesian phylogeographic model implemented in BEAST2 [46] v. 2.4.8.

The alignment file was converted to an *.xml file using BEAUti following the example provided by Taylor ( Sequences were partitioned into 8 geographic regions. A scale corresponding to 200,000 years is provided at the base. The identity of nodes appears in the text.

The Node F clade has a support value of 37 (previously designated “basal” [15]) did not began to diversify until 372 kya, ~173 kya later than node B. Notice that no West African haplotypes appear in the Node B clade. A split (Node H) occurred in West Africa 252 kya. One of these led to East African clade arising from Node G. The lower and upper branch of Node G contains one West African haplotype each.

Both observations indicate that introgression into West Africa began from 252 kya (Node H) to 372 kya. Node G also gave rise to lineages that eventually spread to South America. Subsequent lineages (Nodes I and J) arising from the West African Node H continued to diversify in West Africa and East Africa. Node I gave rise to a lineage that eventually spread to South America. Node J subsequently gave rise to two lineages that appear to have remained in East and West Africa. One of these contains a Mexican haplotype and the other an Indian/Pakistan isolate. Node K (134 kya) appears to mark a time when West African lineages began to spread out of Africa with one lineage ending up in India, Sri Lanka and the other shifting toward Southeast Asia (Node L) with one lineage ending up in India/Pakistan and the South Pacific. Southeast Asia Node K contains one lineage that re-colonized East Africa and from there colonized Sri Lanka and eventually ended up in South America. The other clade arising from Node M produced haplotypes that occur globally today.

The ages represented near the tips seem unreasonably old. For instance, multiple clades comprised exclusively of New World populations have common ancestors more than 50 kya, which is far outside the current hypotheses on when Aedes aegypti escaped Africa. However, the 95% HPD for almost all terminal nodes includes a lower value of zero.

Migration analysis

Analysis of the rate Indicator variables in BEAST appears in Table 3. All rate Indicators with Bayes Factors ≥ 3 are underlined and bolded. This analysis indicates a high rate of gene flow between the India/Pakistan region and all other regions except Africa and, curiously Sri Lanka. There are also high rates of gene flow between East and West Africa and between Sri Lanka and Southeast Asia.

Table 3.

A) Tracer analysis of the posterior probability, likelihood, Prior probability, tree likelihood and average tree height. B) Tracer analysis of posterior probability that the migration rate (rateIndicator) is positive between a pair of geographic regions x and y.


The present study analyzed mitochondrial genetic diversity and phylogeographic relationships of Ae. aegypti mosquitoes collected from seven districts in Sri Lanka. The study revealed a high genetic diversity for CO1 marker which might be due to high levels of gene flow observed in the study between Sri Lanka and Southeast Asia. A high level of gene flow exists between India /Pakistan region and Southeast Asia, South Pacific and North and South America. This would inevitably lead to population mixing. As Sri Lanka is an island and acts as a main harbor for commercial and human transport it may be playing a vital role in the observed gene flow.

Two mitochondrial clades have been reported previously that broadly represent East and West Africa, but this was based only on the ND4 gene [15]. Furthermore, no time frame could be estimated for these clades because a calibrated molecular clock rate is not available for the arthropod ND4 gene. However, an independent molecular clock has been estimated for CO1 [48] and at least seven large datasets for CO1 in Ae. aegypti have appeared [7, 21, 3943]. Some of the problems associated with studies of the mitochondrial genome in Ae. aegypti have been recently summarized [5].

Approximate Bayesian Computation (ABC) [49] has been applied to nucleotide sequences from four nuclear and one mitochondrial marker to assess phylogeographic relationships among Ae. aegypti collections principally in Africa [7]. Their analyses showed that a model wherein Ae. aegypti originated in Africa involving Pleistocene lineage diversification and historical admixture had the best fit in the ABC.

Marine records of African climate variability document a shift toward more arid conditions beginning 2.8 million years ago (mya), resulting from cold North Atlantic sea-surface temperatures caused by the beginning of northern hemisphere glacial cycles [5053]. The maximum of the penultimate glaciations occurred from 190–150 kya (corresponding with the Illinoian Stage) and conditions in Africa became generally drier than present with deserts extending into North Africa. Then from 150–130 kya, Africa gradually became cooler than present. This was followed by a warmer, moister period that lasted for 15,000 years. During that time deserts became covered with vegetation and there was a great expansion of rain forests. Then from 115–70 kya, there followed a period of cooling and drying that led to a cold, arid maximum (corresponding to the Wisconsin period) followed by a slight moderation of climate until 22 kya. From 115–70 kya conditions became warmer and moister but with an interruption by aridity around 11,000 years ago. A resumption of warm, moist conditions known as the Holocene ‘optimum’ occurred wherein again there was a second great rain forest expansion with vegetation again covering most of the Sahara. From then until the present Africa has become more arid. Relatively brief arid phases (e.g. 8,200 ya.) have punctuated the generally moister early and mid-Holocene conditions. Earlier studies assumed that ancestral Ae. aegypti were dependent on forests, and that forest fragmentation driven by climatic change may have been responsible for the lineage splitting in the late Pleistocene [7].

In almost all respects the findings of the current study support Bennett’s model wherein Ae. aegypti originated in Africa and that one or more Pleistocene lineage diversifications occurred. This has been followed by many historical instances of admixture. Differences in time scale between our study and Bennett et al. [7] are likely a result of applying different divergence rates. Bennett et al. [7] used 2.69 x 10−8/site/year which is the mitochondrial rate with both the 16S rRNA and CO1 rates combined [48] and 2.3 x 10−8/site/year [53]. We instead used 3.54 x 10−8/site/year [48] which is the rate for the CO1 alone. Nevertheless, our estimates are correlated with Bennett et al. [7] that lineages diverged at node A 700–3000 kya while we estimate that node A occurred 614 kya.

Our phylogeographic analysis suggests that the Pleistocene lineage diversification(s) probably happened in East Africa. The first East African lineage (Node B) (Fig 3) has remained in East Africa where it continues to diversify (Node E) (Fig 3) and has spread to Sri Lanka and Southeast Asia. Our original designation of Node B as the “derived” lineage [15] was incorrect. Instead the East African lineage evolved 173 kya before the lineage giving rise to the West Africa clade (Node F) and it should therefore be considered as basal. The West Africa clade is the “derived” lineage and it wasn’t until 372 kya that it arrived and began to diversify in West Africa. From there it spread to all global regions examined in the present study.

As with Bennett et al. [7], there are many cases where individuals appear to have been reintroduced into regions of origin. Nodes I and J (Fig 3) arise from West African lineages but contain many sequences from East Africa. Also, nodes K, L, M (Fig 3) arise from Southeast Asian lineages but gave rise to sequences eventually found in East Africa, India / Pakistan, and Sri Lanka. This reflects the admixtures generated by global spread of Ae. aegypti. The question remains as to whether these results indicate a current high level of gene flow between regions in the invasive range of the mosquito or represent shared ancestral polymorphisms.

It is interesting that both groups of Sri Lanka haplotypes cluster together and have recent origins. The two origins are, one from India/Pakistan and another from mixed clades of mosquitoes derived from East Africa. This appears to be counter intuitive based upon the proximity of Sri Lanka to India and its great distance to East Africa and the fact that most of the India collections come from South India [42]. The analysis of migration rates in Table 3 supports the idea that while there is a great deal of gene flow, among mosquitoes within the India/Pakistan Region there is very little gene flow between India/Pakistan and Africa or Sri Lanka. This analysis also supports the idea that gene flow occurs between Southeast Asian countries and Sri Lanka. Sri Lanka is situated along the key shipping route between the Malacca Straits and the Suez Canal. Approximately 36,000 ships, including 4,500 oil tankers, use the route annually [54]. Thus, Sri Lanka may be playing a major role in the trafficking of DENV, its vector Ae. aegypti and infected humans from Southeast Asia to the Middle East and Africa.


Ae. aegypti appears to have arisen as a species in East Africa in the mid-late Pleistocene probably associated with a shift towards more arid conditions beginning 2.8 million years ago (mya), resulting from cold North Atlantic sea-surface temperatures that caused the beginning of northern hemisphere glacial cycles. Two mitochondrial clades arose in the late Pleistocene, but these subsequently became admixed during periods of warming and the spread of forests. An analysis of migration rates suggests a great deal of gene flow between mosquitoes in the India/Pakistan Region and the rest of the world with the exception of Africa and Sri Lanka. There appears to be abundant gene flow between Southeast Asian countries and Sri Lanka.

Supporting information

S1 Table. Region and country of origin of Aedes aegypti NADH dehydrogenase subunit 4 (ND4) gene.

The GenBank accession number and publication reference is listed [5558].


S2 Table. Region and country of origin of Aedes aegypti Cytochrome C Oxidase 1 (CO1) gene sequences.

The GenBank accession number and publication reference is listed [5968].



We thank citizens from the study sites who assisted with sample collection.


  1. 1. da Cruz Ferreira DA, Degener CM, de Almeida Marques-Toledo C, Bendati MM, Fetzer LO, Teixeira CP, et al. Meteorological variables and mosquito monitoring are good predictors for infestation trends of Aedes aegypti, the vector of dengue, chikungunya and Zika. Parasit Vectors. 2017;10:78. pmid:28193291
  2. 2. Mayer SV, Tesh RB, Vasilakis N. The emergence of arthropod-borne viral diseases: A global prospective on dengue, chikungunya and zika fevers. Acta trop. 2017; 166: 155–63. pmid:27876643
  3. 3. Epidemiology unit, Dengue update. (accessed on 01.01.2019)
  4. 4. Barraud CP. The fauna of British India including Ceylon and Burma. Taylor And Francis, Red Lion Court, London, 1934.
  5. 5. Mattingly PF. Taxonomy of Aedes aegypti and Related Species. B World Health Organ. 1967; 36: 552.
  6. 6. Powell JR, Tabachnick WJ. History of domestication and spread of Aedes aegypti—A Review. Mem I Oswaldo Cruz. 2013; 108: 11–17.
  7. 7. Bennett KL, Shija F, Linton YMG, Misinzo M, Kaddumukasa R, Djouaka O, et al. Historical environmental change in Africa drives divergence and admixture of Aedes aegypti mosquitoes: a precursor to successful worldwide colonization. Mol Ecol. 2016; 25: 4337–4354. pmid:27439067
  8. 8. Mcclelland GA. Worldwide Survey of Variation in Scale Pattern of Abdominal Tergum of Aedes aegypti (L) (Diptera-Culicidae). T Roy Ent Soc London. 1974; 126: 239–259.
  9. 9. Sylla M. Bosio C, Urdaneta-Marquez L, Ndiaye M, Black W. C. Gene Flow, Subspecies Composition, and Dengue Virus-2 Susceptibility among Aedes aegypti Collections in Senegal. Plos Neglect Trop D. 2009;3(4).
  10. 10. Paupy C, Brengues C, Ndiath O, Toty C, Herve JP, Simard F. Morphological and genetic variability within Aedes aegypti in Niakhar, Senegal. Infect Genet Evol. 2010; 10: 473–480. pmid:20223297
  11. 11. Sylla M, Ndiaye M, Black WC. Aedes species in tree holes and fruit husks between dry and wet seasons in southeastern Senegal. J Vector Ecol. 2013; 38: 237–244. pmid:24581351
  12. 12. Wallis GP, Tabachnick WJ, Powell JR Macrogeographic genetic variation in a human commensal: Aedes aegypti, the yellow fever mosquito. Genet Res. 1983;41:241–58. pmid:6884770
  13. 13. Tabachnick WJ, Wallis GP, Aitken THG, Miller BR, Amato GD, Lorenz L, et al. Oral Infection of Aedes aegypti with Yellow-Fever Virus—Geographic-Variation and Genetic Considerations. Am J Trop Med Hyg. 1985; 34: 1219–1224. pmid:3834804
  14. 14. Tabachnick WJ, Munstermann LE, Powell JR. Genetic Distinctness of Sympatric Forms of Aedes aegypti in East-Africa. Evolution. 1979; 33: 287–295. pmid:28568173
  15. 15. Moore M, Sylla M, Goss L, Burugu MW, Sang R, Kamau LW, et al. Dual African Origins of Global Aedes aegyptis.l. Populations Revealed by Mitochondrial DNA. Plos Neglect Trop D. 2013;7(4).
  16. 16. Gloria-Soria A, Ayala DA, Bheecarry O, Calderon-Arguedas D, Chadee M, Chiappero M. et al. Global genetic diversity of Aedes aegypti. Mol Ecol. 2016; 25: 5377–5395. pmid:27671732
  17. 17. Evans BR, Gloria-Soria A, Hou L, McBride C, Bonizzoni M, Zhao H, et al. A Multipurpose, High-Throughput Single-Nucleotide Polymorphism Chip for the Dengue and Yellow Fever Mosquito, Aedes aegypti. G3 (Bethesda). 2015; 5: 711–718.
  18. 18. Kotsakiozi P, Evans BR, Gloria‐Soria A, Kamgang B, Mayanja M, Lutwama J,. et al. Population structure of a vector of human diseases: Aedes aegypti in its ancestral range, Africa. Eco Evol. 2018; 8: 7835–7848.
  19. 19. Crawford JE, Alves JM, Palmer WJ, Day JP, Sylla M, Ramasamy R, et al. Population genomics reveals that an anthropophilic population of Aedes aegypti mosquitoes in West Africa recently gave rise to American and Asian populations of this major disease vector. Bmc Biol. 2017;15.
  20. 20. Dickson LB, Campbell CL, Juneja P, Jiggins FM, Sylla M, Black WC. Exon Enriched Libraries Reveal Large Genic Differences Between Aedes aegypti from Senegal, West Africa and Populations Outside Africa. G3 (Bethesda). 2016; g3–116
  21. 21. Jaimes-Duenez J, Arboleda S, Triana-Chavez O, Gomez-Palacio A. Spatio-Temporal Distribution of Aedes aegypti (Diptera: Culicidae) Mitochondrial Lineages in Cities with Distinct Dengue Incidence Rates Suggests Complex Population Dynamics of the Dengue Vector in Colombia. Plos Neglect Trop D. 2015; 9 p.e0003553.
  22. 22. Lorenz L, Beaty BJ, Aitken THG, Wallis GP, Tabachnick WJ. The Effect of Colonization Upon Aedes aegypti—Susceptibility to Oral Infection with Yellow-Fever Virus. Am J Trop Med Hyg. 1984; 33: 690–694. pmid:6476217
  23. 23. Wallis GP, Aitken THG, Beaty BJ, Lorenz L, Amato GD, Tabachnick WJ. Selection for Susceptibility and Refractoriness of Aedes aegypti to Oral Infection with Yellow-Fever Virus. Am J Trop Med Hyg. 1985; 34: 1225–1231. pmid:3834805
  24. 24. Galtier N, Nabholz B, Glémin S, Hurst GD. Mitochondrial DNA as a marker of molecular diversity: a reappraisal. Mol ecol. 2009;18:4541–50. pmid:19821901
  25. 25. Cameron SL. Insect mitochondrial genomics: implications for evolution and phylogeny. Annu rev entomol. 2014;59:95–117. pmid:24160435
  26. 26. Yang Y, Xu S, Xu J, Guo Y, Yang G. Adaptive evolution of mitochondrial energy metabolism genes associated with increased energy demand in flying insects. PloS one. 2014;9(6):e99120. pmid:24918926
  27. 27. Lunt DH, Zhang DX, Szymura JM, Hewltt OM. The insect cytochrome oxidase I gene: evolutionary patterns and conserved primers for phylogenetic studies. Insect mol biol. 1996;5(3):153–65. pmid:8799733
  28. 28. Belkin JN. The mosquitoes of the South Pacific (Diptera, Culicidae). vol. 2, University of California Press, Berkeley and Los Angeles, 1962.
  29. 29. Ballinger-Crabtree ME, Black WC, Miller BR. Use of Genetic Polymorphisms Detected by the Random-Amplified Polymorphic DNA-Polymerase Chain-Reaction (RAPD-PCR) for Differentiation and Identification of Aedes aegypti Subspecies and Populations. Am J Trop Med Hyg 1992; 47: 893–901. pmid:1471745
  30. 30. Simon C, Frati F, Beckenbach A, Crespi B, Liu H, Flook P. Evolution weighting, and phylogenetic utility of mitochondrial gene sequences and a compilation of conserved polymerase chain reaction primers. Ann entomol Soc Am. 1994;87:651–701.
  31. 31. Nei M. Molecular Evolutionary Genetics: Columbia University Press, New York, 1987.
  32. 32. Tajima F. Evolutionary Relationship of DNA-Sequences in Finite Populations. Genetics. 1983;105:437–60. pmid:6628982
  33. 33. Fu YX, Li WH. Statistical Tests of Neutrality of Mutations. Genetics. 1993;133:693–709. pmid:8454210
  34. 34. Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25:1451–2. pmid:19346325
  35. 35. Hlaing T, Tun-Lin W, Somboon P, Socheat D, Setha T, Min S, et al. Mitochondrial pseudogenes in the nuclear genome of Aedes aegypti mosquitoes: implications for past and future population genetic studies. Bmc Genet. 2009;10.
  36. 36. Black WC, Bernhardt SA. Abundant nuclear copies of mitochondrial origin (NUMTs) in the Aedes aegypti genome. Insect Mol Biol. 2009;18(6):705–13. pmid:19849722
  37. 37. Stamatakis A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–3. pmid:24451623
  38. 38. Stover BC; Muller KF. TreeGraph 2: Combining and visualizing evidence from different phylogenetic analyses. Bmc Bioinformatics. 2010; 11.
  39. 39. Ashfaq M, Hebert PDN, Mirza JH, Khan AM, Zafar Y, Mirza MS. Analyzing Mosquito (Diptera: Culicidae) Diversity in Pakistan by DNA Barcoding. Plos One. 2014;9(5).
  40. 40. Mousson L, Dauga C, Garrigues T, Schaffner F, Vazeille M, Failloux AB. Phylogeography of Aedes (Stegomyia) aegypti (L.) and Aedes (Stegomyia) albopictus (Skuse) (Diptera : Culicidae) based on mitochondrial DNA variations. Genet Res. 2005;86(1), 1–11. pmid:16181519
  41. 41. Paupy C, Le Goff G, Brengues C, Guerra M, Revollo J, Simon ZB, et al. Genetic structure and phylogeography of Aedes aegypti, the dengue and yellow-fever mosquito vector in Bolivia. Infect Genet Evol. 2012;12(6):1260–9. pmid:22522103
  42. 42. Vadivalagan C, Karthika P, Murugan K, Panneerselvam C, Paulpandi M, Madhiyazhagan P, et al. Genetic deviation in geographically close populations of the dengue vector Aedes aegypti (Diptera: Culicidae): Influence of environmental barriers in South India. Parasitol Res. 2016;115(3):1149–60. pmid:26627691
  43. 43. Gupta K, Dhawan R, Kajla M, Kumar S, Jnanasiddhy B, Singh NK, et al. Molecular identification of Aedes aegypti mosquitoes from Pilani region of Rajasthan, India. J Vector Dis. 2016;53(2):149–55.
  44. 44. Mcclelland GAH, Mamet R. Aedes aegypti (L.) and Aedes mascarensis Macgregor in Mauritius—a Case of Gene Survival Following Species Eradication. Nature. 1962;195:965–6.
  45. 45. Hartberg WK, Craig GB. Reproductive Isolation in Stegomyia Mosquitoes .I. Sexual Isolation between Aedes aegypti and A. mascarensis. Ann Entomol Soc Am. 1968;61:865–870. pmid:5758054
  46. 46. Bouckaert R, Heled J, Kuhnert D, Vaughan T, Wu CH, Xie D, et al. BEAST 2: A Software Platform for Bayesian Evolutionary Analysis. PlosComput Biol. 2014; pmid:24722319
  47. 47. Hasegawa M. Kishino H, Yano TA. Dating of the Human Ape Splitting by a Molecular Clock of Mitochondrial-DNA. J Mol Evol. 1985;22:160–74. pmid:3934395
  48. 48. Papadopoulou A, Anastasiou I, Vogler AP. Revisiting the Insect Mitochondrial Molecular Clock: The Mid-Aegean Trench Calibration. Mol Biol Evol. 2010:27:1659–72. pmid:20167609
  49. 49. Cornuet JM, Pudlo P, Veyssier J, Dehne-Garcia A, Gautier M, Leblois R, et al. DIYABC v2.0: a software to make approximate Bayesian computation inferences about population history using single nucleotide polymorphism, DNA sequence and microsatellite data. Bioinformatics. 2014;30:1187–9. pmid:24389659
  50. 50. deMenocal PB. Plio-Pleistocene African climate. Science. 1995;270(5233):53–9. . pmid:7569951
  51. 51. Trauth MH, Larrasoana JC, Mudelsee M. Trends rhythms and events in Plio-Pleistocene African climate. Quaternary Sci Rev. 2009;28:399–411.
  52. 52. Trauth MH, Maslin MA, Deino AL, Strecker MR, Bergner AGN, Duhnforth M. High- and low-latitude forcing of Plio-Pleistocene East African climate and human evolution. J Hum Evol. 2007;53:475–86. pmid:17959230
  53. 53. Brower AVZ. Rapid Morphological Radiation and Convergence among Races of the Butterfly Heliconius-Erato Inferred from Patterns of Mitochondrial-DNA Evolution. P Natl Acad Sci USA. 1994;91:6491–5, pmid:8022810
  54. 54. Kumara YG, Weerakoon KG. Review of Port City as a Place; Port City in Hambantota, Sri Lanka. Eur Aca Res. 2014;2(2):2350–70
  55. 55. Paduan KD, Ribolla PEM. Mitochondrial DNA polymorphism and heteroplasmy in populations of Aedes aegypti in Brazil. J Med Entomol. 2008;45(1):59–67. pmid:18283943
  56. 56. Ribeiro JMC, Arca B, Lombardo F, Calvo E, Phan V.M, Chandra PK. et al. An annotated catalogue of salivary gland transcripts in the adult female mosquito, Aedes aegypti. Bmc Genomics. 2007;8. Artn 6 pmid:17204158
  57. 57. Gorrochotegui-Escalante N, Munoz MD, Fernandez-Salas I, Beaty BJ, Black WC. Genetic isolation by distance among Aedes aegypti populations along the northeastern coast of Mexico. Am J Trop Med Hyg. 2000;62(2):200–9. pmid:10813474
  58. 58. Lima RS, Scarpassa VM. Evidence of two lineages of the dengue vector Aedes aegypti in the Brazilian Amazon, based on mitochondrial DNA ND4 gene sequences. Genet Mol Biol. 2009;32(2):414–22. pmid:21637700
  59. 59. Fort P, Albertini A, Van-Hua A, Berthomieu A, Roche S, Delsuc F. et al. Fossil Rhabdoviral Sequences Integrated into Arthropod Genomes: Ontogeny, Evolution, and Potential Functionality. Mol Biol Evol. 2012;29(1):381–90. pmid:21917725
  60. 60. Morlais I, Severson DW. Complete mitochondrial DNA sequence and amino acid analysis of the cytochrome C oxidase subunit I (COI) from Aedes aegypti. DNA Sequence. 2002;13(2):123–7. pmid:12180346
  61. 61. Cywinska A, Hunter FF, Hebert PDN. Identifying Canadian mosquito species through DNA barcodes. Med Vet Entomol. 2006;20(4):413–24. pmid:17199753
  62. 62. Cook S, Moureau G, Harbach RE, Mukwaya L, Goodger K, Ssenfuka F. et al. Isolation of a novel species of flavivirus and a new strain of Culex flavivirus (Flaviviridae) from a natural mosquito population in Uganda. J Gen Virol. 2009;90:2669–78. pmid:19656970
  63. 63. Ajamma YU, Villinger J, Omondi D, Salifu D, Onchuru TO, Njoroge L. et al. Composition and genetic diversity of mosquitoes (Diptera: Culicidae) on Islands and mainland shores of Kenya's lakes Victoria and Baringo. J Med Entomol. 2016.
  64. 64. Kumar NP, Rajavel AR, Natarajan R, Jambulingam P. DNA barcodes can distinguish species of Indian mosquitoes (Diptera : Culicidae). J Med Entomol. 2007;44(1):1–7. pmid:17294914
  65. 65. Ballard JWO, Puslednik L, Wolff JN, Russell RC. Variation under nature: A sesquicentennial DNA barcoding perspective. Chiang Mai J Sci. 2009:36(2):188–99.
  66. 66. Calvez E, Guillaumot L, Millet L, Marie J, Bossin H, Rama V. et al. Genetic diversity and phylogeny of Aedes aegypti, the main arbovirus vector in the Pacific. Plos Neglect Trop D. 2016;10(1): ARTN e0004374 pmid:26799213
  67. 67. Batovska J, Blacket MJ, Brown K, Lynch SE. Molecular identification of mosquitoes (Diptera: Culicidae) in southeastern Australia. Ecol Evol. 2016;6(9):3001–11. pmid:27217948
  68. 68. Freitas MTS, Gomes PP, Batista MVA, Leal-Balbino TC, Araujo AL, Balbino VQ. Novel DNA extraction assay for molecular identification of Aedes spp eggs. Genet Mol Res. 2014;13(4):8776–82. pmid:25366769