Skip to main content
Advertisement
  • Loading metrics

Dispersal patterns of Trypanosoma cruzi in Arequipa, Peru

  • Alexander S. F. Berry,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing

    Affiliations Division of Gastroenterology, Hepatology, and Nutrition, Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America, Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America

  • Renzo Salazar-Sánchez,

    Roles Investigation, Project administration, Writing – review & editing

    Affiliation Zoonotic Disease Research Lab, One Health Unit, School of Public Health and Administration, Universidad Peruana Cayetano Heredia, Arequipa, Peru

  • Ricardo Castillo-Neyra,

    Roles Project administration, Writing – review & editing

    Affiliations Zoonotic Disease Research Lab, One Health Unit, School of Public Health and Administration, Universidad Peruana Cayetano Heredia, Arequipa, Peru, Department of Biostatistics, Epidemiology and Informatics, The Perelman School of Medicine of the University of Pennsylvania, Philadelphia, Pennsylvania, United States of America

  • Katty Borrini-Mayorí,

    Roles Supervision, Writing – review & editing

    Affiliation Zoonotic Disease Research Lab, One Health Unit, School of Public Health and Administration, Universidad Peruana Cayetano Heredia, Arequipa, Peru

  • Claudia Arevalo-Nieto,

    Roles Data curation, Writing – review & editing

    Affiliation Zoonotic Disease Research Lab, One Health Unit, School of Public Health and Administration, Universidad Peruana Cayetano Heredia, Arequipa, Peru

  • Claudia Chipana-Ramos,

    Roles Methodology, Writing – review & editing

    Affiliation Zoonotic Disease Research Lab, One Health Unit, School of Public Health and Administration, Universidad Peruana Cayetano Heredia, Arequipa, Peru

  • Melina Vargas-Maquera,

    Roles Methodology, Writing – review & editing

    Affiliation Zoonotic Disease Research Lab, One Health Unit, School of Public Health and Administration, Universidad Peruana Cayetano Heredia, Arequipa, Peru

  • Jenny Ancca-Juarez,

    Roles Investigation, Writing – review & editing

    Affiliation Zoonotic Disease Research Lab, One Health Unit, School of Public Health and Administration, Universidad Peruana Cayetano Heredia, Arequipa, Peru

  • César Náquira-Velarde †,

    † Deceased.

    Roles Project administration, Writing – review & editing

    Affiliation Zoonotic Disease Research Lab, One Health Unit, School of Public Health and Administration, Universidad Peruana Cayetano Heredia, Arequipa, Peru

  • Michael Z. Levy,

    Roles Funding acquisition, Investigation, Project administration, Resources, Supervision, Writing – original draft, Writing – review & editing

    Affiliations Zoonotic Disease Research Lab, One Health Unit, School of Public Health and Administration, Universidad Peruana Cayetano Heredia, Arequipa, Peru, Department of Biostatistics, Epidemiology and Informatics, The Perelman School of Medicine of the University of Pennsylvania, Philadelphia, Pennsylvania, United States of America

  • Dustin Brisson

    Roles Conceptualization, Funding acquisition, Methodology, Project administration, Resources, Supervision, Writing – original draft, Writing – review & editing

    dbrisson@sas.upenn.edu

    Affiliation Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America

Abstract

Anthropogenic environmental alterations such as urbanization can threaten native populations as well as create novel environments that allow human pests and pathogens to thrive. As the number and size of urban environments increase globally, it is more important than ever to understand the dispersal dynamics of hosts, vectors and pathogens of zoonotic disease systems. For example, a protozoan parasite and the causative agent of Chagas disease in humans, Trypanosoma cruzi, recently colonized and spread through the city of Arequipa, Peru. We used population genomic and phylogenomic tools to analyze whole genomes of 123 T. cruzi isolates derived from vectors and non-human mammals throughout Arequipa to determine patterns of T. cruzi dispersal. The data show significant population genetic structure within city blocks—parasites in the same block tend to be very closely related—but no population structure among blocks within districts—parasites in neighboring blocks are no more closely related to one another than to parasites in distant districts. These data suggest that T. cruzi dispersal within a block occurs regularly and that occasional long-range dispersal events allow the establishment of new T. cruzi populations in distant blocks. Movement of domestic animals may be the primary mechanism of inter-block and inter-district T. cruzi dispersal.

Author summary

Urbanization creates environments that are ideal for some human pests and pathogens. As the number and size of urban environments increases globally, it is becoming vital to understand how human disease-causing pathogens, their vectors, and their non-human hosts disperse through urban landscapes. Here we study a population of Trypanosoma cruzi–the protozoan parasite and causative agent of Chagas disease in humans–that recently colonized the city of Arequipa, Peru. We use population genomic and phylogenomic tools to understand how this parasite population dispersed through the city to achieve its current distribution and abundance. We show that T. cruzi collected from the same city block tend to be very closely related, while those from neighboring blocks are often as distantly related as those from blocks in distant districts. The data suggest that vectors facilitate frequent within-block dispersal of the parasite, while domestic animal movement may facilitate the relatively infrequent inter-block and inter-district dispersal.

Introduction

Human populations are increasingly moving from rural to urban centers resulting in fundamental habitat alterations that have caused local extinctions of many native species. This influx of humans and domestic animals also creates suitable habitats that can be colonized by invasive species, including human pests and pathogens. Given the current rate of immigration and urbanization, the geographic distributions of many pathogens will depend on the manner in which their hosts and vectors interact with the novel biotic and abiotic features of growing cities [1].

Proliferation in an ecosystem is accomplished by immigration to a new location, establishment (reproduction and population growth), and dispersal to new areas within the ecosystem [1]. Investigations into the dynamic processes of immigration, establishment, and dispersal of disease-causing agents in human-altered ecosystems are particularly important for public health risk management because the distribution and abundance of these pest species is correlated with the incidence of disease in humans [2]. In this study, we examine the dispersal pattern of Trypanosoma cruzi, the causative agent of Chagas disease, around the region of Arequipa, Peru.

The Chagas disease system in Arequipa, Peru is ideal to study pathogen dispersal in a rapidly changing environment for several reasons. First, a single introduction of T. cruzi recently established in Arequipa and has since dispersed to many districts in the city [3]. A single population allows dispersal patterns through the city to be assessed without being confounded by repeated immigration from other geographic regions. Second, the expansion of urban and semi-urban environments over the past ~50 years [4,5] has resulted in a mosaic of several habitat types, including different quantities of domestic animals and different husbandry practices, each of which provides unique challenges and opportunities for T. cruzi to establish and proliferate [57]. Lastly, this system is well studied, permitting hypotheses about the proximal causes of dispersal patterns to be formed. For example, dispersal of Triatoma infestans, the only vector of T. cruzi in Arequipa, is hindered by roads between city blocks, making it unlikely that the vector facilitates T. cruzi block-to-block migration [6]. Additionally, the number of domestic animals in and around each house in Arequipa is known, which will allow us to correlate the migration patterns of T. cruzi with the presence and abundance of important reservoir hosts in future studies. Here, we investigate how a zoonotic pathogen has dispersed across this complex urban landscape by analyzing the genome sequences of 123 T. cruzi isolates collected throughout the city of Arequipa, Peru.

Results

T. cruzi cluster within city blocks, not within districts

123 DTU 1 T. cruzi parasites, representing a single, closely related population [3], were collected from ten geographically-distinct districts throughout the region of Arequipa, Peru (Fig 1). Multiple isolates were sequenced from 20 of the 37 blocks in Arequipa where T. cruzi were found. On only three of these blocks were all of the isolates sequenced cultured from different bugs collected from a single house, although the isolates were collected in different years in two of these cases. On 17 blocks, an average of 5 isolates were sequenced which were cultured from bugs collected from an average of 3 houses per block. The common ancestor of the Arequipan T. cruzi population was estimated to live as recently as 8.8kya, however there is significant uncertainty in estimates of divergence times [3]. A total of 9271 of the >28 million sites in the genome sequences were polymorphic from the 123 T. cruzi parasites. The average pairwise genetic distance between all samples was 1726 SNPs (Table 1). The average pairwise distance between samples from different houses within city blocks, however, was considerably smaller (620 SNPs). Further, the average pairwise distance between samples from different blocks within each district was much greater (1925 SNPs) than within-block genetic differences (Fig 2). There is significantly less variation among samples from different houses within blocks than expected (p<0.001) given the total diversity among samples (Fig 2; Table 2). The limited genetic diversity within each block observed in population genetic analyses is also evident in phylogenetic analyses as most city blocks contain a single T. cruzi lineage where all within-block diversity was generated by mutations after colonization of the block (Fig 3, S1 Appendix). This result is also supported by ADMIXTURE analyses of the 56 samples from the densely-sampled Mariano Melgar district, confirming that most city blocks are dominated by a single T. cruzi genetic cluster representing a single lineage (S1 Fig). Additionally, although sampling is limited, we find no evidence of temporal variation.

thumbnail
Fig 1.

Map of sample collection locations in A) Arequipa, Peru and the B) Mariano Melgar district. A) All 123 T. cruzi samples used in this analysis were collected within 100km of Arequipa, Peru from 2008–2015. Sampled T. cruzi isolates are represented by red dots. Mariano Melgar—the city center and most densely-sampled district—is indicated with a yellow star. Inset shows the location of Arequipa in southern Peru. Map of Arequipa was obtained from https://landlook.usgs.gov, sample locations were plotted using QGIS v. 3.4 [8], and inset was modified from https://commons.wikimedia.org/wiki/Atlas_of_the_world [9]. B) Blocks from which T. cruzi were collected in the Mariano Melgar district (N = 56) are numbered 1–12 with the number of samples collected per block in parentheses. No parasites were collected in unlabeled blocks. Blocks are displayed in a grid to maintain privacy with black lines representing streets separating blocks.

https://doi.org/10.1371/journal.pntd.0007910.g001

thumbnail
Fig 2. T. cruzi isolates collected from the same block are genetically similar.

Violin plots represent the number of pairwise SNP differences among pairs of samples collected in different blocks from the same district (left) and among pairs of samples collected in the same block (right). The width of the violins represents the frequency that pairwise distances were detected. Samples found in different blocks within a district are significantly more different from one another than samples collected within the same city block (p<0.001). Genetically-similar samples tend to cluster within blocks despite each district containing much of the overall genetic diversity among all samples.

https://doi.org/10.1371/journal.pntd.0007910.g002

thumbnail
Fig 3. T. cruzi collected from the same block often forms a monophyletic group.

The maximum clade credibility phylogeny includes all 123 T. cruzi samples collected from Arequipa with samples from Mariano Melgar labeled blue and those from La Joya labeled green. Block numbers are labeled for the 48 samples collected from Blocks 1–7 in Mariano Melgar. Black branches represent samples collected from the other districts in the Arequipa region: Bustamante, Cayma, Huanca, Miraflores, Sachaca, Tiabaya, Uchumayo, and Vitor. T. cruzi from each district, for example, Mariano Melgar and La Joya, span the phylogenetic diversity in the region suggesting that most of the genetic variation in Arequipa can be found within each single district. The T. cruzi in many blocks within Mariano Melgar were likely introduced from another district as their closest relatives were detected in other districts and not in other blocks within Mariano Melgar. Jagged lines in most basal branch indicate that the branch was shortened for visualization. Posterior probabilities less than 0.80 are labeled for basal nodes. A modifiable phylogeny containing all tip labels and posterior probabilities can be found in S1 Appendix.

https://doi.org/10.1371/journal.pntd.0007910.g003

thumbnail
Table 2. There is significant population genetic structure among blocks within districts, but not among districts.

https://doi.org/10.1371/journal.pntd.0007910.t002

T. cruzi collected from the same block tend to be very closely related compared to parasites found in different blocks (Fig 2). T. cruzi gene flow is significantly restricted among blocks in analyses of the whole dataset and in analyses focusing only on Mariano Melgar (p<0.001; Table 2). This is consistent with the result that T. cruzi in different blocks within Mariano Melgar belong to unique genetic clusters (S1 Fig, S2 Fig). Further, T. cruzi lineages occupying neighboring city blocks are often phylogenetically divergent such that geographic distance is not correlated with evolutionary distance (Fig 3).

There is no evidence supporting restricted gene flow among the districts of Arequipa (Table 1). The majority of the total genomic diversity is present in the two most densely-sampled districts—Mariano Melgar and La Joya (Fig 3). Further, the lineages occupying city blocks within Mariano Melgar or La Joya are commonly most closely related to lineages in other districts, suggesting dispersal among districts in the region is not sufficiently restricted to result in population genetic structure.

Genetic linkage among SNPs

Many of the genomic differences among strains are the result of independent assortment of chromosomes or cross-over events that occur during sexual reproduction [9,10]. To account for the potential lack of independence among SNPs in the analyses, a dataset containing the presence or absence of 474 unique recombination events was generated as previously described [9]. All analyses of this dataset showed that using presence/absence of recombination events resulted in nearly identical estimations of relatedness and support for conclusions as the SNP dataset, despite the loss of power. The similarities of the results among these datasets likely results from the identity-by-decent of the recombination events (S1 Table; S3 Fig), making these events evolutionarily informative markers.

Animal husbandry and migration

The number of guinea pigs imported from markets and farms was evaluated for 383 households across a transect in Mariano Melgar [11]. Interestingly a large number of guinea pigs were imported from a market into the blocks in which we detected multiple independent T. cruzi introductions. The distribution of guinea pigs is also correlated with the density of Triatoma infestans vectors, as well as the presence of T. cruzi in vectors. No correlation between the vector nor parasite population and either dogs or chickens was identified (S4 Fig).

Discussion

Investigations into the impact of environmental change have focused primarily on habitat destruction and extinction of natural fauna [1217]. Equally impactful is the creation of novel habitats that can be exploited by non-native species including pests associated with human-altered habitats [2]. Understanding how native and non-native species exploit human altered habitats, especially in human-dense urban habitats, is critical to understanding the distribution and abundance of human diseases. Here, we investigate the dispersal patterns of Trypanosoma cruzi, the cause of human Chagas disease, through the urban ecosystem of Arequipa, Peru. The genomic data support the previously described pattern of migration in which dispersal proceeds with little impediment among houses within a city block soon after colonization of the block (Figs 2 and S1) [5]. For example, the T. cruzi population on Block 5 was founded by a single lineage in 2010 and could be collected from multiple houses throughout the block by 2011 (Fig 3) [5]. Establishing a novel T. cruzi population in a city block, however, is a rare event as no T. cruzi were detected in a majority of blocks and blocks harboring a population tend to contain only a single genetic lineage suggestive of a single colonization event (Fig 3). Interestingly, geographic proximity to blocks harboring a T. cruzi population increases the establishment probability of that block only slightly (Figs 3 and S1). Further, the T. cruzi lineages in eight of the 12 blocks investigated in the Mariano Melgar district were most closely related to T. cruzi found only in a geographically distant district of the region suggesting that inter-district immigration is not less likely than immigration from proximal blocks (Fig 3). The dispersal pattern observed in Arequipa is the likely result of relatively rare immigration to blocks without an established T. cruzi population and a low probability of establishing a population after immigration. These patterns support a model of gene flow in which city blocks are relatively homogenous, high-quality patches that allow within-block gene flow, separated by inhospitable barriers (i.e. roads and streets) that greatly reduce gene flow between adjacent blocks but that can act as conduits for inter-district gene flow through the aided movement of infected hosts or vectors.

The population genetic and phylogenetic analyses suggest that limited barriers to T. cruzi gene flow exist within city blocks. After T. cruzi establishes in a new block, the population readily disperses to neighboring houses within the block, likely within infected vectors that readily move between houses through shared walls [6,18]. T. cruzi population expansion within blocks is likely the result of the abundance of resources for both the parasite and its vector. For example, many households raise guinea pigs–which are particularly important reservoirs of T. cruzi [5,1824] –in addition to housing dogs and humans, which offer an abundance of blood meals for the vector and competent hosts for the parasite [5]. These data suggest that blocks remain free of T. cruzi due to either low rates of immigration or low probabilities of establishment as populations thrive once established.

The discontiguous distribution of T. cruzi across city blocks suggests that dispersal between adjoining blocks is rare. T. cruzi was not detected in a majority of city blocks surveyed between 2008–2012 [5], despite many containing an active vector population (S5 Fig). Further, many blocks with a large T. cruzi population support only a single lineage suggesting most blocks were colonized only once (Fig 3). These data suggest either strong barriers to dispersal among city blocks or barriers that hinder the establishment of new populations. Prior investigations demonstrated that streets are a barrier to T. infestans dispersal, the primary vector of T. cruzi in the area [6,25], which is consistent with the behavior of other species in urban areas [2628]. Nevertheless, dispersal of both T. cruzi and T. infestans has occurred between neighboring blocks, especially in older areas of the city. For example, a single T. cruzi lineage colonized neighboring Blocks 1–4 and a sample collected in Block 6 appears to have migrated from Block 7 (Fig 3). By contrast, the recently established population in Block 5 [5] has not invaded neighboring blocks (Fig 3). These data suggest that migration to adjacent blocks is relatively rare but can occur given sufficient time, supporting the hypothesis of a low inter-block dispersal rate.

The distribution of T. cruzi across city blocks is comparatively more fragmented than the distribution of its vector, suggesting that the parasite faces barriers to dispersal in addition to those encountered by the vector. These barriers could result from both life history characteristics of T. cruzi–such as inefficient transmission from vectors to hosts [20,29] –and specific environmental features within blocks–such as the absence of vectors, competitive interactions with previously established T. cruzi lineages, or human-specific activity patterns–any of which could reduce the probability of establishing a population after introduction. The low transmission probability from vectors to hosts likely has a large impact on the probability of establishing a T. cruzi population because the parasite is not vertically [30] nor horizontally [23] transmitted between vectors. Therefore, T. cruzi may be unable to establish a population in a new block even if its infected vector successfully establishes a population. Indeed, many blocks contain an active vector population but do not harbor any T. cruzi (S5 Fig). While the dispersal of T. infestans is hindered by streets, there is some local between-block dispersal such that blocks within 5km of one another have an increased probability of harboring closely-related T. infestans [6,7]. Conversely, there is minimal evidence of local T. cruzi dispersal such that migration primarily occurs within blocks and between districts.

The geographic distribution of T. cruzi may also be impacted by the historical or current diversity of environmental features among city blocks, which could result in different immigration rates or establishment probabilities on each block [31]. For example, one block within the heavily sampled Mariano Melgar district (Fig 3, Block 6) contains six unique T. cruzi lineages (Fig 3), including one that immigrated from a proximal block (Block 7) and five that immigrated from different districts in the Arequipa region. By contrast, all other blocks have received no more than two independent introductions. The immigration and establishment of multiple lineages from adjacent and distant areas into one block may have resulted from unique environmental features or human activities such as the breeding or trading of guinea pigs [11].

The source of the immigrants that establish in each block do not appear to be more likely to originate from proximal blocks (four events) than from other districts (ten events) (Fig 3). That is, the closest relatives of the lineages inhabiting each block are often found across Arequipa and not necessarily the closest block with an active infestation. T. cruzi gene flow among districts is unlikely to be mediated by natural dispersal of infected vectors because T. infestans can only fly ~1km and do so only under stressful conditions such as starvation; an unlikely occurrence in human- and animal-occupied houses [32,33]. Interdistrict T. cruzi gene flow may instead be facilitated by the human-mediated transport of infected vectors or hosts. That is, human movement along the transportation infrastructure of the connected urban centers could facilitate the migration of T. infestans and reservoir species of T. cruzi among districts [34]. One potential hypothesis is human-mediated transport of guinea pigs, which are regularly brought to the study area from farms or purchased at a market (~8% of all individual guinea pigs). Given the small sample size, T. cruzi introductions cannot be statistically correlated with the number of guinea pigs imported onto infected blocks. However, the high rate of inter-district guinea pig importation is likely to increase the probability of inter-district T. cruzi dispersal given that a single infected guinea pig can infect many vectors over its years-long lifetime [5,21,24]. The hypothesis that guinea pigs can infect many vectors is further supported because the number of guinea pigs is correlated with the density of T. infestans (S4 Fig), showing that the presence of guinea pigs helps support a large vector population which increases the number of potential T. cruzi vectors.

The T. cruzi genotypes sampled in Arequipa are exceptionally uniform, as discussed previously [3] and as shown in previous reports of the broader region surrounding Arequipa [35]. Most T. cruzi sampled here were isolated from the only vector species in Arequipa, T. infestans, which reduces dataset biases for specific T. cruzi genotypes as T. cruzi from all infected mammal hosts must be transmitted to T. infestans to complete its life cycle. The T. cruzi samples were isolated from dogs and guinea pigs support this hypothesis as these samples are closely related to those isolated from vectors. Further, because T. cruzi must pass through T. infestans to complete its life cycle, the parasite must infect a vector in order to proliferate in a new environment. Previous studies in domestic settings have found that the vector population harbors a more diverse T. cruzi population than hosts, suggesting that vectors carry the strains in local hosts in addition to strains not yet transmitted from vector to host [36]. One potential caveat is that T. cruzi genotypes residing long-term in mammalian hosts that have not transmitted to bugs will remained unsampled. Indeed, the only T. cruzi isolated from a human in this dataset has a unique genotype from those sequenced from bugs, dogs, and guinea pigs [3]. However, epidemiological models suggest that infected hosts are likely to transmit T. cruzi to vectors given sufficient time and vector abundance. That is, one infected human infects 21 vectors on average while 100 vectors on average are needed to infect one human [37,38]. By sampling T. cruzi primarily from vectors we likely capture the least biased representation of the diversity in the city in order to determine how a single population of T. cruzi migrated through Arequipa.

The data presented here demonstrate that urban areas such as Arequipa represent a patchwork of disconnected habitable areas, city blocks, with limited T. cruzi gene-flow among blocks and few barriers to gene-flow within blocks. The among-block gene flow that has occurred does not appear to be correlated with geographic distance, with many migration events occurring between districts that are unlikely to be traversed by vectors or mammalian hosts in the absence of human aid. These data support the hypothesis that considerable T. cruzi gene flow is facilitated by human transport of infected vectors or hosts. Such long-distance dispersal across the urban ecosystem may be possible for any species capable of exploiting human activity as a mechanism for migration. Without human-mediated migration, a T. cruzi population may be confined to individual blocks with infrequent dispersal across the barriers between blocks. These results highlight a difficulty in controlling dispersal of parasites like T. cruzi in urban areas. Long-term, city-wide vector control campaigns and continuous vector surveillance activities may be required to prevent spread of T. cruzi to new areas.

Methods

Ethics statement

The Institutional Animal Care and Use Committee (IACUC) of Universidad Peruana Cayetano Heredia reviewed and approved the animal-handling protocol used for this study (identification number 59605). The Universidad Peruana Cayetano Heredia Ethics Committee provided approval for human subject research and for the collection of bugs from households (identification number 52186). Human subject research participants provided written consent for the collection of bugs from his or her household and for human subject research. The IACUC of Universidad Peruana Cayetano Heredia is registered in the National Institutes of Health at the United States of America with PHS Approved Animal Welfare Assurance Number A5146-01 and adheres to the Animal Welfare Act of 1990 [3,39].

Sample collection and study site

Briefly, 123 T. cruzi samples were isolated from infected Triatoma infestans bugs (N = 114), dogs (N = 2), and guinea pigs (N = 7) from houses throughout Arequipa, Peru (Fig 1A). Nearly half of the samples isolated from T. infestans (N = 56) were collected during intensive inspections of houses and surrounding areas along an established transect within the Mariano Melgar district (Fig 1B) in 2010 and 2011 as previously described [5]. An effort was made to inspect every house along this transect and collect bugs whenever possible such that T. cruzi isolates from most houses that harbored infected bugs were sequenced (S5 Fig). In 2011, in-depth interviews were conducted in each participating household that focused on animal husbandry [11]. T. cruzi DNA was extracted from each sample and sequenced to an average depth of >50X as described previously [9].

Whole genome assembly

Genomes were assembled using bowtie2 [40] and the most closely related reference genome, TcJR clone 4, obtained from TriTrypDB [41]. Only the 333 contigs longer than 10kb were used for the assembly to avoid spurious alignments for a total genome assembly that includes 28Mbp. This assembly largely excluded the extensive repeat regions found throughout the T. cruzi genome. Duplicate reads were removed using Picard MarkDuplicates [42].

SNP calling

Individual genomic Variant Call Format (gVCF) files containing SNP data for each sample were generated using GATK HaplotypeCaller [42,43] following GATK’s Best Practices procedure [44,45]. A joint genotype file containing all polymorphic sites from all samples was created using GATK GenotypeGVCF. Insertion-deletion mutations were not excluded in the joint genotype file. Polymorphic loci were hard-filtered by quality using GATK VariantFiltration, requiring Fisher strand bias (FS) <40, mapping quality >30, and quality by depth >10. Only loci for which all samples achieved a minimum depth of 20X and a Genotype Quality score greater than 40 were included. These filters maximized the number of polymorphic sites identified while ensuring that duplicate and triplicate sequences resulted in identical SNP datasets. The final consensus SNP panel included 9271 polymorphic sites.

Population genetic analyses

VCF files were converted to plink format using vcftools [46] and plink [47]. Violin plots were used to visualize pairwise SNP distances between samples collected at different spatial scales using ggplot2 [48] in R [49]. AMOVA was performed using the poppr package in R [50]. Principle component analyses (PCA) were performed using Tassel 5 [51]. Phylogenetic relationships among T. cruzi isolates were inferred using a maximum clade credibility (MCC) phylogenetic tree reconstructed using BEAST 1.8.4 [52]. The best-fitting substitution model was determined using BEAST Model Test implemented in BEAST2 [53]. Starting from a UPGMA starting tree, the model implemented a Kimura 3-parameter substitution model with equal base frequencies, an Extended Bayesian Skyline coalescent tree prior [54], and a strict molecular clock. Log files were examined for convergence using Tracer v1.6.0 [55] ensuring ESS values above 200. The MCC tree was calculated using TreeAnnotator 1.8.4 with 10% burn-in and visualized in FigTree (available at http://beast.bio.ed.ac.uk). Phylogenetic analyses were performed using the BEAGLE library to increase computation speed [56,57]. Up to 100 independent iterations of ADMIXTURE [58] were run for each number of genetic clusters (K, ranging from 2 to 10) assuming linkage disequilibrium until the log-likelihood increased by less than ε = 10−4 between iterations. The optimal number of clusters was estimated to be 4 by the cross-validation score averaged across 100 iterations (S6 Fig). The optimal alignment of the 100 iterations was calculated using CLUMPP [59].

Animal husbandry and host dispersal data

Households across a transect in the Mariano Melgar district (N = 383) were surveyed for information including the number of guinea pigs owned, the number obtained from markets and farms, and the location of the markets or farms [11]. The number of guinea pigs imported from markets or farms from outside of Mariano Melgar was aggregated at the block level. T. cruzi genomes and guinea pig data were available for 8 blocks in Mariano Melgar (Blocks 1–8). The minimum number of T. cruzi introductions to each block was estimated based on phylogenetic relationship (Fig 3) and was correlated to the number of guinea pigs imported. Blocks 1–4 were treated as one block because they share a single, recent common ancestor, indicative of a single introduction.

Supporting information

S1 Fig. ADMIXTURE analysis for K = 4 genetic clusters shows that similar genotypes tend to cluster within a block.

The colors of each pie chart represent the likelihood that a sample belongs to each of four genetic clusters. Each color (blue, yellow, orange, grey) represent a unique genetic cluster. There is significant clustering of identical genotypes within blocks 1–4, 5, and 7. Block 6 contains an exceptionally diverse population of T. cruzi. Block 8–12 have few samples per block, but the samples collected in each block are distinct from those collected elsewhere in the district. Blocks are displayed in a grid to maintain privacy.

https://doi.org/10.1371/journal.pntd.0007910.s001

(TIF)

S2 Fig. Principal component analysis shows genetic similarity of T. cruzi collected from the same city block within the Mariano Melgar district.

PCA was calculated using 56 samples. The centroids of all samples from each city block with standard error bars are shown here. Each of the seven samples collected in Block 6 are represented by a red X because they occupy disparate portions of the PCA space. One sample collected in Block 7 is unique and is thus represented by its own point.

https://doi.org/10.1371/journal.pntd.0007910.s002

(TIF)

S3 Fig. Using only presence/absence for each of 474 recombination events in place of genotype data, the T. cruzi isolates collected from the same block are genetically highly similar.

Violin plots show the number of pairwise differences in unique recombination events. Difference between all pairs of 123 samples, among pairs of samples collected in different blocks from the same district, and among pairs of samples collected in the same block, are shown. These results recapitulate the results found using genome-wide SNP data: The distribution of differences among samples between blocks among districts is larger than the distribution among all samples, the differences among samples within blocks are significantly smaller than expected given the diversity in the dataset (p<0.001). The results that genetically-similar samples cluster in blocks while the overall diversity is distributed among districts do not vary regardless of the data set analyzed (recombination events or genome-wide SNP data).

https://doi.org/10.1371/journal.pntd.0007910.s003

(TIF)

S4 Fig. The density of guinea pigs per block is correlated with the number of T. infestans vectors in Mariano Melgar.

The average number of non-human hosts and T. infestans vectors was counted for each house along a 2km transect in Mariano Meglar. The distribution of guinea pigs along the transect is similar to the distribution of the vector. The number of dogs and chickens remains relatively constant across the transect and is not correlated with vector population densities. The y-axis depicts the average count of each species per house. Approximate location along the transect of numbered blocks from Fig 1 are shown.

https://doi.org/10.1371/journal.pntd.0007910.s004

(TIF)

S5 Fig. Many blocks with active T. infestans populations remain uninfected with T. cruzi.

Blocks containing infected vectors are colored red, blocks where only uninfected vectors were found are colored orange, and blocks where no vectors were detected are colored white [60]. Blocks where an infected vector was found but the parasite genome was not sequenced here are represented by red blocks without block numbers. Blocks above the blue line were meticulously surveyed for parasites at four timepoints. Blocks below the blue line were surveyed by the Ministry of Health. T. cruzi-infected vectors were found in three additional blocks in Mariano Melgar outside of this region.

https://doi.org/10.1371/journal.pntd.0007910.s005

(TIF)

S6 Fig. Cross-validation scores for each genetic cluster (K) averaged across 100 iterations.

Standard error bars are shown for each value. K = 4 was determined to be the optimal number of genetic clusters.

https://doi.org/10.1371/journal.pntd.0007910.s006

(TIF)

S1 Table. Using only presence/absence for each of 474 recombination events in place of genotype data, AMOVA performed across Arequipan T. cruzi isolates shows significant structure within blocks.

https://doi.org/10.1371/journal.pntd.0007910.s007

(DOCX)

S2 Table. Sample collection location, source, and year.

https://doi.org/10.1371/journal.pntd.0007910.s008

(DOCX)

S1 Appendix. Maximum clade credibility phylogenetic reconstruction.

Newick file allows enhanced visualization of the phylogeny (Fig 1) using software including FigTree (available at http://beast.bio.ed.ac.uk). By default, tips are labeled with sample and branches in major clades are colored by location. Collection locations (district and block number within district) and dates for each sample can be found in S2 Table.

https://doi.org/10.1371/journal.pntd.0007910.s009

(NWK)

Acknowledgments

The authors would like to thank Philippe Lemey for his advice regarding the use of Bayesian phylogenetics. The authors would also like to acknowledge Stephanie Seifert and Jill Devine for their assistance in the laboratory. The authors gratefully acknowledge the members of the Universidad Peruana Cayetano Heredia and the University of Pennsylvania Zoonotic Disease Research Lab in Arequipa, Peru, for their contributions, especially Carlos Condori and Luis Zamudio. The authors also thank Danitza Pamo, Jose Ylla, Jose Qusipe, Paul Picardo and Gabriela Bustamante for their contribution during the isolation and maintenance of the T. cruzi strains. In addition, the authors wish to acknowledge the advisory support provided by the following institutions: Ministerio de Salud del Perú (MINSA), the Dirección General de Salud de las Personas (DGSP), the Estrategia Sanitaria Nacional de Prevención y Control de Enfermedades Metaxenicas y Otras Transmitidas por Vectores (ESNPCEMOTVS), the Dirección General de Salud Ambiental (DIGESA), the Gobierno Regional de Arequipa, the Gerencia Regional de Salud de Arequipa (GRSA), the PanAmerican Health Organization (PAHO/OPS) and the Canadian International Development Agency (CIDA).

References

  1. 1. Theoharides KA, Dukes JS. Plant invasion across space and time: factors affecting nonindigenous species success during four stages of invasion. New Phytol. 2007;176: 256–273. pmid:17822399
  2. 2. Alirol E, Getaz L, Stoll B, Chappuis F, Loutan L. Urbanisation and infectious diseases in a globalised world. Lancet Infect Dis. 2011;11: 131–141. pmid:21272793
  3. 3. Berry ASF, Salazar-Sánchez R, Castillo-Neyra R, Borrini-Mayorí K, Chipana-Ramos C, Vargas-Maquera M, et al. Immigration and establishment of Trypanosoma cruzi in Arequipa, Peru. PLoS One. 2019;14: e0221678. pmid:31454370
  4. 4. Pedersen D, Tremblay J, Errázuriz C, Gamarra J. The sequelae of political violence: assessing trauma, suffering and dislocation in the Peruvian highlands. Soc Sci Med. 2008;67: 205–217. pmid:18423959
  5. 5. Levy MZ, Barbu CM, Castillo-Neyra R, Quispe-Machaca VR, Ancca-Juarez J, Escalante-Mejia P, et al. Urbanization, land tenure security and vector-borne Chagas disease. Proc Biol Sci. 2014;281: 20141003. pmid:24990681
  6. 6. Khatchikian CE, Foley EA, Barbu CM, Hwang J, Ancca-Juárez J, Borrini-Mayori K, et al. Population structure of the Chagas disease vector Triatoma infestans in an urban environment. PLoS Negl Trop Dis. 2015;9: e0003425. pmid:25646757
  7. 7. Foley EA, Khatchikian CE, Hwang J, Ancca-Juárez J, Borrini-Mayori K, Quıspe-Machaca VR, et al. Population structure of the Chagas disease vector, Triatoma infestans, at the urban-rural interface. Mol Ecol. 2013;22: 5162–5171. pmid:24103030
  8. 8. QGIS Development Team. QGIS Geographic Information System. Open Source Geospatial Foundation Project; 2019.
  9. 9. Berry ASF, Salazar-Sánchez R, Castillo-Neyra R, Borrini-Mayorí K, Chipana-Ramos C, Vargas-Maquera M, et al. Sexual reproduction in a natural Trypanosoma cruzi population. PLoS Negl Trop Dis. 2019;13: e0007392. pmid:31107905
  10. 10. Schwabl P, Imamura H, Van den Broeck F, Costales JA, Maiguashca-Sánchez J, Miles MA, et al. Meiotic sex in Chagas disease parasite Trypanosoma cruzi. Nat Commun. 2019;10: 3972. pmid:31481692
  11. 11. Kim ES. Modeling Domestic Guinea Pigs and Trypanosoma cruzi Persistence in Arequipa, Peru. Undergraduate thesis. 2013.
  12. 12. Mackin-Rogalska R, Pinowski J, Solon J, Wojcik Z. Changes in vegetation, avifauna, and small mammals in a suburban habitat. Polish Ecological Studies. 1988;14: 293–330.
  13. 13. Kowarik I. On the Role of Alien Species in Urban Flora and Vegetation. Plant Invasions-General Aspects and Special Problems. 1995; 85–103.
  14. 14. Denys C, Schmidt H. Insect communities on experimental mugwort (Artemisia vulgaris L.) plots along an urban gradient. Oecologia. 1998;113: 269–277. pmid:28308207
  15. 15. McIntyre NE. Ecology of Urban Arthropods: A Review and a Call to Action. Ann Entomol Soc Am. 2000;93: 825–835.
  16. 16. Blair RB. Birds and butterflies along urban gradients in two ecoregions of the United States: Is urbanization creating a homogeneous fauna? Biotic Homogenization. 2001; 33–56.
  17. 17. McKinney ML. Urbanization, biodiversity, and conservation. BioScience. 2002;52: 883–890.
  18. 18. Levy MZ, Quíspe-Machaca VR, Ylla-Velasquez JL, Waller LA, Richards JM, Rath B, et al. Impregnated netting slows infestation by Triatoma infestans. Am J Trop Med Hyg. 2008;79: 528–534. pmid:18840739
  19. 19. Herrer A. Trypanosomiasis americana en el Perú: III. Importancia del cobayo como reservorio de la enfermedad de Chagas en la región sudoccidental. Revista Peruana de Medicina Experimental y Salud Publica. 1955;9: 45–55.
  20. 20. Cohen JE, Gürtler RE. Modeling household transmission of American trypanosomiasis. Science. 2001;293: 694–698. pmid:11474111
  21. 21. Levy MZ, Bowman NM, Kawai V, Waller LA, Cornejo del Carpio JG, Cordova Benzaquen E, et al. Periurban Trypanosoma cruzi-infected Triatoma infestans, Arequipa, Peru. Emerging Infect Dis. 2006;12: 1345–1352. pmid:17073082
  22. 22. Coffield DJ, Spagnuolo AM, Shillor M, Mema E, Pell B, Pruzinsky A, et al. A model for Chagas disease with oral and congenital transmission. PLoS One. 2013;8: e67267. pmid:23840647
  23. 23. Levy MZ, Tustin A, Castillo Neyra R, Mabud TS, Levy K, Barbu CM, et al. Bottlenecks in domestic animal populations can facilitate the emergence of Trypanosoma cruzi, the aetiological agent of Chagas disease. Proc R Soc B. 2015;282: 20142807. pmid:26085582
  24. 24. Castillo-Neyra R, Borrini Mayorí K, Salazar Sánchez R, Ancca Suarez J, Xie S, Náquira Velarde C, et al. Heterogeneous infectiousness in guinea pigs experimentally infected with Trypanosoma cruzi. Parasitol Int. 2016;65: 50–54. pmid:26432777
  25. 25. Barbu CM, Hong A, Manne JM, Small DS, Quintanilla Calderón JE, Sethuraman K, et al. The effects of city streets on an urban disease vector. PLoS Comput Biol. 2013;9: e1002801. pmid:23341756
  26. 26. Angold PG, Sadler JP, Hill MO, Pullin A, Rushton S, Austin K, et al. Biodiversity in urban habitat patches. Sci Total Environ. 2006;360: 196–204. pmid:16297440
  27. 27. Cheptou PO, Carrue O, Rouifed S, Cantarel A. Rapid evolution of seed dispersal in an urban environment in the weed Crepis sancta. Proc Natl Acad Sci USA. 2008;105: 3796–3799. pmid:18316722
  28. 28. Niemelä J, Kotze DJ. Carabid beetle assemblages along urban to rural gradients: A review. Landsc Urban Plan. 2009;92: 65–71.
  29. 29. Rabinovich JE, Wisnivesky-Colli C, Solarz ND, Gürtler RE. Probability of transmission of Chagas disease by Triatoma infestans (Hemiptera: Reduviidae) in an endemic area of Santiago del Estero, Argentina. Bull World Health Organ. 1990;68: 737–746. pmid:2127382
  30. 30. Kribs-Zaleta CM, Mubayi A. The role of adaptations in two-strain competition for sylvatic Trypanosoma cruzi transmission. J Biol Dyn. 2012;6: 813–835. pmid:22877420
  31. 31. Kowarik I. Novel urban ecosystems, biodiversity, and conservation. Environ Pollut. 2011;159: 1974–1983. pmid:21435761
  32. 32. Ceballos LA, Vazquez-Prokopec GM, Cecere MC, Marcet PL, Gürtler RE. Feeding rates, nutritional status and flight dispersal potential of peridomestic populations of Triatomainfestans in rural northwestern Argentina. Acta Trop. 2005;95: 149–159. pmid:15993834
  33. 33. Richer W, Kengne P, Cortez MR, Perrineau MM, Cohuet A, Fontenille D, et al. Active dispersal by wild Triatoma infestans in the Bolivian Andes. Trop Med Int Health. 2007;12: 759–764. pmid:17550473
  34. 34. Pinto Dias JC. Human chagas disease and migration in the context of globalization: some particular aspects. J Trop Med. 2013;2013: 789758. pmid:23606862
  35. 35. Breniere SF, Llanos B, Tibayrenc M, Desjeux P. Isoenzymic studies and epidemiological data of Trypanosoma cruzi from Arequipa (Peru), Pacific side. Ann Soc belge Méd trop. 1985;65: 63–66.
  36. 36. Llewellyn MS, Miles MA, Carrasco HJ, Lewis MD, Yeo M, Vargas J, et al. Genome-scale multilocus microsatellite typing of Trypanosoma cruzi discrete typing unit I reveals phylogeographic structure and specific genotypes linked to human infection. PLoS Pathog. 2009;5: e1000410. pmid:19412340
  37. 37. Fabrizio MC, Schweigmann NJ, Bartoloni NJ. Analysis of the transmission of Trypanosoma cruzi infection through hosts and vectors. Parasitology. 2016;143: 1168–1178. pmid:27039662
  38. 38. Flores-Ferrer A, Waleckx E, Rascalou G, Dumonteil E, Gourbière S. Trypanosoma cruzi transmission dynamics in a synanthropic and domesticated host community. PLoS Negl Trop Dis. 2019;13: e0007902. pmid:31834879
  39. 39. Castillo-Neyra R, Barbu CM, Salazar R, Borrini K, Naquira C, Levy MZ. Host-seeking behavior and dispersal of Triatoma infestans, a vector of Chagas disease, under semi-field conditions. PLoS Negl Trop Dis. 2015;9: e3433. pmid:25569228
  40. 40. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9: 357–359. pmid:22388286
  41. 41. Aslett M, Aurrecoechea C, Berriman M, Brestelli J, Brunk BP, Carrington M, et al. TriTrypDB: a functional genomic resource for the Trypanosomatidae. Nucleic Acids Res. 2010;38: D457–62. pmid:19843604
  42. 42. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20: 1297–1303. pmid:20644199
  43. 43. Poplin R, Ruano-Rubio V, DePristo MA, Fennell TJ, Carneiro MO, Van der Auwera GA, et al. Scaling accurate genetic variant discovery to tens of thousands of samples. BioRxiv. 2017;
  44. 44. DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43: 491–498. pmid:21478889
  45. 45. Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr Protoc Bioinformatics. 2013;11: 11.10.1–11.10.33. pmid:25431634
  46. 46. Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27: 2156–2158. pmid:21653522
  47. 47. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81: 559–575. pmid:17701901
  48. 48. Wickham H. ggplot2 - Elegant Graphics for Data Analysis. New York, NY: Springer-Verlag New York; 2016. https://doi.org/10.1007/978-0-387-98141-3
  49. 49. R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing. 2017;
  50. 50. Kamvar ZN, Tabima JF, Grünwald NJ. Poppr: an R package for genetic analysis of populations with clonal, partially clonal, and/or sexual reproduction. PeerJ. 2014;2: e281. pmid:24688859
  51. 51. Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007;23: 2633–2635. pmid:17586829
  52. 52. Drummond AJ, Suchard MA, Xie D, Rambaut A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol. 2012;29: 1969–1973. pmid:22367748
  53. 53. Bouckaert R, Heled J, Kühnert D, Vaughan T, Wu C-H, Xie D, et al. BEAST 2: a software platform for Bayesian evolutionary analysis. PLoS Comput Biol. 2014;10: e1003537. pmid:24722319
  54. 54. Heled J, Drummond AJ. Bayesian inference of population size history from multiple loci. BMC Evol Biol. 2008;8: 289. pmid:18947398
  55. 55. Rambaut A, Drummond AJ, Xie D, Baele G, Suchard MA. Posterior summarization in bayesian phylogenetics using tracer 1.7. Syst Biol. 2018;67: 901–904. pmid:29718447
  56. 56. Suchard MA, Rambaut A. Many-core algorithms for statistical phylogenetics. Bioinformatics. 2009;25: 1370–1376. pmid:19369496
  57. 57. Ayres DL, Darling A, Zwickl DJ, Beerli P, Holder MT, Lewis PO, et al. BEAGLE: an application programming interface and high-performance computing library for statistical phylogenetics. Syst Biol. 2012;61: 170–173. pmid:21963610
  58. 58. Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19: 1655–1664. pmid:19648217
  59. 59. Jakobsson M, Rosenberg NA. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics. 2007;23: 1801–1806. pmid:17485429
  60. 60. Rose EB, Lee K, Roy JA, Small D, Ross ME, Castillo-Neyra R, et al. Risk maps for cities: Incorporating streets into geostatistical models. Spat Spatiotemporal Epidemiol. 2018;27: 47–59. pmid:30409376