Phytophthora infestans (Mont.) de Bary, the causal agent of potato late blight, was responsible for the Irish potato famine of the 1840s. Initial disease outbreaks occurred in the US in 1843, two years prior to European outbreaks. We examined the evolutionary relationships and source of the 19th-century outbreaks using herbarium specimens of P. infestans from historic (1846–1970) and more recent isolates (1992–2014) of the pathogen. The same unique SSR multilocus genotype, named here as FAM-1, caused widespread outbreaks in both US and Europe. The FAM-1 lineage shared allelic diversity and grouped with the oldest specimens collected in Colombia and Central America. The FAM-1 lineage of P. infestans formed a genetic group that was distinct from more recent aggressive lineages found in the US. The US-1 lineage formed a second, mid-20th century group. Recent modern US lineages and the oldest Mexican lineages formed a genetic group with recent Mexican lineages, suggesting a Mexican origin of recent US lineages. A survey of mitochondrial haplotypes in a larger set of global herbarium specimens documented the more frequent occurrence of the HERB-1 (type Ia) mitochondrial haplotype in archival collections from 1866–75 and 1906–1915 and the rise of the Ib mitochondrial lineage (US-1) between 1946–1955. The FAM-1 SSR lineage survived for almost 100 years in the US, was geographically widespread, and was displaced first in the mid-20th century by the US-1 lineage and then by distinct new aggressive lineages that migrated from Mexico.
Citation: Saville AC, Martin MD, Ristaino JB (2016) Historic Late Blight Outbreaks Caused by a Widespread Dominant Lineage of Phytophthora infestans (Mont.) de Bary. PLoS ONE 11(12): e0168381. doi:10.1371/journal.pone.0168381
Editor: Mark Gijzen, Agriculture and Agri-Food Canada, CANADA
Received: May 4, 2016; Accepted: November 29, 2016; Published: December 28, 2016
Copyright: © 2016 Saville et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files. Sequences of representative haplotypes for each locus were deposited in NCBI GenBank (Intron Ras: accession numbers KU720599 – KU720609, KU720611; ras (remainder): accession numbers KU720612 – KU720620, KY124131; PiAVR2: accession numbers KU720597 – KU720598; P3: accession numbers KU720593 – KU720596).
Funding: This work was partially supported by the National Institute of Food and Agriculture, U.S. Department of Agriculture (https://nifa.usda.gov/), under award number 2011-68004-30154 to JBR and by the North Carolina Agricultural Research Service to JBR. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Movement of plant pathogens into new geographic ranges and expansion into new hosts is a major factor in the emergence of novel virulent lineages that threaten food security . The oomycete pathogen Phytophthora infestans (Mont.) de Bary exemplifies this threat and causes potato late blight. This disease is the most important biotic constraint to potato production globally . Historically, potato late blight caused massive food insecurity during Ireland’s famine of the 1840s . Long distance movement of the pathogen into new geographic areas occurs primarily via the movement of infected plant materials including potato tubers, tomato fruit, and transplants.
The current global population structure of P. infestans results from a series of migrations and displacements of clonal lineages [4–9]. While populations in Mexico, the Netherlands, Scandinavia, and other parts of Europe are sexual [6, 10–12], the genetic structure of P. infestans in the US and Canada has been simpler and largely consists of new introductions of a few multilocus genotypes that have been displaced over time [13–15]. This is despite the occurrence of both A1 and A2 mating types in some areas of the US and Canada and ephemeral sexual populations [16, 17].
Although we have improved our ability to track recent outbreaks of late blight [6, 14], an unresolved question of significance is the source of inoculum for historic 19th-century US and European outbreaks of the disease. Some data support a Mexican source of the disease [7, 18, 19] while other data support an Andean source of the disease [3, 20–22] or a US metapopulation emergence of the pathogen . In the US, the disease first appeared in 1843, two years prior to European outbreaks, around the ports of Philadelphia and New York [3, 24]. Early scientists believed the disease originated in the Andean region of South America and cite reports from New World explorers indicating that the disease was known to native people in the Andes [22, 25, 26]. The burgeoning bat guano trade for fertilizer from Peru and failure of the European potato crop due to dry rot resulted in increased shipments of potatoes from the Andean region into both US and European ports . In addition, the closely related hybrid species P. andina occurs only in the Andean region and shares an ancestral haplotype lineage with famine-era lineages of P. infestans [27–30], thus suggesting an Andean outbreak source.
A single clonal lineage named US-1 (Ib mitochondrial DNA (mtDNA), which was globally distributed in the mid-20th century, was suspected to be the cause of ‘famine-era’ outbreaks of P. infestans  until PCR amplification of mitochondrial genes from 19th-century samples documented an entirely different Ia mtDNA haplotype [8,9]. We concluded that another migration must have led to the widespread occurrence of the US-1 clonal lineage in the mid-20th century . Two independent labs recently sequenced whole genomes from historic New and Old World P. infestans collected between 1845 and 1896 from herbarium samples [23, 32, 33, 34]. Examination of these genomes indicated the presence of a unique genotype, which was named HERB-1 for both mitochondrial and nuclear genomes by Yoshida et al. after examination of one North American and ten European genomes. . Although it was suggested that the HERB-1 lineage was extinct or rare in modern populations [23, 35], Martin et al.  sequenced 44 additional mitogenomes and determined the HERB-1/Ia mitochondrial lineage to be extant in both Mexican and South American populations. Martin et al. found that the HERB-1 mitochondrial lineage was also present in the Andean hybrid species P. andina Adler & Flier, sp. nov in a study that included genomic data from both Yoshida et al. and Martin et al. and that the mitochondrial lineage diverged into several clades [23, 27, 32, 33]. Thus, HERB-1 is not exclusive to P. infestans.
We collected a larger set of historic specimens of P. infestans from North, Central, and South America (S1 Table) than previous studies and examined the population genetic structure to confirm that New and Old World outbreaks were caused by the same lineage of P. infestans and to determine how widespread the lineage was. We sequenced nuclear and mitochondrial genes as well as 12 nuclear single sequence repeat (SSR) loci to resolve the population structure and evolutionary relationships of historic New and Old World P. infestans and to characterize the SSR lineage from original outbreaks The primary objectives of our work were to: 1) Examine the evolutionary relationships of P. infestans from the first historic US late blight outbreaks; 2) Compare the genetic structure of these populations to historic European P. infestans; 3) Examine the genetic relationships of historic to modern US aggressive lineages; and 4) Explore migration scenarios that best describe the source of the first US historic outbreaks of P. infestans.
Materials and Methods
Phytophthora infestans was sampled from both historic US and European herbarium vouchers as well as modern isolates obtained from cultures derived from infected host tissue. A total of 183 samples were analyzed from historic US (1855–1958), European (1846–1970), South American (1913–1929), Central American (1941–1956), Mexican (1948–1966), and more recent global sources (1992–2014) (S1 Table). The historic collections tested (n = 66) include the oldest existing US specimen, collected in 1855, and one of the oldest existing European specimens, collected in 1846. Modern samples collected in North America (n = 34) included samples from the early 1950s-1990s (US-1, US-6, US-7) and recent US lineages (US-8, US-11, US-22, US-23, US-24)[6, 14, 15]. Sampled regions outside of North America (n = 83) include South America (Bolivia, Brazil, Peru, Ecuador, Colombia), Central America (Costa Rica, Guatemala, Nicaragua), Ireland, Canada, and Mexico. Two historic cultures (P445 and PA 222) of P. infestans collected in the 1950s and 1960s were shared by Mannon Gallegly. The P445 isolate is an A2 mating type and was collected in Mexico and is notable for its use in the US Cold War late blight research at Fort Detrick. The PA 222 is an A1 mating type and was collected in Pennsylvania.
For specific analyses, samples were divided into eight populations: historic US (USHist), historic European (EUHist), US-1 lineages (US-1; Ib mtDNA haplotype), modern US lineages (USAGG), South America (SA), Central America (CA), Mexico (MEX), and Ireland (IRE). With the exception of US-1, all groups were divided based on spatio-temporal boundaries.
DNA extraction, PCR and sequencing
DNA from modern isolates was extracted using either a hexadecyltrimethylammonium bromide (CTAB) method  or a modification of the quick sodium hydroxide extraction method described by Wang et al. . DNA from herbarium samples was extracted using a modified CTAB method and a Qiagen DNEasy Plant Mini Kit (Qiagen, Valencia, CA) . All work with historic DNA was done in BSL-2 laboratories in the Phytotron and the Genomic Sciences Laboratory at NC State with separate reagents and equipment. No modern DNA was used in these labs. A subset of the samples used in this study were subjected to next-generation sequencing analyzed at the Centre for GeoGenetics (University of Copenhagen). The subsequent genomic analyses have been published [32,33].
Two nuclear loci (ras and PiAVR2) and one mitochondrial (P3) locus were sequenced (S2 Table). For the nuclear locus ras, two regions were amplified including intron 1 (Intron Ras; 349 bp with IRF/IRR) located in the 5’ untranslated region of the gene and a 600 bp portion (with RASF/RASR) covering part of exon 3, exon 4, exon 5, part of exon 6, and introns 3 and 4 [21, 37]. For herbarium samples, 224 bp of intron 1 were sequenced with IRF/IRR and the larger 600 bp region was sequenced with two sets of primers (RAS1F/RAS1R and RAS2F/RAS2R) that amplified the polymorphic sites in two smaller regions (162 and 245 bp) (S2 Table). The AVR gene, PiAVR2, was amplified with primers AVR2F1 and AVR2R2 . For herbarium samples, a smaller region (200 bp) nested within the gene was amplified with primers AVR2F4 and qRT-PCR-R. For the mitochondrial locus P3 for modern isolates, a 1446 bp region was amplified with primers F3/R3 . The P3 region includes the genes rp114, rp15, and tRNAs. For herbarium samples, a smaller 492 bp region nested within the P3 region was amplified with primers P3H4F/ P3H6R [8, 9].
Martin et al.  documented the presence of a single SNP within the mitochondrial genome that distinguished the HERB-1 mitochondrial lineage from all other known lineages. Primers were developed (nad11F/nad11R, S2 Table) that amplified a 180 bp region around the target SNP. Amplicons produced were sequenced and utilized to detect the presence of the HERB-1 mitochondrial lineage in historic samples. The complete mitogenome sequence of HERB-1 has been submitted to the Sequence Read Archive . The mitochondrial haplotype of P. infestans in a larger set of herbarium samples was identified using methods reported elsewhere [8, 39]. The frequency of occurrence of the Ib and Ia/HERB-1 mitochondrial lineages was calculated over time.
PCR reactions were performed in 50 μL volumes for each sample. Each reaction contained 5 μL of 10X PCR buffer (Genesee, San Diego, CA), 2.5 μL dNTPs (2 mM per nucleotide), 2 μL each 10 μM forward and reverse primer, 1.8 μL MgCl2 (50 mg/mL), 0.25 μL BSA (20 mg/mL), 0.2 μL Taq (5 U/μL)(Genesee, San Diego, CA), and 5–10 ng of genomic DNA. For herbarium samples, reaction volumes were decreased to 25 μL. Thermal cycling conditions for nuclear genes were 96°C (1 min); then 35 cycles of 96° (1 min), 55° (1 min), 72° (2 min); and a final extension of 72° (10 min). For mitochondrial gene regions, thermal cycling conditions followed Griffith and Shaw . PCR reactions were run at least twice.
Gene sequences were determined for some of the target loci (S2 Table) within Illumina-sequenced P. infestans isolates by creating multiple sequence alignments of genotype calls from Martin et al. . BAM files of sequences mapped to the T30-4 reference genome (available on the Sequence Read Archive under accession SRP055472). Read alignments to the reference genome had previously been optimized with the RealignerTargetCreator and IndelRealigner tools included in the software Genome Analysis Toolkit (GATK) v1.3 . GATK was used to perform the genotype calling, requiring a minimum PHRED-scaled genotype quality score of 20.0. Genotypes not fulfilling this requirement were masked from the alignment.
P. infestans SSR loci were genotyped using a modified version of the protocol for 12-plex single sequence repeat genotyping as described previously . The Qiagen Type-It Microsatellite PCR kit (Qiagen Corporation, Valenica CA) was used for PCR reactions, and sample volumes were modified to run a 12.5 μL reaction by using 6.25 μL 2X Type-It Master Mix, 1.25 μL of a 10X multiplex primer master mix, 4 μL PCR grade water, and 1–2 μL of template DNA (5–10 ng). Thermal cycling conditions followed Danies et al. . Fragments were analyzed on an Applied Biosystems 3730xl DNA analyzer at the Genomic Sciences Laboratory at North Carolina State University. Alleles were scored manually using Peak Scanner 2 (Applied Biosystems, Foster City, CA), and fragment lengths were rounded to the nearest whole number for analysis.
SSR genotyping analyses
Analysis of SSR genotypes was conducted using the program Structure v.2.3.3 . The data were run using a 20,000 repeat burn-in and 1,000,000 MCMC repeats under an admixture model. Independent runs of the model used K values from 1 to 10 with 20 replicate runs at each value of K. The optimal K was estimated using the Evanno method in the web tool Structure Harvester . In addition, the optimal K was inferred through direct observation of groupings of the samples by their assigned Q values. All runs for the optimal K values, as well as non-optimal K values, were averaged using CLUMPP v. 1.1.2  and visualized with the program Distruct v. 1.1. . The geographic distribution of SSR genotypes based on Structure results was examined by mapping samples based on K value onto maps of Europe, Latin America, and the United States. To further visualize groupings, a discriminant analysis of principal components (DAPC) was conducted using the R library adegenet . The R library Poppr  was used to infer population statistics on a clone corrected dataset of the SSR genotypes.
Gene sequence analysis
All statistical analyses of the nucleotide sequences were performed in SNAP Workbench version 2.0 . All sequences were aligned manually and edited using BioEdit . Multiple sequence alignment was also performed in Clustal W . Polymorphisms were examined on the chromatograms and heterozygous sites were determined. Sequences were collapsed into unique haplotypes using SNAP Map  after removing insertions and deletions (indels) from each of the aligned multilocus data sets and excluding infinite-sites violations. Resultant haplotype data sets were used to examine the overall support or conflict among the variable sites in the DNA sequence alignment. A site compatibility matrix was generated from each haplotype data set using SNAP Clade . Compatibility matrices were used to examine compatibility/incompatibility among all variable sites, with any resultant incompatible sites removed from the data set. Data sets were also evaluated using Kwarg  for estimating the minimum number of recombination events and constructing ancestral recombination graphs (ARG). Conflicting data partitions or putative recombinant haplotypes were excluded from further analyses, except when testing for population subdivision using Hudson’s test statistics as recombination increases the power of these tests [54–56]. Non-recombining data sets were collapsed into unique haplotypes excluding infinite-sites violations using SNAP Map. The ras dataset contained 12 of the 14 documented SNPs in the dataset, while no recombination was detected in PiAVR2 or in P3.
Neutrality tests and population subdivision
The nuclear and mitochondrial DNA sequences were analyzed using Arlequin . For each locus in each population, the population mean mutation rate per nucleotide site θw was calculated using Watterson’s θ , based on the number of segregating sites, s, and the average pairwise nucleotide diversity, π  were estimated. Different tests of neutrality including Tajima’s D and Fu’s Fs statistic [60, 61] were performed in order to determine if the data were consistent with the expectations of the neutral model of molecular evolution.
Ancestral recombination graphs and coalescent analyses were generated using the loci ras, PiAVR2, and P3. Coalescent analyses were conducted using genetree  as implemented in SNAP Workbench. Watterson’s theta (θ) was used for estimations of the population neutral mutation rate, and was estimated for each dataset using the program simple theta . Analyses were performed as subdividing populations with 10 million runs. For the purposes of subdivision, samples were designated as South American (SA) or not South American (NSA). The trees were generated five (P3) and fifteen (ras) times, each using different random seeds to evaluate convergence. The representative tree was chosen by examining the consensus of topology and mutation structure between all trees.
Gene flow and migration
We used the ras sequence data and IM to calculate migration rates between populations for US historic, South American, and Mexican lineages and US-1, South American, and Mexican lineages . These results were corroborated using SSR genotypes with tests of potential migration patterns using Approximate Bayesian Comparison (ABC), as implemented in the program DIYABC v. 2.0.4 . Tested migration scenarios for both US historic and US-1 populations included direct divergence from South America or Mexico, admixture with South America and Mexico populations, and admixture between South America or Mexico and an unsampled population. To further explore US and EU historic population migration scenarios, an additional migration scenario set was tested using US historic, EU historic, and South American populations. Parameter range priors were initialized with values from Goss et al.  and then iteratively modified to better fit our data (S3 Table). Scenario probabilities were determined through comparison of the observed dataset to simulated datasets generated by DIYABC. A logistic regression of these differences was computed using ten proportions of the simulated dataset as the dependent variable and corresponding differences between the observed and simulated datasets as the independent variable. The highest value was taken as the scenario’s overall probability. Confidence in the four highest scenarios was evaluated using type I and type II error tests, in which the data were compared against 500 simulated data sets and the number of times the scenario in question was correctly or incorrectly applied to the data was determined.
Samples were divided into eight populations by geography and time: historic US (USHist), historic European (EUHist), US-1 lineages (US-1; Ib mtDNA haplotype), modern US lineages (USAGG), South America (SA), Central America (CA), Mexico (MEX), and Ireland (IRE). A total of 179 multilocus genotypes (MLG) were detected within 12 microsatellite loci. The greatest number of MLGs was observed within the US historic (USHist), US Aggressive (USAGG) and South America (SA) populations, while the least number of MLGs was observed among modern Irish (IRE) populations. USHist, USAGG and SA lineages had the highest MLG diversity indices. The Index of Association (Ia) was calculated for clone corrected data (S4 Table). Populations from Mexico (MEX) had the lowest Ia, indicating the least inter-locus linkage, and the hypothesis of sexual reproduction could not be rejected. The highest Ia was observed in Central America (CA) and SA populations, indicating no linkage among markers and clonal populations.
From these genotypes, we inferred population structure using Structure and used both Structure Harvester and direct observation of cluster assignment probabilities for grouping individuals. The optimal K value determined by Structure Harvester was K = 2, and via observation of probabilities was K = 4. Both the New World historic (USHist) and Old World historic (EUHist) outbreaks belonged exclusively to the same genetic cluster, representing a nuDNA SSR lineage that was named FAM-1 (Fig 1). This FAM-1 SSR lineage consisted of the oldest specimens and was well differentiated from the remaining three clusters at all values of K (Fig 2). In addition to all USHist and EUHist samples, this genetic group also contained P. infestans genotypes recovered from the two oldest South American (SA) samples from Colombia collected in 1913 and 1929 and the oldest sample from Costa Rica collected in 1942. With K = 4, the US-1 genotype (mtDNA haplotype Ib) formed a second distinct genetic group (Figs 1 and 2). The US-1 genotype was found among global populations from the US, Europe, and South America from 1931–1994. US-1 was observed in the greatest numbers in the herbarium records from the post-WWII era from 1946–1955 (Fig 3). The historic isolate PA222, collected in the US and studied during the US Cold War-era bioweapons program in the 1950s, was also assigned to the US-1 cluster. At both K = 3 and K = 4, a third group was observed that included the US-23 genotype and genotypes from South America and Ireland (Fig 2). A fourth genetic group was comprised of several historic and recent genotypes from Mexico (MEX), Central America (CA), and the rest of the US Aggressive (USAGG) genotypes (S1 Table).
Small blue circles indicate FAM-1 SSR lineage found before 1900, and larger circles indicate samples collected after 1900. Samples were assigned to genetic clusters based on the highest Q value and correspond to K = 4 groups by color as in Fig 2. The FAM-1 lineage was geographically widespread. Samples are located by state in the US and by country in the rest of the world. Maps reprinted from www.d-maps.com under a CC BY license, with permission from Daniel Dalet, original copyright 2007–2016.
Optimal value for K was inferred to be K = 2 based on analysis by Structure Harvester and K = 4 based on evaluation of probabilities . The oldest SSR lineage, FAM-1 consisted of samples from US (1855–1958) and EU Historic (1846–1970) outbreaks. Black arrows indicate specimens from the oldest South and Central American outbreaks that were also identified as FAM-1 lineages. A complete list of samples is shown in the S1 Table.
Mitochondrial haplotypes were identified from P. infestans in herbarium collections shown in the S1 Table. Mitochondrial lineages were determined by sequencing the P2 mitochondrial region around the Msp1 restriction site present in type Ib haplotypes and the rare allele found in Nad11 in the HERB-1 mitogenome [27, 39].
Discriminant analysis of principal components (DAPC) was conducted utilizing SSR data. USHist and EUHist populations formed two overlapping populations largely separate from all other groups. The FAM-1 lineage shared allelic diversity with some samples from SA populations (Fig 4). The largest inertia ellipse contained samples from SA and smaller subsets of samples from MEX, US-1 and IRE clustered with SA populations. The US-1 lineage contained samples from both MEX and SA (Fig 4).
Groups are represented by color/symbol as well as by inertia ellipses. Each point represents the SSR genotype of a single sample. Individuals are grouped by US Historic (USHist 1855–1958); European Historic (EUHist 1846–1970); US1: samples identified as mtDNA haplotype Ib or clonal lineage US-1; South American (SA): samples collected in South America; Mexico (MEX): samples collected in Mexico; and Ireland (IRE): samples collected in Ireland. The FAM-1 SSR lineage consisted of US and EU historic samples.
Nuclear and mitochondrial sequence variability
A total of 1259 nucleotides were sequenced, consisting of: 680 nucleotides of the nuclear ras gene (intron 1 and exon regions 3–6); 200 nucleotides of the PiAVR2 gene; and 379 nucleotides in the mitochondrial genome region P3 (rpl14, rpl5 and tRNAs) (S5 Table). Twelve segregating nucleotide sites were identified in the ras gene, including 5 in intron 1 and another 5 in the exons 3–6 that were phylogenetically informative (S6 Table). A total of eleven haplotypes were observed in ras gene sequences, including one additional segregating site not observed previously (S5 and S6 Tables). The greatest number of haplotypes (eight) was found in the IRE populations. Only two haplotypes was found among EUHist, US-1, and CA populations (S5 Table).
One synonymous substitution site and 4 nonsynonymous substitutions were found in exons 3–6 of the ras gene (S6 Table). Populations from IRE, USAGG, and SA had higher nucleotide diversity (π) and mean mutation rates (θW) than populations from MEX (S5 Table). All populations were determined to be neutral based on Tajima’s D and Fu’s Fs statistics. However, the presence of subdivision was detected through the use of Hudson’s statistics, suggesting the populations do not conform to Hardy-Weinberg equilibrium, which is expected at least within clonal lineages. Gene flow between the USHist and EUHist populations was evident (Ks = .0593, Kst = .005, p>0.05) (S7 Table).
One segregating nucleotide site was identified in PiAVR2, and it was phylogenetically informative and resulted in a nonsynonymous substitution site and two haplotypes were observed (S4 Table). Populations from MEX, EUHist, and CA had higher nucleotide diversity and mean mutation rates (θW) than populations from USHist, US-1, SA and IRE (S5 Table).
There were 4 segregating sites identified in the P3 region (S5 Table). Four haplotypes were found in the mitochondrial P3 sequences and all were previously observed by Gómez-Alpizar et al. . Populations from IRE, USAGG and SA had higher nucleotide diversity and mean mutation rates (θW) compared to those from MEX (S5 Table). Since there was one dominant haplotype found in USHist, EUHist, US-1 and CA populations, estimates of π and θW values were null for these loci. For the populations for which neutrality tests could be conducted, all were determined to be neutral.
Phylogeographic source of historic lineages
The largest nonrecombining block of sequence data from the ras gene was utilized for coalescent analysis, and 8 haplotypes were identified (Fig 5). One of the 8 haplotypes (irH5) was unique to historic populations, and was observed only within the USHist population. We used the coalescence process to infer the distribution of mutations on the branches of the tree . The most ancestral mutations observed within USHist and EUHist populations were assigned a South American origin (Fig 5). irH2 and irH3, present in the IRE, USAGG, and SA clusters, formed one of two major lineages identified and the most ancestral mutations from these haplotypes were of South American (SA) origin (Fig 5).
The distribution of mutations for South American (SA) and non-South American (NSA) populations are shown. The frequency of occurrence of haplotypes and the times to the most recent common ancestor are shown in coalescent units for each population examined including US Historic (USHist), European Historic (EUHist), US-1, South American (SA), Central American (CA), Mexican (MEX), US Aggressive (USAGG) and Ireland (IRE). Times are not to scale. Estimates were based on 10 million coalescent simulations and fifteen independent runs to ensure convergence. Simulations were performed assuming constant population size and utilized the largest nonrecombining block of data.
Analyses of ancestral recombination indicated that the one haplotype unique in historic populations was not generated through recombination (S1 Fig). This haplotype (H6), present in USHist, diverged prior to the most ancestral recombination event (S1 Fig). Evidence for a more recent recombination event that gave rise to haplotype H4, unique to USAGG populations, was also observed (S1 Fig). This haplotype was found in the US-11 lineage, a putative recombinant lineage that is found in tomato in the US .
Two haplotypes were observed within the PiAVR2 locus, and they were shared between all historic and modern populations except IRE, in which only H1 was observed (S2 Fig and S8 Table). Four haplotypes were observed within the P3 mitochondrial gene region (rpl14, rp15, and tRNAs), including one haplotype (H4) unique to the SA population and one (H3) unique to the MEX population (S3 Fig and S8 Table). The ancestral recombination graphs indicated no recombination among PiAVR2 or the mitochondrial loci (S3 Fig).
Migration pathways from SA
Mean population mutation rates and numbers of migrants into populations were determined for USHist, EUHist, MEX, SA, and US-1 populations by comparing migrations between paired populations using the ras locus, which was the most phylogenetically informative locus (S9 Table). A greater mean number of migrants was observed moving into EUHist than into USHist for both ras (m2 = 6.63 vs m1 = 4.59) and PiAVR2 (m2 = 6.99 vs m1 = 5.05) (S9 Table). Mean population mutation rates were also higher for EUHist than USHist populations. Thus, asymmetric migration into EUHist populations was observed.
Although migration into both USHist and MEX populations was detected, a higher mean population mutation rate and higher mean number of migrants was observed into USHist (m1 = 6.34) than MEX (m2 = 5.70) populations for ras, indicating asymmetric migration (S9 Table). Additionally, a higher mean population mutation rate and mean number of immigrants was observed into SA (m2 = 6.25) than USHist (m1 = 4.62), also indicating asymmetric migration. Only a slightly higher mean number of migrants and symmetrical migration between SA and USHist populations was observed for PiAVR2 (m2 = 6.49, m1 = 5.87).
Both MEX and SA contributed migrants into US-1. Higher mean numbers of migrants into MEX than US-1 populations (m2 = 7.02 vs m1 = 5.03) and SA than US-1 populations (m2 = 6.18 vs m1 = 4.85) was observed for ras. Similarly, a higher mean number of migrants into MEX than US-1 populations (m2 = 6.48 vs m1 = 4.45) and SA than US-1 populations (m2 = 6.54 vs m1 = 5.78) was observed for PiAVR2 (S9 Table).
Seven different migration scenarios were examined using SSR allele data and ABC analysis among the USHist, SA, and MEX populations. The scenario with the highest probability (Scenario 2, P = 0.504) was chosen as the most likely model. This scenario was a model in which the USHist lineage diverged first from a common ancestor that then diverged into MEX and SA populations (Fig 6a). Confidence in the scenario choice was evaluated by using simulated datasets to calculate error percentages between the three scenarios with the highest probabilities. Estimation of type I error revealed that 65.6% of simulated datasets using this scenario resulted in the highest posterior probability for Scenario 2 when compared to the two scenarios with the next highest probabilities (Scenarios 3, 5) (type I error, 0.344) (S4 Fig and S10 Table).
Posterior probabilities of the top migration scenarios comparing populations from (A) US Historic (USHist), South American (SA) and Mexican (MEX) populations, and (B) US-1, MEX and SA populations. Probabilities are the highest value of a logistic regression of data.
We tested the migration scenarios between US-1, SA, and MEX populations and the scenario with the highest probability (Scenario 3, P = 0.346) was a scenario in which SA populations diverged first from a common ancestor followed by divergence of US-1 and MEX populations (Fig 6b). Estimation of type I error indicated that 68.2% of the datasets simulated using Scenario 3 resulted in the highest posterior probability when compared to the scenarios with the next highest probabilities (Scenarios 4 and 1) (type I error, 0.318) (S5 Fig and S10 Table).
Source of historic US P. infestans
We examined the population structure of historic late blight using one of the largest globally sourced collection of historic specimens (n = 66) and modern samples (n = 117) examined to date. Our data revealed the widespread occurrence of a unique historic nuDNA SSR lineage of P. infestans that we have named the FAM-1 lineage. This lineage was found in historical US specimens collected from 1855 to 1939 (Fig 1) and included specimens reported previously in the FAM phylogenetic group designated by Martin et al. . Interestingly, the oldest known blight-infected herbarium samples from South America (1913, 1939 Columbia; 1942 Costa Rica) also belonged to the FAM-1 SSR lineage. Thus, the FAM-1 lineage was geographically widespread in the US but was also present in Central and South America and persisted for almost 100 years after initial introduction in many geographic areas of the US (Fig 1, blue circles). Our work with the largest set of historic US samples examined to date documents that the same nuclear genome lineage of P. infestans caused epidemics in both the US and Europe as was suggested by others .
Surveys of genotypes to monitor changes in P. infestans populations are regularly carried out in the US and Europe by USAblight and EUROblight project teams using FTA card sampling [4, 6, 14]. These samples are genotyped using the 12-plex SSR protocol, but baseline data about SSR profiles for early lineages prior to US-1 were missing until now . Documentation of a large number of SSR profiles from historic samples from our work now provides a baseline for comparison to new SSR genotypes and will enable large-scale surveys to determine if/where the historic FAM-1 lineage persists.
Yoshida et al. previously sequenced the HERB-1 nuclear and mitochondrial genomes in eleven historic samples, including ten from Europe and one from North America, and came to the same conclusion that the historic outbreaks in the US and Europe were caused by the same lineage . A second paper by Yoshida et al., which analyzed genomes both from the original work and from those sequenced by Martin et al. , brought the total to thirteen genomes and that work also showed a single lineage caused historic US and EU outbreaks . While the HERB-1 mitochondrial and nuclear genomes were initially thought to be extinct [23, 34, 35], work by Martin et al. that involved the sequencing of 44 additional mitogenomes detected the HERB-1 mitochondrial genome in samples of P. infestans from Mexico and Ecuador from the 1980s and 2000s .
The FAM-1 lineage was present in Europe during the first late blight outbreaks (S1 Table). We compared the genetic structure of USHist to EUHist lineages, and show that both New and Old World outbreaks were caused by the same lineage of P. infestans. Shared allelic diversity was observed between US and EU historic lineages for all values of K. Thus, our data suggest that the same FAM-1 lineage caused both the US late blight outbreaks in 1842 and the European outbreaks two years later. Our data also show that the FAM-1 lineage persisted for over 30 years in Europe and with wider examination may likely still be found in present-day European isolates of P. infestans.
The source of the 19th-century late blight outbreaks has been debated [18–20, 25, 30–32, 35]. Our data from multilocus genotyping demonstrate shared allelic diversity between historic lineages in South America, the US, and Europe . Indeed, our current data indicate that US and European famine-era populations share more genetic similarity with the oldest Colombian populations (1913–1929) than the oldest Mexican samples that are known to exist.
Martin et al.  used whole genome sequences from a large set of global samples and recently documented that the divergence time of famine-era European lineages occurred before present-day Mexican and South American lineages. Our ABC data corroborate those findings in that the most likely scenario includes the divergence of the FAM-1 lineage before that of more recent SA and MEX lineages. Our data suggest that the FAM-1 lineage emerged in a US metapopulation from either a South or Central American source, spread to Europe to cause famine-era outbreaks, and survived in the New World for a long period of time after that. The present-day SA lineages in our study are likely reintroductions of the pathogen from Mexican sources as others have suggested .
The oldest published reports of late blight in SA cited by Neiderhauser are in 1887 from Argentina . However, to our knowledge no 19th-century samples exist from either Mexican or South American sources to confirm the 19th-century presence of the pathogen in either region. The oldest South American sample in our study was collected from Colombia and belonged to the FAM-1 SSR lineage. Although it would be interesting to examine even older historic samples from South America and Mexico, we have searched herbarium collections and to our knowledge these specimens do not exist. It would be intriguing to sequence and compare the nuclear and mitochondrial genomes of the oldest USHist and Colombian FAM-1 lineages and compare them with those of the EUHist FAM-1 lineages to further elucidate the temporal sequence of historic introductions to the Old World.
Multilocus sequencing of nuclear genes indicated the presence of haplotypes unique to historic populations. In addition, coalescent analyses of two nuclear loci suggest that USHist and EUHist outbreaks arose from a common South American ancestor. Additional historic evidence, such as increased trade in seed potatoes from South America at the time, [3, 25] suggests a more likely scenario in which infected tubers were moved first to the US and then to Europe, providing a potential migration route and source of disease.
Our analyses indicate more migrants moved into historic European populations from US populations than vice versa, suggesting historic US populations of the pathogen contributed to the European outbreaks more than once. Historic literature also suggests that US outbreaks were a source of subsequent EU outbreaks . There are many reports in the 19th-century Gardeners’ Chronicles, a publication used by farmers and naturalists, describing potential migration routes from shipments of “seed sets” shipped from the US and Canada into Bermuda, the UK, and Europe .
Yoshida et al. [23, 34] suggested a single introduction of the ancestral HERB-1 haplotype into Europe from a metapopulation outside of Mexico. While our data support a US metapopulation emergence outside of Mexico, the presence of multiple mitochondrial haplotypes in Europe  also suggests multiple introductions of the pathogen may have occurred between 1845 and 1889. It is possible that separate introductions of late blight into Europe may have occurred on infected tubers shipped both directly from the US and from South America . The oldest European sample from 1845 shared a basal lineage with the hybrid species P. andina from a shared Andean source population . The incongruity of multiple mtDNA haplotypes combined with a single nuclear lineage was observed in Martin et al.  with the positioning/clustering of P. andina samples within the HERB-1 mitochondrial clade. This dichotomy may be the result of long periods of clonal reproduction, with forces such as mitotic recombination and genetic drift creating the variation that results in the observation of multiple subclades that could point towards multiple introductions .
Origin of the US-1 lineage
We and others [23, 31, 35] have suggested that the US-1 lineage emerged later from a metapopulation outside of Mexico. Our data show that the US-1 lineage clearly emerged on a global scale in the mid-20th century and formed a distinct cluster that shares little allelic diversity with either the US or EU historic FAM-1 lineage. Thus, the US-1 lineage is not a direct descendent of famine-era outbreaks, but is a sister lineage, as is suggested by both mitochondrial and nuclear phylogenies done with whole genome datasets [23, 33, 34, 67, 68]. It is possible that US-1 and the FAM-1 lineages may have come from different sources. Martin et al.  suggested US-1 and famine-era lineages may have originated on non-S. tuberosum hosts. It is possible that the two lineages diverged from a common ancestor on two different host species, resulting in the apparent differences in allelic diversity. US-1 could have been introduced into the US from a different source and then displaced the FAM-1 lineage. However, Martin et al.  noted that relationships inferred from phylogenetic analysis of the P. infestans mitogenome may be inconsistent with relationships observed in the nuclear genome, and a more comprehensive examination of samples would be needed to discern potential sources. Mitochondrial phylogenies support the contention that type Ib (US-1) and type Ia mitochondrial lineages are sister lineages [67, 68]. Coalescent analysis indicates that the oldest mutations that gave rise to US-1 populations arose in South America [27, 33]. We also identified several recent SA isolates of P. infestans from Peru that were clustered with the US-1 lineage. Herbarium samples from the US placed the US-1 lineage in Texas in 1931 (S1 Table). Our migration analyses indicated migration of US-1 into both Mexico and South America, but greater numbers of migrants moved into Mexico. Thus, it is possible that the US-1 emerged from a South American source. The rare occurrence of the US-1 lineage in Mexico suggests it may be a recent immigrant there [7, 33, 69, 70]. Sequencing additional US-1 samples from Mexico would help clarify this question.
The increasing frequency of the US-1 lineage in the WWII era is intriguing. The US-1 lineage was used in Cold War era research at Fort Detrick to test for virulence on potato germplasm . In fact, historic isolates used in those studies were examined in our work here, (Mannon Gallegly, pers. comm.), and one of those isolates belonged to the US-1 lineage. How US-1 became so widely dispersed in the mid-20th century is unknown , but since US-1 is sensitive to metalaxyl, it was displaced by other fungicide-resistant lineages of P. infestans when this compound was deployed in the late 1970s . The introduction of new migrants from Mexico subsequently displaced the US populations of the US-1 lineage in the early 1990s .
Recent migrations from Mexico of aggressive lineages
The oldest Mexican herbarium voucher infected with late blight examined in our study, collected by John Neiderhauser in Chihuahua in 1948, was not assigned to the FAM-1 lineage, but was more similar to modern Mexican and US aggressive lineages. Mexican lineages clustered with many recent lineages of P. infestans in the US. Admixture between Mexican populations and modern US lineages, especially US-6, US-7, and US-8 lineages, support the theory of Fry and others [6, 70] that current populations of P. infestans in the US are primarily the result of migrations out of Mexico. Ancestral recombination graphs clearly documented the recombinant nature of one aggressive lineage, US-11, a clonal lineage that has been observed in recent years on tomato in the US [6, 15].
The cause of the apparent displacement of the historic-era populations of P. infestans after the 1940s is unknown at this time (Fig 3). Clearly there was an increase in potato breeding programs and movement of potato germplasm and the pathogen worldwide during and after WWII . Potatoes were also used as a valuable food source to feed both the Allied and German troops during WWII. Since historic FAM lineages lack many of the genes responsible for virulence on modern potatoes, this could have led to its decline [23, 32]. Modern populations of P. infestans have migrated from Mexico on infected potatoes on multiple occasions, and genotype shifts have been observed [14, 70]. The most recent population shift has been the displacement of the US-22 lineage by the US-23 lineage in the US over the past 5 years [13, 14, 15]. A similar observation has been made in Europe, where 13_A2, noted for its aggressive virulence, has become the dominant lineage . These shifts presumably indicate the replacement of older lineages by newer lineages with higher fitness . These lineages may be more aggressive, able to cause disease on both tomato and potato, or be resistant to fungicides
While our evidence suggests a South American origin for the FAM-1 lineage of P. infestans, to fully understand the evolutionary history of the pathogen in the New World, more samples are needed [72, 73]. Unfortunately, historic samples available for study from Mexico and South America are limited, particularly older herbarium samples. It is problematic that these samples do not exist, but further sampling of modern late blight from the Andean region of South America, especially from wild Solanum species, would improve our ability to draw inferences about FAM-1’s origins.
Recently, we suggested that it is possible that late blight was introduced into Mexico on a non-potato host , and that additional sampling from other Solanaceous hosts could help elucidate the role of host jumps in the evolution of the pathogen [1,74]. Goss et al.  suggested a similar non-potato origin for P. infestans as a species, emphasizing the need to collect from hosts outside of potato and tomato crops. Our data document that the FAM-1 lineage occurred on several wild species early after its introduction into Europe on Anthocercis ilicifolia Hook., Solanum dulcamara L., Solanum nigrum L., and Petunia hybrida E. Vilm. and in the Americas (S1 Table) [8, 9, 29]. It is possible that the metapopulation source of P. infestans alluded to by others [23, 27] may be discerned by a more thorough examination of wild hosts. In addition, previous studies have indicated passage through a wild host may increase aggressiveness of P. infestans, and this could have ecological implications for the control of the disease . P. infestans may have migrated from a wild Solanum host to domesticated potato and vice versa more than once [18, 19, 27]. There is evidence from herbarium records that non-S. tuberosum hosts played a role in the dispersal of this pathogen in the recent past [27, 29]. Thus, further studies to understand the role of host biodiversity and the movement of wild species in migrations of Phytophthora infestans are needed.
S1 Fig. Ancestral recombination graph (ARG) of sequences from the ras locus from modern and historic herbarium collections of Phytophthora infestans.
Green and yellow dots indicate points of coalescence. Recombination events are indicated by blue circles. Numbers within circles indicate the position of the site before recombination takes place (see S4 Table). Numbers along branches indicate the number of mutations between points. P: The origin of the prefix contribution to the recombination event; S: The origin of the suffix contribution to the recombination event.
S2 Fig. Ancestral recombination graph (ARG) of sequences from the PiAVR2 locus from modern and historic herbarium collections of Phytophthora infestans.
Yellow dot indicates the point of coalescence. Numbers along branches indicate the number of mutations between points.
S3 Fig. Ancestral recombination graph (ARG) of sequences from the P3 mitochondrial region from modern and historic herbarium collections of Phytophthora infestans.
Yellow and green dots indicate the points of coalescence. Numbers along branches indicate the number of mutations between points.
S4 Fig. Posterior probabilities of migration scenarios involving populations of Phytophthora infestans from South America (SA), Mexico (MEX), and famine era US (USHist).
Probabilities are based on the highest value of a logistic regression of data.
S5 Fig. Posterior probabilities of migration scenarios involving populations of Phytophthora infestans from South America (SA), Mexico (MEX), and Ib haplotypes (US1).
Probabilities are based on the highest value of a logistic regression of data.
S1 Table. Sample number, date of collection, location, genotype or mitochondrial lineage, group and host of isolates of Phytophthora infestans from herbarium specimens and modern collections used in this study.
S2 Table. Target gene region, primer name, primer sequence, primer size, location of primer on target DNA, and reference source of PCR primers used in the study.
S3 Table. Prior distributions for Do It Yourself Approximate Baysian Computation (DIYABC) scenarios.
Summary statistics for all runs: within population statistics: mean number of alleles and mean genetic diversity; between sample statistics: mean number of alleles, mean genetic diversity, Fst, shared allele distance, (δμ) 2, and maximum likelihood coefficient of admixture.
S4 Table. Diversity statistics for clone corrected microsatellite data for all 12 loci within populations of Phytophthora infestans.
S5 Table. Population statistics, diversity indices, and neutrality tests among Phytophthora infestans populations using ras, PiAVR2, and the P3 mitochondrial region.
S6 Table. Distribution of haplotypes and base substitution events in nuclear and mitochondrial genes of Phytophthora infestans.
S7 Table. Population subdivision of Phytophthora infestans populations at ras locus according to Hudson’s test statistics Ks (upper right matrix) and Kst (lower left matrix).
S8 Table. Locus, haplotype, isolate identities and population sampled for modern and historic isolates of Phytophthora infestans.
S9 Table. Isolation model (IM) parameter estimates for Phytophthora infestans populations from US Historic (USHist), EU Historic (EUHist), Mexican (MEX), US-1 and South America (SA) based on variation in the ras or PiAVR2 nuclear loci.
S10 Table. Posterior probabilities and confidence intervals from three Phytophthora infestans populations.
Probabilities listed are the highest values as calculated over a 10-sample logistic regression.
The authors would like to thank Tom Gilbert for comments on this manuscript. The authors would like to thank NIFA intern Meghan Wyatt for technical assistance. The authors would also like to thank Dr. Mannon Gallegly for conversations on his Cold War era work and many others for providing cultures used in this work (S1 Table). Appreciation is expressed to the Royal Botanic Gardens Kew Mycological Herbarium (K), the National Botanic Garden, Glasnevin, Dublin (DBN), the USDA National Fungus Collection, Beltsville, MD (BPI), the CABI Bioscience collection, Egham (IMI), the Farlow Herbarium Harvard University, Cambridge, MA (FH), the Museum of Evolutionary Biology, Uppsala University, Uppsala (UPS), the Cornell Plant Pathology Herbarium, Ithaca, NY (CUP), and the University of Florida Herbarium, Gainesville, FL (FLAS) for providing herbarium material. DNA used in this work may be requested from the corresponding author. Appreciation is expressed to the NC State Phytotron and the Genomic Sciences Laboratory for use of a separate lab space for work with the historic samples.
- Conceptualization: JBR.
- Data curation: ACS JBR MDM.
- Formal analysis: ACS.
- Funding acquisition: JBR.
- Investigation: JBR ACS MDM.
- Methodology: JBR ACS.
- Project administration: JBR.
- Resources: JBR ACS.
- Software: ACS.
- Supervision: JBR.
- Validation: JBR ACS MDM.
- Visualization: ACS JBR.
- Writing – original draft: ACS.
- Writing – review & editing: ACS JBR MDM.
- 1. Stukenbrock EH, McDonald B. The origins of plant pathogens in agroecosystems. Annu Rev Phytopathol. 2009; 46: 75–100.
- 2. Anderson PK, Cunningham AA, Patel NG, Morales FJ, Epstein PR, Daszak P. Emerging infectious diseases of plants: pathogen pollution, climate change and agrotechnology drivers. Trends Ecol Evol. 2004; 19: 535–544. doi: 10.1016/j.tree.2004.07.021. pmid:16701319
- 3. Bourke PMA. Emergence of potato blight, 1843–46. Nature 1964; 203: 805–808.
- 4. Cooke DEL, Cano LM, Raffaele S, Bain RA, Cooke LR, Etherington GJ, et al. Genome analyses of an aggressive and invasive lineage of the Irish potato famine pathogen. PLoS Pathog. 2012; 8: 1–14.
- 5. Forbes GA, Morales JG, Restrepo S, Pérez W, Gamboa S, Ruiz R, et al. Phytophthora infestans and Phytophthora andina on solanaceous hosts in South America. In: Lamour K, editor. Phytophthora: a global perspective. Boston: CAB International; 2013. Pp. 48–58.
- 6. Fry WE, Birch PRJ, Judelson HS, Grünwald NJ, Danies G, Everts KL. et al. Five reasons to consider Phytophthora infestans a reemerging pathogen. Phytopathology 2015; 105: 966–981. doi: 10.1094/PHYTO-01-15-0005-FI. pmid:25760519
- 7. Goodwin SB, Cohen BA, Fry WE. Panglobal distribution of a single clonal lineage of the Irish potato famine fungus. Proc Natl Acad Sci USA 1994; 91: 11591–11595. pmid:7972108
- 8. May KJ, Ristaino JB. Identity of the mtDNA haplotype(s) of Phytophthora infestans in historical specimens from the Irish Potato Famine. Mycol Res. 2004; 108: 1–9.
- 9. Ristaino JB, Groves CT, Parra GN. PCR amplification of the Irish potato famine from historic specimens. Nature 2001; 411: 695–697. doi: 10.1038/35079606. pmid:11395772
- 10. Brurberg MB, Elameen A, Le VH, Nærstad R, Hermansen A, Lehtinen A, et al. Genetic analysis of Phytophthora infestans populations in the Nordic European countries reveals high genetic variability. Fungal Biol. 2011; 115: 335–342. doi: 10.1016/j.funbio.2011.01.003. pmid:21530915
- 11. Drenth A, Goodwin SB, Fry WE, Davidse LC. Genotypic diversity of Phytophthora infestans in the Netherlands revealed by DNA polymorphisms. Phytopathology 1993; 83: 1087–1092.
- 12. Yuen JE, Andersson B. What is the evidence for sexual reproduction of Phytophthora infestans in Europe? Plant Pathol. 2013; 62: 485–491.
- 13. Fry WE, Myers K, Roberts P, McGrath MT, Everts K, Secor G. et al. The 2009 late blight pandemic in the eastern United States—causes and results. Plant Dis. 2013; 97: 296–306.
- 14. Hu C-H, Perez FG, Donahoo R, McLeod A, Myers K, Ivors , et al. Recent genotypes of Phytophthora infestans in eastern United States reveal clonal populations and reappearance of mefenoxam sensitivity. Plant Dis. 2012; 96: 1323–1330.
- 15. Saville A, Graham K, Grünwald NJ, Myers K, Fry WE, Ristiano JB. Fungicide sensitivity of US genotypes of Phytophthora infestans to six oomycete-targeted compounds. Plant Dis. 2015; 99: 659–666.
- 16. Danies G, Myers K, Mideros MF, Restrepo S, Martin FN, Cooke DEL, et al. An ephemeral sexual population of Phytophthora infestans in the northeastern United States and Canada. PLoS One 2014;
- 17. Gavino PD, Smart CD, Sandrock RW, Miller JS, Hamm PB, Yun Lee T. et al. Implications of sexual reproduction for Phytophthora infestans in the United States: generation of an aggressive lineage. Plant Dis. 200; 84: 731–735.
- 18. Niederhauser JS. Phytophthora infestans: the Mexican connection. In: Lucas JA, Shattock RC, Shaw DS, Cooke LR, editors. Phytophthora. Cambridge: Cambridge University Press; 1991. pp. 25–45.
- 19. Reddick D. Whence came Phytophthora infestans? Chron Bot. 1939; 5: 410–412.
- 20. Andrivon D. The origin of Phytophthora infestans populations present in Europe in the 1840s: a critical review of historical and scientific evidence. Plant Pathol. 1996; 45: 1027–1035.
- 21. Gómez-Alpizar L, Carbone I, Ristaino JB. An Andean origin of Phytophthora infestans inferred from mitochondrial and nuclear gene genealogies. Proc Natl Acad Sci USA. 2007; 104: 3306–3311. doi: 10.1073/pnas.0611479104. pmid:17360643
- 22. Abad ZG, Abad JA. Another look at the origin of the late blight of potatoes, tomatoes, and pear melon in the Andes of South America. Plant Dis. 1997; 81: 682–688.
- 23. Yoshida K, Schuenemann VJ, Cano LM, Pais M, Mishra B, Sharma R. et al. The rise and fall of the Phytophthora lineage that triggered the Irish potato famine. eLife 2013; Available: http://dx.doi.org/10.7554/eLife.00731 (2013).
- 24. Teschemacher JE. Observations on the Potato Disease. Gardeners Chronicle 1845; 125.
- 25. DeBary A. Researches into the nature of the potato-fungus—Phytophthora infestans. J Roy Agr Soc. 1876; 12: 239–268.
- 26. Berkeley MJ. Observations, botanical and physiological on the potato murain. J Hort Soc London 1846; 1: 9–34.
- 27. Martin MD, Vieira FG, Ho SYW, Wales N, Schubert M, Seguin-Orlando A et al. Genomic characterization of a South American Phytophthora hybrid mandates reassessment of the geographic origins of Phytophthora infestans. Mol Biol Evol. 2016; 33: 478–491. doi: 10.1093/molbev/msv241. pmid:26576850
- 28. Oliva RF, Kroon LPNM, Chacón G, Flier WG, Ristaino JB, Forbes GA. Phytophthora andina sp. nov., a newly identified heterothallic pathogen of solanaceous hosts in the Andean highlands. Plant Pathol. 2010; 59: 613–625.
- 29. Gómez-Alpizar L, Hu C-H, Oliva R, Forbes G, Ristaino JB. Phylogenetic relationships of a new species, Phytophthora andina, from the highlands of Ecuador that is closely related to the Irish Potato famine pathogen Phytophthora infestans. Mycologia 2008; 100: 590–602. pmid:18833752
- 30. Goss EM, Cardenas M, Myers K, Forbes GA, Fry WE, Restrepo Grünwald NJ. The plant pathogen Phytophthora andina emerged via hybridization of an unknown Phytophthora species and the Irish potato famine pathogen, P. infestans. PLoS ONE 2011; 6:
- 31. Ristaino JB, Hu C-H. DNA sequence analysis of the late-blight pathogen gives clues to the world-wide migration. Acta Hortic. 2009; 834: 27–40.
- 32. Martin MD, Cappellini E, Samaniego JA, Zepeda ML, Campos PF, Seguin-Orlando A, et al. Reconstructing genome evolution in historic samples of the Irish potato famine pathogen. Nat Commun. 2013;
- 33. Martin MD, Ho SYW, Wales N, Ristaino JB, Gilbert MTP. Persistence of the mitochondrial lineage responsible for the Irish potato famine in extant new world Phytophthora infestans. Mol Biol Evol. 2014; 31: 1414–1420. doi: 10.1093/molbev/msu086. pmid:24577840
- 34. Yoshida K, Burbano HA, Krause J, Thines M, Weigel D, Kamoun S. Mining herbaria for plant pathogen genomes: Back to the future. PLoS Pathog 2014; 10:
- 35. Birch PRJ, Cooke DEL. The early days of late blight. eLife 2013;
- 36. Wang H, Qi M, Cutler AJ. A simple method of preparing plant samples for PCR. Nucleic Acids Res. 1993; 21: 4153–4154. pmid:8371994
- 37. Chen Y, Roxby R. Characterization of a Phytophthora infestans gene involved in vesicle transport. Genetics 1996; 181: 89–94.
- 38. Gilroy EM, Breen S, Whisson SC, Squires J, Hien I, Kaczmarek M, et al. Presence/absence, differential expression and sequence polymorphisms between PiAVR2 and PiAVR2-like in Phytophthora infestans determine virulence on R2 plants. New Phytol. 2011; 191: 763–776. doi: 10.1111/j.1469-8137.2011.03736.x. pmid:21539575
- 39. Griffith GW, Shaw DS. Polymorphisms in Phytophthora infestans: four mitochondrial haplotypes are detected after PCR amplification of DNA from pure cultures or from host lesions. Appl Envir Microbiol. 1998; 64: 4007–4014.
- 40. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A. et al. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010; 20: 1297–1303. doi: 10.1101/gr.107524.110. pmid:20644199
- 41. Li Y, Cooke DEL, Jacobsen E, van der Lee T. Efficient multiplex simple sequence repeat genotyping of the oomycete plant pathogen Phytophthora infestans. J Microbiol Meth. 2013; 92: 316–322.
- 42. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics 2000; 155: 945–959. pmid:10835412
- 43. Earl DA, vonHoldt BM. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour. 2012; 4: 359–361.
- 44. Jakobsson M, Rosenberg NA. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 2007; 23: 1801–1806. doi: 10.1093/bioinformatics/btm233. pmid:17485429
- 45. Rosenberg NA. DISTRUCT: a program for the graphical display of population structure. Mol Ecol Notes 2004; 4: 137–138.
- 46. Jombart T. adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics 2008; 24: 1403–1405. doi: 10.1093/bioinformatics/btn129. pmid:18397895
- 47. Kamvar Z N, Tabina JF, Grünwald NJ. Poppr: An R package for genetic analysis of populations with clonal, partially clonal, and or sexual reproduction. PeerJ 2014; e281: doi: 10.7717/peerj.281. pmid:24688859
- 48. Monacell T, Carbone I. Mobyle SNAP Workbench: a web-based analysis portal for population genetics and evolutionary genomics. Bioinformatics 2014; 30: 1488–1490. doi: 10.1093/bioinformatics/btu055. pmid:24489366
- 49. Hall TA. BioEdit: A user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser. 1999; 41: 95–98.
- 50. Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994; 22: 4673–4680. pmid:7984417
- 51. Aylor DL, Price EW, Carbone I. SNAP: Combine and Map modules for multilocus population genetic analysis. Bioinformatics 2006; 22: 1399–1401. doi: 10.1093/bioinformatics/btl136. pmid:16601003
- 52. Bowden LC, Price EW, Carbone I. SNAP Clade and Matrix, version 2. 2008; http://carbonelab.org/workbench.
- 53. Lyngsø RB, Song YS, Hein J. Minimum recombination histories by branch and bound. In: Casadio R, Myers G, editors. Algorithms in Bioinformatics Berlin: Springer; 2005. pp. 239–250.
- 54. Hudson RR, Boos DD, Kaplan NL. A statistical test for detecting geographic subdivision. Mol Biol Evol. 1992a; 9: 138–151.
- 55. Hudson RR, Slatkin M, Maddison WP. Estimation of levels of gene flow from DNA sequence data. Genetics 1992b; 132: 583–589.
- 56. Hudson RR. A new statistic for detecting genetic differentiation. Genetics 2000; 155: 2011–2014. pmid:10924493
- 57. Excoffier L, Lischer HEL. Arlequin suite ver. 3.5: A new series of programs to perform population genetic analyses under Linux and Windows. Mol Ecol Resour. 2010; 10: 564–567. doi: 10.1111/j.1755-0998.2010.02847.x. pmid:21565059
- 58. Watterson GA. On the number of segregating sites in genetic models without recombination. Theor Pop Biol. 1975; 7: 256–276.
- 59. Tajima F. Evolutionary relationship of DNA sequences in finite populations. Genetics 1983; 105: 437–460. pmid:6628982
- 60. Fu Y-X. Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics 1997; 147: 915–925. pmid:9335623
- 61. Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 1989; 123: 585–595. pmid:2513255
- 62. Griffiths RC, Tavaré S. Ancestral inference in population genetics. Stat Sci. 1994; 9: 307–319.
- 63. Hey J, Nielsen R. Multilocus methods for estimating population sizes, migration rates, and divergence time, with applications to the divergence of Drosophila pseudoobscura and D. persimilis. Genetics 2004; 167: 747–760. doi: 10.1534/genetics.103.024182. pmid:15238526
- 64. Cornuet JM, Ravigné V, Estoup A. Inference on population history and model checking using DNA sequence and microsatellite data with the software DIYABC (v1.0). BMC Bioinformatics 2010; 11: 401. doi: 10.1186/1471-2105-11-401. pmid:20667077
- 65. Goss EM, Tabima JF, Cooke DEL, Restrepo S, Fry WE, Forbes GA, et al. The Irish potato famine pathogen Phytophthora infestans originated in central Mexico rather than the Andes. Proc Natl Acad Sci USA 2014; 111: 8791–8796. doi: 10.1073/pnas.1401884111. pmid:24889615
- 66. Gunter R. The potato murrain. Gardeners Chronicles 39: 657–658.
- 67. Avila-Adame C, Gómez-Alpizar L, Zizmann V, Jones KM, Buell CR, Ristaino JB. Mitochondrial genome sequences and molecular evolution of the Irish potato famine pathogen, Phytophthora infestans. Curr Genet. 2006; 49: 39–46. doi: 10.1007/s00294-005-0016-3. pmid:16328503
- 68. Lassiter ES, Russ C, Nusbuam C, Zeng Q, Saville AC, Olarte RA, et al. Mitochondrial genome sequences reveal evolutionary relationships of the Phytophthora Ic clade species. Curr Genet. 2014; 61: 567–577.
- 69. Grunwald NJ, Flier WG. The biology of Phytophthora infestans at its center of origin. Annu Rev Phytopathol. 2005; 43: 171–190. doi: 10.1146/annurev.phyto.43.040204.135906. pmid:16078881
- 70. Goodwin SB, Smart CD, Sandrock RW, Deahl KL, Punja ZK, Fry WE. Genetic change within populations of Phytophthora infestans in the United States and Canada during 1994 to 1996: Role of migration and recombination. Phytopathology 1998; 88: 939–949. doi: 10.1094/PHYTO.19184.108.40.2069. pmid:18944872
- 71. National Research Council of the National Academies. Countering Agricultural Bioterrorism. 1st ed. Washington DC: National Academy Press; 2002.
- 72. Cárdenas M, Grajales A, Sierra R, Rojas A, González-Almario A, Vargas A, et al. Genetic diversity of Phytophthora infestans in the Northern Andean region. BMC Genetics 2011; 12:
- 73. Tooley PW, Therrien CD, Ritch DL. Mating type, race composition, nuclear DNA content, and isozyme analysis of Peruvian isolates of Phytophthora infestans. Phytopathology 1989; 79: 478–481.
- 74. Flier WG, Grünwald NJ, Kroon LPNM, Sturbaum AK, van den Bosch TBM, Garay-serrano E, et al. The population structure of Phytophthora infestans from the Toluca Valley of Central Mexico suggests genetic differentiation between populations from cultivated potato and wild Solanum spp. Phytopathology 2003; 93: 382–390. doi: 10.1094/PHYTO.2003.93.4.382. pmid:18944351
- 75. Grönberg L, Andersson B, Yuen J. Can weed hosts increase aggressiveness of Phytophthora infestans on potato? Phytopathology 2012; 102: 429–433. doi: 10.1094/PHYTO-07-11-0192. pmid:22185335