Skip to main content
Advertisement
  • Loading metrics

Tools for pathogen genetic surveillance: Lessons from the ash dieback invasion of Europe

  • Jessica A. Peers,

    Roles Conceptualization, Writing – original draft, Writing – review & editing

    Affiliation The Earlham Institute, Norwich, United Kingdom

  • Richard M. Leggett,

    Roles Conceptualization, Writing – original draft, Writing – review & editing

    Affiliation The Earlham Institute, Norwich, United Kingdom

  • Matthew D. Clark,

    Roles Conceptualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Science, The Natural History Museum, London, United Kingdom

  • Mark McMullan

    Roles Conceptualization, Data curation, Formal analysis, Project administration, Supervision, Writing – original draft, Writing – review & editing

    Mark.McMullan@earlham.ac.uk

    Affiliation The Earlham Institute, Norwich, United Kingdom

Introduction

Human movement and climate change are exposing new environments to pathogens and increasing the rate of invasions [1]. Presently, most pathogens are studied independently, requiring species-specific expertise, particularly at the sampling stage. Genomic surveillance strategies can be applied to identify species presence/absence but also within-species diversity, which may not currently be capitalised upon. Here, we use the ash dieback invasion of Europe [2] to frame a discussion on the significance of different types of genomic variation to invasion biology and review the application of developing microbiome sequencing technologies to pathogen surveillance.

What is ash dieback?

European ash (Fraxinus excelsior) is prevalent throughout Europe; for example, it is the third most common tree species in the United Kingdom. Ash is a keystone species with approximately 1,000 associated species and over 100 obligately/highly associated species [3]. The ash dieback invasion of Europe (see review [4]), caused by Hymenoscyphus fraxineus, was first identified in Estonia, 1992, and has since spread west, reaching the UK in 2012 [2]. In its native range of East Asia, H. fraxineus is a saprophyte with little effect on its host, F. mandshurica [5]. In Europe, H. fraxineus causes crown dieback and death of at least 2 ash species [2]. Ash dieback is estimated to cost the UK economy £7.6 billion by 2029 [6].

Slow spread of disease symptoms [7], environmental conditions [8], varying disease resistance [9], and propagule pressure and spread [10] make estimating ash mortality rates difficult. Best estimates suggest mortality of 70% to 85% [7]. Models of plant defence against pathogens focus on resistance genes evolved to recognise different pathogen gene products and trigger immunity [11]. Reduced susceptibility in European ash appears to be mediated by many genes [9], meaning that both natural selection and breeding programmes have a lot of work to do to bring all of those genes together [12].

H. fraxineus was spread within Europe by trade of infected samplings and by wind and consequently, may be found 50 to 100 km ahead of the infection front [13]. Management of diseases at these scales is difficult, so our attention turns to introduction and reintroduction. Surveillance measures could be important to identify an invasion early or prevent subsequent invasions of genetic diversity by focussing effort on limiting exchanges between reservoirs of diversity, which is critically important for pathogen adaptation.

How do we investigate intraspecific genomic variation?

Advancing genomic technologies are particularly tractable for the identification of emergent pathogens, especially in cases where culturing requires species-specific knowledge. Sequence-based methods can also be used to understand pathogen populations: to identify sources and important genomic regions, to inform management strategies to reduce transfers between reservoirs of diversity (e.g., [14,15]). Under investigated but equally important, we outline how genomic surveillance can identify haplotypic and structural variation which also facilitate introduction of novel diversity.

Genetic diversity

Nucleotide substitutions can identify novel species, perhaps notable where invaders are related to a known pathogen or commensal species (e.g., Hymenoscyphus species [16]). However, the level of genetic variation within a species is often measured using the number of single nucleotide polymorphisms (SNPs), which are captured by predominant genome sequencing strategies (Table 1). Nucleotide diversity is an important metric (based on SNP number and frequency) that is determined by the effective population size (and mutation rate). These metrics describe a population’s ability to respond to selection relative to drift [17]. Low numbers of invaders/spores mean that many pathogen introductions are associated with reductions in genetic diversity, as is the case for the ash dieback pathogen (Fig 1; [18]). At the genome scale, the native genetic diversity of an invading pathogen represents a reservoir available for subsequent invasions [19]. At the gene scale, that reservoir of diversity may be particularly stark at effector genes [18], which putatively interact with the host and may modulate defence [20]. Finally, genetic diversity and differentiation among native populations provides valuable information on the biology of the pathogen (e.g., population size and migration rate), which is often overlooked.

thumbnail
Table 1. Methods for genomic surveillance and their advantages and disadvantages.

Targeted resequencing: Select loci (e.g., genes, SNPs) based on existing data. Resequencing: Sequence whole genomes to map to a reference. Pangenomics: Sequence whole genomes from multiple individuals of the same species and analyse using pangenome-graphs.

https://doi.org/10.1371/journal.ppat.1012182.t001

thumbnail
Fig 1. Evidence for preservation of genetic diversity by structural genomic variation in H. fraxineus.

Evidence for duplicated content is replotted against data originally analysed (modified) from McMullan and colleagues [18]. Moving outwards from the centre, tracks detail: cumulative duplication among European invasion isolates, SNP density, nucleotide diversity, gene (blue) and effector (red) density, and repeat density. The exploded section shows for 4 scaffolds, duplicated content, and SNP density for native (red) and invasive (yellow) H. fraxineus populations. The exploded top 2 scaffolds are representative of the genome-wide pattern, in which SNP density is, as expected, lower in the invasion population alongside sparse duplication, rarely overlapping among individuals. The lower 2 (exploded) scaffolds show that invasion SNP density mirrors that of the native, and duplications overlap, and are shared among individuals throughout the scaffold.

https://doi.org/10.1371/journal.ppat.1012182.g001

Haplotype diversity

SNP diversity, identified using targeted/enrichment sequencing, or resequencing in diploids, overlooks haplotype diversity and the impact of recombination. Haplotypes are physically linked combinations of SNPs that are coinherited and broken down over time by recombination. In many fungi, the somatic tissues are haploid and the power to detect the influence of recombination increases as assembly haplotypic contiguity increases, from targeted methods to pangenomics (see Table 1). At the species level, hybridisation recombines divergent genetic diversity and has been associated with host jumps [23] and increased pathogenicity, e.g., Dutch elm disease [15], and is visible as introgression [14].

The costs and benefits of sex are wide ranging (and out of scope), but the potential fitness gains of combining favourable variants and/or the removal of unfavourable ones are greater in heterogeneous and/or novel environments [24]. Evidence for recombination is key to determining the impact of further genetic invasion because recombined genotypes may be more devastating than the sum of their parts. Linkage analysis in the H. fraxineus invasion shows that recombination does operate, increasing the potential of any further introduced polymorphism [18]. Progress in our understanding of the impact of recombination in pathogen invasion may be made by studying pathogen invasions of crops from wild crop relatives (e.g., [14,25]).

Structural genomic variation

Perhaps most difficult to detect without pangenomic methods, structural genomic variation (e.g., gene/chromosome duplication, accessory chromosomes, fusions, and inversions) can introduce variation at the haploid genome level [26]. Polymorphism between duplicated regions (or genes) on a given haplotype is not lost during an invasion bottleneck in the same way that polymorphism between alleles is. Given the higher quality of the genomes required to measure this kind of polymorphism, these processes may be currently underestimated [27].

Fungi appear particularly adept at gain, loss, inversion, and fusion of regions of their genome [2830]. There is evidence in H. fraxineus for the preservation of genetic diversity across regions of the invaders’ genome (Fig 1). However, this evidence comes from short-read resequencing, which is not adequate to address these questions (Fig 1). Depauperate native range sampling and a lack of long read assemblies (or pangenome) mean the mechanism for this preservation is uncertain: Is this duplication preserved across the native range (i.e., duplication occurred once in microevolutionary history), or is there polymorphism among individuals (i.e., ongoing duplication)? These questions are particularly important to address for invaders because they establish a propensity for recurrent successful invasions despite bottlenecks.

How can we surveil for pathogens?

Targeted resequencing takes advantage of available sequence data, while whole genome resequencing increases diversity resolution without prior data except a reference genome. However, as sequencing costs are reducing, pangenomics is more attractive for the most complete information. Our understanding of pathogen invaders may be increased further if we consider sampling environments as opposed to distinct species sequencing efforts.

Pathogen capture methodologies must vary with the medium in question (e.g., air, soil, water). Here, we have considered genomic surveillance strategies employed to sample the air, in part because of the propensity of H. fraxineus to spread by air [13]. However, strategies that describe diversity of other mediums may be even more beneficial, given our increased ability to manage soil systems compared to air, for example.

Air sequencing circumvents the requirement for species-specific expertise to identify an invasion and collect samples in a timely fashion and involves capturing biological particles from the air, extracting DNA, library preparation, and sequencing. This technology is in its infancy but could precede current approaches for identifying invasions [31]. In future, air sequencing could provide rapid population diversity data with sequence reads long enough to determine structural/haplotypic variation in a pathogen agnostic way.

Air sequencing approaches have been used to measure biodiversity [32,33] and to characterise seasonal changes in bacteria and fungi in urban environments [34]. Recovering large amounts of DNA from aerosol samples is challenging, as is identifying their source given the incompleteness of public databases. However, whole genome shotgun approaches have also demonstrated usefulness in clinical [35] and agricultural environments. In a mixed crop field, wheat rust (Puccinia striiformis f. sp. tritici) can be identified to strain level, in and among pathogens specific to the other crops found there [36]. The ability to identify strain specific differences is critical for population genomic surveillance, e.g., COG-UK consortium, which tracked COVID in the UK. Furthermore, DNA can be recovered and sequenced from existing filters used in air monitoring networks [37], providing a cost-effective mechanism for data collection from before the alarm was raised.

Conclusions

When considering the efficacy of genomic surveillance, it is important to understand the invasive and native genetic structure of pathogens as well as the biology each method is tuned to detect. Until relatively recently, the focus has been on understanding genetic diversity (i.e., SNPs), but haplotype and structural genetic variation can alleviate and circumvent reductions in adaptive diversity brought about by invasion bottlenecks. Here, we highlight the benefit and feasibility of surveilling for evidence of recombination and structural diversity. Existing genomic technologies already provide this level of information, yet they require species-specific expertise to sample. The devastation of the ash dieback pathogen, H. fraxineus, and increasing invasions of species due to climate change elevate the importance of pathogen-agnostic surveillance. Improved bioinformatics tools, open-data practices, and global resources for pathogen identification could be invaluable to identify subsequent genetic invasions in existing invasion ranges, as well as invasion routes and primary invasions in novel environments.

Methods

Data, originally published in McMullan and colleagues’ article [18], was reanalysed here using CNVnator [38]. CNVnator uses short read depth data to identify evidence for duplication (α = 0.05) and was run for each European isolate. Duplication regions were plotted as a cumulation of events among all European isolates (Fig 1).

Acknowledgments

We thank Sasha Stanbridge for graphics assistance and Darren Heavens and Rowena Hill for critical comments on the manuscript.

References

  1. 1. Seebens H, Blackburn TM, Dyer EE, Genovesi P, Hulme PE, Jeschke JM, et al. No saturation in the accumulation of alien species worldwide. Nat Commun. 2017 Feb 15;8:14435. pmid:28198420
  2. 2. Drenkhan R, Sander H, Hanso M. Introduction of Mandshurian ash (Fraxinus mandshurica Rupr.) to Estonia: Is it related to the current epidemic on European ash (F. excelsior L.)? Eur J For Res. 2014 Sep 1;133(5):769–781.
  3. 3. Mitchell RJ, Beaton JK, Bellamy PE, Broome A, Chetcuti J, Eaton S, et al. Ash dieback in the UK: A review of the ecological and conservation implications and potential management options. Biol Conserv. 2014 Jul 1;175:95–109.
  4. 4. Downie JA. Ash dieback epidemic in Europe: How can molecular technologies help? PLoS Pathog. 2017 Jul;13(7):e1006381. pmid:28727852
  5. 5. Inoue T, Okane I, Ishiga Y, Degawa Y, Hosoya T, Yamaoka Y. The life cycle of Hymenoscyphus fraxineus on Manchurian ash, Fraxinus mandshurica, in Japan. Mycoscience. 2019 Mar 1;60(2):89–94.
  6. 6. Hill L, Jones G, Atkinson N, Hector A, Hemery G, Brown N. The£ 15 billion cost of ash dieback in Britain. Curr Biol. 2019;29(9):R315–R316. Available from: https://www.sciencedirect.com/science/article/pii/S0960982219303318
  7. 7. Coker TLR, Rozsypálek J, Edwards A, Harwood TP, Butfoy L, Buggs RJA. Estimating mortality rates of European ash (Fraxinus excelsior) under the ash dieback (Hymenoscyphus fraxineus) epidemic. Plants People Planet. 2019 Jan;1(1):48–58.
  8. 8. Grosdidier M, Scordia T, Ioos R, Marçais B. Landscape epidemiology of ash dieback. J Ecol. 2020 Sep;108(5):1789–1799.
  9. 9. Stocks JJ, Metheringham CL, Plumb WJ, Lee SJ, Kelly LJ, Nichols RA, et al. Genomic basis of European ash tree resistance to ash dieback fungus. Nat Ecol Evol. 2019 Dec;3(12):1686–1696. pmid:31740845
  10. 10. Hietala AM, Børja I, Solheim H, Nagy NE, Timmermann V. Propagule Pressure Build-Up by the Invasive Hymenoscyphus fraxineus Following Its Introduction to an Ash Forest Inhabited by the Native Hymenoscyphus albidus. Front Plant Sci. 2018;9. Available from: https://www.frontiersin.org/articles/10.3389/fpls.2018.01087
  11. 11. Dodds PN, Chen J, Outram MA. Pathogen Perception and Signalling in Plant Immunity. Plant Cell. 2024;36(5):1465–1481.
  12. 12. Plumb WJ, Coker TLR, Stocks JJ, Woodcock P, Quine CP, Nemesio-Gorriz M, et al. The viability of a breeding programme for ash in the British Isles in the face of ash dieback. Plants People Planet. 2020 Jan;2(1):29–40.
  13. 13. Grosdidier M, Ioos R, Husson C, Cael O, Scordia T, Marçais B. Tracking the invasion: dispersal of Hymenoscyphus fraxineus airborne inoculum at different scales. FEMS Microbiol Ecol. 2018 May 1;94(5):fiy049. pmid:29668932
  14. 14. Rahnama M, Condon B, Ascari JP, Dupuis JR, Del Ponte EM, Pedley KF, et al. Recent co-evolution of two pandemic plant diseases in a multi-hybrid swarm. Nat Ecol Evol. 2023 Dec;7(12):2055–2066. z pmid:37945944
  15. 15. Brasier CM, Kirk SA. Rapid emergence of hybrids between the two subspecies ofOphiostoma novo-ulmiwith a high level of pathogenic fitness. Plant Pathol. 2010 Feb;59(1):186–199.
  16. 16. Rafiqi M, Kosawang C, Peers JA, Jelonek L, Yvanne H, McMullan M, et al. Endophytic fungi related to the ash dieback causal agent encode signatures of pathogenicity on European ash. IMA Fungus. 2023 May 11;14(1):10. pmid:37170345
  17. 17. Charlesworth B. Effective population size and patterns of molecular evolution and variation. Nat Rev Genet. 2009 Mar;10(3):195–205.
  18. 18. McMullan M, Rafiqi M, Kaithakottil G, Clavijo BJ, Bilham L, Orton E, et al. The ash dieback invasion of Europe was founded by two genetically divergent individuals. Nat Ecol Evol. 2018 Jun;2(6):1000–1008. pmid:29686237
  19. 19. Estoup A, Ravign V, Hufbauer R, Vitalis R, Gautier M, Facon B. Is There A Genetic Paradox of Biological Invasion? Annu Rev Ecol Evol Syst. 2016;47:51–72.
  20. 20. Uhse S, Djamei A. Effectors of plant-colonizing fungi and beyond. PLoS Pathog. 2018 Jun;14(6):e1006992. pmid:29879221
  21. 21. Jouet A, Saunders DGO, McMullan M, Ward B, Furzer O, Jupe F, et al. Albugo candida race diversity, ploidy and host-associated microbes revealed using DNA sequence capture on diseased plants in the field. New Phytol. 2019 Feb;221(3):1529–1543. pmid:30288750
  22. 22. Badet T, Oggenfuss U, Abraham L, McDonald BA, Croll D. A 19-isolate reference-quality global pangenome for the fungal wheat pathogen Zymoseptoria tritici. BMC Biol. 2020 Feb 11;18(1):12. pmid:32046716
  23. 23. Stukenbrock EH, Christiansen FB, Hansen TT, Dutheil JY, Schierup MH. Fusion of two divergent fungal individuals led to the recent emergence of a unique widespread pathogen species. Proc Natl Acad Sci U S A. 2012 Jul 3;109(27):10954–10959. pmid:22711811
  24. 24. Nieuwenhuis BPS, James TY. The frequency of sex in fungi. Philos Trans R Soc Lond B Biol Sci. 2016 Oct 19;371(1706). pmid:27619703
  25. 25. McMullan M, Percival-Alwyn L, Sawford K, Kaithakottil G, Grey M, Yvanne H, et al. Analysis of wild plant pathogen populations reveals a signal of adaptation in genes evolving for survival in agriculture in the beet rust pathogen (Uromyces beticola) [Internet]. bioRxiv. bioRxiv; 2021. Available from: http://dx.doi.org/10.1101/2021.08.12.456076
  26. 26. Mérot C, Oomen RA, Tigano A, Wellenreuther M. A Roadmap for Understanding the Evolutionary Significance of Structural Genomic Variation. Trends Ecol Evol. 2020 Jul;35(7):561–572. pmid:32521241
  27. 27. Hill R, Grey M, Fedi MO, Smith D, Ward SJ, Canning G, et al. Evolutionary genomics reveals variation in structure and genetic content implicated in virulence and lifestyle in the genus Gaeumannomyces [Internet]. bioRxiv. 2024 [cited 2024 Mar 6]. p. 2024.02.15.580261. Available from: https://www.biorxiv.org/content/10.1101/2024.02.15.580261v1
  28. 28. Goodwin SB, Ben M’Barek S, Dhillon B, Wittenberg AHJ, Crane CF, Hane JK, et al. Finished genome of the fungal wheat pathogen Mycosphaerella graminicola reveals dispensome structure, chromosome plasticity, and stealth pathogenesis. PLoS Genet. 2011;7(6):e1002070. pmid:21695235
  29. 29. Tang Y-C, Amon A. Gene copy-number alterations: a cost-benefit analysis. Cell. 2013 Jan 31;152(3):394–405.
  30. 30. Stalder L, Oggenfuss U, Mohd-Assaad N, Croll D. The population genetics of adaptation through copy number variation in a fungal plant pathogen. Mol Ecol. 2023 May;32(10):2443–2460. 5 pmid:35313056
  31. 31. Weisberg AJ, Grünwald NJ, Savory EA, Putnam ML, Chang JH. Genomic approaches to plant-pathogen epidemiology and diagnostics. Annu Rev Phytopathol. 2021 Aug 25;59(1):311–332. pmid:34030448
  32. 32. Lynggaard C, Bertelsen MF, Jensen CV, Johnson MS, Frøslev TG, Olsen MT, et al. Airborne environmental DNA for terrestrial vertebrate community monitoring. Curr Biol. 2022 Feb 7;32(3):701–707.e5. pmid:34995490
  33. 33. Clare EL, Economou CK, Bennett FJ, Dyer CE, Adams K, McRobie B, et al. Measuring biodiversity from DNA in the air. Curr Biol. 2022 Feb 7;32(3):693–700.e5. pmid:34995488
  34. 34. Núñez A, García AM, Moreno DA, Guantes R. Seasonal changes dominate long-term variability of the urban air microbiome across space and time. Environ Int. 2021 May;150:106423. pmid:33578068
  35. 35. King P, Pham LK, Waltz S, Sphar D, Yamamoto RT, Conrad D, et al. Correction: Longitudinal Metagenomic Analysis of Hospital Air Identifies Clinically Relevant Microbes. PLoS ONE. 2016 Dec 28;11(12):e0169376. pmid:28030605
  36. 36. Giolai M, Verweij W, Pearson N, Nicholson P, Leggett RM, Clark MD. Air-seq: Measuring air metagenomic diversity in an agricultural ecosystem [Internet]. bioRxiv. 2022 [cited 2023 Jan 10]. p. 2022.12.13.520298. Available from: https://www.biorxiv.org/content/10.1101/2022.12.13.520298
  37. 37. Littlefair JE, Allerton JJ, Brown AS, Butterfield DM, Robins C, Economou CK, et al. Air-quality networks collect environmental DNA with the potential to measure biodiversity at continental scales. Curr Biol. 2023 Jun 5;33(11):R426–R428. pmid:37279659
  38. 38. Abyzov A, Urban AE, Snyder M, Gerstein M. CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 2011 Jun;21(6):974–984. pmid:21324876