Adaptive evolution in virulence effectors of the rice blast fungus Pyricularia oryzae

Marie Le Naour—Vernet; Florian Charriat; Jérôme Gracy; Sandrine Cros-Arteil; Sébastien Ravel; Florian Veillet; Isabelle Meusnier; André Padilla; Thomas Kroj; Stella Cesari; Pierre Gladieux

doi:10.1371/journal.ppat.1011294

Abstract

Plant pathogens secrete proteins called effectors that target host cellular processes to promote disease. Recently, structural genomics has identified several families of fungal effectors that share a similar three-dimensional structure despite remarkably variable amino-acid sequences and surface properties. To explore the selective forces that underlie the sequence variability of structurally-analogous effectors, we focused on MAX effectors, a structural family of effectors that are major determinants of virulence in the rice blast fungus Pyricularia oryzae. Using structure-informed gene annotation, we identified 58 to 78 MAX effector genes per genome in a set of 120 isolates representing seven host-associated lineages. The expression of MAX effector genes was primarily restricted to the early biotrophic phase of infection and strongly influenced by the host plant. Pangenome analyses of MAX effectors demonstrated extensive presence/absence polymorphism and identified gene loss events possibly involved in host range adaptation. However, gene knock-in experiments did not reveal a strong effect on virulence phenotypes suggesting that other evolutionary mechanisms are the main drivers of MAX effector losses. MAX effectors displayed high levels of standing variation and high rates of non-synonymous substitutions, pointing to widespread positive selection shaping the molecular diversity of MAX effectors. The combination of these analyses with structural data revealed that positive selection acts mostly on residues located in particular structural elements and at specific positions. By providing a comprehensive catalog of amino acid polymorphism, and by identifying the structural determinants of the sequence diversity, our work will inform future studies aimed at elucidating the function and mode of action of MAX effectors.

Author summary

Fungal plant pathogens use small secreted proteins, called effectors, to manipulate to their own advantage their host’s physiology and immunity. The evolution of these effectors, whether spontaneously or in response to human actions, can lead to epidemics or the emergence of new diseases. It is therefore crucial to understand the mechanisms underlying this evolution. In this article, we report on the evolution of effectors in one of the prime experimental model systems of plant pathology, Pyricularia oryzae, the fungus causing blast diseases in rice, wheat, and other cereals or grasses. We further characterize in this fungus a particular class of effectors, called MAX effectors, using structural models based on experimental protein structures of effectors. We show that this class of effector is produced by the pathogen during the early stages of infection, when plant cells are still alive. By comparing the gene content of isolates infecting different plant species, we show that the MAX effector arsenal is highly variable from one isolate to another. Finally, using the inferential framework of population genetics, we demonstrate that MAX effectors exhibit very high genetic variability and that this results from the action of natural selection.

Citation: Le Naour—Vernet M, Charriat F, Gracy J, Cros-Arteil S, Ravel S, Veillet F, et al. (2023) Adaptive evolution in virulence effectors of the rice blast fungus Pyricularia oryzae. PLoS Pathog 19(9): e1011294. https://doi.org/10.1371/journal.ppat.1011294

Editor: Michael F. Seidl, Utrecht University Faculty of Science: Universiteit Utrecht Faculteit Betawetenschappen, NETHERLANDS

Received: March 15, 2023; Accepted: August 9, 2023; Published: September 11, 2023

Copyright: © 2023 Le Naour—Vernet et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All data available from the European nucleotide archive (BioProject PRJEB47684) and from Zenodo (doi: 10.5281/zenodo.7689273; doi: 10.5281/zenodo.8052494).

Funding: This work was funded by HORIZON EUROPE European Research Council grant ERC‐2019‐STG‐852482‐ii‐MAX to SC and Agence Nationale de la Recherche grant ANR-18-CE20-0016 to PG. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Plant pathogens secrete effector proteins to manipulate the physiology and metabolism of their host and to suppress its immunity. Consequently, effectors are expected to engage in coevolutionary interactions with plant defense molecules. The proximate mechanisms of effector-driven adaptation are relatively well-characterized: plant pathogens adapt to new hosts through changes in effector repertoires and effector sequences [1, 2]. However, the ultimate (eco-evolutionary) mechanisms underlying effector diversification have remained elusive. The concept of coevolution posits that adaptation in one partner drives counter-adaptations in the coevolving partner [3–5]. Under the co-evolutionary arms race model, variation for disease resistance and pathogen virulence is transient, resulting in a turnover of sequence variation through repeated episodes of strong directional selection [6]. In agricultural systems, because pathogens tend to be ahead of their hosts in the arms race owing to their larger populations and shorter generation times, the co-evolutionary arms race tends to result in so-called boom and bust cycles [7]. Under the alternative, ‘trench warfare’ hypothesis, advances and retreats of resistance or virulence genes frequencies maintain variation as dynamic polymorphisms [8, 9]. The maintenance of genetic polymorphisms is called ‘balancing selection’, a process by which different alleles or haplotypes are favored in different places (via population subdivision) and/or different times (via frequency-dependent negative selection). While there is a growing body of data demonstrating the nature and prevalence of the selective pressures that shape the diversity of immune systems in plants [6, 10–12], we still lack a clear picture of the co-evolutionary mechanisms underlying the molecular evolution of virulence factors in their interacting antagonists [13].

Effectors from plant pathogenic fungi are typically cysteine-rich secreted proteins smaller than 200 amino acids with an infection-specific expression pattern. Effectors are numerous in fungal genomes (several hundred to more than a thousand per genome), and rarely show homologies with known proteins or domains. They are also highly variable in sequence and do not form large families of sequence homologs. Based on similarity analyses, fungal effectors can form small groups of paralogs (typically with less than five members), but they are most often singletons. This apparent lack of larger effector families has hindered attempts to probe into the evolutionary factors underlying their diversification. In addition, the high diversity of fungal effectors has hampered functional analyses due to the lack of good criteria for prioritizing them and our inability to predict their physiological role. Consequently, the virulence function and evolutionary history of most fungal effectors remain unknown.

Recently, the resolution of the three-dimensional (3D) structure of fungal effectors combined with Hidden Markov Model (HMM) pattern searches and structure modeling revealed that fungal effector repertoires are, despite their hyper-variability, dominated by a limited number of families gathering highly sequence-diverse proteins with shared structures and, presumably, common ancestry [14–17]. One such structurally-conserved but sequence-diverse fungal effector family is the MAX (Magnaporthe Avrs and ToxB-like) effector family. MAX effectors are specific to ascomycete fungi and show massive expansion in Pyricularia oryzae (synonym: Magnaporthe oryzae) [17], the fungus causing rice blast disease, one of the most damaging diseases of rice [18, 19]. MAX effectors are characterized by a conserved structure composed of six β-strands organized into two antiparallel β-sheets that are stabilized in most cases by one or two disulfide bridges. The amino acid sequence of MAX effectors is very diverse and they generally have less than 15% identity, which suggests that they are a family of analogous, rather than homologous, effector proteins. In other words, they share a similar three-dimensional structure, but no conclusions can be drawn with respect to shared ancestry. MAX effectors are massively expressed during the biotrophic phase of infection, suggesting an important role in disease development and fungal virulence [17]. Remarkably, about 50% of the known avirulence (AVR) effectors of P. oryzae belong to the MAX family, indicating that these effectors are closely monitored by the host plant immune system [17].

Pyricularia oryzae is a multi-host, poly-specialist pathogen that infects more than 50 monocotyledonous plants, including major cereal crops such as rice, maize, wheat, or barley [20–23]. Pyricularia oryzae has repeatedly emerged on new hosts [21, 24], in new geographical areas [25, 26], and phylogenomic analyses have revealed that it can be subdivided into several genetic lineages, each preferentially associated with a specific or restricted set of host plant genera [27]. In P. oryzae, effectors can play a major role in host-shifts or host-range expansions [28–30]. For example, loss of function of the PWT3 effector in Lolium-infecting strains contributed to gain of virulence on wheat [29]. Similarly, loss of the MAX effector AVR1-CO39 is thought to have contributed to the emergence of rice blast from Setaria-infecting isolates [20, 31]. This indicates that MAX effectors may be important determinants of host specificity in P. oryzae.

In this study, we characterized the genetic diversity of MAX effectors in P. oryzae and within its different host-specific lineages. We explored the evolutionary drivers of the diversification of MAX effectors and tested whether MAX effectors represent important determinants of P. oryzae host specificity. To this aim, we assembled and annotated 120 high-quality P. oryzae genomes from isolates representing seven main host-specific lineages. We mined these genomes for putative effectors and used hidden Markov models based on fold-informed protein alignments to annotate putative MAX effectors. We identified 58 to 78 putative MAX effector genes per individual genome distributed in 80 different groups of MAX homologs. We showed that the expression of MAX effector genes is largely restricted to the early biotrophic phase of infection and strongly influenced by the host plant. Our evolutionary analyses showed that MAX effectors harbor more standing genetic variation than other secreted proteins and non-effector genes, and high rates of non-synonymous substitutions, pointing to positive selection as a potent evolutionary force shaping their sequence diversity. Pangenome analyses of MAX effectors demonstrated extensive presence/absence polymorphism and identified several candidate gene loss events possibly involved in host range adaptation. Our work demonstrates that MAX effectors represent a highly dynamic compartment of the genome of P. oryzae, likely reflecting intense co-evolutionary interactions with host molecules.

Results

Genome assembly and prediction of MAX effector genes

We used a collection of genome assemblies that included 120 haploid isolates of Pyricularia oryzae fungi from 14 different host genera: Oryza (n = 52), Triticum (n = 21), Lolium (n = 12), Setaria (n = 8), Eleusine (n = 8), Echinochloa (n = 4), Zea (n = 4), Bromus (n = 2), Brachiaria (n = 2), Festuca (n = 2), Stenotaphrum (n = 2), Eragrostis (n = 1), Hordeum (n = 1), and Avena (n = 1) (S1 Table). Assembly size ranged from 37Mb to 43.2Mb, with an average size of 40.2 Mb (standard deviation [s.d]: 1.9Mb). L50 ranged from five to 411 contigs (mean: 97.1; s.d.: 83.2) and N50 from 28Kb to 4.0Mb (mean: 238.6Kb; s.d.: 43.8Kb; S1 Table). Gene prediction based on protein sequences from reference 70–15 and RNAseq data identified 11,520 to 12,055 genes per isolate (mean: 11,763.2; s.d.: 103.7). The completeness of assemblies, as estimated using BUSCO [32], ranged between 93.4 and 97.0% (mean: 96.4%; s.d.: 0.6%; S1 Table).

MAX effectors were identified among predicted secreted proteins using a combination of similarity searches [33, 34] and structure-guided alignments [35] as summarized in Fig 1. To assess variation in the MAX effector content of P. oryzae, we constructed groups of homologous genes (i.e., “orthogroups” or OG) using the clustering algorithm implemented in Orthofinder [36]. A given orthogroup was classified as secreted proteins or MAX effectors if 10% of sequences in the group were identified as such by functional annotation. Sequences were grouped in 14,767 orthogroups, of which 80 were classified as encoding MAX effectors and 3,283 as encoding other types of secreted proteins (Fig 1). The number of MAX orthogroups per isolate ranged from 58 to 73 (average: 65.8; s.d.: 2.8), representing between 58 to 78 MAX genes per isolate (average: 68.4; s.d.: 3.6). The 80 orthogroups of MAX effectors were further split into 94 groups of orthologs, by identifying paralogs using gene genealogies inferred with RAxML v8 [37] (S2 Table). Comparison of these 94 groups of orthologs with MAX effectors predicted by previous studies [15–17, 38] revealed that 19 were not predicted by any other study, while 75 were predicted by at least one other study (including nine predicted by all studies). Twenty-five MAX effectors predicted by other analyses, were not identified by our prediction pipeline and were, therefore, not considered in our study (S2 Table).

Download:

Fig 1. Schematic representation of the main steps of the bioinformatic pipeline used to predict genes in 120 genomes of Pyricularia oryzae, to identify genes encoding MAX effectors, and to measure their genetic variability.

References: Gladieux et al. 2018 [27]; Pordel et al. 2021 [21]. Genome assembly and annotation: using RNAseq and reference proteins, we predicted genes with Augustus 3.4.0 [56] and Braker 1 [70] in 54 genomes that were assembled in this study with ABySS 1.9.0 [69], as well as in 66 assemblies that were published earlier; repeated sequences were masked using RepeatMasker 4.1.0 (http://www.repeatmasker.org/). Secretome prediction: we identified signal peptides in the predicted genes by running SignalP 4.1 [74], targetP 1.1 [75], and phobius 1.01 [62], removing genes encoding proteins with a predicted transmembrane domain (identified using TMHMM [76]), as well as endoplasmic reticulum proteins (identified with PS-Scan from expasy.org). Prediction of MAX effectors: MAX effectors were identified using PSI-Blast [33] to search for homologs of known MAX effectors (AVR1-CO39, AVR-Pia, AvrPiz-t, AVR-PikD, and ToxB) in the predicted secretome, followed by structure-guided alignment of PSI-Blast hits using TM-align [35], and three iterative rounds of HMMER [34] searches based on hmmbuild models built on TM-align alignments of significant hits. Only proteins with two expected conserved cysteines less than 33–48 amino acids apart were retained [17]. Evolutionary analyses: the 11 orthogroups that included paralogous copies of MAX effectors were split into sets of orthologous sequences using genealogies inferred using RAxML v8 [37], yielding a total of 94 single-copy MAX orthologs for evolutionary analyses; protein structure models were inferred using homology modeling in Modeller [79], and structural features were identified with Stride [91].

https://doi.org/10.1371/journal.ppat.1011294.g001

The number of MAX effectors per isolate was primarily determined by the host of origin. We did not observe any significant relationship between the number of MAX effectors and assembly properties (assembly size and N50), unlike the number of secreted proteins and total number of genes (S1 Fig). Analysis of variance revealed that the host of origin had a significant impact on the number of MAX effectors (F_13,95 = 10.33; p = 3e^-8), while the origin of genomic data did not have an effect (i.e., the study in which genomic data were initially described; F_11,95 = 1.07; p = 0.39; S1 Fig). These analyses indicate that the observed variation in the size of the MAX effector repertoire is primarily biological in origin, not technical.

MAX effectors are massively deployed during rice infection

To determine whether these putative MAX effectors are deployed by P. oryzae during plant infection, we analyzed the expression patterns of the 67 MAX genes predicted in the genome of the reference isolate Guy11 by qRT-PCR (Fig 2A). Using RNA samples from Guy11 mycelium grown on artificial media, we found that 94% of the MAX genes (63 genes) were not, or very weakly expressed during axenic culture, and only four (i.e., MAX24, MAX29, MAX59, and MAX66) showed weak, medium or strong constitutive expression (Fig 2A). MAX genes were, therefore, predominantly repressed in the mycelium of P. oryzae.

Download:

Fig 2. The expression of MAX genes is biotrophy-specific and is influenced by the host plant.

(A) Transcript levels of MAX genes and the biotrophy marker gene Bas3 were determined by qRT-PCR in the mycelium (Myc) of the P. oryzae isolate Guy11 grown for 5 days in liquid culture, and in infected leaves of the rice cultivars Maratelli and Kitaake at 2, 3, 4, and 7 days post inoculation (dpi) with Guy11. Relative expression levels were calculated using the constitutively expressed MoEF1α (Elongation Factor 1α) gene as a reference. The heatmap shows the median relative expression value for each gene (in log2 scale), calculated from six independent biological samples for the Myc condition, and three independent inoculation experiments (each with five independent leaf samples per time point) for each rice cultivar. Effectors were ranked from top to bottom by increasing relative expression values in Maratelli. Relative expression values were assigned to six categories: not expressed (<0.008), very weakly (0.008–0.04), weakly (0.04–0.2), moderately (0.2–1), strongly (1–5) and very strongly expressed (>5). (B) Scatter plot comparing the relative expression levels of MAX genes in Guy11-infected Maratelli and Kitaake cultivars. Each point shows the maximum median relative expression value (in log10 scale) calculated in the infection kinetics described in (A). Difference in effector relative expression levels between the two conditions was assessed by Mann-Whitney U tests and dots were colored according to significance results: grey (p>0.05), yellow (p<0.05), orange (p<0.001), red (p<0.0001), black (effectors not expressed in both conditions).

https://doi.org/10.1371/journal.ppat.1011294.g002

We analyzed the expression of the MAX genes 2, 3, 4, and 7 days following spray inoculation of Guy11 on the rice cultivar Maratelli, which is highly susceptible to P. oryzae. We found that 67% of the MAX genes (45 genes) were expressed (Fig 2A). Among them, three were also expressed in the mycelium (i.e., MAX31, MAX59, and MAX94). MAX31 was over-expressed under infection conditions, whereas the other two showed similar expression levels in vitro and during infection. 64% of the MAX genes (42 genes) showed an infection-specific expression profile with relative expression levels ranging from very low (0.008–0.04) to very high (>5). Like the Bas3 gene, encoding a P. oryzae effector specifically induced during the biotrophic phase of infection, all MAX genes showed maximal expression between the second and fourth day post-inoculation (Figs 2A and S2).

To test whether the genotype of the host plant could influence the expression of MAX genes, we analyzed their expression patterns upon infection of the rice cultivar Kitaake, which has a higher basal resistance to P. oryzae than Maratelli. We computed the median relative expression across three independent experiments with five biological replicates each (Fig 2A and 2B). During Kitaake infection, 78% of the MAX genes (52 genes) were upregulated compared to the in vitro condition, while 64% (43 genes) were induced upon infection of Maratelli (Fig 2A). Some MAX genes not expressed in Maratelli were induced in Kitaake (e.g., MAX24, MAX30, MAX32, MAX43, MAX71, and MAX73) (Figs 2B and S3). Others were significantly upregulated in Kitaake compared to Maratelli (i.e., MAX22, MAX44, MAX55, MAX57, MAX69, and MAX91). However, a few genes, such as MAX15, MAX37 and MAX62, among the most strongly expressed effectors in Maratelli, showed weaker levels of expression in Kitaake. These results show that Guy11 deploys a wider diversity of MAX effectors during the infection of Kitaake compared to that of Maratelli, and that MAX effectors are subject to host-dependent expression polymorphism.

Taken together, our data indicate that during the biotrophic phase of rice infection, P. oryzae actively expresses a significant portion of its MAX effector repertoire in a host-dependent manner, which suggests that these effectors have an important function in fungal virulence.

The MAX effector repertoire is highly variable

To investigate the genetic diversity of MAX effectors in P. oryzae, we analyzed their nucleotide diversity per base pair (π), their ratio of non-synonymous to synonymous nucleotide diversity (π_N/π_S; page 226 in ref. [39]), and their presence-absence polymorphism. Compared to other secreted proteins or other genes, MAX effector orthogroups had higher π, and π_N/π_S values, and lower presence frequency (S4 Fig). Orthogroups including known avirulence genes like AVR1-CO39, AvrPiz-t and AVR-Pik featured among the most diverse orthogroups of MAX effectors (S1 Data).

We categorized genes in the pangenome according to their presence frequencies [36], with core genes present in all isolates, softcore genes present in >99% isolates, shell genes present in 1–99% isolates and cloud genes present in <1% isolates. The majority of MAX effector genes were classified as shell (64/80 [80%] orthogroups), while the majority of other secreted proteins or other genes were classified as core or softcore (1650/3283 [50.2%] and 6714/11404 [58.9%] orthogroups, respectively) (Fig 3A). Only a minority of genes were present in multiple copies (MAX: 15/80 [18.8%]; other effectors: 746/3283 [22.7%]; other genes: 1439/11404 [12.6%]; Fig 3A). Assessment of the openness of the pan-genome by iteratively subsampling isolates revealed a closed pangenome with a limited number of pan and core genes for MAX effectors, other secreted proteins and the remainder of the gene space (Fig 3B). Nucleotide diversity differed significantly between categories of the pangenome for non-MAX effectors (Kruskal-Wallis test: H = 181.17, d.f. = 2, p<0.001) and other genes (Kruskal-Wallis test: H = 225.25, d.f. = 2, p<0.001), but not for MAX effectors (Kruskal-Wallis test: H = 2.50, d.f. = 2, p>0.05). For non-MAX effectors and other genes, nucleotide diversity π was significantly higher in the shell genes than in softcore genes and core genes (Post-hoc Mann-Whitney U-tests, p<0.001; Fig 3C). The frequency of MAX orthogroups was positively and significantly correlated with the frequency of neighboring orthogroups at the species-wide level and at the level of Oryza and Setaria lineages, which indicates that non-core MAX effectors tend to be located in regions with presence-absence variation (S5 Fig).

Download:

Fig 3. The pan-genome of P. oryzae.

(A) Composition of the pangenome of MAX effectors, other secreted proteins, and other genes. (B) Rarefaction analysis of the size of pan- and core-genomes. For k ranging from two to the sample size minus one, pan- and core-genome sizes were computed for 1000 random combinations of k isolates. Subsample size is represented as a fraction of the sample size (n = 120), and pan- and core-genome sizes as a fraction of maximum gene counts (reported at the center of donut plots in panel A). “core” genes are present in all isolates of a pseudo-sample of size k; “pan” qualifies genes that are not “core”. (C) Nucleotide diversity per base pair (π) in core, softcore, and shell genes. A number of data points were cropped from the nucleotide diversity plot for visually optimal presentation but included in statistical tests. In box plots, the black circle is the mean, the black line is the median. Cloud genes were not included in the nucleotide diversity plot because it was not computable due to the small sample size or lack of sequence after filtering for missing data. Shared superscripts indicate non-significant differences (Post-hoc Mann-Whitney U-tests).

https://doi.org/10.1371/journal.ppat.1011294.g003

Together, these analyses show that the MAX effector repertoire is highly plastic compared to other gene categories, both in terms of the presence/absence of orthogroups and the sequence variability within orthogroups.

MAX effector variability is structured by host plant

To investigate signatures of positive selection in the genome of P. oryzae, and identify candidate loci involved in host specificity, we first identified the divergent lineages represented in our dataset. We inferred population structure from 6780 SNPs at four-fold degenerate synonymous sites in single-copy core orthologs to minimize the potential impact of natural selection on our findings. We used complementary approaches that make no assumption about random mating or linkage equilibrium. Both clustering analyses with the sNMF software [40] (Fig 4A) and neighbor-net phylogenetic networks [41] (Fig 4B) revealed consistent patterns that split genetic variation primarily by host of origin. Although the lowest cross-entropy value was observed at K = 11 in the sNMF analysis, we chose to represent K = 8 because the cross-entropy was only slightly higher and K = 8 did not split the Triticum- and Eleusine-associated clusters (S6 Fig). Lineage-level analyses were limited to the lineages with the largest sample size, associated with rice (Oryza), foxtail millet (Setaria), wheat (Triticum), ryegrass (Lolium), and goosegrass (Eleusine).

Download:

Fig 4. Population subdivision in 120 isolates of Pyricularia oryzae.

Population subdivision was inferred from (A-B) 6780 polymorphisms at four-fold degenerate synonymous sites identified in coding sequences of single-copy core orthologs (one polymorphism without missing data randomly chosen per ortholog), (C) 130 SNPs without missing data identified in coding sequences of single-copy core MAX effectors, and (D) a table of presence/absence data (coded as 0 and 1) for all 80 MAX effector orthogroups. (A) Genetic ancestry proportions for individual isolates in K = 8 ancestral populations, as estimated using the sNMF clustering algorithm [40]; each isolate is depicted as a horizontal bar divided into K segments representing the proportion of ancestry in K = 8 inferred ancestral populations; host genus of origin is indicated on the right side. (B-D) Neighbor-net phylogenetic networks estimated with SplitsTree [41], with isolate names colored according to their host of origin.

https://doi.org/10.1371/journal.ppat.1011294.g004

Population subdivision inferred from MAX effectors using either 130 SNPs without missing data in single-copy core MAX effectors (Fig 4C) or presence/absence variation of all 80 MAX effector orthogroups (Fig 4D) revealed essentially the same groups as the analysis of the single-copy core orthologs. This indicates that genome-wide nucleotide variation, variation in MAX effector content, and nucleotide variation at MAX effectors reflected similar genealogical processes. The Oryza- and Setaria-infecting lineages displayed exceptionally high presence/absence variation of MAX effectors (average Hamming distance between pairs of isolates: 0.123 and 0.095; Fig 4D), but only limited sequence variation at single-copy core MAX effectors (average Hamming distance between pairs of isolates: 0.017 and 0.012; Mann-Whitney U-tests, p<0.05; Fig 4C).

Loss of MAX effectors in specific lineages does not appear to be associated with host specificity

The comparison of the MAX effector content in the genomes of 120 P. oryzae isolates revealed extensive presence/absence polymorphism between host-specific groups (S3 Table). To address the underlying evolutionary mechanisms, we tested experimentally the hypothesis that MAX effector losses are massively related to escape from receptor-mediated non-host resistance. Indeed, the loss of MAX effectors in specific lineages of P. oryzae could primarily serve to escape from non-host resistance during infections of novel plant species carrying immune receptors specifically recognizing these effectors. To test this hypothesis, we focused on the Oryza- and Setaria-infecting lineages, as previous investigations suggested that the Oryza-infecting lineage emerged by a host shift from Setaria and we found both groups to be closely related (Fig 4) [21, 42]. Our strategy was to introduce into the Oryza-isolate Guy11 MAX effectors absent from the Oryza lineage but present in the Setaria lineage, and to assess the ability of these transgenic isolates to infect rice.

We identified three MAX orthogroups that were largely or completely absent from the Oryza lineage, but present in the majority of isolates of the other lineages (S3 Table). Orthogroup MAX79 (OG0011591-1) was absent in all 52 Oryza-infecting isolates, while MAX83 (OG0011907), and MAX89 (OG0012141) were absent in 50 and 46 of them, respectively (S3 Table). Constructs carrying the genomic sequence of MAX79, MAX83 or MAX89 derived from the Setaria isolate US0071 and under the control of the strong infection specific promoter of the effector AVR-Pia were generated and stably introduced into Guy11. For each construct, three independent transgenic lines were selected. Transgene insertion was verified by PCR and the expression of transgenes was measured by qRT-PCR (S7 Fig). To test whether the selected MAX effectors trigger immunity in rice, the transgenic isolates were spray-inoculated onto a panel of 22 cultivars representative of the worldwide diversity of rice (S4 Table).

As controls, we used the MAX effectors AVR-Pia, which is rare outside the Oryza and Setaria lineages, and AVR1-CO39, which is absent or pseudogenized in the Oryza lineage, but present in all other host-specific lineages including Setaria. Both effectors are detected in rice by the paired NLR immune receptors RGA4 and RGA5 from the Pi-a/Pi-CO39 locus and thereby contribute, respectively, to host or non-host resistance in this plant species [43, 44].

As expected, isolates expressing AVR1-CO39 or AVR-Pia triggered resistance in the rice variety Aichi Asahi that carries Pi-a, but caused disease on Nipponbare (pi-a^-) and other varieties lacking this R locus (Fig 5 and S4 Table). Unlike the positive controls, the effectors MAX79, MAX83 and MAX89 were not recognized and did not induce resistance in any of the tested rice cultivars (Fig 5 and S4 Table). The disease symptoms caused by the transgenic isolates carrying these effectors were similar to those observed for wild-type Guy11 or Guy11 isolates carrying an RFP (red fluorescent protein) construct. This suggests that these effectors do not significantly increase the virulence of Guy11.

Download:

Fig 5. AVR1-CO39 contributes to non-host specificity in rice but not MAX79, MAX83 or MAX89.

Wild type and transgenic isolates of P. oryzae Guy11 expressing the RFP (red fluorescent protein), AVR-Pia, AVR1-CO39, MAX79, MAX83 or MAX89 gene were spray-inoculated at 40 000 spores/ml on three-week-old rice plants of the cultivars Aichi Asahi (A) and Nipponbare (B). For each condition, representative disease phenotypes on rice leaves at seven days post-inoculation are shown (top panels, R: resistance, S: susceptibility). Disease phenotypes were also scored (from 1 [complete resistance] to 6 [high susceptibility]) on leaves from three to five individual rice plants and data are shown as dot plots (bottom panels). The size of each circle is proportional to the number of replicates (n) matching the corresponding score for each condition. Small red dots correspond to individual measurements. The experiment was performed twice for Aichi Asahi and four times for Nipponbare for all isolates except for WT, AVR-Pia, and AVR1-CO39 control isolates. For these isolates, experiments were performed once on Aichi Asahi and twice on Nipponbare because disease phenotypes are well characterized in the literature.

https://doi.org/10.1371/journal.ppat.1011294.g005

These experiments show that despite their loss in the Oryza-infecting lineage of P. oryzae, and unlike AVR1-CO39, the effectors MAX79, MAX83, and MAX89 do not seem to induce non-host resistance in rice. Consequently, other mechanisms than escape from host immunity contributed to the loss of these MAX effectors during the putative host shift of P. oryzae from Setaria to Oryza.

MAX effectors display signatures of balancing selection

To investigate the impact of balancing selection on MAX effector evolution, we focused on single-copy core, softcore, and shell orthogroups to avoid the possible effect of gene paralogy. We then computed π (nucleotide diversity per bp), F_ST (the amount of differentiation among lineages [45]), π_N (non-synonymous nucleotide diversity), π_S (synonymous nucleotide diversity), and π_N/π_S (the ratio of non-synonymous to synonymous nucleotide diversity). Large values of π and π_N/π_S, in particular, are possible indicators of a gene being under balancing selection.

Nucleotide diversity (π) differed significantly between groups of genes (Kruskal-Wallis test, H = 509.9, d.f. = 2, p <0.001; Fig 6A and S5 Table). π was significantly higher for the set of MAX effectors (average π: 0.0104, standard deviation: 0.0137), than for other secreted proteins (average π: 0.0079, standard deviation: 0.020), and other genes (average π: 0.0049, standard deviation: 0.014; Mann-Whitney U-tests, p<0.05), showing that MAX effectors, and to a smaller extent other secreted proteins, are more variable than a typical gene. At the lineage level, however, nucleotide diversity at MAX effectors tended to not significantly differ from other putative effectors, or other genes (S5 Table).

Download:

Fig 6. Summary statistics of polymorphism and divergence at MAX effectors, other secreted proteins (i.e., secretome), and other genes of P. oryzae.

(A) Species-wide estimates of π (nucleotide diversity per bp), F_ST (the amount of differentiation among lineages), π_N (non-synonymous nucleotide diversity per bp), π_S (synonymous nucleotide diversity per bp), π_N/π_S (the ratio of non-synonymous to synonymous nucleotide diversity), d_N/d_S (the ratio of non-synonymous to synonymous rates of substitutions). (B) Lineage-specific estimates of π_N/π_S. (C) Amino acid changes segregating in P. oryzae at MAX effectors with an avirulence function and MoToxB (first row) and MAX effectors with π_N/π_S>2 (next rows); amino acid changes are shown in dark blue and known binding interfaces in light blue; all effectors are represented with the same orientation as AvrPiz-t in panel (B). Note that the π_N/π_S ratio is >2 for the avirulence genes AvrPiz-t and AVR-Pik. Proteins are displayed twice in panels C and E, with one copy rotated 180 degrees around a vertical axis. The interface involved in binding with host proteins is known for AVR1-CO39, AVR-Pia, and AVR-Pik only ([50–53]). (D) and (E) Species-wide estimates of π_N and d_N/d_S computed at MAX effectors with signatures of balancing selection (π_N/π_S>1; panel D) and signatures of directional selection (d_N/d_S>1; panel E) for different classes of structural features highlighted on the three-dimensional structure of AvrPiz-t above panel D: (i) secondary structure elements, with three subclasses: “coil”, “extended conformation”, and “turn”; (ii) solvent accessibility percentage of the Van der Waals surface of the amino acid side chain, with three sub-classes: 0–30% (buried), 30–60% (intermediate), and 60–100% (exposed); (iii) structural domains, with six subclasses that grouped the coil, extended conformation and turn residues that define the six beta strands characteristic of MAX effectors. In the heatmaps, each line represents a MAX effector. For a given MAX effector and a given structural feature, the darkest color indicates the class of the structural feature for which the summary statistic is the highest. Only single-copy core, softcore, and shell groups of orthologous genes were included in the calculations. Shared superscripts indicate non-significant differences (post-hoc Mann-Whitney U-tests, p>0.05). A number of data points were cropped from plots in (A) and (B) for visually optimal presentation but included in statistical tests. In box plots, the black circle is the mean, the black line is the median.

https://doi.org/10.1371/journal.ppat.1011294.g006

In addition to having greater nucleotide variation than other genes at the species level, MAX effectors also displayed a higher ratio of non-synonymous to synonymous nucleotide diversity (Fig 6B and S5 Table). The π_N/π_S ratio differed significantly between groups of genes (Kruskal-Wallis tests H = 101.4, d.f. = 2, p<0.001), and the excess of non-synonymous diversity was significantly, and markedly, higher for MAX effectors (average π_N/π_S: 1.826, standard deviation: 3.847) than for other effectors (average π_N/π_S: 0.461, standard deviation: 1.600), and other genes (average π_N/π_S: 0.448, standard deviation: 1.463; Mann-Whitney U-tests, p<0.05). The higher π_N/π_S of MAX effectors was mostly driven by differences in π_N (Fig 6A and S5 Table). Twenty MAX effectors displayed values in the top 5% percentile of non-effector genes, far exceeding the four genes expected by chance (p<0.05). More specifically, 26 MAX effectors displayed π_N/π_S values greater than 1, which is the value expected under neutrality. This included three well-known avirulence genes: AVR1-CO39 (π_N/π_S = 2.564), AVR-Pik (π_N/π_S = 15.574), and AvrPiz-t (π_N/π_S = 1.431). The average π_N/π_S ratio was also higher at MAX effectors than other secreted proteins and other genes in all lineages, with significant differences in four lineages (Mann-Whitney U-tests, p<0.05), and the average π_N/π_S was greater than one in the Oryza-infecting lineage (Fig 6B and S5 Table). Seven to eleven MAX effectors had π_N/πs>1 at the lineage level, representing 8% (Setaria-infecting lineage) to 41% (Lolium-infecting lineage) of MAX effectors with a defined π_N/πs ratio (S2 Data).

π_N/π_S>1 is a strong indication of multiallelic balancing selection (i.e., multiple alleles at multiple sites are balanced), as single sites under very strong balancing selection cannot contribute enough non-synonymous variability to push the π_N/π_S ratio above one [46]. To assess whether the adaptation of lineages to their respective hosts may contribute to the species-wide excess of non-synonymous diversity detected at MAX effectors, we estimated population differentiation. The differentiation statistic F_ST differed significantly between groups of genes (Kruskal-Wallis tests H = 8.731, d.f. = 2, p = 0.013), and differentiation was significantly higher for MAX effectors than for other secreted proteins and other genes (Fig 6A and S5 Table). F_ST was also significantly, albeit relatively weakly, correlated with π_N/π_S at MAX effectors (Spearman’s ρ: 0.304, p = 0.007; S8 Fig). These observations indicate that between-lineages differences in allele frequencies are greater for MAX effectors than for other secreted proteins or other genes, which may result from divergent selection exerted by hosts.

MAX effectors display signatures of recurrent directional selection

To detect adaptive molecular evolution, we collected orthologous sequences from outgroup Pyricularia sp. LS [47, 48] and estimated the d_N/d_S ratio (the ratio of non-synonymous to synonymous substitution rates) using a maximum likelihood method [49]. Note that d_N/d_S was not computed at the intra-specific level, but computed along the branches connecting the outgroup and isolates from the ingroup. Outgroup sequences could be retrieved for 10,174 out of 14,664 single-copy orthogroups, including 66 out of 94 single-copy orthologs of MAX effectors. The d_N/d_S ratio differed significantly between groups of genes (Kruskal-Wallis test H = 45.812, d.f. = 2, p<0.001; Fig 6A and S5 Table), and was higher for MAX effectors (average d_N/d_S: 0.977, s.d.: 1.316) than for other secreted proteins (average d_N/d_S: 0.711, s.d.: 1.722), and other genes (average d_N/d_S: 0.584, s.d.: 1.584; Mann-Whitney U-tests, p<0.05). The same pattern of higher d_N/d_S for MAX effectors was observed at the lineage level (S5 Table). Twenty-four of the 66 MAX effectors with outgroup sequence (i.e., 36.4%) showed d_N/d_S>1 (S2 Data), which is a strong indication of directional selection. d_N/d_S>1 is only expected for genes that have experienced repeated bouts of directional selection which led to repeated fixations of amino-acid substitutions [46]. Eleven MAX effectors displayed signatures of both multiallelic balancing selection (π_N/π_S>1) and multiallelic directional selection (d_N/d_S>1).

The divergence data, therefore, indicate that a scenario of molecular co-evolution involving repeated selective sweeps may apply to a substantial fraction (at least one-third) of MAX effectors.

Structural determinants of polymorphism and divergence at MAX effectors

Different parts of proteins can be under different selective forces. To investigate if this is the case in MAX effectors, we examined the relationship between three different measures of diversity and three structural features. The measures of diversity were (1) the probability of an amino acid being polymorphic, (2) the non-synonymous nucleotide diversity π_N, and (3) the d_N/d_S ratio. The analyzed structural features were: (1) secondary structure annotations with the three subclasses “extended conformation”, “coil”, and “turn”; (2) solvent accessibility percentage of the Van der Waals surface of the amino acid side chain, with the three sub-classes: 0–30% (buried), 30–60% (intermediate), and 60–100% (exposed); (3) structural domains, with six subclasses that grouped the coil, extended conformation and turn residues that define the six beta strands characteristic of MAX effectors. Structural features were determined using MAX effector structures predicted by homology modeling and computing with STRIDE.

For the relationship between the probability of residues being polymorphic and the structural features, we used a generalized linear mixed model with a set of predictor variables. The fixed effects were the structural features and the model was fitted using maximum likelihood estimation, with MAX effector modeled as a random effect. Only a single structural feature, the solvent accessibility had a statistically significant effect on the probability of amino acid polymorphism (S1 Text). Predicted probabilities of amino acid change were higher for accessibility class 60–100% (95% prediction interval: 0.1294–0.1295), than for accessibility classes 30–60% (95% prediction interval: 0.1029–0.1049) and 0–30% (95% prediction interval: 0.0834–0.0835). A major factor explaining the variability of the response variable turned out to be the identity of MAX effectors. Indeed, the random effect (σ: 1.17) had a larger standard deviation than the largest fixed-effect factor (coefficient for accessibility class 60–100%: 0.64) (S1 Text).

To visualize the localization of polymorphisms, we projected the distribution of amino acid changes on the surface of protein structure models of two types of MAX effectors: those with an avirulence function and MAX effectors with the strongest signatures of multiallelic balancing selection (π_N/π_S>2; Figs 6C and S9). For the three effectors whose binding interfaces have been experimentally characterized (AVR1-CO39, AVR-Pia, AVR-Pik [50–53]), a substantial fraction of amino acid changes co-localized with residues interacting with immune receptors and, presumably, also with their host target proteins. Polymorphic residues are, therefore, potentially good predictors for binding interfaces in MAX effectors and the specific surface regions, where polymorphisms cluster in several MAX effectors (such as MoToxB, MAX58, MAX87, MAX69, or MAX50) could correspond to interfaces that bind immune receptors and/or host target proteins (Fig 6C).

To determine which parts of the MAX structure is most responsible for the high level of standing variation in these effectors, we calculated for the three different structural features and their subclasses the non-synonymous nucleotide diversity π_N (S10 Fig)_. We restricted this analysis to the 25 MAX effectors exhibiting balancing selection (π_N/π_S >1) and we used π_N and not π_N/π_S because the latter tended to be undefined due to relatively short sequence lengths. Non-synonymous nucleotide diversity π_N differed between subclasses (Kruskal-Wallis test H = 8.504, d.f. = 2, p = 0.014; Fig 6D and S6 Table) and was higher at coils and turns, than at extended conformations (coils: π_N = 0.0191; turns: π_N = 0.0167; extended conformations: π_N = 0.0086; posthoc Mann-Whitney U-tests, p<0.05). Ten and nine MAX effectors displayed their highest values of π_N in coils and turns, respectively. π_N did not significantly differ between relative solvent accessibility subclasses (Kruskal-Wallis test H = 2.308, d.f. = 2, p = 0.315), but differences were marginally significant between structural domains (Kruskal-Wallis test H = 11.035, d.f. = 5, p = 0.051). The third, fourth, and fifth beta strands displayed the highest levels of non-synonymous diversity (π_N = 0.0341, π_N = 0.0341, and π_N = 0.0341, respectively), and 18 out of 25 MAX effectors displayed their highest values of π_N at one of these three beta strands (S6 Table).

To identify the parts of MAX effector structures that experience multiallelic directional selection, we analyzed the 23 proteins with d_N/d_S >1. This showed that differences in d_N/d_S were most pronounced between subclasses of structural features (Kruskal-Wallis test H = 5.499, d.f. = 2, p = 0.064), with higher average d_N/d_S values for coils and turns (d_N/d_S = 2.490 and d_N/d_S = 1.184, respectively) than extended conformations (d_N/d_S = 0.573) (Fig 6E and S6 Table). The average d_N/d_S was also close to one for the 30–60% subclass of relative solvent accessibility (d_N/d_S = 0.994), and 12 MAX effectors with signatures of directional selection had their highest d_N/d_S values for this subclass, although differences were not significant.

Overall, these analyses show that multiallelic balancing and directional selection acted preferentially on coils and turns, but that the impact of two forms of selection on structural domains and solvent accessibility subclasses differs.

Discussion

MAX effectors as model systems to investigate effector evolution

Effectors involved in coevolutionary interactions with host-derived molecules are expected to undergo non-neutral evolution. Yet, the role of natural selection in shaping polymorphism and divergence at effectors has remained largely elusive [2]. Despite the prediction of large and molecularly diversified repertoires of effector genes in many fungal genomes, attempts to probe into the evolutionary drivers of effector diversification in plant pathogenic fungi have been hindered by the fact that, until recently, no large effector families had been identified. In this study, we overcome the methodological and conceptual barrier imposed by effector hyper-diversity by building on our previous discovery [17] of an important, structurally-similar, but sequence-diverse family of fungal effectors called MAX. We used a combination of structural modeling, evolutionary analyses, and molecular plant pathology experiments to provide a comprehensive overview of polymorphism, divergence, gene expression, and presence/absence at MAX effectors. When analyzed species-wide or at the level of sub-specific lineages, ratios of non-synonymous to synonymous nucleotide diversity, as well as ratios of non-synonymous to synonymous substitutions, were consistently higher at MAX effectors than at other loci. At the species level, the two ratios were also significantly higher than expected under the standard neutral model for a large fraction of MAX effectors. The signatures of adaptive evolution detected at MAX effectors, combined with their extensive presence/absence variation, are consistent with their central role in coevolutionary interactions with host-derived ligands that impose strong selection on virulence effectors.

Adaptive evolution of MAX effectors

Rates of evolution determined from orthologous comparisons with outgroup sequences revealed that, for a large fraction of MAX effectors, non-synonymous changes have accumulated faster than synonymous changes. The fast rate of amino-acid change at MAX effectors is consistent with a classic arms race scenario, which entails a series of selective sweeps as new virulent haplotypes—e.g., capable of avoiding recognition by plant immune receptors that previously prevented pathogen multiplication—spread to high frequency [54, 55]. Furthermore, it is important to note that although large values of the d_N/d_S ratio provide strong evidence for directional selection, small values do not necessarily indicate the lack thereof, as d_N/d_S ratios represent the integration of genetic drift, constraint, and adaptive evolution [50][56]. Much of the adaptive changes at MAX effectors probably took place before the radiation of P. oryzae on its various hosts. However, the observation that d_N/d_S values determined from orthologous comparisons with outgroup are higher at the species level than at the sub-specific lineage level indicates that part of the signal of directional selection derives from inter-lineage amino acid differences associated with host-specialization. Our structural modeling indicates that it is preferentially “turns” and “coils”, but also residues with intermediate solvent accessibility, which often evolve at an unusually fast rate, and therefore that these are probably the residues of MAX proteins preferentially involved in coevolutionary interactions with host-derived molecules.

MAX effectors are characterized by a remarkable excess of non-synonymous polymorphism, compared to synonymous polymorphism, at the species level, but also—albeit to a lesser extent—at the sub-specific lineage level. This raises the question of how polymorphisms are maintained in the face of adaptive evolution, given that selective sweeps under a classic arms race scenario are expected to erase variation [6, 9]. Directional selection restricted to host-specific lineages—i.e., local adaptation—may contribute to the signature of multiallelic balancing selection observed at the species level. The observation of a positive correlation between π_N/π_S and the differentiation statistic F_ST, together with the fact that most MAX effectors are monomorphic at the lineage level, are consistent with a role of divergent selection exerted by hosts in the maintenance of species-wide diversity at MAX effectors. However, the finding that MAX effectors with a defined π_N/π_S at the sub-specific lineage level (i.e., MAX effectors with π_S≠0) present a higher ratio than the other genes also indicates that the adaptive evolution process is not simply one of successive selective sweeps. This is consistent with balancing selection acting at the lineage level, through which polymorphisms in MAX virulence effectors are maintained due to spatiotemporal variation in selection pressures posed by the hosts–a process known as the trench-warfare model [54]. MAX effectors can experience varying selection pressures due to differences in both arsenals of immune receptors and repertoires of virulence targets across host populations. This means that the variability of effectors can result both from their evolution to avoid detection, and from their evolution to maintain their virulence activity (e.g., by targeting one or more potentially polymorphic host proteins to suppress avirulence or basal immunity, or to manipulate other host cellular processes). Our structural modeling suggests in particular that the “coils” and “turns” are the preferred substrate of these coevolutionary interactions leading to the maintenance of elevated polymorphism at MAX virulence effectors. Mirroring the existence of hypervariable MAX effectors, we also detect a substantial proportion of effectors that show no variability, either at the species level or at the lineage level. However, the lack of variability does not necessarily mean they have no impact on virulence. It is possible that their role in virulence is associated with evolutionary constraints that restrict their variability to a limited region of sequence space. Core, monomorphic MAX effectors could be prime targets for genetically-engineered NLRs [57].

Expression kinetics of MAX effectors

Expression profiling showed that the MAX effector repertoire was induced specifically and massively during infection. Depending on the host genotype, between 64 and 78% of the MAX effectors were expressed and expression was particularly strong during the early stages of infection. These findings are consistent with previous studies that analyzed genome-wide gene expression during rice infection or specifically addressed MAX effector expression, and they reinforce the hypothesis that MAX effectors are crucial for fungal virulence and specifically involved in the biotrophic phase of infection [17, 38].

How this coordinated deployment of the MAX effectors is regulated remains largely unknown. Genome organization does not seem to be a major factor, since MAX effectors do not colocalize and more generally, there is no clustering of effectors in the P. oryzae genome, only a slight enrichment in subtelomeric regions [38, 58]. This differs from other pathogenic fungi, such as Leptosphaeria maculans, for which early-expressed effectors are clustered in AT-rich isochores, and co-regulated by epigenetic mechanisms [59]. Analysis of promoter regions of MAX effectors did not identify common DNA motifs that may be targeted by transcription factors, and no such transcriptional regulators that would directly regulate large fractions of the effector complement of P. oryzae have been identified yet. The few known transcriptional networks controlled by regulators of P. oryzae pathogenicity generally comprise different classes of fungal virulence genes, such as secondary metabolism genes or carbohydrate-active enzymes; they are not restricted to effectors. Recently, it was shown that Rgs1, a regulator of G-protein signaling necessary for appressorium development, represses the expression of 60 temporally co-regulated effectors in axenic culture and during the pre-penetration stage of plant infection [60]. Of these, six belong to the MAX family and their expression is affected in cer7 and ∆rgs1 mutants: MGG_1004T0 [15, 38], MGG_15443T0 [16, 38], MGG_08817T0 [15], MGG_17266T0 [15–17, 38] and MAX15 (MGG_05424T0) and MAX67 (MGG_16175T0), both identified in our study. This represents only 5% of the MAX effectors predicted to date (S2 Table) and suggests that multiple complementary mechanisms contribute to the precise coordination of MAX effector expression during rice invasion.

Expression profiling also revealed that the plant host genotype strongly influenced the expression of the MAX effector repertoire, suggesting that plasticity in effector expression may contribute to the adaptation of P. oryzae to its hosts. MAX effectors were stronger expressed in the more resistant Kitaake rice variety than in highly susceptible Maratelli rice. This is reminiscent of other pathogenic fungi, such as Fusarium graminearum and L. maculans, for which a relationship between host resistance levels and effector expression was established [61, 62]. An expression analysis of MAX effectors in isolates infecting a wider range of host plants with varying resistance levels could be conducted to further investigate the connection between plant resistance and MAX effectors’ expression.

Presence/Absence polymorphism of MAX effectors

Pangenome analyses demonstrated extensive variability in the MAX effector repertoire. In cases where MAX effectors are specifically absent from some lineages, but present in most or all others, it is tempting to hypothesize that they experienced immune-escape loss-of-function mutations that directly contributed to host range expansion or host shifts. A possible example of such a mechanism is the non-MAX effector PWT3 of P. oryzae that is specifically absent from the Triticum-infecting lineage [29]. PWT3 triggers resistance in wheat cultivars possessing the RWT3 resistance gene [63], and its loss was shown to coincide with the widespread deployment of RWT3 wheat. Similarly, the loss of the effector AVR1-CO39 (MAX86), which is specifically absent from the Oryza-infecting lineage and that is detected by the rice NLR immune receptor complex RGA4/RGA5, has been suggested to have contributed to the initial colonization of rice by the Setaria-infecting lineage [20, 31, 64]. Two other orthologous P. oryzae effectors, PWL1 and PWL2, exclude Eleusine and rice-associated isolates from infecting Eragrostis curvula, and can, therefore, also be considered as host-specificity determinants [28, 65]. Interestingly, Alphafold predicts PWL2 to adopt a MAX effector fold [66]. In our study, however, gene knock-in experiments with MAX79, MAX83, and MAX89—specifically absent from the Oryza-infecting lineage—did not reveal a strong effect on virulence towards a large panel of rice varieties. Hence, unlike AVR1-CO39, these effectors are not key determinants of host-specificity. This suggests that overcoming non-host resistance is not the only and maybe not the main evolutionary scenario behind the specific loss of MAX effectors in the Oryza-infecting lineage. A possible alternative mechanism that can explain massive MAX effector loss during host shifts is a lack of functionality in the novel host. Some MAX effectors from a given lineage may have no function in the novel host, simply because their molecular targets are absent or too divergent in the novel host. Cellular targets of fungal effectors remain unknown for the most part, but knowledge of the molecular interactors of MAX effectors may help shed light on the drivers of their presence/absence polymorphism.

Concluding remarks

The discovery of large, structurally-similar, effector families in pathogenic fungi and the increasing availability of high-quality whole genome assemblies and high-confidence annotation tools, pave the way for in-depth investigations of the evolution of fungal effectors by interdisciplinary approaches combining state-of-the-art population genomics, protein structure analysis, and functional approaches. Our study on MAX effectors in the model fungus and infamous cereal killer P. oryzae demonstrates the power of such an approach. Our investigations reveal the fundamental role of directional and balancing selection in shaping the diversity of MAX effector genes and pinpoint specific positions in the proteins that are targeted by these evolutionary forces. This type of knowledge is still very limited on plant pathogens, and there are very few studies compared to the plethoric literature on the evolution of virulence factors in human pathogens. Moreover, by revealing the concerted and plastic deployment of the MAX effector repertoire, our study highlights the current lack of knowledge on the regulation of these processes. A major challenge will now be to identify the regulators, target proteins and mode of action of MAX effectors, in order to achieve a detailed understanding of the relationships between the structure, function and evolution of these proteins.

Methods

Genome assemblies, gene prediction, and pan-genome analyses

Among the 120 genome assemblies included in our study, 66 were already assembled and publicly available, and 54 were newly assembled (S1 Table). For the 54 newly assembled genomes, reads were publicly available for 50 isolates, and four additional isolates were sequenced (available under BioProject PRJEB47684). For the four sequenced isolates, DNA was extracted using the same protocol as in ref. [67]. TruSeq nano kits were used to prepare DNA libraries with insert size of ~500bp for 150 nucleotide paired-end indexed sequencing with Illumina HiSeq 3000. For the 54 newly generated assemblies, cutadapt [68] was used for trimming and removing low-quality reads, reads were assembled with ABySS 1.9.0 [69] using eight different K-mer sizes, and we chose the assembly produced with the K-mer size that yielded the largest N50. For all 120 genome assemblies, genes were predicted by Braker 1 [70] using RNAseq data from ref. [21] and protein sequences of isolate 70–15 (Ensembl Fungi release 43). To complement predictions from Braker, we also predicted genes using Augustus 3.4.0 [56] with RNAseq data from ref. [21], protein sequences of isolate 70–15 (Ensembl Fungi release 43), and Magnaporthe grisea as the training set. Gene predictions from Braker and Augustus were merged by removing the genes predicted by Augustus that overlapped with genes predicted by Braker. Repeated elements were masked with RepeatMasker 4.1.0 (http://www.repeatmasker.org/). The quality of genome assembly and gene prediction was checked using BUSCO 4.0.4 [32]. The homology relationships among predicted genes were identified using OrthoFinder v2.4.0 [36]. The size of pan- and core-genomes was estimated using rarefaction, by resampling combinations of one to 119 genomes, taking a maximum of 100 resamples by pseudo-sample size. Sequences for each orthogroup were aligned at the codon level (i.e., keeping sequences in coding reading frame) with TranslatorX 1.1 [71], using MAFFT v7 [72] as the aligner and default parameters for Gblocks 0.91b [73]. The effect of assembly properties, host of origin, and study of origin on the number of predicted genes computed from the orthology table was analyzed in python 3.7 using the function pearsonr in scipy.stats 1.10.1, and functions formula.api.ols and stats.anova.anova_lm in statsmodels 0.15.0.

Identification of effectors sensu lato, and MAX effectors

We predicted the secretome by running SignalP 4.1 [74], targetP 1.1 [75], and phobius 1.01 [62] to identify signal peptides in the translated coding sequences of 12000 orthogroups. Only proteins predicted to be secreted by at least two methods were retained. Transmembrane domains were identified using TMHMM [76] and proteins with a transmembrane domain outside the 30 first amino acids were excluded from the predicted secretome. Endoplasmic reticulum proteins were identified with PS-Scan (https://ftp.expasy.org/databases/prosite/ps_scan/), and excluded.

To identify MAX effectors, we used the same approach as in the original study that described MAX effectors [17]. We first used PSI-Blast 2.6.0 [33] to search for homologs of known MAX effectors (AVR1-CO39, AVR-Pia, AvrPiz-t, AVR-PikD, and ToxB) in the predicted secretome. Significant PSI-Blast hits (e-value < e-4) were aligned using a structural alignment procedure implemented in TM-align [35]. Three rounds of HMMER [34] searches were then carried out, each round consisting of alignment using TM-align version 20140601, model building using hmmbuild, and HMM search using hmmsearch (e-value < e-3). Only proteins with two expected conserved cysteines less than 33–48 amino acids apart were retained in the first two rounds of HMMER searches, as described in ref. [17].

Subsequent evolutionary analyses were conducted on three sets of orthogroups: MAX effectors, putative effectors, and other genes. The “MAX” group corresponded to 80 orthogroups for which at least 10% of sequences were identified as MAX effectors. The “secreted proteins” groups corresponded to 3283 orthogroups that were not included in the MAX group, and for which at least 10% of sequences were predicted to be secreted proteins. The last group included the remaining 11404 orthogroups.

For missing MAX effector sequences, we conducted an additional similarity search to correct for gene prediction errors. For a given MAX orthogroup and a given isolate, if a MAX effector was missing, we used Blast-n to search for significant hits using the longest sequence of the orthogroup as the query sequence, and the isolate’s genome assembly as the subject sequence (S3 Table). We also corrected annotation errors, such as the presence of very short (typically <50bp) or very long (typically >500bp) introns, missing terminal exons associated with premature stops, or frameshifts caused by indels. All these annotation errors were checked, and corrected manually if needed, using the RNAseq data used in gene prediction in the Integrative Genome Viewer [77, 78]. We also found that some orthogroups included chimeric genes resulting from the erroneous merging of two genes that were adjacent in assemblies. This was the case for orthogroups OG0000093 and OG0010985, and we used RNA-seq data in the Integrative Genome Viewer to split the merged genic sequences and keep only the sequence corresponding to a MAX effector.

For evolutionary analyses conducted on single-copy orthologs, the 11 orthogroups that included paralogous copies of MAX effectors were split into sets of orthologous sequences using genealogies inferred using RAxML v8 [37], yielding a total of 94 single-copy MAX orthologs, of which 90 orthologs passed our filters on length and sample size to be included in evolutionary analyses (see below). For each split orthogroup, sets of orthologous sequences were assigned a number that was added to the orthogroup’s identifier as a suffix (for instance paralogous sequences of orthogroup OG0000244 were split into orthogroups OG0000244_1 and OG0000244_2). Sequences were re-aligned using TranslatorX (see above) after splitting orthogroups.

All genome assemblies, gene models, aligned coding sequences for all orthogroups, and single-copy orthologs, are available in Zenodo, doi: 10.5281/zenodo.7689273 and doi: 10.5281/zenodo.8052494.

Analysis of population subdivision

Population structure was inferred from SNPs identified in Gblocks-cleaned alignments of coding sequences at 7317 single-copy core orthologs (described in section Genome assemblies, gene prediction, and pan-genome analyses). We kept only one randomly chosen four-fold degenerate synonymous site per single-copy core ortholog. We used the sNMF method from the LEA package in R [40] to infer individual ancestry coefficients in K ancestral populations. We used Splitstree version 5.3 [41] to visualize relationships between genotypes in a phylogenetic network, with reticulations to represent the conflicting phylogenetic signals caused by homoplasy.

Homology modeling of MAX effectors

To check that orthogroups predicted to be MAX effectors had the typical 3D structure of MAX effectors with two beta sheets of three beta strands each, eight experimental structures with MAX-like folds were selected as 3D templates for homology modeling (PDB identifiers of the templates: 6R5J, 2MM0, 2MM2, 2MYW, 2LW6, 5A6W, 5Z1V, 5ZNG). For each of the 94 MAX orthologous groups, one representative protein was selected and homology models of this 1D query relative to each 3D template were built using Modeller [79] with many alternative query-template threading alignments. The structural models generated using the alternative alignments were evaluated using a combination of six structural scores (DFIRE [80], GOAP [81], and QMEAN’s E_1D, E_2D, E_3D scores [82]). A detailed description of the homology modeling procedure is provided in S2 Text. The best structural models for the 94 representative sequences of each group of MAX orthologs are available at https://pat.cbs.cnrs.fr/magmax/model/. The correspondence between MAX orthogroups identifiers used in homology modeling and MAX orthogroups identifiers resulting from gene prediction is given in S2 Table. Protein models were visualized with pymol 2.5 [83].

Evolutionary analyses

Lineage-level analyses were conducted on a dataset from which divergent or introgressed isolates were removed (G17 from Eragrostis, Bm88324 & Bd8401 from Setaria, 87–120; BF0072 and BN0019 from Oryza; IR0088 from Echinochloa), to limit the impact of population subdivision within lineages. The Stenotaphrum-infecting lineage was not included in lineage-level analyses due to the small sample size.

Nucleotide diversity [84], synonymous and non-synonymous nucleotide diversity, and population differentiation [45] were estimated using Egglib v3 [85] using classes ComputeStats and CodingDiversity. Sites with more than 30% missing data were excluded. Orthogroups with less than 10 sequences (nseff<10, nseff being the average number of used samples among sites that passed the missing data filter) or shorter than 30bp (lseff<30, lseff being the number of sites used for analysis after filtering out sites with too many missing data) were excluded from computations. For analyses of polymorphism at secondary structure annotations, the cutoff on lseff was set at 10bp.

For the computation of d_N/d_S and quantification of adaptive evolution, we used isolate NI919 of Pyricularia sp. LS [47, 48] as the outgroup (assembly GCA_004337975.1, European Nucleotide Archive). Genes were predicted in the outgroup assembly using Exonerate v2.2 coding2genome [86]. For each gene, the query sequence was a P. oryzae sequence randomly selected among sequences with the fewest missing data. In parsing Exonerate output, we selected the sequence with the highest score, with a length greater than half the length of the query sequence.

The d_N/d_S ratio was estimated using a maximum likelihood approach (runmode = -2, CodonFreq = 2 in codeml [87]), in pairwise comparisons of protein coding sequences (i.e., without using a phylogeny). For each d_N/d_S we randomly selected 12 ingroup sequences and computed the average d_N/d_S across the 12 ingroup/outgroup pairs.

Kruskal-Wallis tests were performed using the scipy.stats.kruskal library in python 3.7. Posthoc Mann-Whitney U-tests were performed using the scikit_posthocs library in python 3.7, with p-values adjusted using the Bonferroni-Holm method.

Amino acid change data was modeled using a binomial generalized linear mixed model with function glmer with package lme4 version 1.1–32 in R version 4.1.2. Interactions between predictor variables were not significant and thus not included in the model presented in the Results section.

Constructs for the transformation of fungal isolates

PCR products used for cloning were generated using the Phusion High-Fidelity DNA Polymerase (Thermo Fisher) and the primers listed in S7 Table. Details of the constructs are given in S8 Table. Briefly, the pSC642 plasmid (derived from the pCB1004 vector), containing a cassette for the expression of a gene of interest under the control of the AVR-Pia promoter (pAVR-Pia) and the Neurospora crassa β-tubulin terminator (t-tub), was amplified by PCR with primers oML001 and oTK609 for the insertion of MAX genes listed in S9 Table. The MAX genes Mo_US0071_000070 (MAX79), Mo_US0071_046730 (MAX89) and Mo_US0071_115900 (MAX83), amplified by PCR from genomic DNA of the P. oryzae isolate US0071, were inserted into this vector using the Gibson Assembly Cloning Kit (New England BioLabs). The final constructs were linearized using the KpnI restriction enzyme (Promega) before P. oryzae transformation.

Plant and fungal growth conditions

Rice plants (Oryza sativa) were grown in a glasshouse in a substrate of 31% coconut peat, 30% Baltic blond peat, 15% Baltic black peat, 10% perlite, 9% volcanic sand, and 5% clay, supplemented with 3.5 g.L^-1 of fertilizer (Basacote High K 6M, NPK 13-5-18). Plants were grown under a 12h-light photoperiod with a day-time temperature of 27°C, night-time temperature of 21°C, and 70% humidity. For spore production, the wild-type and transgenic isolates of P. oryzae Guy11 were grown for 14 days at 25°C under a 12h-light photoperiod on rice flour agar medium (20 g.L⁻¹ rice seed flour, 2.5 g.L⁻¹ yeast extract, 1.5% agar, 500.000U penicillin g), supplemented with 240 μg.ml⁻¹ hygromycin for transgenic isolates. For mycelium production, plugs of mycelium of P. oryzae Guy11 were grown in liquid medium (10 g.L⁻¹ glucose, 3 g.L⁻¹ KNO₃, 2 g.L⁻¹ KH₂PO₄, 2,5 g.L⁻¹ yeast extract, 500 000U penicillin g) for 5 days at 25°C in the dark under agitation.

Fungal transformation

Protoplasts from the isolate Guy11 of P. oryzae were transformed by heat shock with 10μg of KpnI-linearized plasmids for the expression of MAX effectors or RFP as described previously [88]. After two rounds of antibiotic selection and isolation of monospores, transformed isolates were genotyped by Phire Plant Direct PCR (Thermo Scientific) using primers described in S7 Table. The Guy11 transgenic isolates expressing AVR-Pia and AVR1-CO39 were previously generated [50, 89].

Fungal growth and infection assays

For the analysis of interaction phenotypes, leaves of three-week-old rice plants were spray-inoculated with conidial suspensions (40 000 conidia.ml^-1 in water with 0.5% gelatin). Plants were incubated for 16 hours in the dark at 25°C and 95% relative humidity, and then grown for six days in regular growth conditions. Seven days after inoculation, the youngest leaf that was fully expanded at the time of inoculation was collected and scanned (Scanner Epson Perfection V370) for further symptoms analyses. Phenotypes were qualitatively classified according to lesion types: no lesion or small brown spots (resistance), small lesions with a pronounced brown border and a small gray center (partial resistance), and larger lesions with a large gray center or dried leaves (susceptibility). For the analysis of gene expression, plants were spray-inoculated with conidial suspensions at 50 000 conidia.ml^-1 (in water with 0.5% gelatin), and leaves were collected three days after inoculation.

RNA extraction and qRT-PCR analysis

Total RNA extraction from rice leaves or Guy11 mycelium and reverse transcription were performed as described by ref. [90]. Briefly, frozen leaves and mycelium were mechanically ground. RNA was extracted using TRI-reagent (Sigma-Aldrich) and chloroform separation. Denaturated RNA (5μg) was retrotranscribed and used for quantitative PCR using GoTaq qPCR Master Mix according to the manufacturer’s instructions (Promega) at a dilution of 1/10 for mycelium and 1/7 for rice leaves. The primers used are described in S7 Table. Amplification was performed as described by ref. [90] using a LightCycler480 instrument (Roche), and data were extracted using the instrument software. To calculate MAX gene expressions, the 2^-ΔΔCT method and primers measured efficiency were used. Gene expression levels are expressed relative to the expression of constitutive reference gene MoEF1α.

Statistical analyses of phenotypic data

For expression comparison between Kitaake and Maratelli infection, all analyses were performed using R (www.r-project.org). The entire kinetic experiment was repeated three times with five biological replicates for each time point. For each variety, gene, and experimental replicate, values corresponding to the day post-inoculation with the highest median expression were extracted for statistical analyses. Expression data were not normally distributed so for each gene, differences between varieties were evaluated using non-parametric Mann-Whitney U-tests.

Supporting information

S1 Table. Genomic assemblies with metadata.

https://doi.org/10.1371/journal.ppat.1011294.s001

(XLSX)

S2 Table. Nomenclature of MAX effectors predicted in this study and in previous reports.

https://doi.org/10.1371/journal.ppat.1011294.s002

(XLSX)

S3 Table. Presence/absence of MAX effector orthologs.

https://doi.org/10.1371/journal.ppat.1011294.s003

(XLSX)

S4 Table. The expression of MAX79, MAX83 and MAX89 in Guy11 does not trigger recognition in a panel of rice varieties.

https://doi.org/10.1371/journal.ppat.1011294.s004

(XLSX)

S5 Table. Gene average of summary statistics of polymorphism, differentiation and divergence.

https://doi.org/10.1371/journal.ppat.1011294.s005

(DOCX)

S6 Table. π_N and d_N/d_S in different classes of secondary structure annotations for MAX effectors with π_N/π_S>1 and d_N/d_S>1, respectively.

https://doi.org/10.1371/journal.ppat.1011294.s006

(DOCX)

S7 Table. Primers for cloning and expression analyses.

https://doi.org/10.1371/journal.ppat.1011294.s007

(XLSX)

S8 Table. Vector constructs.

https://doi.org/10.1371/journal.ppat.1011294.s008

(XLSX)

S9 Table. Sequences of the MAX effectors in the isolate US0071 that were used for the complementation of Guy11.

https://doi.org/10.1371/journal.ppat.1011294.s009

(XLSX)

S1 Fig. Effect of assembly properties on the number of genes.

https://doi.org/10.1371/journal.ppat.1011294.s010

(DOCX)

S2 Fig. Expression patterns of MAX effectors during rice infection.

https://doi.org/10.1371/journal.ppat.1011294.s011

(DOCX)

S3 Fig. Differential expression levels of MAX effectors upon infection of two different rice cultivars.

https://doi.org/10.1371/journal.ppat.1011294.s012

(DOCX)

S4 Fig. Nucleotide diversity (π), ratio of non-synonymous to synonymous nucleotide diversity (π_N/π_S), orthogroup frequency for MAX effectors, other secreted proteins, and other genes.

https://doi.org/10.1371/journal.ppat.1011294.s013

(DOCX)

S5 Fig. Frequency of MAX effector orthogroups as a function of the frequency of the adjacent orthogroups in the genome.

https://doi.org/10.1371/journal.ppat.1011294.s014

(DOCX)

S6 Fig. Analyses of population subdivision with sNMF.

https://doi.org/10.1371/journal.ppat.1011294.s015

(DOCX)

S7 Fig. MAX79, MAX83 and MAX89 are expressed in the transgenic Guy11 isolates upon rice inoculation.

https://doi.org/10.1371/journal.ppat.1011294.s016

(DOCX)

S8 Fig. F_ST versus π_N/π_S at MAX effectors.

https://doi.org/10.1371/journal.ppat.1011294.s017

(DOCX)

S9 Fig. Amino acid changes segregating in P. oryzae at MAX effectors with an avirulence function and MoToxB (first row), and MAX effectors with π_N/π_S>2 (next rows); amino acid changes are shown in dark blue and known binding interfaces in light blue.

Note that the π_N/π_S ratio is >2 for the avirulence genes AvrPiz-t and AVR-Pik. Proteins are displayed twice, with one copy rotated 180 degrees around a vertical axis. The interface involved in binding with host proteins is known for AVR1-CO39, AVR-Pia, and AVR-Pik only ([50–53]).

https://doi.org/10.1371/journal.ppat.1011294.s018

(DOCX)

S10 Fig. Secondary structure annotations of MAX effectors aligned with TM-ALIGN.

https://doi.org/10.1371/journal.ppat.1011294.s019

(TXT)

S1 Data. Summary statistics per orthogroup.

https://doi.org/10.1371/journal.ppat.1011294.s020

(XLSX)

S2 Data. Summary statistics per MAX effector ortholog, species wide, and per lineage.

https://doi.org/10.1371/journal.ppat.1011294.s021

(XLSX)

S3 Data. Structural properties and polymorphism of amino acids in MAX effectors.

https://doi.org/10.1371/journal.ppat.1011294.s022

(TXT)

S1 Text. Fitting a generalized linear model to amino acid polymorphism data.

https://doi.org/10.1371/journal.ppat.1011294.s023

(PDF)

S2 Text. Homology modeling procedure.

https://doi.org/10.1371/journal.ppat.1011294.s024

(DOCX)

References

1. Schulze-Lefert P, Panstruga R. A molecular evolutionary concept connecting nonhost resistance, pathogen host range, and pathogen speciation. Trends in Plant Science. 2011;16(3):117–25. pmid:21317020
- View Article
- PubMed/NCBI
- Google Scholar
2. Sánchez-Vallet A, Fouché S, Fudal I, Hartmann FE, Soyer JL, Tellier A, et al. The genome biology of effector gene evolution in filamentous plant pathogens. Annual review of phytopathology. 2018;56:21–40. pmid:29768136
- View Article
- PubMed/NCBI
- Google Scholar
3. Haldane JBS. Disease and evolution. Ricerca Scient 1949;19:68–76.
- View Article
- Google Scholar
4. Flor HH. Inheritance of pathogenicity in Melampsora lini. Phytopathology. 1942;32:653–69.
- View Article
- Google Scholar
5. Barrett JA. Frequency-dependent selection in plant-fungal interactions. Philosophical Transactions of the Royal Society of London B, Biological Sciences. 1988;319(1196):473–83.
- View Article
- Google Scholar
6. Bergelson J, Kreitman M, Stahl EA, Tian D. Evolutionary dynamics of plant R-genes. Science. 2001;292(5525):2281–5. pmid:11423651
- View Article
- PubMed/NCBI
- Google Scholar
7. Brown JKM. Chance and selection in the evolution of barley mildew. Trends in Microbiology. 1994;2(12):470–5. 29. pmid:7889322
- View Article
- PubMed/NCBI
- Google Scholar
8. Brown JKM. Durable resistance of crops to disease: a Darwinian perspective. Annual review of phytopathology. 2015;53:513–39. pmid:26077539
- View Article
- PubMed/NCBI
- Google Scholar
9. Stahl EA, Dwyer G, Mauricio R, Kreitman M, Bergelson J. Dynamics of disease resistance polymorphism at the Rpm1 locus of Arabidopsis. Nature. 1999;400(6745):667–71. pmid:10458161
- View Article
- PubMed/NCBI
- Google Scholar
10. Gladieux P, van Oosterhout C, Fairhead S, Jouet A, Ortiz D, Ravel S, et al. Extensive immune receptor repertoire diversity in disease-resistant rice landraces. bioRxiv. 2022:2022–12.
- View Article
- Google Scholar
11. Bakker EG, Toomajian C, Kreitman M, Bergelson J. A genome-wide survey of R gene polymorphisms in Arabidopsis. The Plant cell. 2006;18(8):1803–18. pmid:16798885
- View Article
- PubMed/NCBI
- Google Scholar
12. Bakker EG, Traw MB, Toomajian C, Kreitman M, Bergelson J. Low levels of polymorphism in genes that control the activation of defense response in Arabidopsis thaliana. Genetics. 2008;178(4):2031–43. pmid:18245336
- View Article
- PubMed/NCBI
- Google Scholar
13. Ebert D, Fields PD. Host–parasite co-evolution and its genomic signature. Nature Reviews Genetics. 2020;21(12):754–68. pmid:32860017
- View Article
- PubMed/NCBI
- Google Scholar
14. Ebbole DJ, Chen M, Zhong Z, Farmer N, Zheng W, Han Y, et al. Evolution and Regulation of a Large Effector Family of Pyricularia oryzae. Molecular Plant-Microbe Interactions. 2021;34(3):255–69. pmid:33211639
- View Article
- PubMed/NCBI
- Google Scholar
15. Seong K, Krasileva KV. Computational structural genomics unravels common folds and novel families in the secretome of fungal phytopathogen Magnaporthe oryzae. Molecular Plant-Microbe Interactions. 2021;34(11):1267–80. pmid:34415195
- View Article
- PubMed/NCBI
- Google Scholar
16. Seong K, Krasileva KV. Prediction of effector protein structures from fungal phytopathogens enables evolutionary analyses. Nature Microbiology. 2023;8(1):174–87. pmid:36604508
- View Article
- PubMed/NCBI
- Google Scholar
17. de Guillen K, Ortiz-Vallejo D, Gracy J, Fournier E, Kroj T, Padilla A. Structure analysis uncovers a highly diverse but structurally conserved effector family in phytopathogenic fungi. PLoS Pathog. 2015;11(10):e1005228. pmid:26506000
- View Article
- PubMed/NCBI
- Google Scholar
18. Savary S, Willocquet L, Pethybridge SJ, Esker P, McRoberts N, Nelson A. The global burden of pathogens and pests on major food crops. Nature ecology & evolution. 2019;3(3):430. pmid:30718852
- View Article
- PubMed/NCBI
- Google Scholar
19. Fernandez J, Orth K. Rise of a cereal killer: the biology of Magnaporthe oryzae biotrophic growth. Trends in microbiology. 2018;26(7):582–97. pmid:29395728
- View Article
- PubMed/NCBI
- Google Scholar
20. Couch BC, Fudal I, Lebrun M-H, Tharreau D, Valent B, van Kim P, et al. Origins of Host-Specific Populations of the Blast Pathogen Magnaporthe oryzae in Crop Domestication With Subsequent Expansion of Pandemic Clones on Rice and Weeds of Rice. Genetics. 2005;170(2):613–30. 61.
- View Article
- Google Scholar
21. Pordel A, Ravel S, Charriat F, Gladieux P, Cros-Arteil S, Milazzo J, et al. Tracing the origin and evolutionary history of Pyricularia oryzae infecting maize and barnyard grass. Phytopathology. 2021;111(1):128–36. pmid:33100147
- View Article
- PubMed/NCBI
- Google Scholar
22. Kato H, Yamamoto M, Yamaguchi-Ozaki T, Kadouchi H, Iwamoto Y, Nakayashiki H, et al. Pathogenicity, mating ability and DNA restriction fragment length polymorphisms of Pyricularia populations isolated from Gramineae, Bambusideae and Zingiberaceae plants. Journal of General Plant Pathology. 2000;66:30–47.
- View Article
- Google Scholar
23. Urashima AS, Igarashi S, Kato H. Host range, mating type, and fertility of Pyricularia grisea from wheat in Brazil. Plant Disease. 1993;77(12):1211–6.
- View Article
- Google Scholar
24. Igarashi S. Pyricularia em trigo. 1. Ocorrencia de Pyricularia sp noestado do Parana. Fitopatol Bras. 1986;11:351–2.
- View Article
- Google Scholar
25. Milazzo J, Pordel A, Ravel S, Tharreau D. First scientific report of Pyricularia oryzae causing gray leaf spot disease on perennial ryegrass (Lolium perenne) in France. Plant Disease. 2019;103(5):1024–.
- View Article
- Google Scholar
26. Islam MT, Croll D, Gladieux P, Soanes DM, Persoons A, Bhattacharjee P, et al. Emergence of wheat blast in Bangladesh was caused by a South American lineage of Magnaporthe oryzae. BMC biology. 2016;14(1):84. pmid:27716181
- View Article
- PubMed/NCBI
- Google Scholar
27. Gladieux P, Condon B, Ravel S, Soanes D, Maciel JLN, Nhani A, et al. Gene Flow between Divergent Cereal- and Grass-Specific Lineages of the Rice Blast Fungus Magnaporthe oryzae. mBio. 2018;9(1). pmid:29487238
- View Article
- PubMed/NCBI
- Google Scholar
28. Sweigard JA, Carroll AM, Kang S, Farrall L, Chumley FG, Valent B. Identification, Cloning, and Characterization of PWL2, a Gene for Host Species Specificity in the Rice Blast Fungus. The Plant cell. 1995;7(8):1221–33. 260.
- View Article
- Google Scholar
29. Inoue Y, Vy TTP, Yoshida K, Asano H, Mitsuoka C, Asuke S, et al. Evolution of the wheat blast fungus through functional losses in a host specificity determinant. Science. 2017;357(6346):80–3. pmid:28684523
- View Article
- PubMed/NCBI
- Google Scholar
30. Asuke S, Tanaka M, Hyon G-S, Inoue Y, Vy TTP, Niwamoto D, et al. Evolution of an Eleusine-specific subgroup of Pyricularia oryzae through a gain of an avirulence gene. Molecular Plant-Microbe Interactions. 2020;33(2):153–65. pmid:31804154
- View Article
- PubMed/NCBI
- Google Scholar
31. Zheng Y, Zheng W, Lin F, Zhang Y, Yi Y, Wang B, et al. AVR1-CO39 is a predominant locus governing the broad avirulence of Magnaporthe oryzae 2539 on cultivated rice (Oryza sativa L.). Molecular plant-microbe interactions. 2011;24(1):13–7. pmid:20879839
- View Article
- PubMed/NCBI
- Google Scholar
32. Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31(19):3210–2. pmid:26059717
- View Article
- PubMed/NCBI
- Google Scholar
33. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic acids research. 1997;25(17):3389–402. pmid:9254694
- View Article
- PubMed/NCBI
- Google Scholar
34. Finn RD, Clements J, Eddy SR. HMMER web server: interactive sequence similarity searching. Nucleic acids research. 2011;39(suppl_2):29–37. pmid:21593126
- View Article
- PubMed/NCBI
- Google Scholar
35. Zhang Y, Skolnick J. TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic acids research. 2005;33(7):2302–9. pmid:15849316
- View Article
- PubMed/NCBI
- Google Scholar
36. Emms DM, Kelly S. OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome biology. 2015;16(1):157. pmid:26243257
- View Article
- PubMed/NCBI
- Google Scholar
37. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–3. pmid:24451623
- View Article
- PubMed/NCBI
- Google Scholar
38. Yan X, Tang B, Ryder LS, MacLean D, Were VM, Eseola AB, et al. The transcriptional landscape of plant infection by the rice blast fungus Magnaporthe oryzae reveals distinct families of temporally co-regulated and structurally conserved effectors. The Plant cell. 2023;35(5):1360–85. pmid:36808541
- View Article
- PubMed/NCBI
- Google Scholar
39. Fay JC, Wu C-I. Sequence divergence, functional constraint, and selection in protein evolution. Annual review of genomics and human genetics. 2003;4(1):213–35. pmid:14527302
- View Article
- PubMed/NCBI
- Google Scholar
40. Frichot E, Mathieu F, Trouillon T, Bouchard G, François O. Fast and efficient estimation of individual ancestry coefficients. Genetics. 2014;196(4):973–83. pmid:24496008
- View Article
- PubMed/NCBI
- Google Scholar
41. Huson DH, Bryant D. Application of Phylogenetic Networks in Evolutionary Studies. Molecular Biology and Evolution. 2006;23(2):254–67. pmid:16221896
- View Article
- PubMed/NCBI
- Google Scholar
42. Tosa Y, Osue J, Eto Y, Oh H-S, Nakayashiki H, Mayama S, et al. Evolution of an avirulence gene, AVR1-CO39, concomitant with the evolution and differentiation of Magnaporthe oryzae. Molecular Plant-Microbe Interactions. 2005;18(11):1148–60. pmid:16353550
- View Article
- PubMed/NCBI
- Google Scholar
43. Cesari S, Thilliez G, Ribot C, Chalvon V, Michel C, Jauneau A, et al. The rice resistance protein pair RGA4/RGA5 recognizes the Magnaporthe oryzae effectors AVR-Pia and AVR1-CO39 by direct binding. The Plant cell. 2013;25(4):1463–81. Epub 2013/04/04. pmid:23548743; PubMed Central PMCID: PMC3663280.
- View Article
- PubMed/NCBI
- Google Scholar
44. Okuyama Y, Kanzaki H, Abe A, Yoshida K, Tamiru M, Saitoh H, et al. A multifaceted genomics approach allows the isolation of the rice Pia-blast resistance gene consisting of two adjacent NBS-LRR protein genes. The Plant Journal. 2011;66(3):467–79. pmid:21251109
- View Article
- PubMed/NCBI
- Google Scholar
45. Weir BS, Cockerham CC. Estimating F-statistics for the analysis of population structure. Evolution. 1984;38:1358–70. 283. pmid:28563791
- View Article
- PubMed/NCBI
- Google Scholar
46. Hahn MW. Molecular population genetics: Oxford University Press; 2018.
47. Hirata K, Kusaba M, Chuma I, Osue J, Nakayashiki H, Mayama S, et al. Speciation in Pyricularia inferred from multilocus phylogenetic analysis. Mycological Research. 2007;111(7):799–808. pmid:17656080
- View Article
- PubMed/NCBI
- Google Scholar
48. Gómez Luciano LB, Tsai IJ, Chuma I, Tosa Y, Chen Y-H, Li J-Y, et al. Blast fungal genomes show frequent chromosomal changes, gene gains and losses, and effector gene turnover. Molecular Biology and Evolution. 2019;36(6):1148–61. pmid:30835262
- View Article
- PubMed/NCBI
- Google Scholar
49. Yang Z. Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. Molecular biology and evolution. 1998;15(5):568–73. pmid:9580986
- View Article
- PubMed/NCBI
- Google Scholar
50. Ortiz D, De Guillen K, Cesari S, Chalvon V, Gracy J, Padilla A, et al. Recognition of the Magnaporthe oryzae effector AVR-Pia by the decoy domain of the rice NLR immune receptor RGA5. The Plant cell. 2017;29(1):156–68. pmid:28087830
- View Article
- PubMed/NCBI
- Google Scholar
51. Maqbool A, Saitoh H, Franceschetti M, Stevenson CEM, Uemura A, Kanzaki H, et al. Structural basis of pathogen recognition by an integrated HMA domain in a plant NLR immune receptor. Elife. 2015;4.
- View Article
- Google Scholar
52. Guo L, Cesari S, de Guillen K, Chalvon V, Mammri L, Ma M, et al. Specific recognition of two MAX effectors by integrated HMA domains in plant immune receptors involves distinct binding surfaces. Proceedings of the National Academy of Sciences. 2018;115(45):11637–42. pmid:30355769
- View Article
- PubMed/NCBI
- Google Scholar
53. Bentham AR, Petit-Houdenot Y, Win J, Chuma I, Terauchi R, Banfield MJ, et al. A single amino acid polymorphism in a conserved effector of the multihost blast fungus pathogen expands host-target binding spectrum. PLoS Pathogens. 2021;17(11):e1009957. pmid:34758051
- View Article
- PubMed/NCBI
- Google Scholar
54. Clay K, Kover PX. The Red Queen hypothesis and plant/pathogen interactions. Annual review of Phytopathology. 1996;34(1):29–50. pmid:15012533
- View Article
- PubMed/NCBI
- Google Scholar
55. Van Valen L. A new evolutionary law. 1973.
- View Article
- Google Scholar
56. Yang Z, Bielawski JP. Statistical methods for detecting molecular adaptation. Trends in ecology & evolution. 2000;15(12):496–503. pmid:11114436
- View Article
- PubMed/NCBI
- Google Scholar
57. Kourelis J, Marchal C, Posbeyikian A, Harant A, Kamoun S. NLR immune receptor–nanobody fusions confer plant disease resistance. Science. 2023;379(6635):934–9. pmid:36862785
- View Article
- PubMed/NCBI
- Google Scholar
58. Chiapello H, Mallet L, Guerin C, Aguileta G, Amselem J, Kroj T, et al. Deciphering Genome Content and Evolutionary Relationships of Isolates from the Fungus Magnaporthe oryzae Attacking Different Host Plants. Genome Biol Evol. 2015;7(10):2896–912. Epub 2015/10/11. pmid:26454013; PubMed Central PMCID: PMC4684704.
- View Article
- PubMed/NCBI
- Google Scholar
59. Soyer JL, El Ghalid M, Glaser N, Ollivier B, Linglin J, Grandaubert J, et al. Epigenetic control of effector gene expression in the plant pathogenic fungus Leptosphaeria maculans. PLoS genetics. 2014;10(3):e1004227. pmid:24603691
- View Article
- PubMed/NCBI
- Google Scholar
60. Tang B, Yan X, Ryder LS, Cruz-Mireles N, Soanes DM, Molinari C, et al. Rgs1 is a regulator of effector gene expression during plant infection by the rice blast fungus Magnaporthe oryzae. bioRxiv. 2022:2022–09.
- View Article
- Google Scholar
61. Fall LA, Salazar MM, Drnevich J, Holmes JR, Tseng M-C, Kolb FL, et al. Field pathogenomics of Fusarium head blight reveals pathogen transcriptome differences due to host resistance. Mycologia. 2019;111(4):563–73. pmid:31112486
- View Article
- PubMed/NCBI
- Google Scholar
62. Sonah H, Zhang X, Deshmukh RK, Borhan MH, Fernando WGD, Belanger RR. Comparative transcriptomic analysis of virulence factors in Leptosphaeria maculans during compatible and incompatible interactions with canola. Frontiers in plant science. 2016;7:1784. pmid:27990146
- View Article
- PubMed/NCBI
- Google Scholar
63. Arora S, Steed A, Goddard R, Gaurav K, O’Hara T, Schoen A, et al. A wheat kinase and immune receptor form host-specificity barriers against the blast fungus. Nature Plants. 2023:1–8.
- View Article
- Google Scholar
64. Farman ML, Eto Y, Nakao T, Tosa Y, Nakayashiki H, Mayama S, et al. Analysis of the structure of the AVR1-Co39 avirulence locus in virulent rice-infecting isolates of Magnaporthe grisea. Molecular Plant-Microbe Interactions. 2002;15:6–16. 90.
- View Article
- Google Scholar
65. Kang S, Sweigard Ja Fau—Valent B, Valent B. The PWL host specificity gene family in the blast fungus Magnaporthe grisea. Molecular Plant Microbe Interactions. 1995;(0894–0282 (Print)). pmid:8664503
- View Article
- PubMed/NCBI
- Google Scholar
66. Brabham HJ, Gómez De La Cruz D, Were V, Shimizu M, Saitoh H, Hernández-Pinzón I, et al. Barley MLA3 recognizes the host-specificity determinant PWL2 from rice blast (M. oryzae). bioRxiv. 2022:2022–10.
- View Article
- Google Scholar
67. Thierry M, Charriat F, Milazzo J, Adreit H, Ravel S, Cros-Arteil S, et al. Maintenance of divergent lineages of the Rice Blast Fungus Pyricularia oryzae through niche separation, loss of sex and post-mating genetic incompatibilities. PLoS pathogens. 2022;18(7):e1010687. pmid:35877779
- View Article
- PubMed/NCBI
- Google Scholar
68. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet journal. 2011;17(1):10–2.
- View Article
- Google Scholar
69. Simpson JT, Wong K, Jackman SD, Schein JE, Jones SJM, Birol I. ABySS: a parallel assembler for short read sequence data. Genome research. 2009;19(6):1117–23. pmid:19251739
- View Article
- PubMed/NCBI
- Google Scholar
70. Hoff KJ, Lange S, Lomsadze A, Borodovsky M, Stanke M. BRAKER1: unsupervised RNA-Seq-based genome annotation with GeneMark-ET and AUGUSTUS. Bioinformatics. 2015;32(5):767–9. pmid:26559507
- View Article
- PubMed/NCBI
- Google Scholar
71. Abascal F, Zardoya R, Telford MJ. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations. Nucleic acids research. 2010;38(suppl_2):W7–W13. pmid:20435676
- View Article
- PubMed/NCBI
- Google Scholar
72. Katoh K, Toh H. Recent developments in the MAFFT multiple sequence alignment program. Briefings in Bioinformatics. 2008;9(4):286–98. ISI:000256756400004. pmid:18372315
- View Article
- PubMed/NCBI
- Google Scholar
73. Castresana J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Molecular biology and evolution. 2000;17(4):540–52. pmid:10742046
- View Article
- PubMed/NCBI
- Google Scholar
74. Petersen TN, Brunak S, Von Heijne G, Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nature methods. 2011;8(10):785–6. pmid:21959131
- View Article
- PubMed/NCBI
- Google Scholar
75. Emanuelsson O, Nielsen H, Brunak S, Von Heijne G. Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. Journal of molecular biology. 2000;300(4):1005–16. pmid:10891285
- View Article
- PubMed/NCBI
- Google Scholar
76. Krogh A, Larsson B, Von Heijne G, Sonnhammer ELL. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. Journal of molecular biology. 2001;305(3):567–80. pmid:11152613
- View Article
- PubMed/NCBI
- Google Scholar
77. Robinson JT, Thorvaldsdóttir H, Turner D, Mesirov JP. igv. js: an embeddable JavaScript implementation of the Integrative Genomics Viewer (IGV). Bioinformatics. 2023;39(1):btac830. pmid:36562559
- View Article
- PubMed/NCBI
- Google Scholar
78. Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nature biotechnology. 2011;29(1):24–6. pmid:21221095
- View Article
- PubMed/NCBI
- Google Scholar
79. Webb B, Sali A. Protein Structure Modeling with MODELLER. Methods Mol Biol. 2021;2199(1940–6029 (Electronic)):239–55. pmid:33125654
- View Article
- PubMed/NCBI
- Google Scholar
80. Zhou H, Zhou Y. Distance-scaled, finite ideal-gas reference state improves structure-derived potentials of mean force for structure selection and stability prediction. Protein science. 2002;11(11):2714–26. pmid:12381853
- View Article
- PubMed/NCBI
- Google Scholar
81. Zhou H, Skolnick J. GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction. Biophysical journal. 2011;101(8):2043–52. pmid:22004759
- View Article
- PubMed/NCBI
- Google Scholar
82. Benkert P, Tosatto SCE, Schomburg D. QMEAN: A comprehensive scoring function for model quality assessment. Proteins: Structure, Function, and Bioinformatics. 2008;71(1):261–77.
- View Article
- Google Scholar
83. Schrödinger L, DeLano W. PyMOL. 2020.
- View Article
- Google Scholar
84. Tajima F. Evolutionary relationship of DNA sequences in finite populations. Genetics. 1983;105(2):437–60. pmid:6628982
- View Article
- PubMed/NCBI
- Google Scholar
85. Siol M, Coudoux T, Ravel S, De Mita S. EggLib 3: A python package for population genetics and genomics. Molecular Ecology Resources. 2022;22(8):3176–87. pmid:35753060
- View Article
- PubMed/NCBI
- Google Scholar
86. Slater GSC, Birney E. Automated generation of heuristics for biological sequence comparison. BMC bioinformatics. 2005;6(1):1–11. pmid:15713233
- View Article
- PubMed/NCBI
- Google Scholar
87. Yang Z. PAML: a program package for phylogenetic analysis by maximum likelihood. Computer applications in the biosciences. 1997;13(5):555–6. pmid:9367129
- View Article
- PubMed/NCBI
- Google Scholar
88. Villalba F, Collemare J, Landraud P, Lambou K, Brozek V, Cirer B, et al. Improved gene targeting in Magnaporthe grisea by inactivation of MgKU80 required for non-homologous end joining. Fungal Genetics and Biology. 2008;45(1):68–75. pmid:17716934
- View Article
- PubMed/NCBI
- Google Scholar
89. Ribot C, Césari S, Abidi I, Chalvon V, Bournaud C, Vallet J, et al. The M agnaporthe oryzae effector AVR 1–CO 39 is translocated into rice cells independently of a fungal-derived machinery. The Plant Journal. 2013;74(1):1–12. pmid:23279638
- View Article
- PubMed/NCBI
- Google Scholar
90. Pélissier R, Buendia L, Brousse A, Temple C, Ballini E, Fort F, et al. Plant neighbour-modulated susceptibility to pathogens in intraspecific mixtures. Journal of experimental botany. 2021;72(18):6570–80. pmid:34125197
- View Article
- PubMed/NCBI
- Google Scholar
91. Frishman D, Argos P. Knowledge-based protein secondary structure assignment. Proteins: Structure, Function, and Bioinformatics. 1995;23(4):566–79. pmid:8749853
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Schulze-Lefert P, Panstruga R. A molecular evolutionary concept connecting nonhost resistance, pathogen host range, and pathogen speciation. Trends in Plant Science. 2011;16(3):117–25. pmid:21317020
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Sánchez-Vallet A, Fouché S, Fudal I, Hartmann FE, Soyer JL, Tellier A, et al. The genome biology of effector gene evolution in filamentous plant pathogens. Annual review of phytopathology. 2018;56:21–40. pmid:29768136
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Haldane JBS. Disease and evolution. Ricerca Scient 1949;19:68–76.
View Article
Google Scholar

[10] View Article

[11] Google Scholar

[ref4] 4. Flor HH. Inheritance of pathogenicity in Melampsora lini. Phytopathology. 1942;32:653–69.
View Article
Google Scholar

[13] View Article

[14] Google Scholar

[ref5] 5. Barrett JA. Frequency-dependent selection in plant-fungal interactions. Philosophical Transactions of the Royal Society of London B, Biological Sciences. 1988;319(1196):473–83.
View Article
Google Scholar

[16] View Article

[17] Google Scholar

[ref6] 6. Bergelson J, Kreitman M, Stahl EA, Tian D. Evolutionary dynamics of plant R-genes. Science. 2001;292(5525):2281–5. pmid:11423651
View Article
PubMed/NCBI
Google Scholar

[19] View Article

[20] PubMed/NCBI

[21] Google Scholar

[ref7] 7. Brown JKM. Chance and selection in the evolution of barley mildew. Trends in Microbiology. 1994;2(12):470–5. 29. pmid:7889322
View Article
PubMed/NCBI
Google Scholar

[23] View Article

[24] PubMed/NCBI

[25] Google Scholar

[ref8] 8. Brown JKM. Durable resistance of crops to disease: a Darwinian perspective. Annual review of phytopathology. 2015;53:513–39. pmid:26077539
View Article
PubMed/NCBI
Google Scholar

[27] View Article

[28] PubMed/NCBI

[29] Google Scholar

[ref9] 9. Stahl EA, Dwyer G, Mauricio R, Kreitman M, Bergelson J. Dynamics of disease resistance polymorphism at the Rpm1 locus of Arabidopsis. Nature. 1999;400(6745):667–71. pmid:10458161
View Article
PubMed/NCBI
Google Scholar

[31] View Article

[32] PubMed/NCBI

[33] Google Scholar

[ref10] 10. Gladieux P, van Oosterhout C, Fairhead S, Jouet A, Ortiz D, Ravel S, et al. Extensive immune receptor repertoire diversity in disease-resistant rice landraces. bioRxiv. 2022:2022–12.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref11] 11. Bakker EG, Toomajian C, Kreitman M, Bergelson J. A genome-wide survey of R gene polymorphisms in Arabidopsis. The Plant cell. 2006;18(8):1803–18. pmid:16798885
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref12] 12. Bakker EG, Traw MB, Toomajian C, Kreitman M, Bergelson J. Low levels of polymorphism in genes that control the activation of defense response in Arabidopsis thaliana. Genetics. 2008;178(4):2031–43. pmid:18245336
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref13] 13. Ebert D, Fields PD. Host–parasite co-evolution and its genomic signature. Nature Reviews Genetics. 2020;21(12):754–68. pmid:32860017
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref14] 14. Ebbole DJ, Chen M, Zhong Z, Farmer N, Zheng W, Han Y, et al. Evolution and Regulation of a Large Effector Family of Pyricularia oryzae. Molecular Plant-Microbe Interactions. 2021;34(3):255–69. pmid:33211639
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref15] 15. Seong K, Krasileva KV. Computational structural genomics unravels common folds and novel families in the secretome of fungal phytopathogen Magnaporthe oryzae. Molecular Plant-Microbe Interactions. 2021;34(11):1267–80. pmid:34415195
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref16] 16. Seong K, Krasileva KV. Prediction of effector protein structures from fungal phytopathogens enables evolutionary analyses. Nature Microbiology. 2023;8(1):174–87. pmid:36604508
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref17] 17. de Guillen K, Ortiz-Vallejo D, Gracy J, Fournier E, Kroj T, Padilla A. Structure analysis uncovers a highly diverse but structurally conserved effector family in phytopathogenic fungi. PLoS Pathog. 2015;11(10):e1005228. pmid:26506000
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref18] 18. Savary S, Willocquet L, Pethybridge SJ, Esker P, McRoberts N, Nelson A. The global burden of pathogens and pests on major food crops. Nature ecology & evolution. 2019;3(3):430. pmid:30718852
View Article
PubMed/NCBI
Google Scholar

[66] View Article

[67] PubMed/NCBI

[68] Google Scholar

[ref19] 19. Fernandez J, Orth K. Rise of a cereal killer: the biology of Magnaporthe oryzae biotrophic growth. Trends in microbiology. 2018;26(7):582–97. pmid:29395728
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

[ref20] 20. Couch BC, Fudal I, Lebrun M-H, Tharreau D, Valent B, van Kim P, et al. Origins of Host-Specific Populations of the Blast Pathogen Magnaporthe oryzae in Crop Domestication With Subsequent Expansion of Pandemic Clones on Rice and Weeds of Rice. Genetics. 2005;170(2):613–30. 61.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref21] 21. Pordel A, Ravel S, Charriat F, Gladieux P, Cros-Arteil S, Milazzo J, et al. Tracing the origin and evolutionary history of Pyricularia oryzae infecting maize and barnyard grass. Phytopathology. 2021;111(1):128–36. pmid:33100147
View Article
PubMed/NCBI
Google Scholar

[77] View Article

[78] PubMed/NCBI

[79] Google Scholar

[ref22] 22. Kato H, Yamamoto M, Yamaguchi-Ozaki T, Kadouchi H, Iwamoto Y, Nakayashiki H, et al. Pathogenicity, mating ability and DNA restriction fragment length polymorphisms of Pyricularia populations isolated from Gramineae, Bambusideae and Zingiberaceae plants. Journal of General Plant Pathology. 2000;66:30–47.
View Article
Google Scholar

[81] View Article

[82] Google Scholar

[ref23] 23. Urashima AS, Igarashi S, Kato H. Host range, mating type, and fertility of Pyricularia grisea from wheat in Brazil. Plant Disease. 1993;77(12):1211–6.
View Article
Google Scholar

[84] View Article

[85] Google Scholar

[ref24] 24. Igarashi S. Pyricularia em trigo. 1. Ocorrencia de Pyricularia sp noestado do Parana. Fitopatol Bras. 1986;11:351–2.
View Article
Google Scholar

[87] View Article

[88] Google Scholar

[ref25] 25. Milazzo J, Pordel A, Ravel S, Tharreau D. First scientific report of Pyricularia oryzae causing gray leaf spot disease on perennial ryegrass (Lolium perenne) in France. Plant Disease. 2019;103(5):1024–.
View Article
Google Scholar

[90] View Article

[91] Google Scholar

[ref26] 26. Islam MT, Croll D, Gladieux P, Soanes DM, Persoons A, Bhattacharjee P, et al. Emergence of wheat blast in Bangladesh was caused by a South American lineage of Magnaporthe oryzae. BMC biology. 2016;14(1):84. pmid:27716181
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref27] 27. Gladieux P, Condon B, Ravel S, Soanes D, Maciel JLN, Nhani A, et al. Gene Flow between Divergent Cereal- and Grass-Specific Lineages of the Rice Blast Fungus Magnaporthe oryzae. mBio. 2018;9(1). pmid:29487238
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref28] 28. Sweigard JA, Carroll AM, Kang S, Farrall L, Chumley FG, Valent B. Identification, Cloning, and Characterization of PWL2, a Gene for Host Species Specificity in the Rice Blast Fungus. The Plant cell. 1995;7(8):1221–33. 260.
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref29] 29. Inoue Y, Vy TTP, Yoshida K, Asano H, Mitsuoka C, Asuke S, et al. Evolution of the wheat blast fungus through functional losses in a host specificity determinant. Science. 2017;357(6346):80–3. pmid:28684523
View Article
PubMed/NCBI
Google Scholar

[104] View Article

[105] PubMed/NCBI

[106] Google Scholar

[ref30] 30. Asuke S, Tanaka M, Hyon G-S, Inoue Y, Vy TTP, Niwamoto D, et al. Evolution of an Eleusine-specific subgroup of Pyricularia oryzae through a gain of an avirulence gene. Molecular Plant-Microbe Interactions. 2020;33(2):153–65. pmid:31804154
View Article
PubMed/NCBI
Google Scholar

[108] View Article

[109] PubMed/NCBI

[110] Google Scholar

[ref31] 31. Zheng Y, Zheng W, Lin F, Zhang Y, Yi Y, Wang B, et al. AVR1-CO39 is a predominant locus governing the broad avirulence of Magnaporthe oryzae 2539 on cultivated rice (Oryza sativa L.). Molecular plant-microbe interactions. 2011;24(1):13–7. pmid:20879839
View Article
PubMed/NCBI
Google Scholar

[112] View Article

[113] PubMed/NCBI

[114] Google Scholar

[ref32] 32. Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31(19):3210–2. pmid:26059717
View Article
PubMed/NCBI
Google Scholar

[116] View Article

[117] PubMed/NCBI

[118] Google Scholar

[ref33] 33. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic acids research. 1997;25(17):3389–402. pmid:9254694
View Article
PubMed/NCBI
Google Scholar

[120] View Article

[121] PubMed/NCBI

[122] Google Scholar

[ref34] 34. Finn RD, Clements J, Eddy SR. HMMER web server: interactive sequence similarity searching. Nucleic acids research. 2011;39(suppl_2):29–37. pmid:21593126
View Article
PubMed/NCBI
Google Scholar

[124] View Article

[125] PubMed/NCBI

[126] Google Scholar

[ref35] 35. Zhang Y, Skolnick J. TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic acids research. 2005;33(7):2302–9. pmid:15849316
View Article
PubMed/NCBI
Google Scholar

[128] View Article

[129] PubMed/NCBI

[130] Google Scholar

[ref36] 36. Emms DM, Kelly S. OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome biology. 2015;16(1):157. pmid:26243257
View Article
PubMed/NCBI
Google Scholar

[132] View Article

[133] PubMed/NCBI

[134] Google Scholar

[ref37] 37. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–3. pmid:24451623
View Article
PubMed/NCBI
Google Scholar

[136] View Article

[137] PubMed/NCBI

[138] Google Scholar

[ref38] 38. Yan X, Tang B, Ryder LS, MacLean D, Were VM, Eseola AB, et al. The transcriptional landscape of plant infection by the rice blast fungus Magnaporthe oryzae reveals distinct families of temporally co-regulated and structurally conserved effectors. The Plant cell. 2023;35(5):1360–85. pmid:36808541
View Article
PubMed/NCBI
Google Scholar

[140] View Article

[141] PubMed/NCBI

[142] Google Scholar

[ref39] 39. Fay JC, Wu C-I. Sequence divergence, functional constraint, and selection in protein evolution. Annual review of genomics and human genetics. 2003;4(1):213–35. pmid:14527302
View Article
PubMed/NCBI
Google Scholar

[144] View Article

[145] PubMed/NCBI

[146] Google Scholar

[ref40] 40. Frichot E, Mathieu F, Trouillon T, Bouchard G, François O. Fast and efficient estimation of individual ancestry coefficients. Genetics. 2014;196(4):973–83. pmid:24496008
View Article
PubMed/NCBI
Google Scholar

[148] View Article

[149] PubMed/NCBI

[150] Google Scholar

[ref41] 41. Huson DH, Bryant D. Application of Phylogenetic Networks in Evolutionary Studies. Molecular Biology and Evolution. 2006;23(2):254–67. pmid:16221896
View Article
PubMed/NCBI
Google Scholar

[152] View Article

[153] PubMed/NCBI

[154] Google Scholar

[ref42] 42. Tosa Y, Osue J, Eto Y, Oh H-S, Nakayashiki H, Mayama S, et al. Evolution of an avirulence gene, AVR1-CO39, concomitant with the evolution and differentiation of Magnaporthe oryzae. Molecular Plant-Microbe Interactions. 2005;18(11):1148–60. pmid:16353550
View Article
PubMed/NCBI
Google Scholar

[156] View Article

[157] PubMed/NCBI

[158] Google Scholar

[ref43] 43. Cesari S, Thilliez G, Ribot C, Chalvon V, Michel C, Jauneau A, et al. The rice resistance protein pair RGA4/RGA5 recognizes the Magnaporthe oryzae effectors AVR-Pia and AVR1-CO39 by direct binding. The Plant cell. 2013;25(4):1463–81. Epub 2013/04/04. pmid:23548743; PubMed Central PMCID: PMC3663280.
View Article
PubMed/NCBI
Google Scholar

[160] View Article

[161] PubMed/NCBI

[162] Google Scholar

[ref44] 44. Okuyama Y, Kanzaki H, Abe A, Yoshida K, Tamiru M, Saitoh H, et al. A multifaceted genomics approach allows the isolation of the rice Pia-blast resistance gene consisting of two adjacent NBS-LRR protein genes. The Plant Journal. 2011;66(3):467–79. pmid:21251109
View Article
PubMed/NCBI
Google Scholar

[164] View Article

[165] PubMed/NCBI

[166] Google Scholar

[ref45] 45. Weir BS, Cockerham CC. Estimating F-statistics for the analysis of population structure. Evolution. 1984;38:1358–70. 283. pmid:28563791
View Article
PubMed/NCBI
Google Scholar

[168] View Article

[169] PubMed/NCBI

[170] Google Scholar

[ref46] 46. Hahn MW. Molecular population genetics: Oxford University Press; 2018.

[ref47] 47. Hirata K, Kusaba M, Chuma I, Osue J, Nakayashiki H, Mayama S, et al. Speciation in Pyricularia inferred from multilocus phylogenetic analysis. Mycological Research. 2007;111(7):799–808. pmid:17656080
View Article
PubMed/NCBI
Google Scholar

[173] View Article

[174] PubMed/NCBI

[175] Google Scholar

[ref48] 48. Gómez Luciano LB, Tsai IJ, Chuma I, Tosa Y, Chen Y-H, Li J-Y, et al. Blast fungal genomes show frequent chromosomal changes, gene gains and losses, and effector gene turnover. Molecular Biology and Evolution. 2019;36(6):1148–61. pmid:30835262
View Article
PubMed/NCBI
Google Scholar

[177] View Article

[178] PubMed/NCBI

[179] Google Scholar

[ref49] 49. Yang Z. Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. Molecular biology and evolution. 1998;15(5):568–73. pmid:9580986
View Article
PubMed/NCBI
Google Scholar

[181] View Article

[182] PubMed/NCBI

[183] Google Scholar

[ref50] 50. Ortiz D, De Guillen K, Cesari S, Chalvon V, Gracy J, Padilla A, et al. Recognition of the Magnaporthe oryzae effector AVR-Pia by the decoy domain of the rice NLR immune receptor RGA5. The Plant cell. 2017;29(1):156–68. pmid:28087830
View Article
PubMed/NCBI
Google Scholar

[185] View Article

[186] PubMed/NCBI

[187] Google Scholar

[ref51] 51. Maqbool A, Saitoh H, Franceschetti M, Stevenson CEM, Uemura A, Kanzaki H, et al. Structural basis of pathogen recognition by an integrated HMA domain in a plant NLR immune receptor. Elife. 2015;4.
View Article
Google Scholar

[189] View Article

[190] Google Scholar

[ref52] 52. Guo L, Cesari S, de Guillen K, Chalvon V, Mammri L, Ma M, et al. Specific recognition of two MAX effectors by integrated HMA domains in plant immune receptors involves distinct binding surfaces. Proceedings of the National Academy of Sciences. 2018;115(45):11637–42. pmid:30355769
View Article
PubMed/NCBI
Google Scholar

[192] View Article

[193] PubMed/NCBI

[194] Google Scholar

[ref53] 53. Bentham AR, Petit-Houdenot Y, Win J, Chuma I, Terauchi R, Banfield MJ, et al. A single amino acid polymorphism in a conserved effector of the multihost blast fungus pathogen expands host-target binding spectrum. PLoS Pathogens. 2021;17(11):e1009957. pmid:34758051
View Article
PubMed/NCBI
Google Scholar

[196] View Article

[197] PubMed/NCBI

[198] Google Scholar

[ref54] 54. Clay K, Kover PX. The Red Queen hypothesis and plant/pathogen interactions. Annual review of Phytopathology. 1996;34(1):29–50. pmid:15012533
View Article
PubMed/NCBI
Google Scholar

[200] View Article

[201] PubMed/NCBI

[202] Google Scholar

[ref55] 55. Van Valen L. A new evolutionary law. 1973.
View Article
Google Scholar

[204] View Article

[205] Google Scholar

[ref56] 56. Yang Z, Bielawski JP. Statistical methods for detecting molecular adaptation. Trends in ecology & evolution. 2000;15(12):496–503. pmid:11114436
View Article
PubMed/NCBI
Google Scholar

[207] View Article

[208] PubMed/NCBI

[209] Google Scholar

[ref57] 57. Kourelis J, Marchal C, Posbeyikian A, Harant A, Kamoun S. NLR immune receptor–nanobody fusions confer plant disease resistance. Science. 2023;379(6635):934–9. pmid:36862785
View Article
PubMed/NCBI
Google Scholar

[211] View Article

[212] PubMed/NCBI

[213] Google Scholar

[ref58] 58. Chiapello H, Mallet L, Guerin C, Aguileta G, Amselem J, Kroj T, et al. Deciphering Genome Content and Evolutionary Relationships of Isolates from the Fungus Magnaporthe oryzae Attacking Different Host Plants. Genome Biol Evol. 2015;7(10):2896–912. Epub 2015/10/11. pmid:26454013; PubMed Central PMCID: PMC4684704.
View Article
PubMed/NCBI
Google Scholar

[215] View Article

[216] PubMed/NCBI

[217] Google Scholar

[ref59] 59. Soyer JL, El Ghalid M, Glaser N, Ollivier B, Linglin J, Grandaubert J, et al. Epigenetic control of effector gene expression in the plant pathogenic fungus Leptosphaeria maculans. PLoS genetics. 2014;10(3):e1004227. pmid:24603691
View Article
PubMed/NCBI
Google Scholar

[219] View Article

[220] PubMed/NCBI

[221] Google Scholar

[ref60] 60. Tang B, Yan X, Ryder LS, Cruz-Mireles N, Soanes DM, Molinari C, et al. Rgs1 is a regulator of effector gene expression during plant infection by the rice blast fungus Magnaporthe oryzae. bioRxiv. 2022:2022–09.
View Article
Google Scholar

[223] View Article

[224] Google Scholar

[ref61] 61. Fall LA, Salazar MM, Drnevich J, Holmes JR, Tseng M-C, Kolb FL, et al. Field pathogenomics of Fusarium head blight reveals pathogen transcriptome differences due to host resistance. Mycologia. 2019;111(4):563–73. pmid:31112486
View Article
PubMed/NCBI
Google Scholar

[226] View Article

[227] PubMed/NCBI

[228] Google Scholar

[ref62] 62. Sonah H, Zhang X, Deshmukh RK, Borhan MH, Fernando WGD, Belanger RR. Comparative transcriptomic analysis of virulence factors in Leptosphaeria maculans during compatible and incompatible interactions with canola. Frontiers in plant science. 2016;7:1784. pmid:27990146
View Article
PubMed/NCBI
Google Scholar

[230] View Article

[231] PubMed/NCBI

[232] Google Scholar

[ref63] 63. Arora S, Steed A, Goddard R, Gaurav K, O’Hara T, Schoen A, et al. A wheat kinase and immune receptor form host-specificity barriers against the blast fungus. Nature Plants. 2023:1–8.
View Article
Google Scholar

[234] View Article

[235] Google Scholar

[ref64] 64. Farman ML, Eto Y, Nakao T, Tosa Y, Nakayashiki H, Mayama S, et al. Analysis of the structure of the AVR1-Co39 avirulence locus in virulent rice-infecting isolates of Magnaporthe grisea. Molecular Plant-Microbe Interactions. 2002;15:6–16. 90.
View Article
Google Scholar

[237] View Article

[238] Google Scholar

[ref65] 65. Kang S, Sweigard Ja Fau—Valent B, Valent B. The PWL host specificity gene family in the blast fungus Magnaporthe grisea. Molecular Plant Microbe Interactions. 1995;(0894–0282 (Print)). pmid:8664503
View Article
PubMed/NCBI
Google Scholar

[240] View Article

[241] PubMed/NCBI

[242] Google Scholar

[ref66] 66. Brabham HJ, Gómez De La Cruz D, Were V, Shimizu M, Saitoh H, Hernández-Pinzón I, et al. Barley MLA3 recognizes the host-specificity determinant PWL2 from rice blast (M. oryzae). bioRxiv. 2022:2022–10.
View Article
Google Scholar

[244] View Article

[245] Google Scholar

[ref67] 67. Thierry M, Charriat F, Milazzo J, Adreit H, Ravel S, Cros-Arteil S, et al. Maintenance of divergent lineages of the Rice Blast Fungus Pyricularia oryzae through niche separation, loss of sex and post-mating genetic incompatibilities. PLoS pathogens. 2022;18(7):e1010687. pmid:35877779
View Article
PubMed/NCBI
Google Scholar

[247] View Article

[248] PubMed/NCBI

[249] Google Scholar

[ref68] 68. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet journal. 2011;17(1):10–2.
View Article
Google Scholar

[251] View Article

[252] Google Scholar

[ref69] 69. Simpson JT, Wong K, Jackman SD, Schein JE, Jones SJM, Birol I. ABySS: a parallel assembler for short read sequence data. Genome research. 2009;19(6):1117–23. pmid:19251739
View Article
PubMed/NCBI
Google Scholar

[254] View Article

[255] PubMed/NCBI

[256] Google Scholar

[ref70] 70. Hoff KJ, Lange S, Lomsadze A, Borodovsky M, Stanke M. BRAKER1: unsupervised RNA-Seq-based genome annotation with GeneMark-ET and AUGUSTUS. Bioinformatics. 2015;32(5):767–9. pmid:26559507
View Article
PubMed/NCBI
Google Scholar

[258] View Article

[259] PubMed/NCBI

[260] Google Scholar

[ref71] 71. Abascal F, Zardoya R, Telford MJ. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations. Nucleic acids research. 2010;38(suppl_2):W7–W13. pmid:20435676
View Article
PubMed/NCBI
Google Scholar

[262] View Article

[263] PubMed/NCBI

[264] Google Scholar

[ref72] 72. Katoh K, Toh H. Recent developments in the MAFFT multiple sequence alignment program. Briefings in Bioinformatics. 2008;9(4):286–98. ISI:000256756400004. pmid:18372315
View Article
PubMed/NCBI
Google Scholar

[266] View Article

[267] PubMed/NCBI

[268] Google Scholar

[ref73] 73. Castresana J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Molecular biology and evolution. 2000;17(4):540–52. pmid:10742046
View Article
PubMed/NCBI
Google Scholar

[270] View Article

[271] PubMed/NCBI

[272] Google Scholar

[ref74] 74. Petersen TN, Brunak S, Von Heijne G, Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nature methods. 2011;8(10):785–6. pmid:21959131
View Article
PubMed/NCBI
Google Scholar

[274] View Article

[275] PubMed/NCBI

[276] Google Scholar

[ref75] 75. Emanuelsson O, Nielsen H, Brunak S, Von Heijne G. Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. Journal of molecular biology. 2000;300(4):1005–16. pmid:10891285
View Article
PubMed/NCBI
Google Scholar

[278] View Article

[279] PubMed/NCBI

[280] Google Scholar

[ref76] 76. Krogh A, Larsson B, Von Heijne G, Sonnhammer ELL. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. Journal of molecular biology. 2001;305(3):567–80. pmid:11152613
View Article
PubMed/NCBI
Google Scholar

[282] View Article

[283] PubMed/NCBI

[284] Google Scholar

[ref77] 77. Robinson JT, Thorvaldsdóttir H, Turner D, Mesirov JP. igv. js: an embeddable JavaScript implementation of the Integrative Genomics Viewer (IGV). Bioinformatics. 2023;39(1):btac830. pmid:36562559
View Article
PubMed/NCBI
Google Scholar

[286] View Article

[287] PubMed/NCBI

[288] Google Scholar

[ref78] 78. Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nature biotechnology. 2011;29(1):24–6. pmid:21221095
View Article
PubMed/NCBI
Google Scholar

[290] View Article

[291] PubMed/NCBI

[292] Google Scholar

[ref79] 79. Webb B, Sali A. Protein Structure Modeling with MODELLER. Methods Mol Biol. 2021;2199(1940–6029 (Electronic)):239–55. pmid:33125654
View Article
PubMed/NCBI
Google Scholar

[294] View Article

[295] PubMed/NCBI

[296] Google Scholar

[ref80] 80. Zhou H, Zhou Y. Distance-scaled, finite ideal-gas reference state improves structure-derived potentials of mean force for structure selection and stability prediction. Protein science. 2002;11(11):2714–26. pmid:12381853
View Article
PubMed/NCBI
Google Scholar

[298] View Article

[299] PubMed/NCBI

[300] Google Scholar

[ref81] 81. Zhou H, Skolnick J. GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction. Biophysical journal. 2011;101(8):2043–52. pmid:22004759
View Article
PubMed/NCBI
Google Scholar

[302] View Article

[303] PubMed/NCBI

[304] Google Scholar

[ref82] 82. Benkert P, Tosatto SCE, Schomburg D. QMEAN: A comprehensive scoring function for model quality assessment. Proteins: Structure, Function, and Bioinformatics. 2008;71(1):261–77.
View Article
Google Scholar

[306] View Article

[307] Google Scholar

[ref83] 83. Schrödinger L, DeLano W. PyMOL. 2020.
View Article
Google Scholar

[309] View Article

[310] Google Scholar

[ref84] 84. Tajima F. Evolutionary relationship of DNA sequences in finite populations. Genetics. 1983;105(2):437–60. pmid:6628982
View Article
PubMed/NCBI
Google Scholar

[312] View Article

[313] PubMed/NCBI

[314] Google Scholar

[ref85] 85. Siol M, Coudoux T, Ravel S, De Mita S. EggLib 3: A python package for population genetics and genomics. Molecular Ecology Resources. 2022;22(8):3176–87. pmid:35753060
View Article
PubMed/NCBI
Google Scholar

[316] View Article

[317] PubMed/NCBI

[318] Google Scholar

[ref86] 86. Slater GSC, Birney E. Automated generation of heuristics for biological sequence comparison. BMC bioinformatics. 2005;6(1):1–11. pmid:15713233
View Article
PubMed/NCBI
Google Scholar

[320] View Article

[321] PubMed/NCBI

[322] Google Scholar

[ref87] 87. Yang Z. PAML: a program package for phylogenetic analysis by maximum likelihood. Computer applications in the biosciences. 1997;13(5):555–6. pmid:9367129
View Article
PubMed/NCBI
Google Scholar

[324] View Article

[325] PubMed/NCBI

[326] Google Scholar

[ref88] 88. Villalba F, Collemare J, Landraud P, Lambou K, Brozek V, Cirer B, et al. Improved gene targeting in Magnaporthe grisea by inactivation of MgKU80 required for non-homologous end joining. Fungal Genetics and Biology. 2008;45(1):68–75. pmid:17716934
View Article
PubMed/NCBI
Google Scholar

[328] View Article

[329] PubMed/NCBI

[330] Google Scholar

[ref89] 89. Ribot C, Césari S, Abidi I, Chalvon V, Bournaud C, Vallet J, et al. The M agnaporthe oryzae effector AVR 1–CO 39 is translocated into rice cells independently of a fungal-derived machinery. The Plant Journal. 2013;74(1):1–12. pmid:23279638
View Article
PubMed/NCBI
Google Scholar

[332] View Article

[333] PubMed/NCBI

[334] Google Scholar

[ref90] 90. Pélissier R, Buendia L, Brousse A, Temple C, Ballini E, Fort F, et al. Plant neighbour-modulated susceptibility to pathogens in intraspecific mixtures. Journal of experimental botany. 2021;72(18):6570–80. pmid:34125197
View Article
PubMed/NCBI
Google Scholar

[336] View Article

[337] PubMed/NCBI

[338] Google Scholar

[ref91] 91. Frishman D, Argos P. Knowledge-based protein secondary structure assignment. Proteins: Structure, Function, and Bioinformatics. 1995;23(4):566–79. pmid:8749853
View Article
PubMed/NCBI
Google Scholar

[340] View Article

[341] PubMed/NCBI

[342] Google Scholar

Figures

Abstract

Author summary

Introduction

Results

Genome assembly and prediction of MAX effector genes

MAX effectors are massively deployed during rice infection

The MAX effector repertoire is highly variable

MAX effector variability is structured by host plant

Loss of MAX effectors in specific lineages does not appear to be associated with host specificity

MAX effectors display signatures of balancing selection

MAX effectors display signatures of recurrent directional selection

Structural determinants of polymorphism and divergence at MAX effectors

Discussion

MAX effectors as model systems to investigate effector evolution

Adaptive evolution of MAX effectors

Expression kinetics of MAX effectors

Presence/Absence polymorphism of MAX effectors

Concluding remarks

Methods

Genome assemblies, gene prediction, and pan-genome analyses

Identification of effectors sensu lato, and MAX effectors

Analysis of population subdivision

Homology modeling of MAX effectors

Evolutionary analyses

Constructs for the transformation of fungal isolates

Plant and fungal growth conditions

Fungal transformation

Fungal growth and infection assays

RNA extraction and qRT-PCR analysis

Statistical analyses of phenotypic data

Supporting information

S1 Table. Genomic assemblies with metadata.

S2 Table. Nomenclature of MAX effectors predicted in this study and in previous reports.

S3 Table. Presence/absence of MAX effector orthologs.

S4 Table. The expression of MAX79, MAX83 and MAX89 in Guy11 does not trigger recognition in a panel of rice varieties.

S5 Table. Gene average of summary statistics of polymorphism, differentiation and divergence.

S6 Table. πN and dN/dS in different classes of secondary structure annotations for MAX effectors with πN/πS>1 and dN/dS>1, respectively.

S7 Table. Primers for cloning and expression analyses.

S8 Table. Vector constructs.

S9 Table. Sequences of the MAX effectors in the isolate US0071 that were used for the complementation of Guy11.

S1 Fig. Effect of assembly properties on the number of genes.

S2 Fig. Expression patterns of MAX effectors during rice infection.

S3 Fig. Differential expression levels of MAX effectors upon infection of two different rice cultivars.

S4 Fig. Nucleotide diversity (π), ratio of non-synonymous to synonymous nucleotide diversity (πN/πS), orthogroup frequency for MAX effectors, other secreted proteins, and other genes.

S5 Fig. Frequency of MAX effector orthogroups as a function of the frequency of the adjacent orthogroups in the genome.

S6 Fig. Analyses of population subdivision with sNMF.

S7 Fig. MAX79, MAX83 and MAX89 are expressed in the transgenic Guy11 isolates upon rice inoculation.

S8 Fig. FST versus πN/πS at MAX effectors.

S9 Fig. Amino acid changes segregating in P. oryzae at MAX effectors with an avirulence function and MoToxB (first row), and MAX effectors with πN/πS>2 (next rows); amino acid changes are shown in dark blue and known binding interfaces in light blue.

S10 Fig. Secondary structure annotations of MAX effectors aligned with TM-ALIGN.

S1 Data. Summary statistics per orthogroup.

S2 Data. Summary statistics per MAX effector ortholog, species wide, and per lineage.

S3 Data. Structural properties and polymorphism of amino acids in MAX effectors.

S1 Text. Fitting a generalized linear model to amino acid polymorphism data.

S2 Text. Homology modeling procedure.

References

S6 Table. π_N and d_N/d_S in different classes of secondary structure annotations for MAX effectors with π_N/π_S>1 and d_N/d_S>1, respectively.

S4 Fig. Nucleotide diversity (π), ratio of non-synonymous to synonymous nucleotide diversity (π_N/π_S), orthogroup frequency for MAX effectors, other secreted proteins, and other genes.

S8 Fig. F_ST versus π_N/π_S at MAX effectors.

S9 Fig. Amino acid changes segregating in P. oryzae at MAX effectors with an avirulence function and MoToxB (first row), and MAX effectors with π_N/π_S>2 (next rows); amino acid changes are shown in dark blue and known binding interfaces in light blue.