Figures
Abstract
A first line of defense against pathogen attack for both plants and animals involves the detection of microbe-associated molecular patterns (MAMPs), followed by the induction of a complex immune response. Plants, like animals, encode several receptors that recognize different MAMPs. While these receptors are thought to function largely redundantly, the physiological responses to different MAMPs can differ in detail. Responses to MAMP exposure evolve quantitatively in natural populations of Arabidopsis thaliana, perhaps in response to environment specific differences in microbial threat. Here, we sought to determine the extent to which the detection of two canonical MAMPs were evolving redundantly or distinctly within natural populations. Our results reveal negligible correlation in plant growth responses between the bacterial MAMPs EF-Tu and flagellin. Further investigation of the genetic bases of differences in seedling growth inhibition and validation of 11 candidate genes reveal substantial differences in the genetic loci that underlie variation in response to these two MAMPs. Our results indicate that natural variation in MAMP recognition is largely MAMP-specific, indicating an ability to differentially tailor responses to EF-Tu and flagellin in A. thaliana populations.
Author Summary
Specialized receptors encoded by plants detect different components of bacterial machinery, and initiate an immune response. These recognition events are thought to induce largely redundant defense signaling, the magnitude of which varies quantitatively among populations, perhaps in response to environment specific differences in microbial threat. Here, we sought to determine whether plants evolve distinct or shared responses to two canonical MAMPs within natural populations. We comprehensively tested the extent of functional redundancy in the response of 186 genotypes of Arabidopsis thaliana to variants of each of two classes of bacterial signals, flagellin and EF-Tu. Although plants respond similarly to recognition of different variants of the same MAMP, we found the response to one MAMP class to be largely uncorrelated with the response to the other class. We further investigated the genetic bases underlying growth changes to determine whether similar genes contribute to variation in the response to EF-Tu and flagellin bacterial signals. We find limited genetic similarity, revealing novel MAMP-specific signaling components. The differentiation of these responses reveals MAMP-specific fine tuning of the immune response.
Citation: Vetter M, Karasov TL, Bergelson J (2016) Differentiation between MAMP Triggered Defenses in Arabidopsis thaliana. PLoS Genet 12(6): e1006068. https://doi.org/10.1371/journal.pgen.1006068
Editor: Kirsten Bomblies, John Innes Centre, UNITED KINGDOM
Received: December 14, 2015; Accepted: April 28, 2016; Published: June 23, 2016
Copyright: © 2016 Vetter et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Data are available through the data folder at https://bitbucket.org/mvetter/geneticbasissgi/
Funding: This work was supported by the National Science foundation with grants MCB 0603515 to JB and DDIGG 1311515 to TLK (http://www.nsf.gov/), and by the National Institutes of Health grant GM083068 to JB (http://www.nih.gov/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Pathogens pose a constant threat to their hosts. While lacking the adaptive immune system present in mammals, plants have evolved a two-tiered immune system of considerable specificity. The first tier of defense involves the recognition of microbe associated molecular patterns (MAMPs) that are common to many microbes. Plants recognize MAMPs, such as the elongation factor Tu (EF-Tu) and flagellin, or their epitopes elf18 and flg22, by specialized receptors that allow the plant to discriminate self versus non-self and induce signaling cascades that result in defense responses [1, 2].
The second tier of the plant immune system involves the recognition of specific microbial strains (in contrast to semi-universal microbial patterns) via the activity of resistance proteins. The specificity of the immune system in distinguishing specific microbes was previously thought to lie primarily in the activity of resistance proteins. Recent studies have challenged this paradigm in demonstrating both qualitative and quantitative variation in MAMP perception [3].
Plant hosts have the capacity to recognize multiple MAMPs, which helps ensure that at least one of the many MAMPs is recognized. For example, the order Brassicales evolved the capacity to recognize the MAMP elf18 in addition to ancient flg22, that is detected by all land plants [4–6]. The perception systems for elf18 and flg22 share common molecular components, such as the co-receptor BAK1 [7, 8], and elicit similar changes in gene expression [2, 9] suggesting that MAMP-triggered signal transduction and the associated signaling cascade converge quickly, regardless of the MAMP trigger.
If MAMP-perception systems were completely redundant one would expect identical physiological responses to distinct MAMP triggers. However, some physiological responses differ in detail. For example, elf18 and flg22 induce different Ca2+ signaling and macroscopic growth responses [10]. In particular, flg22 perception induces an equally strong growth reduction in roots and shoots, whereas elf18 perception acts more strongly in leaves than in roots [10, 11]. Additional evidence hints at a more complex interplay of molecular components after induction with different MAMPs [12–15].
The differentiated responses to separate MAMP classes suggest that the benefit of recognizing multiple classes may extend beyond redundancy. For example, it is possible that plants tailor defense response intensities to different MAMPs, an adaptation that could be especially beneficial when MAMP composition reflects differences in microbial community composition. Our previous work revealed extensive natural variation in the detection of a variant of flg22 and signatures of selection at the genetic loci underlying this variation [3]. This quantitative variation in MAMP perception indicated that these traits are evolving within natural populations. Whether these traits–the perception and response to divergent MAMPs–act and evolve redundantly is an open question.
To determine whether detection of flagellin and EF-Tu evolve redundantly in natural populations, we tested a large panel of A. thaliana genotypes for their response to different variants of these two MAMPs. As a read-out for MAMP perception, we utilized the macroscopic phenotype of seedling growth inhibition (SGI), a phenotypic response shown to correlate closely with MAMP perception in several Brassicaceae species [3]. In assaying quantitative variation in the responses to different MAMP variants and different MAMP classes, and determining the genetic loci underlying this variation via genome-wide association mapping, we shed light on the evolution of this first tier of defense.
Results and Discussion
Natural variation in MAMP-induced seedling growth inhibition
In order to characterize natural variation in MAMP-induced seedling growth inhibition (SGI) in A. thaliana, we challenged 186 natural genotypes with several peptide variants for each of two classes of MAMPs, elf18 and flg22 (Fig 1). An important feature of the study system is that growth inhibition is triggered by the peptides alone, without any confounding effects of bacterial proliferation and disease.
(a) Three elf18-variants and four flg22 variants were used to trigger SGI in 186 genotypes of A. thaliana. The x-axis indicates the MAMP variant and bacteria of origin: Pseudomonas syringae pv. tomato DC3000 (DC), P. syringae (PsHR-, PsHR+), P. viridiflava (Pv), and P. aeruginosa (Pa). Plotted are mean SGI values of 186 genotypes. The mean SGI for each A. thaliana genotype is estimated by calculating the relative reduction of fresh mass in percent. At least three replicates were measured per genotype and treatment to calculate mean SGI. The horizontal bar of the boxplot represents the median, the edges of the box present the 25th and 75th percentile. The whiskers are drawn at the data point that is closest to 1.5 x interquartile range. All outliers are shown. Small letters above the boxes indicate statistically different groups (ANOVA/Tukey’s post hoc test). (b) The peptide sequences of both elf18 and flg22 differ between closely related Pseudomonads (P. syringae and P. viridiflava) and within P. syringae.
We observed extensive variation in MAMP-induced SGI: Some genotypes exhibited no response while others exhibited a mean reduction in growth of up to 86% (e.g., Dra3-1 in response to elf18Pv). Different MAMP peptides induced significantly different SGI (ANOVA F5,5828 = 854, p = < 2e − 16, see S1 Table for full ANOVA table), consistent with previous findings [16–18].
In particular, the highly diverged peptide flg22Pv (Fig 1b) induced little to no SGI in most genotypes (Fig 1a). Although this flg22 variant of P. viridiflava induced no response in most A. thaliana genotypes, its elf18 variant induced the strongest SGI of all elf18 variants. In this case, the independent response to P. viridiflava enables the continued detection of pathogenic invader even though recognition of one MAMP fails.
Two of the 186 A. thaliana genotypes did not exhibit elf18-induced SGI and 10 did not exhibit flg22-induced SGI (Fig 2b). Genotypes lacking a response to one MAMP class were not impaired in their response to the other class, suggesting that loss-of-function of the elf18 and flg22 perception systems undergo differentiated evolution. In order to investigate the cause of low flg22-induced SGI, we tested FLS2 protein levels of nine genotypes that showed little or no SGI in response to any flg22 peptide variant. We observed compromised protein production in all tested genotypes (Fig 2d). It remains unclear if lower protein levels are caused by mutations within the protein coding sequence of the receptor or trans-acting factors. Analysis of mRNA expression levels in a subset of the flg22-insensitive genotypes revealed lower levels of mRNA expression of FLS2 (S1 Fig) in these genotypes. We furthermore obtained FLS2 nucleotide sequences for 9 of the 10 flg22-insensitive genotypes. Six out of nine show putative deletions in the catalytic site of the serine-threonine kinase domain of FLS2 (S1 Text). These results suggest that the absence of SGI in response to flg22 treatment is largely explained by differences in the coding sequence and the abundance of the receptor itself. The comparatively high prevalence of flg22-insensitive genotypes raise the possibility that the elf18 perception system has surpassed the ancient flg22 pathway [4] in importance, at least in A. thaliana.
(a) Genotypes that showed no SGI to one MAMP class are not impaired in their growth response to the other MAMP class. We classified genotypes as flg22-insensitive when they exhibited less than 15% SGI over all peptides in one MAMP class. The boxplots present SGI induced by the two MAMP classes (i.e., elf18DC, elf18Ps and elf18Ps grouped as“elf18” and flg22Pa, flg22PsHR+ and flg22PsHR− grouped as “flg22”). (b) Genotypes with low flg22-induced SGI exhibit reduced FLS2 protein abundance or a truncated FLS2 protein (PHW2). Two separate immunoblots and their corresponding coomassie colloidal blue stained membranes for total protein loading control. Immunoblots were conducted with an anti-FLS2 antibody that is directed against the kinase region of the flg22 receptor FLS2. We sequenced this region to ensure that the epitope recognized by the anti-FLS2 antibody is identical to the Col-0 control and thus allows direct comparison of protein levels. Note that the genotype San-2 also exhibited less than mean 15% SGI in response to the flg22 MAMP variants. (c) Pearson correlation coefficients and confidence intervals for genotype means of SGI. S2 Table contains the corresponding numerical values. (d) Effect of different MAMPs on plant architecture. Plants (N > 40) were grown in presence or absence of 100nM of MAMP and both fresh mass and shoot / root ratio determined. ANOVA, followed by a post hoc Tukey test, revealed significantly different means (p ≤ 0.05; indicated by different letters above the box plots).
Finally, it is interesting to note that small differences in amino acid sequence of the inducing peptides had the potential to profoundly impact SGI. For example, three amino acid differences between elf18Ps and elf18Pv resulted in an 18% increase in mean SGI (Welch’s t-test t = −7.0, p = 1.24e−11). On the other hand, flg22PsHR− and flg22PsHR+, which differ by a single amino acid, did not cause statistically significant differences in SGI (Welch’s t-test t = −0.89, p = 0.37, Fig 1a).
Correlation of SGI in response to different MAMP variants
We asked if genotypes exhibit similar SGI in response to variants of flg22 and elf18. If these MAMPs act redundantly, one would expect a strong correlation among the induced physiological responses. We observed strong correlations among variants within each MAMP class: the average Pearson correlation coefficient within a MAMP class is 0.61 whereas the mean correlation of SGI between MAMP classes is much weaker (mean R = 0.24, see Fig 2a and S2 Table for significance and confidence intervals of individual comparisons).
Veluchamy et. al [19] observed a similar disparity between within-class and between-class correlations in a small set of heirloom tomato tested for reactive oxygen production upon treatment with three MAMPs, flg22Pa, flgII-28 and csp22. Overall, the weak correlation between classes of MAMP peptides suggests some autonomy in the underlying genetic bases.
Genetic basis of MAMP-induced SGI
We performed genome-wide association mapping to a) reveal the genetic basis underlying natural variation in SGI and b) identify loci that are uniquely important in shaping either flg22- or elf18-induced SGI (Fig 3). We first calculated marker-based heritability estimates (h2 = 0.29 on average for all perceived peptides, S3 Table) to test if SGI is amenable to GWA mapping [20]. The MAMP flg22Pv induced little to no SGI in most A. thaliana genotypes, which is reflected in a broad-sense heritability estimate of 0 (S3 Table).
The genome-wide distribution of the -log10 p -values of the SNP—phenotype associations are plotted as a function of the genomic position along the five chromosomes. The GWA study was conducted using EMMAX, which controls for population structure. The x-axis displays the position along the chromosome while the y-axis displays the p-value from a linear mixed model regression associating a given SNP with SGI. SNPs with minor allele frequency (MAF) < 0.05 were excluded. For clarity, only SNPs characterized by -log10(p-value) ≤ 1 are shown. The horizontal dashed line depicts the threshold for the 0.1% tail of the p-value distribution, which varies between peptide variants (flg22Pv: 0.00134 to elf18DC: 0.00088). SNPs that displayed above-threshold p-values were considered for further analysis. SNPs corresponding to rare alleles (0.05 > MAF > 0.1) are plotted in magenta. The right-most panel shows quantile plots of the expected versus observed p-values for each MAMP variant. Note the different scale of the y-axis of the uppermost panel. Genes that could be validated as underlying the natural variation in SGI are indicated in black. The a priori candidate genes FLS2 and BAK1 are not significantly associated with SGI but are indicated in gray for reference. A table with all genes underlying the different peak regions is available in the data folder of the repository bitbucket.org/mvetter/geneticbasissgi.
We tested the significance of associations between the seven phenotypes and approximately 210,000 genome-wide SNP markers [21] in linear mixed models (EMMAX) that takes population structure into account [22]. We identified a number of loci that exhibit significant association with natural variation in SGI (Fig 3). In general, SGI appears to be governed by a complex genetic architecture involving loci with small effect. Only the GWA mapping of elf18DC-induced SGI identified a locus of large putative effect; this locus, which was found on chromosome 5, explains 28% of the observed variation.
The MAMP receptors EFR and FLS2 were considered strong a priori candidate genes for variation in SGI. The top SNPs of the single prominent peak for elf18DC-induced SGI, indeed, co-localize with the EFR locus. Surprisingly, EFR is not strongly associated with variation in SGI induced by the other two elf18 variants, elf18Ps and elf18Pv. To investigate this further, we conducted phylogenetic analysis and revealed two major EFR haplotype groups (Fig 4) that strongly differentiate detection of elf18DC from elf18Ps and elf18Pv (Figs 3 and S2). Thus, EFR controls natural variation in SGI for only a subset of elf18 variants.
Phylogenetic analysis of 3,096 nucleotides of the EFR coding region of 109 A. thaliana genotypes that were also represented in the GWA mapping. The sequences were reconstructed from SNP data of the 1001 genome project (http://1001genomes.org/). All nodes are supported by 100 out of 100 bootstrap replicates. Vertical grey bars indicate two haplotype groups that exhibit 28 nucleotide differences. Genotype 8240 (i.e., Kulturen-1) is a strong outlier with 54 nucleotide differences compared to haplotype group 2.
Variation in flg22-induced SGI was attributed to many loci with small effect. The most prominent peak (p ≤ 4 * 10−8) in the flg22Pa-induced SGI co-localizes with the a priori candidate gene NADPH/respiratory burst oxidase protein D (RbohD), that is known to fine tune reactive oxygen production and hypersensitive response around pathogen infection sites. None of the SNPs within the 30 kb window comprising the FLS2 gene are significantly associated with flg22-induced SGI (see close-up of genetic region in S3 Fig.) Similarly, no significantly associated SNP mapped near the FLS2 co-receptor BAK1 (Fig 3). The functional importance of BAK1 for flg22-induced immunity is well-established [7]; however, the unusually limited natural genetic variation at the BAK1 locus (π = 0.0008 in 80 genotypes published in [23] versus 0.005 for a genome wide estimate by [24]) does not appear to contribute to natural variation in the SGI.
We identified a number of a priori candidate genes that are known to have an effect on SGI (Table 1). These candidates were found to be enriched in the 0.1% tail of elf18DC and elf18Ps but not the other MAMPs (Table 2). Thus, our results illustrate that genes with strong phenotypic effects in knock-out experiments do not necessarily harbor genetic variation causing natural variation in this plant phenotype [25].
Candidate genes were identified within 15 kb around a SNP falling in the 0.1% tail of EMMAX p-values. The distance of the candidate locus and the highest associate SNP are indicated in bp (column distance).
We assembled a list of 77 genes that have published evidence to alter SGI in response to either elf18 or flg22. The measure ‘frequency’ indicates how many a priori candidate genes were found among the number of GWA candidate genes (nb. genes) for all peaks of each trait. The empirical p-value is generated by shifting the number of peaks along the genetic positions and calculating the frequency of found a priori genes in the number of genes under these shifted peaks. The random shift along the genetic position maintains patterns of linkage disequilibrium and was repeated 100 times.
MAMP variants induce distinct physiological responses
We examined the genetic overlap in peaks identified during GWAS analysis of the responses to each MAMP. While MAMPs within each class share a higher number of peaks than expected by chance (S4 Table and S4 Fig), we did not observe enrichment in the overlap of GWAS peaks for SGI induced by the MAMP classes elf18 and flg22. This finding is consistent with our observation of high correlations of SGI within, but not between, MAMP classes, and suggests a common genetic architecture within each of two distinct MAMP classes.
Our results raise the possibility that flg22 and elf18 induce molecular responses that are more differentiated than is suggested by their similar patterns of gene expression [2, 9]. The observation that elf18 and flg22 induce different macroscopic changes further supports the differentiation of these responses. Specifically, flg22 acts both on leaf and root tissue, while elf18 is most effective on leaves [10, 11]. We confirmed this observation in a quantitative experiment demonstrating that application of elf18 significantly alters the shoot/root ratio compared to untreated plants, whereas flg22 treatment does not (ANOVA F2,175 = 77.0, Tukey post hoc elf18Ps vs control p ≤ 2.2e−16, flg22Pa vs control p = 0.36, Fig 2c).
MAMP-induced SGI is fine tuned by both shared and distinct loci
To further test our hypothesis that different loci underlie natural variation in elf18 and flg22-induced SGI, we selected candidate genes from our GWA for experimental validation using both MAMP classes. For each MAMP, we selected the 0.1% tail of p-values (i.e., the 203 SNPs most strongly associated with MAMP-induces SGI). To account for linkage disequilibrium, we defined peak regions that comprise all SNPs within 15 kb to either side of the SNP with the lowest p-value. This resulted on average in 113 peak regions per MAMP. On average, six genes underlie a peak region, resulting in 4481 genes across all peak regions and six MAMPs (flg22Pv excluded, see S5 Table for details). For experimental confirmation of candidates, we focused on genes that were either mapped in multiple MAMPs, genes with differential expression upon MAMP treatment [2, 9] or with a known role in defense mechanism or growth. We retrieved one T-DNA insertion line for each of 88 genes from the ABRC seed stock center and successfully confirmed homozygosity of T-DNA insertion for 57 mutant lines that were tested for SGI. We identified 11 candidate genes that significantly alter SGI (Table 3), some of which are a priori candidates such as EFR and RbohD, while others are genes of unknown function (e.g., AT4G21865 and AT5G57345). We consider as confirmed only candidates that pass q ≤ 0.05 after correction for false discovery rate. This approach reduces the reporting of false positives but might exclude some true positives (e.g, RIN4, involved in modulating flg22-induced SGI [26].) Our confirmation of 11 genes reveals a four fold enrichment in significant loci in comparison to the null. Our confirmation rate of 20% (after correction for multiple testing) is therefore substantially higher than expected by chance.
The first column indicates the candidate gene name. The columns elf18 and flg22 denote whether differential SGI was observed in response to either MAMP in mutant plants (an X indicates that a member of the peptide class induced differential SGI significance at FDR < 0.05). The column “Eff” denotes if SGI was stronger (+) or weaker (-) in the mutant in comparison to the wild-type Col-0. The magnitude of response differs between several of the validated candidates (see S6 Table). Information from the Arabidopsis Information Resource (www.arabidopsis.org) is given for genes that have not been previously associated with SGI.
Of the 11 confirmed candidates, four impact both elf18- and flg22-induced SGI and seven control either elf18- or flg22-induced SGI (Table 3). This finding suggests differences in the molecular pathways involved in the recognition of and response to different MAMP classes. The discrepancy between the extensive overlap previously identified in the pathways for elf18 and flg22 recognition, and the reduced overlap in genes identified here, may be explicable by either of two hypotheses.
One possibility is that loci underlying natural variation in SGI are primarily involved in the initial steps of MAMP perception prior to signal convergence. Functional differences in the receptor alone might be caused by sequence variation in its protein coding region [3], its transcriptional regulation [15], its post-translational glycolysation [27, 28] or modification of obligatory co-receptors [31]. Another possibility is that yet unidentified pathways, unique to perception of one MAMP, lead to observed phenotypic differences. We found some support for this hypothesis by identifying a number of experimentally confirmed loci that alter SGI uniquely upon treatment with elf18 variants but not flg22 variants.
Overall, our results reveal a lack of correlation between a plant’s response to elf18 and flg22. What are the implications of this uncorrelated response? Natural populations experience fluctuations in both abiotic and biotic conditions, and plants in different environments are colonized by disparate microbial communities and pathogen species. In light of this variability, the decoupling of elf18- and flg22-triggered physiological responses is potentially advantageous. Genotypes that respond more strongly to one MAMP do not in general respond equally strongly to another MAMP i.e., there are no genotypes hypersensitive to all MAMPs. This uncoupling of the responses allows for responses to different MAMPs to evolve independently in populations. There is also the recent demonstration of epistatic effects of recognizing multiple MAMPs, at least in mammalian systems [32]. The ways in which hosts regulate their microbiomes is a complex issue that is only beginning to be understood, but distinctly tailored MAMP initiated defense is likely one contributing factor.
Materials and Methods
All data and custom scripts are available at https://bitbucket.org/mvetter/geneticbasissgi/
Identification of natural MAMP sequences
Plants perceive specific epitopes of EF-Tu and flagellin that are known as elf18 and flg22, respectively. Flg22 sequences of five natural P. viridiflava strains [33] were identified using primers 5’-GCCATCGCGACGATAACTA-3’ and 5’-GGCGTTTTCGTTGATGTTCT-3’. Flg22 sequences for P. viridiflava strains LP23.1a and RMX3.1b as well as both elf18 and flg22 sequences for 20 P. syringae strains isolated from A. thaliana and the surrounding plant Drava verna [34] were derived from genome sequences available in the Bergelson laboratory (S2–S3 Texts). The Pseudomonas strains from which these MAMPs were derived were previously isolated from natural populations of A. thaliana. Infection analyses with the P. syringae strains determined that a subset of strains did not induce HR on A. thaliana [34, 35]. We tested MAMP variants derived from strains that did not induce HR (HR-) and strains that successfully induce HR (HR+).
The identified elf18 and flg22 variants were synthesized by EZBiolab, Carmel, IN.
Plant culture and estimation of SGI
Seedlings growth inhibition (SGI) was estimated for 186 genotypes of A. thaliana that were part of the panel in Atwell et al. [21]. Plants grown in the absence (control) or presence of 100nM MAMP (treatment) were grown in pairs and cultivated in sterile conditions as described in Vetter et al. [3]. Seedling growth inhibition was calculated as relative reduction of fresh mass in percent by [(CFM—TFM) / CFM] * 100, where CFM stands for control fresh mass and TFM for treatment fresh mass. Phenotypic values of SGI were calculated by averaging at least three pairs of control and MAMP treatments per genotype; these pairs were obtained in five independent biological trials for elf18 and seven independent biological trials for flg22 (resulting in a maximum number of 15 or 21 biologically independent values of SGI per genotype).
All data analysis was conducted in R [36]. Particular packages are indicated where appropriate.
SDS-PAGE and immunoblotting
Plant material was grown and processed as described in Vetter et al. [3]. In short, nitrogen frozen plant material was homogenized, weighed and dissolved in equal amounts of extraction buffer (25mM MES pH 6, 3mM MgCl2, 10mM NaCl and Sigma protease inhibitor, 2% SDS). Proteins were separated in a Novex NuPage 3–8% Tris-acetate gel and blotted onto a PVDF membrane (Immobilon, Millipore). The α-FLS2 antibody was incubated in 1:5,000 dilution, overnight at 4°. A horseradish peroxidase, coupled to secondary anti-rabbit IgG in 1:2,000 dilution was used to detect FLS2 protein bound to the membrane. This antibody was previously demonstrated to be specific to FLS2 [37].
mRNA and nucleotide sequence analysis of FLS2
Expression data for the FLS2 locus were obtained from [38] and the correlation between SGI and mRNA expression level were determined for those genotypes for which we had both SGI and RNA-sequencing information (49 accessions). The FLS2nucleotide sequence data for the genotypes that failed to respond to flg22 was obtained from http://tools.1001genomes.org/pseudogenomes/.
Correlation of SGI in response to diverse MAMPs
We investigated SGI induced by three elf18 and four flg22 peptide variants in 186 genotypes of A. thaliana. Twelve genotypes exhibited a mean SGI < 15% in response to either elf18 or flg22 variants. We considered these genotypes natural SGI mutants and excluded them prior to correlation analysis. We calculated Pearson correlation coefficients and confidence intervals for the remaining 174 genotypes. Note that no genotype failed to recognize both MAMPs.
Estimates of heritability
Marker-based heritability and confidence intervals were calculated using the marker_h2 function of the package ‘heritability’ in R [39]. This model incorporates a genetic relatedness matrix and generates REML-estimates of the additive genetic variance (), residual variance () and their standard errors, which allows calculation of heritability according to the equation, . The relatedness matrix was generated using plink1.07 [40, 41] and SNP data by Kim et al. [42].
Genome-wide association mapping
Genome-wide association mapping was conducted on average SGI for each genotype for which high density genotype data was available [42]. Analysis and restructuring of genotype data was conducted in plink1.07 [40, 41]. The mixed linear model EMMAX takes population structure into account by incorporating a K matrix of genetic relatedness [22]. EMMAX positively biases association signals for alleles with low frequency in the population. We therefore disregarded SNPs with a minor allele frequency (MAF) smaller than 5%. Our mapping panel contained 214,051 discriminative SNPs, of which 203,498 passed the MAF filter.
Investigation of the genetic architecture
In order to assess genetic similarities within and between MAMP classes, we identified shared peak regions. To account for linkage disequilibrium [42] and the fact that the highest associated SNP might not be the causal one [43], we defined peak regions encompassin 15 kb on either side of the highest associated SNP. A literature search identified 77 a priori candidate genes with published evidence of their participation in the SGI response to either elf18 or flg22. We tested for enrichment of these a priori candidates among our identified peak regions by calculating their frequency among the genes underlying the mapped peak regions of each trait. In order to assign an empirical p-value, we kept the number of peaks and their genetic distance intact but slid them across the genetic position of each chromosome. We then counted the number of genes underlying these shifted peaks and calculated the frequency of a priori candidates within the shifted peak regions. We repeated this procedure 100 times and determined the number of times we achieved an equal or greater frequency of a priori candidate genes. All custom scripts are available at https://bitbucket.org/mvetter/geneticbasissgi/.
Selection and confirmation of candidate genes
We considered SNPs that fell in the 0.1% tail of ranked GWA p-values; these correspond to the 203 most strongly associated SNPs. To account for linkage disequilibrium, we defined peak regions 15 kb to either side of the highest associated SNP and identified genes co-localizing with these peak regions based on TAIR 9 annotation. From this list of 4481 genes, we selected candidate genes for experimental confirmation if they fulfilled at least one of the following criteria: (1) detected by multiple MAMP variants, (2) differentially expressed upon MAMP perception (using data of [2, 9]), or (3) a priori candidate for growth or defense related processes. We selected 88 loci for which T-DNA insertion mutants were ordered from ABRC seed stock center (list available in the data folder of the repository bitbucket.org/mvetter/geneticbasissgi.) We analyzed a single T-DNA insertion line for each candidate gene (but see [44]) and could confirm homozygosity of T-DNA insertion for 57 mutant lines using primers according to http://signal.salk.edu/tdnaprimers.2.html. Silke Robatzek, The Sainsbury Laboratory, Norwich, kindly provided fls2-24 and efr-0 mutants for control purposes. We grew each SALK mutant genotype in a minimum of 12 replicates with an equal number of wild-type (WT) Col-0 or (WT) Ler plants (according to respective mutant background). Significant differences in SGI between mutants and WT were determined using a non-parametric Wilcoxon rank-sum test in combination with multiple testing correction using the fdrtool package [45]. To identify MAMP-specific SGI, we first tested the MAMP class and variant that led in the GWA to the candidate gene selection. If a significant result was observed, we tested at least one more variant within the same class as well as a variant in the other class.
Supporting Information
S1 Fig. The relationship between receptor expression level and SGI.
Expression data on the x-axis for FLS2 (a) and EFR (b) are taken from [38]. Plants were grown at 16°C. Receptor mRNA expression level is plotted against SGI induced by the respective MAMP. Expression data is presented in the units of reads per kilobase per million of mapped reads (RPKM). The red dotted line denotes the cutoff for distinguishing MAMP-sensitive from MAMP-insensitive genotypes (15% SGI.) Genotypes that did not respond to flg22 treatment exhibit low expression of FLS2.
https://doi.org/10.1371/journal.pgen.1006068.s001
(PDF)
S2 Fig. The EFR haplotype group strongly influences elf18DC-induced SGI.
The three panels shows SGI induced by elf18DC, elf18Ps and elf18Pv in 186 genotypes of A. thaliana. Above each panel we indicate results of a t-test testing the effect of EFR haplotype group on SGI. We labeled the two outliers (i.e., elf18-insensitive genotypes) Pro-0 and Alc-0 in the plot. The red dot highlights mean SGI of the genotype Col-0. While elf18DC-induced SGI is strongly determined by the EFR haplotype group, elf18Ps and elf18Pv are influenced to a lesser extant. This leads to a strong genotype-phenotype association (i.e., peak) in our GWA for elf18DC but not elf18Ps or elf18Pv at the EFR locus.
https://doi.org/10.1371/journal.pgen.1006068.s002
(PDF)
S3 Fig. Genomic region 18500000 to 19600000 of chromosome 5 is densely populated with highly associated SNPs.
The genomic region 18500000 to 19600000 of chromosome 5 has several peaks that are associated with flg22-induced SGI. These peaks do not co-localize with known a priori candidate genes such as flagellin receptor FLS2. None of the 69 SNPs within 15 kb to either side of FLS2 is significantly associated with flg22-induced SGI. The genes associated with individual peaks are stored in the data folder of the repository bitbucket.org/mvetter/geneticbasissgi.
https://doi.org/10.1371/journal.pgen.1006068.s003
(PDF)
S4 Fig. The number of shared peaks within a MAMP class is higher than expected by chance.
Each histogram shows the distribution of the number of shared peaks for 100 GWA runs that were generated with randomized phenotypic values. The red vertical line represents the number of peaks that were found in the actual mapping. Given is also the empirical p-value that indicates the chance of identifying the observed number of shared peaks by chance. P-values ≤ 0.002 are considered significant after Bonferroni correction for multiple testing.
https://doi.org/10.1371/journal.pgen.1006068.s004
(PDF)
S1 Table. Analysis of variance determining the effect of MAMP class (MAMPclass), host genotype (genotype) and MAMP peptide (MAMP, nested within MAMP class) on seedling growth inhibition (SGI).
https://doi.org/10.1371/journal.pgen.1006068.s005
(PDF)
S2 Table. Correlation of SGI is high within MAMP classes (elf18 or flg22) but not among MAMP classes.
The table indicates Pearson’s correlation coefficients for genotype means of seedling growth inhibition. Genotypes that did not exhibit seedling growth inhibitionin response to elf18 or flg22 were excluded prior analysis. Significant correlations are indicated in bold (Bonferroni corrected p < 0.006).
https://doi.org/10.1371/journal.pgen.1006068.s006
(PDF)
S3 Table. Heritability estimates and confidence intervals.
Marker-assisted heritability was estimated for seedling growth inhibition (SGI), fresh mass in control conditions (CFM) and fresh mass after MAMP treatment (TFM). SGI is calculated by [(CFM—TFM) / CFM] * 100.
https://doi.org/10.1371/journal.pgen.1006068.s007
(PDF)
S4 Table. Number of shared genomic regions.
Many genomic regions were mapped by more than one MAMP variant. SGI induced by peptides of the same MAMP class (elf18 or flg22 variants) share a larger number of genomic regions than between MAMP classes. As a result of linkage disequilibrium and SNP density, a highly associated SNP can be located several kb away from the causal gene. We therefore considered genomic regions of 30 kb instead of directly comparing shared genes.
https://doi.org/10.1371/journal.pgen.1006068.s008
(PDF)
S5 Table. Summary statistics of GWA mapping.
We conducted GWA mapping on SGI, induced by seven diverse MAMPs. We considered the 0.1% tail of strongest associated p-values for further analysis, which corresponds to 203 SNPs out of 203,498 for which genotype data were available. Column “MAMP” indicates the MAMP used to induce SGI, “p-value” indicates the p-value cut-off (i.e., the p-value of the 203rd strongest associated SNP), “nb. peaks” indicates to how many peaks these 203 SNPs cluster. The column “nb. of genes” indicates how many genes were contained in these peak regions. Peak regions are defined as genomic areas 15 kb to either side of the highest associated SNP.
https://doi.org/10.1371/journal.pgen.1006068.s009
(PDF)
S6 Table. Experimental validation of GWA candidate loci.
We tested plants carrying a non-functional allele (mutant) versus plants carrying the WT allele (Col-0 except fls2-24 that has a Ler genetic background). Column N indicates the number of tested mutants / WT plants, and effect (Eff) indicates an increase (+) or decrease (-) in SGI. Column ‘p-value’ indicates statistically discernible difference of seedling growth inhibition in mutant from WT using the non-parametric Wilcoxon rank-sum test. In order to correct for multiple testing, p-values derived by the Wilcoxon rank-sum test were re-evaluated by false discovery rate (FDR). P-values or FDR-corrected values smaller than 0.05 are printed in bold. The FLS2 locus was not associated with MAMP-induced seedling growth inhibition but the fls2-24 mutant was included for control purposes.
https://doi.org/10.1371/journal.pgen.1006068.s010
(PDF)
S1 Text. flg22-insensitivity is associated with nucleotide deletions within the FLS2 Serine-Threonine kinase catalytic domain.
Shown are the 100 bp surrounding the start site of the Serine/Threonine kinase catalytic domain of FLS2 in twenty two genotypes that exhibit SGI in response to flg22 (SGI+) and nine genotypes that do not (SGI-). Pseudogenome data was extracted from http://tools.1001genomes.org/pseudogenomes/. Brown shading details the beginning of the kinase domain. Eight of the nine SGI- genotypes contain missing data in the first 50 bp of this domain (N’s) and six contain stretches of N’s greater than 2 bp, suggestive of deletions in these genotypes.
https://doi.org/10.1371/journal.pgen.1006068.s011
(XLS)
S2 Text. Sequences of Flagellin in P. syringae and P. viridiflava.
Fasta nucleotide sequences of region surrounding flagellin gene in 20 P. syringae strains and two P. viridiflava strains. Strains are labeled with name first then species designation.
https://doi.org/10.1371/journal.pgen.1006068.s012
(TXT)
S3 Text. Sequences of EF-Tu in P. syringae and P. viridiflava.
Fasta nucleotide sequences of region surrounding EF-Tu gene in 20 P. syringae strains and two P. viridiflava strains. Strains are labeled with name first then species designation.
https://doi.org/10.1371/journal.pgen.1006068.s013
(TXT)
Acknowledgments
We thank Benjamin Brachi for discussion on GWA and for kindly providing scripts (as indicated on scripts available on https://bitbucket.org/mvetter/geneticbasissgi/). We thank Aparna Nutakki for technical help and Silke Robatzek for providing the αFLS2 antibody and knock-out mutants efr and fls2-24.
Author Contributions
Conceived and designed the experiments: MV JB. Performed the experiments: MV. Analyzed the data: MV TLK. Contributed reagents/materials/analysis tools: JB MV. Wrote the paper: MV JB TLK.
References
- 1. Zipfel C, Robatzek S, Navarro L, Oakeley EJ, Jones JDG, Felix G, et al. Bacterial disease resistance in Arabidopsis through flagellin perception. Nature. 2004;428(6984):764–767. pmid:15085136
- 2. Zipfel C, Kunze G, Chinchilla D, Caniard A, Jones J, Boller T, et al. Perception of the bacterial PAMP EF-Tu by the receptor EFR restricts Agrobacterium-mediated transformation. Cell. 2006;125(4):749–760. pmid:16713565
- 3. Vetter MM, He F, Kronholm I, Häweker H, Reymond M, Bergelson J, et al. Flagellin perception varies quantitatively in Arabidopsis thaliana and its relatives. Molecular Biology and Evolution. 2012;. pmid:22319159
- 4. Boller T, Felix G. A Renaissance of Elicitors: Perception of Microbe-Associated Molecular Patterns and Danger Signals by Pattern-Recognition Receptors. Annual Review of Plant Biology. 2009;60(1):379–406. pmid:19400727
- 5. Yue JX, Meyers BC, Chen JQ, Tian D, Yang S. Tracing the origin and evolutionary history of plant nucleotide-binding site–leucine-rich repeat (NBS-LRR) genes. New Phytologist. 2012;193(4):1049–1063. pmid:22212278
- 6. Lacombe S, Rougon-Cardoso A, Sherwood E, Peeters N, Dahlbeck D, van Esse HP, et al. Interfamily transfer of a plant pattern-recognition receptor confers broad-spectrum bacterial resistance. Nature Biotechnology. 2010;28(4):365–369. 00172. pmid:20231819
- 7. Chinchilla D, Zipfel C, Robatzek S, Kemmerling B, Nürnberger T, Jones J, et al. A flagellin-induced complex of the receptor FLS2 and BAK1 initiates plant defence. Nature. 2007;448:497–500. pmid:17625569
- 8. Roux M, Schwessinger B, Albrecht C, Chinchilla D, Jones A, Holton N, et al. The Arabidopsis Leucine-Rich Repeat Receptor-Like Kinases BAK1/SERK3 and BKK1/SERK4 Are Required for Innate Immunity to Hemibiotrophic and Biotrophic Pathogens. The Plant Cell. 2011;23(6):2440–2455. 00194. pmid:21693696
- 9. Wan J, Zhang XC, Neece D, Ramonell KM, Clough S, Kim Sy, et al. A LysM Receptor-Like Kinase Plays a Critical Role in Chitin Signaling and Fungal Resistance in Arabidopsis. The Plant Cell Online. 2008;20(2):471–481.
- 10. Ranf S, Eschen-Lippold L, Pecher P, Lee J, Scheel D. Interplay between calcium signalling and early signalling elements during defence responses to microbe- or damage-associated molecular patterns. The Plant Journal. 2011;68(1). pmid:21668535
- 11. Zipfel C, Robatzek S. Pathogen-Associated Molecular Pattern-Triggered Immunity: Veni, Vidi…? Plant Physiology. 2010;154(2):551–554. pmid:20921183
- 12. Zipfel C. Combined roles of ethylene and endogenous peptides in regulating plant immunity and growth. Proceedings of the National Academy of Sciences. 2013;110(15):5748–5749.
- 13. Tintor N, Ross A, Kanehara K, Yamada K, Fan L, Kemmerling B, et al. Layered pattern receptor signaling via ethylene and endogenous elicitor peptides during Arabidopsis immunity to bacterial infection. Proceedings of the National Academy of Sciences. 2013;110(15):6211–6216.
- 14. Mersmann S, Bourdais G, Rietz S, Robatzek S. Ethylene signaling regulates accumulation of the FLS2 receptor and is required for the oxidative burst contributing to plant immunity. Plant physiology. 2010;154(1):391–400. pmid:20592040
- 15. Boutrot F, Segonzac C, Chang K, Qiao H, Ecker J, Zipfel C, et al. Direct transcriptional control of the Arabidopsis immune receptor FLS2 by the ethylene-dependent transcription factors EIN3 and EIL1. Proceedings of the National Academy of Sciences of the United States of America. 2010;107(32):14502–14507. pmid:20663954
- 16. Cai R, Lewis J, Yan S, Liu H, Clarke C, Campanile F, et al. The plant pathogen Pseudomonas syringae pv. tomato is genetically monomorphic and under strong selection to evade tomato immunity. PLoS pathogens. 2011;7(8).
- 17. Bauer Z, Gomez-Gomez L, Boller T, Felix G. Sensitivity of different ecotypes and mutants of Arabidopsis thaliana toward the bacterial elicitor flagellin correlates with the presence of receptor-binding sites. J Biol Chem. 2001;276(49):45669–45676. pmid:11564731
- 18. Sun WX, Dunning FM, Pfund C, Weingarten R, Bent AF. Within-species flagellin polymorphism in Xanthomonas campestris pv. campestris and its impact on elicitation of Arabidopsis FLAGELLIN SENSING2-dependent defenses. Plant Cell. 2006;18(3):764–779. pmid:16461584
- 19. Veluchamy S, Hind SR, Dunham DM, Martin GB, Panthee DR. Natural Variation for Responsiveness to flg22, flgII-28, and csp22 and Pseudomonas syringae pv. tomato in Heirloom Tomatoes. PLoS ONE. 2014 09;9(9):e106119. pmid:25180693
- 20. Segura V, Vilhjalmsson BJ, Platt A, Korte A, Seren U, Long Q, et al. An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations. Nat Genet. 2012 07;44(7):825–830. pmid:22706313
- 21. Atwell S, Huang YS, Vilhjalmsson BJ, Willems G, Horton M, Li Y, et al. Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines. Nature. 2010;465. pmid:20336072
- 22. Kang HM, Sul JH, Service SK, Zaitlen NA, Kong Sy, Freimer NB, et al. Variance component model to account for sample structure in genome-wide association studies. Nat Genet. 2010 Apr;42(4):348–354. pmid:20208533
- 23. Cao J, Schneeberger K, Ossowski S, Gunther T, Bender S, Fitz J, et al. Whole-genome sequencing of multiple Arabidopsis thaliana populations. Nat Genet. 2011 10;43(10):956–963. pmid:21874002
- 24. Nordborg M, Hu TT, Ishino Y, Jhaveri J, Toomajian C, Zheng H, et al. The pattern of polymorphism in Arabidopsis thaliana. PLoS Biology. 2005;3(7):e196. pmid:15907155
- 25. Koornneef M, Alonso-Blanco C, Vreugdenhil D. Naturally occurring genetic variation in Arabidopsis thaliana. Annu Rev Plant Biol. 2004;55:141–172. pmid:15377217
- 26. Kim MG, da Cunha L, McFall AJ, Belkhadir Y, DebRoy S, Dangl JL, et al. Two Pseudomonas syringae Type III Effectors Inhibit RIN4-Regulated Basal Defense in Arabidopsis. Cell. 2005;121(5):749–759. pmid:15935761
- 27. Häweker H, Rips S, Koiwa H, Salomon S, Saijo Y, Chinchilla D, et al. Pattern recognition receptors require N-glycosylation to mediate plant immunity. The Journal of biological chemistry. 2010;285(7):4629–4636. pmid:20007973
- 28. Li J, Zhao-Hui C, Batoux M, Nekrasov V, Roux M, Chinchilla D, et al. Specific ER quality control components required for biogenesis of the plant innate immune receptor EFR. Proceedings of the National Academy of Sciences. 2009;106(37):15973–15978.
- 29. von Numers N, Survila M, Aalto M, Batoux M, Heino P, Palva ET, et al. Requirement of a Homolog of Glucosidase II β-subunit for EFR-mediated Defense Signaling in Arabidopsis thaliana. Molecular Plant. 2010;3(4):740–750. pmid:20457640
- 30. Kadota Y, Sklenar J, Derbyshire P, Stransfeld L, Asai S, Ntoukakis V, et al. Direct Regulation of the NADPH Oxidase RBOHD by the PRR-Associated Kinase BIK1 during Plant Immunity. Molecular Cell. 2014;54(1):43–55. pmid:24630626
- 31. Schwessinger B, Roux M, Kadota Y, Ntoukakis V, Sklenar J, Jones A, et al. Phosphorylation-dependent differential regulation of plant growth, cell death, and innate immunity by the regulatory receptor-like kinase BAK1. PLoS genetics. 2011;7(4). pmid:21593986
- 32. Blander JM, Sander LE. Beyond pattern recognition: five immune checkpoints for scaling the microbial threat. Nature Reviews Immunology. 2012;12(3):215–225. pmid:22362354
- 33. Araki H, Tian D, Goss E, Jakob K, Halldorsdottir S, Kreitman M, et al. Presence/absence polymorphism for alternative pathogenicity islands in Pseudomonas viridiflava, a pathogen of Arabidopsis. Proceedings of the National Academy of Sciences of the United States of America. 2006;103(15):5887–5892. pmid:16581904
- 34. Barrett LG, Bell T, Dwyer G, Bergelson J. Cheating, trade-offs and the evolution of aggressiveness in a natural pathogen population. Ecology Letters. 2011;14(11):1149–1157. pmid:21951910
- 35. Kniskern JM, Barrett LG, Bergelson J. Maladaptation in wild populations of the generalist plant pathogen Pseudomonas syringae. Evolution. 2011;65(3):818–830. pmid:21044058
- 36.
R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria; 2014. Available from: http://www.R-project.org/.
- 37. Göhre V, Spallek T, Häweker H, Mersmann S, Mentzel T, Boller T, et al. Plant Pattern-Recognition Receptor FLS2 Is Directed for Degradation by the Bacterial Ubiquitin Ligase AvrPtoB. Current Biology. 2008;18:1824–1832. pmid:19062288
- 38. Dubin MJ, Zhang P, Meng D, Remigereau MS, Osborne EJ, Casale FP, et al. DNA methylation in Arabidopsis has a genetic basis and shows evidence of local adaptation. Elife. 2015;4:e05255. pmid:25939354
- 39. Kruijer W, Boer MP, Malosetti M, Flood PJ, Engel B, Kooke R, et al. Marker-Based Estimation of Heritability in Immortal Populations. Genetics. 2015;199(2):379–398. pmid:25527288
- 40.
Purcell S. PLINK 1.07; 2009.
- 41. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira M, Bender D, et al. PLINK: a toolset for whole-genome association and population-based linkage analysis. American Journal of Human Genetics. 2007;81. pmid:17701901
- 42. Kim S, Plagnol V, Hu TT, Toomajian C, Clark RM, Ossowski S, et al. Recombination and linkage disequilibrium in Arabidopsis thaliana. Nat Genet. 2007 09;39(9):1151–1155. pmid:17676040
- 43. Korte A, Farlow A. The advantages and limitations of trait analysis with GWAS: a review. Plant Methods. 2013;9(1):29. 00055. pmid:23876160
- 44. Tax FE, Vernon DM. T-DNA-associated duplication/translocations in Arabidopsis. Implications for mutant analysis and functional genomics. Plant physiology. 2001;126(4):1527–1538. pmid:11500551
- 45. Strimmer K. fdrtool: a versatile R package for estimating local and tail area-based false discovery rates. Bioinformatics. 2008;24(12):1461–1462. pmid:18441000