Figures
Abstract
Durum wheat is an important cereal grown in Ethiopia, a country which is also its center for genetic diversity. Yellow (stripe) rust caused by Puccinia striiformis fsp tritici is one of the most devastating diseases threatening Ethiopian wheat production. To identify sources of genetic resistance and combat this pathogen, we conducted a genome wide association study of yellow rust resistance on 300 durum wheat accessions comprising 261 landraces and 39 cultivars. The accessions were evaluated for their field resistance using a modified Cobb scale at Meraro, Kulumsa and Chefe Donsa in the 2015 and 2016 main growing seasons. Analysis of the 35K Axiom Array genotyping data of the panel resulted in a total of 8,797 polymorphic SNPs of which 7,093 were used in subsequent analyses. Population structure analysis suggested two groups in which the cultivars clearly stood out separately from the landraces. Eleven SNPs significantly associated with yellow rust resistance were identified on four chromosomes (1A, 1B, 2B, and 5A) which defined at least five genomic loci. Six of the SNPs were consistently identified on chromosome 1B singly at each and combined overall environments which explained 62.6–64.0% of the phenotypic variation (R2). Resistant allele frequency ranged from 14.0–71.0%; Zooming in to the identified resistance loci revealed the presence of disease resistance related genes involved in the plant defense system such as the ABC transporter gene family, disease resistance protein RPM1 (NBS-LRR class), Receptor kinases and Protein kinases. This study has provided SNPs for tracking the loci associated with yellow rust resistance and a diversity panel which can be used for association study of other agriculturally important traits in durum wheat.
Citation: Alemu SK, Huluka AB, Tesfaye K, Haileselassie T, Uauy C (2021) Genome-wide association mapping identifies yellow rust resistance loci in Ethiopian durum wheat germplasm. PLoS ONE 16(5): e0243675. https://doi.org/10.1371/journal.pone.0243675
Editor: Aimin Zhang, Institute of Genetics and Developmental Biology Chinese Academy of Sciences, CHINA
Received: November 24, 2020; Accepted: April 16, 2021; Published: May 17, 2021
Copyright: © 2021 Alemu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: CU received funding from the UK Biotechnology and Biological Sciences Research Council (BBSRC's) Sustainable Crop Production Research for International Development (SCPRID) initiative (Grant Ref: BB/J012017/1, https://bbsrc.ukri.org/news/topic/scprid/). SKA received financial support for the PhD study to conduct 2 seasons field research in Ethiopia from Ethiopian Institute of Agricultural Research (EIAR: http://www.eiar.gov.et/index.php/en) and Addis Ababa University (AAU: http://www.aau.edu.et/) and a 1 year of lab work staying at the JIC, Norwich, UK through the Grant. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Durum wheat (Triticum turgudum subsp. durum) is a tetraploid wheat (2n = 4x = 28) with AABB genome designation and basic chromosome number x = 7. Tetraploid wheat is thought to be the result of a genome hybridization between the diploid A genome wheat species Triticum urartu (2n = 2x = 14) and an S genome species related to Aegilops speltoides [1].
Durum wheat is an important food crop with global production estimated to be 36 million t per year [2]. Ethiopia is considered as a center of secondary diversity for durum wheat and as such has a wide array of untapped tetraploid germplasm [3–9]. Although statistics on current production are combined (in most cases) into a single “wheat” category, occasional studies report that Ethiopia is the largest producer of durum wheat in sub-Saharan Africa with approximately 0.6 million hectares (Evan School Policy Analysis and Research [10]. Particularly, landraces (i.e., a locally adapted line that has not been modified through a breeding programme) are known to cover about 70% of the total durum wheat area [11]. The crop is grown widely on heavy black soils of the highlands with an altitude range of 1800 to 2800 m asl [12, 13]. Due to this continuous production by farmers in the highlands, Ethiopian durum wheat has evolved in its phenotypic [14] and molecular diversity [15]. Selection pressure from the natural and artificial sources, inevitably has contributed to the development of many adaptive traits including disease resistance [16–18] which provide an opportunity for genetic improvement programs.
Durum wheat is mostly grown as a food crop by small holder farmers, although it has potential as an industrial crop in the food industries. Generally, pasta, spaghetti, biscuits, pancakes, macaroni, pastries, and unleavened breads are among the known forms of uses of durum wheat [19]; and global durum wheat use is trending upward [20]. Traditionally, ‘kinche’ (crushed kernels, cooked with milk or water and mixed with spiced butter) is among the most commonly used Ethiopian recipe [21]. Apart from that, in the mixed farming systems of the highlands, the straw of durum wheat is also of high relevance for animal feed due to its digestibility and palatability characters [22].
Stripe/yellow rust of wheat is an obligate biotrophic fungal disease caused by Puccinia striiformis f. sp tritici (Pst) [23], and one of the most devastating diseases in wheat growing regions of Ethiopia [24]. In 2010, an epidemic of the disease occurred in the country and reached all the wheat growing regions at unprecedented rates. It infected large production areas including 29 zones in the Amhara, Oromia, and Southern Nations, Nationalities, and Peoples (SNNP) regional states [25, 26]; http://www.addisfortune.com.
Stripe rust can be managed using various methods such as cultural practices, fungicide application, and resistance cultivar development and deployment [27–30], with the latter being an environmentally friendly strategy. With regard to exploiting genetic resistance, worldwide research reports on wheat rusts so far indicate that more than 80 Yr genes (Yr1-Yr83) have been permanently named which are known to confer both adult plant resistance (APR) and/or all stage (seedling resistance) [31–34]. Most of these genes originally derived from hexaploid wheat (Triticum aestivum and their wild relatives); some of them from diploid wild relatives and very few of them (such as Yr15 and Yr36) from tetraploid wild relatives (Triticum dicoccoides) wihle just Yr24 from durum wheat (T. turgidum). A partial List of catalogued Yr genes derived from various sources is summarized in S1 Table. Resistance genes having diagnostic markers can be transferred to well adapted but susceptible cultivars through Marker Assisted Selection (MAS) coupled with gene pyramiding strategy. For instance, Yr genes such as Yr26, Yr51, Yr57, Sr genes like Sr22, Sr26 and Sr50 have been transferred to well adapted cultivars through Marker Assisted backcrossing method which is one of MAS approaches [35–37]. Due to its dynamic nature, new virulent Pst races often emerge and break varietal resistance by defeating the genes they carry and drive them out of production. This situation prompts the necessity of continuous search and identification of new sources of resistance to sustain wheat production.
In Ethiopia, research effort made by the wheat improvement programs has resulted in the development of cultivars with relatively good levels of wheat rust resistance, good yield, and desirable agronomic characters [38]. Germplasm screening to identify resistant breeding lines and development of elite durum wheat cultivars [39] has been part of the strategies to exploit the host’s natural defence in breeding programs. Screening available germplasms for the presence of those known Yr genes is one of the strategies to maximize the resistance potential of that germplasm. Specifically, when the genes are mapped to a known genomic locus and diagnostic molecular markers are available [40], the detection of the known resistant genes is facilitated. However, several of the identified genes have limited spectra of effectiveness as they have been previously overcome by specific Pst races. Besides, germplasm pools vary in their genetic background and local adaptation in which case deciphering the resistance genes they carry is difficult specifically in the absence of perfect markers. One strategy to address this is searching for resistance genes in locally adapted germplasm using techniques such as Genome Wide Association Studies (GWAS). GWAS studies ultimately result in the identification of genomic loci/SNPs which are significantly associated with the target traits [41] and can be used in marker assisted introgression programs to improve the target crop. As a follow-up investigation, the associated SNPs can be validated for their diagnostic values in an independent germplasm.
Ethiopian durum landraces have been suggested as a good source of resistance to wheat rusts [42, 43]. This has been supported by association studies on relatively small sized Ethiopian durum panels in limited study environments [17]. Extensive research on potential new and effective Pst resistance associated loci or genes is necessary to cope with the occurrence of new virulent Pst races. The availability of high-density SNP wheat chips e.g., 35k Breeders chips, iSelect 9 and 90 K SNP assays [44–46] provides an opportunity to facilitate such genome level studies. Therefore, the objectives of this study were: (1) to access the extent of field resistance to stripe rust among Ethiopian durum wheat panel and (2) to identify SNP markers/genomic loci significantly associated with Stripe rust resistance.
Materials and methods
Phenotyping
Plant materials.
Durum wheat accessions (n = 513) were obtained from various sources in Ethiopia. The accessions comprise landraces from gene bank collections (Ethiopian Biodiversity Institute (EBI) and Ethio-Organic Seed Action (EOSA)) and few additional landraces and cultivars (released varieties) from Debre Zeit Agricultural Research Center (DZARC). To start with a relatively pure seed stock, all the accessions were grown and those having a mixed seed stock were identified by physical observation on morphological features such as spike architecture (e.g., density, color, awns) and seed color. In few instances, accessions of mixed genotypes were split and considered as a separate accession making the number a bit increased. Each accession was then subjected to a single spike row planting followed by two generations of self-pollination. A final single spike to row planting was carried out at Holeta Agricultural Research Center for seed multiplication. The final working population was then constituted and cut down to 300 based on morphological similarity among the accessions and used for phenotypic evaluations and genotyping. This final panel included 261 landraces and 39 cultivars. The source and related description of the accessions is provided in S2 Table.
Field resistance evaluation.
The accessions were grown in an alpha lattice design with two replications at Chefe Donsa (CHD), Kulumsa (KUL) and Meraro (MER) testing sites in the main growing season (June–November) of 2015 and 2016. Chefe Donsa is located 35 kms east of Debre Zeit at 08°57’15” N and 39°06’04” E and has an altitude of 2450 m. Kulumsa is located 167 kms from Addis Ababa at 8°01’11.7"N 39°09’38.2"E and has an attitude of 2200 m; whereas Meraro is located about 236 kms from Addis Ababa at 7°24’25.8"N 39°14’56.3"E and has an elevation about 3,030 m. Each accession in each replication was sown in two rows of 0.5 m length, with 0.2 m spacing between rows. Each block was enclosed between spreader rows of known susceptible durum wheat (LD-357, Arendato and Local Red) and bread wheat (Morocco known to have Sr25 and Lr19) varieties mixed in the ratio of 1:1:1:1 and sown 20 cm from the experimental plots. The spreader rows act as an inoculum source and help in achieving uniform disease establishment throughout the experimental field. The disease severity (percentage of leaf tissue infected with the rust) was evaluated using a modified Cobb’s scale [47] with values ranging from 0 to 100%. The field response of the genotypes to the rust infection was scored according to Stubbs et al. [48] as R (resistant), MR (Moderately Resistant), Moderate (M), MS (Moderately, Susceptible) and S (Susceptible) each having a numerical constant value of 0.2, 0.4, 0.6, 0.8 and 1.0, respectively. The scoring or field resistance was usually commenced when disease severity is greater or equal to 50% and a susceptible reaction (S) was observed on the spreader rows which is presented as a combined value 50S. All agronomic practices were applied following the recommended practice for wheat at each location. Two data sets (one for each year) per each of the three locations was generated giving a total of six environments.
Phenotype data analysis.
Severity (SEV) score was multiplied with the field Response (RES) values to produce the Coefficient of Infection (CI) which represents the combined reaction of the genotypes for the pathogen. The panel was classified in to Resistant, Intermediate and Susceptible groups based on the average values of SEV and RES [17, 49] for each environment. As CI is the product of SEV and RES, the corresponding values were used to do the same reaction classification in terms of CI as well. To comply with the normality assumption, SEV and CI data were transformed with common logarithmic function (SEVtr = LOG10(SEV+1) and CItr = LOG10(CI+1)) while Response was subjected to arcsine transformation (REStr = arcsine(√RES)). The level of infection on some of the known susceptible genotypes was assessed for its consistency of susceptibility and used as a basis to judge the suitability of the disease reaction data across environments for downstream analysis. Accordingly, level of infection data of CHD-2015 and KUL-2015 were found to be so low to be reliable and excluded from all downsteam analyses. Shapiro Wilk test (Shapiro and Wilk 1965 [50]) was applied on the original and the transformed data to assess the normality of the data. Analysis of Variance (ANOVA) was done with the transformed data for each and combined over environment using a linear mixed model. In the model, genotype, Incomplete block within Replication, Location, year, and genotype by other variance source interactions were considered as random effects. Multiple random effect test (Variance component test) was carried out using Lym4 and LimerTest packages in R. The Best Linear Unbiased Estimates (BLUEs) of each environment and combined data (BLUE-all) was generated using Restricted Maximum Likelihood (REML) method fitting the genotype as fixed effect while the other variance sources and interactions as random effect. These BLUE values were used to perform the association analysis. Correlation analysis was performed for SEV, RES and CI to assess the extent of covariation of the resistance reaction within and across environments.
Genotyping
Seeds were sown on a 50 mm diameter sterile petri dish equipped with 42 mm filter discs to maintain a moist condition for germination. After watering them with distilled water, petri dishes were incubated at 4°C under dark conditions for 24 hours to break dormancy. The dishes were then kept at room temperature for 3–4 days until fully germinated. Germinated seeds were planted in a 96 well tray filled with peat and sand soil mix optimized for raising cereal seedlings. The trays were placed in cereal growth chamber set at 19°C day and 16°C night temperature with a relative humidity of 70% and a photoperiod of 16/8 hours light/dark cycle.
Fully opened leaves were harvested from 10 to 14 days old seedlings in a 1.2 ml deep-well plate on a dry ice and freeze-dried for 24–30 hours at -40°C under a pressure of 20 atm. The tissue was then ground into fine powder followed by a wet grinding with Geno/Grinder 2010 at 1750 rpm for 2 minutes. Genomic DNA was extracted with SDS buffer following the wheat and barley DNA extraction protocol in 96-well Plates [51] with some modification. DNA was cleaned according to Affymetrix User Guide, Axiom® 2.0 Assay for 384 Samples (Genomic DNA Preparation and Requirements) and quantified with Nanodrop (8-sample spectrophotometer ND-8000). A total of 100 μL of DNA sample normalized to 75–100 ng/μL was submitted to University of Bristol Genomic Facility for genotyping using Breeders’ 35K Axiom Array. At the service center, sample array processing and genotyping was carried out following the Axiom® 2.0 Assay for 384 Samples user and Workflow guide (http://media.affymetrix.com /support/downloads/manuals/axiom_2_assay_auto_workflow_user_guide.pdf). We received the genotype data as ARR, JPEG and CEL files, where the later was used for SNP and all other downstream analyses.
Analysis
Genotype/SNP data analysis.
Genotype/SNP analysis was performed with Axiom Analysis Suit (AxAS) v2.0 using the CEL intensity files following sample QC and the Best Practice Workflow. Poor quality samples were identified with Dish Quality Control (DQC) values where samples having a value of ≤ 0.80 were excluded from the next step of the analysis. Using a subset of probe sets, samples which passed the DQC value step were subjected to genotype calling to generate the QC call rate and those samples which had a value ≤ 0.91% were excluded from all subsequent genotyping and SNP data analysis. Once the good quality (QC passed) samples were identified, the downstream genotype data analysis was performed following the Best Practice Workflow with SNP QC default settings for SNP genotype calling. The resulting SNP classes were assessed for complying with expected thresholds mainly with the number of minor alleles ≥ 2. Accordingly, the ‘polyhighresolution’ and ‘NoMinorHomos’ classes, as they fulfill the criteria, were closely examined, and considered for the next step analysis. The other SNP classes (MonoHighResolution, OTV, Other and CallRateBelowThreshold) were not considered because they are noisy in many aspects and did not comply with the minor allele number threshold mentioned above.
The SNP summary table and the genotype call data were extracted using the export tab of the Axiom analysis window. The accessions’ genotype data was further subjected to individual heterozygosity analysis and accessions with a value of ≥ 3% heterozygosity were excluded from the panel. The physical positions of the SNPs were extracted from the position file of Breeders’ 35K Axiom Array anchored on the wheat reference genome sequence RefSeq v1.0 [52]. Minor Allele Frequency (MAF) was calculated as a percentage of each SNP allele relative to the total in the association panel and individuals having a MAF value > 5% (cut-off). Furthermore, Locus Heterozygosity (the probability that an individual is heterozygous for the locus in the population and polymorphism Information content (PIC: the discriminating power of the marker in a population) were calculated as described in Liu [53] and Guo [54]. With all the threshold values of heterozygosity and MAF, the final working number of the panel was determined 293 while that of the working SNP data was cut down from 8797 to 7093 and these were used in all downstream analyses.
Population structure & kinship.
Population structure among the association panel was analyzed using STRUCTURE v 2.3.4 [55]. Allele frequency model was employed at Length of burnin period = 10,000, number of MCMC Reps after burnin = 100,000, K runs from 1–10 with 5 replications for each K. The result of the analysis was zipped into a folder and uploaded on to STRUCTURE HARVESTER [56], an online analysis tool used to generate an estimate of the optimum number of subpopulations (i.e., the value of K) through Evanno method [57]. Kinship matrix was generated through genetic similarity matching using all possible pairwise combination among the panel using the R package Genomic Association and Prediction Integrated Tool (GAPIT) besides performing the PCA [58].
Linkage disequilibrium.
Genome-wide Linkage Disequilibrium was assessed using TASSEL v.5 to estimate the squared allele frequency correlation (r2) for all pairwise comparison of distances between SNPs. The genotype data was imported into TASSEL in HAPMAP format and full matrix LD analysis was performed with the default settings. The resulting LD output was subjected to binning using customized R script with bin_size of 1,000 bp and maximum_bin of 829,100,000 bp to generate the average r2 and the inter-SNP distance values. The LD values were plotted against the corresponding inter-SNP physical distance using a customized R script to estimate the LD decay rate as described in Hill and Weir [59]. The threshold value of the LD (r2) was set at 0.2 a commonly applied value in many related studies [60–63]. To visualize the LD decay pattern, nonlinear model was fitted to the LD plot relating the squared allele frequency with the physical distance [64].
Association analysis and locus exploration.
The working set 7093 SNP genotype data across the 293 genotypes was organized in the HapMap format. The phenotype data in terms of the Best Linier Estimate (BLUE) for SEV, RES, and CI was arranged across the 293 genotypes for all the single environment data sets. GAPIT [58] was used for the association analysis in R v 4.0.3 and RStudio v. 1.4.1103 to identify the genomic loci underlying the Pst resistance. The markers and the phenotype data corresponding to each genotype was input and fitted in the compressed mixed linear model (MLM) which accounts for uneven relatedness and controls it effectively through lowering the type I errors [65]. VanRaden’s method [66] was applied to calculate Kinship. To account for the genetic structure, Principal Component Analysis (PCA) was calculated and iteratively added to the model [67]. The best fit of the model was visually assessed by observing the QQ plots. Correction for false positive association was then performed by adding the default K + PCA covariates to the fixed effect part of the model in the GAPIT code. The analysis was carried out first at each environment using a single dataset, then combined over all environments. The probability of adjusted false discovery rate (p < 0.05) was used as a critical value to declare the significant marker trait association [68]. The “-log10(p)” values were plotted against each chromosome and the “expected -log10(p)” to generate Manhattan and QQ-plots respectively with codes imbedded in the GAPIT script. The SNP allele in the most and consistently susceptible line EDW-262 was used as the susceptible allele and the alternative was considered as the resistant one. The resistant allele frequency among the panel was determined based on the number of individuals which harbor the resistant alleles at each locus.
Single locus-based variation of phenotypic values of resistant and susceptible alleles was also tested using Two-Sample T-test assuming equal variances. Further exploration was done on genomic regions on chromosomes encompassing all identified loci significantly associated with the resistance trait. Genes and their functional descriptions reported with in the identified resistance associated regions and close to the nearest significantly associated SNP were examined using list of genes/gene model extracted from the wheat genome annotation file [52].
Results
Phenotypic reaction to Pst infection and distribution
Variably distributed reaction groups were demonstrated among the panel for SEV, RES and CI across the six environments (Fig 1A–1C). For SEV, 100% of the accessions were classified as resistant (0≤SEV≤10) at CHD_15 and 99.7% at KUL_15 while 49, 95, 84, 58% were classified at MER_15, CHD_16, KUL_16 and MER_16 respectively (Fig 1A). For RES, 96.6% of the accessions fell under resistant class at CHD_15 and 97.63% at KUL_15, while 33, 91, 79, 41% were observed under the same resistant class at MER_15, CHD_16, KUL_16 and MER_16 in that order (Fig 1B). Almost similar frequencies of reaction groups were observed for CI where 99% of them were under resistant class both at CHD_15 and KUL_15 while 38, 89, 78, 49% appeared in the same reaction group respectively at MER_15, CHD_16, KUL_16 and MER_16 (Fig 1C). The reaction data of each genotypes of the panel to Pst has been presented in S2 Table. For all the SEV, RES and CI values of CHD-15 & KUL-15, almost 100% of the panel shown overall resistant reaction unlike their reaction at CHD-2016, KUL-2016, MER-2015 and MER-2016. Besides, investigation of the reaction of the five selected susceptible genotypes (Table 1) across the six environments showed that an intermediate or resistant reaction for CHD-15 and KUL-15. This is too low infection level to be considered in further analysis as it affects results and leads to erroneous conclusion. So based on this assessment, the data of those two environments were excluded from all downstream analyses. The mean values of SEV, RES and CI combined over the remaining four environments was 10.6%, 0.3 and 8.1 in the same order. Considering stability of reactions across all the four environments, 37.2, 25.9, and 31.1% of the genotypes gave consistently resistant reaction for SEV, RES and CI across the testing location in that order (S3 Table). Because CI is a combined representation of SEV and RES for the resistance reaction, 31.1% (91 genotypes) of the total 293 panel were considered as stably resistant across the four environments. The list of these resistant genotypes along with the SEV, RES and CI values can be accessed from S2 Table.
A) severity, B) Field Response and C) Coefficient of Infection. The reaction data of severity and field response was used to classify the panel in to Resistant, Intermediate and Susceptible groups as described in Liu et al. [17] and taking the corresponding values for Coefficient of Infection as well.
Distribution of the untransformed and transformed data is presented in Fig 2A–2F. Despite discarding CHD-15 and KUL-15 data from the analyses, the frequency distribution of the original untransformed data (BLUE-all-Original) values was still skewed for SEV (W = 0.67) and CI (W = 0.58), while RES appeared to be less skewed (W = 0.91) (Fig 2A–2C). Application of logarithmic transformation to SEV and CI and the arcsine transformation to RES changed the distribution to better adjusted normality with W values of 0.93 for SEV, 0.93 for RES and 0.88 for CI (Fig 2D and 2F).
A, B and C represents distributions of best linear unbiased estimates (BLUEs) of original for SEV, RES and CI data while D, E and F represents transformed data in their respective order. “W” is Shapiro—Wilks Statistic indicating the correlation between observed values and normal scores both for the original and transformed values.
Variances and correlations of reactions
We first examined the variance components for the multiple factors included in the statistical model. ANOVA of each environment and combined across environments is summarized in Table 2. Genotypic variance component was significant (P < 0.001) for all the individual environments and combined across environments (Table 2). Except for MER-15 & MER-16, Blocks nested within replications were non-significant for CHD, KUL and combined across environments. Similarly, except for year variance components of locations and the two-way genotype by environment interactions (GxY & GXL) were significant at P < 0.001 for all SEV, RES & CI. Likewise, the three-way interaction (GxLxY) resulted in a highly significant variation at P < 0.001 for SEV, RES and CI. Apparently, there was strong interaction between the response of each genotype based on the location and year, which is consistent with the result presented in Fig 1A–1C).
Next, we calculated the correlation coefficients to see the relationships between the three phenotypic variables within and across environments (Table 3). Within MER, correlation coefficient (r) between data of 2015 & 2016 was 0.70 for SEV; 0.72 for RES and 0.68 for CI. whereas the data from CHD and KUL was only for 2016, and no correlation was calculated for within location level. On the other hand, overall correlation among all four environments was 0.67 ± 0.05 for SEV vs SEV, 0.66 ± 0.05 for RES vs RES and 0.67± 0.05 for CI vs CI (Table 3). Correlations among the three reaction data types was also assessed at different combinations (SEV vs RES, SEV vs CI & RES vs CI). High correlation was observed for SEV vs RES (r = 0.94 ± 0.01), SEV vs CI (r = 0.97 ± 0.01) and RES vs CI (r = 0.93 ± 0.01) within same environment while very low and nearly similar for within same location- different year and among locations (Table 3). This is expected because the level of disease pressure that result in a high severity is highly likely to result in higher reaction of the genotype exposed and hence positively impact the coefficient of correlation. Besides, using the T-distribution test for most of the comparisons shows, statistically significant correlations at P < 5% except very few marked as “ns” (Table 3). This can be partly explained by the higher degrees of freedom in the analysis (i.e., DF = 293–2 = 291) and due to the large number of samples.
Genotypes/SNPs called and chromosomal distribution
Genotype data assessment on the 300 accessions resulted in 296 which passed Dish Quality Control (DQC) ≥ 0.8. An example of Cluster plots of the panel resulted from Axiom Analysis Suit V.2 with Best Practice Workflow is presented in Fig 3. Heterozygosity and SNP analysis defined a final panel of 293 accessions with 10,622 polymorphic SNP markers after excluding Individuals with Heterozygosity values of ≥ 3%. We extracted the physical positions (from RefSeqv1.0) of the informative SNPs which led to 3853 (36.3%) markers being assigned to the A-genome, 4944 (46.5%) to the B-genome, 481 (4.5%) to the D-genome and 1344 (12.7) to “Chromosome U” (unknown physical position and no significant match in the genome). Excluding the D-genome SNPs (as they are not expected in a tetraploid/durum wheat) and the unknown set resulted in a total of 8797 physically positioned SNPs.
QC Threshold passed accessions are 296 and 8 samples are control; AX-95238778 & AX-95685405 are Axiom SNP probes with which the accessions are genotyped.
Varying numbers of SNPs, minor allele frequencies (MAF), Locus heterozygosity and polymorphic information content (PIC) value of the SNPs were obtained at chromosome, sub-genome, and whole genome levels (Table 4). The highest number of SNPs (852) were observed on chromosome 1B while the lowest (387) was found on chromosome 4A. Mean MAF was the highest (0.2171) for chromosome 1B while it was the lowest (0.1631) for chromosome 2A. Mean locus heterozygosity and PIC values were highest (0.2879 and 0.2326) for chromosome 1B SNPs and lowest (0.2214 and 0.1852) for chromosome 3B respectively. On the average, we identified 628 ± 42 SNPs per chromosome.
We further explored the SNPS by looking at the sub-genome distribution. We found lower number of A genome (43.8%) than B genome (56.2%) SNPs (Fig 4). SNPs from both genomes had similar MAF (0.1857, 0.1871), locus heterozygosity (0.2623,0.2634) and PIC (0.2158, 0. 2162). Group 1 chromosomes had the highest SNP density/coverage, diverse and informative SNPs. For more reliable result in downstream analyses, these SNPs were further refined to maintain only those with a > 5% MAF cut-off resulting in a final set of 7093 SNPs (Table 4). The distribution of this final SNP set was on average 442.1 ± 27.0 for the A-genome and 571.1 ± 53.0 for the B-genome.
They are a subset of the 8797 SNPs (Table 4) and are selected based on their MAF values of > 5% for all subsequent analysis.
Population groups and relatedness
We performed a population structure analysis which grouped the panel into two clusters and kinship analysis which revealed further sub groupings (Fig 5A–5F). Landraces clustered together into one group while all the cultivars clustered in a second group (Fig 5A). Structure harvester analysis suggested that delta K attained its highest value at K = 2 which is the most likely number of populations in the study panel (Fig 5B). In a global view, kinship analysis resolved the panel into two distinct groups as well, where the cultivars still stood out separately from the rest of the panel (Fig 5C). A closer look at the grouping pattern however revealed the presence of four groups where the cultivars (CL) clustered separately while the landraces separated into three sub-groups (LR-I, LR-II & LR-III). Principal Component Analysis (PCA) also resolved the panel into four clusters with the cultivars noticeably still isolated from the landraces (Fig 5D). Despite PCA revealed four clusters, only the first two components explained 66.67% (Fig 5E) of the variation which is in agreement with presence of two main groups as depicted by structure and kinship analyses.
A) population structure plot as reveled by STRUCTURE analysis. B) Delta K plot from STRUCTRUE HARVESTER analysis. C) 293x293 Kinship matrix plot using genetic similarity matching where outside the matrix is a clustering tree of the panel in to 2 main groups (in global view) while a bit of detailed view gave four clusters: CL = Cultivars and three Land Races sub- groups (LR-I, LR-II and LR-III). At the top left corner of the plot is the distribution of estimated kinship values. D) 3D PCA plot of the first three principal components where improved varieties still stood out in a distinct group at the very right end in the plot while the rest grouped in a similar way to the Kinship plot. E) 2D PCA plot elaborating sub-groups along with the variation explained by the components.
Linkage disequilibrium
We computed the squared allele frequency correlation (r2) for all pairwise comparisons of distances between SNPs using TASSEL. At a genome-wide level, 627,886 inter-SNP distances were found through binning LD analysis output at a bin of 1,000 bp. Of this, 176,571 (28.12%) were in significant LD with an average LD estimate of r^2 = 0.11 where the highest value (0.28) was achieved within the first 10 Mbp of physical distance (Table 5). The mean LD above the critical value (r^2 = 0.2), was also the highest (0.36) within the same 10 Mbp of physical distance as reported for the total (Table 5).
The fitted LOESS curve intersected with the critical LD value at physical distance of 69.1 Mbp where all the values of LD bellow this point were considered to be due to physical linkage among the inter-SNP pairs (Fig 6). The LD started to decay below this critical value to an average r2 of 0.16 for an increase of 10 Mbp (in the interval 10–20) suggesting that the overall LD accounted for the association is exhibited by relatively a shorter genomic distance.
Average pair-wise inter-SNP LD (r2) values plotted against physical distance in base pairs based on the wheat reference genome RefSeq v.1.0. The red line indicates the threshold LD.
Marker-trait associations for resistance to yellow rust
The association analysis of CI resulted in a total of 11 SNPs, across four chromosomes (1A, 1B, 2B and 5A) significantly associated with yellow rust resistance at FDR-adjusted P≤0.05 (Table 6). Two SNPs (AX-94482796, AX-94856684) at CHD and another three SNPs (AX-94438404, AX-94460229, AX-94648330) at KUL were identified as location specific resistance associated SNPs. Six of the 11 identified SNPs (AX-95171339, AX-94436448, AX-95238778, AX-95096041, AX-94730403 & AX-94427201) however, were consistently identified (Table 6 & Fig 7A–7E) at each location and combined analysis over all four environments (BLUE-all). No location specific SNP was identified at MER. Almost similar sets of SNPs were identified from GWAS analysis for SEV and RES as well. As both SEV and RES are highly correlated to CI (Table 3), we here focused on reporting the GWAS results in relation to CI. A comprehensive result of the GWAS analysis for SEV, RES and CI of each environment and combined data is presented in S4 Table. The phenotypic variation R2 explained by all six SNPs on chromosome 1B ranged from 51.9–53.9% for CHD-16, 52.1–58.5% for KUL-16, 51.1–52.1% for MER-15, 54.8–56.9% for MER-16 and 62.6–64.0% for BLUE-all (Table 6). MAF ranged from 14.2% to 31.7% with the highest for AX-95171339and the lowest for AX-94856684. The lowest (2.13E-08) significant FDR_Adjusted_P-value was exhibited by AX-94730403 on chromosome 1B. The resistance allele frequency (RAF) ranged from 14.0% to 71.0% among the panel with marker AX-94856684 (on chromosome 2B) attaining the lowest value while AX-95238778 and AX-95096041(on chromosome 1B) having the highest value.
Results represent association analysis of CI data from 293 durum wheat accessions at CHD-16 (A), KUL-16 (B) MER-15 (C), MER-16 (D) and combined over the four environments (E). The panel were genotyped with Breeders’ 35K Axiom Array for wheat.
Variation of phenotypic values of resistant and susceptible alleles was also assessed with Two-Sample T-test (assuming equal variances). This resulted in a highly significant difference between the resistant and susceptible allele carrying individuals (Table 7).
Other genes in the vicinity of SNPs associated with the Pst resistance
Further investigation on the genomic region encompassing the significantly associated SNPs in all the four chromosomes revealed 44 genies of various descriptive functions (Table 8). Thirty-six of the genes were identified In approximately 10.5Mbp (325818004–336274839) genomic region encompassing the identified resistance associated SNPs on chromosome 1B. Four of these genes (MATH domain containing protein (TraesCS1B01G180400), Alpha-galactosidase (TraesCS1B01G181700), Chloroplast inner envelope protein putative, expressed (TraesCS1B01G182300) and Plant basic secretory family protein (TraesCS1B01G182700)) were redundantly found. SNP AX-95171339, which is found close to the outer most of this region is very closely associated to Pentatricopeptide repeat-containing protein (TraesCS1B01G179700) while SNP AX-94436448 is flanked by DNA-directed RNA polymerase subunit (TraesCS1B01G179900) and Peptide chain release factor 2 (TraesCS1B01G180000). The ABC transporter gene family (TraesCS1B01G181200) which is known to confer durable resistance to multiple fungal pathogens in wheat [69] is also very closely located with AX-95096041. Similarly, the disease resistance protein RPM1 (TraesCS1B01G182900) belonging to known disease resistance protein families (NBS-LRR class) was also found proximal to SNP AX-94427201 besides the Receptor kinase. The Other 8 genes were identified as flanks in the vicinity of the remaining 4 SNPs on chromosome 1A, 5A and 2B with the Protein kinase flanking the SNP AX-94856684 approximately 2.2 Kb away on chromosome 2B (Table 8).
Discussion
Phenotypic variability in resistance to Pst
The average field response to Pst of accessions ranged from very low to high among the environments. This is very clear when examining particularly the result of CHD-15 and KUL-15 environments for which the mean SEV, RES and CI values of susceptible genotypes are 1.1, 0.3, 1.1 and 2.6, 0.3, 2.5 respectively (Table 1; Fig 1A–1C). Notably, field response at Meraro was higher because the location is known to be one of the hot spot sites for Pst infestation, and hence usually used as a stripe rust test site. However, considering the reaction of the susceptible genotypes, the average SEV (88.5%), RES (1.0), CI (87.5) in 2015 was still lower than it was in 2016 where SEV = 98.0%, RES = 1.0 and CI = 98.0. This might be attributed to a shortage of moisture in 2015. It is known that establishment of Pst infection in the field is highly dependent on available moisture and cool night temperatures which ultimately affects disease development. Ninety-one (31.1%) of the panel consistently appeared as resistant across all the four environments (S3 Table) which is explained by the broad-spectrum resistance of the accessions for the Pst race composition at the environments. Overall, the reaction data was unevenly distributed for the SEV, RES & CI at CHD-15 and KUL-15 environments which were excluded from all downstream analyses to avoid misleading generalization while it was relatively okay for the CHD-16, KUL-16, MER-15 & MER-16. Variance component due to blocking nested within replication was not significant indicating variation due to nesting or incomplete blocking was negligible. On the other hand, genotype by environment interactions have significantly varied at combined level which obviously is due to variation in disease pressure among the environments.
Disease reaction data (SEV, RES & CI) have shown relatively considerable correlations within location between years for Meraro although not that high to be significant. This is in agreement with the lower disease pressure in 2o15 as compared to 2016. Overall correlation of SEV, RES and CI among the four environments is also low which is another confirmation for the varying level of disease prevalence among the environments with a little bit of similarity for Merao test location (Table 3; Fig 1A–1C). On the other hand, correlations for Inter-disease reaction data combinations (SEV vs RES; SEV vs CI; RES vs CI) among environments were very high, ranging from 0.91–0.96 for SEV vs RES; 0.96–0.98 for SEV vs CI and 0.91–0.95 for RES vs CI. This provided the basis for performing the GWAS analysis on any of the three disease reaction data although CI is preferred as it is a combined representation of SEV and RES.
Population structure, relatedness and LD
The population structure analysis plot clustered the panel into two distinct groups where members of group I are mainly (36/39) improved durum cultivars while that of group II contains all the landraces besides three cultivars. This is in agreement with the structure harvester output that suggests K = 2 is the most likely grouping value of the panel. The presence of the two groups in the population structure and further subgrouping in the Kinship has not shown any significant correlation with the pattern of the phenotypic values (resistance reaction) among the panel. Consequently, the data led to identification of a true and acceptable marker-trait association for the resistance as opposed to the discovery of false positive association.
Analysis of the resistance loci identified on chromosomes
It looks that the significant genotypic variance among the panel was reflected in the identification of significant marker-trait associations at various level. Similar sets of SNP association were identified both at single and combined environment data analyses with few exceptions suggesting existence of main wide spectrum vs minor environment specific effectiveness of the identified resistance loci. Obviously, the consistence occurrence of the six significantly associated SNPs at all levels of analyses probably has to do with a wide spectrum effectiveness of the resistance gene/genes underlying the loci as well. On the other hand, the five significant SNPs identified as location specific (2 at CHD and 3 at KUL) associations could mean the presence of site specific Pst race accompanied by race specific genes relative to the other sites.
Our data suggests that chromosome 1B is an important contributor of loci significantly associated with the resistance as seven of the eleven identified SNPs are located on it. Particularly in all single environment and BLUE_all GWAS, all six associated SNPs were from this chromosome. Several genes associated with yellow rust resistance in wheat have been reported on chromosome 1B from multiple GWAS studies. This highlights its usefulness and why efforts to further define these loci are warranted. So far, 83 Pst resistance genes have been designated [31] and many QTLs identified in wheat have been reported on chromosome 1B [70]. Yr10 [71], Yr9 [72], YrAlp [31], Yr15 [73], YrH52 [74], Yr64, Yr65 and Yr24/Yr26 [75], YrExp1 [31], Yr29/Lr46 [76] are some of the known YR genes identified on chromosome 1B and derived from wild relatives and cultivars. Interestingly, the recently cloned Yr15 gene is in position 547Mb in CS on chromosome 1B and the top markers identified in the current study are in position 325–336 Mb suggesting that the identified loci are different from Yr15. One additional SNP was identified on chromosome 1A while the other two came from chromosome 5A and another one from chromosome 2B. Several yellow rust QTLs and Yr genes are mapped on these chromosomes including Yr5 on chromosome 2B [77] which could be amongst the few that are effective against Ethiopian Pst races [44, 78].
The genes that are found in the proximity of the significant loci (ABC Transporter gene family, RPM1, Receptor kinases and Protein kinases) shades light on the functional association of the identified regions in plant defences against pathogens. The ABC transporter gene family is among the known ones involved in the plant defense system conferring durable resistance to multiple fungal pathogens in wheat [69]. They usually act as a transporter for different molecules across biological membranes and are involved in a diverse range of biological processes [79]. RPM1 is an NBS-LRR protein from Arabidopsis thaliana that confers resistance to Pseudomonas syringae expressing either avrRpm1 or avrB [80]. It is also characterized as a peripheral membrane protein that likely resides on the cytoplasmic face of the plasma membrane [80]. Receptor kinases also called Receptor-like kinases (RLks) are among the pattern recognitions receptors that play an important role in plant immunity apart from growth and development [81]. During plant-pathogen interaction, they function as a part of a multiprotein complexes at the cell surface which detect microb-and host-derived molecular patterns as the first layer of inducible defence [82]. Likewise, Protein kinases plays a pivotal role in the activation of plant defence mechanisms through signalling during pathogen recognition at the start of the host-pathogen interaction [83]. In general, the fact that these defence related genes are located in the vicinity of the identified loci suggests future attentions towards functional studies.
To the best of our information, this is the first GWAS study in Ethiopia on Ethiopian durum wheat across the three test locations for identification of marker trait association (MTAs) for Pst resistance. However, a similar study conducted on Ethiopian durum wheat in the USA led to the identification of 12 loci associated with resistance to Pst on seven chromosomes of which chromosome 1B is one of them besides chromosomes 1A, 2BS, 3BL, 4AL, 4B and 5AL [17]. On the other hand, Zegeye [84] carried out a GWAS on synthetic hexaploid wheat at Meraro and Arsi Robe and reported a total of 38 SNPs on 18 genomic regions associated with adult plant resistance. Some of these reported genomic regions are also identified on chromosome 1B besides 1A, 2B and 5A which is in agreement with the current study. So, similarities of this genomic regions in response to Pst resistance across various similar studies signifies the potential usefulness of the genomic regions for wheat resistance improvement.
Conclusion
This study identified 11 SNPs which significantly associated with resistance to Pst at adult plant stage and defined at least five genomic Loci (1 on 1A, 2 on 1B, 1 on 2B and 1 on 5A) across the study panel. Six of the SNPs (AX-95171339, AX-94436448, AX-95238778, AX-95096041, AX-94730403 & AX-94427201) defined the first regions on chromosome 1B. These SNPs, as they were consistently identified at each environment and combined data the region can be considered as the major genomic locus which might contain combinations of genes conferring resistance to Pst across all tested environments. On the other hand, SNP AX-94648330 on 1B, a haplotype of two closely linked SNPs (AX-94438404 & AX-94460229) on 5A, AX-94482796 on 1A and AX-94856684 on 2B identified only at KUL or CHD defined four additions loci. The fact that these SNPs are identified either only at CHD or KUL but not at MER may indicate the presence of resistance gene/genes effective to location specific Pst races. This calls for a separate consideration in future breeding strategies for durable Pst resistance enhancement in wheat which should take into account race-specificity. Disease resistance related and other genes such ABC transporter, RPM1 and the Receptor kinases and Protein kinases have been found in the vicinity of these resistance associated loci. These genes can be targeted in any functional study aiming to reveal the actual genes underlying the associated loci. The study also identified effective sources of resistance to Ethiopian Pst races in Ethiopian durum wheat landraces that can be used, alongside the markers identified here, to transfer the loci into adapted cultivars to provide resistance against Pst. However, the diagnostic value of the identified SNPs needs to be further investigated and validated in an independent germplasm including phenotypic response test to the pathogen. In general, the identified SNPs/resistance loci, coupled with the identification of multi-environment stable genotypes for resistance, will enhances the fight towards mitigation of Pst as it presents a double layer challenge both to the wide spectrum and site specific virulent Pst races.
Supporting information
S1 Table. List of catalogued Yr genes derived from various sources.
https://doi.org/10.1371/journal.pone.0243675.s001
(CSV)
S2 Table. Ethiopian durum wheat landraces and cultivars used for the GWAS.
https://doi.org/10.1371/journal.pone.0243675.s002
(CSV)
S3 Table. Genotypes consistently resistant across the four environments.
https://doi.org/10.1371/journal.pone.0243675.s003
(CSV)
S4 Table. Association analysis result for all single and combined environment data.
https://doi.org/10.1371/journal.pone.0243675.s004
(CSV)
Acknowledgments
The authors are grateful to Kulumsa (KARC) and Debre Zeit Agricultural Research Centre (DZARC) for kindly provision of experimental plot & cultivars; Ethiopian Biodiversity Institute (EBI) and EOSA (Ethio-Organic Seed Action) for their kind provision of the landrace accessions. Our gratitude also goes to wheat breeders, pathologists and technical assistances at Kulumsa and Debre Zeit research centers for their assistance in field work. We are so grateful to all members of the Uauy group; Brande Wulf, Sanu Arora, Clare Lewis and Luzie Wingen at the JIC for their technical support and encouraging guidance during laboratory work.
References
- 1. Kilian B, Özkan H, Deusch O, Effgen S, Brandolini A, Kohl J, et al. Independent wheat B and G genome origins in outcrossing Aegilops progenitor haplotypes. Mol Biol Evol. 2007; 24(1): 217–227. pmid:17053048
- 2.
Chris G, World Durum Outlook. Available online: http://www.internationalpasta.org/ resources/ IPO%20BOARD%202013/2%20Chris%20Gillen.pdf.
- 3. Vavilov NI. The origin, variation, immunity and breeding of cultivated plants. Soil Sci. 1951; 72:482.
- 4.
Vavilov NI, Dorofeev VF. Origin and Geography of Cultivated Plants. Cambridge: Cambridge University Press. 1992.
- 5. Harlain JR. Ethiopia: a centre of diversity. Econ Bot. 1969; 23: 309–314.
- 6. Harlain JR. Agricultural origins: centres and non-centres. Science. 1971; 174: 468–473. pmid:17745730
- 7. Belay G, Tesemma T, BECKER HC, MERKER A. Variation and interrelationships of agronomic traits in Ethiopian tetraploid wheat landraces. Euphytica. 1993; 71: 181–188.
- 8. Tesemma T, Becker HC, Belay G, Mitiku D, Bechere E, Tsegaye S. Performance of Ethiopian tetraploid wheat landraces at their collection sites. Euphytica. 1993; 71: 181–188.
- 9. Tesemma T, Belay G, Worede M. Morphological diversity in tetraploid wheat landrace populations from the central highlands of Ethiopia. Hereditas. 1991; 114: 171–176.
- 10.
Evan School Policy Analysis and Research (EPAR). Wheat Value Chain: Ethiopia. Available online:https://evans.uw.edu/sites/default/files/EPAR_UW_204_Wheat_Ethiopia_07272012.pdf (accessed on24 May 2020).
- 11. Hammer K, Teklu Y. Plant genetic resources: selected issues from genetic erosion to. J. Agric. Rural Dev. Trop. Subtrop. 2008; 109, 15–50.
- 12.
Tessema T, Belay G. Aspects of Ethiopian tetraploid wheats with emphasis on durum wheat genetics and breeding. P. 47–71. In: Gebre-Mariam H, Tanner DG, Hulluka M. (eds.) Wheat Research in Ethiopia: A Historical Perispective. 1991; Addis Ababa, Ethiopia, IAR/CIMMYT.
- 13. Bechere E, Belay G, Mitiku D, Merker A. Phenotypic diversity of tetraploid wheat landraces from northern and north-central regions of Ethiopia. Hereditas. 1996; 124: 165–172.
- 14. Mengistu DK, Kidane YG, Fadda C, Pe ME. Genetic diversity in Ethiopian durum wheat (Triticum turgidum var durum) inferred from phenotypic variations. Plant Genet. Resour. Charact. 2018; 16: 39–49.
- 15. Mengistu DK, Kidane YG, Catellani M, Frascaroli E, Fadda C, Pe ME, et al. High-density molecular characterization and association mapping in Ethiopian durum wheat landraces reveals high diversity and potential for wheat breeding. Plant Biotechnol. J. 2016; 14: 1800–1812. pmid:26853077
- 16. Kidane YG, Hailemariam BN, Mengistu DK, Fadda C, Pe ME, Dell’Acqua M. Genome-wide association study of Septoria tritici blotch resistance in Ethiopian durum Wheat Landraces. Front. Plant Sci. 2017a; 8:1586.
- 17. Liu W, Maccaferri M, Rynearson S, Letta T, Zegeye H, Tuberosa R, et al. Novel Sources of Stripe Rust Resistance Identified by Genome-Wide Association Mapping in Ethiopian Durum Wheat (Triticum turgidum ssp. durum). Front. Plant Sci. 2017; 8:774. pmid:28553306
- 18. Zhang W, Chen S, Abate Z, Nirmala J, Rouse MN, Dubcovsky J. Identification and characterization of Sr13, a tetraploid wheat gene that confers resistance to the Ug99 stem rust race group. Proc. Natl Acad. Sci. 2017; USA, 114: E9483–E9492. pmid:29078294
- 19. Workneh A, Geremew B, Fekedu L. Physicochemical and rheological properties of six Ethiopian durum wheat (Triticum turgidum) varieties grown at Debre Zeit suitable for pasta making. J. Food Sci. Tech. 2008; 45(3): 237–241.
- 20.
Global durum wheat use trending upward: https://www.world-grain.com/articles/8777-global-durum-wheat-use-trending-upward (accessed on May 31, 2020).
- 21. Tidiane SA, Chiari T, Legesse W, Seid-Ahmed K, Ortiz R, van Ginkel M, et al. Durum Wheat (Triticum durum Desf.): Origin, Cultivation and Potential Expansion in Sub-Saharan Africa. Agronomy. 2019; 9: 263.
- 22. Tolera A, Tsegaye B, Berg T. Effects of variety, cropping year, location and fertilizer application on nutritive value of durum wheat straw. J. Anim. Physiol. Anim. Nutr. 2007; 92: 121–130.
- 23. Liu M, Hambleton S. Taxonomic study of stripe rust, Puccinia striiformis sensu lato, based on molecular and morphological evidence. Fungal Biol. 2010; 114: 881–99. pmid:20943198
- 24. Badebo A, Bekele E, Bekele B, Hunde B, Degefu M, Tekalign A, et al. Review of two decades of research on diseases of small cereal crops in Ethiopia. Proceedings of the 14th Annual conference of the Plant protection society of Ethiopia (PPSE). 2008.
- 25.
Solh M, Nazari K, Tadesse W, Wellings CR. The growing threat of stripe rust worldwide Borlaug Global Rust Initiative (BGRI) conference. 2012; Beijing, China.
- 26. Dembel W. Epidemics of Pucciniastriiformis F. Sp. Tritici in Arsi and West Arsi Zones of Ethiopia in 2010 and Identification of Effective Resistance Genes. Journal of Natural Sciences Research. 2014; 4(7): 2224–3186.
- 27.
Roelfs AP, Singh RP, Saar EE Rust Diseases of Wheat: Concepts and Methods of Disease Management; CIMMYT: Mexico, D.F., Mexico,1992; pp. 1–81.
- 28. Chen XM. Epidemiology and control of stripe rust [Puccinia striiformis f. sp. tritici] on wheat. Can. J. Plant Pathol. 2005; 27:314–337.
- 29. Chen XM. High-temperature adult-plant resistance, key for sustainable control of stripe rust. Am. J. Plant Sci. 2013; 4:608–627.
- 30. Atilaw A, Bishaw Z, Eticha F, Gelalcha S, Tadesse Z, Abdalla O., et al. Controlling wheat rusts and ensuring food security through deployment of resistant varieties in Ethiopia. Proceedings of the 2nd International Wheat Stripe Rust Symposium. 2014.
- 31. McIntosh RA, Dubcovsky J, Rogers WJ, Morris C, Appels R, Xia XC. Catalogue of gene symbols for wheat. 2020; http://www.shigen.nig.ac.jp/wheat/komugi/genes/symbolClassList.jsp. (Accessed on June 2020).
- 32.
Wang MN, Chen XM. Stripe rust resistance. In: Chen XM, Kang ZS(eds) Stripe rust. Springer, Dordrecht. 2017; 353–558.
- 33.
McIntosh RA, Wellings CR, Park RF. Wheat rusts: an atlas of resistance genes. Dordrecht: Kluwer Academic Publishers. 1995.
- 34. Aktar-Uz-Zaman M, Tuhina-Khatun Mst, Hanafi MM, Sahebi M. Genetic analysis of rust resistance genes in global wheat cultivars: an overview. 2017. Biotechnology & Biotechnological Equipment. 31(3): 431–445.
- 35. Zheng W, Li S, Liu Z, Zhou Q, Feng Y, Chai S. Molecular marker assisted gene stacking for disease resistance and quality genes in the dwarf mutant of an elite common wheat cultivar Xiaoyan22. BMC Genet. 2020; 21: 45. pmid:32326905
- 36. Yaniv E, Raats D, Ronin Y, Korol AB, Grama A, Bariana H, et al. Evaluation of marker-assisted selection for the stripe rust resistance gene Yr15, introgressed from wild emmer wheat. Mol Breed. 2015; 35(1):43. pmid:27818611
- 37. Randhawa MS, Bains NS, Sohu VS, Chhuneja P, Trethowan RM, Bariana HS, et al. Marker Assisted Transfer of Stripe Rust and Stem Rust Resistance Genes into Four Wheat Cultivars. Agronomy. 2019; 9(9):497.
- 38.
MOA, Plant Variety Release, Protection and Seed Quality Control Directorate Addis Ababa, Ethiopa. 2013. Crop Variety Register. No. 16, 19–23.
- 39. Tesemma T, Bechere E. Developing elite durum wheat landrace selections (composites) for Ethiopian peasant farm use: Raising productivity while keeping diversity alive. Euphytica. 1998; 102 (3): 323–328.
- 40. Rani R, Singh R, Yadav NR. Evaluating stripe rust resistance in Indian wheat genotypes and breeding lines using molecular markers. Comptes Rendus Biologies. 2019; 342(5): 154–174. pmid:31239197
- 41. Huang X, Han B. Natural variations and genome-wide association studies in crop plants. Annu Rev Plant Biol. 2014; 65(1): 531–551. pmid:24274033
- 42.
Porceddu E, Perrino P, Olita G. “Preliminary information on an Ethiopian wheat germplasm collection mission,” in Genetics and Breeding of Durum Wheat, ed. Mugnozza G. T. S. (Bari: University of Bari Press). 1975; 181–200.
- 43. Alemu SK, Badebo A, Tesfaye K, Uauy C. Identification of Stripe Rust Resistance in Ethiopian Durum Wheat by Phenotypic Screening and Kompetitive Allele Specific PCR (KASP) SNP Markers. Plant Pathol Microbiol. 2019; 10:483.
- 44. Winfield MO, Allen AM, Burridge AJ, Barker GL, Benbow HR, Wilkinson PA, et al. High-density SNP genotyping array for hexaploid wheat and its secondary and tertiary gene pool. Plant Biotechnol J. 2016;14(5):1195–206. pmid:26466852
- 45. Cavanagh CR, Chao S, Wang S, Huang BE, Stephen S, Kiani S, et al. Genome-wide comparative diversity uncovers multiple targets of selection for improvement in hexaploid wheat landraces and cultivars. Proc. Natl. Acad. Sci. U.S.A. 2013; 110: 8057–8062. pmid:23630259
- 46. Wang S, Wong D, Forrest K, Allen A, Chao S, Huang BE, et al. Characterization of polyploid wheat genomic diversity using a high-density 90,000 single nucleotide polymorphism array. Plant Biotech. J. 2014; 12: 787–796. pmid:24646323
- 47. Peterson RF, Campbell AG, Hannah AE. A diagrammatic scale for estimating rust intensity on leaves and stems of cereals. Can. J. Res., Sec. C. 1948; 26:496–500.
- 48.
Stubbs RW, Prescot JM, Saari EE, Dubi HJ. Cereal Disease Methodology Manual. Centro International de Mejoramiento de Maizey Trigo (CIMMYT), Mexico. 1986; Pages: 46
- 49.
Line RF, Qayoum A. Virulence, Aggressiveness, Evolution and Distribution of Races of Puccinia striiformis (the Cause of Stripe Rust of Wheat) in North America. 1992; 1968–87. Washington, DC: United State Department of Agriculture, Agricultural Research Service.
- 50. Shapiro SS, Wilk MB. "An analysis of variance test for normality (complete samples)". Biometrika.1965; 52 (3–4):591–611.
- 51. Pallotta MA, Warner P, Fox RL, Kuchel H, Jefferies SJ, Langridge P. Marker assisted wheat breeding in the southern region of Australia. Proceedings of the Tenth International Wheat Genetics Symposium 2003.
- 52. International Wheat Genome Sequencing Consortium (IWGSC), Science 361, eaar7191. 2018. pmid:30115783
- 53.
Liu BH. Statistical genomics: linkage, mapping and QTL analysis. CRC Press. 1998; Boca Raton.
- 54. Guo X, Elston RC. Linkage informative content of polymorphic genetic markers. Hum Hered 1999; 49:112–118. pmid:10077733
- 55. Pritchard JK, Stephens M, Donnelly P. Inference of Population Structure Using Multilocus Genotype Data. Genetics, 2000;15: 945–959. pmid:10835412
- 56. Earl DA, von Holdt BM. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resour. 2012; 4: 359–361.
- 57. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol. Ecol. 2005; 14: 2611–2620. pmid:15969739
- 58. Alexander EL, Feng T, Qishan W, Jason P, Meng L, Peter JB, et al. GAPIT: genome association and prediction integrated tool. Bioinformatics. 2012; 28 (18): 2397–2399. pmid:22796960
- 59. Hill WG, Weir BS. Variances and covariances of squared linkage disequilibria in finite populations. Theor Popul Biol. 1988; 33:54–78. pmid:3376052
- 60. Ardlie KG, Kruglyak L, Seielstad M. Patterns of linkage disequilibrium in the human genome. Nat. Rev. Genet. 2002; 3: 299–309. pmid:11967554
- 61. Shifman S, Kuypers J, Kokoris M, Yakir B, Darvasi A. Linkage disequilibrium patterns of the human genome across populations. Hum. Mol. Genet. 2003;12: 771–776. pmid:12651872
- 62. Khatkar MS, Nicholas FW, Collins AR, Zenger KR, Al Cavanagh J, Wes B. et al. Extent of genome-wide linkage disequilibrium in Australian Holstein-Friesian cattle based on a high-density SNP panel. BMC Genomics. 2008; 9: 187. pmid:18435834
- 63. Lawrence R, Day-Williams AG, Mott R, Broxholme J, Cardon LR, Eleftheria Z. GLIDERS—A web-based search engine for genome-wide linkage disequilibrium between HapMap SNPs. BMC Bioinformatics. 2009;10: 367. pmid:19878600
- 64. Remington DL, Thornsberry JM, Matsuoka Y, Wilson LM, Whitt SR, Doebley J. Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proc Natl Acad Sci USA. 2001; 98:11479–11484. pmid:11562485
- 65. Zhang ZW, Ersoz E, Lai CQ, Todhunter RJ, Tiwari HK, Michael AG, et al. Mixed linear model approach adapted for genome-wide association studies. Nat. Genet. 2010; 42: 355–360. pmid:20208535
- 66. VanRaden PM. Efficient methods to compute genomic predictions. J. Dairy Sci. 2008; 91: 4414–4423. pmid:18946147
- 67. Price A, Patterson N, Plenge R, Weinblatt M, Shadick N, David Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nature Genetics. 2006; 38(8): 904–909. pmid:16862161
- 68. Storey JD. A direct approach to false discovery rates. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 2002; 64: 479–498.
- 69. Krattinger SG, Lagudah ES, Spielmeyer W, Singh RP, Huerta-Espino J, McFadden H, et al. A putative ABC transporter confers durable resistance to multiple fungal pathogens in wheat. Science. 2009; 323: 1360–1363. pmid:19229000
- 70. Maccaferri M, Zhang J, Bulli P, Abate Z, Chao S, Cantu D, et al. A genome-wide association study of resistance to stripe rust (Puccinia striiformis f. sp. tritici) in a worldwide collection of hexaploid spring wheat (Triticum aestivum L.). G3. 2015b; 5: 449–465.
- 71. Ma J, Zhou R, Dong Y, Wang L, Wang X, Jia J, et al. Molecular mapping and detection of the yellow rust resistance gene Yr26 in wheat transferred from Triticum turgidum L. using microsatellite markers. Euphytica. 2001; 120: 219–226.
- 72. Lukaszewski AJ. Manipulation of the 1RS.1BL translocation in wheat by induced homoeologous recombination. Crop Sci. 2000; 40: 216–225.
- 73. Klymiuk V, Yaniv E, Huang L, Raats D Fatiukha A, Chen S, et al. Cloning of the wheat Yr15 resistance gene sheds light on the plant tandem kinase-pseudokinase family. Nat Commun. 2018; 9: 3735 pmid:30282993
- 74. Klymiuk V, Fatiukha A, Raats D, Bocharova V, Huang L, Feng L, et al. Three previously characterized resistances to yellow rust are encoded by a single locus Wtk1, Journal of Experimental Botany. 2020; 71(9): 2561–2572. pmid:31942623
- 75. Cheng P, Xu LS, Wang MN, See DR, Chen XM. Molecular mapping of genes Yr64 and Yr65 for stripe rust resistance in hexaploid derivatives of durum wheat accessions PI 331260 and PI 480016. Theor. Appl. Genet. 2014; 127:2267–2277. pmid:25142874
- 76. Lan C, Rosewarne GM, Singh RP, Herrera‐Foessel SA, Huerta‐Espino J, Basnet BR, et al. QTL characterization of resistance to leaf rust and stripe rust in the spring wheat line Francolin#1. Mol. Breed. 2014; 34: 789–803.
- 77. Marchal C, Zhang J, Zhang P, Fenwick P, Steuernael B, Adamski NM, et al. BED-domain-containing immune receptors confer diverse resistance spectra to yellow rust. Nature Plants. 2018; 4, 662–668. pmid:30150615
- 78. Naruoka Y, Ando K, Bulli P, Muleta KT, Rynearson S, Pumphrey MO. Identification and validation of SNP markers linked to the stripe rust resistance gene Yr5 in wheat. Crop Sci. 2016; 56:3055–3065.
- 79. Lopez-Ortiz C, Dutta SK, Natarajan P, Peña-Garcia Y, Abburi V, Saminathan T, et al. (2019) Genome-wide identification and gene expression pattern of ABC transporter gene family in Capsicum spp. PLoS ONE 14(4): e0215901. pmid:31039176
- 80. Boyes DC, Nam J, Dangl JL. The Arabidopsis thaliana RPM1 disease resistance gene product is a peripheral plasma membrane protein that is degraded coincident with the hypersensitive response. Proc Natl Acad Sci U S A. 1998 Dec 22;95(26):15849–54. pmid:9861059
- 81. Tang D, Wang G, Zhou J. Receptor Kinases in Plant-Pathogen Interactions: More Than Pattern Recognition The Plant Cell.2017; 29(4): 618–637. pmid:28302675
- 82. Jones JD, Dangl JL. The plant immune system. Nature. 2006. 444: 323–329. pmid:17108957
- 83. Romeis T. Protein kinases in the plant defence response. Current Opinion in Plant Biology. 2001; 4(5): 407–414 pmid:11597498
- 84. Zegeye H, Rasheed A, Makdis F, Badebo A, Ogbonnaya FC. Genome-Wide Association Mapping for Seedling and Adult Plant Resistance to Stripe Rust in Synthetic Hexaploid Wheat. PLoS ONE. 2014; 9(8): e105593. pmid:25153126