Figures
Abstract
Barley is the fifth most important food crop in Ethiopia. The genetic relationship and population structure studies of barley are limited to gene bank collections. Therefore, this study fills a gap by investigating the selection, consumption, economic value, genetic diversity, and population structure of farm-collected barley from the Gumer district of the Gurage Zone, which has received little attention. The information on the use of barley in the study area was collected using semi-structured interviews and questionnaires. 124 households of 11 kebeles, the smallest community unit, were interviewed. Barley landraces collected were compared with those collected from Japan, the United States (USA), and other Ethiopian locations. Illumina iSelect (50K genotyping platform) was used to identify single nucleotide polymorphisms (SNP) (20,367). Thirty landraces were found in Gumer. Burdaenadenber had the highest on-farm Shannon index estimate (2.0), followed by Aselecha (1.97) and Enjefo (1.95). Aselecha and Fetazer had the highest (44%) and the lowest (29%) richness values, respectively. High and low Simpson index values were found in Aselecha (84%) and Wulbaragenateretero (79%), respectively. The neighbor-joining tree revealed that Gumer landraces formed a separate subcluster with a common ancestral node; a sister subcluster contained barley landraces from Japan. According to the population structure analysis, barley landraces from Gumer differed from Japan and the United States. The principal component analysis revealed that US barley was the most distant group from Gumer barley. The markers’ allele frequencies ranged from 0.10 to 0.50, with an average value of 0.28. The mean values of Nei’s gene diversity (0.38) and the polymorphic information content (0.30) indicated the presence of high genetic diversity in the samples. The clustering of accessions was not based on geographic origin. Significant genetic diversity calls for additional research and analysis of local barley diversity because the selection and use of barley in Ethiopia would have been affected by the preference of ethnic groups.
Citation: Degu HD, Tehelku TF, Kalousova M, Sato K (2023) Genetic diversity and population structure of barley landraces from Southern Ethiopia’s Gumer district: Utilization for breeding and conservation. PLoS ONE 18(1): e0279737. https://doi.org/10.1371/journal.pone.0279737
Editor: Dragan Perovic, Julius Kuhn-Institut, GERMANY
Received: August 17, 2021; Accepted: December 14, 2022; Published: January 5, 2023
Copyright: © 2023 Degu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data is contained within the paper and its Supporting information files.
Funding: This project is supported by the grants from the Ohara-foundation, an International Joint Research Program implemented at the Institute of Plant Science and Resources at Okayama University, Japan.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Barley (Hordeum vulgare ssp. Vulgare) was domesticated from its wild ancestor H. vulgare ssp. Spontanum [1]. It is the fifth most commonly cultivated crop in Ethiopia [2]. Archeological records indicate that barely appeared in the eighth millennium BC [3]. Currently, it is grown in the highlands of Ethiopia, mainly to produce human food (the grain), animal feed (the straw), and malt for brewing.
The introduction of modern cultivars started in the 20th century in Ethiopia [4]. However, the Ethiopian barley cultivation system still uses several relatively unique landraces [5–9]. Most Ethiopian landraces cluster separately from Asian, European, and North African landraces in a phylogenetic analysis of their genomes [10]. This separation indicates that selection for certain traits affected allele frequency between Ethiopian and non-Ethiopian landraces, leading to linkage disequilibrium. The diversity within these populations is thus indicative of different and genetically diverse landraces, with Ethiopian landraces locally adapted to the traditional farming system and landscape conditions [11].
Despite many collections of barley landraces in the Ethiopian gene bank, the samples lack adequate information. These landraces are sources of germplasm diversity, which would allow future selection and diversification. Geographic information can unravel the domestication process and aid in developing cultivars [10, 12, 13]. Furthermore, it would shed light on the evolutionary relationships of barley around the world [14]. Likewise, barley collection from specific landscapes in Ethiopia could reveal landraces with similar phenotypic characteristics with different genetic backgrounds. These samples could form the basis of an alternative strategy to introduce traits into modern cultivars [15–17].
Understanding and characterizing the cultivated local landraces by researching previously collected samples can help reveal how farmers select specific traits to adapt crops to their local environment [10]. This can provide information on the selection patterns used in different environments. And can be used to improve the adaptability of the barley genome.
Recent studies suggest that barley landraces from multiple locations differentiated into genetically defined populations [11]. Although some studies have identified barley landraces with unique genome structures, it is not being researched extensively. For example, European barley is grouped into three subpopulations based on the sequence of the photoperiod response gene (Ppd-H1) [18–20]. Six different haplotypes were found in Asian barley populations, identified by the β-amylase gene (Bmy1) at positions 115, 165, 233, 347, and 430, which allows adaptation and selection based on the preference for climate, and food [21–23]. Furthermore, the population structure of barley in different geographic locations indicates multiple selection pressures based on uses and growth environments [1, 6, 14–16].
Single nucleotide polymorphism (SNP) of Illumina iSelect genotyping platforms used to examine barley landraces from the Gumer district, Ethiopia. The objectives of this study were 1) to determine the on-farm diversity of barley in the Gumer district. 2) to determine the genetic diversity and population structure of barley landraces from the Gumer district using SNP markers on the nuclear genome. 3) to compare the pattern of genetic diversity from the United States, Japan, and other Ethiopian barley accessions.
Materials and methods
Research area
The Gumer highlands in the Gurage mountain range (Fig 1) rise out of the Ethiopian Rift Valley to elevations above 4,000 masl and form a chain roughly 100 km long (6° 49’ N 39° 47’ E). Major plants cultivated in the region include faba beans, peas, Ensete spp., potatoes, kale, garlic, and barley; with the latter being the major cash crop for farmers in this area. The annual rainfall pattern is bimodal, and mean annual temperatures range from 10°C to 21°C [19]. The study area map was developed in ArcGIS software using geographic coordinates data from a GARMIN 60 GPS.
The study area is ethnically and linguistically homogeneous, representing the Gurage people and language. The ethnic group has several clans, which are all active in managing society. There is an accessible asphalt road, but most farmers in the study area have little or no road access. Small-scale subsistence agriculture dominates the landscape. The smallest unit of the community is called a ‘kebele’ and comprises 100–500 farmers.
Eleven kebeles representing the highlands of Gumer were selected for interviews and collection of their barley landraces. The kebeles were selected based on their potential barley cultivation of landraces from the Gumer highlands. Eleven to 12 farmers were randomly selected from each kebeles for a total of 124 farmers (Table 1).
Consideration of ethics and consent to participation
There is no rule for obtaining ethical approval for plant-related research projects in Ethiopia. To collect barley seed samples and interview farmers, a letter of support was obtained from the Hawassa University, Research and Technology Transfer Vice President Office, which was addressed to the Gumer district in the Gurage Zone, and a copy was provided to the administration of each of the eleven kebeles. The vice president of Hawassa University’s Research and Technology Transfer Office granted permission to interview and collect barley seeds from the farmer (S1 Text). Consent was obtained from the research participants who participated in the interview. The objective of the study was communicated to the participants. Participants were also assured that the information they provided would be kept confidential and would only be used for this study. Participants were asked about their gender, marital status, the size of their land, the number of children, education level, access to the market, access to improved seeds, the number of landraces on their farms, and names of landraces on their farms. Participants were also informed that they could withdraw from the study at any time during the data collection process if they felt the need to do so.
Barley landrace collection and genotyping
A cross-sectional research design focused on selected kebeles within a district was identified after following a rapid preliminary informal survey and discussions with the zonal administration. Data were collected from farmers who cultivate barley landraces. Key information and personal observations were obtained through questionnaires, household interviews, and focus group discussions.
A questionnaire was used to collect information from a wide range of sources (respondents) on indigenous knowledge and practices involved in barley landraces and utilization in the study areas. The questionnaire was written in English and translated into the local language “Guragigna". The head of the family was chosen based on the preliminary survey and documents obtained from district agricultural offices.
Furthermore, all required age groups and sexes, including the heads of household of elder women, were purposefully included to ensure adequate coverage of the requirements. The mentioned landraces, with their names, use-value, and seeds, were collected during interviews. Each of the 124 barley sample communities was sampled during the main barley growing season in 2020. The total land planted with each crop was recorded for each farm. The SNP diversity was compared between barley from Gumer and collections from Japan, the US, and other locations in Ethiopia (S2 Table) that were chosen from a global barley diversity panel [24].
Individual barley samples (Table 1 and S1 Table) from each selected farm were characterized according to barley phenotypic characteristics. Seeds were collected from only one barley spike to ensure uniform barley genotype only one barley spike. Five individual seeds were allowed to germinate in Petri dishes for 4 days. The emerging primary leaf was collected for DNA extraction using a KCl extraction solution [25] at the Plant Cell Laboratory at Hawassa University, Ethiopia. The extracted DNA was preserved with 5 μl 95% ethanol and shipped to the Institute of Plant Science and Resources at Okayama University, Japan, for genotyping.
SNP genotyping was performed with the Illumina iSelect 50K barley SNP array [26]. Before sample genotyping, DNA quality was evaluated using a Nanodrop 2000 (Thermo Fisher Scientific, Waltham, MA, US) with the criterion for the 260/280 and 260/230 ratios to be more than 1.8. A Qubit (Thermo Fisher Scientific, Waltham, MA, USA) was used to quantify DNA, and 100 ng DNA per sample was used for scanning by the Illumina HiScan (Illumina, San Diego, CA, US). SNP alleles were identified using the Genome Studio Genotyping Module v2.0.2 (Illumina, San Diego, CA, US). SNPs with a minor allele frequency (MAF) of less than 5% and markers with more than 2 alleles were removed from further analysis. After quality checks, markers were assigned unique genomic locations, and their respective distances (in bp) were taken from the closest marker in the Morex reference genome. The reported end coordinates for each marker were recorded [27]. A total of 44,044 SNPs were detected. All monomorphic using default parameters, these markers were aligned to the Barley RefSeq v1.0 genome assembly [26] using default parameters. These were reduced to 20,368 biallelic loci after removing SNPs with more than 10% missing (S2 Table).
Statistical analysis
To understand the diversity among the 11 kebeles, the Shannon-Wiener index, Pielou’s index, Pielou’s evenness, Simpson’s index, and abundance analysis were calculated using the vegan package (version 2.5–7) [28] with the following formulas:
Where H is the Shannon index, H’ is the Shannon-Wiener index, J’ is the Pielou index, pi is the proportion of barley landraces, S is the number of barley landraces and λ is Simpson’s index.
Genetic diversity was assessed for the Gumer collection and compared on the unbiased expected heterozygosity (He) from Ethiopian, Japanese, and USA populations chosen from the world barley panel. Genetic differentiation between populations was estimated with F statistics using the snpReady package (version 0.9.0) [29]. The snpReady is used for the preparation and quality control of datasets, the estimation of relationship matrices, and the estimation of basic population genetics parameters such as allele frequencies, minor allele frequency (MAF), observed heterozygosity (Ho), inbreeding coefficient (He), the effective population size, additive variance due to allele frequencies, variance in dominance due to allele frequencies, Nei’s genetic diversity, polymorphic information content (PIC), and Hardy Weinberg equilibrium (HW equilibrium) statistic. Linkage disequilibrium was calculated using the popper package (version 2.9.0) [30].
The spatial genetic structure of landraces was studied with principal component analysis (PCA) performed with the adegenet package (version 1.3–1) [31] using R statistical software (version 4.1.0) [32]. The population structure was also investigated using Structure 2.3.3 software [33, 34] with the admixture model and 25,000 burn-in iterations, followed by 50,000 Markov chain Monte Carlo (MCMC) iterations. The number of clusters (K) to describe the data was determined from the H value calculated by CLUMPP [35] and the ΔK value [34]. The results of 10 independent runs of Structure for each K value merged using CLUMPP, and the graphical output was obtained using the ggtree (version 2.1.3) [36] and the ggplot2 packages (version 3.3.3) [37].
Results and discussion
Farmers’ descriptions and identification of barley landraces
The common names assigned to genetic resources based on indigenous knowledge can be indicators of genetic distinctiveness (Table 1). Names are usually assigned based on distinctive characteristic features, specific or special end-use qualities, or other locally important attributes. As a result, evaluating and documenting such data are crucial parts of supporting conservation and future uses. We identified 30 different barley landraces in Ethiopia’s Gumer district (Table 1 and S1 Table). The landraces identified by the farmers have a local name that describes their characteristic values [38–42].
Seed color, spike length, number of rows, stress tolerance, yield, end-use characteristics, and other significant agronomic features distinguish the landraces. Some landraces are still being cultivated, with varying coverage in various locations. In contrast, Keleme is severely marginalized and on the point of extinction, while Shemeya has already vanished, as indicated by its presence only in one family. Similar findings have emerged from northern and Central Ethiopia, suggesting the rapid extinction of the major historical landrace.
The inter-landrace diversity or landrace richness (R), calculated using the Menhinick index, is summarized in Table 2. All indices indicated an exceptional landrace richness in all kebeles studied (Table 2). The farmers in the region grew their landraces on their farms (Tables 2 and 3), although the amount of on-farm diversity differed across kebeles. Farmers in the Aselecha and Wusho kebeles grew more than three landraces of barley; the Aselecha kebele had the highest average number of barley on-farm (3.58), followed by the Wusho kebele (3.27). All diversity indices measured were not significantly different across the sampled kebeles (P> 0.05, Table 2). The implication is that all kebeles adopt comparable barley landraces. The relative abundances of barley samples showed that communities tended to evolve towards a similar structure of barley landraces, where 2 to 3 types of barley landraces are planted in their farming fields every year.
We also found that the diversity estimate (Pilou evenness (J)) was high, except for the Aselecha kebele, which scored 0.36 (Table 2). The cultivation of different landraces and, therefore, the high abundance of the majority of the landraces identified in the kebeles led to the highest unevenness in the districts (the lowest admixture structure). Our research suggests that this study area has the potential for barley breeding and conservation.
Farmers in the study districts intentionally preserve landraces to meet their needs (Table 3). These requirements include, but were not limited to, early or late planting (i.e., maturity), yield potential with the type of environment intended to be grown, soil conditions (i.e., waterlogged, fertility, or frost effects), and intended dishes and beverages (includes quantitative and qualitative aspects such as product volume, taste, visual appeal, color, and storage ability). Thirty landraces with varying maturity, yield potential, stress tolerance, end-use characteristics, and other agronomic traits were grown. Table 3 summarizes the user preference of some of the landraces cultivated by farmers, which provides insight into the diversity of landrace types and their management as described by farmers. The landraces’ names and descriptions reflect key quantitative, qualitative, and end-use characteristics, planting time, or origins. Farmers listed the following landraces as the most common: Chifeyeawedo, Shemeya, Nech-senef, Awedo, Sheh abdo, Chelko-tikur, Jimua tikur, Nech-temezhe, Keleme, Tikur-senef, Jimua nech, and Wesabo. The landraces are six and two-rowed. Hulled and hulless; black, and white.
Gender plays a role in selecting which landrace to plant. Women favor Keleme for its sweet taste, easy dehulling, and medicinal value. But men do not prefer it due to its susceptibility to drought, disease, and low market value (Table 3). All farmers select Awedo for its white color, suitability for roof thatching, and market value (Table 3).
Farmers cultivate a diverse range of landraces, each with its maturity, yield potential, stress tolerance, end-use quality, and other agronomic characteristics [43]. This discovery is similar to how other Ethiopian farmers preserved many barley landraces for future use [44]. Rice landraces in the Himalayas are preserved for different purposes [45]. Similarly, farmers in Peru grow various types of potatoes to resist blight [46]. This custom is also similar to the culture of maize in Mexico [47], where colored maize is highly preferred. Similar practices exist in the case of tetraploid wheat in central Ethiopia for its aroma and taste preference [48]. A comparable trend exists in Tibet, where landraces are preserved for cultural value [49].
Fifteen land races planted in the study areas are identified, named, and described by Gumer farmers. Since SNP markers differentiated the landraces, vernacular names and farmer descriptions of landraces could be linked to formal scientific classifications. The landraces have been identified as a distinct type using the scientific taxonomic classification system using morphological traits [50].
The selection and decision-making process was based on sets of criteria. This value has an impact on the successful management of the landraces for their cultural and environmental needs. Complexity is expressed in their description of landraces for adaptation, stress tolerance, and yield (Table 3). Selection criteria often reflect changes in farming conditions and responses to socioeconomic and cultural factors that influence farmers’ priorities [51]. Factors that influence barley production and diversity are end-use, biotic, and abiotic tolerance of the landraces.
Genetic diversity of barley
In the present Gumer barley panel, we identified 20,367 replicable SNP calls of the 44,040 SNP probes available on the array (S2 Table). Of these, 40716, 32216, 40292, and 40631 alleles were found among barley collections from Ethiopia, Gumer, Japan, and the USA, respectively. The numbers of polymorphic SNP markers examined were 91, 872, 4,897, 4,134, 3,573, 5,190, 3,141, and 3,819 on chromosomes 0H, 1H, 2H, 3H, 4H, 5H, 6H, 7H, respectively. The S3 Table shows the observed number of heterozygotes (Ho), the expected number of heterozygotes under panmixia (He), and the coefficient of inbreeding (Fis) for each of the 20,367 SNPs. The ratio of the expected minimum, maximum, and observed expected number of heterozygotes was 0.08, 0.51, and 0.30, respectively. The expected and observed heterozygosity among the SNPs was 0.30 and 0.03, respectively. Positive multilocus estimates of FIS (0.9) have been shown. The value for the Bartlett test of homogeneity of variances between observed and expected heterozygosity showed Bartlett’s K-squared = 2157.4, df = 1, p-value < 2.2e-16 indicating the presence of a significant difference between expected and observed heterozygosity.
We compared the Gumer barley SNP markers to Japanese, USA, and other Ethiopian landraces. The PIC value range was 0.17 to 0.38, with an average value of 0.30. Nei’s gene diversity score ranged from 0.15 to 0.50, with a mean of 0.38. The MAF ranges from 0.10 to 0.50 with an average value of 0.28, indicating the presence of a great diversity among the barley landraces (Table 4).
The PIC values were 0.25, 0.30, 0.30, and 0.30 for Gumer, Ethiopia, Japan, and the US, respectively. The average gene diversity (GD) was 0.31, 0.37, 0.37, and 0.39 for Gumer, Ethiopia, Japan, and the United States, respectively (Table 4, S3 Table). Similarly, there was a difference in the average MAF of the four geographic locations, which was 0.21 (GUM), 0.27(ETH), 0.28 (Japan), and 0.29 (USA). These results showed that there was high genetic variation among the 182 barley landraces. GD (0.39) was the highest for barley from the USA compared to Gumer (Table 4). In contrast, landraces from the experimental area had the lowest PIC and GD scores. The average Ho in the 30 landraces of Gumer farmers (0.15) was significantly higher than in the collections of Japan (0.03), the United States (0.02), and Ethiopia (0.03). Finally, the inbreeding coefficient (FIS, 0.51) in Gumer barley was significantly lower than in Ethiopia (0.92), Japan (0.91), and the USA (0.94) populations.
The genetically effective size of populations was determined using SNP data (Table 4). The Japan collection showed the lowest Ne value (15.9), while the USA collection showed the highest Ne value (36.5). The Gumer collection showed the greatest ratio of Ne/No. of genotypes (29.5/30 = 0.98) between collections. The result indicates that inbreeding and genetic drift among the Gumer landraces were higher than the samples from other populations. The result explains the farming practices of Gumer, in which local landraces are grown with few improved varieties. This high self-reflection rate causes considerable distortion from Hardy-Weinberg equilibrium (HW), which shows only 249 loci in HW equilibrium (S3 Table). We also observed that 30, 54, 29, and 69 individuals from Gumer, Ethiopia, Japan, and the United States had P = 0.017, 0.01, 0.01, and 0.01 for the average association index (r d) = 0.256, 0.05, 0.07, and 0.02, respectively (Table 4). There is strong evidence of linkage disequilibrium between loci based on the association index, IA (P = 0.02), which is consistent with a population of self-pollinating and inbreeding. The other populations (Ethiopia, Japan, and the USA) clustered into two and more than two with a little too high admixture. The hypothesis of no marker linkage was rejected (P < 0.001), indicating a population of self-pollinating and inbreed in Gumer.
Population structures of landraces
We determined the most likely population structure for the landraces using two methods. The first technique was a genetic structure-based clustering strategy, which was inferred using the second-order rate of change of the likelihood K (Fig 2B and S4 Table). We found a distinct peak at K = 2, and the delta K value was 1,200, indicating that the populations in the study were best classified into two groups. The second method was based on discriminant principal components (DAPC) analysis). This method should show an elbow at a specific K value, where the Bayesian information criterion (BIC) would be the lowest [52]. This pattern was observed in our data. BIC values decreased up to K = 6 and increased from K = 7 to K = 14 (Fig 2A). This situation suggested that fewer than six clusters could be ideal. As a result of the BIC inferred clustering result, we grouped the landraces into six clusters (P1–P6). The six populations contained 51, 29, 28, 68, 33, and 3 landraces, respectively (Fig 2).
The colors indicate the geographic origins of the barley samples. A. Bayesian information criterion (BIC) for a range of K values (lower BIC indicates higher model support). B. Estimation number of populations using the delta K derived from LnP (D) from 1 to 6 using SNP data. C. Assignment plots where each vertical bar represents an individual. The y-axis is posterior membership probability, and the x-axis is individuals within their population.
We found that Bayesian model-based stratification and PCA clustering techniques did not group barley collections based on their geographic origin. For example, a variety of Gumer (GM12) clustered along the USA. A higher number of polymorphic SNPs were found in the United States and Japan landraces compared to those in Gumer.
A PCA analysis revealed that location on a continental scale is not the most predictor of genetic structure. The geographic origin of the landraces did not confirm the clustering in the first components (PC I), which together explain 26.7% of the variance. PC1 distinguishes Gumer and some Ethiopian barley from the US barley, while PC2 distinguishes the USA and Japan barley from Gumer and some Ethiopian barley (Fig 3).
The color of the dot indicates the geographic origin.
DAPC analysis (Figs 2 and 4) reveals differences between groups as clearly as possible. The lowest BIC value (1,418) was found to correspond to K = 6 (Fig 2A). Hundred PCA axes and all discriminant functions were kept for DAPC analysis. There were six clusters between 182 barley landraces compared in this study on the scatterplot of two main components of DAPC (Figs 2 and 4).
The first two linear discriminants are represented by the axes (LD). The scatterplot of the final DAPC model with sampling locality as a prior, where points are individual genotypes, color-coded by their original sampling locality, and surrounded by a 95% confidence ellipse. The DA and PCA eigenvalues represent the amount of genetic variation captured by the analysis, and the first two discriminant factors are plotted as the x- and y-axis. Clusters 1, 2, 3, and 4 represent Gumer, Ethiopia, Japan, and USA collections.
The clusters were composed of Gumer, Ethiopia, Japan, and USA samples. Populations were divided according to their collection origin; thus, continuous genetic flow between populations is limited or absent (in the case of Gumer). This was confirmed by the unique structure in the Gumer population, shown by the Structure, DAPC, and PCA analysis. Simultaneously, the use of PCA, Structure, and DAPC in clustering the barley landraces from different geographic regions yielded comparable results, indicating a strong genetic uniqueness of Gumer barley compared to USA and Japan barley (Figs 2–4). The study’s use of these methodologies showed a unique structure of Gumer barley attributable to the farming community’s history of barley selection in the region.
Comparison of variation among landrace groups
The results of AMOVA revealed that the variation accounted for 71%, 50%, and 58% of total genetic variability for Ethiopia, Japan, and the United States, respectively (Table 5). By contrast, AMOVA among the four populations was 75%. Genetic divergence was 19% (Gumer), 41% (Ethiopia), 36% (Japan), and 18% (overall selected population).
Based on SNP markers, the overall pairwise FST value was 0.41 (P 0.001). However, the value of FST between Gumer and Ethiopia, Gumer and Japan, Gumer and USA, Ethiopia and USA, and Japan and USA was 0.40, 0.68, 0.74, 0.11, 0.2, and 0.16, respectively (Table 4).
AMOVA showed a clear structure between the collection sites (Gumer vs. Ethiopia) and the geographic origins (Japan, USA) in this study. These findings indicate the need for a specific collection site and use of the value of barley to unravel the different human practices in domesticating barley for different use values. When comparing Gumer vs. Ethiopia, FST was 0.40, and Nm was 0.38. Nm between Japan and USA was high, and genetic differentiation based on FST values was significant between the landraces among the different populations, which confirmed the dynamic genetic variation among the collected landraces [53, 54]. The findings also confirm the presence of significant diversity in Ethiopia and Gumer barley landraces. The results could be best explained by historically effective population sizes [55], while genetic differentiation is influenced by recent changes caused by the use of barley for the local population [56].
The clustering analysis revealed that the SNP data did not classify landraces based on their geographical background, and landraces were jumbled together. For example, cluster I comprised 29 Gumer landraces and 18 landraces from Ethiopia, a landrace from the USA, and Japan. Cluster II comprised 17 landraces from Japan and 10 landraces from the USA, and 4 landraces from Ethiopia. Cluster III included 6 Japanese, 4 American, 1 Gumer, and 18 Ethiopian landraces. Cluster IV contained landraces from the United States, with only six Ethiopian and two Japanese landraces. Cluster V was made up of landraces from the United States, with three from Japan and four from Ethiopia. Cluster VI is composed of 1, and 2 Japanese and Ethiopian landraces (Fig 5).
Tree based on a binary matrix calculated from a dataset of 20,367 SNPs analyzed in 182 barley landraces from four different geographic regions. Clustering was supported by bootstrap analysis with 1000 replicates. Next to strain numbers, different colors represent the places from which the barley landraces were collected. Different circular curves indicate clusters of the landraces.
We discovered that the Ethiopian landraces were closer to the Japanese landraces and far from the Gumer landraces, rather than the two groups being closer to each other, using Fst values between populations (Table 6). The Japan and USA landraces were closer to each other than the Gumer landraces (Table 6), which was also supported by the hierarchical clustering of genetic distance, which revealed that the Gumer landraces form a distinct patch (Fig 5).
Some Ethiopian landraces clustered with Japanese and US landraces, which implies that they share a genetic background (Fig 5). This grouping result is remarkably similar to the previous one. Similar to the findings of a large-scale genotyping study of barley germplasm [57]. Ethiopia does not share borders with Japan, and the USA, thus plant and animal species will not frequently cross those borders, so the clustering of the Ethiopian landrace along with the Japan and USA landrace can only be explained by the presence of germplasm exchange through breeding [57]. This grouping of landraces is also in the STRUCTURE analysis, where the Gumer population has a unique genetic structure. These findings should be validated with a large set of germplasm.
Genetic diversity of landraces in Gumer
Genetic diversity is essential to create a robust food security system capable of responding to repeated biotic and abiotic stresses [58]. In plant breeding, genetic diversity analysis is critical in identifying alleles to develop crops with high yield, resistance to biotic and abiotic stressors, and satisfactory production [53].
Our interviews with the farmers showed that most landraces are used for food. These findings provide insight into the observed patterns in the Gumer barley genetic structure. We found little difference in abundance among the kebeles’ collections, likely due to the exchange of landraces among the members in different Kebele. The community uses barley as a food source, animal feed, and a source of income. As a result, the farmers select landraces with higher yields due to their increased adaptation to local farming conditions. These practices, and the importance of barley to the Gumer communities, shed light on the low diversity and genetic structure of barley in Gumer.
The 50K barley SNP array is designed to capture the most reliable gene-associated SNP markers available in the barley genome (cv. Morex), but it is not sufficient to discover novel loci from Gumer landraces. The fact that we found fewer replicable SNPs on the array than the original Morex allele panel suggests the presence of new alleles. Subsequent research would benefit from hybridization and sequencing techniques to offer a more comprehensive description of the Ethiopian barley genome.
Our findings demonstrated that the barley landraces gathered from Gumer had lower diversity indices than those gathered from other regions in Ethiopia. This suggests that the occurrence of variability between the Ethiopian and Gumer landraces may have been caused by the usage of farmer landraces that are stored and employed for every season of farming. This study’s genetic diversity and structure could be attributable to genetic drift that occurred during the rejuvenation cycles. Several conditions must be met for this to be a persuasive explanation. For instance, the initial samples used to create the early landraces must have been large enough to capture a significant amount of the genetic diversity prevalent in today’s landrace populations in Gumer. In addition, the genetic diversity of in situ landrace populations must not have changed systematically over time. Alternatively, the genetic diversity in Gumer landraces could arise from factors such as adaptation to wider agroecology, natural crossings caused by cultivating different genotypes in a field, and the use of diversified agricultural methods.
We identified two barley subpopulations using a genetic stratification approach based on a Bayesian clustering model. A discriminant analysis using the BIC suggested that fewer than six clusters would be preferable. Both methods showed successful clustering of Gumer barely separate from USA, Japan, and other Ethiopia landraces.
Another group used barley from gene banks and found [59, 60] geographic origin as a primary region or structure. However, our results show different facts where the structure of barley is not based on geographic location. For example, Ethiopian landraces have a structure similar to USA and Japan. And some Ethiopian landraces have the same structure as the Gumer landraces. The presence of multiple groups of Ethiopian landraces could be due to frequent germplasm exchanges between different countries or to the fact that humans manipulated barley genes throughout history, resulting in genetic mixing through breeding.
However, our study found that there was limited admixture between the current barley collection from Gumer and barley from Japan, the USA, and other regions of Ethiopia. Instead, Gumer barley formed a separate structure and cluster, at least from Japan, and USA landraces, demonstrating a unique genetic makeup; therefore, the landraces now represent a unique genome structure of barley.
Conclusion
We collected 30 barley landraces from Gumer in this work and compared them to landraces from Ethiopia, Japan, and the USA, using a high-density 50K barley SNP array. The research assesses the genetic diversity and population structure within landraces. Our results revealed a moderate polymorphism in the investigated Gumer barely. The molecular analysis in the tested landraces from Ethiopia, Japan, and the USA showed a high level of SNP variability. Based on the results of AMOVA, genetic diversity observed within Gumer is less than that found among samples from Ethiopia, Japan, and the USA. The result suggests that local landraces have less genetic differentiation. In addition, based on obtained results, the SNP array can distinguish the Gumer barley from Ethiopia, Japan, and the USA. However, many SNP lines were not recognized from the Gumer genome, indicating that a reference genome for Ethiopian barley germplasm is an alternative method to unravel the domestication process of Ethiopian barley. This study highlights the value of farmer-based agroecology in efforts to quantify diversity in barley. As a result, DNA genotyping of barley collection using a large number of landraces and SNP markers tightly linked to known genes associated with plant adaptation to different environments can help breeding programs based on the deployment of specifically selected DNA markers spread across the entire genome. The creation of sets for these chosen SNP markers of adaptation-related genes and their use in breeding programs could speed up the production of new competitive barley cultivars.
Supporting information
S1 Table. Passport data of the Gumer barley landraces.
https://doi.org/10.1371/journal.pone.0279737.s001
(XLSX)
S2 Table. Highly informative selected single-nucleotide polymorphism markers for genetic diversity analysis, and data for calculating barley diversity among the different Kebeles of Gumer.
https://doi.org/10.1371/journal.pone.0279737.s002
(XLSX)
S3 Table. Genetic diversity in the 182 landraces of barley.
Summary statistics of genetic variation existing in barley based on 20368 SNP among 182 landraces. Ho: heterozygosity within a population; Hs: genetic diversity within a population; Ht: overall gene diversity; Http: corrected Ht; Dst: gene diversity among samples; Dstp: corrected Dst; Fst: fixation index; Fstp: corrected Fst; Fis: inbreeding coefficient per overall loci; Dest: a measure of population differentiation. Significant deviation from Hardy–Weinberg equilibrium at *P < 0.05, **P < 0.01, and ***P < 0.001, respectively.
https://doi.org/10.1371/journal.pone.0279737.s003
(XLSX)
S4 Table. Determining the structure barley landraces using a structure harvester.
The results are based on allele frequencies for 20368 SNP markers in 182 barley landraces.
https://doi.org/10.1371/journal.pone.0279737.s004
(DOCX)
S1 Text. Support letter confirming the authorized institute that permits barley collection and farmer interviews.
https://doi.org/10.1371/journal.pone.0279737.s005
(PDF)
Acknowledgments
The barley seeds were provided by the National Bio-Resource Project, Japan. We thank Mrs. Nami Yamaji for her help with the technical procedure of SNP generation.
References
- 1. Sato K. History and future perspectives of barley genomics. DNA Res. 2020;27. pmid:32979265
- 2. Abebe TD, Bauer AM, Léon J. Morphological diversity of Ethiopian barley (Hordeum vulgare L.) about geographic regions and altitudes. Hereditas. 2010;147: 154–164. pmid:20887601
- 3. Nixon-Darcus L, D’Andrea AC. Necessary for Life: Studies of Ancient and Modern Grinding Stones in Highland Ethiopia. African Archaeol Rev. 2017;34: 193–223.
- 4.
Tigerstedt PMA, Bjørnstad AS, Demissie A, Demissie A, Kilian A, Kleinhofs A, et al. Communicated by The distinctness and diversity of Ethiopian barleys. Springer-Verlag; 1997.
- 5. Tadesse D, Asres T. On-farm diversity of barley landraces in North Western Ethiopia. Int J Biodivers Conserv. 2019;11.
- 6. Seid E, Eleni S, Faris H. Evaluation of genetic diversity in barley (Hordeum vulgare L.) from Wollo high land areas using agro-morphological traits and hordein. Afr J Biotechnol. 2015;14.
- 7. Girma M. Genetic erosion of barley in North Shewa Zone of Oromiya Region. Ethiop Int J Biodivers Conserv. 2014;6.
- 8. Eticha F, Woldeyesus S, Heinrich G. On-farm Diversity and Characterization of Barley (Hordeum vulgare L.) Landraces in the Highlands of West Shewa, Ethiopia. Ethnobotany Res Appl. 2010;8.
- 9. Assamere A, Mamuye A, Amare K, Fiseha M. Diversity, distribution, agronomic and post-harvest management of local barley (Hordeum vulgare L.) variety in South Wollo, Ethiopia. PLoS One. 2021;16: e0250899. pmid:33956836
- 10. Milner SG, Jost M, Taketa S, Mazón ER, Himmelbach A, Oppermann M, et al. Genebank genomics highlights the diversity of a global barley collection. Nat Genet. 2019;51: 319–326. pmid:30420647
- 11. Samberg LH, Fishman L, Allendorf FW. Population genetic structure in a social landscape: barley in a traditional Ethiopian agricultural system. Evol Appl. 2013;6: 1133–1145. pmid:24478796
- 12. Purugganan MD, Fuller DQ. The nature of selection during plant domestication. Nature. 2009;457: 843–848. pmid:19212403
- 13. Mohammadi SA, Abdollahi Sisi N, Sadeghzadeh B. The influence of breeding history, origin, and growth type on the population structure of baisey as revealed by SSR markers. Sci Rep. 2020;10: 1–13. pmid:33154389
- 14. Poets AM, Fang Z, Clegg MT, Morrell PL. Barley landraces are characterized by geographically heterogeneous genomic origins. Genome Biol. 2015;16: 173. pmid:26293830
- 15. Casañas F, Simó J, Casals J, Prohens J. Toward an evolved concept of landrace. Front Plant Sci. 2017;8. pmid:28228769
- 16. Bailey-Serres J, Parker JE, Ainsworth EA, Oldroyd GED, Schroeder JI. Genetic strategies for improving crop yields. Nature. 2019. pp. 109–118. pmid:31695205
- 17. Glenn KC, Alsop B, Bell E, Goley M, Jenkinson J, Liu B, et al. Bringing new plant varieties to market: Plant breeding and selection practices advance beneficial characteristics while minimizing unintended changes. Crop Science. Crop Science Society of America; 2017. pp. 2906–2921.
- 18. Aslan S, Forsberg NEG, Hagenblad J, Leino MW. Molecular genotyping of historical barley landraces reveals novel candidate regions for local adaption. Crop Sci. 2015;55: 2766–2776.
- 19. Ibrahim A, Harrison M, Meinke H, Fan Y, Johnson P, Zhou M. A regulator of early flowering in barley (Hordeum vulgare L.). PLoS One. 2018;13. pmid:30016338
- 20. Abdullaev RA, Alpatieva N V, Zveinek IA, Batasheva BA, Anisimova IN, Radchenko EE, et al. Diversity of Dagestan Barleys for the Duration of the Period between Shooting and Earing Stages and Alleles in the Ppd-H1 and Ppd-H2 Loci. Springer. 2017;43: 3–7.
- 21. Malysheva-Otto L V., Röder MS. Haplotype diversity in the endosperm specific β-amylase gene Bmy1 of cultivated barley (Hordeum vulgare L.). Mol Breed. 2006;18: 143–156.
- 22. Contreras-Moreira B, Serrano-Notivoli R, Mohammed NE, Cantalapiedra CP, Beguería S, Casas AM, et al. Genetic association with high-resolution climate data reveals selection footprints in the genomes of barley landraces across the Iberian Peninsula. Mol Ecol. 2019;28: 1994–2012. pmid:30614595
- 23. Kumar P, Banjarey P, Malik R, Tikle AN, Verma RPSS, Vijayaraje R, et al. Population structure and diversity assessment of barley (Hordeum vulgare L.) introduction from ICARDA. 2020;99.
- 24. Darrier B, Russell J, Milner SG, Hedley PE, Shaw PD, Macaulay M, et al. A Comparison of Mainstream Genotyping Platforms for the Evaluation and Use of Barley Genetic Resources. Front Plant Sci. 2019;0: 544. pmid:31105733
- 25. Tuvesson S, Von Post R, Von Post L, Dayteg C, Nilsson M, Forster BP. A high-throughput DNA extraction method for barley seed. Euphytica. 2003;130: 255–260.
- 26. Bayer MM, Rapazote-Flores P, Ganal M, Hedley PE, Macaulay M, Plieske J, et al. Development and evaluation of a barley 50k iSelect SNP array. Front Plant Sci. 2017;8: 1792. pmid:29089957
- 27. Mascher M, Gundlach H, Himmelbach A, Beier S, Twardziok SO, Wicker T, et al. A chromosome conformation capture ordered sequence of the barley genome. Nature. 2017;544: 427–433. pmid:28447635
- 28.
Wagner JO and FGB and MF and RK and PL and DM and PRM and RBO and GLS and PS and MHHS and ES and H. vegan: Community Ecology Package. R package; 2020. version 2.7–5
- 29. Granato ISC, Galli G, de Oliveira Couto EG, e Souza MB, Mendonça LF, Fritsche-Neto R. snpReady: a tool to assist breeders in genomic analysis. Mol Breed. 2018;38.
- 30. Kamvar ZN, Tabima JF, unwald NJ. Poppr: An R package for genetic analysis of populations with clonal, partially clonal, and/or sexual reproduction. PeerJ. 2014;2014: 1–14. pmid:24688859
- 31. Jombart T, Ahmed I, Jombart T. and Ahmed I. new tools for the analysis of genome-wide SNP data. Bioinformatics. 2011;27: 3070–3071. pmid:21926124
- 32.
R Core Team. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2021. https://www.r-project.org/
- 33. Porras-Hurtado L, Ruiz Y, Santos C, Phillips C, Carracedo Á, Lareu M V. An overview of STRUCTURE: Applications, parameter settings, and supporting software. Front Genet. 2013;4. pmid:23755071
- 34. Evanno G, Regnaut S, ecology JG-M, 2005 undefined. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol. 2005;14: 2611–2620. pmid:15969739
- 35. Jakobsson M, Rosenberg NA. CLUMP: a cluster matching and permutation program for dealing with label switching and multimodality in the analysis of population structure. Bioinformatics. 2007;23: 1801–1806. pmid:17485429
- 36. Yu G, Smith DK, Zhu H, Guan Y, Lam TTY. ggtree: an r package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods Ecol Evol. 2017;8: 28–36.
- 37. Wickham H. ggplot2. Wiley Interdiscip Rev Comput Stat. 2011;3: 180–185.
- 38. Tadesse D, Derso B. The status and constraints of food barley production in the North Gondar highlands, North Western Ethiopia. Agric Food Secure. 2019;8.
- 39. Daniel T, Tazebachew A. On-farm diversity of barley landraces in North Western Ethiopia. Int J Biodivers Conserv. 2019;11: 1–7.
- 40. Gadissa F, Abebe M, Worku B. Assessment on the Current State of On-Farm Diversity and Genetic Erosion in Barley (Hordeum vulgare L.) Landraces from Bale Highlands, Southeast. Ethiopia. Biomed Res Int. 2021; 23: 6677363. pmid:33681372
- 41. Dessie AB, Abate TM, Mekie TM, Liyew YM. Crop diversification analysis on red pepper dominated smallholder farming system: evidence from northwest Ethiopia. Ecol Process. 2019;8.
- 42. Tadesse D, Mekuria W, Asres T, Derso B. On-farm Characterization of Barley (Hordeum vulgare L.) Landraces in the Highlands of North Gondar, Ethiopia. Int J Sci Res Manag. 2018;6.
- 43. Sansaloni C., Franco J., Santos B. et al. Diversity analysis of 80,000 wheat accessions reveals consequences and opportunities of selection footprints. Nat Commun. 2020;11: 4572. pmid:32917907
- 44. Hailemichael S, Spade P. Ethnobotany, diverse food uses, claimed health benefits and implications on conservation of barley landraces in North Eastern Ethiopia highlands. J Ethnobiol Ethnomed. 2011;7.
- 45. Chettri D., Anandan A., Nagireddy R., Sabarinathan S., & Sathyanarayana N. Genetic diversity in rice (Oryza sativa L.) landraces of Sikkim-Himalaya and early insight into their use in genome-wide association analysis. Plant Genetic Resource.2021,19(4), 347–356.
- 46. Zevallos E, Inga J, Alvarez F, Marmolejo K, Paitan R, Viza I, et al. First signs of late blight resistance in traditional native potatoes of Pasco—Peru, a preliminary assay. Agric Food Secure. 2021;10:33.
- 47. Garduño G, and C P-J of D, 2020 undefined. The on-farm diversity of maize cultivars and landraces in the Lacandon of Chiapas, Mexico. academicjournals.org. 12: 113–121.
- 48. Negisho K, Shibru S, Pillen K, Ordon F, Wehner G. Genetic diversity of Ethiopian durum wheat landraces. PLoS One. 2021;16. pmid:33596260
- 49. Zeng X, Guo Y, Xu Q, Mascher M, Guo G, Li S, et al. Origin and evolution of qingke barley in Tibet. Nat Commun. 2018;9: 1–11. pmid:30575759
- 50.
De Haan S. Potato Diversity and Height: Multiple Dimensions of Farmer-Driven-in situ Conservation in the Andes. 2009; PhD thesis, Wageningen University, Wageningen.
- 51. Liu B, Ma G, Bussmann R, Bai K, Li J, et al. Determining factors for the of hulless barley agroecosystem in the himalaya region—A from Northwest Yunnan, China. E. Global Ecol. Conserv.2019;18: e00600,
- 52. Bauldry S. Structural Equation Modeling. Int Encycl Soc Behav Sci Second Ed. 2015; 615–620.
- 53. Swarup S, Cargill E, Crosby K, … L F-C, 2021 undefined. is indispensable for plant breeding to improve crops. Wiley Online Libr. 2021;61: 839–852.
- 54. Kumar P, Banjarey P, Malik R, Tikle AN, Verma RPS. Population structure and diversity assessment of barley (Hordeum vulgare L.) introduction from ICARDA. J Genet. 2020;99. pmid:32893841
- 55. Corbin LJ, Liu AYH, Bishop SC, Woolliams JA. Estimation of historical effective population size using linkage disequilibria with marker data. J Anim Breed Genet. 2012;129: 257–270. pmid:22775258
- 56. Tehseen M, Istipliler D, Kehel Z, Genes CS-, 2021 undefined. Genetic diversity and population structure analysis of Triticum aestivum L. Landrace panel from Afghanistan. mdpi.com. 2021, 25;12(3):340. pmid:33668962
- 57. Muñoz-Amatriaín M, Cuesta-Marcos A, Endelman JB, Comadran J, Bonman JM, Bockelman HE, et al. The USDA barley core collection: Genetic diversity, population structure, and potential for genome-wide association studies. PLoS One. 2014;9: 94688. pmid:24732668
- 58. Fricano A, Battaglia R, Mica E, Tondelli A, Crosatti C, Guerra D, et al. Genetic Diversity for Barley Adaptation to Stressful Environments. Genomic Des Abiotic Stress Resist Cereal Crop. 2021; 153–191.
- 59. Milner SG, Jost M, Taketa S, Mazón ER, Himmelbach A, Oppermann M, et al. Genebank genomics highlights the diversity of a global barley collection. Nat Genet 2019;52(2): 319–326. pmid:30420647
- 60. Jayakodi M, Padmarasu S, Haberer G, Bonthala VS, Gundlach H, Monat C, et al. The barley pan-genome reveals the hidden legacy of mutation breeding. Nature. 2020;588: 284–289. pmid:33239781