Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Genetic structure and isolation by altitude in rice landraces of Yunnan, China revealed by nucleotide and microsatellite marker polymorphisms

  • Di Cui ,

    Contributed equally to this work with: Di Cui, Cuifeng Tang, Jinmei Li

    Affiliations Institute of Crop Science, Chinese Academy of Agricultural Sciences, Key Laboratory of Crop Germplasm Resources and Utilization, The National Key Facility for Crop Gene Resources and Genetic Improvement, Ministry of Agriculture, Beijing, China, Department of Plant Science, Research Institute for Agriculture and Life Science, Seoul National University, Seoul, Republic of Korea

  • Cuifeng Tang ,

    Contributed equally to this work with: Di Cui, Cuifeng Tang, Jinmei Li

    Affiliation Institute of Biotech and Germplasm Resources, Yunnan Academy of Agricultural Sciences, Kunming, Yunnan, China

  • Jinmei Li ,

    Contributed equally to this work with: Di Cui, Cuifeng Tang, Jinmei Li

    Affiliation Institute of Crop Science, Chinese Academy of Agricultural Sciences, Key Laboratory of Crop Germplasm Resources and Utilization, The National Key Facility for Crop Gene Resources and Genetic Improvement, Ministry of Agriculture, Beijing, China

  • Xinxiang A,

    Affiliation Institute of Biotech and Germplasm Resources, Yunnan Academy of Agricultural Sciences, Kunming, Yunnan, China

  • Tengqiong Yu,

    Affiliation Institute of Biotech and Germplasm Resources, Yunnan Academy of Agricultural Sciences, Kunming, Yunnan, China

  • Xiaoding Ma,

    Affiliation Institute of Crop Science, Chinese Academy of Agricultural Sciences, Key Laboratory of Crop Germplasm Resources and Utilization, The National Key Facility for Crop Gene Resources and Genetic Improvement, Ministry of Agriculture, Beijing, China

  • Enlai Zhang,

    Affiliation Institute of Biotech and Germplasm Resources, Yunnan Academy of Agricultural Sciences, Kunming, Yunnan, China

  • Yanjie Wang,

    Affiliation Institute of Crop Science, Chinese Academy of Agricultural Sciences, Key Laboratory of Crop Germplasm Resources and Utilization, The National Key Facility for Crop Gene Resources and Genetic Improvement, Ministry of Agriculture, Beijing, China

  • Guilan Cao,

    Affiliation Institute of Crop Science, Chinese Academy of Agricultural Sciences, Key Laboratory of Crop Germplasm Resources and Utilization, The National Key Facility for Crop Gene Resources and Genetic Improvement, Ministry of Agriculture, Beijing, China

  • Furong Xu,

    Affiliation Institute of Biotech and Germplasm Resources, Yunnan Academy of Agricultural Sciences, Kunming, Yunnan, China

  • Luyuan Dai,

    Affiliation Institute of Biotech and Germplasm Resources, Yunnan Academy of Agricultural Sciences, Kunming, Yunnan, China

  • Longzhi Han ,

    heejkoh@snu.ac.kr (HJK); hanlongzhi@caas.cn (LZH)

    Affiliation Institute of Crop Science, Chinese Academy of Agricultural Sciences, Key Laboratory of Crop Germplasm Resources and Utilization, The National Key Facility for Crop Gene Resources and Genetic Improvement, Ministry of Agriculture, Beijing, China

  • Hee-Jong Koh

    heejkoh@snu.ac.kr (HJK); hanlongzhi@caas.cn (LZH)

    Affiliations Department of Plant Science, Research Institute for Agriculture and Life Science, Seoul National University, Seoul, Republic of Korea, Plant Genomics and Breeding Institute, Seoul National University, Seoul, Republic of Korea

Abstract

Rice landraces, a genetic reservoir for varietal improvement, are developed by farmers through artificial selection during the long-term domestication process. To efficiently conserve, manage, and use such germplasm resources, an understanding of the genetic structure and differentiation of local rice landraces is required. In this study, we analyzed 188 accessions of rice landraces collected from localities across an altitudinal gradient from 425 to 2, 274 m above sea level in Yunnan Province, China using ten target genes and 48 SSR markers. We detected clear differentiation of the rice landraces into indica and japonica groups and further separation of the accessions in each group into two subgroups according to altitude, including a lower altitude subgroup and higher altitude subgroup. The AMOVA results showed significant genetic differentiation among altitude zones at SSRs and most genes, except Os1977 and STS22. We further determined that differentiation among landrace populations followed a model of isolation by altitude, in which gene flow was higher among populations at similar altitude levels than across different altitude levels. Our findings demonstrated that both adaptation to altitude and altitude-dependent gene flow played key roles in the genetic differentiation of rice landraces in Yunnan, China.

Introduction

Crop domestication is a complex process mediated by a series of phenotypic changes to improve cultivation, harvesting, and consumption. Rice (Oryza sativa L.) is one of the earliest domesticated crop species, and the genetic diversity of cultivated rice has been reduced by up to 80% from that of the wild ancestor during the domestication and artificial selection processes [1]. The most extreme loss of diversity is found in modern high-yielding rice varieties, and this has serious consequences for disease susceptibility and adaptation to changing environments [2]. By contrast, rice landraces, which originated and evolved in the field over millennia via selective breeding by farmers, have retained genetic variation [3]. This variation has important implications in rice breeding by providing new genes/alleles for crop improvement. However, with the development of modern agriculture, a large number of local landraces have been replaced with modern varieties introduced over the past 40 years [4]. In China, rice landraces are no longer planted in most provinces, with the exception of some ethnic minority regions, such as areas of Yunnan and Guizhou Provinces.

Selection by various ethnic groups inhabiting areas of different altitudes and climatic conditions and with different cultivation methods, cultures, and traditions has contributed to rice crop diversity in Yunnan. Accordingly, Yunnan is one of the largest centers of genetic diversity for rice worldwide [57]. Rice landraces in Yunnan are widely distributed in a region from 21°8′ 32′′ N to 29°11′ 18′′ N and 97°31′ 39′′ E to 106°11′ 47′′ E, and are planted at various altitudes and under diverse climatic conditions [7]. Landraces grow from altitudes of approximately 76 m in Hekou county, Honghe prefecture in the southeastern part of Yunnan to 2,700 m in Weixi county, Diqing prefecture [7]. The wide distribution of rice landraces provides an excellent opportunity for studies of genetic structure and differentiation patterns of rice landraces along the altitudinal range as well as the role of altitude in shaping population genetic structure. Several studies have examined genetic differentiation and the distribution of rice landraces along altitudinal gradients in Yunnan based on phenotype traits or insertion/deletion (indel) molecular markers [810]. According to a previous study, both indica and japonica rice varieties are cultivated in Yunnan [6], and the distribution of rice landraces can be artificially categorized into three rice cultivation regions as follows: (1) the indica belt at altitudes of below 1,400 m, (2) the mixed indica and japonica belt at altitudes of between 1,400 and 1,600 m, and (3) the japonica belt at altitudes of above 1,600 m [8]. However, little is known about how altitudinal variation affects population genetic structure. Furthermore, it is not clear whether rice landraces in Yunnan exhibit isolation by altitude. These questions need to be explored using DNA data.

Among various molecular marker types, simple sequence repeat (SSR) markers are commonly used to estimate genetic diversity, population structure, and differentiation in numerous plant species [11]. Furthermore, with the development of DNA sequencing technology, multi-locus DNA sequences have been successfully used to estimate genetic diversity and for phylogenetic analyses [1215]. DNA sequence differences directly reflect genetic differences; accordingly, sequence comparison is an ideal method for revealing genetic diversity and differentiation.

We collected diverse rice landraces from Yunnan, China along a wide range of altitudes and analyzed gene sequences and SSR markers. Our objectives were to examine the genetic structure of rice landraces and to assess the role of isolation by altitude in genetic differentiation.

Materials and methods

Ethics statement

Field work and the collection of leaves were approved by Institute of Crop Science, CAAS (Chinese Academy of Agricultural Sciences). In this study, the land accessed is not privately owned or protected, and any protected or endangered species were not sampled.

Sampling and choice of loci

In total, 188 rice landraces were collected from a wide altitudinal range (425–2,274 m above sea level) in Yunnan Province, China (S1 Table). These rice landraces were artificially divided into eight groups according to altitude at intervals of 200 m, except for the lowest (≤ 800 m) and highest (≧2,000) zones (S2 Table). The geographic localities of the rice landraces sampled in this study are shown in S1 Fig. In addition, two varieties, 93–11 and Nipponbare, were used as references for indica and japonica, respectively.

A set of 48 SSRs evenly distributed throughout the rice genome (S3 Table) and ten unlinked nuclear gene loci were used in this study. Five of the genes, CatA, GBSSII, Os1977, STS22, and STS90, which have been used to estimate nucleotide diversity in rice populations in previous studies, were used [16,17]. Five additional genes, Ehd1, S5, Pid3, GS3, and GS5, which are associated with agronomic traits in rice, were also used [1822]. Schematic diagrams of all ten genes are shown in S2 Fig. Detailed information about the genomic location and putative functions of the genes as well as the primer sequences for amplification can be found in S4 Table.

DNA extraction, SSR genotyping, and gene sequencing

Total genomic DNA was extracted from fresh seedling leaves using a modified CTAB procedure [23]. A total of 48 SSRs were amplified by polymerase chain reaction (PCR) with fluorescently labeled primers in a 10-μL reaction volume containing 20 ng of genomic DNA, 10× PCR reaction buffer, 10 mM Mixture dNTP, 2 μM primers, and 0.5 units of Taq polymerase. The PCR profile was as follows: pre-denaturation at 94°C for 5 min, 36 cycles of denaturation at 94°C for 30 s, annealing at 55–60°C (dependent on primers) for 30 s, and extension at 72°C for 40 s, and a final extension at 72°C for 10 min. PCR products were size separated on a 3730XL DNA Sequencer equipped with GENESCAN software (ABI, Waltham, MA, USA). Fragment size was recorded using Gene Marker V1.6 (SoftGene, State College, PA, USA) and manually re-checked.

For the detection of genes, PCR was performed in a 25-μL volume consisting of 0.2 μM of each primer, 200 μM of each dNTP, 10 mM Tris—HCl (pH 8.3), 50 mM KCl, 1.5 mM MgCl2, 0.5 U of HiFi DNA polymerase (Transgen, Beijing, China), and 10–30 ng of genomic DNA. The PCR profile was as follows: pre-denaturation at 94°C for 5 min, 36 cycles of denaturation at 94°C for 30 s, annealing at 55–60°C (dependent on primers) for 30 s, and extension at 72°C for 1.5 min, and a final extension at 72°C for 10 min. The PCR products were electrophoresed on 1.2% agarose gels, and DNA fragments were cut from the gel and purified using a Tiangen Gel Extraction Kit (Tiangen, Beijing, China). Sequencing reactions were performed using an ABI 3730 Automated Sequencer. Initially, all samples were directly sequenced. However, if haplotypes could not be readily inferred owing to heterozygosity, the PCR product was ligated into an EASY vector (Transgen) and at least four clones were sequenced. For heterozygous individuals, one allele sequence was randomly selected. Because Taq errors occurred, when polymorphisms were only found in one accession, this accession was re-sequenced with the cloning step to verify the polymorphisms.

Population genetic structure

To identify population structure, a Bayesian clustering analysis was conducted using STRUCTURE 2.2 [24,25] based on the 48 SSRs and SNP data, respectively. Fifteen independent runs were performed for each k value (from 1 to 12), using a burn-in length of 100,000, a run length of 100,000, and admixture and correlated allele frequency models. The k value was determined based on LnP(D) in the STRUCTURE output and the ad-hoc statistic Δk [26,27]. A principal component analysis (PCA) was performed using NTSYSpc version 2.11 [28] based on SSR data.

Analysis of DNA sequence and SSR data

DNA sequences were aligned using ClustalX 1.83 [29] and edited using BioEdit 7.0.9.0 [30]. Indels were not included in the analysis. For each locus, the number of segregating sites (S), the number of haplotypes (h), haplotype diversity (Hd), and two nucleotide diversity parameters, mean pairwise differences (θπ) [31] and Watterson’s estimator based on the number of segregating sites (θw) [32], were determined using DnaSP version 5.0 [33]. The minimum number of recombination events (Rm) was estimated using the four-gamete test [34]. The statistical analysis of SSR data, including allele number, genotype number, number of private alleles (i.e., alleles that only appear in certain populations), heterozygosity, gene diversity, and polymorphism information content (PIC), was implemented in PowerMarker version 3.25 [35].

Haplotype network analysis

Haplotype networks were constructed based on mutational steps using Network 4.5 [36]. These networks represent the genetic distances among DNA sequences or alleles and are represented by circles of different sizes and colors and lines linking the circles. Because many haplotypes were obtained for the nuclear loci, only major haplotypes, i.e., those observed in more than three individuals, were selected for network construction.

Genetic differentiation and isolation by altitude

The overall distribution of nucleotide diversity was investigated using an analysis of molecular variance (AMOVA) implemented in Arlequin 3.01 [37]. Sequence variation was hierarchically partitioned between the two subpopulations, among altitude zones within subpopulations, and within altitude zones. The significance of all estimated fixation indices was tested using 10,000 permutations, as described by Excoffier et al. [37]. Pairwise FST, generally expressed as the proportion of genetic diversity explained by allele frequency differences among populations [38], was used to measure differentiation within and between subpopulations, as implemented in Arlequin 3.01 [37]. Isolation by altitude was evaluated by assessing the correlation matrix between pairwise altitude difference and genetic differentiation between altitude zones was assessed using Mantel’s tests implemented in Arlequin 3.01 [37]. A total of 10,000 random permutations were performed.

Results

Population structure and genetic relationship analyses

To infer the population structure of 188 rice landraces, we performed model-based simulations using 48 SSRs and 89 SNPs from ten genes, respectively. We observed an increase in the mean posterior probability LnP(D) as the number of groups k increased, but we detected a sharp peak of Δk at k = 2 (S3A Fig) for rice landraces using 48 SSRs. These results were also supported by structural analysis based on SNP data (S3B Fig). These results suggest that rice landraces could be grouped into two subpopulations, referred to as P1 and P2 (Fig 1A). Based on the reference cultivars, the 110 P1 accessions were indica or indica-like, and the 78 P2 accessions were japonica or japonica-like. This result demonstrated that rice landraces from Yunnan were clearly differentiated into indica and japonica groups. Fig 1B shows the distribution of rice accessions from the two subpopulations in each altitude zone. We found only indica at altitudes below 800 m, but we observed both indica and japonica in all other altitude zones. We detected a significant negative correlation between the proportion of indica rice landraces in various altitude zones and altitude (r = -0.814, P < 0.05), but a significant positive correlation between the proportion of japonica rice landraces in various altitude zones and altitude (r = 0.814, P < 0.05) (S4 Fig). In other words, the proportion of rice landraces classified as indica in each altitude zone in Yunnan decreased from low to high altitudes. By contrast, the proportion of landraces classified as japonica in each altitude zone increased from low to high altitudes. Based on a PCA (Fig 2), all rice landraces were distinctly divided into two main groups, indica and japonica, supporting the population structure revealed by STRUCTURE. In each group, we detected further separation into two subgroups according to altitude, including a lower altitude subgroup and higher altitude subgroup, with a few exceptions in middle altitude zones. The indica group was more highly dispersed than the japonica group, indicating higher diversity based on SSRs.

thumbnail
Fig 1. Model-based ancestries and their distribution in altitude zones.

(A) Model-based ancestry of each accession in P1 and P2; (B) distribution of model-based populations in each altitude zone.

https://doi.org/10.1371/journal.pone.0175731.g001

thumbnail
Fig 2. Principal component analysis of rice landraces from eight altitude zones.

https://doi.org/10.1371/journal.pone.0175731.g002

Nucleotide diversity in model-based populations

We sequenced ten unlinked loci from 188 Yunnan rice landraces covering a wide altitudinal gradient. The aligned sequences for each locus ranged from 420 to 627 bp, with a total length of 4,994 bp, including 1120 bp of coding sequence (S4 Table). We excluded 17 indel polymorphisms, ranging from 1 to 5 bp, from the data analyses. Standard sequence polymorphism statistics at each locus are summarized in Table 1. We observed genetic diversity, as estimated by θw, ranging from 0.0014 (STS22) to 0.0082 (CatA) for rice accessions from the indica subpopulation and from 0.0010 (S5) to 0.0074 (CatA) for accessions from the japonica subpopulation. We did not observe a significant difference between the average nucleotide diversity at silent sites between indica accessions (θπ = 0.0034, θw = 0.0030) and the japonica subpopulation (θπ = 0.0032, θw = 0.0031; P > 0.05 for both θπ and θw). In general, based on SSR markers, we detected more genetic variation (gene diversity and PIC), but less heterozygosity, in indica rice than in japonica rice (S5 Table).

Phylogenetic and geographic analyses of haplotypes

Fig 3 shows haplotype networks constructed based on the major haplotypes for each gene. For CatA, GBSSII, Os1977, STS22, STS90, S5, Pid3, Ehd1, GS3, and GS5, we observed 6 (5 and 3 for indica and japonica accessions), 2 (2 and 2), 3 (1 and 3), 5 (5 and 5), 6 (6 and 6), 4 (3 and 2), 5 (3 and 5), 7 (7 and 4), 5 (5 and 5), and 5 (2 and 4) haplotypes, respectively. In total, we found ten indica-specific haplotypes in CatA, S5, GS5 and Ehd1, and eight japonica-specific haplotypes in CatA, Os1977, Pid3, S5 and GS5. In addition, we detected distinct differences in haplotype frequency between indica and japonica rice landraces. We detected five haplotypes (H_3 of CatA; H_2 of GS3; H_1 of GS5; H_1 of Os1977; H_3 of STS90) that were common in indica rice, with an average frequency of 52.51% (8.16–99.05%), but rare in japonica rice, with an average frequency of 2.82% (1.43–4.05%). We observed six haplotypes (H_6 of Ehd1; H_3 of Pid3; H_2 of Os1977; H_5 of STS22; H_3 of S5; H_5 of STS90) that were rare in indica rice, with an average frequency of 1.55% (0.95–2.78%), but were more frequent in japonica rice, with an average frequency of 49.22% (8.00–94.81%). These results further indicated that there was obvious genetic differentiation between indica and japonica rice landraces at some loci.

thumbnail
Fig 3. Haplotype networks for each gene.

The circle size is proportional to the quantity of samples with a given haplotype, and the numbers next to the circles represent the haplotype number. Lines between haplotypes represent mutational steps between alleles. When more than one nucleotide difference existed between linked haplotypes, the number is indicated next to the lines. Colors for rice landraces collected from different subpopulations are as follows: yellow, P1 (indica) subpopulation; blue, P2 (japonica) subpopulation. The haplotype network for GBSSII, which had only two haplotypes, is not shown.

https://doi.org/10.1371/journal.pone.0175731.g003

We examined the geographic distribution of Ehd1 (Early heading date 1) haplotypes along the altitudinal gradient because this gene had the most polymorphisms. As shown in Fig 4, haplotype H_1 was a major haplotype and was widely distributed across all altitude zones, indicating an ability to adapt to a wide range of conditions in different altitude zones. Haplotype H_6 was a japonica-like haplotype; its frequency was 78.08% in japonica rice, but only 0.97% in indica rice. Interestingly, at altitudes above 1,600 m, we detected an increase in the frequency of haplotype H_6 as the altitude increased. This result suggested that the japonica-like haplotype is more adaptable to high-altitude zones than to low-altitude zones.

thumbnail
Fig 4. The distribution of haplotypes of rice landraces in eight altitude zones.

Phylogenetic relationships among the haplotypes based on a neighbor-joining analysis are indicated below the map. Pie charts show the proportions of haplotypes within each altitude zone. Haplotypes are indicated by different colors.

https://doi.org/10.1371/journal.pone.0175731.g004

Genetic differentiation between and within subpopulations

Hierarchical AMOVA results are presented in Table 2. We found that a significant proportion of the total variance was explained by variance between subpopulations for all gene loci, ranging from 11.33% (STS22) to 85.81% (Os1977). We observed significant genetic differentiation among altitude zones within subpopulations at most loci, except for Os1977 and STS22. The proportion of the total variance that was explained by variance among altitude zones within subpopulations ranged from 1.63% (CatA) to 9.30% (S5). We detected significant genetic differentiation within altitude zones for all ten loci, representing 14.00% (Os1977) to 89.76% (STS22) of the total variance. For all genes, the average genetic variation between subpopulations (46.92%) and within altitude zones (48.98%) was greater than that found among altitude zones within subpopulations (4.10%). These results were further supported by an AMOVA based on SSRs indicating that 61.85% of the total variation was due to variation within altitude zones, 32.53% between subpopulations, and only 5.62% was found among altitude zones within subpopulations.

thumbnail
Table 2. Hierarchical analysis of molecular variance for eight altitude zones in two subpopulations (AMOVA).

https://doi.org/10.1371/journal.pone.0175731.t002

Based on FST, we observed less genetic differentiation among altitude zones within subpopulations (-0.0282–0.1633 for indica rice and -0.0491–0.1178 for japonica rice; Table 3) than between subpopulations (0.0395–0.8559) for the genes. For SSRs, we also observed less genetic differentiation among altitude zones within subpopulations (0.0222–0.2415 for indica rice and 0.0374–0.2419 for japonica rice; Table 3) than between subpopulations (0.3231–0.5335). These results were fairly consistent with the AMOVA results, indicating that there was less genetic differentiation among altitude zones within subpopulations than between subpopulations. The average FST between subpopulations for gene loci (0.4922) was higher than that for SSR loci (0.3893), indicating that genetic differentiation between the indica and japonica subpopulations might be more obvious in gene regions.

thumbnail
Table 3. Pairwise divergence (FST) between altitude zones within subpopulations and between subpopulations.

https://doi.org/10.1371/journal.pone.0175731.t003

Isolation by altitude

The structure analysis suggested that genetic variation is altitude-dependent and specifically that there may be an isolation by altitude. We examined the isolation by altitude using a Mantel’s test implemented in Arlequin 3.01 [37] and observed a significant correlation between genetic differentiation and altitude difference (r = 0.806, P < 0.01) in the entire population based on SSR data (Fig 5A). Notably, we observed the lowest average genetic differentiation (FST) between rice in the middle altitude zone and those in other altitude zones, but we detected higher genetic diversity in middle altitude rice (S6 and S7 Tables). Similarly, we detected significant positive correlations between genetic differentiation and altitude difference both in the indica subpopulation (r = 0.868, P < 0.01, Fig 5B) and japonica subpopulation (r = 0.824, P < 0.01, Fig 5C). Furthermore, we found a significant correlation between genetic differentiation and altitude difference for most gene loci in the entire population, except for STS90 and Pid3, with correlation coefficients ranging from 0.510 to 0.819 (Table 4).

thumbnail
Fig 5. Patterns of isolation by altitude in rice landraces based on SSR markers.

(A) Isolation by altitude in the entire population; (B) isolation by altitude in the indica subpopulation; (C) isolation by altitude in the japonica subpopulation. The correlation coefficients are 0.806, 0.868 and 0.824 for the entire population, indica subpopulation and japonica subpopulation, respectively.

https://doi.org/10.1371/journal.pone.0175731.g005

thumbnail
Table 4. Mantel’s tests for the correlation between genetic differentiation and altitude difference.

https://doi.org/10.1371/journal.pone.0175731.t004

Discussion

Genetic structure and differentiation of rice landraces

Using a model-based structure analysis and a PCA, we found that the rice landraces from Yunnan were clearly differentiated into indica and japonica subpopulations. Within each group, the accessions could be further separated into two subgroups according to altitude, i.e., a lower altitude subgroup and higher altitude subgroup, with the exception of a few accessions from middle altitude zones. These results suggested that altitude is an important determinant of population structure. Genetic differentiation between indica and japonica has been detected based on analyses of morphological traits [39], isozymes [40,41], and DNA markers [42], consistent with our results at the DNA level [7]. Based on the observed distributions across altitudes, indica landraces adapt to a wider range of conditions than japonica landraces, although both types were cultivated across a wide altitude range above 800 m in Yunnan. We verified the relationship between altitude and the distribution of subspecies in a correlation analysis (S4 Fig); indica rice landraces were more frequent at lower altitudes, while japonica rice landraces were more frequent at higher altitudes. The observed patterns of distribution were largely attributable to their adaptations to different ecological conditions, especially temperature; the air temperature in the high-altitude zone was significantly lower than that in the low-altitude zone in Yunnan (S5 Fig).

Genetic diversity and haplotype distribution of rice landraces

Yunnan is a center of diversity for rice in China, and several studies have examined diversity in Yunnan rice landraces. However, to date, such studies focused exclusively on morphological traits or SSR markers [6, 7, 43, 44, 45, 46]. In this study, we used both nuclear gene loci and SSRs to analyze the extent and structure of genetic variation in rice landraces across a wide range of altitudes in Yunnan.

We observed higher levels of nucleotide diversity in both indica and japonica subpopulations than those reported in a previous survey [12] of 30 Asian cultivated rice accessions based on ten genes (θπ = 0.0029, θw = 0.0021 for indica; θπ = 0.0013, θw = 0.0011 for japonica), indicating a high level of nucleotide diversity in rice landraces from Yunnan. Based on SSRs, we detected more genetic variation in indica rice than in japonica rice, which was in agreement with the results of a survey employing 24 SSR markers to examine genetic diversity of 113 rice varieties from Yunnan [47]. By contrast, Zhang et al. found higher genetic diversity in japonica rice landraces than in indica rice landraces of Yunnan based on 20 SSRs [6]. The inconsistent results can probably be explained by differences in sample size, number of molecular markers, marker types, and so on.

In a haplotype analysis, we further found obviously genetic differentiation between indica and japonica rice landraces in some gene regions. Remarkably, for Ehd1 (Fig 4), which promotes short-day flowering and controls FT-like gene expression [18], the frequency of the japonica-like haplotype H_6 increased as the altitude increased above 1,600 m. Indeed, we found that the average days to heading for rice landraces containing haplotype H_6 was less than for landraces containing other haplotypes (S8 Table). These results indicated that early flowering is a critical trait for adaptation to high-altitude zones with short growing seasons and low air temperatures and the japonica-like haplotype is more adapted to high-altitude zones [48].

Isolation by altitude in rice landraces

Rice landraces in Yunnan are planted across a wide range of altitudes [7,8]. A series of environmental factors vary along the altitudinal gradient, and climatic differences could cause genetic divergence among populations [49,50]. High-altitude environmental conditions (e.g., short growing seasons and low temperatures) severely constrain the survival and reproduction of plants and populations; thus, divergence might be due to local adaptation [49,51]. In the present study, we observed a strong association between genetic differentiation and altitude difference at different population levels (Fig 5), indicating an altitude-dependent isolation pattern. Thus, the isolation by altitude was a main factor influencing genetic differentiation among populations in different altitude zones. This raises the question of how isolation by altitude occurred in rice landraces of Yunnan.

The isolation by altitude pattern was consistent with higher rates of gene flow among rice landraces at similar altitudes than along an altitudinal gradient [52]. For inbred cultivated rice, where little to no pollen flow occurs, gene flow must occur by seed movement, and specifically by seed exchange among farmers [2]. Altitude can result in reproductive isolation due to phenological shifts (e.g., flowering period), reducing seed exchange between populations at different altitudes; this may be an important mechanism for creating genetic divergence among populations along the altitudinal gradient [53].

Generally, seeds are exchanged more frequently among farmers within an altitude zone, resulting in high genetic diversity within these zones. Differentiation occurs between altitude zones, reflecting more limited seed exchange. Interestingly, we found that the rice landraces located in the middle altitude zone had the lowest average genetic differentiation (FST) with landraces of other altitude zones, but showed higher genetic diversity. These results suggested that seed exchange among farmers in neighboring altitude zones reduces rice genetic structure. In addition, the social structure of communities at different altitudes, such as differences in language or customs, can also influence seed exchange preferences. For example, Daizu people dwell in the flatlands or semi-mountainous areas under 1,200 m, while Lahuzu people prefer medium or highly mountainous regions [54]. These groups have different cultures and food preferences; the Daizu people enjoy glutinous rice, but Lahuzu people prefer red rice varieties. Hence, seed exchange might be restricted between populations at different altitudes under these circumstances, i.e., language and custom barriers. Overall, our results highlight the influence of isolation by altitude on the pattern of the gene flow and genetic differentiation of rice landraces from mountainous regions.

Supporting information

S1 Fig. Geographic localities of rice landraces sampled in this study (When a figure is similar but not identical to the original image, and is therefore for illustrative purposes only).

The localities of rice landraces are indicated by solid circles. Detailed information of the materials is provided in S1 Table.

https://doi.org/10.1371/journal.pone.0175731.s001

(PDF)

S2 Fig. Schematic diagrams of ten nuclear loci and locations of the sequenced regions.

Exons are shown as open boxes and exon numbers are labeled with capital roman numbers. Thin lines between open boxes indicate introns. Locations of primers for each fragment are shown above the diagrams.

https://doi.org/10.1371/journal.pone.0175731.s002

(PDF)

S3 Fig. The ΔK statistic for each given k using 48 SSRs (A) and 89 SNPs from ten genes (B).

https://doi.org/10.1371/journal.pone.0175731.s003

(PDF)

S4 Fig. Correlation between the proportion of indica rice and altitude (A) and between the proportion of japonica rice and altitude (B).

https://doi.org/10.1371/journal.pone.0175731.s004

(PDF)

S5 Fig. The average of daily maximum (red), average (green) and minimum (blue) air temperatures (°C) at the low (Hekou, 425 m) (A) and high-altitude sites (Xianggelila, 2,274 m) (B) during the period from April to October of 2012–2016.

https://doi.org/10.1371/journal.pone.0175731.s005

(PDF)

S1 Table. List of samples included in the study, including their origin and subpopulation.

https://doi.org/10.1371/journal.pone.0175731.s006

(PDF)

S2 Table. Descriptions of agronomic traits of rice landraces from each altitude zone in Yunnan.

https://doi.org/10.1371/journal.pone.0175731.s007

(PDF)

S3 Table. Summary of SSR markers and primer sequences.

https://doi.org/10.1371/journal.pone.0175731.s008

(PDF)

S4 Table. Summary of sequenced genes and primer sequences used in this study.

https://doi.org/10.1371/journal.pone.0175731.s009

(PDF)

S5 Table. Genetic diversity of model-based populations based on SSRs.

https://doi.org/10.1371/journal.pone.0175731.s010

(PDF)

S6 Table. Genetic diversity of populations in each altitude zone based on SSRs.

https://doi.org/10.1371/journal.pone.0175731.s011

(PDF)

S7 Table. Genetic distances among populations in different altitude zones based on SSRs.

https://doi.org/10.1371/journal.pone.0175731.s012

(PDF)

S8 Table. Days to heading for rice landraces with different haplotypes of Ehd1.

https://doi.org/10.1371/journal.pone.0175731.s013

(PDF)

S1 Dataset. Final assembled DNA sequences of CatA.

https://doi.org/10.1371/journal.pone.0175731.s014

(TXT)

S2 Dataset. Final assembled DNA sequences of GBSSII.

https://doi.org/10.1371/journal.pone.0175731.s015

(TXT)

S3 Dataset. Final assembled DNA sequences of Os1977.

https://doi.org/10.1371/journal.pone.0175731.s016

(TXT)

S4 Dataset. Final assembled DNA sequences of STS22.

https://doi.org/10.1371/journal.pone.0175731.s017

(TXT)

S5 Dataset. Final assembled DNA sequences of STS90.

https://doi.org/10.1371/journal.pone.0175731.s018

(TXT)

S6 Dataset. Final assembled DNA sequences of S5.

https://doi.org/10.1371/journal.pone.0175731.s019

(TXT)

S7 Dataset. Final assembled DNA sequences of Pid3.

https://doi.org/10.1371/journal.pone.0175731.s020

(TXT)

S8 Dataset. Final assembled DNA sequences of Ehd1.

https://doi.org/10.1371/journal.pone.0175731.s021

(TXT)

S9 Dataset. Final assembled DNA sequences of GS3.

https://doi.org/10.1371/journal.pone.0175731.s022

(TXT)

S10 Dataset. Final assembled DNA sequences of GS5.

https://doi.org/10.1371/journal.pone.0175731.s023

(TXT)

S11 Dataset. Detial information of 48 SSR primer alleles.

https://doi.org/10.1371/journal.pone.0175731.s024

(XLSX)

Acknowledgments

We thank the Chinese National Germplasm Bank for providing the landrace rice seeds.

Author Contributions

  1. Conceptualization: HJK LZH DC.
  2. Data curation: DC JML CFT.
  3. Formal analysis: DC CFT.
  4. Funding acquisition: HJK LZH.
  5. Investigation: DC JML XDM.
  6. Methodology: DC CFT.
  7. Project administration: HJK LZH.
  8. Resources: CFT XXA TQY ELZ YJW GLC FRX.
  9. Software: DC CFT.
  10. Supervision: LYD HJK LZH.
  11. Validation: HJK LZH.
  12. Visualization: DC.
  13. Writing – original draft: DC CFT.
  14. Writing – review & editing: HJK LZH.

References

  1. 1. Londo JP, Chiang YC, Hung KH, Chiang TY, Schaal BA. Phylogeography of Asian wild rice, Oryza rufipogon, reveals multiple independent domestications of cultivated rice, Oryza sativa. Proceedings of the National Academy of Sciences. 2006; 103: 9578–9583.
  2. 2. Pusadee T, Jamjod S, Chiang YC, Rerkasem B, Schaal BA. Genetic structure and isolation by distance in a landrace of Thai rice. Proceedings of the National Academy of Sciences. 2009; 106: 13880–13885.
  3. 3. Deb D. Seeds of tradition, seeds of future, folk rice varieties of Eastern India. New Delhi: Research Foundation for Science Technology & Ecology. 2005.
  4. 4. Heal G, Walker B, Levin S, Arrow K, Dasgupta P, Daily G, et al. Genetic diversity and interdependent crop choices in agriculture. Resource and Energy Economics. 2004; 26: 175–184.
  5. 5. Zeng YW, Wang JJ, Yang ZY, Shen SQ, Wu LH, Chen XY, et al. The diversity and sustainable devolopment of crop genetic resources in the Lancang River Valley. Genetic Resourse and Crop Evolution. 2001; 48: 297–306.
  6. 6. Zhang HL, Sun JL, Wang MX, Liao DQ, Zeng YW, Shen SQ, et al. Genetic structure and phylogeography of rice landraces in Yunnan, China, revealed by SSR. Genome. 2006; 50: 72–83.
  7. 7. Zeng YW, Zhang HL, Li ZC, Shen SQ, Sun JL, Wang MX, et al. Evaluation of genetic diversity of rice landraces (Oryza sativa L.) in Yunnan, China. Breeding Science. 2007; 57: 91–99.
  8. 8. Zeng YW, Li ZC, Yang ZY, Shen SQ, Zhang HL, Wang XK. Geographical distribution and cline classification of indica/japonica subspecies of Yunnan local rice resources. Acta Agronomica Sinica. 2001; 1: 16–20.
  9. 9. Xiong ZY, Zhang SJ, Wang YY, Ford-Lloyd BV, Tu M, Jin X, et al. Differentiation and distribution of indica and japonica rice varieties along the altitude gradients in Yunnan Province of China as revealed by InDel molecular markers. Genet Resour Crop Evol. 2010; 57: 891–902.
  10. 10. Xiong ZY, Zhang SJ, Ford-Lloyd BV, Jin X, Wu Y, Yan HX, et al. Latitudinal Distribution and Diff erentiation of Rice Germplasm: Its Implications in Breeding. Crop Sci. 2011; 51: 1050–1058.
  11. 11. Smith JSC, Chin ECL, Shu H, Smith S, Wall SJ, Senior ML, et al. An evaluation of the utility of SSR loci as molecular markers in maize (Zea mays L.): comparisons with data from RFLPS and pedigree. Theor Appl Genet. 1997; 95: 163–173.
  12. 12. Zhu QH, Zheng XM, Luo JC, Gaut BS, Ge S. Multilocus analysis of nucleotide variation of Oryza sativa and its wild relatives: severe bottleneck during domestication of rice. Molecular Biology and Evolution. 2007; 24: 875–888. pmid:17218640
  13. 13. Zheng XM, Ge S. Ecological divergence in the presence of gene flow in two closely related Oryza species (Oryza rufipogon and O. nivara). Molecular Ecology. 2010; 19: 2439–2454. pmid:20653085
  14. 14. Wei X, Qiao WH, Chen YT, Wang RS, Cao LR, Zhang WX, et al. Domestication and geographic origin of Oryza sativa in China: insights from multilocus analysis of nucleotide variation of O. sativa and O. rufipogon. Molecular Ecology. 2012; 21: 5073–5087. pmid:22989375
  15. 15. Cui D, Li JM, Tang CF, A XX, Yu TQ, Ma XD, et al. Diachronic analysis of genetic diversity in rice landraces under on-farm conservation in Yunnan, China. Theor Appl Genet. 2016; 129: 155–168. pmid:26498440
  16. 16. Caicedo AL, Williamson SH, Hernandez RD, Boyko A, Fledel-Alon A, York TL, et al. Genome—wide patterns of nucleotide polymorphism in domesticated rice. PLoS Genetics. 2007; 3: 1745–1756. pmid:17907810
  17. 17. Li ZM, Zheng XM, Ge S. Genetic diversity and domestication history of African rice (Oryza glaberrima) as inferred from multiple gene sequences. Theoretical and Applied Genetics. 2011; 123: 21–31. pmid:21400109
  18. 18. Doi K, Izawa T, Fuse T, Yamanouchi U, Kubo T, Shimatani Z, et al. Ehd1, a B—type response regulator in rice, confers short—day promotion of flowering and controls FT—like gene expression independently of Hd1. Genes and Development. 2004; 18: 926–936. pmid:15078816
  19. 19. Chen JJ, Ding JH, Ouyang YD, Du HY, Yang JY, Cheng K, et al. A triallelic system of S5 is a major regulator of the reproductive barrier and compatibility of indica—japonica hybrids in rice. Proceedings of the National Academy of Sciences. 2008; 105: 11436–11441.
  20. 20. Shang JJ, Tao Y, Chen XW, Zou Y, Lei CL, Wang J, et al. Identification of a New Rice Blast Resistance Gene, Pid3, by Genomewide Comparison of Paired Nucleotide-Binding Site—Leucine-Rich Repeat Genes and Their Pseudogene Alleles Between the Two Sequenced Rice Genomes. Genetics. 2009; 182: 1303–1311. pmid:19506306
  21. 21. Mao HL, Sun SY, Yao JL, Wang CR, Yu SB, Xu CG, et al. Linking differential domain functions of the GS3 protein to natural variation of grain size in rice. Proceedings of the National Academy of Sciences. 2010; 107: 19579–19584.
  22. 22. Li YB, Fan CC, Xing YZ, Jiang YH, Luo LJ, Sun L, et al. Natural variation in GS5 plays an important role in regulating grain size and yield in rice. Nature Genetics. 2011; 43: 1266–1269. pmid:22019783
  23. 23. Doyle JJ, Dickson EE. Preservation of plant samples for DNA restriction endonuclease analysis. Taxon. 1987; 36: 715–722.
  24. 24. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000; 155: 945–959. pmid:10835412
  25. 25. Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics. 2003; 164: 1567–1587. pmid:12930761
  26. 26. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Molecular Ecology. 2005; 14: 2611–2620. pmid:15969739
  27. 27. Earl DA, vonHoldt BM. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conservation Genetics Resources. 2012; 4: 359–361.
  28. 28. Rohlf F. FNTSYS-PC numerical taxonomy and multivariate analysis system ver 2.11L. Applied Biostatistics, Newyork. 2000.
  29. 29. Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. The CLUSTALX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Research. 1997; 25: 4876–4882. pmid:9396791
  30. 30. Hall TA. Bioedit: a user—friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symposium Series. 1999; 41: 95–98.
  31. 31. Nei M. Molecular Evolutionary Genetics. Columbia University Press, New York, New York. 1987.
  32. 32. Watterson GA. On the number of segregating sites in genetical models without recombination. Theoretical population Biology. 1975; 7: 256–276. pmid:1145509
  33. 33. Rozas J. DNA sequence polymorphism analysis using DnaSP. Methods in Molecular Biology. 2009; 537: 337–350. pmid:19378153
  34. 34. Hudson RR, Kaplan NL. Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics. 1985; 111: 147–164. pmid:4029609
  35. 35. Liu KJ, Muse SV. PowerMarker: an integrated analysis environment for genetic marker analysis. Bioinformatics. 2005; 21:2128–2129. pmid:15705655
  36. 36. Bandelt HJ, Forster P, Rohl A. Median—joining networks for inferring intraspecific phylogenies. Molecular Biology and Evolution. 1999; 16: 37–48. pmid:10331250
  37. 37. Excoffier L, Smouse PE, Quattro JM. Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics. 1992; 131: 479–491. pmid:1644282
  38. 38. Holsinger KE, Weir BS. Genetics in geographically structured populations: defining, estimating and interpreting F(ST). Nature Reviews Genetics. 2009; 10: 639–650. pmid:19687804
  39. 39. Oka HI. Origin of cultivated rice. Jnp. Sci. Soc. Press. Tokyo. 1988.
  40. 40. Second G. Origin of the genetic diversity of cultivated rice (Oryza spp.): study of polymorphism scored at 40 isoezyme loci. Japanese Journal of Genetics. 1982; 57: 25–57.
  41. 41. Glaszmann JC. Isozymes and classification of Asian rice varieties. Theoretical and Applied Genetics. 1987; 74: 21–30. pmid:24241451
  42. 42. Garris AJ, Tai TH, Coburn J, Kresovich S, McCouch S. Genetic structure and diversity in Oryza sativa L. Genetics. 2005; 169: 1631–1638. pmid:15654106
  43. 43. Xu FR, Dong C, Yang WY, Tang CF, A XX, Zhang EL, et al. Comparison of genetic diversity of rice landraces planted in two periods in Hani’s terraced fields in Yuanyang County, Yunnan province, China using microsatellite markers. Chinese Journal of Rice Science. 2011; 25: 38l–386.
  44. 44. Yan HM, Dong C, Zhang EL, Tang CF, A XX, Yang WY, et al. Analysis of genetic variation in rice paddy landraces across 30 years as revealed by microsatellite DNA markers. Hereditas. 2012; 34: 87–94. pmid:22306877
  45. 45. Xu FR, Zhang EL, Dong C, Dai LY, Zhang HS. Comparison of phenotypic traits of rice landraces, grown in two different periods in Hani’s terraced fields in Yuanyang County, Yunnan. Biodiversity Science. 2010; 18: 365–372
  46. 46. Sun JC, Cao GL, Ma Jing, Chen YF, Han LZ. Comparative genetic structure within single—origin pairs of rice (Oryza sativa L.) landraces from in situ and ex situ conservation programs in Yunnan of China using microsatellite markers. Genetic Resourse and Crop Evolution. 2012; 59: 1611–1623.
  47. 47. Xu J X, Wang Y Y, Yao C, Liu CX, Tang M. Indica-japonica differentiation and genetic variation of upland rice varieties from Yunnan Province, China. Chin Sci Bull (Chin Ver). 2012; 57: 2705–2714.
  48. 48. Chang TT, Oka HI. Genetic variousness in the climatic adaption of rice cultivars. In: Proceedings of the symposium on climate and rice. International Rice Research Institute, Los Banˇos, Philippines; 1976. pp 87–111.
  49. 49. Ohsawa T, Ide Y. Global patterns of genetic variation in plant species along vertical and horizontal gradients on mountains. Global Ecol. Biogeogr. 2008; 17: 152–163.
  50. 50. Shi M, Michalski SG, Chen X, Durka W. Isolation by elevation: genetic structure at neutral and putatively non-neutral loci in a dominant tree of subtropical forests. Castanopsis eyrei. PLoS ONE. 2011; 6: e21302. pmid:21701584
  51. 51. Frei ER, Ghazoul J, Matter P, Heggli M, Pluess AR. Plant population differentiation and climate change: responses of grassland species along an elevational gradient. Global Change Biol. 2014; 20: 441–455.
  52. 52. Byars SG, Parsons Y, Hoffmann AA. Effect of altitude on the genetic structure of an Alpine grass, Poa hiemata. Annals of Botany. 2009; 103: 885–899. pmid:19208670
  53. 53. Hirao AS, Kudo G. Landscape genetics of alpine-snowbed plants: comparisons along geographic and snowmelt gradients. Heredity. 2004; 93: 290–298. pmid:15241452
  54. 54. Gao LZ. The conservation of Chinese rice biodiversity: genetic erosion, ethnobotany and prospects. Genetic Resourse and Crop Evolution. 2003; 50: 17–32.