Correction
30 May 2018: Zhang X, Sun C, Zhang Z, Dai Z, Chen Y, et al. (2018) Correction: Genetic dissection of main and epistatic effects of QTL based on augmented triple test cross design. PLOS ONE 13(5): e0198562. https://doi.org/10.1371/journal.pone.0198562 View correction
Figures
Abstract
The use of heterosis has considerably increased the productivity of many crops; however, the biological mechanism underpinning the technique remains elusive. The North Carolina design III (NCIII) and the triple test cross (TTC) are powerful and popular genetic mating design that can be used to decipher the genetic basis of heterosis. However, when using the NCIII design with the present quantitative trait locus (QTL) mapping method, if epistasis exists, the estimated additive or dominant effects are confounded with epistatic effects. Here, we propose a two-step approach to dissect all genetic effects of QTL and digenic interactions on a whole genome without sacrificing statistical power based on an augmented TTC (aTTC) design. Because the aTTC design has more transformation combinations than do the NCIII and TTC designs, it greatly enriches the QTL mapping for studying heterosis. When the basic population comprises recombinant inbred lines (RIL), we can use the same materials in the NCIII design for aTTC-design QTL mapping with transformation combination Z1, Z2, and Z4 to obtain genetic effect of QTL and digenic interactions. Compared with RIL-based TTC design, RIL-based aTTC design saves time, money, and labor for basic population crossed with F1. Several Monte Carlo simulation studies were carried out to confirm the proposed approach; the present genetic parameters could be identified with high statistical power, precision, and calculation speed, even at small sample size or low heritability. Additionally, two elite rice hybrid datasets for nine agronomic traits were estimated for real data analysis. We dissected the genetic effects and calculated the dominance degree of each QTL and digenic interaction. Real mapping results suggested that the dominance degree in Z2 that mainly characterize heterosis showed overdominance and dominance for QTL and digenic interactions. Dominance and overdominance were the major genetic foundations of heterosis in rice.
Citation: Zhang X, Sun C, Zhang Z, Dai Z, Chen Y, Yuan X, et al. (2017) Genetic dissection of main and epistatic effects of QTL based on augmented triple test cross design. PLoS ONE 12(12): e0189054. https://doi.org/10.1371/journal.pone.0189054
Editor: Jinfa Zhang, New Mexico State University, UNITED STATES
Received: June 9, 2017; Accepted: November 17, 2017; Published: December 14, 2017
Copyright: © 2017 Zhang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: This work was supported by China Postdoctoral Science Foundation (2012M511722), China Postdoctoral Science Foundation funded project (2014T70769), Natural Science Foundation of Hunan Province (2016JJ3070), Scientific Research Fund of Hunan Provincial Education Department (2015118), and State key laboratory of crop germplasm innovation and resource utilization Breeding Base (16KFXM03). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: We submitted a patent to China's State Intellectual Property Office in Chinese (not in English) this June. The patent application is under the accept status now, but not issued (authorized) yet. We only have the following information about this patent: Name: A QTL mapping method of RIL-based argument Triple Test Cross design. Number: 201710346769.2. Employment: Hunan Agricultural University Consultancy: Changsha ZhenQi patent agency limited company. Tel: 86-0731-84580566. This does not alter our adherence to PLOS ONE policies on sharing data and materials.
Introduction
Heterosis, or hybrid vigor, describes the superior performance of heterozygous hybrid plants over their homozygous parental inbred lines [1–3]. The development of heterotic crops, especially those for hybrid rice and maize, is one of the most important applications of genetics in agriculture [4–5], but the molecular basis underlying heterosis remains elusive.
Indeed, much of our knowledge regarding heterosis derives from classical genetic studies on maize, during which the fundamental hypotheses for heterosis were defined, with the main competing hypotheses including dominance, overdominance, and epistasis [1,5–6]. The dominance hypothesis explains heterosis by the complementing action of superior dominant alleles from both parental inbred lines at multiple loci over the corresponding unfavorable alleles leading to the improved vigor of hybrid plants [1,5,7–8]. The overdominance hypothesis attributes heterosis to allelic interactions at one or multiple loci in hybrids that result in superior traits compared to the homozygous parental inbred lines [1,9–10]. In addition, the epistasis hypothesis considers epistatic interactions between non-allelic genes at two or more loci as the main factor for the superior phenotypic expression of a trait in hybrids [1,10–12].
To decipher the genetic basis of heterosis, NCIII [13] and TTC [14] are powerful genetic mating designs widely used in maize [12,15–19], rice [12,8,20–26], and Arabidopsis thaliana [27–30]. Rice is the staple food for a large segment of the world’s population. The success of hybrid rice breeding [31], together with its relatively small genome size [32], saturated molecular linkage maps [33], and rapid advances in genome sequencing [34–35], have provided a novel opportunity for dissecting the genetic basis of heterosis.
In Xiao et al.’s study [8], based on the NCIII design, 194 F7 RIL were backcrossed to their parental lines to develop the mapping population, and 37 QTL were detected for 12 quantitative traits by single-point analysis [one-way analysis of variance (ANOVA)] and an interval mapping method. In one of the two BC1F7 populations, 82% of the detected heterozygotes were superior to the respective homozygotes; therefore, Xiao et al. concluded that dominance complementation was the major genetic basis of heterosis in rice. On the other hand, Li et al. [21] and Luo et al. [22] investigated five interrelated mapping populations by an interval mapping method in which 254 F10 RIL were selected as the base population; two BC1F1 populations were derived from the NCIII design and two test cross populations were obtained by test crossing the RIL with two testers (Zhong 413 and IR64). The results suggested that epistasis and overdominance, rather than dominance, were the major genetic bases of heterosis in rice. Yu et al. [20] also pointed out that epistasis played a major role as the genetic basis of heterosis. Hua et al. [23] investigated the genetic components conditioning the heterosis of yield and yield component traits in an elite rice hybrid using an immortalized F2 population with modified composite interval mapping (CIM) and two-way ANOVA methods and found that heterotic effects at the single-locus level and a dominance × dominance interaction at the two-locus level could adequately explain the genetic basis of heterosis. In our previous study [24] based on the NCIII design, two recombinant inbred populations were backcrossed to their respective parents to develop mapping populations (L1 and L2) in which main-effect QTL were detected by the CIM method and epistatic QTL were detected by the mixed linear approach in the RIL population and summation (L1 + L2) and subtraction (L1 − L2) data of two backcross populations. The research demonstrated that heterosis was attributable to the orchestrated outcome of partial-to-complete dominance, overdominance, and epistasis. In addition, based on an ultra-high-density single nucleotide polymorphism bin map constructed with population sequencing, the immortalized F2 population in Hua et al. [23] was reanalyzed by Zhou et al. [26] with an h test in one-locus effects detection and two-way ANOVA in two-locus interactions for the whole genome. The results suggested that relative contributions of the genetic components varied with traits; single-locus dominance had relatively small contributions in all of the traits and the cumulative effects of these different components may adequately explain the genetic basis of heterosis. This conclusion was consistent with our previous study [24].
In summary, most of the mapping populations above derive from the NCIII design, and QTL mapping methods usually employ ANOVA, interval mapping, or the CIM method in one-locus effects detection and two-locus interactions. However, the estimated additive and dominant effects are confounded with epistatic effect if epistasis is present. Kao and Zeng [36] pointed out that a two-way ANOVA-exploiting genetic marker and trait phenotype data from an F2-segregating population was, in principle, inappropriate for testing for pairwise epistasis, even though this approach has been widely used in analyses of such data sets [20–21,23]. In addition, only one variable was involved in the model at one time, which was not able to capture all types of genetic effects, especially epistatic effects, simultaneously on the whole genome. In 2008, Garcia et al. [25] developed a multiple-interval mapping model for the NCIII design that provided a platform to simultaneously estimate the number, genomic positions, augmented additive and dominance effects, and digenic interactions (aa + dd and ad + da) of QTL. This method was used to reanalyze the datasets by Stuber et al. [12], who found that additive × additive effect (aa) epistatic effects of QTL could be the main cause for the heterosis in rice. After this, He et al. [37] proposed a method for mapping epistatic QTL associated with heterosis using the RIL-based NCIII design by a series of simulation studies; however, main or epistatic effects were mixed measured as augment effects.
In 1988, Liu [38] proposed an aTTC design based on the TTC design. In TTC design, base populations are backcrossed to P1, P2 and F1 to get L1i, L2i, and L3i, i = 1…n, whereas in aTTC design, base populations are simultaneously self-mated to get L4i. The aTTC design provided several ways to detect epistasis by detecting a variance component. However, there was no report based on aTTC design for QTL mapping on a Mendelian factor level.
In this paper, under aTTC design, based on four data sets (L1i, L2i, L3i, and L4i), we developed six data set transformations [38]: Z1i = L1i + L2i, Z2i = L1i − L2i, Z3i = L1i + L2i − 2L3i, Z4i = L1i + L2i − L4i, Z5i = L1i + L2i + L3i, and Z6i = 2L3i − L4i. By employing Z1, Z2, and Z3, He and Zhang [39] provided a complete solution for dissecting main and epistatic effects in the F2-based TTC design through a simulation study. Our study utilized different data set combinations (Z1, Z2, and Z4), (Z1, Z2, and Z5), and (Z1, Z2, and Z6), respectively, to provide a two-step approach for estimating, in an unambiguous and unbiased manner, all the main and, especially, epistatic effects of QTL; this method also fits for many types of base populations such as RIL, F2, and Double Haploid (DH). Here, we will take the first combination (Z1, Z2, and Z4) of the RIL-based aTTC design as an instance for QTL mapping to dissect genetic effects. The other combinations (Z1, Z2, and Z5) and (Z1, Z2 and Z6) listed above can also be used to estimate genetic effects. A series of Monte Carlo simulation studies were carried out to confirm the proposed approach. We further applied the proposed method to real data analysis.
Materials and methods
Genetic design
In aTTC design, F2 populations or their offspring (BC, DH, or RIL) derived from the hybridization of two pure lines (P1 and P2) and were selected as the base population. On one hand, n individuals in the base population were crossed to three testers (P1, P2, and F1) to get L1i, L2i, and L3i, respectively (i = 1, 2… n); on the other hand, the n individuals in the base population were self-mated to get L4i. Therefore, 4n aTTC lines (L1i, L2i, L3i, and L4i) can be obtained and used for the detection of epistasis.
All 4n families, each with m replications, were planted. Molecular marker information was observed from all of the n base population lines and the testers P1, P2 and F1, whereas quantitative traits were measured for all 4nm aTTC progeny. The phenotypic observations were denoted by ytij, where t = 1, 2, 3, and 4 for L1i, L2i, L3i, and L4i; respectively; j = 1, 2… m. The family means were denoted by .
The genetic expectations of six data set transformations, Z1i, Z2i, Z3i, Z4i, Z5i, and Z6i, were obtained from L1i, L2i, L3i, and L4i. Z1i = L1i + L2i, Z2i = L1i − L2i, Z3i = L1i + L2i − 2L3i, Z4i = L1i + L2i − L4i, Z5i = L1i + L2i + L3i, and Z6i = 2L3i − L4i. Two main metrics were adopted for the 4n aTTC lines: the F∞ and F2 metrics [36,40]; their genetic expectations are listed in S1 Supporting Information.
Genetic models for mapping heterotic QTL in RIL-based aTTC design
The derivation of the expected genetic values of Z1i, Z2i, Z3i, Z4i, Z5i, and Z6i under both the F∞ and F2 metric models is presented in S3 Supporting Information under the assumption that the quantitative trait was determined by two QTL with digenic epistasis and arbitrary linkage. The genetic effect symbols adopted in this research were described by Kao and Zeng [36]. He et al. [37] simulated and estimated main and epistatic QTL in the RIL-based NCIII design under both the F∞ and F2 metrics models and found that QTL mapping results under the F∞ metric were superior to the F2 metric; therefore, this paper simulated QTL under the F∞ metric models.
QTL mapping models in the RIL-based aTTC design under the F∞ metric model.
The phenotypic values of Z1i and Z2i in the RIL-based aTTC design are the same as the RIL-based NCIII design. Details can be found in the publication by He et al. [37]. According to the genetic expectations of Z1i under the F∞ metric model (Table A5 in S1 Supporting Information), the phenotypic value of Z1i can be described as
(1)
where μ is the mean genotypic value of the four homozygotes in the RIL population; ak and dk are additive and dominance effects of the kth QTL (k = 1, 2);
,
,
, and
are additive × additive, additive × dominance, dominance × additive, and dominance × dominance interactions between two QTL, respectively;
,
,
,
,
, and
are dummy variables and are determined by the genotype of the ith RIL line (Table A5 in S1 Supporting Information); and e1i is the residual error with an
distribution. According to Table A5 in S1 Supporting Information,
and
, model (1) can be reduced to
(2)
where
,
,
,
, and
. If the quantitative trait was controlled by q QTL, model (2) should be extended to
(3)
where the model mean
;
is the augmented additive effect of QTL k;
is the augmented epistatic effect between QTL k and l. Coefficients
and
are determined by genotypes of the kth and lth QTL for the ith RIL line, as shown in Table 1.
Similarly, the phenotypic value of Z2i can be described as
(4)
where
,
,
,
,
, and
are determined by the genotype of the ith RIL line (Table A6 in S1 Supporting Information), e2i is the residual error with an
distribution. According to Table A6 in S1 Supporting Information,
and
, model (4) can be reduced to
(5)
where
,
,
,
, and
. If the quantitative trait controlled by q QTL, model (5) should be extended to
(6)
where the model mean
;
is the augmented dominance effect of QTL k,
is the augmented epistatic effect between QTL k and l. Coefficients
and
are determined by genotypes of the kth and lth QTL for the ith RIL line, as shown in Table 1.
Similarly, the phenotypic value of Z3i can be described as
(7)
where
, r is the recombination fraction between two QTL; dummy variables
,
, and
are determined by the genotype of the ith RIL line (Table A7 in S1 Supporting Information). e3i is the residual error with an
distribution. Genetic effects
can be estimated directly.
In the same way, the phenotypic value of Z4i can be described as
(8)
where
,
,
, and
are determined by the genotype of the ith RIL line (Table A8 in S1 Supporting Information), e4i is the residual error with an
distribution. According to Table A8 in S1 Supporting Information, there are
and
. Therefore, model (8) can be reduced to
(9)
where
,
,
, and
,
.
If the quantitative trait was controlled by q QTL, model (9) can be extended to
(10)
where the model mean
,
,
is the augmented epistatic effect between QTL k and l. Coefficients
and
are determined by genotypes of the kth and lth QTL for the ith RIL line, as shown in Table 1.
In the same way, the phenotypic value of Z5i can be described as
(11)
where
,
,
,
,
, and
are determined by the genotype of the ith RIL line (Table A9 in S1 Supporting Information), e5i is the residual error with an
distribution. According to Table A9 in S1 Supporting Information, model (11) can be reduced to
(12)
where the model mean
. Genetic effects
can be estimated directly.
In the same way, the phenotypic value of Z6i can be described as
(13)
where
,
,
, and
are determined by the genotype of the ith RIL line (Table A10 in S1 Supporting Information), e6i is the residual error with an
distribution. According to Table A10 in S1 Supporting Information, model (13) can be reduced to
(14)
where the model mean μz6 = μ + d1 + d2. Genetic effects
can be calculated directly.
Model parameter components for Z1i, Z2i, Z3i, Z4i, Z5i and Z6i in the RIL-based aTTC design under both the F∞ were listed in Table 2.
Genetic models for mapping heterotic QTL in F2-based aTTC design
Genetic models for mapping heterotic QTL in the F2-based aTTC design under both F∞ and F2 metric can be found in S4 Supporting Information.
Parameter estimation
For a continuously distributed trait, the observed phenotypic value yi of individual i can be described by the linear regression model
(15)
where q is the number of markers, μ is the overall mean, xki denotes the genotype of marker k for individual i and is defined as −1 or 1 for the two genotypes in the mapping population, and xkli represents the epistatic genotype between the kth and lth QTL of individual i, and is obtained as the element-wise product of xkj and xlj. In addition, ak and ikl are the corresponding augmented main and epistatic effects, respectively.
is the total number of genetic effects and xji and βj are the corresponding genotypes and coefficients, including the main and epistatic effects. ei is the residual error assumed to follow an N(0,σ2) distribution.
Model (15) can be written as
(16)
where vectors βG and
represent the augmented main and epistatic effects of all markers, respectively. XG and
are corresponding design matrices of different effects and e is the residual error that follows an N(0,σ2) distribution. Defining
and
, model (16) can be written in a more compact form
(17)
Due to the physical linkage or epistatic interactions among multiple QTL, it is rational when taking a large number of loci into consideration simultaneously. However, the total number of genetic effects p is very large because we set each marker as a QTL initially. Typically, we have p >> n. To handle such an oversaturated model, we employed a fast empirical Bayesian LASSO (EBLASSO) algorithm. Simulation studies demonstrate that the EBLASSO method can sharply reduce the computational burden by shrinking small effects into zero, and can detect more true QTL effects without increasing the false-positive rate. Details of the EBLASSO algorithm can be seen by reference to the work of Cai et al. [41]. At last, all remaining markers with are picked up, where
is the standard deviation estimation for normal prior
. The epistatic model is then established that only includes effects that pass the first round of selection. We also perform a usual likelihood ratio test on the model to obtain significant QTL and epistatic interactions. With transformation combinations (Z1, Z2, and Z4), following the above steps, in the first round, the argument effects ak*,
, dk*,
,
, and
can be obtained; in the second round, main and epistatic effects of each QTL and interaction can be calculated according to equation transformations
,
,
,
,
, and
.
Take any two significant QTL in a model as an example to explain how we obtain genetic effects. After performing QTL mapping, genetic parameters and
in Z4i,
in Z1i, and
in Z2i can be obtained. For
and
, so
,
, while
and
; we then get
,
. Meanwhile, a1*, a2*, d1*, and d2* are obtained after performing QTL mapping in Z1i and Z2i because
,
,
, and
, with the estimation value of d1*, d2*, and above
,
,
, and
values, main effects can be calculated by
,
,
, and
. All main and epistatic effects were dissected by the integration of augmented effects in Z1, Z2, and Z4.
Similarity, with transformation combination (Z1, Z2, and Z5) and (Z1, Z2, and Z6), we can also get the genetic effect of each QTL or interaction by QTL mapping under the aTTC design, respectively. More details are listed in S1 Supporting Information.
Simulation study
We took all the possible types of epistatic interaction patterns into consideration. The simulated genome, covered by 100 evenly spaced markers with a marker interval of 5 cM, was 495 cM in total length and comprised four chromosomes. For data sets Z1, Z2, or Z4, six QTL positions were preset, of which three positions (QTL1, QTL2, and QTL4) had main effects. Pairwise interactions were set between positions with main effects (QTL1 and QTL2), with and without main effects (QTL3 and QTL4), and without main effects (QTL5 and QTL6), respectively. The assumed QTL positions, parameters, and augmented effects (including main and epistatic effects) are listed in Table 3 and Tables A-B in S11 Supporting Information. The sample size (n) was set at three levels: 800, 400, and 200. The broad heritability (h) was also set at three levels: 0.8, 0.5, and 0.2, separately representing high, middle, and low heritabilities. The replication number of offspring (m) was set at two levels: 5 and 10. In three transformations, Z1, Z2, and Z4, each treatment was replicated 100 times. Simulation study also be conducted for Z5 and Z6. The results of Z5 and Z6 were listed in Tables C and D in S11 Supporting Information.
QTL mapping in simulation study.
All independent variable for Z1, Z2, Z4 in simulation study were simultaneously included in one genetic model, which was much larger than the sample size. Data sets were implemented in R (version 3.0) with the EBLASSO package obtained from Cai et al. [41]. Hyper-parameters a and b were obtained by three-fold cross-validation (by default) in each individual model; after 100 replications, hyper-parameters with minimum predicted errors were fixed to estimate parameters. The time was approximately 5 minutes in each transformation in a stand-alone personal computer (Intel Pentium CPU 2.9 GHz; memory 4 GB); therefore, the EBLASSO algorithm presented high efficiency and saved time.
Real data analysis
Considering the unbiased estimate of coefficients and the excellent detection power in the simulation study, we further applied the proposed approach to a real mapping population and presented a comparison to previous mapping results.
Populations.
Two elite rice hybrids, one inter-subspecific between 9024 (indica) and LH422 (japonica) and one intra-subspecific between Zhenshan97 (indica) and Minghui63 (indica), were analyzed, and details were documented in our previous study [24]. For convenience, we designated the two hybrids as IJ and II hybrids, respectively. The RIL were derived from the cross of a random sample of F2 individuals to their parental lines (194 F7 lines for the IJ hybrid and 222 F12 lines for the II hybrid, respectively).
Genetic linkage maps.
For the II hybrid, the linkage map comprised 221 marker loci and covered 1796 cM in total [42]. For the IJ hybrid, Xiao et al. [8] constructed a linkage map of the recombinant population in which a subset of 141 polymorphic restriction fragment length polymorphism markers was used.
Phenotypic traits.
Nine quantitative traits, including heading date (HD, in days), plant height (PH, in centimeters), tillers per plant, panicle length (PL, in centimeters), filled grains per panicle (FGPP), percentage of seed set, grain density (GD, in grain numbers per centimeter of panicle length), 1000-grain weight (KGW, in grams), and grain yield (YD, in tons/hectare) were investigated in RIL, Z1, Z2, and Z4 respectively. All the materials described above were laid out in a field in a randomized complete block design with two replications (plots) for phenotypic evaluation.
QTL mapping in real data analysis.
Data sets Z1, Z2 and Z4 were implemented in R (version 3.0) with the EBLASSO package obtained from Cai et al. [41] for QTL mapping.
Results
Simulation study results
Augmented effects in simulation study.
As shown in Table 3 and Tables A-B in S11 Supporting Information, the augmented additive () and epistatic effects (
) in Z1, augmented dominance (
) and epistatic effects (
) in Z2, augmented epistatic effects (
,
) in Z4 were rightly and unbiased estimated with a high statistical power in preset positions. The ratio of the number of samples, in which the LOD statistic was greater than 2.5, to the total number of replicates represented the empirical power of this simulated QTL or interaction.
In Z1 transformation (Table A in S11 Supporting Information, S1 Fig), when the sample size was 800 or 400, almost all augmented additive and epistatic effects were detected, except for the detection power of digenic interactions in 400 samples, 0.2 heritability, and 5 replications. This indicated that smaller heritability or less individual replication had little influence on the detection of QTL in a relatively large sample size. When the sample size was reduced to 200, all the preset QTL were detected successfully with a heritability of 0.8 and 0.5; however, detection power decreased sharply to the level of 0.2, which is more true for the preset digenic interactions. When individual replication was 5, the detection power of the augment additive effect of QTL3 was 0.73, whereas it was 0.985 when the individual replication was 10, and the detection power of augmented epistatic effects dropped to 0.59 for interaction QTL5 × QTL6. Similar results could be found in Z2 (Table B in S11 Supporting Information, S2 Fig); all the preset QTL were precisely detected and the QTL effects were estimated in an unbiased manner, even on the level of the smallest sample size (200) or the lowest heritability (0.2). In addition, all the augmented epistatic effects estimated in an unbiased manner in Z4 (Table 3, Fig 1). The poor detection power occurred only on a low heritability level (0.2) with sample sizes 400 or 200.
Epis-1 refer the augmented epistatic effect (iaa—idd) interaction, epis-2 refer the augmented epistatic effect (iad—ida) interaction.
Main and epistatic effects in simulation study.
Table 4 shown the main and epistatic effects of QTL1 and QTL2 in the in RIL-based aTTC design using the two-step approach under the F∞ metric model. Other pairs of interactions are listed in Tables 5 and 6 for the interaction between QTL3 and QTL4 and QTL5 and QTL6, respectively. We can see that the main-effect and epistatic effects of QTL were very close to set value when sample size is big (800) and heritability is high (0.8). all the preset QTL were precisely detected and the QTL effects were estimated in an unbiased manner, except on the level of the smallest sample size (200) and the lowest heritability (0.2) with 5 replications in Z1, Z2 and Z4.
Real data analysis results
QTL mapping in II and IJ hybrid.
In the II hybrid, all were simultaneously included in the genetic model, about 115 times as large as the sample size, while in IJ hybrids,
, which was about 45 times bigger than the sample size. QTL effect-explained 1% phenotypic variation was set as a threshold for declaring the presence of QTL. QTL mapping results for the II and IJ hybrids are listed in Tables A and B in S12 Supporting Information, respectively.
Augmented effects in II and IJ hybrid.
As shown in Table A in S12 Supporting Information, 14 QTLs and 36 digenic interactions were detected in the II hybrid, and the explained variation of a single QTL or interaction varied from 1.13% to 7.67%. In the RIL mapping population, 8 QTL (16%) were revealed, of which 2 QTLs (25%) were detected in trait GD with relative small phenotypic variation. In trait YD, a digenic interaction of marker C1016 and C483 explained the maximum (7.49%) phenotypic variation. In Z1, 11 QTL (22%) were detected, and one or more QTL were revealed in each trait. The explained variation in Z1 varied from 1.21% to 7.67%. The interaction between marker R3166 and RZ667 was also detected in traits FGPP and GD, and explained 7.67% and 7.33% variation, respectively. In Z2, 10 QTLs were identified. There was no QTL detected in trait PL. The explained variation in Z2 varied from 1.13% to 5.91%. In Z4, 11 interactions were found, and at least one QTL or interaction was revealed in each trait. The maximum explained variation was in trait HD (6.05%). Ten interactions were also dissected, and the explained variation of a single interaction varied from 1.69% to 7.09%.
In the IJ hybrid, as shown in Table B in S12 Supporting Information, a total of 46 QTL and 75 interactions was detected. Of the detected QTL, the majority was detected in RIL (39.37%) and Z4 (41.73%). The explained variation of a single QTL in the RIL ranged from 1.09% to 27.48%. In Z4, the detected QTL-associated marker RG333 affected HD in chromosome 8 explained 27.48% of phenotypic variation. It was also found simultaneously in Z1 and Z2 data sets. Eighteen QTLs in Z1 and 9 QTLs in Z2 were identified. In Z1, QTL-associated marker RG333 influenced HD, accounting for 36.58% of variation, which also explained 8.47% of variation for PH. In Z4, the majority of interactions were detected in trait FGPP in which 17 (58.6%) and 10 (41.7%) marker pairs were found in and
.
Dissection of main and epistatic effects.
Integrated in the QTL mapping result of Tables A and B in S12 Supporting Information, main and epistatic effects were dissected by the proposed approach distributed previous part 3.4. The results of the II and IJ hybrid are presented in Tables 7 and 8, respectively. For main effect QTL, we dissected the additive and dominance effects, whereas for interactions, additive × additive (aa), additive × dominance (ad), dominance × additive (da), dominance × dominance (dd), effects were dissected. The dominance degree of each QTL or interaction was separately calculated by |d1/a1| and |d1d2/a1a2|, respectively. Where d1, a1, d1d2, a1a2 denote the dissected dominance effect, additive effect, dominance × dominance epistatic effect, additive × additive epistatic effect, respectively. According to Stuber et al. [12], main effect QTL can be classified as additive (|d1/a1|<0.2), partial dominance (0.2≤|d1/a1|<0.8), dominance (0.8≤|d1/a1|<1.2), and overdominance (|d1/a1|≥1.2). Epistatic QTL can be classified as additive (|d1d2/a1a2|<0.2), partial dominance (0.2≤|d1d2/a1a2|<0.8), dominance (0.8≤|d1d2/a1a2|<1.2), and overdominance (|d1d2/a1a2|≥1.2).
HD: In the II hybrid, the only main effect QTL was classified as additive; in the other five epistatic QTL, most were classified as dominance, except one that showed overdominance. In the IJ hybrid, four main effect QTL were classified as additive, and only one main effect QTL was classified as partial dominance; four epistatic QTL were classified as dominance, and one epistatic QTL was classified as overdominance.
PH: In the II hybrid, two main effect QTL were classified as additive and overdominance, respectively; two epistatic QTL were classified as dominance. In the IJ hybrid, two main effect QTL were classified as additive and overdominance, respectively; in seven epistatic QTL, two were classified as overdominance and the other five were dominance.
Tillers per plant: In the II hybrid, the only main effect QTL was classified as overdominance; two of three epistatic QTL were classified as dominance and the remaining one was classified as overdominance. In the IJ hybrid, no main effect QTL was found; in seven epistatic QTL, two were classified as overdominance and the remaining five were dominance, which was similar to trait PH.
PL: In the II hybrid, the only main effect QTL was classified as additive; two of three epistatic QTL were classified as dominance and the remaining one showed overdominance. In the IJ hybrid, no main effect QTL was found; in four epistatic QTL, one was identified as overdominance and the rest were dominance.
FGPP: In the II hybrid, no main effect QTL was found; in four epistatic QTL, one was classified as overdominance and the rest were dominance. In the IJ hybrid, the only main effect QTL was classified as additive; a total of 28 epistatic QTL was dissected, all of which showed dominance.
Percentage of seed set: In the II hybrid, the only main effect QTL was classified as additive and the three epistatic QTL were classified as dominance. In the IJ hybrid, only three epistatic QTL were dissected and all of them were classified as dominance.
GD: In the II hybrid, no main effect QTL was found; among four epistatic QTL, one was classified as overdominance and the remaining three were dominance. In the IJ hybrid, the only main effect QTL was classified as overdominance; among four epistatic QTL, three were classified as dominance and the remaining one was classified as overdominance.
KGW: In the II hybrid, two main effect QTL were classified as overdominance; in four epistatic QTL, one showed overdominance and the remaining three showed dominance. In the IJ hybrid, two main effect QTL were classified as additive and overdominance, respectively; two epistatic QTL showed dominance.
YD: In the II hybrid, two main effect QTL were classified as additive and overdominance, respectively; all four epistatic QTL were classified as dominance. In the IJ hybrid, the only main effect QTL showed overdominance, and the three epistatic QTL were classified as dominance.
From Tables 7 and 8, we can see that little common loci were found. This phenomenon partially results from the mapping markers in IJ and II hybrid are different. But on the same chromosome, we found some nearby loci affected same trait in both II and IJ hybrids.
Table 9 summarizes the main and epistatic QTL revealed in the II and IJ hybrids. For main effect QTL, 10 QTL were identified in the II hybrid; five were classified as additive, and the rest were classified as overdominance. In the IJ hybrid, 12 QTL were found, more additive (58.33%) loci were identified than overdominance in number (33.34%). For epistatic QTL, dominance or overdominance are found in two hybrid combinations, and dominance played a leading role in epistatic QTL. Dominance accumulation and overdominance were the major genetic basis of heterosis.
Discussion
Models comparison
Based on the aTTC design, this paper developed a QTL mapping method that fit for many base populations (RIL, F2, and DH); by employing multiple data set transformations (Z1i, Z2i, Z3i, Z4i, Z5i, and Z6i), many types of main and epistatic effects can be dissected. This paper took one combination (Z1, Z2, and Z4) of the aTTC design as an instance and proposed a two-step approach to dissect additive, dominance, and epistatic effects of QTL in the RIL-based aTTC design. A series of Monte Carlo simulation studies were carried out to confirm the proposed approach. Compared to previous studies on our methodologies, the proposed approach offered great advantages over previous methods.
aTTC design has many more transformations than do the TTC or NCIII designs, and with a series of transformation combinations (Z1, Z2, and Z4), (Z1, Z2, and Z5), or (Z1, Z2, and Z6), we can dissect main and epistatic effects of individual QTL or interactions by QTL mapping. It provides a new method for quantitative genetics research and especially for allowing scientists and breeders to understand the genetic basis for plant heterosis. In our study, we took the transformation combination Z1, Z2, and Z4 of RIL-based aTTC design as an instance to dissect genetic effects. There were some advantages when taking RIL as the base population. The genetic expectation mean of RIL was equivalent to L4i; therefore, there was no need to self-mate the base population. We simply used the RIL population data set substitute L4i, which saved labor and time. When using RIL-based TTC design for QTL mapping, we need generate four populations RIL, L1, L2 and L3. However, when using RIL-based aTTC design, breeders only need generate three populations RIL, L1 and L2. With combination (Z1 = L1+L2, Z2 = L1−L2 and Z4 = L1+L2−L4), we can dissect additive, dominance, and epistatic effects of QTL with high statistical powers and accuracies. In addition, many real mapping populations that derived from RIL-based NCIII design can be re-analyzed by the proposed method to develop main and epistatic effects to clearly decipher a genetic basis for heterosis.
In the present study, we used three different interaction patterns in one genetic model, which was much more complicated than that proposed by He et al. [37,39]. As shown in Tables A and B in S11 Supporting Information, with high detection power, all the augmented main effects in QTL (QTL1-6) and epistatic effects in digenic interactions (QTL1 and QTL2, QTL3 and QTL4, and QTL5 and QTL6) were estimated in an unbiased manner in Z1 and Z2. In Z4, two augmented epistatic effects and
were further estimated precisely (shown in Table 3).
Actually, for the detection of small and linked QTL, low powers were observed. EBLASSO can handle the model that includes many effects [37, 39, 43]. In this sudy, we use a large number of effects, including main and epistatic QTL effects, simultaneously. EBLASSO shrinks weak effect into zero, which has little influence on large effect QTL. Simulation studies demonstrated that the fast EBLASSO greatly improved calculated speed and detected more true QTL effects without increasing the false-positive rate.
Comparison of QTL mapping results in II and IJ hybrid with previous mapping results
The QTL mapping results of this paper were compared with those of our previous study [24] in which the CIM was employed to mapping main effect QTL [44], and the mixed linear approach [45] was used to estimate epistatic QTL. QTL detected by both studies are listed in Tables 10 and 11 for II and IJ hybrids, respectively. As shown in Table 10, a total of nine main effects QTL and four epistatic QTL was found simultaneously in two studies; only one QTL revealed in trait PH showed opposite dominance degree. In an RIL mapping population, three main effects QTL were detected; two main effects QTL were detected in Z1, and both of them were identified as additive. Eight QTL were found in Z2, and half of them were main effects; dominance degree revealed by two methods was similar, except for marker R1925 in trait PH. For the IJ hybrid, shown in Table 11, no epistatic QTL was simultaneously detected. The number of main effects QTL detected by both studies was 17, 3, and 3 for RIL, Z1, and Z2, respectively. Except for marker CDO533 in trait PH, the detected main effects QTL showed the same dominance degree. If not taking threshold into consideration, the ratio of same main effect QTL detected by the fast EBLASSO algorithm to previous mapping results were 23.08%, 22.22%, and 50% for RIL, Z1, and Z2, respectively in the II hybrid, whereas in the IJ hybrid, they were 54.84%, 17.65%, and 21.430%.
Among the identified QTL, some of them were pleiotropic. In the IJ hybrid, marker RG333 on chromosome 8 was simultaneously revealed in traits HD, PH, KGW, and YD; marker CDO1081 on chromosome 3 was simultaneously identified for traits HD, FGPP, and YD. These markers, especially for marker RG333 and marker CDO1081, were also found pleiotropic in the work of Xiao et al. [8] and Li et al. [24]. These regions deserve further attention, especially in marker-assisted breeding.
Genetic basis of heterosis with real data analysis
With our proposed approach, we dissected genetic effects of QTL and interactions for the II and IJ hybrids, respectively, and calculated the dominance degree of each QTL or digenic interaction (Tables 7 and 8). We summarized the classified dominance degree of real mapping populations (Table 9) and found that dominance degree in the Z2 data set that mainly characterized the heterosis showed overdominance and dominance for QTL and digenic interactions, and the ratio of dominance is greater than overdominance. Therefore, we conclude that dominance accumulation and overdominance are the major genetic basis of heterosis. This finding is consistent with Huang et al. [4], who pointed out that the accumulation of numerous rare superior alleles with positive dominance was an important contributor to heterotic phenomena after genomic analysis of hybrid rice varieties.
To explicitly elucidate the influence of single-locus (additive and dominance) and two-loci (aa, ad, da, and dd epistatic effect) genetic effects conditioning the heterosis of agronomic traits, models or genetic mating design (e.g., RIL-based TTC design) [30, 38], which can be used to study how interactions among multiple genes can lead to the phenotypic manifestations of heterosis, are probably the most relevant. Recent findings from genomic, proteomic, metabolic, epigenetic, and network studies in hybrids and polyploids also highlight some testable models for heterosis [46].
Supporting information
S1 Supporting Information. Expected genetic values of Z1i, Z2i, Z3i, Z4i, Z5i and Z6i under both the F∞ and F2 metric models in RIL-based aTTC design.
https://doi.org/10.1371/journal.pone.0189054.s001
(DOCX)
S2 Supporting Information. Expected genetic values of Z1i, Z2i, Z3i, Z4i, Z5i and Z6i under both the F∞ and F2 metric models in F2-based aTTC design.
https://doi.org/10.1371/journal.pone.0189054.s002
(DOCX)
S3 Supporting Information. Statistical genetic models for mapping heterotic QTL in the RIL-based aTTC design under the F2 metric model.
https://doi.org/10.1371/journal.pone.0189054.s003
(DOC)
S4 Supporting Information. Statistical genetic models for mapping heterotic QTL in the F2-based aTTC design under the F∞ metric model.
https://doi.org/10.1371/journal.pone.0189054.s004
(DOC)
S5 Supporting Information. Simulation data generate script.
https://doi.org/10.1371/journal.pone.0189054.s005
(ZIP)
S6 Supporting Information. Simulation data of Z1.
https://doi.org/10.1371/journal.pone.0189054.s006
(ZIP)
S7 Supporting Information. Simulation data of Z2.
https://doi.org/10.1371/journal.pone.0189054.s007
(ZIP)
S8 Supporting Information. Simulation data of Z4.
https://doi.org/10.1371/journal.pone.0189054.s008
(ZIP)
S9 Supporting Information. Simulation data of Z5.
https://doi.org/10.1371/journal.pone.0189054.s009
(ZIP)
S10 Supporting Information. Simulation data of Z6.
https://doi.org/10.1371/journal.pone.0189054.s010
(ZIP)
S11 Supporting Information. Simulation results of Z1,Z2,Z5,Z6.
https://doi.org/10.1371/journal.pone.0189054.s011
(DOC)
S12 Supporting Information. Real data augument effect results.
https://doi.org/10.1371/journal.pone.0189054.s012
(DOC)
S1 Fig. The mean statistic power of augmented main and epistatic effect interactions in Z1.
https://doi.org/10.1371/journal.pone.0189054.s013
(TIF)
S2 Fig. The mean statistic power of augmented main and epistatic effect interactions in Z2.
https://doi.org/10.1371/journal.pone.0189054.s014
(TIF)
S3 Fig. The mean statistic power of augmented main and epistatic effect interactions in Z5.
https://doi.org/10.1371/journal.pone.0189054.s015
(TIF)
S4 Fig. The mean statistic power of augmented main and epistatic effect interactions in Z6.
https://doi.org/10.1371/journal.pone.0189054.s016
(TIF)
Acknowledgments
The authors thank the anonymous reviewers for their comments on an earlier version of the manuscript.
References
- 1. Shull GH. The composition of field of maize. J Hered. 1908; 1: 296–301.
- 2. Holá D, Benešová M, Fischer L, Haisel D, Hnilič ka F, Hniičková H, et al. The disadvantages of being a hybrid during drought: A combined analysis of plant morphology, physiology and leaf proteome in maize. PLoS ONE. 2017; 12(4): e0176121. pmid:28419152
- 3. Hochholdinger F, Hoecker N. Towards the molecular basis of heterosis. Trends in Plant Science. 2007; 12(9): 427–432. pmid:17720610
- 4. Huang XH, Yang SH, Gong JY, Zhao Y, Feng Q, Gong H, et al. Genomic analysis of hybrid rice varieties reveals numerous superior alleles that contribute to heterosis. Natuer Communication. 2015; 6: 6258.
- 5. Bruce AB. The mendelian theory of heredity and the augmentation of vigor. Science. 1910; 32(827): 627–628.
- 6. Lippman ZB, Zamir D. Heterosis: revisiting the magic. Trends in Genetics. 2007; 23(2): 60–66. pmid:17188398
- 7. Jones DF. Dominance of linked factors as a means of accounting for heterosis. Genetics. 1917; 3(4): 310–312.
- 8. Xiao JH, Li JM, Yuan LP, Tanksley SD. Dominance is the major genetic basis of heterosis in rice as revealed by QTL analysis using molecular markers. Genetics. 1995; 140(2): 745–754. pmid:7498751
- 9. Hull FH. Recurrent selection for specific combining ability in corn. Agron J.1945; 37: 134–145.
- 10. Crow JF. Alternative hypotheses of hybrid vigor. Genetics. 1948; 33(5): 477–487. pmid:17247292
- 11. Liang QZ, Shang LG, Wang YM, Hua JP. Partial Dominance, overdominance and epistasis as the genetic basis of heterosis in upland cotton (Gossypium hirsutum L.). PLoS ONE. 2015; 10(11):e0143548. pmid:26618635
- 12. Stuber CW, Lincoln SE, Wolff DW, Helentjaris T, Lander ES. Identification of genetic factors contributing to heterosis in a hybrid from two elite maize inbred lines using molecular markers. Genetics. 1992; 132(3): 823–839. pmid:1468633
- 13. Comstock RE, Robinson HF, Gowen JW. Estimation of average dominance of genes. Heterosis. 1952; 494–516.
- 14. Kearsey MJ, Jinks JL. A general method of detecting additive, dominance and epistatic variation for metrical traits. Heredity. 1968; 23(3): 403–409. pmid:5250122
- 15. Cockerham CC, Zeng ZB. Design III with marker loci. Genetics. 1996; 143(3): 1437–1456. pmid:8807314
- 16. Frascaroli E, Canè MA, Landi P, Pea G, Gianfranceschi L, Villa M, et al. Classical genetic and quantitative trait loci analyses of heterosis in a maize hybrid between two elite inbred lines. Genetics. 2007; 176(1): 625–644. pmid:17339211
- 17. Melchinger AE, Utz HF, Piepho HP, Zeng ZB, Schön CC. The role of epistasis in the manifestation of heterosis: A systems-oriented approach. Genetics. 2007; 177(3): 1815–1825. pmid:18039883
- 18. Schön CC, Dhillon BS, Utz HF, Melchinger AE. High congruency of QTL positions for heterosis of grain yield in three crosses of maize. Theor Appl Genet. 2009; 120(2): 321–332. pmid:19911156
- 19. Larièpe A, Mangin B, Jasson S, Combes V, Dumas F, Jamin P, et al. The genetic basis of heterosis: multiparental quantitative trait loci mapping reveals contrasted levels of apparent overdominance among traits of agronomical interest in maize (Zea mays L.). Genetics. 2011; 190(2): 795–811. pmid:22135356
- 20. Yu SB, Li JX, Xu CG, Tan YF, Gao YJ, Li XH, et al. Importance of epistasis as the genetic basis of heterosis in an elite rice hybrid. Proceedings of the National Academy of Sciences. 1997; 94(17): 9226–9231.
- 21. Li ZK, Luo LJ, Mei HW, Wang DL, Shu QY, Tabien R, et al. Overdominant epistatic loci are the primary genetic basis of inbreeding depression and heterosis in rice. I. biomass and grain yield. Genetics. 2001; 158(4): 1737–1753. pmid:11514459
- 22. Luo LJ, Lia ZK, Mei HW, Shu QY, Tabien R, Zhong DB, et al. Overdominant epistatic loci are the primary genetic basis of inbreeding depression and heterosis in rice. II. grain yield components. Genetics. 2001; 158(4): 1755–1771. pmid:11514460
- 23. Hua JP, Xing YZ, Wu WR, Xu CG, Sun XL, Yu SB, et al. Single-locus heterotic effects and dominance by dominance interactions can adequately explain the genetic basis of heterosis in an elite rice hybrid. Proceeding of the National Academy of Sciences. 2003; 100(5): 2574–2579.
- 24. Li LZ, Lu KY, Chen ZM, Mu TM, Hu ZL, Li XQ. Dominance, overdominance and epistasis condition the heterosis in two heterotic rice hybrids. Genetics. 2008; 180(3): 1725–1742. pmid:18791236
- 25. Garcia AAF, Wang SC, Melchinger AE, Zeng ZB. Quantitative trait loci mapping and the genetic basis of heterosis in maize and rice. Genetics. 2008; 180(3): 1707–1724. pmid:18791260
- 26. Zhou G, Chen Y, Yao W, Zhang CJ, Xie WB, Hua JP, et al. Genetic composition of yield heterosis in an elite rice hybrid. Proceeding of the National Academy of Sciences. 2012; 109(39): 15847–15852.
- 27. Kearsey MJ, Pooni HS, Syed NH. Genetics of quantitative traits in Arabidopsis thaliana. Heredity. 2003; 91(5): 456–464. pmid:14576738
- 28. Syed NH, Chen ZJ. Molecular marker genotypes, heterozygosity and genetic interactions explain heterosis in Arabidopsis thaliana. Heredity. 2005; 94(3): 295–304. pmid:15316557
- 29. Kusterer B, Piepho HP, Utz HF, Schön CC, Muminovic J, Meyer RC, et al. Heterosis for biomass-related traits in Arabidopsis investigated by quantitative trait loci analysis of the triple testcross design with recombinant inbred lines. Genetics. 2007; 177(3): 1839–1850. pmid:18039885
- 30. Melchinger AE, Utz HF, Schön CC. Genetic expectations of quantitative trait loci main and interaction effects obtained with the triple testcross design and their relevance for the analysis of heterosis. Genetics. 2008; 178(4): 2265–2274. pmid:18430948
- 31.
Yuan LP. Hybrid rice breeding in China. Advances in Hybrid Rice Technology. Philippines: International Rice Research Institute. 1998; pp 27–33.
- 32. Arumuganathan K, Earle ED. Nuclear DNA content of some important plant species. Plant Molecular Biology Reporter. 1991; 9(3): 208–218.
- 33. Harushima Y, Yano M, Shomura A, Sato M, Shimano T, Kuboki Y, et al. A high-density rice genetic linkage map with 2275 markers using a single F2 population. Genetics. 1998; 148(1): 479–494. pmid:9475757
- 34. Goff SA, Ricke D, Lan TH, Presting G, Wang R, Dunn M, et al. A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Scienc. 2002; 296(5565): 92–100.
- 35. Yu J, Hu SN, Wang J, Wong GKS, Li SG, Liu B, et al. A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science. 2002; 296(5565): 79–92. pmid:11935017
- 36. Kao CH, Zeng ZB. Modeling epistasis of quantitative trait loci using Cockerham’s model. Genetics. 2002; 160(3): 1243–1261. pmid:11901137
- 37. He XH, Hu ZL, Zhang YM. Genome-wide mapping of QTL associated with heterosis in the RIL-based NCIII design. Chinese Science Bulletin. 2012; 57(21): 2655–2665.
- 38. Liu DF. The augmented triple test cross design and its analysis. Journal of Huazhong Agricultural University. 1988; 7(4): 311–319.
- 39. He XH, Zhang YM. A complete solution for dissecting pure main and epistatic effects of QTL in triple testcross design. PloS One. 2011; 6(9): e24575. pmid:21949729
- 40. Zeng ZB, Wang T, Zou W. Modeling quantitative trait loci and interpretation of models. Genetics. 2005; 169(3): 1711–1725. pmid:15654105
- 41. Cai XD, Huang AH, Xu SZ. Fast empirical Bayesian LASSO for multiple quantitative trait locus mapping. BMC Bioinformatics. 2011; 12: 211. pmid:21615941
- 42. Xing YZ, Tan YF, Hua JP, Sun XL, Xu CG, Zhang Q. Characterization of the main effects, epistatic effects and their environmental interactions of QTLs on the genetic basis of yield traits in rice. Theor Appl Genet. 2002; 105: 248–257. pmid:12582526
- 43. Wen J, Zhao XW, Wu GR, Xiang D, Liu Q, Bu SH, et al. Genetic dissection of heterosis using epistatic association mapping in a partial NCII mating design. Scientific Reports. 2015; 5: 18376. pmid:26679476
- 44. Zeng ZB. Precision mapping of quantitative trait loci. Genetics. 1994; 136(4): 1457–1468. pmid:8013918
- 45. Wang DL, Zhu J, Li ZK, Paterson AH. Mapping QTLs with epistatic effects and QTL×environment interactions by mixed linear model approaches. Theor Appl Genet. 1999; 99: 1255–1264.
- 46. Chen ZJ. Genomic and epigenetic insights into the molecular bases of heterosis. Nature Reviews Genetics. 2013; 14: 471–482. pmid:23752794