Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Quantile regression in genomic selection for oligogenic traits in autogamous plants: A simulation study

  • Gabriela França Oliveira ,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Resources, Software, Writing – original draft, Writing – review & editing

    gabriela.franca@ufv.br

    Current address: Department of Statistics, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil

    Affiliation Department of Statistics, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil

  • Ana Carolina Campana Nascimento,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Resources, Software, Writing – original draft, Writing – review & editing

    Current address: Department of Statistics, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil

    Affiliation Department of Statistics, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil

  • Moysés Nascimento,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Resources, Software, Writing – original draft, Writing – review & editing

    Current address: Department of Statistics, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil

    Affiliation Department of Statistics, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil

  • Isabela de Castro Sant'Anna,

    Roles Formal analysis, Methodology, Writing – review & editing

    Affiliation Center of Ruber Tree and Agroforestry Systems, Agronomy Institute (IAC), Votuporanga, São Paulo, Brazil

  • Juan Vicente Romero,

    Roles Investigation, Methodology, Software, Writing – review & editing

    Affiliation AGROSAVIA, The Colombian Agricultural Research Corporation, Mosquera, Colômbia

  • Camila Ferreira Azevedo,

    Roles Methodology, Supervision, Writing – review & editing

    Current address: Department of Statistics, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil

    Affiliation Department of Statistics, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil

  • Leonardo Lopes Bhering,

    Roles Methodology, Supervision, Writing – review & editing

    Affiliation Department of General Biology, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil

  • Eveline Teixeira Caixeta Moura

    Roles Supervision, Writing – review & editing

    Affiliation Empresa Brasileira de Pesquisa Agropecuária—Embrapa Café, Brasília, DF, Brazil

Abstract

This study assessed the efficiency of Genomic selection (GS) or genome‐wide selection (GWS), based on Regularized Quantile Regression (RQR), in the selection of genotypes to breed autogamous plant populations with oligogenic traits. To this end, simulated data of an F2 population were used, with traits with different heritability levels (0.10, 0.20 and 0.40), controlled by four genes. The generations were advanced (up to F6) at two selection intensities (10% and 20%). The genomic genetic value was computed by RQR for different quantiles (0.10, 0.50 and 0.90), and by the traditional GWS methods, specifically RR-BLUP and BLASSO. A second objective was to find the statistical methodology that allows the fastest fixation of favorable alleles. In general, the results of the RQR model were better than or equal to those of traditional GWS methodologies, achieving the fixation of favorable alleles in most of the evaluated scenarios. At a heritability level of 0.40 and a selection intensity of 10%, RQR (0.50) was the only methodology that fixed the alleles quickly, i.e., in the fourth generation. Thus, it was concluded that the application of RQR in plant breeding, to simulated autogamous plant populations with oligogenic traits, could reduce time and consequently costs, due to the reduction of selfing generations to fix alleles in the evaluated scenarios.

Introduction

In mid-2019, the world population reached 7.7 billion inhabitants and a further rise to 9.7 billion by 2050 is estimated [1]. Thus, more food must be produced to feed this population, although agricultural areas are increasingly limited and concerns about the negative environmental impacts of food production are growing [2, 3].

Since the Green Revolution in the 1960s, which caused a boost in the production potential of several crops, it is generally expected that plant breeding efforts will be able to secure the required yield gains [4]. The productivity of coffee trees, for example, has increased considerably, and one of the main reasons is the use of improved cultivars. In Brazil, coffee cultivars that were released and are still in use, e.g., “Mundo Novo”, are 240% more productive than introduced varieties [5]. Plant breeding programs, aside from focusing on higher yields, require the improvement of several other traits [6], e.g., the development of plants with a more appropriate architecture for higher plant density and mechanical management, better resistance and tolerance to biotic and abiotic stresses, adaptation to and stability in different cultivation environments, and a higher fruit and grain quality [711].

To meet the growing producer, consumer and market demand, a complex, continuous and dynamic breeding process is required [4], resulting in costly long-term projects for the development of superior cultivars. The developmental period of an improved cultivar of a perennial species can be over 25 years [12] and for annual species approximately 12 years. Thus, the search for procedures capable of providing superior genotypes in less time and, consequently, at a lower cost, has been intensified [4, 13, 14].

With a view to reducing the time demand and increasing selection accuracy, Meuwissen et al. [15] proposed the genome-wide selection (GWS). This kind of selection uses direct DNA information based on molecular markers to predict the genomic estimated breeding value (GEBV) of an individual, which is a measure used to select the best individuals, according to their merit within the population. The main advantage of GWS, compared to phenotypic selection, is that the GEBV of individuals whose phenotypes were not yet collected can be estimated, thus resulting in a reduced generation interval and an increase in genetic gain [1618].

The possibilities of applying GWS in autogamous plant breeding have been described in the literature. According to Heffner et al. [19], the prediction accuracy of GWS was superior to phenotypic selection in wheat. In simulated scenarios to improve oligogenic traits in Coffea Arabica, with different population densities and sizes, Romero [20] tried to determine the generation in which a favorable allele is fixed. As a result, the author observed that in small populations (as commonly used in breeding programs, e.g., for coffee), favorable alleles were fixed in the sixth generation (F6), while in large populations, fixation occurred in the fifth generation (F5). The GWS was also successfully applied in other crops, such as rice [21], oats [22] and barley [23].

An alternative and still little explored methodology for GWS studies is Quantile Regression (QR) [24]. Such methodology, unlike traditional methods based on averages, allows to adjust regression models throughout the distribution of the dependent variable, does not require assumptions about the distribution of the error and is robust to outliers. Parameter estimation is based on the weighted absolute errors method [25]. To deal with dimensionality problems in GWS studies, which are common in the marker matrix, Li and Zhu [26] proposed the Regularized Quantile Regression (RQR). The use of RQR in a GWS study was proposed by Nascimento et al. [27], in order to estimate GEBV for different quantiles of the phenotype of interest [28, 29]. In their study, Nascimento et al. [27] used RQR to estimate GEBV from simulated data with scenarios with different skewness levels in the phenotype distribution. The results of the RQR were compared to those of the BLASSO (Bayesian Least Absolute Shrinkage and Selection Operator) method, and the authors observed a lower mean square error of the former. The results indicated the viability of this alternative for GWS analysis, even in scenarios without skewness of the phenotype distribution. The approaches RQR and BLASSO were also used by Santos et al. [30] to estimate the genetic merit in pigs for asymmetric traits related to the pig carcass, and observed equally or more accurate results by RQR than BLASSO for all evaluated traits.

In spite of the interesting and promising results, RQR has not yet been evaluated throughout an entire breeding process, considering the reproductive system of a plant species. Thus, this study evaluated whether the use of RQR in GWS, for simulated data of autogamous plants with oligogenic traits, at different selection and heritability levels, allows the fixation of favorable alleles in earlier generations than the commonly used GWS methodologies. The results of the predictive capacity, mean and genotypic variance obtained by RQR were compared with traditional methods of genomic selection, specifically with RR-BLUP and BLASSO.

Material and methods

Population simulation

For this study, a 1040 cM genome was simulated, using software GENES [31], with markers spaced 1 cM apart, with eight linkage groups, resulting in a total of 1048 markers [32]. Oligogenic traits controlled by four loci were simulated, located in four different linkage groups, with uniform effect and absence of dominance and epistasis.

The F1 population was established by crossing contrasting parents, thus generating gametes for the formation of the F2 population, consisting of 625 individuals. Once the base genome was generated, genotypic values and three sets of phenotypic values were simulated, at heritability levels (h2) of 0.10, 0.20 and 0.40. To determine the genotypic values (vgi), the following equation was used: (1) where vgi is the genotypic value of individual i; μ the genotype population mean (here μ = 1.0); ai the additive effect of individual i, with , where ρk = 2.5 is the effect of the favorable allele with the same contribution to the whole locus k; αk the contribution of locus k (1, 0 or -1 for genotypic classes AA, Aa and aa, respectively); and di is the dominance effect, assumed to be null in this study (di = 0).

The phenotypic values (vfi) were determined by the following equation: (2) where vfi is the phenotypic value of individual i; vgi the genotypic value of individual i; εi the environmental effect generated according to a normal distribution, where mean and variance are compatible with the specific trait heritability (), with , where is the genetic variance [33]. The phenotypic and genotypic simulated data sets are freely accessible at https://zenodo.org/record/4292736#.X8BDrmhKjIU.

The advanced generations F3, F4, F5 and F6 were obtained from F2 as base generation by selfing. The individuals with the highest GEBV obtained by an adjusted GWS model in F2 were selected. The number of selected individuals depends on the selection intensity. The selection/simulation process of the progenies was repeated until the sixth generation. The generations F3 to F6, with 200 individuals [34], were generated from the genotype of the selected individuals, simulating a selfing process. The F3 to F6 populations were simulated using software R [35].

Genomic prediction

Based on the simulated F2 population, it was stipulated that 80% of the individuals would belong to the estimation population and 20% to the validation population. The genomic genetic values of the individuals were estimated by RQR [26] based on different quantiles (0.10, 0.50 and 0.90), using RR-BLUP [15] and BLASSO [36]. Two selection intensities (10% and 20%) and three heritability levels (h2 = 0.10, 0.20 and 0.40) were considered, and each evaluated scenario was simulated 30 times. For all evaluated methods, the general GWS model was considered [15]: (3) where yi is the ith observation of phenotype y (i = 1,2,…,625); μ the overall mean; gj the effect of the jth marker (j = 1,2,…,1040); xij are the elements of the incidence matrix of marker j in individual i, with parameterization 1, 0 and −1; and ei is the ith observation of the random error .

The parameters of model (2) were estimated by three methodologies: RQR, at three quantiles (τ = 0.10, 0.50 and 0.90), BLASSO and RR-BLUP.

In RQR, the marker effects are computed by solving the following optimization problem [26]: (4) where is the sum of the absolute values of the regression coefficients, λ the penalty parameter, n = 625, p = 1040 and ρτ(.), called “check function” by Koenker and Bassett (1978), and defined by: in this study, τ = 0.1; τ = 0.5 and τ = 0.9.

Note that, in the RQ, the coefficients are estimated from the minimization of the weighted sum of the vertical distances between the observed and estimated values [25]. For that, linear programming algorithms are used [37, 38]. One of the methods used to estimate these coefficients is the Simplex Method. For details on the Simplex Method, please consult Koenker [38].

After estimating the regression coefficients (marker effects), for the three quantiles considered (τ = 0.1; τ = 0.5 and τ = 0.9), the GEBV of the ith individual, based on quantile τ (GEBVi(τ)), was calculated by the following equation: (5) where is the estimated effect of the jth SNP marker based on quantile τ (τ = 0.1; τ = 0.5 e τ = 0.9). The GEBVs were also determined by BLASSO and RR-BLUP, according to the equation: (6) where is the effect of the jth SNP marker, estimated by the two said methods.

According to the recommendation of Santos et al. [30], the penalty parameter λ of RQR was defined as half the penalty parameter resulting from the BLASSO method.

To compare the analyzed methodologies, the predictive capacity of the methods was calculated, which is the correlation coefficient between the observed phenotypic (y) and the estimated genomic value () in each generation. The genotypic means and variances in each generation were also determined.

Based on Eq 1, with μ = 1.0 and , it can be said that favorable alleles in a given generation are fixed when the genotypic mean of a population reaches 11 and variance zero.

Computational aspects

The calculations and estimates were performed in the R program [35]. The function used to estimate the regression parameters at the three quantiles was rq of the quantreg package [39]. The regression coefficients were estimated by RR-BLUP using the mixed.solve function of package rrBLUP [40]. The Bayesian models were adjusted with the bglr function in package BGLR [41], with 100,000 iterations for the MCMC (Markov chain Monte Carlo) algorithms, of which 10,000 were discarded as burn-in, to ensure chain heating and a thin of 5. Convergence analysis was performed based on the criteria proposed by Raftery and Lewis [42] and Heidenberg and Welch (1983) [43].

Results and discussion

The mean estimates of the predictive capacity (PC) in scenarios with heritability of 0.10 varied between 0.20 (RQR (0.10)) and 0.45 (BLASSO and RR-BLUP) in generation F2 (Fig 1); between 0.40 (RQR (0.10)) and 0.5 (BLASSO and RR-BLUP) at heritability 0.20 (Fig 2); and between 0.65 (RQR (0.10) and RQR (0.90)) and 0.70 (RR- BLUP) in the scenarios with heritability of 0.40 (Fig 3). In general, PC rises as heritability increases (Figs 13). This result was already expected, since traits with higher heritability are less affected by environmental variation, facilitating the breeding process [44].

thumbnail
Fig 1. Average predictive capacity (y-axis) of the models evaluated over five generations (x-axis).

Considering a heritability of 0.10 and two selection intensities. (A) SP = 10%; (B) SP = 20%.

https://doi.org/10.1371/journal.pone.0243666.g001

thumbnail
Fig 2. Average predictive capacity (y-axis) of the models evaluated over five generations (x-axis).

Considering a heritability of 0.20 and two selection intensities. (A) SP = 10%; (B) SP = 20%.

https://doi.org/10.1371/journal.pone.0243666.g002

thumbnail
Fig 3. Average predictive capacity (Y axis) of the models evaluated over five generations (x axis).

Considering a heritability of 0.4 and two selection intensities. (A) SP = 10%; (B) SP = 20%.

https://doi.org/10.1371/journal.pone.0243666.g003

Over the generations, the PC estimates decreased, to values close to zero in F6 in several scenarios (Figs 13). This result can be explained by the fact that the model to predict GEBV in the F3 –F6 generations was adjusted in F2. Specifically, since selection occurs over generations, the allele frequency of the initial generation changes, which leads to a reduction in the marker-QTL linkage disequilibrium (LD). Over the generations, Sant’Anna et al. [33] observed a drop in LD, which is reflected in the predictive capacity of the model for autogamous populations. In allogamous species on the other hand, LD is dissipated by advancing a single generation, resulting in a low efficiency of GS procedures based on models adjusted in previous populations.

There was an increase in means over the generations and a decrease in the genotypic variances to values close to zero from the third generation onwards, for all evaluated methods (Table 1 and S1S3 Figs). These results are in line with the theory of quantitative population genetics, which states that in response to directional selection, the allele frequency of traits with few major-effect loci changes rapidly, inducing a phenotypic response [45]. In this way, the population mean increases since the allele value is positive in the simulation process.

thumbnail
Table 1. Means and mean genotypic variances (n = 30) of five generations in response to selection by five predictive methods, based on three heritability levels (0.10, 0.20 and 0.40) and two selection intensities (10 and 20%).

https://doi.org/10.1371/journal.pone.0243666.t001

The results of fixation or non-fixation of favorable alleles were different in the evaluated scenarios. In scenarios with trait heritability of 0.10, the RQR (τ = 0.1) with a selection intensity of 10% failed to fix the favorable alleles until the sixth generation, as it did not reach the genotypic mean 11 (8.54 ± 2.19) (Table 1). When selection was based on the models RQR (τ = 0.1) or BLASSO, with a selection intensity of 20%, the favorable alleles were not fixed until the sixth generation, as the genotypic variance did not reach zero by these methods (4.50 ± 3.57 and 2.24 ± 1.99, respectively) (Table 1). For the other methods, even in low-heritability scenarios, the favorable alleles could be fixed until the sixth generation (Table 1). According to Goddard [46], the speed at which a population increases or decreases the level of an allele depends on its initial frequency. Thus, the greatest difficulty in fixing alleles in a low-heritability scenario may be due to the greater environmental effect that affects the estimation of GEBV, making it even more difficult to select individuals with the desired alleles than in the other scenarios, where the disturbing environmental effect is lower.

For the scenarios with a trait heritability of 0.20 and regardless of the selection intensity, the tested methods allowed the fixation of favorable alleles until the sixth generation, except for RQR (τ = 0.90), at a selection intensity of 20% (Table 1). Selection based on the BLASSO models, at an intensity of 10%, and on RQR (τ = 0.50), at intensities of 10 and 20%, reached fixation in F5 (Table 1).

Moreover, at a heritability level of 0.40, RQR (τ = 0.50) at a selection intensity of 10% allowed the establishment of favorable alleles as early as in the fourth generation, with a genotypic mean of 10.86 ± 0.21 and genotypic variance of 0.50 ± 0.69, while the other methods allowed allele fixation in the fifth or sixth generation (Table 1). With these results, there was a reduction of one (h2 = 0.20) or two generations (h2 = 0.40) in the fixation process of favorable alleles. The reduction of generations in a plant breeding program is decisive in view of the savings in terms of time, efforts and costs. In coffee for example, one selection generation lasts on average six years [47], i.e., by this technique, the breeding process can be considerably reduced, thus reducing the time required to develop genetically superior genotypes and, consequently, save costs.

Although the BLASSO and RR-BLUP methods had the highest predictive capacity in F2 in all evaluated scenarios, the results in relation to favorable allele fixation were equal to or lower than by RQR (0.50).

Generally speaking, the breeding process by RQR can be equal to or faster than by the standard GS methodologies. Although to date little explored in breeding, the RQR method has been shown to be very promising for genomic selection and association studies, in both plant and animal breeding [2730, 48]. In this study, RQR (τ = 0.50) fixed the favorable alleles in the fourth generation (F4) in the scenario with a heritability of 0.4 and selection intensity of 10%. The efficiency of RQR, in contrast with the traditional methods, based on conditional means, can be explained by the possibility of fitting models at different levels (quantiles) of the phenotype distribution, and consequently making a more thorough study of the phenomenon of interest possible [24]. Specifically, for highly skewed phenotypic distributions, the results of quantile models that allow a quantile fitting far from the mean are interesting. In an evaluation of quantiles 0.25 and 0.75 for right- and left-skewed distributions, respectively, Nascimento et al. [27] and Barroso et al. [28] observed that these models have a higher predictive capacity and lower mean square errors than the traditional GS methodologies, respectively.

In this study, since the data were generated assuming a normal (symmetrical) distribution, better results were expected from mean- or median-based methodologies. However, the best results were based on medians, which may be related to the rarity of occurrence, both in simulated and in practical processes, of a perfectly symmetrical distribution. Thus, a median-based methodology such as RQR (τ = 0.50) can better describe the functional relationship between the dependent and explanatory variables and is robust to outliers, in cases of symmetry deviations in the phenotype distribution [25, 38].

Conclusions

The use of Regularized Quantile Regression models proved effective in genomic selection studies, for allowing an accelerated development of superior genotypes in relation to traditional GS methodologies. Among the simulated conditions, the configuration of Regularized Quantile Regression (τ = 0.50), at a heritability of 0.40 and selection intensity of 10% was the most efficient, since favorable alleles could be fixed more quickly, as early as in the fourth generation.

Supporting information

S1 Fig. Means (blue lines) and mean genotypic variances (red lines) of the models evaluated over five generations.

Considering heritability 0.10 and two selection intensities (SP). (A) SP = 10%; (B) SP = 20%.

https://doi.org/10.1371/journal.pone.0243666.s001

(TIF)

S2 Fig. Means (blue lines) and mean genotypic variances (red lines) of the models evaluated over five generations.

Considering heritability 0.20 and two selection intensities (SP). (A) SP = 10%; (B) SP = 20%.

https://doi.org/10.1371/journal.pone.0243666.s002

(TIF)

S3 Fig. Means (blue lines) and mean genotypic variances (red lines) of the models evaluated over five generations.

Considering heritability 0.40 and two selection intensities (SP). (A) SP = 10%; (B) SP = 20%.

https://doi.org/10.1371/journal.pone.0243666.s003

(TIF)

References

  1. 1. Organização das Nações Unidas [homepage na Internet]. População mundial deve chegar a 9.7 bilhões de pessoas em 2050, diz relatório da ONU [acesso em 25 jan 2020]. Disponível em: https://nacoesunidas.org/populacao-mundial-deve-chegar-a-97-bilhoes-de-pessoas-em-2050-diz-relatorio-da-onu/.
  2. 2. Godfray HCJ, Beddington JR, Crute IR, Haddad L, Lawrence D, Muir JF, et al. Food Security: The Challenge of Feeding 9 Billion People. Science. 2010; 327(5967): 812–818. pmid:20110467
  3. 3. Hunter MC, Smith RG, Schipanski ME, Atwood LW, Mortensen DA. Agriculture in 2050: Recalibrating targets for sustainable intensification. Bioscience. 2017; 67(4): 386–391.
  4. 4. Cobb JN, Biswas PS, Platten JD. Back to the future: revisiting MAS as a tool for modern plant breeding. Theor Appl Genet. 2019; 132 (3): 647–667. pmid:30560465
  5. 5. Guerreiro Filho O, Ramalho MAP, Andrade VT. Alcides Carvalho and the selection of catua cultivar: Interpreting the past and drawing lessons for the future. Crop Breed Appl Biotechnol. 2018; 18(4): 460–466.
  6. 6. Akdemir D, Beavis W, Fritsche-Neto R, Singh AK, Isidro-Sánchez J. Multi-objective optimized genomic breeding strategies for sustainable food improvement. Heredity (Edinb). 2019; 122(5):672–683. pmid:30262841
  7. 7. Byrne PF, Volk GM, Gardner C, Gore MA, Simon PW, Smith S. Sustaining the future of plant breeding: The critical role of the USDA-ARS national plant germplasm system. Crop Sci. 2018; 58(2): 451–468.
  8. 8. Barbosa I de P, da Costa WG, Nascimento M, Cruz CD, de Oliveira ACB. Recommendation of Coffea arabica genotypes by factor analysis. Euphytica. 2019; 215(10): 1–10.
  9. 9. Lado J, Moltini AI, Esteban V, Rodríguez G, Arcia P, Rodríguez M, et al. Integration of sensory analysis into plant breeding: review. Agrociencia Uruguay. 2019; 23(01): 1–15.
  10. 10. Marie L, Abdallah C, Campa C, Courtel P, Bordeaux M, Navarini L, et al. G × E interactions on yield and quality in Coffea arabica: new F1 hybrids outperform American cultivars. Euphytica. 2020; 216(5): 1–17.
  11. 11. Setotaw TA, Caixeta ET, Zambolim EM, Sousa TV, Pereira AA, Baião AC, et al. Genome Introgression of Híbrido de Timor and Its Potential to Develop High Cup Quality C. arabica Cultivars. J Agric Sci. 2020; 12(4): 64–76.
  12. 12. Valencia A, Morales AY, Moncada P, Alfonso H, Herrera JC. Introgression of the SH3 gene resistant to rust (Hemileia vastatrix) in improved lines of CASTILLO ® variety (Coffea arabica L.). J Plant Breed Crop Sci. 2017; 9: 130–138.
  13. 13. Alkimim ER, Caixeta ET, Sousa TV, Pereira AA, de Oliveira ACB, Zambolim L, et al. Marker-assisted selection provides Arabica coffee with genes from other Coffea species targeting on multiple resistance to rust and coffee berry disease. Mol Breed. 2017; 37(1): 6.
  14. 14. Sousa TV, Caixeta ET, Alkimim ER, Oliveira ACB, Pereira AA, Sakiyama NS, et al. Early selection enabled by the implementation of genomic selection in Coffea arabica breeding. Front Plant Sci. 2019; 9: 1–12. pmid:30671077
  15. 15. Meuwissen THE, Hayes BJ, Goddard ME. Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001; 157(4): 1819–1829. pmid:11290733
  16. 16. Resende MDV de M De, Lopes PPS, da Silva RL, Pires IE, Silva RL Da, Pires IE. Seleção genômica ampla (GWS) e maximização da eficiência do melhoramento genético. Pesqui Florest Bras. 2008; (56): 63–77.
  17. 17. Crossa J, Pérez P, de los Campos G, Mahuku G, Dreisigacker S, Magorokosho C. Genomic selection and prediction in plant breeding. J Crop Improv. 2011; 25(3): 239–261.
  18. 18. Crossa J, Pérez-Rodríguez P, Cuevas J, Montesinos-López O, Jarquín D, de los Campos G, et al. Genomic selection in plant breeding: methods, models, and perspectives. Trends Plant Sci. 2017; 22(11): 961–975. pmid:28965742
  19. 19. Heffner EL, Jannink J-L, Sorrells ME. Genomic Selection Accuracy using Multifamily Prediction Models in a Wheat Breeding Program. The Plant Genome Journal. 2011; 4(1): 65–75.
  20. 20. Romero JV. Seleção genômica usando genotipagem de baixa saturação no melhoramento genético do cafeeiro. Viçosa. Tese [Doutorado em Genética e Melhoramento]. Universidade Federal de Viçosa; 2017.
  21. 21. Spindel J, Begum H, Akdemir D, Virk P, Collard B, Redona , et al. Genomic selection and association mapping in rice (Oryza sativa): effect of trait genetic architecture, training population composition, marker number and statistical model on accuracy of rice genomic selection in elite, tropical rice breeding lines. PLoS Genet. 2015; 11(2): 1–25.
  22. 22. Asoro FG, Newell MA, Beavis WD, Scott MP, Jannink J-L. Accuracy and Training Population Design for Genomic Selection on Quantitative Traits in Elite North American Oats. Plant Genome. 2011; 4(2): 132–144.
  23. 23. Shengqiang Z, Dekkers JCM, Fernando RL, Jannink JL. Factors affecting accuracy from genomic selection in populations derived from multiple inbred lines: A barley case study. Genetics. 2009; 182(1): 355–364. pmid:19299342
  24. 24. Koenker R, Bassett G. Regression Quantiles. Econometrica. 1978; 46(1): 33–35
  25. 25. Hao L, Naiman DQ. Quantile regression. New Delhi: Sage publications; 2007.
  26. 26. Li Y, Zhu J. L1-norm quantile regression. J Comput Graph Stat. 2008; 17(1): 163–185.
  27. 27. Nascimento M, Silva FF e., de Resende MDV, Cruz CD, Nascimento ACC, Viana JMS, et al. Regularized quantile regression applied to genome-enabled prediction of quantitative traits. Genet Mol Res. 2017; 16(1): 1–12. pmid:28340274
  28. 28. Barroso LMA, Nascimento M, Nascimento ACC, Silva FF, Serão NVL, Cruz CD, et al. Regularized quantile regression for SNP marker estimation of pig growth curves. J Anim Sci Biotechnol. 2017;8(1):1–9. pmid:28702191
  29. 29. Nascimento M, Nascimento ACC, Silva FF e., Barili LD, Do Vale NM, Carneiro JE, et al. Quantile regression for genome-wide association study of flowering time-related traits in common bean. PLoS One. 2018; 3(1): 1–14. pmid:29300788
  30. 30. dos Santos PM, Nascimento ACC, Nascimento M, Fonseca e Silva F, Azevedo CF, Mota RR, et al. Use of regularized quantile regression to predict the genetic merit of pigs for asymmetric carcass traits. Pesqui Agropecu Bras. 2018; 53(9): 1011–1017.
  31. 31. Cruz CD. GENES—Software para análise de dados em estatística experimental e em genética quantitativa. Acta Sci—Agron. 2013; 35(3): 271–276.
  32. 32. da Costa e Silva L, Cruz CD, Moreira MA, de Barros EG. Simulation of population size and genome saturation level for genetic mapping of recombinant inbred lines (RILs). Genet Mol Biol. 2007;30(4):1101–1018.
  33. 33. Sant’ Anna I de C, Diniz Cabral Ferreira RA, Nascimento M, Silva GN, Carneiro VQ, Cruz CD, et al. Multigenerational prediction of genetic values using genome-enabled prediction. PLoS One. 2019; 14(1): 1–14.
  34. 34. Ferreira A, da Silva MF, da Costa e Silva L, Cruz CD. Estimating the effects of population size and type on the accuracy of genetic maps. Genet Mol Biol. 2006;29(1):187–192.
  35. 35. R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2020; URL http://www.R-project.org/.
  36. 36. De Los Campos G, Naya H, Gianola D, Crossa J, Legarra A, Manfredi E, et al. Predicting quantitative traits with regression models for dense molecular markers and pedigree. Genetics. 2009; 182(1): 375–385. pmid:19293140
  37. 37. Silva EN da, Sabino da Silva Porto Júnior. Istema financeiro e crescimento econômico: uma aplicação de regressão quantílica. Econ Apl. 2006; 10(3):425–442.
  38. 38. Koenker R. Regressão quantílica. Nova York: Cambridge University Press; 2005.
  39. 39. Koenker R. quantreg: Quantile regression. R package version 4.91. http://CRAN.Rproject.org/package=quantreg, 2015.
  40. 40. Endelman JB. Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP. The Plant Genome. 2011; 4(3): 250–255.
  41. 41. CAMPOS e RODRIGUEZ, Bayesian Generalized Linear Regression. URL: http://cran.r-project.org/web/packages/BGLR/index.html, 2015.
  42. 42. Raftery AE, Lewis SM. Comment: One long run with diagnostics: Implementation strategies for Markov chain Monte Carlo. Stat Sci. 1992; 7(4): 493–497.
  43. 43. Heidelberger P, Welch PD. Simulation Run Length Control in the Presence of an Initial Transient. Oper Res. 1983.
  44. 44. Legarra A, Robert-Granié C, Manfredi E, Elsen JM. Performance of genomic selection in mice. Genetics. 2008; 180(1): 611–618. pmid:18757934
  45. 45. Falconer, D.S. and Mackay, T.F.C., 1996. Introduction to quantitative genetics, Longman. Essex, England.
  46. 46. Goddard ME HB. Genomic selection. Genetics in the Third Millennium. 2015; 12(4): 3794–3805.
  47. 47. da Conceição AS, Fazuoli LC, Braghini MT. Avaliação e seleção de progênies F3 de cafeeiros de porte baixo com o gene SH3 de resistência a Hemileia vastatrix berk. et br. Bragantia. 2005; 64(4): 547–559.
  48. 48. Nascimento M, Nascimento ACC, Dekkers JCM, Serão NVL. Using quantile regression methodology to evaluate changes in the shape of growth curves in pigs selected for increased feed efficiency based on residual feed intake. Animal. 2019; 13(5): 1009‐1019. pmid:30306885