Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Adaptability and stability evaluation of maize hybrids using Bayesian segmented regression models

  • Tâmara Rebecca A. Oliveira ,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Resources, Writing – original draft, Writing – review & editing

    tamara_rebecca@hotmail.com

    Affiliation Núcleo de Graduação de Agronomia, Universidade Federal de Sergipe, Campus Sertão, Nossa Senhora da Glória, Sergipe, Brazil

  • Hélio Wilson L. Carvalho,

    Roles Conceptualization, Supervision

    Affiliation Embrapa Tabuleiros Costeiros, Aracaju, Sergipe, Brazil

  • Moysés Nascimento,

    Roles Methodology, Writing – review & editing

    Affiliation Laboratório de Inteligência Computacional e Aprendizado Estatistico, Departamento de Estatística, Universidade Federal de Viçosa, Viçosa, Brazil

  • Emiliano Fernandes N. Costa,

    Roles Resources, Supervision

    Affiliation Embrapa Tabuleiros Costeiros, Aracaju, Sergipe, Brazil

  • Gustavo Hugo F. Oliveira,

    Roles Conceptualization, Methodology, Writing – review & editing

    Affiliation Núcleo de Graduação de Agronomia, Universidade Federal de Sergipe, Campus Sertão, Nossa Senhora da Glória, Sergipe, Brazil

  • Geraldo A. Gravina,

    Roles Supervision, Writing – review & editing

    Affiliation Laboratório de Engenharia Agrícola, Centro de Ciências e Tecnologias Agropecuárias, Universidade Estadual do Norte Fluminense, Rio de Janeiro, Brazil

  • Antonio T. Amaral Junior,

    Roles Formal analysis, Investigation, Methodology

    Affiliation Laboratório de Engenharia Agrícola, Centro de Ciências e Tecnologias Agropecuárias, Universidade Estadual do Norte Fluminense, Rio de Janeiro, Brazil

  • José Luiz S. Carvalho Filho

    Roles Conceptualization, Investigation, Supervision

    Affiliation Departamento de Agronomia, Universidade Federal Rural de Pernambuco, Recife, Brazil

Abstract

The occurrence of genotype by environment interaction (G x E), which is defined as the differential response of genotypes to environmental variation, is frequently reported in maize cultures, making it challenging to recommend cultivars. Methods allowing to study the potential nonlinear pattern of genotype responses to environmental variation allied to prior beliefs on unknown parameters are interesting to evaluate the phenotypic adaptability and stability of genotypes. In this context, the present study aimed to assess the adaptability and stability of maize hybrids, by using the Bayesian segmented regression model, and evaluate the efficacy of using informative and minimally informative prior distributions for the selection of cultivars. Randomized complete-block design experiments were carried out to study the yield (kg/ha) of 25 maize hybrids, in 22 different environments, in Northeastern Brazil. The Bayesian segmented regression model fitted using informative prior distributions presented lower credibility intervals and Deviance Criterium of Information values, compared to those obtained by fitting using minimally informative distributions. Therefore, the model using informative prior distributions was considered for the adaptability and stability evaluation of maize genotypes. Once most northeastern farmers in Brazil have limited capital, the genotype P4285HX should be considered for planting, due to its high yield performance and adaptability to unfavorable environments.

Introduction

Maize (Zea mays L.) cultures are appreciated worldwide. Thus it has tremendous relevance due to its several uses and applications in areas ranging from animal feed to technological industries.

Furthermore, because maize is grown under different environmental conditions, it interacts with various environments, resulting in varied genotype performances [1]. Such interactions hinder the genotype sealing works given that the best-suited genotype for a specific environment may not be best suited for another environment where such interactions take place. Thus, recommendations for the broad adaptability and stability of cultivars become costly [1].

The literature presents several methodologies to study phenotypic adaptability and stability. Examples of such methods are those based on simple linear regression [2], piecewise linear regression [3, 4], mixed models (REML/BLUP) [5], non-parametric methods [6, 7], multivariate analysis, for example, multiple and expanded centroid methods [8, 9] and Bayesian inference for simple linear regression [10].

Unlike the deterministic and frequentist methods, the Bayesian framework allows for the incorporation of additional information relating to the parameters through prior distributions, which are characterized by the probability distribution. According to [11], all information is useful and must be used in the Bayesian analysis. Additionally, owing to the large quantity of information available from previous studies, incorporating this information during modeling is reasonable [12].

Despite being interesting, the Bayesian approach to adaptability and stability studies is based on a simple regression model [10]. According to [13], the simple linear regression models are unable to fit a potential nonlinear pattern to genotype responses to environmental variations. Aiming to solve this deficiency, under a statistical “frequentist” framework, [3, 4] proposed the segmented regression model allowing the identification of the “ideal” genotype, which presents high yield performance, high stability and low sensitivity to adverse conditions. Nascimento et al. [12] proposed the Bayesian segmented regression model approach to analyze phenotypic adaptability and stability. This approach differs from the “frequentist” framework, allowing the addition of prior beliefs to unknown parameters, bringing new insights for plant breeders. Additionally, this method allows for the exploitation of potential nonlinear patterns of genotype responses to environmental variations, aiming to identify genotypes that present high yield performance, and high stability under adverse conditions. This genotype is denoted as “ideal,” according to [4].

In light of the above, the present study aimed to assess the adaptability and stability of maize hybrids by using the Bayesian segmented regression model and evaluate the efficacy of using informative and minimally informative prior distributions in the selection of cultivars.

Materials and methods

During the agricultural years 2012 and 2013, 25 maize hybrids from public and private companies from the states of Maranhão (Balsas, Brejo, Colinas, and São Raimundo das Mangabeiras counties), Piauí (Nova Santa Rosa, Teresina, and Uruçuí counties), and Sergipe (Nossa Senhora das Dores, Frei Paulo, and Umbaúba counties) were assessed. The assessments comprised 11 environments, where the Nossa Senhora das Dores County had two different fertilization and each one was assumed as a different environment (Table 1).

thumbnail
Table 1. List of hybrid maize cultivars and their respective origins, types, cycles, colors, grain textures and companies.

https://doi.org/10.1371/journal.pone.0236571.t001

During the trials, samples considered to have high fertilization ranges were treated with a total of 180.00 kg ha-1 of N, 149.80 kg ha-1 of P2O5 and 85.60 kg ha-1 of K2O, whereas samples considered to have low fertilization ranges were treated with 45.00 kg ha-1 of N, 37.80 kg ha-1 of P2O5 and 21.60 kg ha-1 of K2O, in the form of 535 and 135 kg ha-1 of 8-28-16 Zn at the time of sowing, respectively.

The experimental design was based on randomized blocks, with two repetitions, wherein each plot comprised four 5.0 m-long rows, with spacings of 0.70 m x 0.20 m, between rows, and between holes within the rows, respectively.

Fertilization was performed according to the results of the soil analysis from each experimental area. Irrigation was not carried out, and weed and pest control was performed according to the crop’s requirement in each region.

The maize yield data were subjected to variance analysis for each environment. A joint analysis was carried out by adopting the model: yijk = μ + r/ek(j) + ej + gi + geij + εijk, where yijk is the phenotypic mean, μ is the overall mean, r/ek(j) is the effect of the kth repetition in the jth environment, gi is the fixed effect of the ith genotype, ej is the effect of the jth environment normally and independently distributed (NID) , geij is the effect of the interaction of the ith genotype in the jth environment NID , and εijk is the experimental error

Model and Bayesian inference

The bi-segmented regression model is given by (1) where yij is the response of genotype i in environment j, βi0 is the mean response of genotype i, βi1 is the slope under the first regime (the linear regression coefficient related to the unfavorable environments), and βi2 represents the change in slope from the first to the second regime (βi1 + βi2 is the slope after the change-point, that is, the linear response to the favorable environments). Further, Ij is the coded environmental index, T(Ij) = 0 if Ij ≤ 0, or if Ij > 0, and is mean of the coded environmental index considering only environments with positive indexes and eij is the error term, NID (0, σ2).

The Bayesian approach for the bi-segmented model is described in Nascimento et al. [12]. In summary, assuming , each observation yij has a distribution , and the likelihood function for each genotype is given by (1.1)

The prior distributions for the parameters (βi0, βi1, βi2, ) are given by (1.2) (1.3) (1.4) and (1.5) where , ,, , , , and αi, βi are the known parameters. This last prior distribution is the Gamma distribution with mean and variance equal to e , respectively. Additionally, ie the precision is equal to .

The joint posterior distribution is proportional to the product of the likelihood function (Eq 1.1) and the prior distributions (Eqs 1.21.5).

(2)

To make inferences regarding the parameters in Eq 2, the Markov chain Monte Carlo (MCMC) was used to obtain the posterior marginal distributions for each parameter.

The marginal distribution samples of the stability parameter, , were obtained indirectly. This parameter is a function of . Therefore, using the values from each interaction, we obtain according to the following expression: , where MSR is the residual mean square obtained from the variance analysis and r is the number of repetitions of the experiment. The hypotheses of interest were tested by calculating the 95% credibility intervals for the parameters.

Priors distributions

Two models were fitted to assess the model’s goodness of fit. Model 1 (M1—minimally informative prior distributions) was characterized by minimally informative prior distributions, which were represented by distributions with large variances: ,, , and τie|αi, βi ~ Gamma(αi = 0.001, βi = 0.001).

Model 2 (M2—informative prior distributions), similar to the method employed in [12], was characterized by the estimates obtained from the frequentist analysis of the bi-segmented model, used as information to define the hyperparameters.

Assessing the model’s goodness of fit

Models M1 (minimally informative priors) and M2 (informative priors) were compared by means of the Deviance Information Criterion (DIC) [14]: . Here, is a point estimate of the deviance obtained by replacing the parameters with their posterior mean estimates in the likelihood function and 2pD is given by the effective number of parameters in the models. Models with lower DIC are preferred.

Bayesian analysis

We adopted MCMC chains considering 100,000 iterations of the Gibbs sampler algorithm. We set the burn-in to 10,000 iterations and thinned every five iterations. In each chain, we analyzed the posterior mean, standard deviation, 95% credibility intervals, and convergence criterion statistics [15, 16]. The methodology was implemented in software R [17], and the joint distribution samples were obtained using the rbugs function of the rbugs package [18], which was accomplished by fusing R and OpenBugs (a software application for the Bayesian analysis of complex statistical models using MCMC methods). The MCMC chain convergence was accessed by Geweke and Raftery-Lewis diagnostics using the package [19] provided in the R software [17].

Results and discussion

The analysis of variance of the maize yield (kg/ha) demonstrated that the genotypes, environments and the genotype x environment interaction (G×E) presented a significant effect (P < 0.05) (Table 2). The significance of G x E interaction indicates contrasts between environments and differential genotypic responses to environmental effects. The occurrence of G × E interaction, which can be defined as the differential response of genotypes to environmental variation, is frequently reported in maize cultures, making it challenging to recommend cultivars [2024].

thumbnail
Table 2. Mean squares from analysis of variance (ANOVA) for yield of 25 hybrid maize genotypes assessed in 22 environments.

https://doi.org/10.1371/journal.pone.0236571.t002

The posteriori means and their respective credibility intervals (CI) provided estimates for the adaptability and stability parameters. Considering the results provided by the model M1, which is characterized by the minimally informative prior distributions, most genotypes (14 genotypes) presented the linear regression coefficient related to the unfavorable environments equal to 1 (βi1 = 1), except 30A16HX, 2B707HX, 2B587HX, 30A37HX, 2B604HX, 20A55HR, 20A78HX and DKB370, which presented values higher than 1 (βi1 > 1), and the genotypes P4285HX and BRS2020, which presented values lower than 1 (βi1 < 1) (Table 3). Among those genotypes that presented the linear regression coefficient related to the unfavorable environments equal to 1, only two (30A68HX and AS1555YG) exhibited the linear response to the favorable environments higher than 1 (βi1 + βi2 > 1) (Table 3). No genotype presented stability parameter () equal to zero. On the other hand, the genotype AS1555YG presented coefficient of determination higher than 80% (Table 3). However, only the genotype 30A68HX presented higher mean productivity ().

thumbnail
Table 3. Estimates of the a posteriori means () and of the credible intervals1 (95%) of the adaptability2 () and stability parameters3 (, R2), by taking into consideration informative and non-informative priors for maize hybrids.

https://doi.org/10.1371/journal.pone.0236571.t003

Considering the results provided by the model M1, which is characterized by the minimally informative prior distributions, most genotypes (14 genotypes) presented the linear regression coefficient related to the unfavorable environments equal to 1 (βi1 = 1), except 30A16HX, 2B707HX, 2B587HX, 30A37HX, 2B604HX, 20A55HR, 20A78HX and DKB370, which presented values higher than 1 (βi1 > 1) and the genotypes P4285HX and BRS2020, which presented values lower than 1 (βi1 < 1) (Table 3). Among those genotypes that presented the linear regression coefficient related to the unfavorable environments equal to 1, only two (30A68HX and AS1555YG) presented the linear response to the favorable environments higher than 1 (βi1 + βi2 > 1) (Table 3). No genotype presented stability parameter () equal to zero. On the other hand, the genotype AS1555YG presented coefficient of determination higher than 80% (Table 3). However, only the genotype 30A68HX presented higher mean productivity ().

According to the results obtained by Model 2 (M2), which is characterized by the informative prior distributions, out of the 25 genotypes, 11 (30A68HX, 2B710HX, 30F53HR, 2B433HX, DKB370, AG8041YG, 30K73H, DKB330, AS1596YG and BM820) presented the linear regression coefficient related to the unfavorable environments equal to 1 (βi1 = 1). The nine genotypes observed in the previous analysis presented values higher than 1 (βi1 > 1) and five (P4285HX, AS1555YG, BRS2022, 30A91HX and BRS2020) presented values lower than 1 (βi1 < 1) (Table 3). Among those genotypes that presented the linear regression coefficient related to the unfavorable environments equal to 1, only the genotype 30A68HX showed the linear response to the favorable environments higher than 1 (βi1 + βi2 > 1) and high mean productivity () (Table 3). This genotype is suitable for growers who employ high-level technology, since it responds well to improved environments [13]. In line with the M1 results, the genotype 30A68HX presented stability parameter () higher than zero and coefficient of determination lower than 80% (Table 3).

The analysis considering M2 was able to better discriminate the genotypes, since, out of 25 genotypes, 14 and 11 presented the linear regression coefficient related to the unfavorable environments equal to 1 for M1 and M2 fitted models, respectively.

A comparative analysis of the limits of credibility intervals obtained by the two fitted models (M1 and M2) reveals that the use of informative prior distributions (M2) reduced the limits of credibility intervals, when compared to minimally informative prior distributions (M1). Similar results were observed by [11], who used the Bayesian segmented regression model for adaptability and stability evaluation of cotton genotypes. Nascimento et al. [10], Couto et al. [20] and Teodoro et al. [25] used the Eberhart and Russel’s Bayesian method to evaluate the phenotypic stability and adaptability of alfalfa and popcorn cultivars and obtained similar results. Additionally, the difference in DIC values between models using minimally informative and informative priors ranged between 1.59 and 2.01. Once smaller DIC values indicate better data fitting, these results demonstrate that M2 should be considered for the adaptability and stability evaluation of maize genotypes (Table 4).

thumbnail
Table 4. Deviance Information Criterion (DIC) values obtained through the difference of DIC values between models using minimally informative (i) and informative priors (j) for maize hybrids.

https://doi.org/10.1371/journal.pone.0236571.t004

Overall, the Bayesian framework of the segmented regression model allowed the incorporation of additional information related to the parameters, through prior distributions, which reduced the ranges of the credibility intervals, increased the precision of parameter estimates, and, consequently, provided reliable genotype selection. In practice, this information can be obtained from previous studies, including [10, 20]. Due to the lack of prior information related to the evaluated maize hybrids in the literature, in this work, the estimates obtained from the frequentist analysis of the segmented model were used to define the hyperparameters.

In practice, most northeastern farmers in Brazil have limited capital, which prevents them from investing in production technology. Therefore, genotypes adapted to unfavorable environments should be considered for low technology planting [26]. The recommendation of cultivars not adapted to regional conditions leads to low yield and other serious problems, such as the indiscriminate use of pesticides and excessive cultural treatment [27]. Considering the results provided by the model M2 (informative prior distributions), only the genotype P4285HX presented the linear regression coefficient related to the unfavorable environments lower than 1 (βi1 < 1) and high mean productivity () (Table 3).

Conclusions

Incorporating additional information about the parameters through prior distributions decreases the credibility interval ranges. The difference in DIC values between models using minimally informative (M1) and informative priors (M2) was positive, which indicates a better data fitting, considering M2. Therefore, it should be an alternative for the adaptability and stability evaluation of maize genotypes.

The genotype P4285HX presents high yield performance and adaptability to unfavorable environments and should be considered for low technology planting, which is practiced by northeastern Brazilians farmers.

References

  1. 1. Oliveira IJ, Atroch AL, Dias MC, Guimarães LJM, Guimarães PEO. Seleção de cultivares de milho quanto à produtividade, estabilidade e adaptabilidade no Amazonas. Pesq Agropec Bras. 2017; 52: 453–461.
  2. 2. Eberhart ST, Russell W. Stability parameters for comparing varieties. Crop Sci. 1966; 6: 36–40.
  3. 3. Verma MM, Chahal G, Murty B. Limitations of conventional regression analysis a proposed modification. Theo Appl Genet. 1978; 53: 89–91.
  4. 4. Cruz CD, Torres RD, Vencovsky R. An alternative approach to the stability analysis proposed by Silva and Barreto. Rev Bras Gen. 1989; 12: 567–580.
  5. 5. Resende MDV. SELEGEN-REML/BLUP. Viçosa:UFV; 2007.
  6. 6. Lin CS, Binns MR. A superiority measure of cultivar performance for cultivar× location data. Can J Plant Sci. 1988; 68: 193–198.
  7. 7. Nascimento M, Ferreira A, Ferrão RG, Campana ACM, Bhering LL, Cruz CC, et al. Adaptability and stability based on nonparametric regression in coffee genotypes Pesq Agropec Bras. 2010; 45: 41–48.
  8. 8. Nascimento M, Cruz CD, Campana ACM, Tomaz RS, Salgado CC, Ferreira RDP. Alteração no método centroide de avaliação da adaptabilidade genotípica. Pesq Agropec Bras. 2009; 44: 263–269.
  9. 9. Nascimento M, Ferreira A, Campana ACM, Salgado CC, Cruz CD. Multiple centroid methodology to analyze genotype adaptability. Ceres. 2015; 62: 30–36.
  10. 10. Nascimento M, Silva FF, Sáfadi T, Nascimento ACMC, Paula Ferreira R, et al. Abordagem bayesiana para avaliação da adaptabilidade e estabilidade de genótipos de alfafa. Pesq Agropec Bras. 2011; 46: 26–32.
  11. 11. Gamerman D, Migon HS. Inferência estatística: uma abordagem integrada. Rio de Janeiro: Universidade Federal do Rio de Janeiro; 1933.
  12. 12. Nascimento M, Nascimento AC, Silva F, Azevedo C, Oliveir TRA, Amaral Junior AT, et al. Bayesian Segmented Regression Model for Adaptability and Stability Evaluation of Cotton Genotypes. Euphytica. 2020; 216: 1–10.
  13. 13. Ferreira DF, Demétrio CGB, Manly BFJ, Vencovsky R, Machado A. Statistical models in agriculture: biometrical methods for evaluating phenotypic stability in plant breeding. Cerne. 2006; 12: 373–388.
  14. 14. Spiegelhalter DJ, Best NG, Carlin BP, Linde A. Bayesian measures of model complexity and fit. J. R. Statist Soc B. 2002; 64: 583–639.
  15. 15. Gewek EJ. Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments. Research Department Staff Report. 1991; 1: 30. Available from: https://ideas.repec.org/p/fip/fedmsr/148.html
  16. 16. Raffery A, Lewis S. One long run with diagnostics: Implementation strategies for Markov chain Monte Carlo. Statist Sci. 1992; 7: 493–497.
  17. 17. R Development Core Team. R: A Language and Environment for Statistical Computing. Vienna: The R Foundation for Statistical Computing; 2017.
  18. 18. Yan J, Prates M. rbugs: Fusing R and OpenBugs and Beyond. rdrr.io. 2013. https://rdrr.io/cran/rbugs/
  19. 19. Smith BJ. Boa: an R package for MCMC output conversence assessment and posterior inference. J Stat Soft. 2007; 21: 1–37.
  20. 20. Couto MF, Nascimento M, Amaral AT, Viana AP, Vivas M. Eberhart and Russel’s bayesian method in the selection of popcorn cultivars. Crop Sci. 2014; 55: 571–577.
  21. 21. Buso WHD, Arnhold E. Evaluation of corn hybrids under contrasting water availability conditions. Rev Caatinga. 2016; 29: 927–934.
  22. 22. Amaral CB, Môro GV. Estabilidade e adaptabilidade do florescimento em variedades de milho. Ciênc Tecnol. 2017; 2017: 1–9.
  23. 23. Oliveira TRA, Carvalho HWL, Costa EFN, Carvalho Filho JLS. Correlation among adaptability and stability assessment models in maize cultivars. Aust J Crop Sci. 2017; 11: 516–521.
  24. 24. Villafuerte MBC, Dávila IBL, Linzán JPLV, Rivera EASS, Cobeña FDAN. Estabilidad ambiental en híbridos de maíz usando el modelo ammi en el litoral ecuatoriano. Rev Espamc. 2017; 8: 23–32.
  25. 25. Teodoro PE, Nascimento M, Torres FE, Barroso LMA, Sagrilo E. Perspectiva bayesiana na seleção de genótipos de feijão–caupi em ensaios de valor de cultivo e uso. Pesq Agropec Bras. 2015; 50: 878–885.d
  26. 26. Carvalho HWL, Santos MX, Leal MLS, Carvalho PCL. Melhoramento genético da cultivar de milho br 5033-asa branca no nordeste brasileiro. Pesq Agropec Bras. 2000; 35: 1417–1425.
  27. 27. Carvalho HWL, Cardoso MJ, Pacheco CAP, Costa EFN, Rocha LMP, Oliveira IR, et al. Recomendação de Cultivares de Milho no Nordeste Brasileiro: Safra 2015. Embrapa. 2017; 203: 29. Available from: https://ainfo.cnptia.embrapa.br/digital/bitstream/item/171880/1/COT-203-diagramado-aprovadoAUTOR.pdf