Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Use of the reversible jump Markov chain Monte Carlo algorithm to select multiplicative terms in the AMMI-Bayesian model

  • Carlos Pereira da Silva ,

    Contributed equally to this work with: Carlos Pereira da Silva, Luciano Antonio de Oliveira

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Supervision, Writing – original draft, Writing – review & editing

    alessandrasilva@ufgd.edu.br

    Affiliation Department of Statistics (DES), Institute of Exact Sciences, Federal University of Lavras, Lavras, Minas Gerais, Brazil

  • Cristian Tiago Erazo Mendes ,

    Roles Data curation, Methodology, Visualization, Writing – original draft, Writing – review & editing

    ‡ CTEM, AQS, RGVP and MB also contributed equally to this work.

    Affiliation Department of Statistics (DES), Institute of Exact Sciences, Federal University of Lavras, Lavras, Minas Gerais, Brazil

  • Alessandra Querino da Silva ,

    Roles Conceptualization, Funding acquisition, Methodology, Writing – original draft, Writing – review & editing

    ‡ CTEM, AQS, RGVP and MB also contributed equally to this work.

    Affiliation Faculty of Exact Sciences and Technology (FACET), Federal University of Grande Dourados, Dourados, Mato Grosso do Sul, Brazil

  • Luciano Antonio de Oliveira ,

    Contributed equally to this work with: Carlos Pereira da Silva, Luciano Antonio de Oliveira

    Roles Conceptualization, Investigation, Methodology, Supervision, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Faculty of Exact Sciences and Technology (FACET), Federal University of Grande Dourados, Dourados, Mato Grosso do Sul, Brazil

  • Renzo Garcia Von Pinho ,

    Roles Conceptualization, Funding acquisition, Investigation, Writing – original draft, Writing – review & editing

    ‡ CTEM, AQS, RGVP and MB also contributed equally to this work.

    Affiliation Department of Agriculture (DAG), School of Agricultural Sciences, Federal University of Lavras, Lavras, Minas Gerais, Brazil

  • Marcio Balestre

    Roles Conceptualization, Formal analysis, Methodology, Software, Supervision, Validation, Writing – original draft, Writing – review & editing

    ‡ CTEM, AQS, RGVP and MB also contributed equally to this work.

    Affiliation Faculty of Exact Sciences and Technology (FACET), Federal University of Grande Dourados, Dourados, Mato Grosso do Sul, Brazil

Abstract

The model selection stage has become a central theme in applying the additive main effects and multiplicative interaction (AMMI) model to determine the optimal number of bilinear components to be retained to describe the genotype-by-environment interaction (GEI). In the Bayesian context, this problem has been addressed by using information criteria and the Bayes factor. However, these procedures are computationally intensive, making their application unfeasible when the model’s parametric space is large. A Bayesian analysis of the AMMI model was conducted using the Reversible Jump algorithm (RJMCMC) to determine the number of multiplicative terms needed to explain the GEI pattern. Three a priori distributions were assigned for the singular value scale parameter under different justifications, namely: i) the insufficient reason principle (uniform); ii) the invariance principle (Jeffreys’ prior) and iii) the maximum entropy principle. Simulated and real data were used to exemplify the method. An evaluation of the predictive ability of models for simulated data was conducted and indicated that the AMMI analysis, in general, was robust, and models adjusted by the Reversible Jump method were superior to those in which sampling was performed only by the Gibbs sampler. In addition, the RJMCMC showed greater feasibility since the selection and estimation of parameters are carried out concurrently in the same sampling algorithm, being more attractive in terms of computational time. The use of the maximum entropy principle makes the analysis more flexible, avoiding the use of procedures for correcting prior degrees of freedom and obtaining improper posterior marginal distributions.

Introduction

In plant breeding programs, the selection and recommendation of superior genotypes are of considerable importance. The observed performance (phenotype) is determined by the genetic composition of the plant (genotype), by the conditions under which it develops (environment), and the interaction between these two factors, i.e. genotype×environment interaction (GEI) [1, 2]. GEI reduces the predictability of the performance of varieties in environments, making it difficult for the breeder to make a broad recommendation of genotypes. Moreover, efficient modeling of GEI has become a central aspect in the statistical analysis of data from multi-environmental trials (MET) [3]. Therefore, interest in the development of statistical methods aimed at identifying stable genotypes and satisfactory combinations between genotypes and environments is increasing. These methods aim to explore the positive effect of GEI for specific recommendations.

Linear-bilinear models are useful for evaluating genotype responses in different environments. Specifically, the additive main effects and multiplicative interaction (AMMI) model deserves special mention due to its wide applicability in the study of GEI by plant researchers and breeders [4, 5]. However, limitations in the standard analysis of the AMMI model (which considers parameters with fixed effects), such as difficulty in dealing with unbalanced data and heterogeneity of variance, have motivated the pursuit of more flexible methods.

An option to overcome such limitations is the use of the Bayesian paradigm, which allows the incorporation of more complex structures in the GEI analysis [68]. The first Bayesian proposals of the AMMI model were presented by Viele and Srinivasan [9] and Liu [10]. These authors presented an MCMC (Markov chain Monte Carlo) algorithm to correctly sample the model parameters, especially for those that describe GEI, whose support for the posterior joint distribution is not trivial. The flexibility of the Bayesian-AMMI in incorporating inference to the multiplicative terms of the biplot, random effects for genotypes, and historical information to experiments was illustrated by Crossa et al. [11], Perez-Elizalde et al. [12], and Oliveira et al. [13]. In turn, Romão et al. [14] and Silva et al. [15], empirically demonstrated the robustness of the Bayesian-AMMI model under unbalance and heterogeneity of variances between environments.

An aspect that is central in research involving AMMI analysis is the adjustment of the model with respect to the number of bilinear terms to be retained for explaining GEI. This issue has been widely explored in the frequentist context, and involves the use of both F-approximate tests and non-parametric procedures based on intensive computation [1623]. In contrast, the selection of multiplicative terms in Bayesian-AMMI was done based on information criteria, such as Akaike’s Information Criterion (AIC), Bayesian Information Criterion (BIC), and Bayes Factor (BF) [10, 12].

Silva et al. [24] proposed a Bayesian version of the frequentist Shrinkage method published by Cornelius and Crossa [25], assigning a specific prior to the variance components of the singular values. According to Cornelius et al. [26], and Cornelius and Crossa [25], the use of estimators with a shrinkage effect eliminates the model selection stage, but does not guarantee more parsimonious models with respect to the number of retained components for explaining the GEI. Oliveira et al. [27], in turn, implemented the shrinkage estimators for the singular values of the GGE model using the principle of maximum entropy to ascription priors (the authors called this model the Bayesian-GGE entropy model or just BGGEE). In the Bayesian analysis of the AMMI (or GGE) model the use of a prior that promotes shrinkage in in singular values predictors can offer more parsimonious model adjustment. For this, some model selection methods must be used to determine the optimal number of axes to be retained. As in multiplicative models the first principal components describe most of the variability, it is expected that few singular values are significant for describing the GEI pattern. On the other hand, the estimates of singular values related to the components with little importance to explain the GEI (those related to the largest dimensions) effect are shrunk to zero. This gives us a more plausible justification for using the AMMI-2 biplot (with the first two principal components) representation since the first axes are predicted in a more pronounced manner.

A difficulty faced in the use of the Bayes factor or model selection methods that use information criteria is the great demand for time and computational resources. This is due to the fact that all possible models, depending on the dimension, must be adjusted for comparison. From a Bayesian point of view, the uncertainty regarding the best model, which fits a given set of data, can simply be incorporated into the inference problem. Consequently, the model itself is considered as another unknown parameter to be estimated and, with that, each model has its probability distribution. In this context, approaches involving the construction of transdimensional Markov chains, which allow the alteration of the model’s dimension during the sampling process, have gained popularity [28].

Among the class of transdimensional methods, there is the Reversible Jump in Markov chain Monte Carlo (RJMCMC), initially proposed by Green [29]. This procedure can be seen as a generalization of the Metropolis-Hastings algorithm when the model’s parametric space does not have a fixed dimensionality [30]. This method allows for the flexibility to “jump” between models with parametric spaces of different dimensions, which ensure that the chain is irreducible. Consequently, from the output of a single Markov chain sampler, a complete probabilistic description of the posterior distributions of each model is obtained [28, 31, 32].

Therefore, the main objective of this study was to adjust the AMMI model under the Bayesian perspective, assuming that the number of principal axes to be retained in the model is a random variable. This assumption was implemented using the RJMCMC methodology for the parameter sampling process. Thus, the joint inference about the number of bilinear terms responsible for the GEI pattern is directly incorporated by the inclusion (or exclusion) of multiplicative terms in the model during the sampling process. Furthermore, specific objectives of the study were as follows: i) to evaluate the sensitivity of the AMMI-Bayesian model to the prior specification for the variance component associated with singular values; ii) use information criteria for model selection, and iii) an evaluation the predictive capacity of the model in a scenario of random loss of genotypes in environment.

To exemplify the use of the method in the evaluation of cultivars we will use simulated and real data. The real data comprise 50 maize genotypes evaluated in 10 locations in southern Brazil. We will address two points that are important in the final stages of plant breeding programs. The first will be to verify the existence of genotypes that have little contribution to the GEI (stable), but that are interesting in terms of yield (broad recommendation). The second will be to identify positive combinations between genotypes and environments, aiming to take advantage of the positive effect of the GEI (recommendation of genotypes to specific environments).

Material and methods

Simulated data

To exemplify the proposed method, a data set with 20 genotypes (G1-G20) and nine environments (E1-E9) was simulated, using a randomized block design, with three replicates. The main effects of genotypes (g) were sampled from a Gaussian distribution with mean equal to zero and variance equal to 12, N(0, 12), and the main effects of the environment sampled from a Gaussian distribution with mean equal to zero and variance equal to one N(0,1).

The effects of the GEI for each genotype by environment combination (geij) for i = 1,…,20 and j = 1,…,9, where i indicate the genotype and j indicate the environment, were sampled from Gaussian distribution, getting three subgroups (patterns) as highlighted below:

  1. 1st subgroup: Genotypes G1, G2, G3, G4, and G5 with positive effects in E1, E2, E3, E4 environments and negative effects in E5, E6, E7, E8, and E9 environments. The effects of geij (i = 1,…,5) were sampled from a Gaussian distribution N(0, 16), and in the environments j = (1,…4) was considered as the module of the values and in the other environments j = (5,…9), the module of the values, was multiplied by (-1).
  2. 2nd subgroup: Genotypes G6, G7, G8, G9, and G10 with negative effects in E1, E2, E3, E4 environments and positive effects in E5, E6, E7, E8, and E9 environments. Similar to what was considered in the first subgroup, the effects of geij (i = 6,…,10) were sampled from a Gaussian distribution N(0, 16). But for j = (1,…4) the values are negative, whereas for j = (5,…9) they are positive.
  3. 3rd subgroup: Genotypes containing G11, G12,… ., G20, whose GEI effects geij (i = 11,…20), in the environments j = (1,…9), are sampled from a standard Gaussian distribution N(0,1).

The geij effects thus simulated make up a GEg×e matrix which is finally corrected for row and column mean so that it corresponds merely to the matrix genotype-by-environment interaction, where g represents the number of rows and e represents the number of columns.

Genotypes belonging to the 1st and 2nd groups were assumed to be unstable genotypes (contribute to interaction). By contrast, the genotypes belonging to the 3rd group were assumed as being stable (they do not have expressive contributions to interaction).

Thus, each value yijk was obtained by adding the simulated effects plus the error which, in turn, was simulated from a Gaussian distribution N(0,2.5). The simulation was performed using R software [33].

Real data

The real data comprised 50 simple maize hybrids evaluated for grain yield character (t/ha-1) during the 2013 and 2014 crop years at 10 locations in the Southern Region of Brazil. The experiment was conducted in an incomplete block design with a variable number of repetitions.

Statistical model.

The AMMI model, in vector notation, can be described as: (1) where y(n×1) is the vector of observations with n = ger, in which g corresponds to the number of genotypes, e to the number of environments, and r to the number of repetitions; β(er×1) and g(g×1) are the vectors of block effects within environments and genotype effects, respectively; and correspond to, respectively, the singular value and the singular vectors (genotypic and environmental) associated with the k-th principal component, with k = 1, …,t, in which t is the rank of the GEI matrix, where the singular vectors satisfy the orthonormality constraints and the singular values satisfy the order constraint λk ≥ λk+1 ≥ 0. The matrices and, Z(n×g) are designs associated with effects β, γk and g, respectively. As can be seen, and are different, where is used to distribute the effects of the GE(g×e) interaction and the number of columns is, therefore, equal to the number of environments. The matrix, in turn, distributes the effects of blocks within environments, as in Silva et al. [24] and Edwards and Jannink [7] to maintain the identifiability of all parameters and not use marginal conditions where effects sum to zero as presented in Crossa et al. [11]. The vector ϵ(n×1) is composed of random errors with .

If we denote by GE the interaction matrix, then it will have dimensions g × e (g row = genotype and e columns = environment). In the classic AMMI model, the mean of the cell is given by , where now αik and γjk are coordinates of singular vectors of the k-th principal component. As our model is for parcels, the multiplications involving Z and X2 in the sum of the model (1) allow distributing and replicating the effects of the GEI in the vector y.

In model (1), the term k specifies the number of axes that are included in the model, which would be from k = 1, where only one axis is being incorporated into the model, up to k = t that comprises the complete model. However, in the approach presented here, the model space is t + 1, such that one can assume an index k = 0, where the corresponding model would be y = X1β + Zg + ϵ, that is, the model disregarding the bilinear part.

The likelihood function is given by: (2) where and .

Prior distributions for the model parameters are as follows: which implies

For the singular value scale parameter, three hypotheses were established about the prior distribution:

  1. Hypothesis a: It is assumed that , which corresponds to assuming a non-informative prior for λk. This is the hypothesis presented by Oliveira et al. [13], being referred to by Bayesian AMMI (BAMMI).
  2. Hypothesis b: It is assumed that is a scaled inverted chi-square prior with scale parameter equal to zero, and degrees of freedom equal with the restriction of 0 < Δ < 1//2, resulting in a posterior with a shrinking effect for singular values. This is the Bayesian AMMI Shrinkage (BAMMIS) version proposed by Silva et al. [24]:
  3. Hypothesis c: It is assumed that is an inverse gamma prior with degrees of freedom equal to one (a = 1) and scale parameter equal to zero (b = 0). This choice is based on the concept of Maximum Entropy (ME), and was implemented in the Bayesian GGE model by Oliveira et al. [27] (more details in S1 Appendix) as follows:

As specified, priors are non-informative for , and genotypic and environmental singular vectors. However, for g, a hierarchical prior at two levels was assigned considering uncertainty about the hyperparameter , which results in considering a common population for genotypes. This is similar to what occurs in mixed models when genotype effects are considered random.

The prior distributions, assigned to the singular vectors, are uniform spherical in the corrected subspace due to the orthonormality constraint (the justification for such an assignment is elucidated in Viele and Srinivasan [9]). In this study, there is a special interest in the priors attributed to the singular values hyperparameters. The prior distribution of λk is a truncated normal for positive values, it is denoted by N+, with zero mean and with variance , which was assumed in the three hypotheses about the prior that was utilized.

Furthermore, the number of principal axes (t) present in the model is a random variable whose a priori uncertainty was established through a truncated Poisson distribution with mean μ. For the μ hyperparameter, a Gamma distribution with scale parameter and degree of freedom equal to one was assigned, as assumed in Balestre and Souza [34].

Posterior complete conditional distributions for model parameters.

Combining the likelihood function with the established prior distributions, using Bayes’ theorem, we obtain the joint posterior distribution.: (3) in which: .

The posterior complete conditional distributions are obtained from the joint distribution (3). They are presented, along with the algebraic details in the S2 Appendix.

RJMCMC algorithm decision rule

The objective of using the RJMCMC algorithm was to conduct model selection during the MCMC process. This method allows one to switch between models with different dimensions such that the properties inherent in the MCMC process are not violated. As in this RJMCMC approach, the models are nested, and the decision rule is similar to the proposals of Waagepetersen and Sorensen [35], Bodin and Sambridge [36], and Balestre and Souza [34], where the Jacobian determinant of the transformation in the decision rule is equal to one.

Therefore, the decision rule to test the addition of a new main axis to the model is given by: (4) where t corresponds to the current state model and t + 1 to the candidate model with the addition of an axis; p(yt) and p(yt + 1) are the probability functions of data from model t and (t + 1), respectively; p(t│μ) and p(t + 1│μ) are the priors referring to the respective models; ξ(t,t + 1) and ξ(t + 1,t) correspond to the proposed Hastings correction necessary to guarantee the reversibility condition of the model during the MCMC process, being and ξ(t + 1,t) = p0 wherein pa, pd and p0 correspond to the prior probabilities of adding, deleting, or maintaining, respectively, the number of main axes in the model (pa = pd = p0 = 1/3).

Simplifying Eq (4), the decision rule for adding a dimension to the model is:

If a(t, t + 1) is greater than a random variable u ~ U(0,1), then the candidate axis is included in the model; otherwise, the model is retained with the current number of main axes present in the model t.

The next step in the algorithm is to carryout the test between the model with dimension t and the model with dimension t − 1. That is, the model with t principal components and the model with reduction of a principal component t − 1, where the decision rule is given by: (5)

Simplifying Eq (5), the decision rule to accept or not accept the exclusion of a dimension in the model is: where p(y|t − 1) is the conditional distribution of data for the dimension-reduced model (t − 1) and with ξ(t, t − 1) and ξ(t − 1, t) corresponding to the necessary Hastings corrections to guarantee the reversibility conditions in the sampling, with ξ(t, t − 1) = p0 and ξ(t − 1, t) = pd/t.

Sampling process

The sampling of the parameters of the models was performed iteratively, combining the Gibbs sampler and the RJMCMC method. The Gibbs algorithm was arranged as follows:

Gibbs algorithm.

A1—Initial values are assigned to model parameters:

A2—The l-th iteration can be obtained as follows:

  1. Generates from the conditional posterior distribution: where: .
  2. Generates from the conditional posterior distribution: where:
  3. Generates from the conditional posterior distribution: where ϕk = diag(Zαk)X2γk, and λ0 = +∞.
    Step “D” refers to the component of variance related to the singular value for which three hypotheses are specified. Here, only hypothesis “c” will be used to illustrate the sampling process. For the other hypotheses, the process is analogous.
  4. Generates from the conditional posterior distribution:
  5. Generates from the conditional posterior distribution: where: Δ1k = diag(X2γk)Z
  6. Generates from the conditional posterior distribution: where: Δ2k = diag(Zαk)X2.
  7. Generates from the conditional posterior distribution:
  8. Generates from the conditional posterior distribution:
  9. Generates from the conditional posterior distribution:

After carrying out the sampling process parameters of the model (t) in the l-th iteration, the additional parameters referring to the concurrent model (t + 1) are sampled.

The size of the model is given by the number of multiplicative terms retained to explain GEI (0, 1, 2,…, rank(GEI)). For each proposed model, this dimension is directly related to steps C, D, E, and F of the Gibbs sampling process. For the model without the presence of the multiplicative part t = 0, steps C, D, E, and F are not included in the sampling.

RJMCMC algorithm.

With the sampling process for the parameters of the candidate models already established, the next step is to test the models, using the decision rule, selecting the most probable. For this, the following steps of the algorithm are performed:

  1. Inclusion movement
    1. Propose the addition of a bilinear term in the model (t + 1);
    2. Calculate the probability of accepting a(t, t + 1), the inclusion of the bilinear term;
    3. Sampling a random number from a uniform distribution u ~ U(0,1);
    4. If u < min(1, a(t, t + 1)), the proposed state is accepted; otherwise, the model is retained with the initial dimension.
  2. Exclusion movement:
    1. Propose the removal of a bilinear term in the model (t − 1);
    2. Calculate the probability of accepting a(t, t − 1) the exclusion of the bilinear term;
    3. Sampling a random number from a uniform distribution u ~ (0,1);
    4. If u < min(1, a(t, t − 1)), the proposed state is accepted; otherwise, the model remains with the initial dimension.

Repeat the steps specified in i) and ii) during the iterative process.

The model selected to represent the dataset will be the one with the highest frequency of visits, based on the decision rule.

The predictive capacity of the model

To assess the predictive capacity of the models with and without the use of the RJMCMC algorithm, an unbalance process was performed on the data such that within each environment, random losses of genotypes were established (total loss of the genotype) at the level of loss of 10% of the data set. Thus, the original sample was randomly divided into 10 sub-samples (10-fold), keeping all the genotype and environment information in the sub-samples. In each fold, the data were divided into training and validation data.

The predictive capacity of the models was evaluated using the PRESS (Predicted Residual Sum of Squares) criteria and the correlation (Cor) between the predicted () and observed (yijr) phenotypic values. This verification was given in scenarios involving the different hypotheses of the variance component related to the singular value.

Cor and PRESS are calculated, respectively, by: and where is the predicted mean values, the mean of the observed values and n denotes the number of data removed for validation.

Diagnosis of Markov chains and a posteriori inference

The total number of iterations to be performed with the Gibbs sampler in each analysis, was determined by a pilot sample according to Raftery and Lewis [37], indicating the number of initial observations to be discarded (burning), and the jumps necessary to collect the observations (tinning) that will compose the sample for inference. These procedures were performed to avoid selecting values from non-stationary strings and/or autocorrelated strings.

The convergence of the produced Markov chains, after the correction procedures described above, was monitored using the method of Raftery and Lewis [37] and by the criterion of Heidelberger and Welch [38]. Trace and density plots were used for visual analysis of the stationarity of Markov chains.

The singular vectors, in turn, are estimated from a correction to preserve the orthonormality constraints between the vectors [10]. This is done by orthonormalizing the matrices "U" and "V", whose columns are composed by the average coordinates of the genotypic and environmental singular vectors, respectively.

Linear parameters, singular values, and variance components of the selected model were estimated by posterior means of the simulated distributions. In addition, uncertainty regarding parameter estimation was quantified through the highest posterior density credible region (HPD).

Posterior bivariate credibility regions for genotypic and environmental scores were incorporated into the biplot-AMMI using the method described by Hu and Yang [39] to study stability and adaptability.

For models adjusted by the RJMCMC method, posterior means, HPD regions with 95% credibility, and bivariate credibility regions for genotypic and environmental scores in the biplot were obtained from two perspectives:

  1. Based on the most likely model, which is the one that considers only the observations contained in the conditional model (the model with the highest frequency of “visits”), which is referred to as conditional response herewith.
  2. Based on the marginal model that considers all MCMC observations of the possible models that contain the estimated parameters present in the most likely model; that is, it considers the estimates of nested models in which the number of bilinear terms is equal to or greater than that of the most likely model. This is referred to in the text as marginal response.

The entire inference process, as well as the convergence tests, were conducted using the boa package [40] and other tools from the R statistical software.

Results

The results obtained by applying the criteria of Heidelberger and Welch [38], and Raftery and Lewis [41] indicated good convergence properties for MCMC chains (Gibbs sampling) for all parameters with a dependence factor (I) always lower than five (I<5). Furthermore, all parameters passed the stationarity test, indicating that convergence was achieved.

Model selection using information criteria

Based on Fig 1, the model with the best fit, according to the AICM criterion, is independent of the assumed prior for the variance components of the singular value, being the one with two retained bilinear terms (AMMI-2). For BAMMIE, the model that best fits the data was also the same (AMMI-2), regardless of the selection criteria used. For BAMMI and BAMMIS, the best fit models were, respectively, AMMI-3 for the BIC; for the AIC, it was AMMI-4 and AMMI-3, for BAMMI and BAMMIS, respectively.

thumbnail
Fig 1. Ockham’s plot for the BAMMI, BAMMIS, and BAMMIE models for the AIC, AICM, and BIC information criteria, as well as for the log-likelihood (log-like) and residual variance (σe2).

https://doi.org/10.1371/journal.pone.0279537.g001

Fig 1 also presents the description of the behavior of the models concerning the log-likelihood (Log-Link) and residual variance (σe2). It is noteworthy that this information, by itself, does not correspond to tests to determine the number of bilinear components that should be retained in the model, but is useful for describing the behavior of the data and can still be used as auxiliary criteria in the step of selection.

Table 1 presents posterior means of the singular values for the models as a function of the number of retained bilinear terms, as well as the respective sum of squares of the accumulated GEI to each dimension. An accentuated shrinkage effect was observed in the estimates of the singular values of BAMMIE, concerning the other models, from the adjustment of the third multiplicative term. This accentuated shrinkage was also seen in BAMMIS concerning BAMMI, as from the model with a three-axis. S1 Table presents “true” values for the λk, together with the respective GEI sums of squares, conditioned to each dimension of the model, and were obtained by Singular Value Decomposition (SVD) of the simulated interaction matrix, in which the sum of squares accumulated was 259.6064.

thumbnail
Table 1. Posterior means of singular values referent of each model (AMMI) and their respective accumulated sum of squares of the interaction in each dimension (Gibbs algorithm).

https://doi.org/10.1371/journal.pone.0279537.t001

A point to be noted is that no member of the BAMMIE family overestimated the total variance of the GEI (simulated), and its estimates were always smaller concerning BAMMIS and BAMMI. Combining information from Fig 1 and Table 1, it is observed that models selected by the AICM criterion did not present an accumulated sum of squares greater than the true total sum of squares. This fact was also verified for BAMMIE, regardless of the criteria. For the models selected in BAMMIS and BAMMI using AIC and BIC, the sum of squared estimates retrieved was greater than the total simulated sum of squares.

Another finding was that the BAMMIE model selected by all information criteria (AMMI-2) presented a less discrepant accumulated sum concerning the first two terms of the SVD solution (S1 Table) compared to the others. However, considering complete models, BAMMIE was the one that most underestimated the true total sum of squares, BAMMI was the one that most overestimated it, and BAMMIS offered the closest value.

Summaries with a posteriori estimates of model parameters, including credibility regions for genotype effects and biplot representations, with incorporated bivariate credibility regions for genotypic and environmental scores that describe the GEI, referring to the adjustment using only Gibbs sampling, are presented in S1S6 Figs and S2 Table.

Adjustment of the AMMI model using the Reversible Jump method

Fig 2 illustrates the frequencies at which each model was visited in the RJMCMC sampling. These frequencies were used as criteria to select the winning model (most likely). According to this criterion, models AMMI3, AMMI2, and AMMI3 respectively would be selected for BAMMI, BAMMIE, and BAMMIS. Note that there are significant differences between the frequency of the winning model concerning the others for each AMMI version.

thumbnail
Fig 2. Frequency referring to the dimension (t) of the model accepted during the RJMCMC (simulated data).

https://doi.org/10.1371/journal.pone.0279537.g002

Table 2 indicates that the models presented very close estimates for the mean of the first two singular values. Furthermore, BAMMI and BAMMIS obtained similar mean values concerning λ3. Also, in relation to λ3, there is a more expressive difference for BAMMIE, although the HPD intervals overlap with 95% probability.

thumbnail
Table 2. Estimated singular values for the AMMI model referring to the three hypotheses made under the interaction, using the RJMCMC algorithm.

https://doi.org/10.1371/journal.pone.0279537.t002

The sum of squares for the selected models BAMMI, BAMMIE, and BAMMIS were 272.7710, 217.3316, and 264.2804, respectively. Thus, the most probable models for BAMMI and BAMMIS overestimated the GEI total sum of squares, while BAMMIE had a relatively smaller value and was closer to the true sum of squares than that obtained by the AICM criterion (S1 Table). The accentuated shrinkage effect for the posterior mean of λ3 in the BAMMIE model, in relation to the others, as well as in relation to the real value (S1 Table), is remarkable, which indicates that the estimates for the other parameters should be shrunk to zero. This sharp shrinkage effect was not observed in the other selected models.

Posterior means and HPD credibility regions for residual and genotypic variances for the conditional and marginal responses of the models are presented in Table 3. No large discrepancies were observed between the conditional and marginal estimates or between the three models. Even when compared with the estimates of the models selected by the information criteria, it was found that there are no major discrepancies about genotypic and residual variances (S2 Table).

thumbnail
Table 3. Posterior means the standard deviation (SD) and the 95% HPD intervals (LL: lower limit, LU: upper limit) for components of variance genotypic (σg2) and residual (σe2) for models with conditional and response responses.

https://doi.org/10.1371/journal.pone.0279537.t003

In Fig 3, posterior means and HPD intervals at 95% of credibility of the genotypic effects for BAMMI, BAMMIS, and BAMMIE are presented considering conditional (blue color) and marginal (red color) responses. In both situations, the pattern in the classification of main genotypic effects was practically the same between the models. Overlap between ranges indicates similar effects.

thumbnail
Fig 3. Posterior means and the 95% HPD intervals for the effect of genotypes, conditional response (blue color) and marginal responses (red color) referring to: a) BAMMI-3; b) BAMMIS-3; c) BAMMIE-2.

https://doi.org/10.1371/journal.pone.0279537.g003

Regarding yield, the same subgroup of genotypes (more productive) {G1, G12, G4, G10, G13, G11} was selected for the three models. The subgroup {G19, G2, G16, G9, G18, G5, G14} had a significantly lower mean than the overall one (negative effect), not being interesting in terms of productivity. Lastly, there are those genotypes whose HPD regions include the origin and, therefore, the effects do not differ significantly from zero. Differences between the main effect estimates between the models are practically imperceptible graphically. These results are presented in S3S5 Tables.

Biplot credibility regions for models adjusted by Reversible Jump

In Fig 4, biplots with posterior credibility regions for the genotypic scores of the three models are presented, with two representations for each model: marginal response and conditional response (as explained earlier in the Methods). From the visualization of the biplots, it is clear (for each model) that the two representations (marginal or conditional) did not result in differences in interpretation. However, there is clearly a change in the stability rating when comparing different models. As the responses within each model did not differ, conclusions about existing GEI standards will be made based on the marginal response.

thumbnail
Fig 4. Representations of the bivariate credibility regions at 95% probability, based on the marginal and conditional response of the BAMMI, BAMMIS, and BAMMIE, models.

https://doi.org/10.1371/journal.pone.0279537.g004

Fig 5 presents the biplot with the credibility regions for genotypic and environmental scores (at 95% probability) referent the marginal BAMMI model. Regions that do encompass the origin (0,0) are not plotted for ease of interpretation biplot. The subgroup of genotypes formed by {G2, G4, G15, G17, G18, G19, G20}, which were not represented, were interpreted as stable, as they have no important contribution to the GEI. The same interpretation applies to the environments and, in this sense, E5, which was not represented, was considered stable. The subgroups of genotypes and environments, whose associated bivariate regions do not encompass the origin, effectively contribute to the GEI effect and are often referred to as unstable.

thumbnail
Fig 5. Bivariate credibility regions at 95% probability for the genotypic and environmental scores of the BAMMI model whose regions do not contain the origin (0.0).

https://doi.org/10.1371/journal.pone.0279537.g005

Furthermore, it is possible to identify similar subgroups of genotypes and environments concerning the interaction effect (homogeneous subgroups). Although with some overlaps that make it difficult to observe separable groups, one could identify five homogeneous subgroups of genotypes {G1, G3, G5, G6, G16}, {G8, G11, G12}, {G9, G10}, {G7, G13, G14} and {G2, G4, G15, G17, G18, G19, G20}, the latter being formed by stable genotypes. For environments, four homogeneous subgroups could be suggested, namely: {E6, E7, E8}, {E1, E2, E3, E4}, {E9} and {E5}. The adaptability of genotypes to specific environments can also be suggested from the biplot representation observing regions in the same quadrant and/or that have expressive overlaps.

The biplots for the BAMMIS and BAMMIE models are shown in Figs 6 and 7, respectively. The only difference between BAMMI and BAMMIS is the G3 genotype, which was classified as stable in the BAMMIS analysis. In the BAMMIE biplot, the credibility regions of G3 and G16 also encompass the origin, signaling that these genotypes would not be important for the effect of GEI, which are the main differences in interpretations of BAMMIE concerning BAMMI and BAMMIS.

thumbnail
Fig 6. Bivariate credibility regions at 95% probability for the genotypic and environmental scores of the BAMMIS model whose regions do not contain the origin (0.0).

https://doi.org/10.1371/journal.pone.0279537.g006

thumbnail
Fig 7. Bivariate credibility regions at 95% probability for the genotypic and environmental scores of the BAMMIE model whose regions do not contain the origin (0.0).

https://doi.org/10.1371/journal.pone.0279537.g007

Evaluation of the predictive capacity of the models

The evaluation of the predictive capacity of the models utilizing the Cor (the higher the value, the better) and PRESS (smaller values indicate a greater predictive capacity) statistics, was performed using the Gibbs and RJMCMC (marginal responses of the models) algorithms.

Fig 8 illustrates the behaviors of the three AMMI versions adjusted by the Gibbs sampler, based on mean Cor and PRESS. The results suggest a high predictive capacity for the models under the used imbalance (always above 81%), with emphasis on the BAMMI-1 that has a correlation above 84%. For the PRESS criterion, the models that presented the best results were BAMMI-1, BAMMIS-1, and BAMMIE-2, similar to the results obtained based on the correlation; note that the BAMMI-1 model achieved slightly higher results in both evaluation criteria.

thumbnail
Fig 8. Ockham’s plot for the mean evaluation of the predictive capacity of the models based on the criteria of correlation between the observed and estimated values and the PRESS (Gibbs algorithm).

https://doi.org/10.1371/journal.pone.0279537.g008

The results of Cor and PRESS for the models fitted by the RJMCMC method are presented in Fig 9. For Cor, a mean behavior above 89% was observed, and individually (for the folds) the results were always above 84% for all models. For the PRESS criterion, the models that presented the best results were BAMMI-4, BAMMIS-4 and BAMMIE-8, and for Cor, the best result was presented by AMMI-8, regardless of the model. A similar pattern was observed between BAMMI and BAMMIS, both of which were better (in prediction) than BAMMIE, which also showed good performance.

thumbnail
Fig 9. Ockham’s plot for the mean evaluation of the predictive capacity of the models based on the criteria of correlation between the observed and estimated values and the PRESS (RJMCMC algorithm).

https://doi.org/10.1371/journal.pone.0279537.g009

Real data

Analyses of actual data were performed using only the RJMCMC. The posterior estimates were obtained considering the marginal and conditional responses of the models, in a similar manner to what was done with the simulated data. In Fig 10, the histograms of the frequency of visits of the models in relation to the number of bilinear terms are presented.

thumbnail
Fig 10. Frequency referring to the dimension (t) of the model accepted during the RJMCMC process (real data).

https://doi.org/10.1371/journal.pone.0279537.g010

Based on the frequencies of visits, AMMI-3 would be selected for the considered three versions of the AMMI model. Similar to in the study of simulated data, differences were observed in the frequency of the most likely model concerning the others.

Table 4 presents posterior means of singular values (for the conditional and marginal responses) and credibility regions estimated based on the marginal response. There were no accented differences between point estimates (conditional and marginal). Among the models, the mean estimates were close for the first three singular values, a fact that was also observed for the HPD interval at 95% of credibility.

thumbnail
Table 4. Point estimate and interval estimate (HPD at 95% probability) for singular values for the three Bayesian AMMI versions, using the RJMCMC algorithm.

https://doi.org/10.1371/journal.pone.0279537.t004

Posterior means along with HPD regions for the residual and genotypic variance components, for the marginal and conditional responses, are presented in Table 5. The estimates, considering conditional and marginal responses, are practically the same within each model. Between the models, there were also no expressive differences in the estimates of the corresponding parameters.

thumbnail
Table 5. Point estimate and interval estimate (HPD at 95% probability) for the variance genotypic () for the models, considering conditional and marginal responses, for the three Bayesian AMMI versions.

https://doi.org/10.1371/journal.pone.0279537.t005

In Fig 11, posterior means and HPD intervals at 95% of credibility are presented, for the effect of genotypes, conditional response (blue color), and marginal response (red color). In both situations, the pattern in the classification of the effects was practically the same within each model. The inferences were also practically the same in both models with the same subgroup of genotypes (more productive) being selected {G9, G10, G38, G8, G6, G2, G1, G3, G5, G7}.

thumbnail
Fig 11. Posterior means and the 95% HPD intervals for the effect of genotypes, conditional response (blue color) and marginal responses (red color) referring to: a) BAMMI; b) BAMMIS; c) BAMMIE.

https://doi.org/10.1371/journal.pone.0279537.g011

Figs 1214, are presented as biplots with 95% bivariate credibility regions for genotypic and environmental scores’ (to the AMMI3 model) marginal response. Here too, only the regions that do not contain the origin (0,0) were represented to simplify the interpretations; the genotypes and environments associated with these regions have an important contribution to the GEI effect. By visually inspecting positions and overlaps between credibility regions, the same homogeneous genotype subgroups, {G1}, {G7, 10}, {G11} and {G8, G39}, can be suggested for all models. The remaining genotypes (not represented) form another subgroup, and they are interpreted as stable. The same interpretation can be made in relation to the environments, so that five subgroups can be identified, namely: {E8, E6}, {E7, E10}, {E1, E2, E3, E4, E5} and {E9}. The region associated with the E9 environment encompasses the origin; thus, it is considered stable, and the other environments significantly contribute to the GEI effect.

thumbnail
Fig 12. Bivariate credibility regions with 95% credibility for the genotypic and environmental scores of the BAMMI model, built based on the marginal response (AMMI3).

https://doi.org/10.1371/journal.pone.0279537.g012

thumbnail
Fig 13. Bivariate credibility regions with 95% credibility for the genotypic and environmental scores of the BAMMIE model, built based on the marginal response (AMMI3).

https://doi.org/10.1371/journal.pone.0279537.g013

thumbnail
Fig 14. Bivariate credibility regions with 95% credibility for the genotypic and environmental scores of the BAMMIS model, built based on the marginal response (AMMI3).

https://doi.org/10.1371/journal.pone.0279537.g014

The adaptability of genotypes (or subgroup of genotypes) to environments (or subgroups of environments) can also be suggested from the biplot visualization. This analysis can be performed by observing positions and overlaps between regions of credibility for genotypic and environmental scores in the quadrants referring to the PC1 and PC2 axes. It is noteworthy that these inferences would be almost identical in both biplots.

Points considered for exemplifying the method

In the introduction, we pose two fundamental questions concerning the work of plant breeders. We previously described some differences between the three versions of the AMMI model fitted with MCMC and RJMCMC. We will now highlight differences (or similarities) between the two adjustment methods (MCMC × RJMCMC) for simulated and real data in the analysis of adaptability and phenotypic stability.

Stability analysis for simulated data

Estimates for the singular values of selected models (from information criteria) and the most likely models using RJMCMC were not very outliers. However, as already reported, it was observed that in the assessment of the predictive capacity, the models adjusted by the RJMCMC method performed better than those adjusted by the MCMC method.

The selection and recommendation of genotypes are one of the main objectives of plant breeding programs. The identification of stable genotypes makes it possible to minimize the interaction effect if they have interesting yields. We can compare the main effects of genotypes between models fitted with MCMC and RJMCMC by looking at Fig 3 and the figures that are in the S1S3 Figs. From the graphical analysis, we found that the inferences do not change even between the different AMMI versions. The best genotypes in terms of the main effect are the same {G1, G12, G4, G10, G13, G11}.

The second step in this evaluation would be to observe which of these genotypes would be stable by comparing the versions of the models with MCMC and RJMCMC. Considering MCMC models (S4S6 Figs), G4 is stable in all the model AMMI versions, while G13 is stable also for BAMMIS and G12 for BAMMIE. Therefore, among the best genotypes (considering main effects), these would have a broad recommendation, respectively, for BAMMI, BAMMIS, and BAMMIE.

When analyzing the biplots of the models for the RJMCMC analyses, it is noticed that G4 is also stable in all of them. The only difference is that G13 is also considered stable for BAMMI-E, which differs from the analysis using MCMC. But, in general, the inferences between these two approaches converge.

Adaptability to simulated data

Another way to mitigate the effect of the GEI is to take advantage of its positive effect. For this, it is necessary to evaluate the biplot and interpret positions and overlaps of genotypic and environmental credibility regions.

In the biplots of models fitted with MCMC (S4S6 Figs), the visual analysis suggests separable subgroups concerning the interaction effect. For the three versions of the model, the GEI standard, in general, remains in the biplot representation. It is observed that for models fitted with RJMCMC the patterns are similar (Figs 57). The similarity is also observed when comparing the biplots of equivalent models (MCMC × RJMCMC). This shows us that the analysis of adaptability and stability when considering the different methods, in general, offers the same inferences. Concerning prediction, a slight advantage was seen for the RJMCMC method. In addition, the computational analysis time for the adjustment proved to be much more feasible.

Stability analysis for real data

We exemplified the RJMCMC method with real data and observed, concerning the main effect, that the subgroup {G9, G10, G38, G8, G6, G2, G1, G3, G5, G7} stood out, and even though this inference is independent of the model (BAMMI, BAMIS or BAMMIE). In addition, we saw that G7 was the one that presented the highest value for the main effect since its HPD region does not overlap with any other (Fig 11).

In the biplot analysis (Figs 12, 13, and 14), we also observed that the genotypes G1, G7, 10, G11, G8, and G39 have significant contributions to the GEI the others and were considered stable since their bivariate credibility regions include the origin (0,0). This inference was common to both versions (BAMMI, BAMMIS, and BAMMIE). Thus, among the best genotypes for the main effect, the subgroup {G2, G3, G5, G6, G9, G38}, formed by stable genotypes, has a broad recommendation for all environments tested.

For comparison purposes, fitted the models by the MCMC method and used information criteria for model selection. Fig 15 presents the results of model selections obtained by AIC, BIC, and AICM. As additional criteria, the behaviors of log-likelihood and residual variance are presented. In this sense, models with lower residual variance are preferable, while higher log-likelihood indicates the best model.

thumbnail
Fig 15. Ockham’s plot for the BAMMI, BAMMIS, and BAMMIE models for the AIC, AICM, and BIC information criteria for real data.

The behaviors of log-likelihood (log-like) and residual variance (σe2) are also presented.

https://doi.org/10.1371/journal.pone.0279537.g015

The AICM criterion indicated that the BAMMIS and BAMMIE models with three bilinear terms (BAMMIS-3 and BAMMIE-3) would be the best, while for the BAMMI four principal components (BAMMI-4) would be necessary to obtain the optimal model. Using BIC and AIC criteria, we would select different models for each AMMI version. For the AIC criterion, we would have BAMMIE-3, BAMMIS-5, and BAMMI-6; while the BIC criterion pointed out the models: BAMMIE-3, BAMMIS-4, and BAMMI-5.

In Figs 1618 present posterior means and HPD regions, at 95% credibility, for genotype main effects of the models BAMMIE-3, BAMMIS-3, and BAMMI-4, respectively. To make these inferences, we considered the models selected by the AICM criterion, remembering that for the BAMMIE, the model selected by the criteria used was BAMMIE-3.

thumbnail
Fig 16. Posterior means and the 95% HPD intervals for the effect of genotypes, considering the BAMMIE-3 model, for real data using the MCMC method.

https://doi.org/10.1371/journal.pone.0279537.g016

thumbnail
Fig 17. Posterior means and the 95% HPD intervals for the effect of genotypes, considering the BAMMIS-3 model, for real data using the MCMC method.

https://doi.org/10.1371/journal.pone.0279537.g017

thumbnail
Fig 18. Posterior means and the 95% HPD intervals for the effect of genotypes, considering the BAMMI-4 model, for real data using the MCMC method.

https://doi.org/10.1371/journal.pone.0279537.g018

As you can see, the inferences regarding the main effect (ranking and credibility regions) do not change from one version to another. When we compare these results with those obtained with the models adjusted with RJMCMC (Fig 11), the inferences are the same, with the subgroup {G9, G10, G38, G8, G6, G2, G1, G3, G5, G7} highlighting and G7 being the best genotype. This behavior was also observed for models adjusted using AIC or BIC (S4 Appendix).

The biplots for the models selected from the AICM method are shown in Figs 1921. We observe that the positions and overlaps of the credibility regions in the biplots of the three models are practically identical. Furthermore, there are no significant discrepancies when comparing these biplots with those of Figs 1214, built from the fit using the RJMCMC method. So, the inferences are also the same.

thumbnail
Fig 19. Bivariate credibility regions at 95% probability for the genotypic and environmental scores of the BAMMIE-3 model for real data, adjusted using the MCMC method.

Only regions whose regions do not contain the origin (0.0) were plotted.

https://doi.org/10.1371/journal.pone.0279537.g019

thumbnail
Fig 20. Bivariate credibility regions at 95% probability for the genotypic and environmental scores of the BAMMIS-3 model for real data, adjusted using the MCMC method.

Only regions whose regions do not contain the origin (0.0) were plotted.

https://doi.org/10.1371/journal.pone.0279537.g020

thumbnail
Fig 21. Bivariate credibility regions at 95% probability for the genotypic and environmental scores of the BAMMI-4 model for real data, adjusted using the MCMC method.

Only regions whose regions do not contain the origin (0.0) were plotted.

https://doi.org/10.1371/journal.pone.0279537.g021

Adaptability in real data

For models fitted with the RJMCMC method, we highlighted the following separable homogeneous subgroups of unstable genotypes {G1}, {G7, 10}, {G11} and {G8, G39}. Genotypes in each subgroup are similar concerning the effect of GEI. We point out that the same subgroups can be identified regardless of the model (BAMMI, BAMMIS, and BAMMIE). Therefore, considering the RJMCMC biplot analysis (Figs 1214), it is possible to make the following recommendations: G1 genotype for the E8 environment, the subgroup of genotypes {G8, G39} for the subgroup of environments {E1, E2, E3, E4, E5} and subgroup {G7, G10} to environments E7 and E10.

This same result can be inferred from the biplot analysis for the model adjusted by the MCMC method. As already highlighted, the advantages of using the RJMCMC are related to the probabilistic nature of this proposal, such as its feasibility, since the use of information criteria in the selection of models is exhaustive and may become unfeasible. Concerning the analysis of stability and adaptability, we did not find significant differences when considering the different AMMI versions, depending on the a priori distribution adopted for the singular values. Our objective, in this sense, was to show that the RJMCMC can be applied to any Bayesian version of the AMMI model present in the literature.

Discussion

In analyzes using linear-bilinear models, it is important to determine how many bilinear terms should be retained to describe the pattern of interaction. In the frequentist method, this is usually done through parametric tests based on the F distribution or using cross-validation schemes [5, 17, 19]. The question of the dimensionality of the model naturally arises from the Bayesian perspective, although this subject is still not very widespread in the Bayesian analysis of linear-bilinear models and, in particular, in the Bayesian-AMMI. Most approaches have not focused on the selection of models, but the assessment of stability and adaptability, using credibility regions incorporated in the biplot [11, 13, 42, 43].

The most recent works on model selection in the Bayesian analysis of AMMI and GGE were carried out by Silva et al. [24] and Oliveira et al. [27], respectively. The analyzes presented by these authors can be considered Bayesian versions of the frequentist proposal of Cornelius and Crossa [25]. One of the reasons for using shrinkage estimators is the possibility of dispensing with the use of any criteria to determine the number of bilinear parameters to be retained in the analysis. The use of shrinkage estimators provides better adjustments in terms of prediction. But the method does not necessarily lead to a more parsimonious choice in terms of the model dimension. In this sense, we argue in favor of the need to think about procedures for selecting models, even when specific priors are used for the components of singular values, as is the case of the BAMMIS and BAMMIE models that were discussed in this study. Liu [10] offers a rich discussion on the selection of models for the Bayesian-AMMI, using information criteria BIC and AIC, and Bayes factor to determine the best model.

Despite their popularity, information criteria such as AIC and BIC require an exhaustive evaluation of all possible models. This fact can make applications unfeasible when the model space is large and, therefore, only in limited circumstances are indicated. Furthermore, the Bayes factor, as Liu [10] argues, is sensitive to the choice of priors, and choosing an ideal prior is a difficult task (although this criterion has not been used in this study). It is also noted that different criteria, in general, tend to choose different models (Fig 1). In this sense, it seems more reasonable to opt for the AICM criterion, which, for the conditional inference, proved to be more consistent, for selecting models with the same dimension. In turn, the use of the RJMCMC algorithm, allows both the adjustment and determination of the model dimension simultaneously. It is one of the most popular methods involving transdimensional Markov chains and has been applied in several contexts [29, 34, 4446].

In assessing the predictive ability, we consistently observed good properties for the three versions of the AMMI model. As noted, the results of the models fitted by the two methods do not coincide. The results obtained with the RJMCMC were always superior to the Gibbs approach both by the PRESS criterion and by the Correlation. This was also observed for the conditional response of each model (Figs 8 and 9 and S6 Table).

Another point to be highlighted was the use of different priors related to the parameter . In the BAMMI model, as in Crossa et al. [11] and Oliveira et al. [13], a prior can be considered as an approximation of the principle of insufficient reason. The BAMMIS proposed by Silva et al. [24] was obtained by assigning the Jeffreys prior [47]. This prior is based on the principle of invariance. Other suppositions were investigated by Liu [10], where he showed that different suppositions could lead to different results, as verified in this study as well. Despite the wide use of non-informative priors, they may not represent the current state of available information about the problem. In most cases, they are used only as reference distributions, that is, as a default option in the absence of any available information. Thus, among non-informative prior calls, there are some that are more useful and, therefore, frequently used. But depending on the situation, it would be incorrect to say that these are less informative than others. In this sense, the use of a prior based on the principle of maximum entropy [27] provided us with a theoretical justification for performing inference with less informative priors. Using maximum entropy priors can also be useful in situations of incomplete or doubtful information [4851].

With simulated data, we observed that, in general, the choice of priors did not affect the pattern of the marginal or conditional response of the model in the biplot. The prior based on the principle of maximum entropy (BAMMIE) provided a more parsimonious choice, but, with few exceptions, it did not change the patterns of the marginal bivariate credibility regions obtained with other priors. This choice is convenient in the example, as the breeder tends to prefer biplot representations to evaluate genotype responses in different environments. The biplot analysis facilitates the work of selection and recommendation of superior cultivars. Although no expressive discrepancies were found, opting for the marginal biplot could be interesting as it produces less Monte Carlo error. However, the conditional biplot determines a greater degree of freedom for the error.

From S8 and S9 Tables, it is possible to infer that the models, in general, performed well in correctly identifying stable and unstable genotypes (based on the simulated scenario). This behavior was verified for the fit using only the Gibbs sampler as well as the RJMCMC method. For real data, the most likely model was the one with three retained multiplicative terms (AMMI3) for all AMMI versions (BAMMI, BAMMIS, and BAMMIE). This became more evident for BAMMIE, as indicated by noting the marked difference between the frequency of “visits” (Fig 10). Furthermore, the biplot patterns for the three models are practically indistinguishable. Therefore, in theory, for a graphical evaluation of patterns, BAMMIE could be preferable. The justification is based on the singular value decomposition property that the first axes capture more information. BAMMIE better estimates the first two singular values, reducing the estimates of those related to higher dimensions to zero. Furthermore, the most popular biplot analysis is the AMMI-2 biplot analysis, which is systematically used even when the selected model dimension is greater than two. In addition, the use of maximum entropy priors makes the analysis more flexible, justifying the shrinkage effect without the need of imposing restrictions on the prior to avoid obtaining inappropriate marginal posterior distributions, as presented in Silva et al. [24]. This aspect was also emphasized by Oliveira et al. [27] who reported that the use of the principle of maximum entropy brings greater stability to the algorithm and simplifies the estimation of parameters.

The issues raised here present a good reason to choose the RJMCMC, although the dataset used in our approach is small. In this sense, the computational time of analysis with RJMCMC is much more attractive. As can be seen from S7 Table, the selection of models by information criteria involves procedures whose total computational time (in minutes) is at least four times longer compared to the RJMCMC method. Another justification for using RJMCMC, as mentioned in this text, would be that this method leads to a natural extension of the Markov chain theory.

Other RJMCMC approaches could be used. We chose the one that seemed the simplest to us and didn’t bother comparing different methods trying to find the best one. Our objective was to implement RJMCMC directly in the model fit and to show that the method is a more viable alternative than the use of information criteria. However, the comparison of different RJMCMC methods, aiming to find the best one, is very interesting and can lead to greater efficiency in the adjustment. Another option would be to use non-MCMC methods, which could solve problems such as poor mixing and the difficulty of diagnosing convergence. In some situations, there is evidence that non-MCMC methods can be more efficient than MCMC algorithms [52, 53]. Fearnhead [54] emphasizes this idea, reviewing three alternatives to MCMC methods: Importance Sampling, Forward-Backward algorithm, and Sequential Monte Carlo (SMC), discussing the efficiency and limitations of these procedures. The use of non-MCMC procedures could be an option to make the Bayesian analysis of multiplicative models more flexible. For this, further studies are needed and will be the subject of future research.

Supporting information

S1 Appendix. The complete posterior conditional distributions for the models.

https://doi.org/10.1371/journal.pone.0279537.s001

(PDF)

S4 Appendix. Posterior summaries for AMMI models with the MCMC method selected by AIC and BIC.

https://doi.org/10.1371/journal.pone.0279537.s004

(PDF)

S1 Table. Singular values were obtained through the SVD of the GEI simulated.

https://doi.org/10.1371/journal.pone.0279537.s005

(PDF)

S2 Table. Estimates for the variance components of the genotypic effect and residual variance component, referring to the AMMI model (Gibbs algorithm).

https://doi.org/10.1371/journal.pone.0279537.s006

(PDF)

S3 Table. Posterior means and HPD intervals (at 95%) for the genotypic effects of the model BAMMI (conditional and marginal), RJMCMC algorithm.

https://doi.org/10.1371/journal.pone.0279537.s007

(PDF)

S4 Table. Posterior means and HPD intervals (at 95%) for the genotypic effects of the model BAMMIS (conditional and marginal), RJMCMC algorithm.

https://doi.org/10.1371/journal.pone.0279537.s008

(PDF)

S5 Table. Posterior means and HPD intervals (at 95%) for the genotypic effects of the model BAMMIE (conditional and marginal), RJMCMC algorithm.

https://doi.org/10.1371/journal.pone.0279537.s009

(PDF)

S6 Table. Correlation and PRESS for the models (RJMCMC).

https://doi.org/10.1371/journal.pone.0279537.s010

(PDF)

S7 Table. Total computational time, of analysis of the models using Gibbs and RJMCMC algorithm.

https://doi.org/10.1371/journal.pone.0279537.s011

(PDF)

S8 Table. Classification of the pattern of genotypic scores by biplot AMMI2 representation based on the simulated scenario (Gibbs algorithm).

https://doi.org/10.1371/journal.pone.0279537.s012

(PDF)

S9 Table. Classification of the pattern of genotypic scores by biplot AMMI2 representation based on the simulated scenario (RJMCMC algorithm).

https://doi.org/10.1371/journal.pone.0279537.s013

(PDF)

S1 Fig. Genotypic effect of the BAMMI model (Gibbs algorithm).

https://doi.org/10.1371/journal.pone.0279537.s014

(PDF)

S2 Fig. Genotypic effect of the BAMMIS model (Gibbs algorithm).

https://doi.org/10.1371/journal.pone.0279537.s015

(PDF)

S3 Fig. Genotypic effect of the BAMMIE model (Gibbs algorithm).

https://doi.org/10.1371/journal.pone.0279537.s016

(PDF)

S4 Fig. Bivariate credibility regions for the genotypic and environmental scores of the BAMMI model (Gibbs algorithm).

https://doi.org/10.1371/journal.pone.0279537.s017

(PDF)

S5 Fig. Bivariate credibility regions for the genotypic and environmental scores of the BAMMIE model (Gibbs algorithm).

https://doi.org/10.1371/journal.pone.0279537.s018

(PDF)

S6 Fig. Bivariate credibility regions for the genotypic and environmental scores of the BAMMIS model (Gibbs lgorithm).

https://doi.org/10.1371/journal.pone.0279537.s019

(PDF)

References

  1. 1. de Carvalho CGP, Arias CAA, de Toledo JFF, de Almeida LA, de Kiihl RA S, de Oliveira MF. Interação genótipo x ambiente no desempenho produtivo da soja no Paraná. Pesqui Agropecuária Bras. 2002;37(7):989–1000.
  2. 2. Malosetti M, Ribaut JM, van Eeuwijk FA. The statistical analysis of multi-environment data: Modeling genotype-by-environment interaction and its genetic basis. Front Physiol. 2013;4 MAR.
  3. 3. Dia M, Wehner TC, Arellano C. Analysis of genotype × environment interaction (G×E) using SAS programming. Agron J. 2016;108(5).
  4. 4. Cornelius PL, Seyedsadr MS. Estimation of general linear-bilinear models for two-way tables. J Stat Comput Simul. 1997;58(4):287–322.
  5. 5. Gauch HG. Model Selection and Validation for Yield Trials with Interaction. Biometrics. 1988;44(3):705.
  6. 6. Cotes JM, Crossa J, Sanches A, Cornelius PL. A Bayesian Approach for Assessing the Stability of Genotypes. Crop Sci. 2006;46(6):2654–65.
  7. 7. Edwards JW, Jannink J-L. Bayesian Modeling of Heterogeneous Error and Genotype × Environment Interaction Variances. Crop Sci. 2006;
  8. 8. Orellana M, Edwards J, Carriquiry A. Heterogeneous variances in Multi-Environment yield trials for corn Hybrids. Crop Sci. 2014;
  9. 9. Viele K, Srinivasan C. Parsimonious estimation of multiplicative interaction in analysis of variance using Kullback–Leibler Information. J Stat Plan Inference. 2000;84(1–2):201–19.
  10. 10. Liu G. Bayesian computations for general linear-bilinear models. University of Kentucky; 2001.
  11. 11. Crossa J, Perez-Elizalde S, Jarquin D, Cotes JM, Viele K, Liu G, et al. Bayesian Estimation of the Additive Main Effects and Multiplicative Interaction Model. Crop Sci. 2011;51(4):1458–69.
  12. 12. Perez-Elizalde S, Jarquin D, Crossa J. A General Bayesian Estimation Method of Linear–Bilinear Models Applied to Plant Breeding Trials With Genotype × Environment Interaction. J Agric Biol Environ Stat. 2012;17(1):15–37.
  13. 13. Antonio de Oliveira L, Pereira da Silva C, Nuvunga JJ, Da Silva AQ, Balestre M. Credible Intervals for Scores in the AMMI with Random Effects for Genotype. Crop Sci. 2015;55(2):465–76.
  14. 14. Romão RF, Nuvunga JJ, Silva CP, Oliveira LA, Mendes CTE, Balestre M. Research Article Predictive ability of AMMI and factorial analytical models in the study of unbalanced multi-environment data. Genet Mol Res. 2019;18(3).
  15. 15. da Silva CP, de Oliveira LA, Nuvunga JJ, Pamplona AKA, Balestre M. Heterogeneity of Variances in the Bayesian AMMI Model for Multienvironment Trial Studies. Crop Sci. 2019;59(6):2455–72.
  16. 16. Cornelius PL. Statistical tests and retention of terms in the additive main effects and multiplicative interaction model for cultivar trials. Crop Sci. 1993;33(6).
  17. 17. Piepho HP. Robustness of statistical tests for multiplicative terms in the additive main effects and multiplicative interaction model for cultivar trials. Theor Appl Genet. 1995;90(3–4):438–43. pmid:24173935
  18. 18. Malik WA, Forkman J, Piepho HP. Testing multiplicative terms in AMMI and GGE models for multienvironment trials with replicates. Theor Appl Genet. 2019;132(7). pmid:30982926
  19. 19. Dias CTDS, Krzanowski WJ. Model selection and cross validation in additive main effect and multiplicative interaction models. Crop Sci. 2003;43(3).
  20. 20. Dos Santos Dias CT, Krzanowski WJ. Choosing components in the additive main effect and multiplicative interaction (AMMI) models. Sci Agric. 2006;63(2).
  21. 21. Hadasch S, Forkman J, Piepho HP. Cross-validation in AMMI and GGE models: A comparison of methods. Crop Sci. 2017;57(1).
  22. 22. Forkman J, Piepho HP. Parametric bootstrap methods for testing multiplicative terms in GGE and AMMI models. Biometrics. 2014;70(3). pmid:24588726
  23. 23. Malik WA, Hadasch S, Forkman J, Piepho HP. Nonparametric resampling methods for testing multiplicative terms in ammi and gge models for multienvironment trials. Crop Sci. 2018;58(2).
  24. 24. da Silva CP, de Oliveira LA, Nuvunga JJ, Pamplona AKA, Balestre M. A Bayesian Shrinkage Approach for AMMI Models. Hsiao CK, editor. PLoS One. 2015;10(7):e0131414. pmid:26158452
  25. 25. Cornelius PL, Crossa J. Prediction Assessment of Shrinkage Estimators of Multiplicative Models for Multi‐Environment Cultivar Trials. Crop Sci. 1999;39(4):998–1009.
  26. 26. Cornelius PL, Crossa J, Seyedsadr MS. Statistical tests and estimators of multiplicative models for genotype-by-environment interaction. Genotype by Environment Interaction. 1996.
  27. 27. de Oliveira LA, da Silva CP, da Silva AQ, Mendes CTE, Nuvunga JJ, Muniz JA, et al. Shrinkage in the Bayesian analysis of the GGE model: A case study with simulation. Teodoro PE, editor. PLoS One. 2021;16(8):e0256882. pmid:34460844
  28. 28. Sisson SA. Transdimensional Markov Chains. J Am Stat Assoc. 2005;100(471).
  29. 29. Green PJ. Reversible jump Markov chain monte carlo computation and Bayesian model determination. Biometrika. 1995;
  30. 30. Dellaportas P, Forster JJ, Ntzoufras I. On Bayesian model and variable selection using MCMC. Stat Comput. 2002;12(1).
  31. 31. Green PJ, Mira A. Delayed rejection in reversible jump Metropolis-Hastings. Biometrika. 2001;88(4).
  32. 32. Tierney L. A note on Metropolis-Hastings kernels for general state spaces. Ann Appl Probab. 1998;8(1).
  33. 33. Team RC. R Core Team R. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL http://www.R-project.org. 2016.
  34. 34. Balestre M, de Souza CL. Bayesian reversible-jump for epistasis analysis in genomic studies. Vol. 17, BMC Genomics. 2016. pmid:27938339
  35. 35. Waagepetersen R, Sorensen D. A tutorial on reversible jump MCMC with a view toward applications in QTL-mapping. Int Stat Rev. 2001;69(1).
  36. 36. Bodin T, Sambridge M. Seismic tomography with the reversible jump algorithm. In: Geophysical Journal International. 2009.
  37. 37. Raftery a. E, Lewis SM. The number of iterations, convergence diagnostics and generic Metropolis algorithms. Pract Markov Chain Monte Carlo. 1995;
  38. 38. Heidelberger P, Welch PD. Simulation Run Length Control in the Presence of an Initial Transient. Oper Res. 1983;31(6):1109–44.
  39. 39. Hu Z, Yang R-C. Improved Statistical Inference for Graphical Description and Interpretation of Genotype × Environment Interaction. Crop Sci. 2013;53(6).
  40. 40. Smith BJ. boa: An R package for MCMC output convergence assessment and posterior inference. J Stat Softw. 2007;21(11).
  41. 41. Raftery AE, Lewis S. How Many Iterations in the Gibbs Sampler? Bayesian Statistics 4 (Bernardo JM, Berger JO, Dawid AP and Smith AFM, eds). Oxford: Oxford University Press; 1992. p. 763–73.
  42. 42. de Oliveira LA, da Silva CP, Nuvunga JJ, da Silva AQ, Balestre M. Bayesian GGE biplot models applied to maize multi-environments trials. Genet Mol Res. 2016;15(2). pmid:27323201
  43. 43. Bernardo Júnior LAY, da Silva CP, de Oliveira LA, Nuvunga JJ, Pires LPM, Von Pinho RG, et al. Ammi Bayesian models to study stability and adaptability in Maize. Agron J. 2018;110(5).
  44. 44. Mondal A, Efendiev Y, Mallick B, Datta-Gupta A. Bayesian uncertainty quantification for flows in heterogeneous porous media using reversible jump Markov chain Monte Carlo methods. Adv Water Resour. 2010;33(3).
  45. 45. Xu S. Principles of statistical genomics. Principles of Statistical Genomics. 2013.
  46. 46. Zhang Z, Chan KL, Kwok JT, Yeung DY. Bayesian inference on principal component analysis using reversible jump markov chain Monte Carlo. In: Proceedings of the National Conference on Artificial Intelligence. 2004.
  47. 47. Jeffreys H. The theory of probability. 3rd ed. Oxford: OUP Oxford; 1961.
  48. 48. Jaynes ET. Information Theory and Statistical Mechanics. Phys Rev. 1957;106(4):620–30.
  49. 49. Templeman AB, Xingsi L. A maximum entropy approach to constrained non-linear programming. Eng Optim. 1987;
  50. 50. Beane NR, Rentch JS, Schuler TM. Using maximum entropy modeling to identify and prioritize red spruce forest habitat in West Virginia. Res Pap NRS-23 Newt Square, PA US Dep Agric For Serv North Res Station 16 p. 2013;23:1–16.
  51. 51. Debbah M, Müller RR. MIMO channel modeling and the principle of maximum entropy. IEEE Trans Inf Theory. 2005;51(5).
  52. 52. Fearnhead P. Exact and efficient Bayesian inference for multiple changepoint problems. Stat Comput. 2006;16(2).
  53. 53. Chopin N. Inference and model choice for sequentially ordered hidden Markov models. J R Stat Soc Ser B Stat Methodol. 2007;69(2).
  54. 54. Fearnhead P. Computational methods for complex stochastic systems: A review of some alternatives to MCMC. Stat Comput. 2008;18(2).