Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Accounting for spatial trends in multi-environment diallel analysis in maize breeding

  • Igor Ferreira Coelho,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Departamento de Biologia Geral, Universidade Federal de Viçosa (UFV), Viçosa, Minas Gerais, Brazil

  • Marco Antônio Peixoto,

    Roles Conceptualization, Formal analysis, Supervision, Writing – review & editing

    Affiliation Departamento de Biologia Geral, Universidade Federal de Viçosa (UFV), Viçosa, Minas Gerais, Brazil

  • Tiago de Souza Marçal,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Supervision, Writing – review & editing

    Affiliation Departamento de Biologia, Universidade Federal de Lavras (UFLA), Lavras, Minas Gerais, Brazil

  • Arthur Bernardeli,

    Roles Data curation, Formal analysis, Methodology, Writing – original draft, Writing – review & editing

    Affiliation Departamento de Agronomia, Universidade Federal de Viçosa (UFV), Viçosa, Minas Gerais, Brazil

  • Rodrigo Silva Alves,

    Roles Methodology, Supervision, Writing – review & editing

    Affiliations Departamento de Biologia Geral, Universidade Federal de Viçosa (UFV), Viçosa, Minas Gerais, Brazil, Instituto Nacional de Ciência e Tecnologia do Café (INCT Café), Universidade Federal de Lavras (UFLA), Lavras, Minas Gerais, Brazil

  • Rodrigo Oliveira de Lima,

    Roles Investigation, Supervision, Writing – original draft, Writing – review & editing

    Affiliation Departamento de Agronomia, Universidade Federal de Viçosa (UFV), Viçosa, Minas Gerais, Brazil

  • Edésio Fialho dos Reis,

    Roles Conceptualization, Data curation, Project administration, Resources, Supervision, Writing – review & editing

    Affiliation Departamento de Agronomia, Universidade Federal de Jataí (UFJ), Jataí, Goiás, Brazil

  • Leonardo Lopes Bhering

    Roles Conceptualization, Investigation, Resources, Software, Supervision, Writing – review & editing

    leonardo.bhering@ufv.br

    Affiliation Departamento de Biologia Geral, Universidade Federal de Viçosa (UFV), Viçosa, Minas Gerais, Brazil

Abstract

Spatial trends represent an obstacle to genetic evaluation in maize breeding. Spatial analyses can correct spatial trends, which allow for an increase in selective accuracy. The objective of this study was to compare the spatial (SPA) and non-spatial (NSPA) models in diallel multi-environment trial analyses in maize breeding. The trials consisted of 78 inter-populational maize hybrids, tested in four environments (E1, E2, E3, and E4), with three replications, under a randomized complete block design. The SPA models accounted for autocorrelation among rows and columns by the inclusion of first-order autoregressive matrices (AR1 ⊗ AR1). Then, the rows and columns factors were included in the fixed and random parts of the model. Based on the Bayesian information criteria, the SPA models were used to analyze trials E3 and E4, while the NSPA model was used for analyzing trials E1 and E2. In the joint analysis, the compound symmetry structure for the genotypic effects presented the best fit. The likelihood ratio test showed that some effects changed regarding significance when the SPA and NSPA models were used. In addition, the heritability, selective accuracy, and selection gain were higher when the SPA models were used. This indicates the power of the SPA model in dealing with spatial trends. The SPA model exhibits higher reliability values and is recommended to be incorporated in the standard procedure of genetic evaluation in maize breeding. The analyses bring the parents 2, 10 and 12, as potential parents in this microregion.

Introduction

Maize (Zea mays L.) is the most cultivated crop worldwide [1]. Quantitative traits, such as grain yield and plant height, are controlled by several genes and are highly influenced by the environment [2]. In this scenario, the genotype-by-environment (G × E) interaction, also known as phenotypic plasticity [3, 4], plays an essential role in phenotypic expression and can lead to difficulties in genetic selection [4].

Diallel mating designs are used for progeny tests [5] and are widely adopted in plant breeding [69]. These mating designs allow the evaluation of general and specific combining abilities, which are additive genetic effects based on general combining ability (GCA), and dominance genetic effect based on specific combining ability (SCA) [10, 11]. The GCA is given by the mean of the performance of a particular individual in combination with many others, and the SCA is the genetic effect of a specific cross [11, 12].

There are few examples of diallel multi-environment trials (MET) in maize breeding [1315]. The mixed model methodology, or the residual maximum likelihood (REML)/best linear unbiased prediction (BLUP) [16, 17] procedure, has been widely adopted for analyzing MET in plant breeding [1821]. However, diallel analyses are still underused [22], even being an effective procedure for genetic evaluation [23].

In diallel analyses, the REML/BLUP procedure allows the estimation of additive and dominance genetic variances, as well as the narrow- and broad-sense heritabilities [2426]. In addition, it allows the prediction of additive and dominance genetic effects and the gains with selection, derived directly from parents and hybrid selection [22]. In MET analyses, the REML/BLUP procedure allows modeling different residuals [2730] and genetic [18, 3136] covariance structures and may be applied to unbalanced data [19, 37].

In diallel MET analyses, there are many sources of variation [38]. As main environmental sources, Burgueño et al. [39] cited soil structure, moisture, light interception, pathogen infections, and even crop management. To deal with spatial trends, one of the most adopted model structures is the separable first-order autoregressive (AR1⊗AR1), which considers a partitioned residual structure (e), composed of the correlated error (ξ) and, the independent random error (η) [40, 41], where e = ξ+η. For MET analyses, Smith et al. [31, 42] proposed the two-stage analysis, being stage 1 the modeling of AR1⊗AR1 spatial structure of residuals, to determine the best-fitted model. The output of stage 1 is imputed in stage 2, where the MET analysis is performed. This analysis has recently been applied in wheat [43] and soybean [44] breeding.

The two-stage analysis can present some issues, such as convergence failure and high computational demand, owing to the large amount of data [45]. Selective accuracy, heritability, and gain with selection can be used to compare different statistical models [46]. Selective accuracy indicates the precision of the predicted genotypic values [46], guides the strategies of breeding programs, and assists in decision-making [47].

There are several studies regarding diallel designs in maize breeding [14, 4850], but only a few of them performed spatial analyses, mainly diallel MET analyses [51]. In addition, the use of spatial analyses has been increasing in recent years [43, 45, 5257]. Therefore, the objective of this study was to compare the spatial (SPA) and non-spatial (NSPA) models for diallel MET analyses in maize breeding.

Material and methods

Genetic material

A diallel mating design (13 × 13), with no reciprocal crossing, was implemented (Fig 1 and S1 Table in S1 File). The genetic material consisted of 78 interpopulational hybrids evaluated in four environments (E1 to E4). Six commercial hybrids (F1), which were widely adopted in the region, were used as checks (S2 Table in S1 File).

thumbnail
Fig 1. Interpopulational hybrids development process.

The process occurred as following: (I) the most recommended commercial genetic materials (F1) were selected as parents 1 to 13; (II) these parents (F1) were cultivated in isolated fields; (III) the isolated fields were harvested and generated 12 F2 hybrids and one rustic genetic material; (IV) the F2 genetic materials were crossed artificially (controlled pollination) among themselves, in a diallel mating design; (V) as a result it was generated 78 interpopulational hybrids; and, (VI) the 78 interpopulational hybrids were cultivated in 4 environments (E1 to E4).

https://doi.org/10.1371/journal.pone.0258473.g001

Experimental data

The trials were carried out during the 2018 winter season at four locations in the southwestern Goiás State, Brazil: the environments, E1 and E2, in the municipality of Jataí, and the environments, E3 and E4, in the municipalities of Caiapônia and Mineiros, respectively. The planting and harvesting dates, geographic coordinates, altitudes, average temperatures, and precipitation data for each environment are presented in S3 Table and S1 Fig in S1 File [58].

According to Alvares et al. [59], the weather in the southwestern region Goiás State is wet and temperate, with dry winters and hot summers (Cwa). The average annual temperature is around 21.5°C and the average rainfall is between 1.400 and 2.000 mm year-1. Agricultural practices for maize crops in Brazil [60] were adopted in this study and irrigation was not applied in the field (S3 Table in S1 File). The experiments were conducted in a randomized complete block design with three replications and 44 plants per plot. Plots were 4m long, with four rows spaced 0.45 m apart and a plot size of 7.2 m2.

To evaluate the grain yield (GY) trait, the ears at physiological maturity were harvested and shelled. Then, the grain weight and grain moisture percentage were recorded, and the GY trait was calculated at 13% moisture. All phenotypic measurements were taken from the middle rows, leaving the two border rows.

Statistical analyses

The estimation of variance components and the prediction of genotypic values for the GY trait were made using the REML/BLUP procedure [16, 17], according to Gilmour et al. [61].

The individual analyses, considering the randomized complete block design with one observation per plot, was given by the following equation: [1] where y is the vector of phenotypic data, τ is the vector of replications and checks (assumed to be fixed) added to the general mean, ug is the vector of the GCA effect (assumed as random), us is the vector of the SCA effect (assumed as random), and e is the vector of residuals (random). Uppercase letters refer to the incidence matrices for these effects.

In this model, and e N(0,R); where is the GCA variance (related to additive genetic variance), is the SCA variance (related to dominance genetic variance), and R refers to the residual variance matrix. Models accounting for different residual variance structures considering the correlations among observations (rows and columns) were tested.

The joint analysis, considering the randomized complete block design with one observation per plot and four environments, was performed using the following equation: [2] where y is the vector of phenotypic data, τ is the vector of block-locations-checks combinations (assumed to be fixed), which comprises the effects of environment, replication within the environment, and checks added to the overall mean; ug is the vector of additive genetic effects (assumed as random); us is the vector of dominant genetic effects (assumed as random); uge is the vector of the interaction between additive genetic effects and environments (random), use is the vector of the interaction between dominance genetic effects and environments (random), and e is the vector of residuals (random). Uppercase letters refer to the incidence matrices for these effects.

In this model: , and eN(0,R); where is the GCA variance (related to additive genetic variance), is the SCA variance (related to dominance genetic variance), is the GCA by environment interaction variance, is the SCA by environment interaction variance, and R refers to the residual variance matrix. Models accounting for different residual variance structures considering the correlations among observations (rows and columns) were tested.

SPA and NSPA analyses were performed for comparison purposes.

Modelling non-genetic effects.

In the individual analyses, the residual (co)variance structure could account for heterogeneity within the trial. The best fitted models were tested by the inclusion or the absence of the spatial information (given by rows and columns) in the fixed and random parts of the model. The joint analysis accounted for heterogeneity across environments in residual and genetic (co)variance matrices in two different ways: (i) by assuming spatially independent observations and (ii) by allowing spatial autocorrelations among observations, indexed by rows and columns.

The presence of autocorrelation among rows and columns was considered by the inclusion of first-order autoregressive matrices (AR1) for rows (Σr(ρr)) and columns (Σc(ρc)). There are three possible models beyond the NSPA model (), where is the residual variance, Ir is the identity matrix of rows (r × r), and Ic is the identity matrix of columns (c × c). The baseline model did not consider any correlations among the rows and columns. The following two residual structures account for the correlation among rows (M2) and columns (M3), respectively, by the inclusion of the AR1 structure. M2 and M3 contemplate the residual structure as and , respectively. The last model accounts for correlations among rows and columns (M4), being , where is the SPA variance among columns and among rows, ⊗ is the Kronecker product, and, AR1x, or, Σx, represent first-order autoregressive correlation matrix, for x (row or column).

After selecting the best residual structure for each trial, the SPA information (row and column specifications) was included in the fixed and random parts of the model. First, the selected residual structure (MRx) of each trial was adopted as the first model, named Mx.1. Second, we tested the presence of fixed linear effects for rows and columns (Mx.2 and Mx.3). Third, we tested the presence of random linear effects for rows and columns (Mx.4 and Mx.5). Finally, the measurement error was added to the random part of the model (Mx.6).

The fixed part of the model (rows and columns) deals with rows and columns as covariates, and was developed as a regression of the phenotypic responses as a function of the row and column positions. In the random part of the model, the rows and columns effects were included to quantify the variability among rows and columns. Finally, we also tested the inclusion of an independent error regarding the measurement error, which is an independent residual variance, .

Fig 2 presents the flowchart of all steps to encounter the best SPA models for each trial by modeling the fixed and random effects and to select the best genetic structure when analyzing all trials jointly in a MET analysis.

thumbnail
Fig 2. Flowchart of the spatial analysis in each trial and jointly.

All steps of the spatial analysis by considering the spatial information (rows and columns) in the fixed and random parts of the model, in each trial. Following, it is considered all individual models to encounter the best joint model, and the most adequate genetic covariance structure in the multi-environment analysis.

https://doi.org/10.1371/journal.pone.0258473.g002

Modelling genetic effects.

Five different genetic variance structures were tested across different environments: diagonal (DIAG), compound symmetry (CS), heterogeneous correlation (CORH), and factor analytic of first (FA1) and second (FA2) orders.

Model selection, significance test, and gain with selection.

To select the best-fit model, the conditional Bayesian information criterion (BICc) [62] was used. The comparison among models was based on the parameter vector (), which stands for the covariance matrix to calculate the corrected LogL (LogLc), as follows: [3] where LogLc is the corrected LogL by ; p is the number of random parameters, q is the number of fixed parameters, and n is the model’s degrees of freedom.

The variances of the random effects were assessed using the likelihood ratio test (LRT) as follows [63]: [4] where LogLR is the logarithm of the maximum residual likelihood function of the reduced model. For the LRT test, chi-square statistics with 1 degree of freedom and 5% probability of Type I error were considered.

The additive genetic variance (), dominance genetic variance (), and broad-sense (H2) heritability (given by the additive genetic variance among hybrids) were estimated for each trial using the following equations [6466]: [5] [6] [7] where and are the estimates of the general and specific combining ability variances, respectively.

In addition to the genetic parameters by each trial, the additive genetic variance (), dominance genetic variance (), additive by environment interaction variance (), dominance by environment interaction variance (), and broad-sense (H2) heritability, were estimated considering the four environments in a joint analysis using the following equations [6466]: [5] [6] [8]

In addition, the genotypic correlation across environments (rg) and the overall mean (μg) for the GY trait was also calculated as follows: [9]

Selective accuracies ( and ) were estimated for individual and joint () analyses using the following expression [67]: [10] [11] [12] where PEV is the prediction error variance, according to the respective effect obtained by the diagonal elements of the inverse of the coefficient matrix of the mixed model equations.

The correlations among the NSPA and SPA models were evaluated using the Pearson correlation coefficient. Cohen’s Kappa coefficient (K) [68] was applied to estimate the agreement between the NSPA and SPA models. The agreement among the selected interpopulational hybrids was calculated as follows: [13] where A is the number of matching selected interpopulational hybrids between the NSPA and SPA model rankings, C is the number of selected interpopulational hybrids due to chance (C = bD, where b is the selection intensity = 0.13), and D is the number of selected interpopulational hybrids (10).

Contrasting SPA and NSPA analyses.

After selecting the individual and joint SPA models, further analyses were conducted using the NSPA model to expose the relative importance of the SPA model. Therefore, all analyses performed under the spatial approach were also re-made under a non-spatial approach, considering the independence among rows and columns.

Software.

All statistical analyses were carried out using ASReml v.4.1 [61], ASReml-R [69], and R software [70].

Results

Modelling non-genetic effects

In trials E1 and E2, the NSPA model (MR1), which did not consider any correlation among the rows, and among the columns, was the best fitted model (Table 1). In trial E3, the best fitted model (MR4) considered the existence of correlation among rows and columns in the residual effects, while in trial E4, the best fitted model (MR3) considered the correlation among columns in the residual effects (Table 1).

thumbnail
Table 1. One-dimensional and two-dimensional autoregressive spatial models fitted for the grain yield trait.

https://doi.org/10.1371/journal.pone.0258473.t001

Modelling fixed and random effects

The E1, E2, and E3 trials did not consider any other effect beyond the selected residual in the previous step; the E4 trial was the only trial that considered the inclusion of a linear effect of rows (M3.2), which improved the model’s goodness-of-fit (Table 2).

thumbnail
Table 2. Spatial models fitted for the grain yield trait.

https://doi.org/10.1371/journal.pone.0258473.t002

Modelling the genetic effects in multi-environment trials

After selecting the best-fitted model for each trial, a joint analysis was performed, considering the best-fitted models, indicated by BICc in the previous steps. In the joint analysis, the genetic (additive and dominance effects) covariance matrix was also modeled. The compound symmetry model (CS) was assigned as the best-fit model (Table 3).

thumbnail
Table 3. Models fitted for the multi-environment trial for the grain yield trait.

https://doi.org/10.1371/journal.pone.0258473.t003

Estimates of variance components, heritability, and coefficient of determination of the dominance effects

The LRT results for both the SPA and NSPA analyses are presented in Fig 3. We observed differences in the significance of the random effects, where the SPA and NSPA models were considered, beyond the additive by environment interaction and dominance genetic effects, the additive genetic effects (GCA) were significant. When analyzing each trial (Fig 3), the same pattern was observed for trials E3 and E4, where, after SPA, genetic effects were considered significant and were not considered by NSPA.

thumbnail
Fig 3. Likelihood ratio test for spatial and non-spatial analyses.

Likelihood ratio test (LRT), using the Chi-square test with 1 degree of freedom (p< 0.05) (represented by the dashed line), for the genetic (additive and dominance) effects and their interaction with environment (in joint analysis) for (a) non-spatial and (b) spatial analyses.

https://doi.org/10.1371/journal.pone.0258473.g003

The SPA and NSPA models used for the joint analyses fit the compound symmetry structure (Table 4). The NSPA model fitted a homogeneous residual structure, while SPA considered all required spatial information for each trial in the residual structure, as well as fixed and random effects.

thumbnail
Table 4. Variance components, genetic parameters and phenotypic mean for the four trials (individual and joint analyses), by non-spatial (NSPA) and spatial (SPA) analyses.

https://doi.org/10.1371/journal.pone.0258473.t004

Differences between SPA and NSPA analyses were observed for the estimates of additive and dominance genetic variances and residual variance in each trial (Table 4). In the joint analyses, where the additive by environment interaction () and dominance by environment interaction () variances were estimated, the NSPA and SPA also conducted different estimates (Table 4). Trials E1, E3, and E4 presented higher dominance genetic variances than additive genetic variances (Table 4). Conversely, trial E2 presented a higher additive genetic variance (Table 4). The joint analyses conducted to similar genetic variance estimates among NSPA and SPA, except for dominance by environment interaction variances that were 50% lower in NSPA (Table 4). The residual structure, which was homogeneous in NSPA, became heterogeneous in SPA, and it was possible to obtain residual variances per environment. Regarding the residual variance, E3 and E4 trials presented reduced values in SPA, achieving a reduction of 20% in the E4 trial when adopting the indicated SPA models over the NSPA model (Table 4).

In individual analyses, heritability estimates varied across trials. Considering the broad-sense heritabilities, trials E1 and E2 presented estimates superior to trials E3 and E4. In trial E3, SPA showed an increase of approximately 1.2% over the NSPA estimate. Broad-sense heritability presented estimates that were superior to 15% in all trials. In addition, trials E1 and E2 had the same estimates in the SPA and NSPA analyses, since they did not require any spatial information. In trials E3 and E4, the broad-sense heritability increased by 1.1% and 3%, respectively, due to the adoption of SPA instead of NSPA models.

The selective accuracies (, and, ) varied from NSPA to SPA analyses in each trial. E1 and E2 presented the same values in the NSPA and SPA analyses, which were superior to 0.80 the selective accuracy of genotypic effects and selective accuracy for genetic additive effects. The selective accuracies for dominance genetic effect were 0.60 and 0.52 in the trials E1 and E2, respectively. The trials that required spatial information (E3 and E4) had increased accuracy values from the NSPA to SPA analyses. The highest difference was observed in the trial E3, where the selective accuracy for genetic additive effects increased from 0.62 to 0.73 when considered the spatial information. The same behaviors were observed in the accuracy estimates in trials E3 and E4.

The joint analyses showed improvement of the genetic parameter estimates (heritability, accuracy, and genotypic correlation) from NSPA to SPA analyses. First, the NSPA analyses presented broad-sense heritability estimates of approximately 5%. For SPA analysis, the broad-sense heritability estimates were split per environment, and their values varied according to each environment, from 4% (E1 and E3) to over 7% (E4). Compared to the individual analysis, the broad-sense heritability decreased from 13% in trial E1 to 4.8% in joint analysis, but in trial E4, it increased from 6.6% to 7.5%. The genotypic correlation across environments increased from 0.71, in NSPA analysis, to 0.75, in SPA analysis.

Contrasting additive and dominance genetic effects rankings

We contrasted the additive and dominance genetic effects by NSPA and SPA analyses, in each trial, to check the differences when the spatial information was used. Considering the three best parents selected, the same selected parents were observed in trials E1 and E2 (P1, P10, and P12) (S4 Table in S1 File). In trial E3, only one parent (P6) was identified among the top three highest additive genetic values in both NSPA and SPA. In trial E4, two parents were the same, P6 and P12, in the NSPA and SPA analyses. Based on the predicted additive and dominance genetic effects, it was also possible to predict the genotypic effects of the interpopulational hybrids in each trial (S5 Table in S1 File) and the gains with selection (Fig 4) by NSPA and SPA analyses.

thumbnail
Fig 4.

Selection gains for 10 (A) and 5 (B) selected hydrids. Direct gain, considering spatial and non-spatial analyses, and indirect gain, given by joint analysis, in both approaches, in all four environments. The values between parentheses indicated the predicted mean of each environment (E1 = trial 1, E2 = trial 2, E3 = trial 3, E4 = trial 4).

https://doi.org/10.1371/journal.pone.0258473.g004

The predicted dominance genetic effects of the parents’ combinations (S6 Table in S1 File), represented by the interpopulational hybrids (Hi,j, where i and j the parents crossed). The top ten genetic materials were compared and detached. The E3 and E4 trials presented different crossings into the top 10 dominance genetic values. The Pearson correlation coefficients were equal to 1.0 in the E1 and E2 trials, while in the E3 and E4 trials, they were higher than 0.8 (S6 Table in S1 File). The Cohen’s Kappa coefficients presented the same results in the E1 and E2 trials (1.0), while in E3 and E4 trials they were around 0.60 (S6 Table in S1 File).

Discussion

The use of statistical methods in breeding programs enables the breeder to access all information available from simpler, as phenotypic data, to complex inputs, such as genomic relationships and spatial correlations [7173]. The accuracy tends to increase by modeling more sources of information, leading to more efficient use of resources [74].

According to Cooper et al. [75], there are some variations among plots that the breeder cannot predict a priori, even when the soil fertility is well controlled and the experimental design is appropriate. These spatial trends contribute to a large number of non-target signals [76]. Indifference or proper accounting for exogenous sources of variation decreases the accuracy [73, 77] and selection mistakes [41, 78].

The occurrence of diseases and insects within and across blocks, missing plots, irrigation misdistribution, and micro soil fertility spots can affect the residual independence among plots [79]. In this manner, trials E3 and E4, which required first-order autoregressive (AR1) modeling for rows and/or columns, also presented the lowest values for broad-sense heritability, selective accuracy, and phenotypic means for the GY trait.

So and Edwards [29] found, in a simulated balanced dataset study, similar results compared compound symmetry and unstructured models. This statement is relevant because of the genotypic correlation estimate across environments (0.75), by SPA analysis, where the compound symmetry structure, even being a more conservative structure [36], is capable of dealing with the level of G×E interaction effect without loss of predictive capability.

When comparing the NSPA and SPA models, the results demonstrated the importance of spatial analysis. Relevant studies on potato [80] and soybean [44] also presented satisfactory results when spatial trends were considered. Higher selective accuracies were encountered in the SPA models, reinforcing the importance and potential of the SPA analyses in crop breeding. Another simulation study of soybean breeding considering the accounting of spatial information in the model also presented gains in selective accuracy when autoregressive structures were used for spatial information [76].

The significance of the random effects tested by the LRT differed when spatial information was considered in both individual and joint analyses. The selective accuracies and genetic gains demonstrate the capacity of data correction, when necessary, by spatial analyses. Similar results were reported by Resende [11], who affirmed that the presence of spatial dependence is required when there is any kind of heterogeneity in the experiment, or even inside the block. Furthermore, by the adoption of SPA analyses, when applicable, it was possible to estimate higher values of genetic variances and parameters, which proves the importance of considering the best-fitted model. For instance, in trial E3, after modeling the residuals and including spatial information in the model, it was possible to obtain a higher portion of the additive and dominance genetic effects, turning them into significant effects. They were non-significant in the NSPA analyses and became significant in the SPA analyses. In addition, the genetic variances, heritabilities, and selective accuracies were maximized using SPA analyses. The genotype ranking also varied between the NSPA and SPA analyses. This information can be used in breeding programs for genetic selection purposes.

Selective accuracy demonstrates the correlation between the predicted and true genetic values. As mentioned previously, there was an increase from NSPA to SPA analyses. These results reinforce the superiority of SPA analyses, which maximize the selective accuracy when compared to NSPA analyses. Residual modeling considering the dependence among rows and/or columns decreased the residual variance within the trials. The consideration of two errors, dependent and independent, allows modeling of the dependence part and mitigates the residue in explained portions. This error investigation maximized the selective accuracies in the environments, which indicated the spatial models as the best-fitted model because of the possibility of modeling more environmental effects that were not considered in the NSPA analyses.

One of the main goals of plant breeding programs is to select the best genotypes to increase genetic gains [81]. In addition, to maximize the frequency of favorable alleles for a specific trait, plant breeders use the best-ranked genotypes in the selection process to develop hybrids [81]. Following this idea, the selection gains based on the predicted genotypic effects by the BLUP method confirm the corrective capacity of SPA analyses over the NSPA analyses. The predicted genetic values can be used to predict the breeding value of individuals [82]. The joint analysis allows the calculation of the additive genetic effect and its interaction with the environment when considering all four environments. This fact increases the accuracy of the selection gain, as it is based on the additive genetic effect free of the GxE interaction effect. Herein, we confirmed the corrective capacity of spatial analyses when considering the selection gains, as well the genotype ranking, which led to the selection of the right genotypes in the SPA analyses ranking.

Furthermore, the differences between SPA and NSPA analyses rankings, in trials E3 and E4 (which spatial information was used) reinforce the importance of this analysis. It was observed that the spatial information was useful for selecting different parents and interpopulational hybrids using SPA over NSPA analyses. In addition, using the Pearson correlation coefficient demonstrates the existence of different genetic materials when contrasting NSPA and SPA analyses, and the Cohen’s Kappa coefficient, which focuses on the selected genetic materials, confirms the difference among the selected.

By observing the differences between the additive and dominance genetic effects, some interesting conclusions can be drawn for use in maize breeding programs. For instance, in trial E1, the H10,12, from the best parents (by additive effects) also presented a high dominance genetic effect; however, in trial E2, this effect was negative, demonstrating the presence of dominance by environment interaction effect, and its importance to calculate it. P6, which showed one of the lowest additive genetic effects, when combined with P12 (one of the highest additive genetic effects) presented one of the highest dominance genetic effects. These characteristics were observed in all the trials. In trial E3, a significant difference was noted between variance estimates and genetic parameter values from NSPA and SPA analyses, which demonstrates the importance of the SPA analyses in this study. Focusing on P3 and P12 as an example, the P3 was into three better parents in NSPA analyses, and decreased significantly in SPA analyses, while P12 arose from a median rank (in NSPA) to the third better in SPA analyses. Following, when looking into the dominance genetic effects just of the same two parents cross (H3,12), it was in the top 10 in the NSPA analyses, and dropped to the 16th position in SPA analyses. As demonstrated, the spatial trend corrections affected the additive and dominance genetic effects and must be considered by the breeder in breeding program strategies. The selection will be made based on these genotypic, additive genetic, and dominance genetic effects values, and their accuracy will directly affect the subsequent cycles of the breeding process.

Identifying the best genetic material is the main goal of any breeding program. The main goal of this study was to identify the best potential parents and their combinations to form heterotic groups. The interpretation individually and jointly brings the parents 2, 10 and 12, as potential parents in this microregion composted by the four environments. Overall, they presented a high additive genetic effect (meaning they have a high proportion of favorable alleles to GY trait) and a high dominance genetic effect (seem to be contrasting and complementary), which would be very interesting in a hybrid composition, as the heterosis effect is expected in maize.

In this way, this study highlighted the importance of considering the dependence among rows and columns when required for specific trials and applied all additional information into the joint analysis, combined with the best genetic covariance structure. The adoption of SPA analyses led to better results, as observed in BICc, selective accuracy, heritability, and selection gains. This study confirmed the importance of SPA diallel MET analyses in maize breeding, which is a pioneer in the literature.

Conclusion

The adoption of MET permits a better understanding of the genetic effect, as it enables the breeder to account for the GxE interaction effect. This makes it possible to isolate the genetic effect or further explore the GxE interaction effect, depending on the goal. In addition, the diallel matching design enriches the study as it accesses the effects within the genotypic effect, both additive and dominance genetic effects, by crossing all parents among themselves. We found that the SPA analyses maximize the selective accuracy when compared to NSPA analyses, confirming the superiority of the SPA models. In addition, the LRT confirmed that the mismodeling process can lead to the misinterpretation of model effects, where some significant effects can be considered non-significant due to inappropriate analyses.

Based on the fitted information, it was possible to identify the genotypes with the highest potential for future heterotic groups that can be used as potential commercial hybrids. The parents 2, 10 and 12, were indicated as potential parents in this microregion. Owing to their high additive genetic effects, bringing favorable genes to the future parental lines, their positive combination, given by the dominance genetic effect, will be expressed as a strong heterosis in future hybrids.

Supporting information

S1 File. Tables and figure to aid the understanding of the manuscript’s sections (materials and methods, and results).

https://doi.org/10.1371/journal.pone.0258473.s001

(DOCX)

References

  1. 1. USDA. Economic Research Service. 2019.
  2. 2. Allard RW. Principles of plant breeding. São Paulo—SP: Edgard Blücher; 1971.
  3. 3. Resende MDV. Genética quantitativa e de populações. 1st ed. Suprema, Visconde do Rio Branco. Visconde do Rio Branco: Suprema; 2015.
  4. 4. Marchal A, Schlichting CD, Gobin R, Balandier P, Millier F, Muñoz F, et al. Genotype by environment interactions in forest tree breeding: review of methodology and perspectives on research and application. Plant Genome. 2019;13: 281–288.
  5. 5. Dong L, Xie Y, Sun X. Full-diallel-based analysis of genetic parameters for growth traits in Japanese larch (Larix kaempferi). New For. 2020;51: 261–271.
  6. 6. Melani MD, Carena MJ. Alternative Maize Heterotic Patterns for the Northern Corn Belt. Crop Sci. 2005;45: 2186–2194.
  7. 7. Rodrigues MC, Chaves LJ, Pacheco CAP. Heterosis in crosses among white grain maize populations with high quality protein. Pesqui Agropecu Bras. 2006;41: 59–66. pmid:16697280
  8. 8. Welcker C, Thé C, Andréau B, De Leon C, Parentoni SN, Bernal J, et al. Heterosis and Combining Ability for Maize Adaptation to Tropical Acid Soils—Implications for Future Breeding Strategies. Crop Sci Madison. 2005;45: 2405–2413.
  9. 9. Coelho IF, Alves RS, Rocha JR do AS de C, Peixoto MA, Teodoro LPR, Teodoro PE, et al. Multi-trait multi-environment diallel analyses for maize breeding. Euphytica. 2020;216: 144.
  10. 10. Zhang X, Lv L, Lv C, Guo B, Xu R. Combining Ability of Different Agronomic Traits and Yield Components in Hybrid Barley. Zhou M, editor. PLoS One. 2015;10: 9. pmid:26061000
  11. 11. Resende MDV de. Genética biométrica e estatística no melhoramento de plantas perenes. Colombo: Embrapa Florestas; 2002.
  12. 12. Sprague GF, Tatum LA. General vs. Specific Combining Ability in Single Crosses of Corn 1. Agron J. 1942;34: 923–932.
  13. 13. Wegary D, Vivek BS, Labuschagne MT. Combining ability of certain agronomic traits in quality protein maize under stress and nonstress environments in Eastern and Southern Africa. Crop Sci. 2014;54: 1004–1014.
  14. 14. Nardino M, Souza VQ De, Baretta D, Konflanz VA, Follmann DN, Carvalho IR, et al. Partial diallel analysis among maize lines for characteristics related to the tassel and the productivity. African J Agric Res. 2016;11: 15–20.
  15. 15. Faria MV, Mendes MC, Rossi ES, Junior OP, Rizzardi DA, Gralak E, et al. Análise dialélica da produtividade e do progresso da severidade de doenças foliares em híbridos de milho em duas densidades populacionais. Semin Ciência Agrárias. 2015;36: 123–134.
  16. 16. Henderson CR. Best linear unbiased estimation and prediction under a selection model. Biometrics. 1975;31: 423–447. Available: https://www.jstor.org/stable/pdf/2529430.pdf pmid:1174616
  17. 17. Patterson HD, Thompson R. Recovery of inter-block information when block sizes are unequal. Biometrika. 197158: 545–554.
  18. 18. Burgueño J, Crossa J, Cotes JM, Vicente FS, Das B. Prediction Assessment of Linear Mixed Models for Multienvironment Trials. Crop Sci. 2011;51: 944–954.
  19. 19. Smith AB, Cullis BR, Thompson R. The analysis of crop cultivar breeding and evaluation trials: An overview of current mixed model approaches. J Agric Sci. 2005;143: 449–462.
  20. 20. Piepho H, Möhring J. Selection in Cultivar Trials—Is It Ignorable? Crop Sci. 2006;46: 192–201.
  21. 21. Peixoto MA, Coelho IF, Evangelista JSPC, Alves RS, do Amaral Santos de Carvalho Rocha JR, Farias FJC, et al. Reaction norms‐based approach applied to optimizing recommendations of cotton genotypes. Agron J. 2020; 1–28.
  22. 22. Buzinaro R, Hugo G, Oliveira F De, Oliveira GHF de, Amaral CB do, Souza Junior CL, et al. Diallel mixed-model analyses to select superior maize parental lines for Azospirillum brasilense and nitrogen-use efficiency. Crop Breed Appl Biotechnol. 2018;18: 382–389. http://dx.doi.org/10.1590
  23. 23. Vivas M, Silveira SF, Viana AP, Amaral AT Jr., Cardoso DL, Pereira MG. Efficiency of circulant diallels via mixed models in the selection of papaya genotypes resistant to foliar fungal diseases. Genet Mol Res. 2014;13: 4797–4804. pmid:25062415
  24. 24. Alves RS, de Carvalho Rocha JR do AS, Teodoro PE, de Resende MDV, Henriques EP, Silva LA, et al. Multiple-trait BLUP: a suitable strategy for genetic selection of Eucalyptus. Tree Genet Genomes. 2018;14: 77.
  25. 25. Carvalho IR, de Pelegrin AJ, Szareski VJ, Ferrari M, da Rosa TC, Martins TS, et al. Diallel and prediction (REML/BLUP) for yield components in intervarietal maize hybrids. Genet Mol Res. 2017;16: 1–12. pmid:28873210
  26. 26. de Souza NO, Alves RS, Teodoro PE, Silva LA, Tardin FD, Tardin AB, et al. Single-and multiple-trait blup in genetic selection of parents and hybrids of grain sorghum. Rev la Fac Ciencias Agrar. 2019;51: 1–12.
  27. 27. Peixouto LS, Nunes JAR, Furtado DF. Factor analysis applied to the G + GE matrix via REML / BLUP for. Crop Breed Appl Biotechnol. 2016;16: 1–6.
  28. 28. Ogut F, Maltecca C, Whetten R, McKeand S, Isik F. Genetic Analysis of Diallel Progeny Test Data Using Factor Analytic Linear Mixed Models. For Sci. 2014;60: 119–127.
  29. 29. So Y-S, Edwards J. Predictive Ability Assessment of Linear Mixed Models in Multienvironment Trials in Corn. Crop Sci. 2011;51: 542–552.
  30. 30. Rocha JR do AS de C, Nunes KV, Carneiro ALN, Marçal TDS, Salvador FV, Carneiro PCS, et al. Selection of superior inbred progenies toward the common bean ideotype. Agron J. 2019;111: 1181–1189.
  31. 31. Smith A, Cullis B, Thompson R. Analyzing Variety by Environment Data Using Multiplicative Mixed Models and Adjustments for Spatial Field Trend. Biometrics. 2001;57: 1138–1147. pmid:11764254
  32. 32. Burgueño J, de los Campos G, Weigel K, Crossa J. Genomic Prediction of Breeding Values when Modeling Genotype × Environment Interaction using Pedigree and Dense Molecular Markers. Crop Sci. 2012;52: 707–719.
  33. 33. Piepho H. Analyzing Genotype-Environment Data by Mixed Models with Multiplicative. Int Biometric Soc. 1997;53: 761–766.
  34. 34. Piepho HP. Ridge Regression and Extensions for Genomewide Selection in Maize. Crop Sci. 2009;49: 1165–1176.
  35. 35. Crossa J, Yang R-C, Cornelius PL. Studying crossover genotype × environment interaction using linear-bilinear models and mixed models. J Agric Biol Environ Stat. 2004;9: 362–380.
  36. 36. Crossa J, Burgueño J, Cornelius PL, McLaren G, Trethowan R, Krishnamachari A. Modeling Genotype × Environment Interaction Using Additive Genetic Covariances of Relatives for Predicting Breeding Values of Wheat Genotypes. Crop Sci. 2006;46: 1722–1733.
  37. 37. Fritsche-Neto R, Gonçalves MC, Vencovsky R, Souza Junior CL. Prediction of genotypic values of maize hybrids in unbalanced experiments. Crop Breed Appl Biotechnol. 2010;10: 32–39.
  38. 38. Selle ML, Steinsland I, Hickey JM, Gorjanc G. Flexible modelling of spatial variation in agricultural field trials with the R package INLA. Theor Appl Genet. 2019;132: 3277–3293. pmid:31535162
  39. 39. Burgueño J, Glaz B, Yeater KM. Chapter 12: Spatial Analysis of Field Experiments. Applied statistics in agricultural, biological, and environmental sciences. 2018. pp. 319–344.
  40. 40. Cullis BR, Gleeson AC. Spatial Analysis of Field Experiments—An Extension Spatial Analysis to Two Dimensions. Biometrics. 1991;47: 1449–1460.
  41. 41. Gilmour AR, Cullis BR, Verbyla AP. Accounting for Natural and Extraneous Variation in the Analysis of Field Experiments. J Agric Biol Environ Stat. 1997;2: 269–293.
  42. 42. Smith A, Cullis B, Gilmour A. The analysis of crop variety evaluation data in Australia. Aust New Zeal J Stat. 2001;43: 129–145.
  43. 43. Gogel B, Smith A, Cullis B. Comparison of a one- and two-stage mixed model analysis of Australia’s National Variety Trial Southern Region wheat data. Euphytica. 2018;214: 44.
  44. 44. Bernardeli A, Rocha JRAS de C, Borem A, Lorenzoni R, Aguiar R, Silva JNB, et al. Modeling spatial trends and enhancing genetic selection: An approach to soybean seed composition breeding. Crop Sci. 2020; csc2.20364.
  45. 45. Velazco JG, Rodríguez-Álvarez MX, Boer MP, Jordan DR, Eilers PHC, Malosetti M, et al. Modelling spatial trends in sorghum breeding field trials using a two-dimensional P-spline mixed model. Theor Appl Genet. 2017;130: 1375–1392. pmid:28374049
  46. 46. Resende MDV de, Duarte JB. Precisão e controle de qualidade em experimentos de avaliação de cultivares. Pesqui Agropecuária Trop. 2007;37: 182–194.
  47. 47. Lima RO de, Borém A. Melhoramento de Milho. Viçosa—MG: Editora UFV; 2018.
  48. 48. Oliboni R, Faria MV, Neumann M, Resende JTV, Battistelli GM, Tegoni RG, et al. Análise dialélica na avaliação do potencial de híbridos de milho para a geração de populações- base para obtenção de linhagens. Semin Agrar. 2013;34: 7–18.
  49. 49. Meirelles WF, Parentoni SN, Guimarães LJM, de Oliveira Guimarães PE, Pacheco CAP, de Oliveira AC, et al. Análise dialélica de linhagens de milho quanto à responsividade ao fósforo e à sua eficiência de uso. Pesqui Agropecu Bras. 2016;51: 224–232.
  50. 50. Karim ANMS, Ahmed S, Akhi AH, Talukder MZA, Mujahidi TA. Combining Ability and Heterosis study in maize (Zea mays L.) Hybrids at different environments in Bangladesh. Bangladesh J Agril Res. 2018;7122: 125–134.
  51. 51. Long JK, Bänziger M, Smith ME. Diallel analysis of grain iron and zinc density in southern African-adapted maize inbreds. Crop Sci. 2004;44: 2019–2026.
  52. 52. Sussy M, Ola H, Maria FAB, Niklas BO, Cecilia OM, Willis OK, et al. Micro-spatial analysis of maize yield gap variability and production factors on smallholder farms. Agric. 2019;9: 1–23.
  53. 53. Qiao CG, Basford KE, DeLacy IH, Cooper M. Evaluation of experimental designs and spatial analyses in wheat breeding trials. Theor Appl Genet. 2000;100: 9–16.
  54. 54. Bian L, Zheng R, Su S, Lin H, Xiao H, Wu HX, et al. Spatial analysis increases efficiency of progeny testing of Chinese fir. J For Res. 2017;28: 445–452.
  55. 55. Fukatsu E, Hiraoka Y, Kuramoto N, Yamada H, Takahashi M. Effectiveness of spatial analysis in Cryptomeria japonica D. Don (sugi) forward selection revealed by validation using progeny and clonal tests. Ann For Sci. 2018;75.
  56. 56. Chen Z, Helmersson A, Westin J, Karlsson B, Wu HX. Efficiency of using spatial analysis for Norway spruce progeny tests in Sweden. Ann For Sci. 2018;75: 13.
  57. 57. Liu SM, Constable GA, Cullis BR, Stiller WN, Reid PE. Benefit of spatial analysis for furrow irrigated cotton breeding trials. Euphytica. 2015;201: 253–264.
  58. 58. DigitalGlobe. Google Earth. http://www.earth.google.com [August 01, 2021]; 2021.
  59. 59. Alvares CA, Stape JL, Sentelhas PC, de Moraes Gonçalves JL, Sparovek G. Köppen’s climate classification map for Brazil. Meteorol Zeitschrift. 2013;22: 711–728.
  60. 60. Cruz JC, Karam D, Monteiro MAR, Magalhães PC. A cultura do milho. Sete Lagoas, MG: Embrapa Milho e Sorgo; 2008.
  61. 61. Gilmour AR, Gogel BJ, Cullis BR, Welham Sj, Thompson R. ASReml user guide release 4.1 structural specification. Hemel hempstead VSN Int ltd. 2015.
  62. 62. Verbyla AP. A note on model selection using information criteria for general linear models estimated using REML. Aust New Zeal J Stat. 2019;61: 39–50.
  63. 63. Wilks SS. The Large-Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses. Ann Math Stat. 1938;9: 60–62. Available: https://www.jstor.org/stable/pdf/2957648.pdf
  64. 64. Griffing B. Concept of General and Specific Combining Ability in relation to diallel crossing systems. Aust J Biol Sci. 1956;9: 463–493. Available: https://www.publish.csiro.au/bi/pdf/BI9560463
  65. 65. Pádua JMV, Das Graças Dias KO, Pastina MM, de Souza JC, Queiroz VAV, da Costa RV, et al. A multi-environment trials diallel analysis provides insights on the inheritance of fumonisin contamination resistance in tropical maize. Euphytica. 2016;211: 277–285.
  66. 66. Hallauer AR, Carena JM, Miranda Filho JB de. Quantitative genetics in maize breeding. New York: Springer, 2010. 500 p. New York: Springer; 2010.
  67. 67. Resende MDV. Software Selegen-REML/BLUP: a useful tool for plant breeding. Crop Breed Appl Biotechnol. 2016;16: 330–339.
  68. 68. Cohen J. A coefficient of agreement for nominal scales. Educ Psychol Meas. 1960;20: 37–46.
  69. 69. Butler DG, Cullis BR, Gilmour AR, Gogel BJ. ASReml-R reference manual (version 3). State Queensland, Dep Prim Ind Fish Brisbane, Qld. 2018.
  70. 70. R Development Core Team. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2020.
  71. 71. Lado B, Matus I, Rodríguez A, Inostroza L, Poland J, Belzile F, et al. Increased Genomic Prediction Accuracy in Wheat Breeding Through Spatial Adjustment of Field Trial Data. G3 Genes|Genomes|Genetics. 2013;3: 2105–2114. pmid:24082033
  72. 72. Rue H, Martino S, Chopin N. Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. J R Stat Soc Ser B (Statistical Methodol. 2009;71: 319–392.
  73. 73. Oakey H, Verbyla A, Pitchford W, Cullis B, Kuchel H. Joint modeling of additive and non-additive genetic line effects in single field trials. Theor Appl Genet. 2006;113: 809–819. pmid:16896718
  74. 74. Bernal-Vasquez AM, Möhring J, Schmidt M, Schönleben M, Schön CC, Piepho HP. The importance of phenotypic data analysis for genomic prediction—a case study comparing different spatial models in rye. BMC Genomics. 2014;15: 1–17. pmid:24382143
  75. 75. Cooper M, Messina CD, Podlich D, Totir LR, Baumgarten A, Hausmann NJ, et al. Predicting the future of plant breeding: Complementing empirical evaluation with genetic prediction. Crop Pasture Sci. 2014;65: 311–336.
  76. 76. Borges da Silva ÉD, Xavier A, Faria MV. Joint Modeling of Genetics and Field Variation in Plant Breeding Trials Using Relationship and Different Spatial Methods: A Simulation Study of Accuracy and Bias. Agronomy. 2021;11: 1397.
  77. 77. Hunt CH, Smith AB, Jordan DR, Cullis BR. Predicting Additive and Non-additive Genetic Effects from Trials Where Traits Are Affected by Interplot Competition. J Agric Biol Environ Stat. 2013;18: 53–63.
  78. 78. Stroup WW, Baenziger PS, Mulitze DK. Removing Spatial Variation from Wheat Yield Trials: A Comparison of Methods. Crop Sci. 1994;34: 62–66.
  79. 79. Ramalho MAP, Ferreira DF, Oliveira AC. Experimentação em genética e melhoramento de plantas. 3rd ed. Lavras: Editora UFLA; 2005.
  80. 80. Andrade MHML, Fernandes Filho CC, Fernandes MO, Bastos AJR, Guedes ML, Marçal T de S, et al. Accounting for spatial trends to increase the selection efficiency in potato breeding. Crop Sci. 2020;60: 2354–2372.
  81. 81. Cobb JN, Juma RU, Biswas PS, Arbelaez JD, Rutkoski J, Atlin G, et al. Enhancing the rate of genetic gain in public-sector plant breeding programs: lessons from the breeder’s equation. Theor Appl Genet. 2019;132: 627–645. pmid:30824972
  82. 82. Arnhold E, Mora F, Silva RG, Good-god PI V, Rodovalho MA. Evaluation of top-cross popcorn hybrids using mixed linear model methodology. Chil J Agric Res. 2009;69: 46–53.