Recommendation of Tahiti acid lime cultivars through Bayesian probability models

Renan Garcia Malikouski; Filipe Manoel Ferreira; Saulo Fabrício da Silva Chaves; Evellyn Giselly de Oliveira Couto; Kaio Olimpio das Graças Dias; Leonardo Lopes Bhering

doi:10.1371/journal.pone.0299290

Abstract

Probabilistic models enhance breeding, especially for the Tahiti acid lime, a fruit essential to fresh markets and industry. These models identify superior and persistent individuals using probability theory, providing a measure of uncertainty that can aid the recommendation. The objective of our study was to evaluate the use of a Bayesian probabilistic model for the recommendation of superior and persistent genotypes of Tahiti acid lime evaluated in 12 harvests. Leveraging the Monte Carlo Hamiltonian sampling algorithm, we calculated the probability of superior performance (superior genotypic value), and the probability of superior stability (reduced variance of the genotype-by-harvests interaction) of each genotype. The probability of superior stability was compared to a measure of persistence estimated from genotypic values predicted using a frequentist model. Our results demonstrated the applicability and advantages of the Bayesian probabilistic model, yielding similar parameters to those of the frequentist model, while providing further information about the probabilities associated with genotype performance and stability. Genotypes G15, G4, G18, and G11 emerged as the most superior in performance, whereas G24, G7, G13, and G3 were identified as the most stable. This study highlights the usefulness of Bayesian probabilistic models in the fruit trees cultivars recommendation.

Citation: Malikouski RG, Ferreira FM, Chaves SFdS, Couto EGdO, Dias KOdG, Bhering LL (2024) Recommendation of Tahiti acid lime cultivars through Bayesian probability models. PLoS ONE 19(3): e0299290. https://doi.org/10.1371/journal.pone.0299290

Editor: Mehdi Rahimi, KGUT: Graduate University of Advanced Technology, ISLAMIC REPUBLIC OF IRAN

Received: November 19, 2023; Accepted: February 8, 2024; Published: March 5, 2024

Copyright: © 2024 Malikouski et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The data relevant to this study are available from GitHub at: https://github.com/malikouskirg/PONE-D-23-38484.

Funding: This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) - Finance Code 001, Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), Fundação de Amparo à Pesquisa do Estado de Minas Gerais (FAPEMIG), Fundação de Amparo à Pesquisa do Estado do Espírito Santo (FAPES), and Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP). FMF was supported by FAPESP (São Paulo Research Foundation, Grant 2023/04881-3), and LLB was supported by CNPq (Research Productivity Fellowship, Grant 310610/2021-4).

Competing interests: The authors have declared that no competing interests exist.

Introduction

Breeding perennial fruit crops presents a set of challenges. The 3 to 5-year juvenile phase and the variable expression of quantitative traits over time can delay and mislead the selection of superior genotypes [1]. This differential performance over time can be a reflex of the genotypes-by-harvests interaction (GHI). The GHI in perennial species refers to the variation in gene expression and, consequently, the phenotypic traits of a plant due to the different environmental conditions and agricultural practices that occur in each planting cycle [2]. Therefore, in perennial fruit breeding, repeated measures on the same plant over time are important, which increases costs and the duration of breeding cycle [3]. In the presence of complex GHI, breeders must consider the productivity and the stability of the genotypes over harvests [4–6].

In Tahiti acid lime (Citrus latifolia Tanaka L.), several strategies have been proposed to address the challenges related to the extended juvenile phase and the presence of GHI. Grafting, a well-established technique in citrus propagation, influences vegetative growth and shortens the juvenile phase, facilitating earlier evaluations in breeding programs [7, 8]. Furthermore, research has demonstrated that employing repeatability models allows for accurate genotype selection in Tahiti acid lime after just four measurements [9]. To address the interdependence of measurements taken on the same individual over time, approaches like random regression models using Legendre polynomials have been applied [10]. Random regression models enable the estimation of the genotypic trajectory of evaluated treatments over time.

A recent methodology proposed by Dias et al. [11] can help to optimize the Tahiti acid lime cultivars recommendation, since it uses probability concepts. This method aims to reduce the risk associated with the selection of a given genotype, which is the daily dilemma of the farmers, who seek to guide their actions to minimize the risks of low production for a given crop [12]. Furthermore, plant breeding is increasingly focused on developing genotypes capable of coping with the modifications of the current climate, as the impacts of climate change become an additional pivotal factor to consider in the agricultural sector [13]. Dias et al. [11] proposed to use Bayesian probability concepts to assist in the selection of genotypes that gather favorable alleles for performance and stability across environments and harvesters. Furthermore, it allows a straightforward recommendation based on the probability of a given genotype to be selected considering its performance and stability, and a pairwise comparison of the probabilities of the evaluated selection candidates. This methodology has been proposed for the multiple-location context. Nevertheless, we believe that the same ideas are valid for the multi-harvest. The objective of our study was to evaluate the use of a Bayesian probabilistic model for the recommendation of superior and persistent genotypes of Tahiti acid lime evaluated in 12 harvests.

Materials and methods

Trial and plant material

We evaluated 24 combinations of rootstock and scion of Tahiti acid lime for fruit yield, expressed in kg of fruit per plant (Kg/tree). The hybrids Citrumelo swingle (Citrus paradisi X Poncirus trifoliata) and Citrandarin ‘riverside’ (Citrus sunki X Poncirus trifoliata) were used as rootstock for scions of 12 clones of Tahiti acid lime (Table 1). The plant materials came from the Active Germplasm Bank of Embrapa Mandioca e Fruticultura [14]. Each combination was considered a different selection candidate. The trial was laid out in a complete randomized block design, with four replications. Each plot was composed of three plants. The inter-rows and inter-ranges spacing was 6 x 3 m, respectively. We performed the statistical analysis described in the next topic using the plots unit mean.

Download:

Table 1. Codes for 24 combinations of rootstock and scion of Tahiti acid lime.

https://doi.org/10.1371/journal.pone.0299290.t001

The trial was established in July 2015, in São Mateus municipality, Espírito Santo, Brazil (18°48’21"S, 39°53’30"W, 35 m of altitude). The work was conducted on a farm at Bello Fruit^® company, through a partnership between the fruit production and export company and the Universidade Federal do Espírito Santo. The experimental region has a rainy season in summer and a dry season in winter, being classified as Aw, following the classification of Köppen [15]. The precipitation and temperature during the period of the experiment was illustrated in the S1 Fig of the Supplementary Material. The data was collected between July 2017 and September 2020, and consisted of 12 harvests carried out in the following days after planting: 736, 808, 861, 918, 972, 1083, 1200, 1249, 1415, 1568, 1633 and 1867.

Statistical analyses

We applied the probabilistic approach of Dias et al. [11] by fitting two Bayesian models using the the rstan package [16] and ProbBreed package [17]. The first model had a homogeneous residual variance (B-ID) and presented the following conditional normal probability: where: in which E[y_ijk] is the expectation of the phenotype from the i^th genotype, evaluated in the j^th block, at the k^th harvest; μ is the overall mean; g_i is the genotypic effect; r_j is the block effect; h_kis the harvest effect; gh_ik is the genotypes-by-harvests interaction, and p_ij is the environmental permanent effect.

The prior probability distribution of each parameter of the model was defined as: where N(0,σ_[α]) and Half Cauchy(0,σ_[α]) represent the normal and half-Cauchy distributions, respectively, with mean equal to zero and with different scale parameters. The following hyperpriors were considered for the respective parameters: where φ represents a predetermined global hyperparameter (φ = max(y)*10), defined in such a way that results in a weakly informative second level hyperpriors. Therefore, the data dominated the posterior distributions [18]. The half-Cauchy distribution is restricted to positive values, being often recommended as a prior distribution when modeling variance parameters [19].

The second Bayesian model had heterogeneous residual variances (B-DG). This model has the same considerations of the M-ID, except that σ_kHalfCauchy(0,σ_[σk]). We selected the best-fitted model via the Watanabe-Akaike Information Criterion 2 (WAIC2) [18]. We used the Hamiltonian Monte Carlo algorithm in four Markov chains with 4000 samples, thin equals 1 and 50% burn-in.

Convergence diagnostics.

The scale reduction factor () was used to assess the effectiveness of the convergence of the Markov chain Monte Carlo (MCMC). This metric indicates whether the chains have mixed sufficiently, and the estimated model parameters have reached a stable distribution. The closer the to 1, the greater the quality of mixing and convergence [18]. Greater values imply that more iterations are needed.

We also conducted a graphical analysis to visually assess how well the data generated by our model aligns with the true generative process of the observed data. This involved creating samples (referred to as "y_gen") from the fitted models using ancestral sampling from the conditional joint distribution and then plotting these samples against the observed data. Additionally, we employed posterior predictive p-values to gauge how closely the statistical measures (maximum, minimum, median, mean, and standard deviation) of the data generated by the fitted models resembled those of the observed data [18]. For instance, when considering the maximum statistic, we defined the Bayesian p-value as follows: P_max = pr(T(y_gen,θ) ≥ T(y,θ)|y), where T is the statistic test. The degree of similarity between the statistics derived from the generated data and those from the observed data increases as the Bayesian p-values approach 0.5.

Probability of superior performance and genotypic stability.

The probability metrics for both performance and stability utilized in this study were proposed by Dias et al. [11] and implemented in the ProbBreed R package [17]. Aiming to select the top four genotypes, we sampled the posterior distribution of the marginal genotypic values given the observed phenotypic values. In each sample, we ranked these genotypes in descending order of posterior genotypic values (). Then, we counted the number of events where a given genotype appeared in the subset of superior genotypes (Ω), i.e., among the top four (selection proportion of 16%). The selection of 4 genotypes has been defined in a breeding program that aims to select multiple superior materials to be recommended across various regions. Moreover, diversifying varieties on a single farm is important for the sustainability of cultivating Tahiti acid lime. The probability of superior performance of a genotype is given by the number of samples it appeared in divided by the total number of samples. In summary: where S(s = 1,2,…,S) is the number of samples and is an indicator variable that maps failure (0, if ) or success (1, if ).

We calculated the stability across harvests based on the variance of the effect of GHI. Genotypes with lower GHI variance (var[gh_ik]) tend to be more stable, showing less fluctuation in their performance between harvests. One can draw a parallel of this metric and the frequentist persistence (see the next topic). Following the same idea described in the last paragraph, calculated the probability of a given candidate belonging to the subset of the top four genotypes with smaller var[gh_ik] (I(var[gh_ik]^s ∈ℓV|y)). The probability of superior stability was given as follows:

We used the ideas previously described for the two probabilities to perform pairwise comparisons between selection candidates. The goal is to investigate the chances of a given genotype being superior, whether in performance or stability, to its peers. The pairwise probabilities of superior performance and the pairwise probabilities of superior stability were given by, respectively: and where is an indicator variable mapping success if has higher genotypic value than , or failure otherwise; and I(var[gh_i]^s < var[gh_i]^s|y) is another indicator variable mapping success if gh_ik has lower variance than gh_ik, or failure otherwise.

The probability of superior performance within harvests and the pairwise probability of superior performance within harvests can be obtained by the following equations, respectively: where , is an indicator variable mapping success if has higher genotypic value than in the harvest k, or failure otherwise; and I(var[gh_ik]^s < var[gh_ik]^s|y) is another indicator variable mapping success if gh_ik has lower variance than gh_ik in harvest k, or failure otherwise.

Stability in the frequentist context.

To compute the stability, we first fitted the following frequentist model (F-DG): where y is the vector of phenotypic data, 1 is a vector of ones, μ is the intercept of the model, r is the vector of repetition effects (assumed to be fixed), h is the vector of harvest effects (assumed to be fixed), g is the vector of genotypic effects (assumed to be random) , gh is the vector of the effects of the genotype-by-harvest interaction (assumed to be random) , p is the vector of permanent environmental effect (assumed to be random) , and e is the vector of residues associated with phenotypic observations (random) , where R is a residual covariance matrix. The capital letters X₁ and X₂ refer to the incidence matrix for the fixed effects, and Z₁,Z₂ and Z₃ are the incidence matrix for the random effects of the respective effects.

We estimated the variance components and predicted the genetic values using the residual maximum likelihood–REML [20], and the best linear unbiased predictor–BLUP [21], respectively. The significance tests of the random effects were verified via likelihood ratio test (LRT) [22].

A concept analogous to stability called “persistence” is used by breeders of perennial forages and refers to the ability to survive and keep producing dry matter for long periods [23, 24]. In perennial fruit plants, this concept can be readjusted as the ability to maintain a high fruit yield for several years [2]. Therefore, persistence is analogous to an ecological stability of the genotypes (P_i) based on the distance between each genotype in relation to the ideotype. The ideotype (g_max) was defined as the maximum genotypic value estimated on each harvest. We used the following expression to estimate the persistence [24]:

We compared the genotype ranking of the Bayesian probabilistic model and persistence via the frequentist model using Spearman correlation, following the expression below [25]: where ρ is the Spearman correlation, d is the difference between the rank positions of the genotypes in each methodology, and n is the number of genotypes.

We performed the analysis using R software environment, version 4.2.1 [26]. The Bayesian models were fitted using the probabilistic programming language Stan [27] from the rstan package [16], and the ProbBreed R package [17]. We fitted the linear mixed models using ASReml-R (version 4.1) [28].

Results

Probability of superior performance of genotypes

The Bayesian models (B-ID and B-DG) displayed a mean value of the statistic close to 1, indicating strong convergence of the model parameters (Table 2). Notably, among these models, the B-DG model exhibited the best fit, as evidenced by its lower WAIC2 value (Table 2). Note how the density of the data generated by B-DG model follows the same trend as the density of the real data. This indicates the model’s effectiveness in replicating the distribution of observed data through the generated data, highlighting its reliability in capturing the underlying patterns (Fig 1A). Considering the findings from the B-DG, the posterior distribution of genotypic values of the 24 genotypes exhibited a variable overlapping pattern among their highest posterior density intervals (Fig 1B).

Download:

Fig 1.

Bayesian distribution of the observed and generated data of the Tahiti acid lime dataset (A). Caterpillar plot of the genotypic posterior effects (and their 95% and 97.5% HPDs, represented by the thick and thin lines, respectively) of 24 genotypes of posterior effects (B). Marginal probability of superior performance of the 24 genotypes (C). Probability of superior stability of the 24 genotypes (D).

https://doi.org/10.1371/journal.pone.0299290.g001

Download:

Table 2. Comparative statistics of Bayesian models.

https://doi.org/10.1371/journal.pone.0299290.t002

Genotypes G15, G4, G18 and G11 presented the highest posterior genotypic values (Fig 1B) and the highest probability of superior performance (Fig 1C). Genotypes G15, G4, G18, G11, G3, G23, G22, G1, G19, G14, G2, G13 and G16 were selected at least in a few samples, while the remaining 11 genotypes did not appear among the selected in any sample (Fig 1C). Genotypes G15 and G4 offers low risk of bad performance if selected (probability of superior performance equal to or higher than 75%). Genotypes G24 and G7 had the highest probability of superior stability, meaning that they have the less variable performance across harvests (Fig 1D).

The pairwise comparison graph presents two symmetrical sides, which indicate the probability of success (lower diagonal) and failure (upper diagonal) of the genotypes in the x-axis being superior to the ones of the y-axis (Fig 2A). G15, for example, has a high probability of beating all genotypes. On the other hand, G9 is beaten by all its peers, except for G21, which wins three-thirds of the time. The greenish color indicates genotypes that tie in their performance (probability close to 50%), like G18 and G11, G2 and G14, and G10 and G24 (Fig 2A).

Download:

Fig 2.

Pairwise probability of superior performance among genotypes (A). Probabilities of superior performance within environments (B).

https://doi.org/10.1371/journal.pone.0299290.g002

The probabilities of a given genotype belonging to the group of selected ones in each harvest varies for high performance genotypes (Fig 2B). Genotypes G15, G4, G18, and G11 consistently displayed probabilities exceeding 50% across nearly all harvests. Conversely, genotypes G9 and G21 exhibited nil probabilities throughout all harvests, implying that they are not recommended due to their consistently poor performance (Fig 2B). According to the probability of superior performance within the harvests, high production by certain genotype in one harvest did not guarantee the same level of performance in subsequent harvests (Supplementary material–S2 Fig).

Probability of genotypic stability

The probability of encountering a Tahiti acid lime genotype with minimal variation for stability was generally low. Only G24 exhibited values exceeding 0.3, indicating it to be the most stable genotype among those under evaluation (Fig 1D). The frequentist model (F-DG) demonstrated the significance of GHI according to the LRT. Also, the variance components exhibited similar magnitudes for both models (Table 3). The overshadowed the and , with values of 2.21 for the F-DG model and 2.44 for the B-DG. The residual variance displayed varying values across different harvests, ranging from 0.88 to 412.17 in the F-DG and from 0,63 to 384.17 in the B-DG (Table 3).

Download:

Table 3. Variance components estimates for a frequentist heterogeneous (diagonal) residual variance model (F-DG) and a Bayesian heterogeneous (diagonal) residual variance model (B-DG).

For the Bayesian model, it includes the corresponding lower (L) and upper (U) highest posterior density (HPD), considering a confidence level α = 0.05.

https://doi.org/10.1371/journal.pone.0299290.t003

Persistence by the frequentist sense presented different results from the probability of superior stability of Bayesian models. However, in both contexts, the values were low. Genotypic persistence in the frequentist context ranged from 5.7 to 3.4 (Fig 3). Except for G3, which presented the highest value and well above the others, the other genotype values were in a range of 1.1, showing the low ability of the F-DG to distinguish the persistence of these genotypes. The correlation between the persistence rankings in the Bayesian context (Fig 1D) and those in the frequentist context (Fig 3) exhibited a coefficient of 0.69. This suggests a statistically significant correlation between the classifications provided by the two methods, considering a confidence level of α = 0.05.

Download:

Fig 3. Persistence of 24 genotypes of Tahiti acid lime via the frequentist model with diagonal residual variance.

https://doi.org/10.1371/journal.pone.0299290.g003

Discussion

The consideration of the GHI is very important in the genetic evaluation of perennial species. This is because gene expression varies in response to environmental factors across different harvests [29]. The model selection based on WAIC2, coupled with the observed increase in residual variance over successive harvests, provides robust evidence in favor of the suitability of the heterogeneous residuals model for data fitting over the homoscedastic model. These findings underscore the critical importance of appropriate modeling and accounting for diverse sources of variation in repeated measures datasets.

Comparing the genotypic values associated with a posteriori probability ensures greater confidence in the analysis of the performance of the Tahiti acid lime genotypes. Bayesian probability measures offer breeders the opportunity to delve into the probability of a particular genotype surpassing others, including scenarios where a candidate genotype may outperform a widely adopted cultivar [11]. This probability-based approach aids decision-making, especially when the difference between genotypes’ performance is small. These probabilities are dependent on the selection intensity, a value that is often predefined in breeding programs, depending on the stage [30]. Thence, probabilities are an intuitive metric and offer an enhanced reliability for recommendations, since it provides information about the risks. Its application simplifies decision-making processes and opens avenues for application in various domains beyond plant breeding [11]. Defining a probability value as a threshold would be very useful for practical purposes, as it would make it easy to classify comparisons as significative or non-significative. However, due to the different selective intensities that can be employed, and the diversity of selection candidates, the threshold depends on each reality and dataset the analysis is adjusted for.

Genotypes with the highest probability of superior performance, namely G15 (Iconha x Citrandarin riverside), G4 (Itarana x Citrumelo swingle), G18 (Bearss Lime x Citrandarin riverside), and G11 (BRS Passos x Citrumelo swingle), emerged as strong candidates for recommendation. These genotypes, as determined through probabilistic methods, possess alleles that impart adaptation to the changes in environmental conditions encountered through the harvests, maintaining consistently good performance. Indeed, the probability of superior performance is a measure of stability in an agronomic sense. The presence of GHI imposes changes for the selection of superior genotypes based on a single or a few harvests. Therefore, considering the genotype`s performance across multiple harvests is advisable for making well-informed decisions when selecting genetically superior candidates.

Initially employed in forage species to assess the maintenance of productivity levels through multiple cuts [24], the concept of persistence is also relevant in the context of perennial fruit crops [31], given the perennial behavior of both. We can make a parallel between persistence and the probability of superior stability, as both represent ecological stability, i.e., invariance of performance. In the Bayesian framework, the probability of being selected among the four most stable genotypes was, in general, low (values below 0.4). This metric had a 50% agreement rate in identifying the top four most persistent genotypes with the frequentist persistence. Both approaches selected G24 and G3 among the four most persistent genotypes.

Certain advantages of the Bayesian model deserve attention. The incorporation of priors enhances confidence in selecting materials with varying levels of persistence [11, 32]. Furthermore, Bayesian models offer the advantage of obtaining variance components with associated high probability density intervals. These credibility intervals provide a more intuitive means of quantifying component uncertainty. Also, from an asymptotic perspective, Bayesian credibility intervals outperform frequentist confidence intervals [18], since frequentist confidence intervals may prove inaccurate for small or moderate sample sizes and may, in certain instances, fail to converge to the true parameter value as the sample size increases [33]. Likewise in mixed model, Bayesian models work well in common situation of plant breeding, such as unbalanced data, heterogeneous residual variance [6, 34, 35].

Conclusion

By applying probabilistic Bayesian models in Tahiti acid lime in the genetic evaluation, we estimated the probability of superior performance of a genotype and the pairwise probabilities of superior performance between genotypes for both across and within harvests. Genotypes G15, G4, G18 and G11 were considered superior, and genotypes G24, G7, G13 and G3 were considered the most stable ones. Therefore, we believe that Bayesian probabilistic models can assist to more accurate recommendation in perennial fruit crops evaluated along many harvests, since it allows a more direct and precise interpretation of the performance and persistence of the candidate’s genotypes.

Supporting information

S1 Fig. Climatic data of precipitation (mm) and temperature (°C) from July 2017 to July 2020 in the field trial location.

https://doi.org/10.1371/journal.pone.0299290.s001

(TIF)

S2 Fig. Pairwise probability of superior performance among genotypes within harvest.

https://doi.org/10.1371/journal.pone.0299290.s002

(TIF)

Acknowledgments

We appreciate the Federal University of Espírito Santo, the Brazilian Agricultural Research Corporation, and the Bello Fruit Company for making possible the execution of these experiments. Also, we thank the Federal University of Viçosa, which provided infrastructure and human training for data analysis.

References

1. Gray SB, Brady SM. Plant developmental responses to climate change. Dev Biol. 2016;419: 64–77. pmid:27521050
- View Article
- PubMed/NCBI
- Google Scholar
2. Chaves SFS, Evangelista JSPC, Alves RS, Ferreira FM, Dias LAS, Alves RM, et al. Application of linear mixed models for multiple harvest/site trial analyses in perennial plant breeding. Tree Genet Genomes. 2022;18: 44.
- View Article
- Google Scholar
3. Chaves SFS, Dias LAS, Alves RS, Alves RM, Evangelista JSPC, Dias KOG. Leveraging multi-harvest data for increasing genetic gains per unit of time for fruit yield and resistance to witches’ broom in Theobroma grandiflorum. Euphytica. 2022;218: 171.
- View Article
- Google Scholar
4. Zhang S, Liang M, Wang N, Xu Q, Deng X, Chai L. Reproduction in woody perennial Citrus: an update on nucellar embryony and self-incompatibility. Plant Reprod. 2018;31: 43–57. pmid:29457194
- View Article
- PubMed/NCBI
- Google Scholar
5. Rai MK, Shekhawat NS. Recent advances in genetic engineering for improvement of fruit crops. Plant Cell, Tissue and Organ Culture (PCTOC). 2014;116: 1–15.
- View Article
- Google Scholar
6. Malosetti M, Ribaut J-M, van Eeuwijk FA. The statistical analysis of multi-environment data: modeling genotype-by-environment interaction and its genetic basis. Front Physiol. 2013;4: 37433. pmid:23487515
- View Article
- PubMed/NCBI
- Google Scholar
7. García-Muñoz MC, Henao-Rojas JC, Moreno-Rodríguez JM, Botina-Azain BL, Romero-Barrera Y. Effect of rootstock and environmental factors on fruit quality of Persian lime (Citrus latifolia Tanaka) grown in tropical regions. Journal of Food Composition and Analysis. 2021;103: 104081.
- View Article
- Google Scholar
8. Hassanzadeh Khankahdani H, Rastegar S, Golein B, Golmohammadi M, Aboutalebi Jahromi A. Effect of rootstock on vegetative growth and mineral elements in scion of different Persian lime (Citrus latifolia Tanaka) genotypes. Sci Hortic. 2019;246: 136–145.
- View Article
- Google Scholar
9. Malikouski RG, Peixoto MA, de Morais AL, Elizeu AM, Zucoloto M, Bhering LL. Repeatability coefficient estimates and optimum number of harvests in graft/rootstock combinations for’tahiti’acid lime. Acta Sci Agron. 2021;43: e51740–e51740.
- View Article
- Google Scholar
10. Malikouski RG, Alves RS, Peixoto MA, Ferreira FM, do Nascimento EF, de Morais AL, et al. Selection index based on random regression model in ‘Tahiti’ acid lime. Euphytica. 2022;218: 153.
- View Article
- Google Scholar
11. Dias KOG, dos Santos JPR, Krause MD, Piepho H-P, Guimarães LJM, Pastina MM, et al. Leveraging probability concepts for cultivar recommendation in multi-environment trials. Theoretical and Applied Genetics. 2022;135: 1385–1399. pmid:35192008
- View Article
- PubMed/NCBI
- Google Scholar
12. Eskridge KM, Byrne PF, Crossa J. Selection of stable varieties by minimizing the probability of disaster. Field Crops Res. 1991;27: 169–181.
- View Article
- Google Scholar
13. Henry RJ. Innovations in plant genetics adapting agriculture to climate change. Curr Opin Plant Biol. 2020;56: 168–173. pmid:31836470
- View Article
- PubMed/NCBI
- Google Scholar
14. Santos MG, Passos OS, Soares Filho WS, Girardi EA, Gesteira AS, Ferreira CF. Variability analysis of “Persian” acid lime tree selections using agronomic and molecular markers. Gen and Mol Res. 2013;12: 4604–4614. pmid:24222236
- View Article
- PubMed/NCBI
- Google Scholar
15. Alvares CA, Stape JL, Sentelhas PC, Gonçalves JL de M. Modeling monthly mean air temperature for Brazil. Theor Appl Climat. 2013;113: 407–427.
- View Article
- Google Scholar
16. Stan Development T. RStan: the R interface to Stan. R package version 217 3. 2018.
17. Chaves SF, Krause MD, Dias LA, Garcia AA, Dias KO. ProbBreed: A novel tool for calculating the risk of cultivar recommendation in multi-environment trials. G3 (Bethesda). 2024 Jan 18. pii: jkae013. pmid:38243647
- View Article
- PubMed/NCBI
- Google Scholar
18. Gelman A, Carlin JB, Stern HS, Rubin DB. Bayesian Data Analysis, Chapman & Hall/CRC Texts in Stat Scien. Vol. 3. 2013.
- View Article
- Google Scholar
19. Gelman A. Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper). Bayesian Anal. 2006;1.
- View Article
- Google Scholar
20. Patterson HD, Thompson R. Recovery of inter-block information when block sizes are unequal. Biometrika. 1971;58: 545–554.
- View Article
- Google Scholar
21. Henderson CR. Best Linear Unbiased Estimation and Prediction under a Selection Model. Biometrics. 1975;31: 423. pmid:1174616
- View Article
- PubMed/NCBI
- Google Scholar
22. Wilks SS. The Large-Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses. The Annals of Math Stat. 1938;9: 60–62.
- View Article
- Google Scholar
23. Parsons AJ, Edwards GR, Newton PCD, Chapman DF, Caradus JR, Rasmussen S, et al. Past lessons and future prospects: plant breeding for yield and persistence in cool-temperate pastures. Grass and For Sci. 2011;66: 153–172.
- View Article
- Google Scholar
24. Rocha JRASC, Machado JC, Carneiro PCS. Multitrait index based on factor analysis and ideotype-design: proposal and application on elephant grass breeding for bioenergy. GCB Bioenergy. 2018;10: 52–60.
- View Article
- Google Scholar
25. Spearman C. The Proof and Measurement of Association Between Two Things. Studies in individual differences: The search for intelligence. East Norwalk: App Cent Crofts. 1961;1:45–58.
- View Article
- Google Scholar
26. R Development Core Team. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2020.
27. Carpenter B, Gelman A, Hoffman MD, Lee D, Goodrich B, Betancourt M, et al. Stan: A Probabilistic Programming Language. J Stat Softw. 2017;76. pmid:36568334
- View Article
- PubMed/NCBI
- Google Scholar
28. Butler DG, Cullis BR, Gilmour AR, Gogel BJ, Thompson R. ASReml-R reference manual version 4. VSN International Ltd, Hemel Hempstead, HP1 1ES, UK. 2017.
29. Ferreira FM, de Carvalho Rocha JR do AS, Alves RS, Elizeu AM, Benites FRG, de Resende MDV, et al. Estimates of repeatability coefficients and optimum number of measures for genetic selection of Cynodon spp. Euphytica. 2020;216: 70.
- View Article
- Google Scholar
30. Silva ÉDB da , Xavier A, Faria MV. Impact of Genomic Prediction Model, Selection Intensity, and Breeding Strategy on the Long-Term Genetic Gain and Genetic Erosion in Soybean Breeding. Front Genet. 2021;12. pmid:34539725
- View Article
- PubMed/NCBI
- Google Scholar
31. Evangelista JSPC, Peixoto MA, Coelho IF, Ferreira FM, de Souza Marçal T, Alves RS, et al. Modeling covariance structures and optimizing Jatropha curcas breeding. Tree Genet Geno. 2023;19: 21.
- View Article
- Google Scholar
32. Silva JO, Bruckner CH, Carneiro PC, de Resende MD, Alves RS, Matias RG. Estimates of genetic parameters and repeatability coefficients in S2 peach genotypes grown in a subtropical environment. EMBRAPA, 2020.
- View Article
- Google Scholar
33. Kruschke J. Doing Bayesian data analysis: A tutorial with R, JAGS, and Stan. 2014.
- View Article
- Google Scholar
34. Smith AB, Cullis BR, Thompson R. The analysis of crop cultivar breeding and evaluation trials: an overview of current mixed model approaches. J Agric Sci. 2005;143: 449–462.
- View Article
- Google Scholar
35. Crossa J. From genotype x environment interaction to gene x environment interaction. Curr Genomics. 2012;13: 225–244.
- View Article
- Google Scholar

[ref1] 1. Gray SB, Brady SM. Plant developmental responses to climate change. Dev Biol. 2016;419: 64–77. pmid:27521050
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Chaves SFS, Evangelista JSPC, Alves RS, Ferreira FM, Dias LAS, Alves RM, et al. Application of linear mixed models for multiple harvest/site trial analyses in perennial plant breeding. Tree Genet Genomes. 2022;18: 44.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref3] 3. Chaves SFS, Dias LAS, Alves RS, Alves RM, Evangelista JSPC, Dias KOG. Leveraging multi-harvest data for increasing genetic gains per unit of time for fruit yield and resistance to witches’ broom in Theobroma grandiflorum. Euphytica. 2022;218: 171.
View Article
Google Scholar

[9] View Article

[10] Google Scholar

[ref4] 4. Zhang S, Liang M, Wang N, Xu Q, Deng X, Chai L. Reproduction in woody perennial Citrus: an update on nucellar embryony and self-incompatibility. Plant Reprod. 2018;31: 43–57. pmid:29457194
View Article
PubMed/NCBI
Google Scholar

[12] View Article

[13] PubMed/NCBI

[14] Google Scholar

[ref5] 5. Rai MK, Shekhawat NS. Recent advances in genetic engineering for improvement of fruit crops. Plant Cell, Tissue and Organ Culture (PCTOC). 2014;116: 1–15.
View Article
Google Scholar

[16] View Article

[17] Google Scholar

[ref6] 6. Malosetti M, Ribaut J-M, van Eeuwijk FA. The statistical analysis of multi-environment data: modeling genotype-by-environment interaction and its genetic basis. Front Physiol. 2013;4: 37433. pmid:23487515
View Article
PubMed/NCBI
Google Scholar

[19] View Article

[20] PubMed/NCBI

[21] Google Scholar

[ref7] 7. García-Muñoz MC, Henao-Rojas JC, Moreno-Rodríguez JM, Botina-Azain BL, Romero-Barrera Y. Effect of rootstock and environmental factors on fruit quality of Persian lime (Citrus latifolia Tanaka) grown in tropical regions. Journal of Food Composition and Analysis. 2021;103: 104081.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref8] 8. Hassanzadeh Khankahdani H, Rastegar S, Golein B, Golmohammadi M, Aboutalebi Jahromi A. Effect of rootstock on vegetative growth and mineral elements in scion of different Persian lime (Citrus latifolia Tanaka) genotypes. Sci Hortic. 2019;246: 136–145.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref9] 9. Malikouski RG, Peixoto MA, de Morais AL, Elizeu AM, Zucoloto M, Bhering LL. Repeatability coefficient estimates and optimum number of harvests in graft/rootstock combinations for’tahiti’acid lime. Acta Sci Agron. 2021;43: e51740–e51740.
View Article
Google Scholar

[29] View Article

[30] Google Scholar

[ref10] 10. Malikouski RG, Alves RS, Peixoto MA, Ferreira FM, do Nascimento EF, de Morais AL, et al. Selection index based on random regression model in ‘Tahiti’ acid lime. Euphytica. 2022;218: 153.
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref11] 11. Dias KOG, dos Santos JPR, Krause MD, Piepho H-P, Guimarães LJM, Pastina MM, et al. Leveraging probability concepts for cultivar recommendation in multi-environment trials. Theoretical and Applied Genetics. 2022;135: 1385–1399. pmid:35192008
View Article
PubMed/NCBI
Google Scholar

[35] View Article

[36] PubMed/NCBI

[37] Google Scholar

[ref12] 12. Eskridge KM, Byrne PF, Crossa J. Selection of stable varieties by minimizing the probability of disaster. Field Crops Res. 1991;27: 169–181.
View Article
Google Scholar

[39] View Article

[40] Google Scholar

[ref13] 13. Henry RJ. Innovations in plant genetics adapting agriculture to climate change. Curr Opin Plant Biol. 2020;56: 168–173. pmid:31836470
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref14] 14. Santos MG, Passos OS, Soares Filho WS, Girardi EA, Gesteira AS, Ferreira CF. Variability analysis of “Persian” acid lime tree selections using agronomic and molecular markers. Gen and Mol Res. 2013;12: 4604–4614. pmid:24222236
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref15] 15. Alvares CA, Stape JL, Sentelhas PC, Gonçalves JL de M. Modeling monthly mean air temperature for Brazil. Theor Appl Climat. 2013;113: 407–427.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref16] 16. Stan Development T. RStan: the R interface to Stan. R package version 217 3. 2018.

[ref17] 17. Chaves SF, Krause MD, Dias LA, Garcia AA, Dias KO. ProbBreed: A novel tool for calculating the risk of cultivar recommendation in multi-environment trials. G3 (Bethesda). 2024 Jan 18. pii: jkae013. pmid:38243647
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref18] 18. Gelman A, Carlin JB, Stern HS, Rubin DB. Bayesian Data Analysis, Chapman & Hall/CRC Texts in Stat Scien. Vol. 3. 2013.
View Article
Google Scholar

[58] View Article

[59] Google Scholar

[ref19] 19. Gelman A. Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper). Bayesian Anal. 2006;1.
View Article
Google Scholar

[61] View Article

[62] Google Scholar

[ref20] 20. Patterson HD, Thompson R. Recovery of inter-block information when block sizes are unequal. Biometrika. 1971;58: 545–554.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref21] 21. Henderson CR. Best Linear Unbiased Estimation and Prediction under a Selection Model. Biometrics. 1975;31: 423. pmid:1174616
View Article
PubMed/NCBI
Google Scholar

[67] View Article

[68] PubMed/NCBI

[69] Google Scholar

[ref22] 22. Wilks SS. The Large-Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses. The Annals of Math Stat. 1938;9: 60–62.
View Article
Google Scholar

[71] View Article

[72] Google Scholar

[ref23] 23. Parsons AJ, Edwards GR, Newton PCD, Chapman DF, Caradus JR, Rasmussen S, et al. Past lessons and future prospects: plant breeding for yield and persistence in cool-temperate pastures. Grass and For Sci. 2011;66: 153–172.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref24] 24. Rocha JRASC, Machado JC, Carneiro PCS. Multitrait index based on factor analysis and ideotype-design: proposal and application on elephant grass breeding for bioenergy. GCB Bioenergy. 2018;10: 52–60.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref25] 25. Spearman C. The Proof and Measurement of Association Between Two Things. Studies in individual differences: The search for intelligence. East Norwalk: App Cent Crofts. 1961;1:45–58.
View Article
Google Scholar

[80] View Article

[81] Google Scholar

[ref26] 26. R Development Core Team. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2020.

[ref27] 27. Carpenter B, Gelman A, Hoffman MD, Lee D, Goodrich B, Betancourt M, et al. Stan: A Probabilistic Programming Language. J Stat Softw. 2017;76. pmid:36568334
View Article
PubMed/NCBI
Google Scholar

[84] View Article

[85] PubMed/NCBI

[86] Google Scholar

[ref28] 28. Butler DG, Cullis BR, Gilmour AR, Gogel BJ, Thompson R. ASReml-R reference manual version 4. VSN International Ltd, Hemel Hempstead, HP1 1ES, UK. 2017.

[ref29] 29. Ferreira FM, de Carvalho Rocha JR do AS, Alves RS, Elizeu AM, Benites FRG, de Resende MDV, et al. Estimates of repeatability coefficients and optimum number of measures for genetic selection of Cynodon spp. Euphytica. 2020;216: 70.
View Article
Google Scholar

[89] View Article

[90] Google Scholar

[ref30] 30. Silva ÉDB da , Xavier A, Faria MV. Impact of Genomic Prediction Model, Selection Intensity, and Breeding Strategy on the Long-Term Genetic Gain and Genetic Erosion in Soybean Breeding. Front Genet. 2021;12. pmid:34539725
View Article
PubMed/NCBI
Google Scholar

[92] View Article

[93] PubMed/NCBI

[94] Google Scholar

[ref31] 31. Evangelista JSPC, Peixoto MA, Coelho IF, Ferreira FM, de Souza Marçal T, Alves RS, et al. Modeling covariance structures and optimizing Jatropha curcas breeding. Tree Genet Geno. 2023;19: 21.
View Article
Google Scholar

[96] View Article

[97] Google Scholar

[ref32] 32. Silva JO, Bruckner CH, Carneiro PC, de Resende MD, Alves RS, Matias RG. Estimates of genetic parameters and repeatability coefficients in S2 peach genotypes grown in a subtropical environment. EMBRAPA, 2020.
View Article
Google Scholar

[99] View Article

[100] Google Scholar

[ref33] 33. Kruschke J. Doing Bayesian data analysis: A tutorial with R, JAGS, and Stan. 2014.
View Article
Google Scholar

[102] View Article

[103] Google Scholar

[ref34] 34. Smith AB, Cullis BR, Thompson R. The analysis of crop cultivar breeding and evaluation trials: an overview of current mixed model approaches. J Agric Sci. 2005;143: 449–462.
View Article
Google Scholar

[105] View Article

[106] Google Scholar

[ref35] 35. Crossa J. From genotype x environment interaction to gene x environment interaction. Curr Genomics. 2012;13: 225–244.
View Article
Google Scholar

[108] View Article

[109] Google Scholar

Figures

Abstract

Introduction

Materials and methods

Trial and plant material

Statistical analyses

Convergence diagnostics.

Probability of superior performance and genotypic stability.

Stability in the frequentist context.

Results

Probability of superior performance of genotypes

Probability of genotypic stability

Discussion

Conclusion

Supporting information

S1 Fig. Climatic data of precipitation (mm) and temperature (°C) from July 2017 to July 2020 in the field trial location.

S2 Fig. Pairwise probability of superior performance among genotypes within harvest.

Acknowledgments

References