Figures
Abstract
Maize (Zea mays L.) productivity in Sub-Saharan Africa is increasingly constrained by recurrent drought linked to climate change. Improving yield stability under contrasting moisture conditions remains challenging, especially when breeding materials are derived from parental lines within the same heterotic group (e.g., Group A × Group A), where genetic divergence is limited. We hypothesized that doubled haploid (DH) lines derived from biparental populations still harbor sufficient within-population genetic variation to generate exploitable phenotypic diversity and adaptive differentiation. Furthermore, crossing each DH line with a single-cross tester from the opposite heterotic group provides an effective framework to capture this variation through hybrid performance. In the present work therefore, 855 DH testcross hybrids and six commercial checks were evaluated under optimal and managed drought conditions using an alpha-lattice design. Drought stress was imposed two weeks before flowering until harvest. The mean grain yield under optimal conditions were between 3.66 to 10.36 t ha-1, while 0.16 to 6.13 t ha-1 under drought condition. The mean of top 10 hybrids (≈1.2% selection intensity) outperformed the mean of commercial checks by 60% under optimal conditions and 161% under drought stress. Under optimum conditions, 629 and 286 hybrids exhibited higher heterosis over the mean of checks and the best check, respectively. Similarly, 536 and 235 hybrids surpassed the mean of checks and the best check, respectively, in terms of standard heterosis. The mean and best check yields under drought stress were 1.97 and 3.10 while, 5.91 and 7.09 t ha-1 under optimum condition, respectively. The multi-trait clustering grouped the hybrids into four distinct adaptive categories. Cluster 3, defined primarily by grain yield, integrated temperate introgression with elite tropical backgrounds consistently expressed superior yield performance along with desirable secondary traits. These results demonstrate that DH lines derived from within-group parental crosses can generate functional diversity and predictable adaptive clusters. Thus supports a cluster-based selection strategy to improve drought tolerance, yield potential, and adaptation in maize breeding programs.
Citation: Mekasha GM, Yigezu ZD, Teklewold A, Gowda M, Burgueño J, Beyene Y (2026) Performance of doubled haploid maize (Zea mays L.) testcross hybrids under optimal and drought-stressed environments. PLoS One 21(6): e0350861. https://doi.org/10.1371/journal.pone.0350861
Editor: C. Anilkumar, ICAR - National Rice Research Institute, INDIA
Received: March 10, 2026; Accepted: May 18, 2026; Published: June 3, 2026
Copyright: © 2026 Mekasha et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The relevant data are within the manuscript.
Funding: Bill and Melinda Gates Foundation (B&MGF), and the United States Agency for International Development (USAID) through the Stress Tolerant Maize for Africa (STMA, B & MGF Grant # OPP1134248).
Competing interests: No.
1 Introduction
Drought is a major constraint to maize production and a significant threat to global food security [1,2]. In Sub-Saharan Africa, the challenge is increasingly driven by the rising frequency of mid-season drought episodes, particularly around flowering, which are often simulated as managed drought in breeding programs to mimic field conditions and identify tolerant genotypes [3,4]. As global temperatures rise, drought intensity and variability are expected to increase, further exacerbating yield instability [1]. At the same time, the global population is projected to reach nearly 10 billion by 2050, requiring a 60–70% increase in food production [5,6].
Maize (Zea mays L.) plays a central role in global agriculture as a source of food, feed, and industrial raw materials [7–10]. In eastern and southern Africa, it is the dominant staple crop for rural households and a cornerstone of food security [11]. However, maize productivity in Africa remains low (<2.1 t ha-1) compared to the global average (~5.8 t ha-1) [8,12], largely due to climate-induced stresses. Among these, drought is the most critical, causing yield losses ranging from 23% to 70%, particularly when it occurs near flowering [13,14]. In the face of climate change, rapidly developing maize germplasm with resistance to key abiotic and biotic stresses is crucial for the resilience of Africa’s maize-based cropping systems [15]. The International Maize and Wheat Improvement Center (CIMMYT), in collaboration with various institutions, has successfully developed and deployed stress-tolerant maize cultivars in sub-Saharan Africa, Asia, and Latin America through extensive stress screening and on-farm testing. To further enhance genetic gains in tropical maize breeding, integrating doubled haploidy, high-throughput phenotyping, and genomic-assisted breeding with effective data management is essential [16].
Plant breeding remains a key strategy for improving drought tolerance and yield stability. Recent advances such as DH technology enable the rapid development of completely homozygous lines within 1–2 generations, compared to 6–8 generations required in conventional inbreeding, thereby accelerating genetic gain and varietal development [17,18]. This is particularly important for responding to climate variability and shortening breeding cycles. Genetic diversity is fundamental for effective crop improvement [19]. Understanding diversity within breeding materials supports the identification of superior parental lines and the formation of heterotic groups, which are essential for hybrid development [20–22]. However, most hybrid breeding strategies rely on crosses between distinct heterotic groups (e.g., Group A × Group B) to exploit heterosis, while crosses within the same heterotic group are generally considered to have limited genetic divergence and, therefore, lower potential for hybrid vigor [23,24].
Despite this assumption, the extent to which populations derived from within-group crosses can generate useful phenotypic diversity, particularly under drought conditions, remains insufficiently explored [24,25]. This represents a critical knowledge gap in maize breeding, especially when breeding programs are constrained by limited heterotic pools. We therefore hypothesized that DH lines derived from within-heterotic group crosses (e.g., Group A × Group A) maintain sufficient genetic variation to generate exploitable phenotypic diversity and adaptive differentiation under contrasting moisture conditions, and that this variation can be effectively revealed through testcrossing with a single-cross tester from the opposite heterotic group. Accordingly, the present study aimed to evaluate the phenotypic performance of DH maize testcross hybrids under optimal and managed drought conditions, with the objective of identifying superior hybrids and assessing their adaptive patterns across environments.
2 Materials and methods
The maize (Zea mays L.) DH lines were developed from 16 single-cross and three bulk populations formed using marker-assisted recurrent selection techniques. Details of the base population characteristics and the genetic sources used to develop the 834 DH lines are presented in Table 1. These lines were generated through in vivo haploid induction following the standard CIMMYT DH induction procedure [17]. Each of the 855 DH lines was crossed with a single cross-tester from the opposite heterotic group, yielding 855 hybrids. This work did not require specific permission. One of the mandates of the International Maize and Wheat Improvement Center (CIMMYT) is to develop new lines and hybrids and evaluate them across different biotic and abiotic stresses to identify the best-adapted hybrids for commercialization across various parts of sub-Saharan Africa.
2.1 Test crosses performance evaluation
The 855 newly developed hybrids were organized into 17 trial sets connected through a common check strategy according to Lado et al. [26], enabling valid comparisons across sets. Each trial included four to six shared commercial check hybrids, which served as references to account for environmental and design differences among sets. In total, 861 hybrids (including checks) were evaluated using an alpha-lattice design with two replications. Field evaluations were conducted in Kenya during the 2019 growing season under two well-watered environments and one managed drought stress environment, allowing all hybrids to be assessed on a common performance scale across trials. Drought stress was imposed by withholding irrigation approximately two weeks before flowering and maintaining water deficit conditions until harvest, allowing progressive soil moisture depletion. In contrast, well-watered trials received supplemental irrigation whenever moisture stress was observed, ensuring near-optimal soil moisture throughout the growing period. Although the exact amount of water applied was not quantified in millimeters, the drought treatment was characterized. The experiments for optimal management were conducted at two locations in Kenya: Kakamega and Kiboko, while moisture stress experiments were performed solely at Kiboko. Kakamega is situated at 34°46’E and 0°17’N, with an elevation of 1,585 meters above sea level (masl). On the other hand, Kiboko is located at 37°44’E and 2°13’S, with an elevation of 975 masl.
Each plot consisted of a single row measuring 5 m in length, with a spacing of 0.75 m between rows and 0.75 m between plants. Initially, two seeds per hill were planted, and after three weeks, plants were thinned to one per hill, achieving a density of 53,333 plants per hectare. Fertilizer was applied at 60 kg of nitrogen and 60 kg of phosphorus per hectare, with nitrogen split into applications at planting and six weeks later. Weeds were controlled by hand weeding.
Plant aspect (PAS) and ear aspect (EAS), which are breeder-assessed traits, were evaluated using a visual scoring system. Both traits were rated on a 1–5 scale, where 1 indicates excellent and 5 indicates poor performance. Plant aspect (PAS) was assessed based on overall plant appearance, including vigor, uniformity, plant type, and absence of disease or lodging. Ear aspect (EAS) was evaluated considering ear characteristics such as size, uniformity, filling, and freedom from pest or disease damage. Scoring was conducted at physiological maturity across all plots to ensure consistency in evaluation for PAS and during harvesting time for EAS. The data recorded for each plot included days to 50% silking and anthesis, anthesis-silking interval (days), plant and ear heights measured from the base to the first tassel branch and from the base to the node bearing upper ear, respectively, root and stalk lodging percentages (plants leaning more than 30 degrees from vertical, and plants broken at or below the highest ear node, respectively) were also assessed. In optimum and stressed conditions, grain moisture was measured after harvesting and shelling the ears. For well-watered conditions, ear weight was used to estimate grain yield, adjusted to 12.5% moisture content.
2.2 Phenotypic data analysis
Before data analysis, the anthesis-silking interval (ASI) was normalized using ln as suggested by Bolafios and Edmeades [27]. The analysis of variance for all traits was done separately for each location and combined across locations using DeltaGen online software applying mixed model procedures [28]. Trait BLUPs were computed using DeltaGen online software (https://www.deltagen.agr.nz) [28]. Genotypes, replication, and blocks within replications were considered random effects for individual location analysis. In combined analysis, location was considered as fixed effects whereas genotypes, replication within location, blocks within replication and locations, and genotypes by location interaction were considered random effects. Under both individual and combined analysis, the variance was partitioned into relevant sources of variation to test for the difference among genotypes and genotypes by environment interaction. The following model was used for individual locations.
where yjkb represents the phenotypic trait analysis on the kth replicate of the jth test cross in the bth incomplete block, μ is an intercept (Grand Mean), Lj is the random effect of the jth maize test cross, rk is the random effect of the kth replicate that is independently, identically, and normally distributed (iid), ib(r)b denotes the random effect of the bth incomplete block within the kth replicate σ2 𝑖𝑏(𝑟) being the variance of the incomplete block within each replicate, and ejkb is error variance. For multi-location data, the following GLM-mixed model will be used:
where Yijrk is the mean value of genotype i at location j in replicate r within the block k; µ is the general mean; Lj is the fixed effect of the location j; Rr (Lj) is the random effect of the replicate r within location j; Bk [Rr (Lj)] is the random effect of the incomplete block k within replicate r and location j; Gi is the random effect of genotype i; GLij is the random and effect of the genotype × location interaction; and εijrk is the random residual error assumed independent and identically normally distributed with mean zero and variance σ2ε. Standard heterosis (SH) of new genotypes over the best check and mean of the checks (SH) was calculated as SH = 100 × (F1–best check)/best check, and SH = 100 × (F1– mean of check)/mean of checks, respectively. Genotypes exceeding the respective critical difference (CD) values were considered significantly superior to the checks.
Narrow-sense heritability (h2) was estimated as the ratio of additive genetic variance to total phenotypic variance (h2 = V_A/V_P), representing the proportion of phenotypic variation attributable to additive effects. In this study, additive genetic variance was approximated as one-quarter of the among-family variance (¼σ²_A), following standard half-sib family assumptions [28]. This measure excludes non-additive components such as dominance and epistasis. Genetic variance was estimated from 855 new doubled haploid (DH) crosses evaluated as half-sib families alongside six standard checks. Heritability estimates for combined analyses were based on entry means across environments and replications, while estimates for individual locations were based on entry means across replications. The contribution of each source of variation to total variability was computed using R software (Version 4.4.1) [29].
2.3 Cluster analysis
For clustering analysis, both single-trait and multivariate approaches were employed. Single-trait clustering, particularly for grain yield, used mean BLUEs (Best Linear Unbiased Estimates) from each location as input in the DeltaGen online tool [28], enabling identification of genotypes with consistent performance across environments. In contrast, multivariate analysis incorporated combined data across locations and management conditions (optimum and drought), allowing simultaneous evaluation of multiple traits and broader phenotypic variation. K-means clustering was then applied in R to group genotypes based on overall similarity, minimizing within-cluster variation through iterative reassignment of cluster centroids. This approach produced clusters of genotypes with similar performance patterns, facilitating selection and diversity assessment across environments.
2.4 Genotype ranking and population mean
Mean grain yield (GY; t ha-1) and percentage yield advantage relative to the mean of the check varieties were calculated for the top 20 genotypes from each breeding population evaluated across three environments (Kakamega_OPT, Kiboko_OPT, and Kiboko_DT). Genotypes were ranked based on mean best linear unbiased estimates (BLUEs) across environments. Population means were calculated as the arithmetic mean of the mean BLUEs of the top 20 genotypes within each population, except for Population 17, Population 18, and the check group, for which the mean was calculated using all contributing genotypes. Accordingly, pop 17 and pop 18 were represented by two and four genotypes, respectively, while the check group included six commercial varieties.
3 Results
3.1 Additive genetic variance component and narrow-sense heritability
The variance components, narrow-sense heritability estimates, and their corresponding standard errors (in brackets) are presented in Table 2. Under optimal management at Kiboko, narrow-sense heritability for grain yield (GY) was moderate (0.33). In contrast, anthesis date (AD) showed a higher heritability (0.54), while plant height (PH) and ear height (EH) each had heritability estimates of 0.59. The result showed that heritability estimates for phenological and plant architectural traits (AD, PH, and EH) were consistently higher than those for GY across environments, indicating greater genetic control and stability for these traits. Under moisture stress management at Kiboko, narrow-sense heritability for GY was 0.46. The heritability of AD was 0.65, whereas PH and EH each exhibited heritability estimates of 0.63.
The coefficient of variation (CV) for grain yield was highest under the Kiboko moisture-stressed condition (37.73%) compared to the optimal environments, indicating substantially greater relative variability under drought stress (Table 2). This elevated CV is expected under high-intensity drought conditions (just before two weeks of flowering and onward), particularly in drought-prone environments such as Kiboko, where water limitation amplifies differences among genotypes. Under stress, small differences in drought tolerance mechanisms such as root architecture, stomatal regulation, water-use efficiency, problem of synchrony of male and female flowering which enforce limited fertility can lead to large differences in grain yield, thereby increasing variability relative to the mean. Furthermore, drought stress typically reduces the overall mean yield (2.38 t ha-1 in this case), which inherently inflates the CV since it is expressed as a percentage of the mean. The combination of reduced mean performance and heterogeneous genotype responses to stress contributes to the observed high CV.
The present work also revealed the presence of substantial variation in genotypes across various conditions and locations. Among genotypes, significant variance was observed for traits such as GY, AD, ear position height (EPH), ear aspect score (EAS), PH, EH, silking date (SD), and anthesis-silking interval (ASI) at a high significance level (α = 0.01). Differences for ear per plant (EPP) and plant aspect score (PAS) were significant at α = 0.05, while no significant variance was noted for grain moisture content (MOI). The interaction between genotype and location was highly significant for most traits, including GY, AD, EPH, EAS, PAS, MOI, PH, EH, and SD, highlighting the varying performance of genotypes across different locations. However, this interaction was not significant for EPP and ASI.
Location-specific results further underscore the variability. At Kakamega, Kiboko, and Kiboko_DT, the genotypic variance was highly significant for traits such as GY, AD, EPH, EAS, MOI, PH, EH, EPP and SD. Variance in PAS was significant at Kakamega and Kiboko under optimal conditions.
In combined analysis, blocking effects within replication and location were notably significant for GY, MOI, PH, EH, and SD, indicating the importance of experimental design with inclusion of block. Under optimal conditions, blocking effects were significant for GY, MOI, PH, and EH at Kakamega and Kiboko, and for AD at Kiboko. The blocking effect was also significant for EAS and PAS at Kakamega. Furthermore, the blocking effects under moisture stressed condition, at Kiboko, were significant for GY, AD, EPP, MOI, PH, and EH. These results confirmed the importance of considering both genotype and environmental conditions in breeding programs to optimize yield performance across diverse settings.
3.2 Variance based on sum square contribution
For traits such as GY, PH, and EH, the percent sum square contribution is notably higher under water-stressed conditions by 49, 62, and 44%, respectively, indicating that genotype variance for these traits was more pronounced when plants were subjected to water stress (Table 3). Similarly, for the ASI and other phenological parameters, the percent sum square contribution was significantly higher under water stress, ranging from 55 to 93%, further emphasizing the impact of stress on genotype variance for these traits (Table 3). Even though the variance for genotype × location interaction was significant for most traits, the proportion of total variability explained by this interaction relative to that of genotype was low to moderate across all traits. This finding aligns with the report by Govindaraj et al. [30] who showed that genotype × environment interactions are significant and their contribution to the total variability is often less than the variability attributed to genotype alone.
Conversely, the residual variance showed different patterns. For GY and ASI, residual variance was higher under well-watered conditions by 22 and 38%, respectively, when compared with water-stressed conditions (Table 3). This suggests that while genotype variance was more pronounced under stress, the residual variance, or unexplained variance, was higher in well-watered environments. For PH and EH, the sum square contribution was greater under water stress, by 43 and 8%, respectively, as compared to well-watered conditions (Table 3). Notably, the residual variance for AD and SD was significantly higher under water-stressed conditions by 84 and 87%, respectively, as compared to well-watered conditions, indicating that the unexplained variability for these phenological traits were increased under stressed condition (Table 3). The findings showed that while water stress amplifies the genetic differences for various traits, it also increases the unexplained variance for certain parameters.
3.3 Means and ranges under well-watered and drought-stressed conditions
Under moisture-stressed conditions at Kiboko, the mean grain yield was 2.38 t/ha with a heritability estimate of 0.46. The moderate heritability indicates substantial genetic variability among genotypes, suggesting good prospects for selection under drought stress. Although experimental variability was relatively high, the presence of heritable variation supports effective genetic improvement in stress environments (Table 4). Grain yield under moisture-stressed condition was drastically lower by 179.65%, than in non-stressed environments. Water stress also led to reduction in plant height by 15.64% and ear height (EH) by 7.88%, while increasing the anthesis-silking interval (ASI) by 86.14% (Table 5). However, moisture content did not significantly differ between the two conditions (Table 5).
In well-watered environments, GY of the test crosses ranged from 3.66 to 10.36 t/ha, with a mean of 6.63 t/ha. The top ten test crosses exceeded the best check, PH30G19, by 2.10–3.27 t/ha (Table 6). Overall, the mean yield of all test crosses was 12% higher than the checks, and the top ten DH crosses showed a 60% yield advantage. Anthesis date (AD) ranged from 59.48 to 71.70 (mean 65.75 days), while silking days ranged from 59.89 to 72.11 days, with a short mean ASI of 0.4 days. Plant aspect (PAS) (1.74–4.01; mean 2.72) and ear aspect (1.74–3.75; mean 2.88) indicated generally good plant and ear quality. Plant height (PH) ranged from 209.45 to 283.74 cm (mean 247.31 cm), and ear height from 96.16 to 155.90 cm (mean 124.94 cm). Ears per plant (EPP) varied from 0.65 to 1.48, with a mean of 1.0, indicating limited prolificacy under well-watered conditions (Table 6).
Under water stress, the grain yields of test crosses were ranged from 0.16 to 6.13 t/ha (Table 7). The top ten test crosses yield were 1.57 and 3.03 t/ha which was more than the best check (DK777) (Table 7). On average, the new DH test crosses exceeded the mean grain yield of the checks by 21%, and the top ten performing DH test crosses showed a remarkable 161% yield increase over the mean of the checks (Table 7). The performance metrics for the tested genotypes revealed notable variations. The mean for AD ranged from 60.49 to 75.10, with an overall average of 67.48 days, while the SD varied from 62.45 to 80.95 days. The ASI showed a range from −1.44 to 13.97 days, resulting in an overall mean of 2.89 days. For EAS, the overall mean was 3.01, with a range of 1.0 to 5.0. The mean PH ranged from 163.58 to 265.20 cm, averaging 213.93 cm. EH had a mean value of 115.79 cm, with a range from 75.15 to 153.94 cm. These results indicate that the genotypes exhibit significant variation in performance under water stress conditions, particularly in terms of growth metrics.
3.4 Standard heterosis for grain yield
Under optimum condition significant variation in standard heterosis for grain yield was observed among genotypes when evaluated over the best check (PH30G19) and the mean of checks. Several genotypes expressed positive and statistically significant heterosis, indicating superior yield performance relative to the checks. Genotype G135 recorded the highest heterosis over the best check (46.18%) and over the mean of checks (75.48%) (Table 8). Other genotypes including G849, G596, G113, G583, G585, and G123 also exhibited significant positive heterosis over both standards heterosis. Heterosis values ranged from −48.32% to 46.18% over the best check and from −37.96% to 75.48% over the mean of checks, reflecting wide genetic variability for grain yield.
Under drought conditions, standard heterosis for GY over the best check (DK777) and the mean of checks varied widely among genotypes and showed high levels of statistical significance. Several genotypes demonstrated pronounced positive heterosis, indicating strong yield superiority under moisture stress. Genotype G576 exhibited the highest heterosis over the best check (97.71%) and over the mean of checks (210.97%), both highly significant. Genotypes G500, G792, G707, G643, and G533 also showed substantial and significant positive heterosis over both standards. The heterosis estimates ranged from −105.21% to 97.71% over the best check and from −108.20% to 210.97% over the mean of checks (Table 8), indicating considerable genetic variation among genotypes for GY expression. Several genotypes surpassed the critical difference thresholds, confirming their statistical superiority.
3.5 Grouping of genotypes using GY BLUEs estimate
Cluster and pattern analyses based on grain yield classified the 861 genotypes into three distinct clusters: C1 (239 genotypes), C2 (294 genotypes), and C3 (328 genotypes) (Fig 1). The dendrograms revealed clear relationships among genotypes and test environments based on yield performance. The two optimal environments, Kiboko_OPT and Kakamega_OPT, clustered closely together, indicating similar genotype responses. In contrast, Kiboko_DT was clearly separated, highlighting the strong impact of imposed drought stress and the resulting phenotypic variation among genotypes. The heatmap further showed that genotypes with good performance under Kiboko_DT were mainly grouped in Clusters 2 and 3, while high-performing genotypes at Kiboko_OPT were also concentrated in Clusters 2 and 3. Superior performance at Kakamega was predominantly associated with Cluster 2.
Rows represent individual genotypes, while columns represent the three environments (Kiboko_DT, Kiboko_OPT, and Kakamega). The dendrograms illustrate similarity patterns among genotypes and locations derived from standardized grain yield values. Color intensity reflects relative grain yield performance, where red indicates higher-than-average yield, blue indicates lower-than-average yield, and white represents values close to the overall mean, as shown in the color key. This heatmap highlights genotype-by-environment differentiation and reveals contrasting yield responses of genotypes across drought-stressed, optimal, and high-rainfall environments.
Clustering based solely on grain yield revealed clear and consistent differences in mean performance across environments. These differences closely matched the population composition within each cluster (Table 9). The yield-based clusters showed distinct response patterns across Kiboko_DT, Kiboko_OPT, and Kakamega_OPT, indicating a strong association between cluster mean performance and population background. Differences in slope among clusters in Fig 2 further demonstrate contrasting responses to environmental and management conditions, highlighting variation in adaptability and yield stability.
The plot shows cluster mean yields under drought stress at Kiboko (Kiboko_DT), optimal conditions at Kiboko (Kiboko_OPT), and optimal conditions at Kakamega. Differences in slope among clusters indicate contrasting yield responses, stability, and levels of environmental adaptation under stress and favorable conditions.
Genotyping in Cluster 1 are dominantly drought tolerant and broadly adapted populations. Major contributors included pop4 (16.7%), pop1 (14.2%), pop2 (13.8%), pop5 (10.9) and pop12 (10.0%), together accounting for more than 40% of the cluster, along with 23.4% of genotypes of unknown background (Table 9). This composition explains the relatively stable but generally low mean yield across environments (Fig 1) based on the heatmap. It also explains the comparatively better performance under Kiboko_DT and the limited yield response under favorable or Kakamega conditions (Fig 2).
Cluster 2 contained lines derived from temperate introgression populations. The main contributors pop13 (15.3%), pop16 (13.9%), pop3 (13.6%), and pop11 (9.9%) represented nearly half of the cluster (Table 9). These elite backgrounds explain the strong yield increase from drought to optimal conditions and the highest mean yield at Kakamega. The small contribution of the drought-tolerant populations suggests a limited representation or expression of drought-tolerance traits in their genetic background, as evidenced by their low yield performance under Kiboko_DT environment.
Cluster 3 was the largest, with 328 genotypes, and included populations combining temperate introgression, stress tolerance, and Molecular marker-based population improvement of elite lies. Pop14 (12.5%), pop8 (11.3%), pop7 (10.1%), pop15 (9.1%), and pop19 (10.1%) together contributed about 50% of the cluster, with additional representation from recurrent cycle genomic selection (RCGS) derived populations. This balanced composition explains the consistently strong performance across all environments. It also explains the improved drought tolerance relative to Cluster 2 and the modest yield reduction under optimal conditions.
Pairwise comparisons of the top 15% (128) selected genotypes across environments, based on BLUE mean values (Table 10), revealed low agreement among locations, confirming strong genotype × environment interaction. The highest overlap occurred between Kakamega_OPT and Kiboko_OPT, with 27 common genotypes (21.1%). Kakamega_OPT and Kiboko_DT shared 15 genotypes (11.7%). Only seven genotypes (5.5%) were common across all environments further emphasizes the strong genotype × environment interaction and, suggests limited stability of genotypic performance across diverse testing conditions.
Overall, yield-based clustering clearly differentiated genotypes by environmental adaptation. Cluster 1 showed low yield stability with better performance under drought. Cluster 2 displayed specific adaptation to favorable conditions at Kakamega. Cluster 3 showed strong and consistent performance across both optimal and drought environments at Kiboko. The close alignment among population composition, cluster mean yield, and selection overlap confirms that population formation strategy strongly influences genotype response, highlighting the complementary value of these clusters for breeding stress-tolerant, high-yielding, and broadly adapted maize.
3.6 Cluster-wise summary of top-performing genotypes
Cluster 1 was ranked using the mean yield across the three locations. Cluster 2 was ranked based on performance at Kakamega. Cluster 3 was ranked based on performance under drought stress at Kiboko_DT. This summary is based on the top five genotypes selected from each cluster according to yield performance across environments (Table 11).
In Cluster 2, evaluation under Kakamega conditions identified G135, G233, G849, G780, and G613 as the top-performing genotypes. These genotypes showed high grain yield and mainly originated from high-yielding, temperate-introgressed, and drought-tolerant populations. This indicates good adaptation to high-potential environments.
In Cluster 3, selection under Kiboko_DT conditions identified G576, G500, G707, G643, and G533 as the best-performing genotypes. These genotypes were largely derived from temperate-introgressed and elite-line backgrounds. Their performance indicates improved drought adaptation and yield stability under stress-prone conditions.
Overall, the top five genotypes from each cluster showed clear and distinct adaptation patterns. Cluster 2 was better suited to Kakamega environments. Cluster 3 showed stronger adaptation to Kiboko conditions. These genotypes provide valuable parental material for targeted breeding strategies.
3.7 Population-wise comparison based on top performing genotypes
The plot shows clear differences in grain yield and relative yield advantage among breeding populations under optimal and drought-stressed environments. Several populations derived from temperate introgressions and drought-tolerant × high-yielding crosses consistently outperformed the commercial checks. In some cases, yield advantages exceeded 30–53% (Fig 3). This indicates substantial genetic gain from targeted parental combinations. In contrast, populations with lower mean grain yield and little yield advantage show limited adaptation and narrower genetic potential across environments.
3.8 Multivariate consideration for clustering genotypes across environments
K-means clustering based on multi-trait data was evaluated using the elbow method (Fig 4). The within-cluster sum of squares declined sharply as the number of clusters increased from one to three, indicating a strong improvement in cluster compactness. Beyond three to four clusters, the reduction in within-cluster variation became small and gradual. The curve flattened after four clusters, showing diminishing returns from adding more clusters. The clear elbow at K = 4 suggests an optimal balance between explained variability and model simplicity. Increasing cluster numbers beyond this point did not meaningfully improve within-cluster homogeneity. Therefore, the data are best represented by four distinct clusters, which were used for downstream interpretation and biological inference.
The figure shows the relationship between the number of clusters (K) and the within-cluster sum of squares. A steep reduction in within-cluster variation is observed as K increases from 1 to 3, indicating improved cluster compactness. The curve begins to level off at K = 4, forming a clear elbow point, beyond which additional clusters result in only marginal reductions in within-cluster sum of squares. This pattern indicates that four clusters adequately capture the underlying structure of the data and were therefore selected for subsequent clustering and interpretation.
Multivariate diversity analysis was performed using combined data across locations and management conditions (optimum and drought). Cluster and pattern analyses grouped the 861 maize genotypes into four distinct clusters: C1 (267 genotypes), C2 (125), C3 (245), and C4 (224) (Fig 5). The dendrogram revealed strong associations among key traits. Grain yield (GY) was closely associated with ears per plant (EPP). Silking date (SD) showed a strong relationship with AD, indicating shared genetic control and coordinated trait expression (Fig 5).
Rows correspond to individual genotypes, while columns represent measured traits (where PAS = Plant Aspect, EAS = Ear Aspect, ASI = Anthesis Silking Interval, SD Silking Date, AD Anthesis Date, MOI = Grain Moisture Content, GY = Grain Yield, EPP = Ear Per Plant, PH = Plant Height, EH = Ear Height, EPH = Ear Position Height, SL = Stalk Lodging and BHC = Bad Husk Cover, TLB = Turcicum Leaf Blight). The dendrograms illustrate similarity relationships among genotypes and traits derived from standardized phenotypic values across environments. Color intensity reflects relative trait expression, where red indicates above-average values, blue indicates below-average values, and white represents values close to the overall mean, as shown in the color key. This heatmap reveals trait associations, genotype differentiation, and contrasting multi-trait performance patterns, supporting the identification of favorable trait combinations for selection.
3.9 Integration of PCA clustering and multi-trait heatmap
The combined interpretation of the PCA clustering table (Table 12) and the multi-trait heatmap (Fig 5) provides a clear understanding of how source population characteristics shaped the genetic structure of the germplasm. Table 12 quantified the number and percentage of genotypes contributed by each population to the four clusters. The heatmap visually displayed the trait profiles underlying this classification. Together, these results provide numerical and graphical validation of the clustering pattern.
The PCA separated genotypes mainly along PC1 and PC2. PC1 was driven by yield and plant architecture traits. PC2 captured phenology and stress-related traits. This structure was reflected in the heatmap, where hierarchical clustering divided genotypes into four major blocks with contrasting color patterns. These blocks closely corresponded to Clusters C1- C4, confirming that genotypes with similar breeding histories showed similar multivariate trait performance.
Cluster 1 contained 267 genotypes and was dominated by populations with temperate introgression and elite tropical backgrounds, including pop6, pop7, pop10 and pop11. In the heatmap, this cluster showed strong expression of GY, EPP, PH, and EH. This consistency indicates that Cluster 1 represents broadly adapted, high-yielding germplasm suitable for mainstream hybrid development.
Cluster 2 comprised 125 genotypes and was mainly represented by Pop2 and Pop3, each contributing 26 genotypes (20.8%). These populations were developed for high yield and MLN tolerance under optimal conditions. The heatmap supported this pattern by showing lower expression of yield-related traits and weaker responses for stress-related traits, such as ASI and SD, together with higher values for stem lodging (SL) and bare husk cover (BHC).
Cluster 3 included 245 genotypes with major contributions from pop8, pop5, pop13, pop14, and pop15, with pop15 contributing the most. The heatmap showed reduced yield and plant architecture traits but stronger expression of stress-adaptive traits, including shorter ASI and earlier flowering and maturity. The presence of known drought-tolerant populations, such as pop1 and pop12, indicates that this cluster represents a diverse stress-adapted group for drought-prone environments.
Cluster 4 contained 224 genotypes and was enriched with drought-targeted particularly pop1, pop12, pop19 populations. In the heatmap, this cluster formed a distinct block with strong stress tolerance traits but reduced performance for optimum yield traits. This pattern indicates that Cluster 4 represents highly drought-resilient germplasm selected under low-moisture conditions and is valuable for breeding in marginal environments. Genotypes from pop17 and 18, each represented by very few genotypes (n = 2 and n = 4, respectively) clustered only under cluster 4, showing limited distribution across clusters (Table 12), which likely reflects their small sample size and reduced representation of within-population variability.
Finally, the broad distribution of the uncharacterized mixed population across all clusters in Table 12 was reflected by its scattered pattern in the heatmap. This highlights the high genetic diversity of the panel. It also demonstrates the effectiveness of combining PCA and heatmap analyses for organizing complex maize germplasm.
3.10 Cluster mean performance and variability
The clustering of genotypes revealed important patterns for breeding (Table 13; Fig 5). Clusters differed for yield-related traits, with some clusters showing higher performance than others. Variation among clusters for GY, PH, and EH confirms sufficient genetic diversity to support multi-trait selection. Traits with minimal differences, such as EAS, PAS, and TLB, showed low variability, indicating high uniformity within clusters.
Days to anthesis differed slightly among clusters. Some clusters flowered earlier, while others were later. The ASI increased across clusters, with the shortest interval observed in the earliest-flowering cluster, which is desirable for reproductive synchrony and stress tolerance. Low variability in phenological traits suggests consistency within clusters.
Plant height and EH contributed noticeably to cluster differentiation. Some clusters had taller plants with higher ear placement, while others were shorter. Structural traits, including stalk lodging and BHC, showed minor differences, with certain clusters exhibiting slightly better structural attributes. Yield-related and quality traits were generally uniform across clusters.
The heatmap (Fig 5) visualizes these patterns, showing clear clustering of traits and genotypes. Higher values for key traits appear prominently, while lower values are distinguishable, highlighting trait differentiation among clusters.
Overall, clusters differed in agronomic performance. Some combined high yield, early flowering, and short ASI, making them desirable for breeding. Others showed balanced performance or distinct structural traits. Low within-cluster variability confirms effective clustering, and the observed inter-cluster differences provide opportunities for parent selection and hybrid development in maize breeding programs.
3.11 Biplot pattern analysis (PCA) in multivariate
In the PCA summary, to achieve 100% cumulative variance, 11 principal components were needed. The first three components accounted for 61.6% of the variation, with their standard deviations/eigenvalues greater than one. In contrast, the remaining eight components had standard deviation values below one, indicating that they were less significant and explain less variation. Specifically, the first two components together accounted for 43.7% of the total variation, with PC1 explaining 25.4% and PC2 explaining 18.3%. The biplot from the multivariate analysis revealed four distinct genotype clustering patterns (Fig 6).
The first two principal components explained 25.4% (PC1) and 18.3% (PC2) of the total phenotypic variation. Each point represents a genotype, colored according to its K-means cluster membership (Clusters 1–4), while labels denote individual genotypes. The arrows indicate trait loadings and their direction and length reflect the magnitude and contribution of each trait to the observed variation. Genotypes located close to the origin exhibit average performance, whereas those farther from the center show stronger associations with specific traits. The four quadrants highlight distinct patterns of trait association and genotype grouping, demonstrating clear multivariate differentiation among clusters and confirming the effectiveness of clustering based on combined agronomic and phenological traits. Where GY = Grain Yield, AD = Anthesis date, SD = Silking date, ASI = Anthesis Silking interval, PH = Plant Height, EH = Ear Height, EPH = Ear Position Height, EPP = Ear Per Plant, BHC = Bad Husk Cover, SL = Stalk Lodging, EAS = Ear Aspect, MOI = Grain Moisture Content.
The length of trait vectors in the PCA biplot reflects their contribution to the total variation explained by the first two principal components. Traits with longer vectors in Fig 6, particularly GY and key plant architectural traits, contributed most to genotype discrimination and were the primary drivers of phenotypic diversity. Specifically, GY exhibited a longer vector than EPP along both PC1 and PC2, indicating that GY has stronger influence on the variation explained by the first two principal components. In contrast, EPP and other traits with relatively shorter vectors contributed less to genotype separation, reflecting lower variability or weaker association with PC1 and PC2. Notably, the MOI displayed the shortest vector length, indicating that it contributed the least to the variation captured by PC1 and PC2.
3.12 Pearson correlation analysis
The Pearson correlation matrix (Fig 7) illustrates the relationships among GY and the evaluated agronomic traits. Grain yield exhibited moderate positive correlations with PH; r ≈ 0.39, EH; r ≈ 0.39, EPP; r ≈ 0.52, and MOI; r ≈ 0.22, indicating that increases in these traits tend to be associated with higher grain yield. In contrast, GY showed negative correlations with AD; r ≈ −0.09, SD; r ≈ −0.21, ASI; r ≈ −0.25, EAS; r ≈ −0.45, and PAS; r ≈ −0.46, suggesting that delayed flowering, wider ASI, and poorer ear and plant appearance are associated with reduced yield.
Diagonal panels display the frequency distributions of individual traits, lower panels show pairwise scatterplots with fitted trend lines, and upper panels present Pearson’s correlation coefficients. Positive and negative values indicate the direction and strength of associations between traits, highlighting key relationships influencing grain yield performance across environments. Where GY = Grain Yield, AD = Anthesis date, SD = Silking date, ASI = Anthesis Silking interval, PH = Plant Height, EH = Ear Height, EPH = Ear Position Height, EPP = Ear Per Plant, EAS = Ear Aspect, PAS = Plant Aspect, MOI = Grain Moisture Content.
Anthesis date was strongly positively correlated with SD (r ≈ 0.89) and moderately correlated with ASI, EH, EPH, EAS, PAS, and MOI, while showing a weak negative association with EPP. ASI displayed negative correlations with PH, EH, EPH, EPP, EAS, and PAS, indicating that a longer ASI tends to coincide with reduced plant vigor and yield-related traits. Plant height was positively correlated with EH, EPH, and EPP, whereas negative associations were observed with EAS and PAS. Ear height similarly showed positive correlations with EPH, EPP, and MOI, and negative correlations with EAS and PAS. In general, the correlation pattern highlights that traits such as ears per plant, ear placement traits, plant height, and moisture index are favorably associated with grain yield, while delayed flowering, extended ASI, and poor ear and plant aspects show unfavorable relationships with yield. These associations underscore the importance of jointly considering yield-enhancing and yield-limiting traits in selection decisions.
4 Discussion
4.1 Phenotypic variation and heritability
Phenotypic analysis revealed substantial variation among maize genotypes under both optimum and moisture-stressed conditions. Significant differences in grain yield across locations indicated inconsistent genotype performance. Similar results have been reported under drought conditions by Poudel et al. [31] and Dube et al. [32]. Narrow-sense heritability for GY, AD, PH, and EH was consistently higher under drought conditions. This indicates stronger additive genetic control in stress environments. The higher heritability observed under moisture stress at Kiboko suggests that genetic variance was more effectively expressed relative to environmental variance. As a result, genetic differences among genotypes became more detectable, improving selection efficiency.
Similar findings have been reported in tropical maize, where moderate-to-high heritability under managed drought enabled effective selection for grain yield and stress tolerance [33–35]. These studies highlight that well-managed stress conditions can enhance the expression of additive genetic effects and accelerate genetic gain.
However, contrasting evidence also exists. Some studies report lower heritability under drought due to strong genotype × environment interaction and the complex genetic basis of stress tolerance [36–38]. Under severe or highly variable stress, environmental variance may dominate and reduce the expression of genetic potential. Similar trends were observed by [39] and Beyene et al. [40], where higher heritability was reported under optimal conditions due to more stable environments.
These differences indicate that heritability under drought is highly context-dependent. It is influenced by stress intensity, environmental uniformity, and population structure. In the present study, drought conditions at Kiboko were sufficiently discriminative to reveal additive genetic variation without masking phenotypic expression.
From a breeding perspective, these results emphasize the importance of selection under target stress environments. This approach improves the identification of genotypes with true drought tolerance. At the same time, multi-environment testing remains essential to ensure stable performance across diverse conditions. The CV for grain yield was highest under the Kiboko drought-stressed condition (37.73%), indicating increased variability under water-limited environments. This is consistent with previous studies in maize, where the coefficient of variation (CV) for grain yield is typically higher under drought stress than under well-watered conditions and can range from approximately 25.6% to 48.08% across different testing locations under water-deficit environments reported by Ningning et al. [41]. Such high variability reflects differential genotypic responses to stress and the reduction in mean yield under drought conditions.
4.2 Genotype variance and environmental effects
Under optimum condition significant genotypic variance and moderate heritability estimates observed for grain yield and related traits under optimum conditions suggest the presence of exploitable additive genetic variation. This aligns with recent studies showing that substantial genetic variability for yield and agronomic traits exists among maize genotypes under well‑watered environments [42]. Such variability offers opportunities for effective phenotypic selection and genetic gain. The significant genotype × location interaction effects indicate environmental influence on trait expression, consistent with evidence that both genotype and environment contribute to yield variation in maize breeding evaluations [43]. These results reaffirm that stable genetic differences under favorable conditions can guide breeders in selecting high‑performing genotypes.
Under moisture stress, the presence of notable genotypic variance and moderate to high heritability for yield and related traits highlights the potential for genetic improvement even in challenging environments. Recent research has similarly reported that genetic variance and heritable differences can be detected for key traits under water deficit, supporting the feasibility of selection under drought [42]. Significant genotype × location interaction under stress further underscores how genotype responses vary with environmental conditions, a pattern consistently observed in maize evaluations across contrasting moisture regimes [44]. The notable residual variance for some traits suggests complex stress responses that may require careful evaluation across seasons to reliably select superior genotypes.
The residual variance for GY and ASI was higher under well-watered (optimum conditions), by 22 and 38%, respectively, as compared to under drought stress condition. The increase might be associated with the greater environmental heterogeneity in the combined optimum analysis. The optimum dataset included both Kakamega and Kiboko, which differ markedly in temperature, humidity, and evaporative demand. Such contrasts likely increased unexplained variability in ASI, a trait known to be highly sensitive to environmental fluctuations [27,45]. In contrast, the managed drought stress at Kiboko provided a more uniform environment, reducing environmental noise and resulting in lower residual variance. This suggests that ASI is more environmentally plastic under heterogeneous optimum conditions than under controlled stress.
4.3 Performance of DH hybrids and standard heterosis
The mean of top 10 hybrids showed strong grain yield performance compared with the mean of commercial checks, with average superiority of 161% under drought and 60.4% under optimum conditions. When compared with previous tropical doubled haploid (DH) studies, these gains appear exceptionally high. For instance, DH-derived maize hybrids evaluated in East Africa typically showed yield advantages ranging from about 23% under optimum conditions to approximately 43% under drought stress relative to commercial checks [46]. Similarly, other tropical hybrid improvement studies reported yield gains of 21–23% under optimum conditions and 51–52% under drought stress for top-performing entries [15]. Therefore, the 161% superiority observed under drought in the present study substantially exceeds the range commonly reported, while the 60.4% gain under optimum conditions is also notably higher than typical values. Recent studies also report that newly developed maize hybrids can achieve high yield and wide adaptability across different environments [47]. The variation in heterosis among crosses indicates the presence of genotype × environment interaction. This interaction is widely reported in maize breeding [48]. Molecular studies further show that heterotic expression can change with environmental conditions. Over dominant gene expression contributes to hybrid vigor in specific environments [49]. These findings highlight the importance of multi-environment testing to identify stable and high-performing hybrids.
Under optimum conditions, several hybrids showed positive standard heterosis for grain yield. This confirms the effectiveness of heterosis breeding in maize. The superiority of these hybrids reflects non-additive gene action and good parental complementarity [50,51]. Hybrids with high standard heterosis under favorable conditions are suitable for further multi-location evaluation.
Under drought conditions, high and significant standard heterosis was also observed in selected hybrids. This indicates that heterosis plays an important role in stress adaptation and yield improvement under moisture stress. Similar results have been reported in maize breeding programs targeting drought-prone environments [52]. Recent molecular evidence suggests that over dominant gene expression and stress-responsive pathways contribute to drought-related heterosis [49]. Therefore, the identified genotypes are valuable resources for drought-focused maize breeding programs.
4.4 Multivariate analysis and clustering
Principal component analysis captured most of the phenotypic variation and grouped genotypes into four distinct clusters, consistent with [53]. Multivariate clustering provided better resolution of diversity than single-trait analysis and helped to identify genotypes suitable for both high yield and stress tolerance. The clustering analysis, considering multiple agronomic and yield-related traits simultaneously, revealed genetically distinct groups with specific trait combinations. This pattern aligns with studies showing significant phenotypic and genetic divergence among maize genotypes for traits such as grain yield, anthesis-silking interval (ASI), plant height, and other yield components, where distinct clusters often reflect biologically meaningful diversity useful for breeding [32,36,54].
The ASI ranged narrowly from 1.02 to 1.35 days across all clusters (Table 13), indicating strong synchrony between pollen shed and silk emergence among the evaluated hybrids. This consistently short ASI is physiologically important and widely recognized as a critical driver of drought resilience in maize. Under water stress, silk emergence is typically delayed more than anthesis, resulting in an extended ASI, reduced pollination success, and poor kernel set. In contrast, genotypes that maintain a short ASI, as observed in this study, preserve reproductive synchrony and ensure effective fertilization even under moisture-limited conditions [27]. Recent studies further confirm that ASI is a key indicator of stress sensitivity, where increased ASI reflects disrupted reproductive development and consequent yield reduction under drought stress [45,55]. This reinforces ASI as a robust physiological marker linking reproductive failure to yield loss under water-limited environments. The consistently short ASI identified in this study expressed alongside other favorable agronomic and physiological traits that contribute to the overall adaptation. This suggests that drought resilience in these hybrids is not governed by a single trait, rather by a coordinated suite of traits that collectively enhance reproductive stability and yield performance.
4.5 Population performance and breeding implications
Our clustering and PCA analyses revealed clear differences in population performance across environments, emphasizing the influence of population origin and breeding history on yield and stress adaptation. Based on GY grouping, Cluster 1 dominated by drought-oriented and broadly adapted populations, exhibited stable but moderate yields, performing relatively better under stress, whereas Cluster 2, enriched with temperate introgression and elite yield-focused populations, showed strong performance under optimal conditions but limited drought tolerance. Cluster 3 combined populations with stress tolerance and advanced selection strategies, maintaining high performance across all environments, indicating broader adaptability. These patterns are consistent with previous studies showing that population-specific breeding strategies shape genotype × environment responses, with temperate introgression and elite tropical populations often driving higher yields under favorable conditions, while stress-adapted populations maintain performance under drought [56,57].
The low overlap of top-performing genotypes across environments highlights substantial genotype × environment interaction, reinforcing the need for targeted selection based on population-specific performance rather than single-genotype evaluation. This aligns with prior findings that multilocation evaluation and multivariate analysis provide better resolution of both broadly and specifically adapted germplasm, enabling the identification of populations with high yield potential or stress resilience [58,59]. From a breeding perspective, these results underscore the value of combining population performance data, clustering, and PCA to guide parental selection and hybrid development. Specifically, the complementary strengths of clusters broad adaptability, stress tolerance, or high yield under optimal conditions can inform crossing strategies to maximize genetic gain and develop hybrids suitable for both marginal and favorable environments.
4.6 Integrated genetic diversity, heterotic structure, and cluster-based selection for enhanced hybrid breeding efficiency
The distribution of doubled haploid (DH)-derived testcrosses across clusters highlights the importance of both within- and between-heterotic group diversity in maize breeding. The dispersion of lines from the same biparental populations into multiple clusters indicates substantial within-population genetic variation, confirming that DH populations can capture recombination-driven diversity and generate broad phenotypic variation. The finding agrees with previous reports [32,36,60,61].
Genotypes from the same heterotic group were distributed across different clusters, suggesting considerable within-group diversity. This is consistent with reports that recombination, introgression, and selection history can create sub-structuring within heterotic groups [36,62,63]. The presence of contrasting genetic backgrounds among clusters indicates that testcrossing DH lines with a tester from the opposite heterotic group remains effective for capturing combining ability and heterosis [32,62]. Hybrids derived from inter-group crosses are often associated with superior grain yield and heterotic expression compared to within-group crosses.
The clustering also reflects adaptive differentiation, with specific groups enriched for drought tolerance, high yield, or introgressed backgrounds. Similar results have been reported in multi-trait clustering studies [32,36]. Integrating genomic and phenotypic data further enhances the identification of adapted and high-performing germplasm [61]. By considering multiple agronomic and adaptive traits simultaneously, cluster-based selection improves the identification of well-balanced genotypes and enhances selection efficiency under complex environments [32,60].
The relationship between genetic distance and heterosis remains complex. While several studies confirmed a positive association particularly for inter-heterotic group crosses, some reports indicates that genetic distance alone is not a reliable predictor [64,65]. The present findings suggested that genetic divergence among DH lines exists but is not fully captured by conventional heterotic grouping. The wide dispersion of lines across clusters indicates complex and admixed diversity. Therefore, even without clear heterotic group separation, sufficient divergence between DH lines and testers can still generate heterotic responses. Furthermore, it has been confirmed that the DH lines derived from biparental populations harbor substantial genetic variation, and that testcrossing with an opposite heterotic group tester remains an effective strategy to exploit this diversity. Integrating DH technology, heterotic grouping, and multi-trait clustering provides a robust framework for improving selection accuracy, maximizing heterosis, and accelerating genetic gain in maize breeding, particularly under drought-prone conditions.
4.7 Association of traits
In this study, which shows that grain yield is controlled by a coordinated network of flowering, architectural, and ear-related traits, agrees with both earlier and recent findings. Grain yield is a complex trait influenced by multiple interrelated components rather than by single traits acting independently [66]. The positive associations of grain yield with ears per plant, plant height, and ear height support earlier reports that these traits enhance sink capacity and assimilate partitioning in maize [4,67]. These traits have therefore been widely recommended as useful indirect selection criteria for yield improvement. In contrast, negative relationships between grain yield and flowering delay traits, particularly the anthesis–silking interval (ASI), are well documented. Extended ASI has been associated with poor reproductive success and reduced grain yield, especially under drought and low-nitrogen conditions [4,68]. Recent studies confirm that genotypes with shorter ASI show greater yield stability across environments [66,69,70]. The central role of ASI identified in this study is supported by physiological and molecular evidence. Reduced ASI improves pollen–silk synchrony and reproductive efficiency, leading to higher grain yield under both stress and non-stress conditions [71]. Overall, these findings reinforce that simultaneous selection for short ASI, favorable plant architecture, and ear productivity is more effective than selection based on grain yield alone. This integrated trait-based approach provides a practical and sustainable framework for yield improvement under variable and stress-prone environments.
5 Conclusions
This study confirms the hypothesis that doubled haploid (DH) lines from biparental maize populations retain sufficient genetic variability. This variability generates exploitable phenotypic diversity and supports adaptation across environments. Genotypes were widely distributed across clusters, indicating strong adaptive differentiation. Heterotic grouping alone did not fully explain genetic diversity or performance. Multivariate approaches, such as PCA and clustering, more effectively captured functional variation linked to adaptation. Clusters acted as adaptive breeding units by integrating genetic background with environmental response. Strong genotype × environment interaction and limited stability of superior genotypes highlighted the need for environment-specific selection. Multi-trait selection is also essential due to the complex control of grain yield. Scientifically, the study demonstrates how DH-derived variability can be structured and exploited using multivariate tools. For breeding, it provides a practical framework for parent selection and hybrid development. Integrating DH technology, heterotic grouping, and cluster-based approaches enhances heterosis and accelerates genetic gain. This strategy supports the development of high-yielding and drought-resilient maize hybrids for variable and stress-prone environments.
Key Message
- Doubled haploid lines created useful diversity within the same heterotic group
- Some hybrids strongly outperformed commercial checks under both optimal and drought conditions
- Strong genotype × environment interaction affected stability
- Cluster-based selection can improve drought tolerance and yield stability
Acknowledgments
The authors are grateful to the International Maize and Wheat Improvement Center (CIMMYT) scientists and technicians who generated the germplasm, and highly appreciate the technical support received from the staff members affiliated to CIMMYT maize research station in Kiboko, Kenya.
References
- 1. Naumann G, Alfieri L, Wyser K, Mentaschi L, Betts RA, Carrao H, et al. Global changes in drought conditions under different levels of warming. Geophysical Research Letters. 2018;45:3285–96.
- 2. Azeem M, Haider MZ, Javed S, Saleem MH, Alatawi A. Drought stress amelioration in maize (Zea mays L.) by inoculation of Bacillus spp. strains under sterile soil conditions. Agriculture. 2022;12.
- 3. Cairns JE, Sonder K, Zaidi PH, Verhulst N, Mahuku G, Babu R. Maize production in a changing climate. impacts, adaptation, and mitigation strategies. Advances in Agronomy. 2012;114:1–58.
- 4.
Bänziger M, Edmeades GO, Beck D, Mauricio RB. Breeding for drought and nitrogen stress tolerance in maize: from theory to practice. Mexico, DF: CIMMYT; 2000.
- 5. Munaweera TIK, Jayawardana NU, Rajaratnam R, Dissanayake N. Modern plant biotechnology as a strategy in addressing climate change and attaining food security. Agric Food Security. 2022; 11:1–28.
- 6.
Bharadwaj DN. Advanced molecular plant breeding: meeting the challenge of food security. Taylor & Francis; 2019.
- 7. Yali W. Review of the genetic variability in maize genotypes. Plant. 2022;10(1).
- 8. Erenstein O, Jaleta M, Sonder K, Mottaleb K, Prasanna BM. Global maize production, consumption and trade: trends and R&D implications. Food Security. 2022;14:1295–319.
- 9. Haruna A, Adu GB, Buah SS, Kanton RAL, Kudzo AI, Seidu AM, et al. Analysis of genotype by environment interaction for grain yield of intermediate maturing drought tolerant top-cross maize hybrids under rain-fed conditions. Cogent Food & Agriculture. 2017;3(1):1333243.
- 10. Kamweru I, Beyene Y, Bruce AY, Makumbi D, Adetimirin VO, Pérez-Rodríguez P, et al. Genetic analyses of tropical maize lines under artificial infestation of fall armyworm and foliar diseases under optimum conditions. Front Plant Sci. 2023;14:1086757. pmid:36743507
- 11. Shiferaw B, Prasanna BM, Hellin J, Bänziger M. Crops that feed the world 6. Past successes and future challenges to the role played by maize in global food security. Food Sec. 2011;3(3):307–27.
- 12.
USDA. World Agricultural Production. United States Department of Agriculture (USDA). 2022.
- 13. Adebayo MA, Menkir A. Assessment of hybrids of drought tolerant maize (Zea mays L.) inbred lines for grain yield and other traits under stress managed conditions. Nigerian Journal of Genetics. 2014;28(2):19–23.
- 14. Yami M, Meyer F, Hassan R. The impact of production shocks on maize markets in Ethiopia: implications for regional trade and food security. Agric Econ. 2020;8(1).
- 15. Beyene Y, Gowda M, Olsen M, Robbins KR. Empirical comparison of tropical maize hybrids selected through genomic and phenotypic selections. Front Plant Sci. 2019;10:1–11.
- 16. Prasanna BM, Cairns JE, Zaidi PH, Beyene Y, Makumbi D, Gowda M, et al. Beat the stress: breeding for climate resilience in maize for the tropical rainfed environments. Theor Appl Genet. 2021;134(6):1729–52. pmid:33594449
- 17.
Prasanna BM, Vijay Chaikam GM. Doubled haploid technology in maize breeding: theory and practice. Mexico, DF: CIMMYT. 2012.
- 18. Dwivedi SL, Britt AB, Tripathi L, Sharma S, Upadhyaya HD, Ortiz R. Haploids: Constraints and opportunities in plant breeding. Biotechnol Adv. 2015;33(6 Pt 1):812–29. pmid:26165969
- 19. Kanaka KK, Sukhija N, Goli RC, Singh S, Ganguly I, Dixit SP, et al. On the concepts and measures of diversity in the genomics era. Current Plant Biology. 2023;33:100278.
- 20. Luqman T, Qamar Z, Tabasum A, El-Kallawy WaelH, Nazir T, Attacha S, et al. Genetic characterization of coarse and basmati rice (Oryza sativa L.) through microsatellite markers and morpho-agronomic traits. Genet Resour Crop Evol. 2023;70(8):2307–20.
- 21. Soliman ERS, El-Shazly HH, Börner A, Badr A. Genetic diversity of a global collection of maize genetic resources in relation to their subspecies assignments, geographic origin, and drought tolerance. Breed Sci. 2021;71(3):313–25. pmid:34776738
- 22. Madankar K, Shahi JP, Singh PK, Kr Y, Singamsetti A, Nair SK, et al. Elucidating molecular diversity and grouping of Indian maize (Zea mays L.) inbred lines using SNP markers. Cereal Research Communications. 2023;52(2):475–87.
- 23. Melchinger AE, Gumber RK. Concepts and Breeding of Heterosis in Crop Plants. 1998.
- 24. Reif JC, Hallauer AR, Melchinger AE. Heterosis and heterotic patterns in maize. Maydica. 2005;50:215–23.
- 25. Hallauer R, Carena MJ, Miranda Filho JB. Quantitative Genetics in Maize Breeding. Springer. 2010.
- 26. Lado B, Barrios PG, Quincke M, Silva P, Gutiérrez L. Modeling genotype × environment interaction for genomic selection with unbalanced data from a wheat breeding program. Crop Science. 2016;56(5):2165–79.
- 27. Bolaños J, Edmeades GO. The importance of the anthesis-silking interval in breeding for drought tolerance in tropical maize. Field Crops Research. 1996;48(1):65–80.
- 28. Jahufer MZZ, Luo D. DeltaGen: a comprehensive decision support tool for plant breeders. Crop Science. 2018;58(3):1118–31.
- 29.
R Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing; 2025.
- 30. Govindaraj M, Rai KN, Kanatti A, Upadhyaya HD, Shivade H, Rao AS. Exploring the genetic variability and diversity of pearl millet core collection germplasm for grain nutritional traits improvement. Sci Rep. 2020;10(1):21177. pmid:33273504
- 31. Poudel R, Kandel BP, Poudel A, Tripathi A, Gurung R, Ghimire U. Performance evaluation and multivariate analysis of maize accessions against drought stress in Lamjung district, Nepal. Discov Agric. 2024;2(1).
- 32. Dube SP, Sibiya J, Kutu F. Genetic diversity and population structure of maize inbred lines using phenotypic traits and single nucleotide polymorphism (SNP) markers. Sci Rep. 2023;13(1):17851. pmid:37857752
- 33. Singamsetti A, Zaidi PH, Seetharam K, Vinayan MT, Olivoto T, Mahato A, et al. Genetic gains in tropical maize hybrids across moisture regimes with multi-trait-based index selection. Front Plant Sci. 2023;14:1147424. pmid:36938016
- 34. Tarekegne A, Wegary D, Cairns JE, Zaman-Allah M, Beyene Y, Negera D, et al. Genetic gains in early maturing maize hybrids developed by the International Maize and Wheat Improvement Center in Southern Africa during 2000–2018. Frontiers in Plant Science. 2023;14.
- 35. Idehen EO, Abegunde IO, Kehinde OB. Genetic variability of yield and yield-related traits among 27 drought-tolerant maize genotypes. Agricultura Tropica et Subtropica. 2023;56(1):81–91.
- 36. Alam MA, Rahman M, Ahmed S, Jahan N, Khan MAA, Islam MR. Genetic Variation and Genotype by Environment Interaction for Agronomic Traits in Maize (Zea mays L.) Hybrids. Plants. 2022;11.
- 37. Chen S, Dang D, Liu Y, Ji S, Zheng H, Zhao C, et al. Genome-wide association study presents insights into the genetic architecture of drought tolerance in maize seedlings under field water-deficit conditions. Front Plant Sci. 2023;14:1165582. pmid:37223800
- 38. Amadu MK, Beyene Y, Chaikam V, Tongoona PB, Danquah EY, Ifie BE, et al. Genome-wide association mapping and genomic prediction analyses reveal the genetic architecture of grain yield and agronomic traits under drought and optimum conditions in maize. BMC Plant Biol. 2025;25(1):135. pmid:39893411
- 39. Elmyhun M, Abate E, Abate A, Teklewold A, Menkir A. Genetic analysis of tolerance to combined drought and heat stress in tropical maize. PLoS One. 2024;19(6):e0302272. pmid:38900753
- 40. Beyene Y, Mugo S, Semagn K, Asea G, Trevisan W, Tarekegne A, et al. Genetic distance among doubled haploid maize lines and their testcross performance under drought stress and non-stress conditions. Euphytica. 2013;192(3):379–92.
- 41. Ningning Z, Binbin L, Fan Y, Jianzhong C, Yuqian Z, Yejian W, et al. Molecular mechanisms of drought resistance using genome-wide association mapping in maize (Zea mays L.). BMC Plant Biol. 2023;23(1):468. pmid:37803273
- 42. Sedhom YSA, Rabie HA, Awaad HA, Alomran MM, Alshamrani SM, Mansour E, et al. Genetic potential of newly developed maize hybrids under different water-availability conditions in an arid environment. Life (Basel). 2024;14(4):453. pmid:38672723
- 43. Dube SP, Shayanowako AIT, Kutu F, Sibiya J. Performance of tropical and subtropical maize inbred lines under well-watered and drought-stressed environments. Euphytica. 2024;220(9).
- 44. Manigben KA, Beyene Y, Chaikam V, Tongoona PB, Danquah EY, Ifie BE, et al. Testcross performance and combining ability of intermediate maturing drought tolerant maize inbred lines in Sub-Saharan Africa. Front Plant Sci. 2024;15:1471041. pmid:39678012
- 45. Cairns JE, Crossa J, Zaidi PH, Grudloyma P, Sanchez C, Araus JL, et al. Identification of drought, heat, and combined drought and heat tolerant donors in maize. Crop Science. 2013;53(4):1335–46.
- 46. Sserumaga JP, Beyene Y, Pillay K, Kullaya A, Oikeh SO, Mugo S, et al. Grain-yield stability among tropical maize hybrids derived from doubled-haploid inbred lines under random drought stress and optimum moisture conditions. Crop Pasture Sci. 2018;69(11):691–712. pmid:33312239
- 47. Khan AM, Ali Z, Mohsin A ul, Soomro AA, Hussain K, Rafiq M, et al. Genotype × environment interactions analysis for chickpea grain yield and related traits by a mixed model approach. Discov Agric. 2024;2(1).
- 48. Bal SS, Adak A, Erdal Ş. Exploring the interplay between drought resilience and yield in maize hybrids for drought-prone regions. Tropical Plant Biol. 2025;18(1).
- 49. Dai K, Zhang Z, Wang S, Yang J, Wang L, Jia T, et al. Molecular mechanisms of heterosis under drought stress in maize hybrids Zhengdan7137 and Zhengdan7153. Front Plant Sci. 2024;15:1487639. pmid:39439513
- 50. Taye WE, Abebe TD, Tiruneh MA. Evaluation of standard heterosis for grain yield and yield-related traits in maize (Zea mays L.) hybrids in West Gojam, Ethiopia. IJ-FANRES. 2025;6(3):60–6.
- 51. Mideksa LY, Alamerew S, Tadesse B. Hybrid performance and heterosis for yield and agronomic traits of quality protein maize (Zea mays L.) inbred lines adapted to mid-altitude agroecology of Ethiopia. Agro Bali Agric J. 2022;5(2):219–39.
- 52. Sang ZQ, Zhang ZQ, Yang YX, Li ZW, Liu XG, Xu YB, et al. Heterosis and heterotic patterns of maize germplasm revealed by a multiple-hybrid population under well-watered and drought-stressed conditions. Journal of Integrative Agriculture. 2022;21:2477–91.
- 53. Naveen A, Hathiram D, Supriya P, Danakumara T, Mishra VK, Sinha B, et al. Multivariate principal component analysis and clustering methods for assessing genetic diversity in bread wheat (Triticum aestivum L.) genotypes. IJBSM. 2024;15(Nov, 11):01–8.
- 54. Azrai M, Aqil M, Efendi R, Andayani NN, Suwardi, Zainuddin B, et al. Integrating multi-trait and multi-index approaches for identifying drought tolerant tropical maize genotypes. Front Sustain Food Syst. 2025;9.
- 55. Zhuang L, Wang C, Hao H, Song W, Guo X. Maize anthesis-silking interval estimation via image detection under field rail-based phenotyping platform. Agronomy. 2024;14(8):1723.
- 56. Bocianowski J, Nowosad K, Rejek D. Genotype-environment interaction for grain yield in maize (Zea mays L.) using the additive main effects and multiplicative interaction (AMMI) model. J Appl Genet. 2024;65(4):653–64. pmid:39115751
- 57. Talefe W. Cluster and principal component analysis of maize inbred lines in low moisture stress areas in Ethiopia. Int J Agric Sc Food Technol. 2023;9(3):059–63.
- 58. Bai L, Wang K, Zhang Q, Zhang Q, Wang X, Pan S, et al. A study of maize genotype–environment interaction based on deep K-means clustering neural network. Agriculture. 2025;15(4):358.
- 59. Hudson AI, Odell SG, Dubreuil P, Tixier M-H, Praud S, Runcie DE, et al. Analysis of genotype-by-environment interactions in a maize mapping population. G3 (Bethesda). 2022;12(3):jkac013. pmid:35134181
- 60. Cheng D, Li J, Guo S, Wang Y, Xu S, Chen S, et al. Genomic prediction for germplasm improvement through inter-heterotic-group line crossing in maize. Int J Mol Sci. 2025;26(6):2662. pmid:40141304
- 61. Li Z, Li C, Zhang R, Duan M, Tian H, Yi H, et al. Genomic analysis of a new heterotic maize group reveals key loci for pedigree breeding. Front Plant Sci. 2023;14:1213675. pmid:37636101
- 62. Adejumobi II, Mengesha W, Gedil M, Meseka S, Abubakar AM, Bonkoungou OT, et al. Genomic-assisted refinement of heterotic groups in short-duration maturing tropical yellow and orange maize inbred lines. BMC Agric. 2026;2(1):9. pmid:41835110
- 63. Rajasekhar D, Kl N, Kalita S, Tatiparthi H, Rai M, Sen D. Heterotic grouping of maize (Zea mays L.) inbreds derived from diverse indigenous landraces of the Northeast Hill Region of India. Plant Genet Resour. 2025;23(4):252–61.
- 64. Sun J, Yin X, Ma Y, Jiang F, Guo R, Shaw RK, et al. Heterotic group classification and heterosis prediction in recombinant maize inbred lines. J Sci Food Agric. 2026;106(7):4017–27. pmid:41622696
- 65. Akinyosoye ST, Balogun MO, Olakojo SA. Heterosis of quality protein maize inbred lines for agronomic traits and association with genetic distances based on SSR and phenotypic markers. Plant Gene. 2023;36:100435.
- 66. Masuka B, Magorokosho C, Olsen M, Atlin GN, Bänziger M, Pixley KV. Gains in maize genetic improvement in eastern and southern Africa: II. CIMMYT open-pollinated variety breeding pipeline. Crop Science. 2017;57:180–91.
- 67. Singh D, Kumar A, Kumar R, Singh SK, Kushwaha N, Mohanty TA. Correlation and path coefficient analysis for ‘Yield contributing’ traits in quality protein maize (Zea mays L.). CJAST. 2020:91–9.
- 68. Kamara AY, Kling JG, Menkir A, Ibikunle O. Agronomic performance of maize (Zea mays L.) breeding lines derived from a low nitrogen maize population. J Agric Sci. 2003;141:221–30.
- 69. Valkova V, Vulchinkov Z. The anthesis–silking interval and its relation to certain yield components in maize hybrids from the mid-early FAO group. Poljoprivreda. 2025;31(3):3–11.
- 70. Silva PC, Sánchez AC, Opazo MA, Mardones LA, Acevedo EA. Grain yield, anthesis-silking interval, and phenotypic plasticity in response to changing environments: Evaluation in temperate maize hybrids. Field Crops Research. 2022;285:108583.
- 71. Tao K, Li Y, Hu Y, Li Y, Zhang D, Li C, et al. Overexpression of ZmEXPA5 reduces anthesis-silking interval and increases grain yield under drought and well-watered conditions in maize. Mol Breed. 2023;43(12):84. pmid:38009100