Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Genome-wide association analysis and genomic selection for leaf-related traits of maize

  • Yukang Zeng ,

    Contributed equally to this work with: Yukang Zeng, Xiaoming Xu

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Project administration, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation College of Agronomy, Xinjiang Agricultural University, Urumqi, Xinjiang, China

  • Xiaoming Xu ,

    Contributed equally to this work with: Yukang Zeng, Xiaoming Xu

    Roles Conceptualization, Data curation, Formal analysis

    Affiliation College of Agronomy, Xinjiang Agricultural University, Urumqi, Xinjiang, China

  • Jiale Jiang,

    Roles Data curation, Formal analysis, Investigation, Software

    Affiliation College of Agronomy, Xinjiang Agricultural University, Urumqi, Xinjiang, China

  • Shaohang Lin,

    Roles Data curation, Investigation, Methodology, Supervision

    Affiliation College of Agronomy, Xinjiang Agricultural University, Urumqi, Xinjiang, China

  • Zehui Fan,

    Roles Data curation, Formal analysis, Methodology

    Affiliation College of Agronomy, Xinjiang Agricultural University, Urumqi, Xinjiang, China

  • Yao Meng,

    Roles Data curation, Formal analysis

    Affiliation College of Agronomy, Xinjiang Agricultural University, Urumqi, Xinjiang, China

  • Atikaimu Maimaiti,

    Roles Validation

    Affiliation College of Agronomy, Xinjiang Agricultural University, Urumqi, Xinjiang, China

  • Penghao Wu ,

    Roles Funding acquisition, Methodology, Resources, Supervision, Validation, Visualization, Writing – review & editing

    craie788@126.com (PW); renjiaojiao789@sina.com (JR)

    Affiliation College of Agronomy, Xinjiang Agricultural University, Urumqi, Xinjiang, China

  • Jiaojiao Ren

    Roles Funding acquisition, Methodology, Resources, Supervision, Validation, Visualization, Writing – review & editing

    craie788@126.com (PW); renjiaojiao789@sina.com (JR)

    Affiliation College of Agronomy, Xinjiang Agricultural University, Urumqi, Xinjiang, China

Abstract

Maize is an important food crop worldwide. The length, width, and area of leaves are crucial traits of plant architecture and further influencing plant density, photosynthesis, and crop yield. To dissect the genetic architecture of leaf length, leaf width, and leaf area, a multi-parents doubled haploid (DH) population was used for genome-wide association study (GWAS) and genomic selection (GS). The length, width, and area of the first leaf above the uppermost ear, the uppermost ear leaf, and the first leaf below the uppermost ear were evaluated in multi-environment trials. Using BLINK and FarmCPU for GWAS, 19 significant single nucleotide polymorphisms (SNPs) on chromosomes 1, 2, 5, 6, 8, 9, and 10 were associated with leaf length, 49 SNPs distributed over all 10 chromosomes were associated with leaf width, and 37 SNPs distributed on all 10 chromosomes except for chromosome 3 were associated with leaf area. The phenotypic variation explained (PVE) by each QTL ranged from 0.05% to 27.46%. Fourteen pleiotropic SNPs were detected by at least two leaf-related traits. A total of 57 candidate genes were identified for leaf-related traits, of which 44 were annotated with known functions. Candidate genes Zm00001d032866, Zm00001D022209, and Zm00001d001980 are involved in leaf senescence. Zm00001d026130, Zm00001d002429, Zm00001d023225, and Zm00001d046767 play important roles in leaf development. GS analysis showed that when 60% of the total genotypes was used as the training population and 3000 SNPs were used for prediction, moderate prediction accuracy was obtained for leaf length, leaf width, and leaf area. The prediction accuracy would be improved by using top significantly associated SNPs for GS. The current study provides a better understanding of the genetic basis of leaf length, leaf width, and leaf area, and valuable information for improving plant architecture by implementing GS.

1. Introduction

Maize is an important food and feed crop worldwide. Several studies have shown that increasing the planting density of maize leads to higher crop yields [1]. In 1968, Donald [2] introduced the concept of the ideal plant type, which refers to a specific plant structure that promotes growth and photosynthesis while minimizing competition between individuals. Optimized plant morphology can effectively reduce competition within the population and achieve higher planting densities for maize [3,4]. Plant morphology plays a crucial role in breeding high-yielding maize varieties. Leaves are essential components of plant morphology and the main organ of photosynthesis. Adequate leaf size can enhance photosynthetic efficiency and ultimately increase crop yield [5]. Genetic analysis of leaf-related traits can provide a deeper understanding of the mechanisms underlying maize leaf development.

Numerous studies have been reported for the genetic analysis of leaf-related traits using linkage mapping, genome-wide association study (GWAS), or other methods. The RIL population derived from a cross between B73 and SICAU1212 was used to identify the leaf length, leaf width, and leaf area of eight consecutive leaves below the tassel [6]. Thirty-four QTL located on chromosomes 1, 2, 3, 4, 5, 6, and 8 were detected for the leaf length of eight leaves, explaining 4.19% to 13.03% of the phenotypic variance individually. Eighty-three QTL located on all chromosomes except for chromosome 9 were associated with the leaf width of eight leaves, and the phenotypic variance explained (PVE) by a single QTL ranged from 3.05% to 24.77%. Fifty-three QTL located on all chromosomes except for chromosome 7 were identified for leaf area of eight leaves, and the PVE value of each QTL ranged from 4.19% to 20.76%. Wang et al. [7] identified 23 QTL for leaf length and 25 QTL for leaf width using a RIL population derived from a cross between Z58 and HD568. A total of nine QTL related to leaf length, 17 QTL related to leaf width, and 16 QTL related to leaf area were detected in an IBM Syn10 DH maize population [8]. In the study of Guo et al. [9], 46 QTL related to the width of leaves at different positions above the uppermost ear were detected in four RIL populations, which were derived from four crosses between Yu82 and Yu87–1, Yu82 and Shen137, Zong3 and Yu87–1, and Yu537A and Shen137. Three common QTL were identified for leaf width at all four evaluated positions, indicating that leaf width at different leaf positions may be controlled by one or several identical QTL.

In a collection of 300 maize inbred lines, five SNPs significantly associated with leaf width were identified by GWAS and 10 genes were annotated [10]. Dai et al. [11] conducted GWAS and linkage analysis to dissect the genetic basis of 25 leaf-related traits. In the IBM Synl0 DH population, 130 QTL with the value of each QTL ranged from 3.46% to 10.02% were detected for leaf length of eight leaves, 202 QTL with the PVE value of each QTL ranged from 3.40% to 9.87% were detected for leaf width of eight leaves, and 166 QTL with the PVE value of each QTL ranged from 3.65% to 11.28% were detected for leaf area of eight leaves. Using an association panel consisting of 334 diverse maize inbred lines, 79, 52, and 40 SNPs were significantly related to leaf length, leaf width, and leaf area of eight leaves, respectively. Five unique SNPs related to leaf length were identified by both linkage mapping and association analysis. Although numerous genetic loci controlling leaf length, leaf width, and leaf area have been detected, few loci were co-located due to the use of different methods, populations, and environments.

Numerous genes and proteins affecting leaf growth and development have been identified. ZmNL4 is a kelch-repeat superfamily gene that controls leaf width in maize [12]. Knockout of ZmNL4 by CRISPR/Cas9 editing significantly reduced the leaf length by reducing the cell number. NAD kinase (NADK) can catalyze the conversion of NAD+ into NADP+ and affect multiple metabolic pathways by adjusting phosphorylation levels [13]. In particular, NADK2, which is localized in chloroplasts, plays a pivotal role in the energy conversion of photosynthesis and can affect chlorophyll content and leaf size [14]. Stress-associated proteins (SAPs), as a subclass of zinc finger proteins (ZFPs), are widely recognized to play a crucial regulatory role in plant development and stress response [15].

Genomic selection (GS) using genome-wide markers to predict the genomic estimated breeding values (GEBVs) of individuals with only genotype data can accelerate genetic gains in maize breeding [16]. The prediction model is established by a training population set of individuals with both genotype and phenotype data [17]. Selection is conducted based on GEBVs. GS has been reported in various studies on different traits of maize, such as plant height [18,19], stem strength [20], root related traits [21], shell related traits [22], husk tightness [23], and resistance to biological [24] and abiotic stress [25]. The prediction accuracy of different traits varies greatly. Pace et al. [26] conducted GS analysis on root seedling traits and found that the prediction accuracy of total root length (TRL) ranged from 0.10 to 0.56, with an average accuracy of 0.42, the prediction accuracy of primary root length (PRL) ranged from 0.10 to 0.58, with an average accuracy of 0.44, and the prediction accuracy of the secondary root length (SEL) ranged from 0.30 to 0.56, with an average accuracy of 0.43. The prediction accuracy of resistance to common rust was 0.61 in a GWAS panel based on genotyping-by-sequencing (GBS) SNPs and the five-fold cross-validation method [27]. Limited information is available on GS of leaf-related traits of maize.

The prediction accuracy of GS was affected by many factors [23,2830]. Zhang et al. [29] studied the effect of trait heritability, training population size, and marker density on prediction accuracy in 22 bi-parental tropical maize populations. Moderate prediction accuracy was achieved for grain yield, anthesis data, and plant height, when using 50% of the total genotypes as the training population and about 200 SNPs for prediction. Liu et al. [23] used six models, rrBLUP, BayesA, BayesB, BayesC, Bayesian LASSO (BL), and Bayesian ridge regression to evaluate the effect of prediction models for GS. The results showed that the rrBLUP model was the optimal prediction strategy for husk tightness. For rind penetrometer resistance (RPR), a high prediction accuracy was observed when using a multi-variable model including QTL as fixed effects [20]. Zhou et al. [30] also achieved high prediction accuracy by including kernel moisture content (KMC)-related QTLs as a fixed effect in GS for KMC at harvest time in maize. In the study of Guo et al. [31], GBS and repeat amplification sequencing (rAmpSeq) markers were used for GS for Zn concentration, the prediction accuracy with GBS markers was significantly higher than that with rAmpSeq markers.

In this study, a multi-parent DH population consisted of 379 DH lines was developed for GWAS and GS. The DH population was phenotyped in multi-environment trials and genotyped using the 48K liquid-phase hybridization probe capture technique. The main objectives of this study were to: (1) detect the significant SNPs associated with leaf-related traits; (2) predict candidate genes; (3) estimate GS prediction accuracy of leaf-related traits, and (4) explore the impact of marker density, training population size, and significantly associated SNPs on prediction accuracy.

2. Materials and methods

2.1. Plant materials and field experiment

A DH population consisting of 379 DH lines was used in this study. The DH population was developed from 21 hybrids, including XY1466, XY1366, XY1266, XY1148, XY1140, XY047, XY1225, XY335, XY1224, J1652, DK653, C6361, Lidan771, Lidan638, Lidan618, DK516, Lihe869, DK159, ZD958, Lidan295 and JK968. The DH population was planted at the Sangong Town experimental station (87° 12′ 57″ E, 43° 56′ 54″ N) and Dafeng Town experimental station (86° 34′ 49″ E, 44° 10′ 47″ N), Xinjiang, in the summer season of 2022; Ledong experimental station (108° 57′ 14″ E, 18° 27′ 14″ N), Hainan, in the winter season of 2022; and Qitai County experimental station (89° 44′ 19″ E, 44° 5′ 77″ N), Xinjiang, in the summer season of 2023. Xinjiang is located in the northwest of China, with a typical temperate continental arid climate. Hainan is located at the most southern part of China, with a tropical monsoon maritime climate.The experiments were conducted in a randomized complete block design with two replications. Eleven seeds were planted in a single-row plot, with a row length of 2.5 m and a row spacing of 0.60 m. Ten days after pollination, five plants showing consistent growth were measured. The length and the width of the first leaf above the uppermost ear, the uppermost ear leave, and the first leaf below the uppermost ear were measured. Leaf length was measured from the ligule to the tip of the blade. Leaf width was measured at the widest point of the leaf. Leaf area was calculated as follows: .

2.2. Phenotype analysis

The phenotypic data of leaf length, leaf width, and leaf area was analyzed using R3.4.4. The lme4 [32] package was used to obtain the best linear unbiased prediction (BLUP) values, which were used for GWAS and GS analyses. The mixed linear model was as follows:

where is the phenotype value of the jth genotype in the kth replicate of the ith environment, is the over all mean, is the effect of ith environment, is the effect of jth genotype, is the effect of kth replicate in ith environment, is the error. All the factors in the model were considered as random effects.

To calculate the broad-sense heritability (H2) [33], we employed the following formula:

where, represents genotypic represents genotype- represents residual variance, i represents the number of environments, and k represents the number of replications. The correlation of leaf-related traits were estimated by the Pearson’s correlation analysis.

2.3. Genotyping and genotypic data analysis

Young leaves of DH lines were sampled. Sequencing was performed using the 48K liquid-phase hybridization probe capture technique at China Golden Marker Biotech Co. (Beijing, China). Trimmomatic-0.36 [34] was used for quality control. Clean reads were aligned to the Maize B73_RefGen_v4 reference genome by BWA 0.7.17 software [35]. A total of 1,583,425 SNPs were obtained and filtered using the GATK software [36] with the basic filtering parameters recommended on the GATK website (https://gatk.broadinstitute.org/hc/en-us/articles/360035531112--How-to-Filter-variants-either-with-VQSR-or-by-hard-filtering). After filtering, 1,322,279 SNPs were obtained and further filtered using vcftools [37]. The SNPs with missing rate (MR) >20% and minor allele frequency (MAF) <0.05 were deleted. Finally, 134,768 high-quality SNPs distributed on 10 maize chromosomes were obtained for GWAS and GS analyses.

2.4. Population structure and genome-wide association analysis

STRUCTURE V2.3.4 [38] was used for population structure analysis. The number of populations (K) was set from 1 to 11 with run lengths of 10,000 and replications of 50,000. The number of iterations was set to 5. Structure Harvester V0.6.91 (http://alumni.soe.ucsc.edu/~dearl/software/structureHarvester/) [39] was used to visualize the STRUCTURE output and implement the Evanno method. Delta K was estimated using the following formula:

where is the absolute value of the 2nd order rate of change of the mean likelihood of K, is the likelihood of K.

The number of subgroups K was determined by delta K. The Q matrix was obtained by clumpp V1.1.2 (http://rosenberglab.bioinformatics.med.umich.edu/clumpp.html) [40]. TASSEL V5.0 [41] was used for linkage disequilibrium (LD) analysis. The sliding window size was set as 50 SNPs. GWAS was conducted using the bayesian-information and linkage-disequilibrium iteratively nested keyway (BLINK) and fixed and random model circuitous probability unification (FarmCPU) [42] of GAPIT (https://zzlab.net/GAPIT/GAPIT.library.R) package in R. The first three principal components (PCs) were estimated for population structure assessment. The significant threshold P value was 7.42 × 10−6 estimated by 1/ the number of effective markers.

2.5. Candidate gene analysis and validation by quantitative RT-PCR

Within the local LD blocks of significant SNPs, candidate genes were retrieved and annotated according to B73 maize genome reference V4.0. Genes that may be involved in leaf development were identified as the putative candidate genes.

Two genotypes, 20NP15396 with long wide leaves and 20NP15299 with short narrow leaves were utilized for quantitative real-time PCR (qRT-PCR) analysis. The first leaf above the uppermost ear, the uppermost ear leaf, and the first leaf below the uppermost ear were sampled with three biological replicates at the tasseling stage. Total RNA was isolated using TRIzol® reagent (Invitrogen). The PrimeScript™ RT reagent kit (TAKARA) with the gDNA Eraser was used to eliminate genomic DNA contamination and synthesize cDNA.

Two candidate genes Zm00001d002034 and Zm00001d011174 were selected for expression analysis. The Primer-Premier 5.0 program was used to design quantitative primers (S1 Table). The glyceraldehyde-3-phosphate dehydrogenase (GAPDH) gene was used as the internal reference gene for quantification. qRT-PCR was performed using the TB Green® Premix Ex Taq™ Kit (TAKARA) on an ABI 7500 Real-Time PCR System (Applied Biosystems). The relative expression of candidate genes was calculated using the 2-ΔΔCT equation methods [43].

2.6. Genomic selection

Genomic selection analysis of the length, width, and area of the first leaf above the uppermost ear, the uppermost ear leaf, and the first leaf below the uppermost ear was conducted using the rrBLUP package [44] in R3.4.4. The analysis was conducted by a five-fold cross-validation method with 100 replications, where 80% of the population was randomly selected as the training population and the remaining 20% was used as the prediction population. BLUP values across environments were used as the phenotype values of the training population. The GEBVs of the prediction population were predicted. The prediction accuracy was estimated as the Pearson’s correlation coefficient between the BLUP values across environments and GEBVs.

The effect of marker density on prediction accuracy was estimated by setting the number of markers to 10, 30, 50, 100, 300, 500, 1,000, 3,000, 5,000, and 10,000. The markers were selected randomly. The prediction accuracy was then estimated using a five-fold cross-validation method with 100 replications. To study the effect of training population size on prediction accuracy, the training population varied from 10% to 90% of the total population was used for GS, whereas the remaining population was used as the prediction population. All the markers were used for GS with 100 replications. To explore the effect of significantly associated markers on prediction accuracy, SNPs were sorted in ascending order of P-value, which was obtained by GWAS and represented the significant association levels between SNPs and target traits. The number of SNPs with the lowest P-value was set at 1, 3, 5, 10, 30, 50, 100, 300, and 500.

3. Results

3.1. Phenotypic variation

The descriptive statistics of leaf-related traits are shown in Table 1. The length of the first leaf above the uppermost ear (LL1), the uppermost ear leaf (LL2), and the first leaf below the uppermost ear (LL3) ranged from 71.70 cm to 77.10 cm. LL1 was the shortest and LL3 was the longest. The width of the first leaf above the uppermost ear (LW1), the uppermost ear leaf (LW2), and the first leaf below the uppermost ear (LW3) were similar, ranging from 9.31 cm to 9.44 cm. The area of the first leaf above the uppermost ear (Lar1), the uppermost ear leaf (Lar2), and the first leaf below the uppermost ear (Lar3) ranged from 502.00 cm2 to 543.00 cm2. The coefficient of variation (CV) of leaf-related traits varied from 12.77% to 16.61%, and LW3 showed the strongest variation. All the traits showed sufficient phenotypic variations and exhibited a normal distribution.

thumbnail
Table 1. Phenotypic performance, variance components, and broad-sense heritability (H2) of leaf-related traits in maize.

https://doi.org/10.1371/journal.pone.0323140.t001

The analysis of variance revealed highly significant differences (P < 0.001) among genotypic variance and genotype × environment interaction variance. The broad-sense heritability of LL1, LL2, LL3, LW1, LW2, LW3, Lar1, Lar2, and Lar3 was 0.75, 0.74, 0.73, 0.80, 0.78, 0.76, 0.77, 0.75, and 0.75, respectively. These relatively high heritability values indicated that the phenotypic variation was mainly caused by genetic variation. Correlation analysis showed significant (P < 0.01) correlations between every two leaf-related traits (Fig 1A and 1B).

thumbnail
Fig 1. Pattern diagram of maize plant and correlation analysis for leaf-related traits.

(A) The orange leaves are the first leaf above the uppermost ear, the uppermost ear leaf, and the first leaf below the uppermost ear; (B) Pearson’s correlation analysis of leaf-related traits. ** represents p < 0.01 and *** represents p < 0.001.

https://doi.org/10.1371/journal.pone.0323140.g001

3.2. Genotyping

When r2 = 0.25, the LD decay distance for the 10 chromosomes ranged from 8.73 to 38.71 kb, with an average of 16.18 kb (Fig 2A). The results of population structure analysis are presented in Fig 2B and 2C. When the K value was 2, delta K reached a peak, indicating that the DH population can be divided into two subgroups. The number of DH lines in subgroup 1 and subgroup 2 was 37 and 342, respectively. Principal component analysis (PCA) revealed two subgroups that were consistent with the results of STRUCTURE analysis (Fig 2D).

thumbnail
Fig 2. Genetic diversity analysis of the DH population.

(A) Linkage disequilibrium decay for the 10 chromosomes of maize; (B) Plot of delta K; (C) Genetic structure of the DH population at K = 2; (D) Principal component analysis.

https://doi.org/10.1371/journal.pone.0323140.g002

3.3. Significantly associated SNPs of leaf-related traits

The GWAS results of BLINK are shown in Table 2 and Fig 3. The quantile–quantile (Q-Q) plots had a straight line with a sharp upward deviated tail, indicating that the false positive and false negative were well controlled by the BLINK model. A total of 11 SNPs were significantly associated with leaf length. Five SNPs on chromosomes 1, 5, 6, 8, and 10 were significantly associated with LL1. The most significant SNP 8_141809639 with the lowest P value of 1.46 × 10-11 was located on chromosome 8. It had a MAF of 0.26, a SNP effect of -1.79, and a PVE of 5.51%. Two SNPs on chromosomes 8 were significantly associated with LL2. The most significant SNP 8_98628012 with the lowest P value of 1.38 × 10-7 had a MAF of 0.14, a SNP effect of 1.48, and a PVE of 15.28%. Four SNPs on chromosomes 2, 8, 9, and 10 were significantly associated with LL3. The most significantly associated SNP 8_141809639 was located on chromosomes 8. It had the lowest P-value of 8.94 × 10−12. It had a MAF of 0.26, a SNP effect of -1.70, and a PVE of 6.89%.

thumbnail
Table 2. Significant SNPs and candidate genes of leaf-related traits identified by BLINK.

https://doi.org/10.1371/journal.pone.0323140.t002

thumbnail
Fig 3. Manhattan and QQ plots of leaf-related traits in maize using BLINK.

The dashed lines represent the threshold at P = 7.42 × 10−6. A, B, and C represent LL1, LL2, and LL3, respectively. D, E, and F represent LW1, LW2, and LW3, respectively. H, I, and G represent Lar1, Lar2, and Lar3.

https://doi.org/10.1371/journal.pone.0323140.g003

A total of 24 SNPs were significantly associated with leaf width. Eleven SNPs on chromosomes 1, 2, 5, 6, 7, 8, 9, and 10 were significantly associated with LW1. The most significant SNP 1_3881916 with the lowest P value of 2.43 × 10-16 was located on chromosome 1. It had a MAF of 0.13, a SNP effect of 0.35, and a PVE of 15.54%. Six SNPs on chromosomes 1, 2, 5, and 9 were significantly associated with LW2. The most significant SNP 1_3881916 with the lowest P value of 1.10 × 10-08 was located on chromosome 1. It had a MAF of 0.13, a SNP effect of 0.25, and a PVE of 10.42%. Seven SNPs on chromosomes 1, 5, 7, 9, and 10 were significantly associated with LW3. The most significant SNP 9_96557394 with the lowest P value of 2.16 × 10-11 was located on chromosome 9. It had a MAF of 0.28, a SNP effect of -0.25, and a PVE of 8.99%.

A total of 24 SNPs were significantly associated with leaf area. Seven SNPs on chromosomes 1, 2, 4, 5, 8, and 9 were significantly associated with Lar1. The most significant SNP 8_122873056 with the lowest P value of 1.25 × 10-12 was located on chromosome 8. It had a MAF of 0.29, a SNP effect of -24.39, and a PVE of 13.00%. Nine SNPs on chromosomes 1, 2, 4, 8, and 9 were significantly associated with Lar2. The most significant SNP 1_5522631 with the lowest P value of 1.62 × 10-08 was located on chromosome 1. It had a MAF of 0.20, a SNP effect of -18.18, and a PVE of 6.66%. Eight SNPs on chromosomes 1, 2, 7, and 8 were significantly associated with Lar3. The most significant SNP 8_127153607 with the lowest P value of 2.64 × 10-10 was located on chromosome 8. It had a MAF of 0.22, a SNP effect of -22.36, and a PVE of 8.08%.

The GWAS results of FarmCPU are shown in S2 Table and S1 Fig. The Q-Q plots for LL1, LL2, LL3, Lar1, and Lar2 had a straight line with a slightly downward deviated tail, indicating that the FarmCPU model did not effectively control the false negatives. The FarmCPU was too conservative and was not able to identify any associated SNPs for LL1, LL2, LL3, Lar1, and Lar2. A total of 13, 6, 6, and 13 SNPs were significantly associated with LW1, LW2, LW3, and Lar3, respectively. The average PVE of these 38 associations was 2.75%, with a range from 0.05% to 7.99%. Six SNPs 1_71164006, 1_243650459, 3_150675344, 4_30716310, 5_178448074, and 7_164523818 had a PVE greater than 5%.

The overlapping SNPs identified by different models are presented in Table 3. A total of nine SNPs were detected by both BLINK and FarmCPU models, including five SNPs 1_71164006, 2_7546743, 6_38092952, 9_96557394, and 10_827617 for LW1, one SNP 7_172900884 for LW3, and three SNPs 1_57846004, 2_4813184, and 8_122873056 for Lar3. The allele effects of SNP 9_96557394 for LW1 was the most significant, with the smallest p-value of 1.80 × 10-7 (Fig 4).

thumbnail
Table 3. The overlapping SNPs identified by different models for leaf-related traits.

https://doi.org/10.1371/journal.pone.0323140.t003

thumbnail
Fig 4. Allele effects of the overlapping SNPs identified by different models.

(A) Allele effect of SNP 1_71164006 for LW1; (B) Allele effect of SNP 2_7546743 for LW1; (C) Allele effect of SNP 6_38092952 for LW1; (D) Allele effect of SNP 9_96557394 for LW1; (E) Allele effect of SNP 10_827617 for LW1; (F) Allele effect of SNP 7_172900884 for LW3; (G) Allele effect of SNP 1_57846004 for Lar3; (H) Allele effect of SNP 2_4813184 for Lar3; (I) Allele effect of SNP 8_122873056 for Lar3. * represents p < 0.05, ** represents p < 0.01, *** represents p < 0.001, and **** represents p < 0.0001.

https://doi.org/10.1371/journal.pone.0323140.g004

Fourteen SNPs with pleiotropic effects were detected on all ten maize chromosomes except for chromosomes 3 and 6 (Table 4). The SNP 9_96557394 were found to be co-localized with LL1, LL2, LL3, Lar2, and Lar3. Two SNPs 1_3881916 and 8_122873056 were associated with four traits. SNP 1_3881916 was co-detected for LW1, LW2, LW3, and Lar2. SNP 8_122873056 was co-detected for LW1, Lar1, Lar2, and Lar3. Four SNPs 1_114123179, 1_5522631, 2_7546743, and 4_33127506 were identified for three traits. Seven SNPs distributed on chromosomes 1, 2, 4, 5, 6, 7, and 10 were identified by two different traits.

thumbnail
Table 4. The pleiotropic SNPs identified for leaf-related traits.

https://doi.org/10.1371/journal.pone.0323140.t004

3.4. Candidate gene analysis of leaf-related traits

Based on the significantly associated SNPs identified by BLINK, 50 candidate genes were annotated (Table 2). Eleven candidate genes were related to leaf length, and the function of nine genes has been annotated. Nineteen candidate genes were related to leaf width, and the function of 15 genes has been annotated. Twenty candidate genes were related to leaf area, and the function of 16 genes has been annotated.

Based on the significantly associated SNPs identified by FarmCPU, 34 candidate genes were identified. Twenty-three candidate genes including 16 genes with known functions were annotated for leaf width. Eleven candidate genes including eight genes with known functions were annotated for leaf area.

Zm00001d002034 associated with the overlapping SNP 2_4813184 and Zm00001d011174 associated with the pleiotropic SNP 8_141809639 were validated using qRT-PCR (Fig 5). These two genes were differentially expressed in the genotype 20NP15299 with short narrow leaf and the genotype 20NP15396 with long wide leaf. Zm00001d002034 exhibited higher relative expression levels in genotype 20NP15396 than in genotype 20NP15299 for the uppermost ear leaf and the first leaf below the uppermost ear, indicating that Zm00001d002034 may have a positive regulatory effect on leaf growth and development. The opposite expression pattern was observed for Zm00001d011174. Zm00001d011174 exhibited significantly lower relative expression levels in genotype 20NP15396 than in genotype 20NP15299 for all three leaves, indicating that Zm00001d011174 may have a negative regulatory effect on leaf growth and development. These findings provide valuable insights into the differential expression dynamics of the candidate genes across three key leaves and genotypes, contributing to a better understanding of their roles in maize development and physiology.

thumbnail
Fig 5. Relative expression levels of candidate genes.

(A–C) Relative expression levels of Zm00001d002034; (D and E) Relative expression levels of Zm00001d011174.

https://doi.org/10.1371/journal.pone.0323140.g005

3.5. The impact of different factors on prediction accuracy

The prediction accuracy estimated by the five-fold cross-validation method with all the markers ranged from 0.30 to 0.43. The prediction accuracies of LL1, LL2, LL3, LW1, LW2, LW3, Lar1, Lar2, and Lar3 were 0.38, 0.31, 0.33, 0.43, 0.42, 0.42, 0.33, 0.32, and 0.30, respectively. Lar3 showed the lowest prediction accuracy among all traits, and LW1 showed the highest prediction accuracy.

Fig 6A illustrates the effect of marker density on the prediction accuracy. All the traits exhibited the similar trend, that the prediction accuracy increased as marker density increased. When the number of markers increased from 10 to 3,000, there was a significant improvement in the prediction accuracy, which then reached a plateau. When 3,000 SNPs used for GS, the mean prediction accuracies of LL1, LL2, LL3, LW1, LW2, LW3, Lar1, Lar2, and Lar3 were 0.36, 0.29, 0.30, 0.42, 0.41, 0.41, 0.31, 0.31, and 0.30, respectively.

thumbnail
Fig 6. Genomic prediction accuracy for leaf-related traits when the number of SNPs ranged from 10 to 10,000, the training population size ranged from 10% to 90% of the total population size, and the number of significantly associated SNPs varied from 1 to 500.

(A) GS with different marker density; (B) GS with different training population sizes; (C) GS with different number of significantly associated SNPs identified by BLINK model; (D) GS with different number of significantly associated SNPs identified by FarmCPU model.

https://doi.org/10.1371/journal.pone.0323140.g006

The effect of population size on prediction accuracy is illustrated in Fig 6B. All the traits exhibited the same trend, that was the prediction accuracy increased as the training population size increased. There was a substantial increase in prediction accuracy as the training population size increased from 10% to 60% of the total genotypes, after which it leveled off. When 60% of the total genotypes were used as the training polpulation, the mean prediction accuracies of LL1, LL2, LL3, LW1, LW2, LW3, Lar1, Lar2, and Lar3 were 0.35, 0.27, 0.28, 0.41, 0.40, 0.39, 0.30, 0.28, and 0.28.

Fig 6C and 6D shows the impact of significantly associated markers on prediction accuracy. All traits showed a consistent trend, that is the prediction accuracy increased as the number of significantly associated markers increased. When the number of significantly associated markers increased from 1 to 300, the prediction accuracy increased rapidly and then reached a plateau. Compared to using all the markers for GS, the mean prediction accuracy was improved from 0.38 to 0.68 for LL1, from 0.31 to 0.63 for LL2, from 0.33 to 0.67 for LL3, from 0.43 to 0.71 for LW1, from 0.42 to 0.65 for LW2, from 0.42 to 0.68 for LW3, from 0.33 to 0.78 for Lar1, from 0.32 to 0.74 for Lar2, and from 0.30 to 0.73 for Lar3 when using the top 300 significant markers with the lowest p-value identified by BLINK. Compared to using all the markers for GS, the mean prediction accuracy was improved from 0.38 to 0.55 for LL1, from 0.31 to 0.55 for LL2, from 0.33 to 0.58 for LL3, from 0.43 to 0.74 for LW1, from 0.42 to 0.71 for LW2, from 0.42 to 0.72 for LW3, from 0.33 to 0.60 for Lar1, from 0.32 to 0.59 for Lar2, and from 0.30 to 0.66 for Lar3 when using the top 300 significant markers with the lowest p-value identified by FarmCPU. The overall prediction accuracy was improved by at least 0.16. Except for the three leaf width traits, GS using significant SNPs identified by BLINK showed higher prediction accuracy than the FarmCPU for leaf-related traits.

4. Discussion

4.1. Genetic architecture of leaf-associated traits

Leaf morphology plays a crucial role in the architecture of maize plants, influencing their photosynthetic performance and density tolerance [4548]. The development and breeding of maize varieties with narrower leaf types can enhance planting density and ultimately increase yield. Therefore, to gain a deeper understanding of the genetic basis of leaves, we conducted GWAS and GS for the length, width, and area of the first leaf above the uppermost ear, the uppermost ear leaf, and the first leaf below the uppermost ear of maize. Our study showed that leaf length, leaf width, and leaf area were significantly affected by environments, genotypes, and genotype by environment interaction. Heritabilities ranged from 0.73 to 0.80, indicating that genetic variation is the main cause of phenotypic variation.

A total of 63 unique SNPs were identified to be significantly associated with leaf-related traits and 14 SNPs showed pleiotropic effects. The PVE of each SNP ranged from 0.05% to 24.46%. The results were consistent with previous reports [6,7,11,49,50]. Tian et al. [49] identified 36 QTL explaining 77.7% of the phenotypic variation for leaf length and 34 QTL explaining 80.3% of the phenotypic variation for leaf width by joint linkage mapping in a nested association mapping (NAM) population. Low levels of pleiotropy between leaf length and leaf width was obtained. In a large population consisting of 866 maize-teosinte BC2S3 recombinant inbred lines, 17 QTL were detected for leaf length and 14 QTL were detected for leaf width [50]. The PVE of each locus ranged from 1.2% to 12.0% for leaf length and from 1.2% to 9.4% for leaf width. Only one QTL was detected for both leaf length and leaf width. In conclusion, leaf-related traits are controlled by multiple QTL with little pleiotropy.

4.2. Significant SNPs for leaf-related traits

In this study, BLINK and FarmCPU were used for GWAS to dissect the genetic basis of leaf-related traits in maize. According to the Q-Q plots, the FarmCPU did not effectively control the false negatives for LL1, LL2, LL3, Lar1, and Lar2, resulting in missed detections of some truly associated SNPs. BLINK was the best-fitted model by effectively controlling the false positives and false negatives for all the leaf-related traits.

A total of 19 SNPs distributed on chromosomes 1, 2, 5, 6, 8, 9, and 10 were associated with leaf length at a P-value of 7.42 × 10−6. Some loci have been reported in previous studies [11,45,49]. The SNP 10_89498418 located on chromosome 10 was mapped in the same region as the leaf length QTL Chr10:87-96.1Mb detected in a K22 × BY815 RIL population [45] and the leaf length QTLm1106 (Chr10:89.3–101.9Mb) detected in a NAM population [49]. The SNP 2_1167190 was 323,087 bp away from the SNP PUT-163a-74244798–3692, which was significantly associated with leaf area [11]. The overlapping SNP 6_156292657 was 366,932 bp away from the SNP PZE06152684254 related to leaf length [49]. The pleiotropic SNP 10_89498418 associated with LL1 and LL3 was 998,574 bp away from the SNP PZE1088191117 associated with leaf length. The pleiotropic SNP 8_141809639 identified for LL1, LL2, and LL3 was 456,457 bp away from the SNP PZE08136896381 related to leaf length [49].

A total of 49 SNPs distributed over all 10 chromosomes were associated with leaf width. SNPs 1_3881916 and 10_12117124 were located in the same region of the leaf width related QTL on Chromosome 1 (3–6.7Mb) and Chromosome 10 (121.6–126.6Mb), respectively [45]. SNP 1_243735414 was coincident with QTL qLW1b (Chr1:240.69–245.71Mb) detected in a RIL population by Wang et al. [51]. The SNP 8_5225233 was 53,613 bp away from the SNP PZE0804976376 associated with leaf width [49]. The pleiotropic SNP 2_3876362 associated with LW2 and LW3 was 177,955 bp away from the SNP PZE-102007336 related to leaf length [11]. The pleiotropic SNP 1_3881916 associated with LW1, LW2, and LW3 was 450,792 bp away from the SNP PZE0104222883 related to leaf width [49].

A total of 37 SNPs distributed on all 10 chromosomes except for chromosome 3 were associated with leaf area. SNPs 1_5522631, 1_92219160, and 8_127153607 were located in the same region of the leaf area related QTL qLAE1–1 (Chr1:5.30–6.90MB), qLAE1–2 (Chr1:88.60–92.38MB), and qLAE8–2 (Chr8:125.33–131.38MB), respectively [11]. SNP 5_178448074 was mapped in the same region of QTL qLAr-5–2 (Chr5:178.400–179.725MB) [8]. The SNP 8_168008436 was 483,730 bp away from the SNP SYN29288 related to leaf width [11]. These common loci detected in different studies are stable QTL for leaf length and deserve further investigation.

A total of 14 SNPs distributed on all ten maize chromosomes except for chromosomes 3 and 6 were associated with at least two leaf-related traits, which were regarded as pleiotropic SNPs. The SNP 9_96557394 showed a pleiotropic effect on five traits LL1, LL2, LL3, Lar2, and Lar3. SNP 1_3881916 and SNP 8_122873056 were associated with four leaf-related traits. These results suggested that these SNPs might be crucial for leaf growth and development. Therefore, our study provides abundant loci for accumulating favorable alleles that affect leaf growth in maize breeding programs.

4.3. Candidate genes for leaf-related traits

The candidate gene analysis can be performed for a better understanding of the genetic architecture of leaf-related traits. Based on the GWAS results, 57 unique candidate genes were identified for leaf-related traits, and 44 were annotated with known functions. Some functional genes that may control leaf growth and development were analyzed. SNP 10_139299756 associated with LW1 is located within the gene model of Zm00001d026130, encoding the BZIP transcription factor, which is an important regulatory factor for plant development and abiotic resistance. In Arabidopsis, transcription factor bZIP3 affects leaf shape [52]. Overexprssion of bZIP3 leads to aberrant shaped cotyledons with hyponastic bending. SNP 2_12493180 associated with Lar2 is located within the gene model Zm00001d002429, encoding the growth-regulating factor 6-like. The growth-regulating factors (GRFs) are plant-specific transcription factors that play an important role in regulating the growth of plant roots, leaves, and floral organ [53]. Overexprssion of AtGRFs in Arabidopsis leads to leaf enlargement, which is consistent in rice and sugarcane [54]. Overexpression of SsGRF7 promotes the growth and expansion of leaves in sugarcane, leading to larger leaf length. SNP 1_240601681 associated with Lar3 is located within the gene model of Zm00001d032866, encoding the UDP-glycosyltransferase 86A2. UDP-glycosyltransferases (UGTs) play important roles in plant growth and development by participating in many metabolic processes [55]. In cotton, UGTs has been shown to affect leaf senescence [56]. SNP 9_104957251 associated with LW1 is located within the gene model of Zm00001d046767, encoding a Zinc finger A20 and AN1 domain-containing stress-associated protein 8. It plays a key role in regulating plant growth, development, and abiotic stress [5760].

The pleiotropic SNP 7_172900884 associated with LW1 and LW3 was identified by both BLINK and FarmCPU models, and located in LD with Zm00001D022209. Zm00001D022209 encodes a developmentally-regulated GTP-binding protein 1, which plays a key role in plant life and leaf senescence by regulating ribosomal biology [61]. GTP-binding proteins are required for leaf development and the establishment of leaf polarity [62]. The co-localized SNP 10_827617 for LW1 is located in the gene model of Zm00001d023225. It encodes a protein trichome birefringence-like 34, which involved in cell wall modification and leaf development by mediating xylan acetylation [63]. Xylan acetylation affects the structural integrity of the cell wall. In rice, two trichome birefringence-like proteins are essential for leaf blight resistance [64]. SNP 2_3876559 associated with Lar1 and the pleiotropic SNP 2_3876362 accociated with LW2 and LW3 are located in the gene model of Zm00001d001980. It encodes a protein kinase-like, which involved in the regulation of leaf senescence [65]. The information of candidate genes showed in this study can provide a reference for the cloning of maize leaf-related genes.

4.4. Genomic selection for leaf-related traits

GS is a selection method using high-density markers, which can speed up the breeding process and has been widely used in animals and plants [66,67]. Prediction accuracy is influenced by various factors, such as heritability, marker density, marker quality, training population size, the relationship between the training population and the prediction population, prediction models, genotyping platforms, and significant markers [68,69]. The factors affecting the estimation accuracy will affect the prediction ability of GS. The prediction accuracy increases when the relationship between the training population and the prediction population is close [70]. Incorporating significantly associated markers for target trait as fixed effects in GS model has the potential to improve prediction accuracy [29,71,72]. The prediction accuracy also increases as the value of heritability, training population size, and marker density increase [73].

In this study, the heritability of leaf-related traits was relatively high, but the prediction accuracy estimated by the five-fold cross-validation method with all the markers was low. The contradiction between high heritability and low prediction accuracy may be caused by the model used for GS. In the rrBLUP model, it is assumed that the marker effects are normally distributed with the same variance [16,74]. The rrBLUP model is suited for traits controlled by multiple small effect QTL [23,67,75]. BayesA, BayesB, BayesC, and Bayesian LASSO assign different prior distributions to marker effects [23]. GS for leaf-related traits using BayesA, BayesB, BayesC, and Bayesian LASSO models deserves further research.

The effect of marker density, training population size, and significantly associated markers on prediction accuracy of leaf-related traits was studied. The results indicated that the prediction accuracy of leaf-related traits increased when the marker density and population size increased. When the marker density reached to 3000, the prediction accuracy reached a plateau, which is consistent with previous studies. Cao et al. [70] reported that the prediction accuracy of resistant to tar spot complex reached a plateau, with a marker density of 3000 in the GWAS panel and 500 in a bi-parental DH population. The number of markers required for a moderate prediction accuracy may depend on the LD decay distance of the population and the population type [31]. When the training population size reached to 60% of the total population, the prediction accuracy of leaf-related traits reached a plateau, which is consistent with previous studies [27,29]. A relatively high prediction accuracy with small standard error was obtained when the training population was 50% to 60% of the total genotypes. To evaluate the impact of significant markers on prediction accuracy, nine marker densities were used for GS. The top 300 markers with the lowest P-values were enough to obtain a high prediction accuracy for leaf-related traits. The prediction accuracy estimated by significant markers was higher than that estimated by the same number of randomly selected markers. However, the P-values based on the GWAS of the entire population may inadvertently inflate prediction accuracy, as the test set’s phenotypic data are utilized in the GWAS. A more stringent approach involves utilizing GWAS results derived from the training set of each cross-validation iteration can mitigate potential biases. In conclusion, our results have shown the potential of GS for leaf-related traits and provided a better understanding of how marker density, population size, and significant markers affect the prediction accuracy.

The BLUP values evaluated by treating lines as random effects are shrunk back toward the population mean compared to the best linear unbiased estimators (BLUEs) [76]. Since BLUP values reduce the effects of individual genetic variants, BLUEs are more recommended for GS analysis. Therefore, GS was also conducted with BLUEs and the results are shown in S3 Table. The GS prediction accuracy estimated by BLUEs showed the same trend as that estimated by BLUP values. As the marker density and training population size increased, the prediction accuracy increased. When the marker density reached to 3000, or the training population size reached to 60% of the total population, the prediction accuracy reached a plateau. The prediction accuracy was significantly improved by using the top 300 significant markers identified by GWAS.

5. Conclusion

In this study, two GWAS models, BLINK and FarmCPU, were used to investigate the genetic basis of leaf-related traits in a multi-parents DH population. According to the Q-Q plots, BLINK performs better than FarmCPU in terms of false negative controlling. A total of 19, 49, and 37 SNPs were significantly associated with leaf length, leaf width, and leaf area, respectively. Fourteen pleiotropy SNPs were identified, which is consistent with the significantly positive correlations between leaf-related traits. Two candidate genes, Zm00001d002034 and Zm00001d011174 were validated using qRT-PCR. The GS prediction accuracy estimated by the five-fold cross-validation method with all the markers was relatively low, ranging from 0.30 to 0.43. The prediction accuracy of GS can be improved by using the GWAS-related markers, indicating that the GWAS information helps to increase the prediction accuracy. This study gives a better understanding of GS for leaf-related traits and promotes the application of GS for improving plant architecture in maize breeding programs.

Supporting information

S1 Fig. Manhattan and QQ plots of leaf-related traits in maize using FarmCPU.

The dashed lines represent the threshold at P = 7.42 × 10 − 6. A, B, and C represent LL1, LL2, and LL3, respectively. D, E, and F represent LW1, LW2, and LW3, respectively. H, I, and G represent Lar1, Lar2, and Lar3.

https://doi.org/10.1371/journal.pone.0323140.s001

(TIFF)

S1 Table. The primer sequences for qRT-PCR.

https://doi.org/10.1371/journal.pone.0323140.s002

(XLSX)

S2 Table. Significant SNPs and candidate genes of leaf-related traits identified by FarmCPU.

https://doi.org/10.1371/journal.pone.0323140.s003

(XLSX)

S1 Raw Data. Multi-environment phenotypic data and genotypic data.

https://doi.org/10.1371/journal.pone.0323140.s005

(ZIP)

Acknowledgments

The authors thank the research team for their help and support in experimental design, data collection, and data analysis.

References

  1. 1. Duvick DN. The contribution of breeding to yield advances in maize (Zea mays L.). Adv Agronomy. 2005:83–145.
  2. 2. Donald CM. The breeding of crop ideotypes. Euphytica. 1968;17(3):385–403.
  3. 3. Mock JJ, Pearce RB. An ideotype of maize. Euphytica. 1975;24(3):613–23.
  4. 4. Li H, Ribaut J-M, Li Z, Wang J. Inclusive composite interval mapping (ICIM) for digenic epistasis of quantitative traits in biparental populations. Theor Appl Genet. 2008;116(2):243–60. pmid:17985112
  5. 5. Stewart DW, Costa C, Dwyer LM, Smith DL, Hamilton RI, Ma BL. Canopy Structure, Light Interception, and Photosynthesis in Maize. Agronomy Journal. 2003;95(6):1465–74.
  6. 6. Yang C, Tang D, Qu J, Zhang L, Zhang L, Chen Z, et al. Genetic mapping of QTL for the sizes of eight consecutive leaves below the tassel in maize (Zea mays L.). Theor Appl Genet. 2016;129(11):2191–209. pmid:27550554
  7. 7. Wang H, Liang Q, Li K, Hu X, Wu Y, Wang H, et al. QTL analysis of ear leaf traits in maize (Zea mays L.) under different planting densities. The Crop Journal. 2017;5(5):387–95.
  8. 8. Ma L, Guan Z, Zhang Z, Zhang X, Zhang Y, Zou C, et al. Identification of quantitative trait loci for leaf‐related traits in an IBM Syn10 DH maize population across three environments. Plant Breeding. 2018;137(2):127–38.
  9. 9. Guo S, Ku L, Qi J, Tian Z, Han T, Zhang L, et al. Genetic analysis and major quantitative trait locus mapping of leaf widths at different positions in multiple populations. PLoS One. 2015;10(3):e0119095. pmid:25756495
  10. 10. Guo S, Zhu M, Du J, Wang J, Lu X, Jin Y, et al. Accurate Phenotypic Identification and Genetic Analysis of the Ear Leaf Veins in Maize (Zea mays L.). Agronomy. 2023;13(3):753.
  11. 11. Dai W, Yu H, Liu K, Chengxu Y, Yan J, Zhang C, et al. Combined linkage mapping and association analysis uncovers candidate genes for 25 leaf-related traits across three environments in maize. Theor Appl Genet. 2023;136(1):12. pmid:36662253
  12. 12. Gao L, Yang G, Li Y, Sun Y, Xu R, Chen Y, et al. A kelch-repeat superfamily gene, ZmNL4, controls leaf width in maize (Zea mays L.). Plant J. 2021;107(3):817–30. pmid:34009654
  13. 13. Grose JH, Joss L, Velick SF, Roth JR. Evidence that feedback inhibition of NAD kinase controls responses to oxidative stress. Proc Natl Acad Sci U S A. 2006;103(20):7601–6. pmid:16682646
  14. 14. Chai M-F, Chen Q-J, An R, Chen Y-M, Chen J, Wang X-C. NADK2, an Arabidopsis chloroplastic NAD kinase, plays a vital role in both chlorophyll synthesis and chloroplast protection. Plant Mol Biol. 2005;59(4):553–64. pmid:16244906
  15. 15. Billah SA, Khan NZ, Ali W, Aasim M, Usman M, Alezzawi MA, et al. Genome-wide in silico identification and characterization of the stress associated protein (SAP) gene family encoding A20/AN1 zinc-finger proteins in potato (Solanum tuberosum L.). PLoS One. 2022;17(8):e0273416. pmid:35998181
  16. 16. Meuwissen TH, Hayes BJ, Goddard ME. Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001;157(4):1819–29. pmid:11290733
  17. 17. Crossa J, Pérez-Rodríguez P, Cuevas J, Montesinos-López O, Jarquín D, de Los Campos G, et al. Genomic selection in plant breeding: methods, models, and perspectives. Trends Plant Sci. 2017;22(11):961–75. pmid:28965742
  18. 18. Kadam DC, Potts SM, Bohn MO, Lipka AE, Lorenz AJ. Genomic prediction of single crosses in the early stages of a maize hybrid breeding pipeline. G3 (Bethesda). 2016;6(11):3443–53. pmid:27646704
  19. 19. Dang D, Guan Y, Zheng H, Zhang X, Zhang A, Wang H, et al. Genome-wide association study and genomic prediction on plant architecture traits in sweet corn and waxy corn. Plants (Basel). 2023;12(2):303. pmid:36679015
  20. 20. Liu X, Hu X, Li K, Liu Z, Wu Y, Wang H, et al. Genetic mapping and genomic selection for maize stalk strength. BMC Plant Biol. 2020;20(1):196. pmid:32380944
  21. 21. Sharma S, Pinson SRM, Gealy DR, Edwards JD. Genomic prediction and QTL mapping of root system architecture and above-ground agronomic traits in rice (Oryza sativa L.) with a multitrait index and Bayesian networks. G3 (Bethesda). 2021;11(10):jkab178. pmid:34568907
  22. 22. Cui Z, Dong H, Zhang A, Ruan Y, He Y, Zhang Z. Assessment of the Potential for Genomic Selection To Improve Husk Traits in Maize. G3 (Bethesda). 2020;10(10):3741–9. pmid:32816916
  23. 23. Liu Y, Ao M, Lu M, Zheng S, Zhu F, Ruan Y, et al. Genomic selection to improve husk tightness based on genomic molecular markers in maize. Front Plant Sci. 2023;14:1252298. pmid:37828926
  24. 24. Gesteiro N, Ordás B, Butrón A, de la Fuente M, Jiménez-Galindo JC, Samayoa LF, et al. Genomic versus phenotypic selection to improve corn borer resistance and grain yield in maize. Front Plant Sci. 2023;14:1162440. pmid:37484478
  25. 25. Shikha M, Kanika A, Rao AR, Mallikarjuna MG, Gupta HS, Nepolean T. Genomic selection for drought tolerance using genome-wide SNPs in maize. Front Plant Sci. 2017;8:550. pmid:28484471
  26. 26. Pace J, Yu X, Lübberstedt T. Genomic prediction of seedling root length in maize (Zea mays L.). Plant J. 2015;83(5):903–12. pmid:26189993
  27. 27. Ren J, Li Z, Wu P, Zhang A, Liu Y, Hu G, et al. Genetic dissection of quantitative resistance to common rust (Puccinia sorghi) in tropical maize (Zea mays L.) by combined genome-wide association study, linkage mapping, and genomic prediction. Front Plant Sci. 2021;12:692205. pmid:34276741
  28. 28. Wang X, Xu Y, Hu Z, Xu C. Genomic selection methods for crop improvement: Current status and prospects. The Crop Journal. 2018;6(4):330–40.
  29. 29. Zhang A, Wang H, Beyene Y, Semagn K, Liu Y, Cao S, et al. Effect of Trait Heritability, Training Population Size and Marker Density on Genomic Prediction Accuracy Estimation in 22 bi-parental Tropical Maize Populations. Front Plant Sci. 2017;8:1916. pmid:29167677
  30. 30. Zhou G, Zhu Q, Mao Y, Chen G, Xue L, Lu H, et al. Multi-Locus Genome-Wide Association Study and Genomic Selection of Kernel Moisture Content at the Harvest Stage in Maize. Front Plant Sci. 2021;12:697688. pmid:34305987
  31. 31. Guo R, Dhliwayo T, Mageto EK, Palacios-Rojas N, Lee M, Yu D, et al. Genomic Prediction of Kernel Zinc Concentration in Multiple Maize Populations Using Genotyping-by-Sequencing and Repeat Amplification Sequencing Markers. Front Plant Sci. 2020;11:534. pmid:32457778
  32. 32. Bates D, Mächler M, Bolker B, Walker S. Fitting Linear Mixed-Effects Models Usinglme4. J Stat Soft. 2015;67(1).
  33. 33. Knapp SJ, Stroup WW, Ross WM. Exact Confidence Intervals for Heritability on a Progeny Mean Basis1. Crop Science. 1985;25(1):192–4.
  34. 34. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20. pmid:24695404
  35. 35. Li H, Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010;26(5):589–95. pmid:20080505
  36. 36. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20(9):1297–303. pmid:20644199
  37. 37. Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27(15):2156–8. pmid:21653522
  38. 38. Hubisz MJ, Falush D, Stephens M, Pritchard JK. Inferring weak population structure with the assistance of sample group information. Mol Ecol Resour. 2009;9(5):1322–32. pmid:21564903
  39. 39. Earl DA, vonHoldt BM. Structure harvester: a website and program for visualizing structure output and implementing the Evanno method. Conservation Genet Resour. 2011;4(2):359–61.
  40. 40. Jakobsson M, Rosenberg NA. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics. 2007;23(14):1801–6. pmid:17485429
  41. 41. Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007;23(19):2633–5. pmid:17586829
  42. 42. Wang J, Zhang Z. GAPIT Version 3: Boosting Power and Accuracy for Genomic Association and Prediction. Genomics Proteomics Bioinformatics. 2021;19(4):629–40. pmid:34492338
  43. 43. Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods. 2001;25(4):402–8. pmid:11846609
  44. 44. Endelman JB. Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP. The Plant Genome. 2011;4(3):250–5.
  45. 45. Pan Q, Xu Y, Li K, Peng Y, Zhan W, Li W, et al. The Genetic Basis of Plant Architecture in 10 Maize Recombinant Inbred Line Populations. Plant Physiol. 2017;175(2):858–73. pmid:28838954
  46. 46. Kusmec A, Srinivasan S, Nettleton D, Schnable PS. Distinct genetic architectures for phenotype means and plasticities in Zea mays. Nat Plants. 2017;3(9):715–23. pmid:29150689
  47. 47. Jalilian J, Delkhoshi H. How Much, Leaves Near the Ear Contribute on Yield and Yield Components of Maize?. Cercetari Agronomice in Moldova. 2014;47(2):5–12.
  48. 48. Li Y, Ma X, Wang T, Li Y, Liu C, Liu Z, et al. Increasing Maize Productivity in China by Planting Hybrids with Germplasm that Responds Favorably to Higher Planting Densities. Crop Science. 2011;51(6):2391–400.
  49. 49. Tian F, Bradbury PJ, Brown PJ, Hung H, Sun Q, Flint-Garcia S, et al. Genome-wide association study of leaf architecture in the maize nested association mapping population. Nat Genet. 2011;43(2):159–62. pmid:21217756
  50. 50. Fu Y, Xu G, Chen H, Wang X, Chen Q, Huang C, et al. QTL mapping for leaf morphology traits in a large maize-teosinte population. Mol Breeding. 2019;39(7).
  51. 51. Wang B, Zhu Y, Zhu J, Liu Z, Liu H, Dong X, et al. Identification and Fine-Mapping of a Major Maize Leaf Width QTL in a Re-sequenced Large Recombinant Inbred Lines Population. Front Plant Sci. 2018;9:101. pmid:29487604
  52. 52. Sanagi M, Lu Y, Aoyama S, Morita Y, Mitsuda N, Ikeda M, et al. Sugar-responsive transcription factor bZIP3 affects leaf shape in Arabidopsis plants. Plant Biotechnol (Tokyo). 2018;35(2):167–70. pmid:31819719
  53. 53. Wang W, Cheng M, Wei X, Wang R, Fan F, Wang Z, et al. Comprehensive evolutionary analysis of growth-regulating factor gene family revealing the potential molecular basis under multiple hormonal stress in Gramineae crops. Front Plant Sci. 2023;14:1174955. pmid:37063175
  54. 54. Wang Q, Li Y, Lin D, Feng X, Wang Y, Wang T, et al. A growth-regulating factor 7 (GRF7)-mediated gene regulatory network promotes leaf growth and expansion in sugarcane. The Crop Journal. 2024;12(2):422–31.
  55. 55. Li Y, Li P, Wang Y, Dong R, Yu H, Hou B. Genome-wide identification and phylogenetic analysis of Family-1 UDP glycosyltransferases in maize (Zea mays). Planta. 2014;239(6):1265–79. pmid:24647682
  56. 56. Chen Y, Fu M, Li H, Wang L, Liu R, Liu Z. Genome-wide characterization of the UDP-glycosyltransferase gene family reveals their potential roles in leaf senescence in cotton. Int J Biol Macromol. 2022;222(Pt B):2648–60. pmid:36228815
  57. 57. Lai W, Zhou Y, Pan R, Liao L, He J, Liu H, et al. Identification and Expression Analysis of Stress-Associated Proteins (SAPs) Containing A20/AN1 Zinc Finger in Cucumber. Plants (Basel). 2020;9(3):400. pmid:32213813
  58. 58. Gao W, Long L, Tian X, Jin J, Liu H, Zhang H, et al. Genome-wide identification and expression analysis of stress-associated proteins (SAPs) containing A20/AN1 zinc finger in cotton. Mol Genet Genomics. 2016;291(6):2199–213. pmid:27681253
  59. 59. Fatima S, Zafar Z, Gul A, Bhatti MF. Genome-Wide Identification of Stress-Associated Proteins (SAPs) Encoding A20/AN1 Zinc Finger in Almond (Prunus dulcis) and Their Differential Expression during Fruit Development. Plants (Basel). 2021;11(1):117. pmid:35009120
  60. 60. Fu Q, Duan H, Cao Y, Li Y, Lin X, Pang H, et al. Comprehensive Identification and Functional Analysis of Stress-Associated Protein (SAP) Genes in Osmotic Stress in Maize. Int J Mol Sci. 2022;23(22):14010. pmid:36430489
  61. 61. Jeon Y, Park Y-J, Cho HK, Jung HJ, Ahn T-K, Kang H, et al. The nucleolar GTPase nucleostemin-like 1 plays a role in plant growth and senescence by modulating ribosome biogenesis. J Exp Bot. 2015;66(20):6297–310. pmid:26163696
  62. 62. Wang X, Xie B, Zhu M, Zhang Z, Hong Z. Nucleostemin-like 1 is required for embryogenesis and leaf development in Arabidopsis. Plant Mol Biol. 2012;78(1–2):31–44. pmid:22058024
  63. 63. Zhu X, Ma X, Hu W, Xing Y, Huang S, Chen Z, et al. Genome-wide identification of TBL gene family and functional analysis of GhTBL84 under cold stress in cotton. Front Plant Sci. 2024;15:1431835. pmid:38957598
  64. 64. Gao Y, He C, Zhang D, Liu X, Xu Z, Tian Y, et al. Two Trichome Birefringence-Like Proteins Mediate Xylan Acetylation, Which Is Essential for Leaf Blight Resistance in Rice. Plant Physiol. 2017;173(1):470–81. pmid:27864442
  65. 65. Yang F, Miao Y, Liu Y, Botella JR, Li W, Li K, et al. Function of Protein Kinases in Leaf Senescence of Plants. Front Plant Sci. 2022;13:864215. pmid:35548290
  66. 66. Xu S, Zhu D, Zhang Q. Predicting hybrid performance in rice using genomic best linear unbiased prediction. Proc Natl Acad Sci U S A. 2014;111(34):12456–61. pmid:25114224
  67. 67. Daetwyler HD, Bansal UK, Bariana HS, Hayden MJ, Hayes BJ. Genomic prediction for rust resistance in diverse wheat landraces. Theor Appl Genet. 2014;127(8):1795–803. pmid:24965887
  68. 68. Combs E, Bernardo R. Accuracy of Genomewide Selection for Different Traits with Constant Population Size, Heritability, and Number of Markers. The Plant Genome. 2013;6(1).
  69. 69. Zhao Y, Gowda M, Liu W, Würschum T, Maurer HP, Longin FH, et al. Accuracy of genomic selection in European maize elite breeding populations. Theor Appl Genet. 2012;124(4):769–76. pmid:22075809
  70. 70. Cao S, Song J, Yuan Y, Zhang A, Ren J, Liu Y, et al. Genomic Prediction of Resistance to Tar Spot Complex of Maize in Multiple Populations Using Genotyping-by-Sequencing SNPs. Front Plant Sci. 2021;12:672525. pmid:34335648
  71. 71. Crossa J, Pérez P, Hickey J, Burgueño J, Ornella L, Cerón-Rojas J, et al. Genomic prediction in CIMMYT maize and wheat breeding programs. Heredity (Edinb). 2014;112(1):48–60. pmid:23572121
  72. 72. Zhang X, Pérez-Rodríguez P, Semagn K, Beyene Y, Babu R, López-Cruz MA, et al. Genomic prediction in biparental tropical maize populations in water-stressed and well-watered environments using low-density and GBS SNPs. Heredity (Edinb). 2015;114(3):291–9. pmid:25407079
  73. 73. Liu X, Wang H, Wang H, Guo Z, Xu X, Liu J, et al. Factors affecting genomic selection revealed by empirical evidence in maize. The Crop Journal. 2018;6(4):341–52.
  74. 74. Ma W, Qiu Z, Song J, Li J, Cheng Q, Zhai J, et al. A deep convolutional neural network approach for predicting phenotypes from genotypes. Planta. 2018;248(5):1307–18. pmid:30101399
  75. 75. Kaler AS, Purcell LC, Beissinger T, Gillman JD. Genomic prediction models for traits differing in heritability for soybean, rice, and maize. BMC Plant Biol. 2022;22(1):87. pmid:35219296
  76. 76. Holland JB, Piepho H-P. Don’t BLUP Twice. G3 (Bethesda). 2024;14(12):jkae250. pmid:39558791