The primary maize (Zea mays L.) production areas are in temperate regions throughout the world and this is where most maize breeding is focused. Important but lower yielding maize growing regions such as the sub-tropics experience unique challenges, the greatest of which are drought stress and aflatoxin contamination. Here we used a diversity panel consisting of 346 maize inbred lines originating in temperate, sub-tropical and tropical areas testcrossed to stiff-stalk line Tx714 to investigate these traits. Testcross hybrids were evaluated under irrigated and non-irrigated trials for yield, plant height, ear height, days to anthesis, days to silking and other agronomic traits. Irrigated trials were also inoculated with Aspergillus flavus and evaluated for aflatoxin content. Diverse maize testcrosses out-yielded commercial checks in most trials, which indicated the potential for genetic diversity to improve sub-tropical breeding programs. To identify genomic regions associated with yield, aflatoxin resistance and other important agronomic traits, a genome wide association analysis was performed. Using 60,000 SNPs, this study found 10 quantitative trait variants for grain yield, plant and ear height, and flowering time after stringent multiple test corrections, and after fitting different models. Three of these variants explained 5–10% of the variation in grain yield under both water conditions. Multiple identified SNPs co-localized with previously reported QTL, which narrows the possible location of causal polymorphisms. Novel significant SNPs were also identified. This study demonstrated the potential to use genome wide association studies to identify major variants of quantitative and complex traits such as yield under drought that are still segregating between elite inbred lines.
Citation: Farfan IDB, De La Fuente GN, Murray SC, Isakeit T, Huang P-C, Warburton M, et al. (2015) Genome Wide Association Study for Drought, Aflatoxin Resistance, and Important Agronomic Traits of Maize Hybrids in the Sub-Tropics. PLoS ONE 10(2): e0117737. https://doi.org/10.1371/journal.pone.0117737
Academic Editor: Lewis Lukens, University of Guelph, CANADA
Received: July 28, 2013; Accepted: December 31, 2014; Published: February 25, 2015
Copyright: © 2015 Farfan et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Funding: This work was supported by USDA-NIFA-AFRI. #2010-85117-20539: Improving Drought Tolerance and Aflatoxin Resistance in Maize; Education, Extension, and Translational Breeding via Altered Lipid Metabolism. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Maize (Zea mays L.) is one of the three most important crops of the world along with rice (Oryza sativa) and wheat (Triticum spp.). World production in 2011 was 883 million ton (http://faostat.fao.org [verified 6 May 2013]). Maize is the most important crop in the United States with a production of 301 million tons, and an estimated market value of 77.4 billion U.S. dollars in 2012 (National Agricultural Statistical Service [NASS] 2013). Globally, the most important and highest yielding production areas are in the temperate Midwestern U.S. and other temperate regions throughout the world, which is where the majority of investment in maize breeding is focused [1–3]. Important but lower yielding maize growing regions, such as the sub-tropics, experience unique challenges. Sub-tropical production zones are hotter and drier, and two of the greatest challenges for maize production in these zones are drought stress and aflatoxin contamination [4–12]. Sub-tropical maize production in the U.S. Southern states accounts for approximately 9% of U.S. production . The largest producer in the Southern states is Texas, where summers are hot and dry. There is a strong inter-annual precipitation variation across the state and severe drought episodes have occurred in the last ten years . These challenges may become increasingly common in temperate regions under a changing climate. Texas provides ideal environmental conditions to conduct research in drought tolerance and aflatoxin contamination that major maize production regions could experience in the future.
Improving drought tolerance is important because agriculture is the major user of surface and ground water in the U.S. It has been estimated that water usage in agriculture for the Western states accounts for up to 90% of the total water used in these states . The use restrictions and competition for water by growing urban areas will make drought stress even more common in irrigated agriculture. Drought episodes are likely to increase in the Midwestern U.S. because of a stronger inter-annual variation in precipitation and temperature attributable to a changing climate [16,17]. Drought tolerance is difficult to quantify and improve, as it is clearly quantitative and complex, and regulated by thousands of genes [18–20].The impact of drought stress depends on the severity, timing and length of the stress. Maize is most sensitive to drought stress during flowering and grain fill stages [18,21,22]. Drought stress in maize causes reduction in plant height, leaf rolling, early senescence, asynchronous flowering, kernel abortion and barren plants without ears [18,21,22].
Drought episodes and other stresses such as heat are often followed by pre-harvest aflatoxin contamination. Aflatoxin is a carcinogenic mycotoxin, produced by the soil borne fungus Aspergillus flavus, which thrives under hot and dry conditions in pre-harvest maize. Aflatoxin is federally regulated at 20 ng g−1 for human consumption, and is believed to cause over $200 million dollars of economic losses in the Southern U.S. each year [4,12,23,24]. Aflatoxin susceptibility in plants is a highly complex trait and no complete source of resistance is known for maize [7,25]. Adding to the complexity of this pathogen, both colonization and aflatoxin production appears to have strong host-by-pathogen interactions [26–29]. As a consequence, breeding for aflatoxin resistance is a complex challenge. Despite this complexity a number of breeding lines and germplasm with improved aflatoxin resistance have been released. The lines Mp313E, Mp715, and Mp717 [30,31] were derived from the tropical maize race Tuxpeño after several cycles of selection. The maize inbred lines Tx736, Tx739, Tx740, and Tx772 [7,25,32] were selected using the pedigree method from Argentinean and Bolivian lines. Other sources of resistance such as the line GT603 were selected from temperate elite hybrids produced in the 1970s . This diverse germplasm has already been used in linkage mapping studies to identify the quantitative trait loci (QTL) responsible for conferring resistance [11,23,34–39]. Relatively large QTL for aflatoxin resistance have been reported on chromosomes one, three, four, five, and nine [25,34,39,40] and there has often been little consistency between germplasm and environments. These findings give evidence that diverse germplasm has been the primary and best source for reducing ear rot diseases, and more importantly that decreased susceptibility is indeed heritable [25,30,31,41]. Diverse tropical maize germplasm has also demonstrated the potential to out-yield commercial hybrids in some environments. This germplasm also can exhibit improved drought tolerance and other unique traits unavailable in commercial temperate hybrids when testcrossed to elite temperate lines [42–48]. However, tropical germplasm also may have many undesirable traits, such as delayed flowering time/ photoperiod sensitivity and dry down, lower yield in some cases, poor stalks, and disruption of heterotic patterns which makes it challenging to use in temperate breeding programs [45,46,48].
Diverse germplasm must be further investigated to identify new sources of genetic variation for drought stress and aflatoxin contamination outside of the elite Midwestern germplasm such as the so called exPVP’s . One source of diverse germplasm previously characterized for the maize research community is the 282 Goodman maize association panel [43,49,50]. The 282 maize association panel includes several lines that originated from the dent maize commercial lines in the 1970s and 1980s , and multiple North Carolina and CIMMYT lines of tropical origin. Another diversity source includes the lines that have been bred in the Southern States by public programs, including those selected and released for aflatoxin resistance. The identification of favourable alleles in these sets of germplasm can allow targeted genetic improvement of current germplasm, and the incorporation of those alleles into elite material. In addition, diverse germplasm with differences in phenotype facilitates linkage mapping and association studies . Multiple bi-parental linkage QTL mapping studies on aflatoxin and drought have found many loci which tend to be specific to a narrow set of genetic backgrounds, but not maize as whole. In contrast, genome wide association studies (GWAS) allow evaluating multiple alleles in multiple genetic backgrounds at once. GWAS also captures many more effective recombination events than traditional linkage populations, and therefore has the potential to increase QTL resolution when the assembled germplasm panels have low linkage disequilibrium (LD).
Complex trait analysis can benefit from a number of statistical adjustments that account for data dependencies such as field spatial variation or blocking effects. A number of spatial adjustment methods have been developed, which can decrease the micro-environment non-genetic noise in an experiment, making the results more robust and powerful [53–57]. These spatial adjustment procedures are particularly appropriate in large field trials with heterogeneous soil or furrow irrigation patterns. In addition, the modelling of the genetic by environmental interaction (G x E) can allow detection of QTL specific for certain environments [58,59]. These more complex models maximize the amount of genetic variation explained and model the genetic variance and correlation across environments. As a result, the power of the data is increased, while reducing false positives. Similarly, statistical methods to account for genetic population structure and relatedness have long been used in association mapping to account for spurious associations and provide a more conservative analysis . Finally, a number of multiple testing corrections procedures have been developed, each with various advantages and drawbacks [61–63]. However, despite the various statistical analyses corrections (spatial adjustment, modelling G x E, genetic relatedness, and multiple testing), it has been determined that some associations are still subject to false positives (and negatives) which may only be identified retrospectively . In past work, we have found that attempting multiple analysis methods and ensuring consistency between results suggests the most robust candidates for further analysis [65,66], this can come at the expense of clarity in presentation.
This study used 346 lines from across two diversity panels testcrossed to Tx714, which is a high yielding but aflatoxin susceptible line that is closely related to B73, to evaluate alleles from diverse germplasm in Southern US environments. Specifically, the goals of this study were: 1) to identify diverse lines to improve yield potential, aflatoxin resistance, and agronomic abilities in hybrid combination; and 2) to identify genomic regions that confer these phenotypes in hybrids grown in Southern sub-tropical environments using a genome wide association study (GWAS).
Materials and Methods
Phenotypic data collection
An association mapping panel comprised of subsets of the USDA Goodman maize association panel (302 lines, ) and the diverse Southern subtropical focused Williams/Warburton panel (300 lines, ) was assembled for a total of 400 lines . Of the 400 lines, 346 produced sufficient seed in testcrosses to Tx714  in a summer and fall nursery in College Station, TX and Weslaco, TX, respectively in both 2010 and 2011. The hybrids were evaluated in two separate replicates in a randomized complete block design (RCBD) under irrigated and non-irrigated treatments with commercial checks randomly assigned in the field. Standard production agronomics were followed to provide furrow irrigation to the irrigated trials as needed to support crop growth. Only the well-watered trials were inoculated with A. flavus. A modified colonized kernel technique was used where autoclaved maize kernels were inoculated with A. flavus spores from isolate NRRL 3357  and then incubated 24 to 36 hours to promote A. flavus growth and sporulation. Colonized kernels were manually spread on the soil surface between treatment rows when the maize hybrids reached mid-silk stage [4,71]. The testcross hybrids were grown in College Station (CS) in 2011 (CS11) and 2012 (CS12) at the Texas A&M AgriLife Research Farm in one row plots 7.92 meters long and 76.2 centimetres wide, and measured for all reported traits as explained below. The target plant density was 75,000 plants/ha and the soil type was a Ships clay. Combination of year and treatment were designated trials and the following coding system was adopted in this research: The irrigated trials were inoculated with A. flavus and coded well watered (WW) and the non-irrigated trials were coded as water stress (WS). The water stress was most severe in 2011, especially during flowering and grain filling. Water stress was less severe at these times in 2012 because of moderate temperatures and timely rainfall. An additional well watered location was evaluated in two replicates using a RCBD at Mississippi State, MS in 2012 (MS12) only for aflatoxin accumulation and days to silk. Hybrids in this location were inoculated using the side needle method [71–73].
Plant height was measured from the ground to top of the tassel; ear height was measured to the bottom ear node. Days to silk and days to anthesis were measured as the number of days, at which 50% of the plants in a plot showed silks or pollen shed respectively. Anthesis silking interval (ASI) was calculated using the difference between days to silk and days to anthesis. Because the severe drought in Texas in 2011 substantially reduced ear size, the CS11-WS trial was completely hand-harvested. In contrast, for the CS12-WS, CS11-WW and CS12-WW, 10 ears were hand harvested skipping the first five plants in the plot, and then hand harvesting every other plant. The rest of the plot was harvested using a John Deere 3300 combine with a HM-1000B Grain Gauge (Juniper Systems Inc., Logan, Utah) from which plot weight, moisture and test weight were obtained. For the MS12-WW trial only the 10 inoculated ears were hand harvested and processed for aflatoxin content. Hand harvested ears from each hybrid in CS were photographed and phenotyped for disease, percentage of kernel abortion and pollination. 500-kernel weight was determined after shelling. All yields were adjusted to 15.5% moisture as determined from the combine at harvest or for hand harvested ears after shelling using a Dickey-John mini GAC plus portable moisture tester. Moisture was expressed as percentage of weight. Aflatoxin content was determined by the Vicam Aflatest fluorometer (Vicam, Watertown, MA) following standard procedures [7,37,39]. Aflatoxin values were transformed using the transformation (Log10 [aflatoxin + 10]) to improve normality and constant variance and back-transformed. Raw phenotypic data can be found in S1 Table.
Phenotypic analysis for the GWAS study
The different hybrids within each treatment and each location were laid out using a randomized complete block (RCBD) design with commercial checks randomly assigned to different plots in the trial. Four commercial checks (check 1: BH9014 GENVT3P, check 2: GA26V21 GENVT3P, check 3: GA28V81 GENVT3P, check 4: DKC-805 GENVT3P) were replicated several times in the field and were used to adjust for field spatial variation and to estimate the residual variance. These commercial checks were excluded for estimating variance components and Best Linear Unbiased Predictors (BLUPs) for the GWAS. A combined multi-environment trial (MET) analysis was performed considering three different models: 1) a randomized complete block design (RCBD) model, 2) a spatial model that used autoregressive terms for the row and column effects (AR1 x AR1), and 3) to model the G x E, an unstructured genetic variance-covariance matrix model (VCOV). The unstructured model assumes that every location has its own genetic variance and that the genetic covariance between locations is different. The likelihood ratio test was used to compare between the three different models for each trait, but not all models could be estimated for all traits because of differences in the heritability estimates for the different traits.
An RCBD model was first fit to the data. The phenotypic observation yijk on hybrid i in replicate j of trial k was modelled as: (1) where, μ is the grand mean; ek is the fixed effect of trial k; gi is the random effect of hybrid i and is ∼ NID (0, σ²g), i = 1, …,g; (r/e)jk is the random effect of replication j nested in environment k and is ∼ NID (0, σ²r), r = 1,2; (g*e)ik is the random effect of the interaction between hybrid i and trial k and is NID (0, σ²ge), and εijk is the random residual effect for hybrid i in the replication j of trial k and is NID (0, σ²ε). For the second model the phenotypic observation yijk on hybrid i in k was modelled as: (2) where, μ is the grand mean; ek is the fixed effect of trial k; gi is the random effect of hybrid i and is ∼ NID (0, σ²g), i = 1, …,g; (g*e)ik is the random effect of the interaction between hybrid i and trial k and is NID (0, σ²ge), and εijk is the random residual effect for hybrid i in the replication j of trial k and is NID (0, σ²ε). εijk was further expanded to fit two-dimensional autoregressive first order (AR1) x (AR1) terms for the row and column effects[53–57].
A third model assumes a genetic variance-covariance (VCOV) matrix based on an unstructured model for the random genetic effects where a specific variance was fit for each trial and a specific covariance was fit for each pair of trials. The phenotypic observation Yijk on hybrid i of trial k was modelled as: (3) where, μ is the grand mean; ek is the fixed effect of trial k; ggei represents the hybrid main effect together with the genetic environmental interaction (GEI) for hybrid i in trial k. This model was enhanced by including the significant terms for autoregressive terms for row and column effects. Each of the three different models were fit using restricted maximal likelihood (REML) (Patterson and Thompson, 1975) in ASREML v3.0 . JMP Pro V11.1.1 was used in testing for significance of yield and aflatoxin in hybrids, treated as fixed, using similar models. Heritability (h2) was calculated using the equation: (4)
[55,74]. The predictor error variance (PEV) was calculated as square root of the average of the standard error for each hybrid prediction for a given trait. The standard error was obtained from the output in ASREML.
Genetic diversity, population structure and estimation of kinship matrix
Genome-wide single nucleotide polymorphism (SNP) genotype data from 213 lines from the sub-tropical diverse panel was obtained from the USDA-ARS Corn Host Plant Resistance Research Unit (Mississippi State, MS) using the Genotype By Sequence (GBS) method . All SNP locations are based on the maize genome version AGPV2. Genotype data for 133 lines from the temperate panel was extracted from the maize diversity panel of 282 inbred lines available in Panzea . These genotypes were identical to Warburton et al.  with the additional lines obtained from the 282 association mapping panel . SNPs with a minor allele frequency (MAF) greater than 25% and a low missing data rate (<7.5%) were extracted to perform the genetic diversity and structure analysis only (total 1999 SNPs). The genetic distance was calculated for Nei’s genetic distance  using the software PowerMarker. A principal coordinate analysis (PCoA) was then carried out using the prcomp function in R (R Development Core Team, 2013). Population structure was determined using the software Structure v2.93 . The number of subpopulations was estimated from five independent runs having 5 x 105 burn-in and sampling iterations, and the number of subpopulations was varied between 1 and 15. The ancestry model allowed for population admixture and correlated allele frequencies. The optimum K was estimated using the ad hoc statistic ∆K, which is based on the rate of change in the log probability of the data between successive K values . Based on the estimated K determined, a run of 5 x 106 burn-in and sample iterations was used. A kinship coefficient estimation matrix was created using the VanRaden algorithm as implemented in the software GAPIT .
Genome wide association study (GWAS)
For each trait analysed, up to four different phenotypic inputs were used for GWAS analysis to ensure that quantitative trait variants (QTVs) were robust across models, and to identify QTV’s for specific environments. The phenotypic observations used as input for the four GWAS analyses included 1) the entry mean for the different traits collected for all trials; 2) the BLUPs for the combined MET analysis in Eq. (1); 3) the BLUPs for the MET analysis in Eq. (2) including the row and column (AR1) x (AR1) effects for the spatial analysis; and 4) the BLUPs from the VCOV GEI analysis in Eq. (3). Population structure and relatedness were taken into account in the linear mixed model in all four analyses as described by Yu et al. . The association mapping analysis was conducted using the compressed mixed linear mixed model (CMLM) and the P3D method, as implemented in GAPIT [79–81]. SNPs with a MAF< 0.05 were discarded for the association mapping analysis as they are prone to false positives . Two common correction procedures were used to control for multiple testing, since each procedure has unique advantages and disadvantages [61,83]. First, the p-value was corrected for multiple testing using the modified method proposed by Gao et al. . This approach calculates the Meff, which is the effective number of independent test to correct for multiple testing. Meff is determined by estimating the pair-wise composite LD matrix [62,63,84]. The number of principal components (PCs) required explaining 99% of the variation in the dataset was used to calculate the Bonferroni correction. The Meff calculated for this study was 49,030 independent tests, which is equivalent to 1.01 x 10-6 or 5.99 (-log10[p]). Second, the false discovery rate (FDR) was also used to correct for multiple testing for all the different analysis and traits in this study but can overcorrect, leading to type II error .
Results and Discussion
Variance component estimates and heritability within and across environments
The genetic variance in this experiment was very high. The heritability estimates for the WW and WS trials for grain yield ranged from 0.61 to 0.83 (Table 1). The lowest heritability in the WS trial was reported during the extreme drought of 2011. This result is expected since heritability estimates are expected to decrease under stressed trials [21,86–89]. Comparable heritability estimates for yield under drought stress trials have been reported previously [53,90]. Within each environment, heritability estimates for aflatoxin level ranged from 0.67 to 0.83 for both the transformed and raw aflatoxin level data (Table 1) demonstrating high statistically significant variation between genotypes. However, a lower heritability estimate of 0.59 across all environments was obtained (Table 2). Transformation of aflatoxin is a standard procedure used prior to QTL mapping [7,34,39], and it is needed because the data are often skewed. In this study, data transformation increased the heritability in some, and across all, environments (Table 2).
Across all environments and all reported traits, heritability estimates ranged from 0.59 for aflatoxin to 0.98 for days to anthesis (Table 2), based on the components of variance estimated using the MET model from Eq. (1). These heritability estimates are considerably higher than estimates reported in previous studies [7,39,72,91]. The differences observed in the heritability estimates for grain yield and aflatoxin level between this study and others are partially explained by the high genetic variation present in this study. Based on the heritability estimates observed, it was concluded that sufficient genetic variation was present in this panel to perform an association analysis.
Spatial analysis for row column (AR1 x AR1), and G x E modelling using an unstructured genetic variance-covariance model
The number of hybrids investigated in the experiment resulted in large field tests. Spatial analysis has proven useful in identifying and reducing error in large field studies when checks are replicated throughout the field [54,56,92]. The analysis showed that field spatial variation was most important for for grain yield among traits reported (Table 3). CS11 was the location with the most field spatial variation for grain yield (data not shown). This would be expected with the extreme drought in 2011, as drought typically brings out the highest level of field variation. For other traits such as plant and ear height and 500 kernel weight field spatial variation was observed as well. In contrast, field spatial variation was not observed for aflatoxin contamination, however this could partially be due to the large variance (lower heritability) that would not allow row and column effects to be determined. Fitting these spatial models, when significant, partitions out error allowing more power for subsequent QTV detection.
Genetic correlation across different environments
A higher genetic correlation across environments would be expected in diverse material with a wide range in flowering, but the diversity in this study was moderated by the use of testcrosses. The modelling of the variance-covariance structure for GEI analysis using Eq. (2)  was greater than zero for all environment pairs (Table 4). This analysis showed the lowest genetic correlation for grain yield was 0.63, which correspond to the genetic correlation between CS11-WS and CS12-WW trials (Table 4). In contrast, the genetic correlation between other trials for grain yield ranged from 0.70 to 0.95. Based on these results, it was concluded that hybrid ranking between years and trials was consistent. This result is similar to other drought QTL studies in maize, sorghum and wheat [90,94,95]. The lowest genetic correlation for aflatoxin level was between the CS12-WW and MS12-WW trials (Table 5). This could in part be due to the different inoculation methods used which was confounded between these locations or other environmentally specific differences.
Best performing hybrids
There were multiple hybrids that performed statistically as well as, or better than, the elite commercial checks used in this study for grain yield (Table 6). This was evidence that diverse inbred lines bred in the test environment have the potential to out-yield elite commercial hybrids under water stress, even when combined with an older public tester. Under the severe drought that was experienced in Texas 2011, none of the commercial checks was among the fifteen highest yielding hybrids but three commercial checks were not significantly different from this group. In the CS11-WW and CS12-WS trials, only one of the commercial checks was among and not significantly different from the top fifteen hybrids (Table 6).
For aflatoxin resistance, the best testcrosses (Table 7) were numerically better but generally not significantly different from most of the checks (data not shown) in every environment; this is likely because the tester selected, Tx714, has very high aflatoxin susceptibility, which was anticipated to identify inbreds with non-recessive forms of resistance . However, aflatoxin values for many lines demonstrated an aflatoxin susceptibility statistically greater than the commercial checks; overall BLUPs from Eq. (1) ranged from 215 up to 1068 ng g−1. Testcrosses of the lines bred in tropical and sub-tropical areas generally exhibited decreased aflatoxin susceptibility when compared to testcrosses from temperate materials. These results indicate the presence of favourable alleles for stress tolerance and Aspergillus ear rot disease resistance in exotic material. The presence of favourable genetic variation in exotic germplasm has been previously reported by different authors, including for aflatoxin resistance [43,45–47,96].While the lowest accumulating testcross hybrids cannot be recommended per se, the best performing inbreds are expected to be useful in breeding for decreased aflatoxin susceptibility.
Genetic diversity, population structure and estimation of kinship matrix
Of the 400 lines, only 346 lines were able to successfully produce seed under conditions of the four Texas nurseries. The genetic diversity analysis between the 346 inbred lines indicated that the majority of the lines are quite distantly related to each other. Between all pairs of lines, the mean and median genetic distance was 0.68 and 0.69, respectively. Only 0.1% of the pairs of entries exhibited a genetic distance of less than 0.2, 2.7% of the entry pairs exhibited a genetic distance of less than 0.5, and 64% of the entry pairs exhibited a genetic distance less than 0.7 (S1 Fig.). These results suggested that the majority of the lines are equally distantly related in absolute number of shared alleles, however there were still patterns in allele sharing that resulted in observed population structuring. The population structure was determined using the software Structure [77,97], a preliminary analysis was performed to estimate the optimum K (number of populations) using the rate of change in the log probability, as measured by the ad hoc statistic ∆K. This analysis found that the optimum subpopulation number was K = 4 and these clusters roughly correspond to tropical, temperate, B73, and a mixed group. These results are similar to previous studies [49,67]. However, the Northern U.S./B14 stiff stalk lines clustered with non-stiff stalk lines (Mo17 related lines) , which is uncommon. This likely occurred because the Northern temperate germplasm is poorly represented in the current association panel. In contrast, the B73 group is small but well-defined.
The population structure results were similar to those visualized by the pairwise genetic distance matrix using a PCoA. The first two eigenvectors used to graph the data explained 14% and 13% of the variation, respectively, and separated three general clusters containing tropical lines, B73 types and a mixed group (Fig. 1). The mixed cluster is ill-defined and contains the non-stiff stalks and most southern US inbred lines. To explain 80% of the variation, over 100 PCoA eigenvectors were required, which further corroborated the weak structure between the inbred lines.
GWAS analysis for grain yield and 500 kernel weight
Based on the heritabilities shown in Table 2 for the different traits collected, it is clear that there was enough genetic variation in this study to perform a GWAS study to identify QTVs for most traits collected. For the GWAS analysis for grain yield four different analyses were performed using raw means, and the BLUPs extracted from equations 1 to 3 and the results were compared for consistency. Five QTVs for grain yield on chromosomes two, seven (two variants), nine (not shown in Fig. 2), and ten were identified across most analyses (Fig. 2). The allelic effects for the different QTV ranged from 0.14 to 0.59 ton/ha, and the amount of phenotypic variation explained ranged from 3 to 5% (Table 8).
The phenotypic observation was the BLUPs for the MET analysis, which includes AR1 x AR1 terms to adjust for column and row effects. The red line represents the threshold value after correcting for multiple testing using the Meff or FDR.
QTV1, QTV2, and QTV3 were detected under WW and WS trials in 2011 and 2012, respectively (Table 8). These QTV were also detected in the MET analysis that adjusted for field spatial variation (Fig. 2). QTV1 had the strongest effect in the non-irrigated trial in 2012. No QTV were detected in the non-irrigated trial in 2011, likely due to low overall heritability. QTV4 was only significant in the well-irrigated trial in 2012 and QTV5 was only significant in the MET analysis that adjusted for field spatial variation (Table 8). Several linkage mapping studies have reported multiple QTLs for grain yield, but to our knowledge this has not yet been examined using association approaches or in testcrosses [51,98–105]. These previous studies based on linkage mapping identified one to five linkage QTLs for grain yield, explaining 20 to 35% of phenotypic variation for all QTLs together.
QTV1 found in this study for grain yield is located in bin 2.03, and the exact SNP location can be found within the locus name in Table 9. No QTLs associated with grain yield have been reported in this bin by other authors and this SNP is in the abph1—aberrant phyllotaxy1- gene (Maize B73 RefGen_v2available at www.maizegdb.com/). Gene abph1 is expressed in the shoot apical meristems, and mutations in this gene alters the regular arrangement of leaves and flowers . Despite LD decaying rapidly in this area of the genome, further investigation must be done to confirm that this SNP is only associated with the abph1 locus, and not other linked genes.
QTV2 is located in bin 7.04, the allelic effect ranged from 0.14 to 0.42, and the percentage of explained phenotypic variation ranged from 4.5 to 5% (Table 8). In addition to yield, QTV2 was also detected as associated with plant height, days to anthesis and days to silk, suggesting a pleiotropic effect on multiple traits (Table 9). In a recent meta-analysis of Texas commercial yield trial data, the R2 of plant height and grain yield was determined to be 0.61, indicating the importance of robust tall plants under Southern stressed growing conditions and further supporting the possibility of pleiotropy of this locus . In order to further address this question, an association analysis was performed including plant height as a covariate in Eq. (1) and (2). The QTV2 variant was no longer significant for grain yield, providing an additional line of evidence that this QTV affects grain yield via a positive pleiotropic effect with height and may be enhancing overall plant vigor (Fig. 3). Austin and Lee  found a QTL for grain yield in the same bin that explained 3.9% of the phenotypic variation; however, their sparse map makes it difficult to confirm co-location with QTV2. Schön et al.  using 1000 F4:5 maize testcross progeny developed by Pioneer Hi-Bred in 1995, identified a QTL in the same chromosome bin, which was detected across seven environments in the Corn Belt.
The phenotypic observation was the BLUPs for the MET analysis, which includes AR1 x AR1 terms to adjust for column and row effects. In addition, this model includes a centered and standardized covariance for plant height. The red line represents the threshold value after correcting for multiple testing using the Meff or FDR.
To identify potential candidate genes for QTV2, LD was investigated and found to decay within 0.2 kilobases (kb) upstream and showed no LD with the closest marker 225 (kb) downstream (data not shown). Different authors have reported that LD decays rapidly in maize around 1 to 10 kb; specifically for chromosome seven, linkage disequilibrium has varied from 2 to 5 kb depending on the region and the germplasm [52,108–111]. Assuming that the LD follows the same pattern observed for the upstream area from QTV2, a plausible transcript corresponds to a protein with an unknown function (Table 9) that is expressed in vegetative stage 1 (V1) (Maize eFP browser) [112,113].
QTV3 is located in bin 9.06, the allelic effect ranged from 0.28 to 0.33 ton/ha across environments, and the percentage of explained phenotypic variation ranged from 3.5 to 3.9% (Table 8). No QTLs have been reported in bin 9.06 for grain yield to our knowledge. LD near the QTV3 variant is higher than average, which is consistent to previous reports for chromosome nine [52,110,111]. There are six transcripts upstream from QTV3 within a high level of LD. Downstream from the QTV3 variant, there are two additional transcripts (GRMZM2G150594 GRMZM2G573775) in LD with the variant. Additional work will be needed to narrow down which of the seven possible transcripts (if any) are responsible for the positive association with grain yield. QTV4 (bin 9.07), and QTL5 (bin 10.02) are in locations that have not been previously reported to contain grain yield QTL, and correspond to genes of unknown function (Table 8).
This study did not detect a significant QTV for 500-kernel weight after stringent corrections for multiple testing. However, a peak near significance was consistently observed on bin 9.01 across different trials and both of the MET analysis (data not shown). The effect estimates ranged from 2.3 to 3.6 g per 500 kernels, and explained 3.8 to 4.3% of the phenotypic variation. Although this peak was not significant after correcting for multiple testing using FDR and Meff, this SNP warrants further investigation based on consistency and the fact that other authors have reported a linkage QTL for 300 kernel weight, named q300k21, in the same bin [114–116]. Correction for multiple testing can be too stringent, causing false negatives and loss of information.
The effect sizes of significant QTV for yield, which ranged from 3 to 5%, appeared larger in magnitude than might be expected for a complex quantitative trait; we believe there are a few potential causes. First, heterosis from the tester would be expected to mask phenotypic variation at many alleles. For all QTVs the tester (Tx714) has the allele of positive effect , suggesting theseQTVs show either additive effects or dominant deleterious effects. Second, with such a diverse panel, more extreme deleterious alleles would be expected to be present than in a narrow elite panel. Indeed, we find that the alleles with the positive effect are more common in the population and examining the QTVs in ex-PVP lines , only a few have the alleles of negative effect; these are generally in the oldest lines suggesting they have been selected out of more elite maize . Third, given the moderate population size it is reasonable to expect the ‘Beavis effect’ or ‘winners curse’ to be present in this study and all allelic effects may be overestimated. Fourth, it is possible that population structure is not fully accounted for ; however, this seems less probable after examining the diverse origins of the individuals with these SNPs.
GWAS for plant and ear height
Two QTVs (QTV2 and QTV6) were detected for plant height (Fig. 4). QTV2 was previously described for grain yield (Table 8 and 9). Several studies have reported a QTL for height at bin 7.04 [104,105,107]. The effect of QTV2 ranged from 5.3 to 5.6 centimetres, and explained 4.6 to 5% of the phenotypic variation (Table 10). The other SNP, QTV6 in bin 3.05, had an effect that ranged from 3 to 3.2 centimetres and explained 4.7 to 4.8% of the phenotypic variation (Table 10). A QTL for plant height has been reported previously in bin 3.05 using a bi-parental cross between lines Ki3 and CML139 . Both QTV2 and QTV6 were detected in the non-irrigated and in the irrigated trials in 2011 (Fig. 4).
The phenotypic observation was the BLUPs for the CS11-DTYL trial. The red line represents the threshold value after correcting for multiple testing using the Meff or FDR.
For ear height, only one significant SNP, QTV7 (bin 2.04) was detected after correcting for multiple testing and only in the non-irrigated trial in 2012 (Table 10). It had an estimated effect of 3.9 centimetres and explained is 6.3% of the phenotypic variation. No QTL have been previously reported for ear height in this bin to our knowledge. Three additional peaks were consistently detected across different analyses, and the effects were strongest in the CS12-WW trial (Fig. 5). These SNPs are worthy of additional consideration despite the fact that none of them were significant after adjusting for multiple testing. The SNP S4_62573339 is located in bin 4.05, had an allele effect of 4.4 centimetres, and explained 5% of the phenotypic variation (Table 10). The SNP falls within a putative transcript; however, the LD extends around 100 kb in this area, indicating that this transcript may not be the causal gene. The strong LD in this area may indicate that this region of the genome has been under recent selection. Different authors have reported that LD generally decays around 1 to 5 kb for chromosome four [52,108–111]. A QTL for plant height, but not ear height, has been previously reported in bin 4.05 by different authors [102–105].
The phenotypic observation was the BLUPs for the CS12-WW trial.
Significant peaks associated with SNPs S4_173817044 and S4_173996901 are located in bin 4.07. The effect of the S4_173817044 SNP ranged from 3.8 to 6.5 centimetres, and the percentage of explained phenotypic variation ranged from 4.2 to 5%. The effect of the S4_173996901 SNP ranged from 3.8 to 4.3 centimetres. The amount of phenotypic variation ranged from 4.4 to 4.6%. Similar to the previous QTL for ear height, neither SNP falls within a bin where a QTL has been reported by other authors. Additionally, the LD decay around 100 bp in this area of chromosome four and based on the Maize B73 RefGen_v2 genome, these SNPs seems to be in the promoter regions of different uncharacterized transcripts (Table 9).
GWAS for flowering time traits
For the days to silk and days to anthesis traits, only one GWAS analysis was performed using the entry mean as the phenotype observation. Multiple QTVs were found (Fig. 6) with effects ranging from 0.5 to 1.8 days and the percentage of explained phenotypic variation ranging from 4.2 to 7.4% (Table 11). QTV2, previously reported for grain yield and plant height, was also detected for days to anthesis and days to silk in the well-watered CS11-WW and CS12-WW trials. Buckler et al.  reported three QTLs for days to anthesis (PZA03624, PZA03728, PZA-1744) and four QTLs for days to silk (PHM15501.9, PZA00986.1, PZA02722.1, PZA01044.1) on chromosome seven. However, based on the physical location and genetic distance none of the markers are located in bin 7.04.
The phenotypic observation was the BLUPs for the CS12-WW trial. The red line represents the threshold value after correcting for multiple testing using the Meff or FDR.
QTV8 (bin 8.05) was detected for both days to anthesis (effect of 0.9 days) and days to silk (effect of 1 day), explaining 5.8% of the phenotypic variation for both traits. Two different SNPs were found for QTV9 (bin 8.05), S8_123509373 and S8_123511933, separated by 2.5 kb. The effect for these QTL variants ranges from 0.5 to 0.8 and the phenotypic variation ranges from 4.6 to 4.7% (Table 11). Buckler et al. (2009) reported three QTLs for days to anthesis (PZA00908.2, PZB02155.1, PZA00675.1) and three QTLs for days to silk (PHM4711.14, PZB02155.1, PHM1834.47) on this chromosome across eight environments in the NAM panel. One of the QTL reported is located on chromosome eight in bin 8.05 (locus pzb02155), located between position 123,542,426 and 125,974,265, which is 30 kb downstream from QTV9. LD extends for 3 kb upstream from QTV9 (results not shown); however, downstream from the QTV9, there is gap between the markers of 80 kb. As a consequence, this study cannot definitively determine if QTV9 and locus PBZ02155.1 are the same QTV or not. Finally, QTV10 had an effect of 0.6 for days to anthesis and 0.8 for days to silk, and explaining 4.2% of the phenotypic variation for days to anthesis and 4.3% for days to silk (Table 11).
This study did not find any significant QTL variants for ASI despite multiple analyses run using the average of the raw data, and the MET analysis described in Eq. (1) and (2). Buckler et al.’s  study was the most powerful to date for flowering time QTV detection, and many but not all of the QTV were novel to a particular study. This lack of co-localization raises the question of whether the differences observed were because of the use of different germplasm, different environments, or the use of testcross hybrids. If germplasm caused the observed differences, this suggests that the hypothesis of multiple variants for common genes may play more of a role in temperate than in tropical germplasm. If different environments were the cause, this suggests that local testing is critical for relevance in association mapping. If the use of testcross hybrids were the cause, this suggests that for the most relevance to crop improvement, only testcross hybrids and a relevant tester should be used in GWAS studies conducted in crops grown as hybrids. It seems likely that the true cause is partially due to all these factors and their interactions.
GWAS for aflatoxin resistance
The GWAS analysis did not find any significant QTLs for the transformed aflatoxin data after correcting for multiple testing (Fig. 7) despite moderately high heritability and significant variation between the experimental hybrids. Three peaks appeared consistently for the irrigated trials CS11-WW and CS12-WW, and the combined MET analysis (Table 12). SNP: S3_185272026 (bin 3.06) had an allele effect of 9.4 ng g−1 and explained 6.06% of phenotypic variation (Table 12). This SNP corresponds to transcript GRMZM2G399433, which is highly expressed in the pericarp, embryo and endosperm, the silks and the cob during flowering and post-flowering . These tissues are directly relevant for the infection of A. flavus to proceed under non-wounding inoculation. A QTL for aflatoxin and/or Aspergillus ear rot resistance has been previously reported in bin 3.06 [34,36,39]. This QTV did not have a significant effect on any of the other traits measured (yield, height, flowering time, etc.) indicating that the effect of this QTV is directly on aflatoxin or ear rot resistance, not via a co-varying trait.
The phenotypic observation was the BLUPs for the CS12-WW trial.
The SNP: S4_17376432 (bin 4.03) had allele effects of 9.1 ng g−1 and explained phenotypic variation ranging from 5.3 to 5.7%. The transcript GRMZM2G013546 corresponds to this SNP marker and is highly expressed in the pericarp and husk of maize [36,41,112], and a strong QTL for aflatoxin resistance coming from the resistant line Mp313E has been found in this bin location by multiple authors [5,35]. Further evidence of the presence of a significant QTL in this bin comes from a recent meta-analysis study . After building a consensus map, the study found linked markers extending from bins 4.02 to 4.04, which seems to contain six QTL for the three diseases presented, including aflatoxin and Aspergillus ear rot. Multiple QTL for ear rot resistance have been reported in bin 4.03 for resistance to ear rot diseases such as Fusarium and Gibberella [36,41]. The most likely reason that this QTV was not declared statistically significant in our study is that only a fraction of the lines in the panel have the allele, thus reducing our statistical power. Another possible explanation is the low marker density, which may not have been close enough to detect the causal polymorphism. The peak at SNP: S5_197707198 (bin 5.06) corresponds to the transcript GRMZM2G057789, which is highly expressed in the silks during the R1 stage . Xiang et al.  reported a QTL responsible for ear rot resistance in the same bin 5.06. One reason for the relatively poor detection of QTV for aflatoxin could be the choice of a susceptible tester, allowing only dominant sources of resistance to be found. However it is dominant sources of resistance that would be of most use in commercial breeding, since a recessive source would require improving lines in two heterotic groups.
Major findings of this study
Association mapping using candidate genes or at a whole genome level is now routine in many plant species [64,66,121–126] including maize. These studies have reported on average fewer associations than linkage mapping based studies, even for genes with large phenotypic effects. Although many maize association mapping studies have been conducted [127–129], there have been few previous reports of mapping in a hybrid testcross background which allows dominant alleles to be detected. This study found 10 QTV and other potential associations which were nearly significant after correcting for ∼50,000 tests; several of these QTVs co-located with QTL reported by others, which suggests that association mapping in a diverse set of germplasm using testcrosses is consistent and relevant. For QTV co-located to QTL reported from linkage mapping, this study identifies potential candidate genes, improves genetic resolution and provides independent confirmation of these loci. QTV novel to this study might not have been segregating in previous linkage mapping populations, and detection of these is a benefit of GWAS studies. Further investigation of these novel QTV will rule out possible false positives and validate their use for maize improvement. Detection power of association mapping is affected by several factors including sample size, population structure, the extent of LD, the magnitude of the phenotypic effect, and the quality and density of the SNP markers used [52,60,110,130]. The results here clearly highlight the importance of larger sample size to be able to detect associations with rare SNPs and genes of small effect, likely the most important genetic basis for complex traits such as drought tolerance and aflatoxin resistance. Yan et al.  reported that using a population of 500 individuals in GWAS can detect associations that explain as little as 3% of the phenotypic variation. Increasing the sample size to 1500 genotypes can detect associations that explain 1% of the phenotypic variation, but would not be practical given the resources needed for field analyses with replication. Similar trends have been obtained for QTL mapping, where it has been shown that increasing the number of individuals is more efficient than increasing the number of replications . The importance of large sample size was highlighted by our results obtained for aflatoxin GWAS, where no significant associations were detected after correcting for multiple testing. In the case of maize we feel it best to err slightly towards type I error since it is relatively straight forward to confirm or refute candidates in subsequent studies. Therefore, we used two different methods of multiple testing corrections, with different advantages and disadvantages; improved methods with less type II error for crop species would be desired and is an ongoing area of research [61,83].
This study used a diverse maize association mapping panel to identify genomic regions associated with grain yield, aflatoxin resistance and important agronomic traits in Southern US environments. Useful variation in diverse germplasm for aflatoxin resistance and drought tolerance was identified. Additionally, this diverse germplasm, when testcrossed to an elite, although older, Texas version of B73, has the potential to out-yield commercial hybrids sold in Texas but bred in the best Midwestern environments. This study found 10 QTVs for grain yield, plant and ear height, days to anthesis and days to silk with some co-localizing to previously reported linkage QTL, while others were novel, demonstrating the utility of GWAS to resolve and discover useful variation. Comparing different statistical adjustment models was useful to maximize the power of the data and to detect consistent QTVs. Once these QTVs are validated, they will be useful for molecular improvement of Southern maize germplasm and, if cloning is pursued, for understanding the basic biology of improvement of these traits.
S1 Fig. Heatmap of genetic distances between lines.
The pairwise genetic distances between lines according to Nei’s 1972 genetic distance were plotted in a heat map. The distribution of pairwise distance values in the upper left also shows the color legend. In the main figure, the dendrogram and relatedness are shown on the top and left. The names of the lines are shown on the right and bottom of the main figure.
S1 Table. Raw phenotypic data.
The raw phenotypic data is provided to allow genetic mapping using this panel with SNPs discovered in the future. Column headings are as follows: Rows—The number of rows in a plot; Row Width—the Row width in centimeters; Range Length—the plot length in meters; Loc—the location of the test; Year—the year of the test; Trial:$9—a unique combination of location, year, and treatment; Trial:$92—a unique combination of location, year, and test; Pedigre:$160.—the pedigree of the testcross hybrid inbred lines which were crossed to Tx714; Common name—another version of the inbred pedigree; Entry:$ a coded entry for analysis; Entry_No—the entry number; Panel:$15.—a stock number for the inbred; Inbredtag:$15.—a stock number for the testcross hybrid seed that was planted; Tagtrial:$15.- the unique barcode/ plot location in the field; Lox:$—the knocked out lipoxygenase gene in the Tx714 isogenic tester (must be either lox-4 or lox-5); Plot:$—the plot number, only unique within a trial; Row—the range in the field, only unique within a field location; Column—the row in the field, only unique within a field location; Pos—an identifier of row and column only unique within a field location; Rep—the replication only unique within a trial; DTA—days from planting to anthesis; DTS—days from planting to silk; ASI—anthesis silking interval; PH—plant height in centimeters; EH—ear height in centimeters; STD—the stand count in number of plants; PlotWt_gr—the total plot weight from the combine and hand harvest in grams; Moist—the percentage of moisture in the grain; kernel500Wt—the weight of 500 kernels in grams; Yield_plot—the yield in kilograms per hectare; Yield_BuAc—the yield in bushels per acre; PlotWt_Combine_gr—the plot weight from the combine in grams; Wt_Shelled_grs—the weight from hand shelling hand-harvested ears in grams; EarCount—the number of ears hand-harvested and hand shelled; poln_per—the percentage of pollinated kernels on hand-harvested ears; abortion_per—the percentage of kernel abortion on the ears; flav_per—a visual rating of the amount of Aspergillus flavus sporulation on the ears; NoKerRow1, NoKerRow2, NoKerRow3—the number of kernel rows on each of three randomly selected hand harvested ears; type:$—a visual rating of the kernel type (flint or dent); the amount of Aflatoxin in ng g−1; Log(Aflatoxin+10)—the log of aflatoxin in ng g−1 plus 10.
We thank the many students and staff who assisted with the seed production and phenotypic data collection for this project. We also thank the editor and anonymous reviewers for their comments which greatly helped to improve this manuscript.
Conceived and designed the experiments: IDBF GNDLF SCM TI MK. Performed the experiments: IDBF GNDLF SCM TI PCH MW PW MK GLW. Analyzed the data: IDBF MW SCM. Contributed reagents/materials/analysis tools: IDBF GNDLF SCM TI PCH MW PW MK GLW. Wrote the paper: IDBF SCM MW PW MK TI.
- 1. Frey KJ (1996) National plant breeding study I. Special Report 98. Iowa Agriculture and Home Economics Experiment Station. Ames, IA. 51 p.
- 2. Fuglie KO (2000) Trends in agricultural research expenditures in the United States. Ames, IA. 9–23. p.
- 3. Schimmelpfennig DE, Pray CE, Brennan MF (2004) The impact of seed industry concentration on innovation: a study of US biotech market leaders. Agricultural Economics 30: 157–167.
- 4. Betran J, Odvody G, Mayfield K, Isakeit T (2005) Breeding Corn to Reduce Preharvest Aflatoxin Contamination. Aflatoxin and Food Safety: CRC Press. pp. 353–378.
- 5. Brown RL, Chen Z-Y, Cleveland TE, Russin JS (1999) Advances in the Development of Host Resistance in Corn to Aflatoxin Contamination by Aspergillus flavus. Phytopathology 89: 113–117. pmid:18944783
- 6. Horne CW, Boleman LL, Coffman CG, Denton JH, Lawhorn DB, et al. (1991) Mycotoxins in feed and food-producing crops. College Station: Texas A&M. pmid:25144101
- 7. Mayfield KL, Murray SC, Rooney WL, Isakeit T, Odvody GA (2011) Confirmation Of QTL Reducing Aflatoxin In Maize Testcrosses. Crop Sci 51: 2489–2498.
- 8. Payne G (1998) Process of Contamination by Aflatoxin-Producing Fungi and Their Impact on Crops. In: Sinha K, Bhatnagar D, editors. Mycotoxins in agriculture and food safety. New York: Marcel Dekker Inc. pp. 277–306.
- 9. Robens J, Cardwell K (2003) The Costs of Mycotoxin Management to the USA: Management of Aflatoxins in the United States. Toxin Reviews 22: 139–152.
- 10. Smith R (2011) Southwest farmers and ranchers endure hardships, keep on going. Southwest Farm Press v 38 no 19 (October 6 2011) p 4.
- 11. Widstrom NW, Guo BZ, Wilson DM (2003) Integration of Crop Management and Genetics for Control of Preharvest Aflatoxin Contamination of Corn. Toxin Reviews 22: 195–223.
- 12. Williams WP, Windham GL, Buckley PM (2003) Enhancing Maize Germplasm with Resistance to Aflatoxin Contamination. Toxin Reviews 22: 175–193.
- 13. NASS (2013) USDA National Agricultural Statistics Service. Washington D.C.: USDA-NASS. https://doi.org/10.1097/ACC.0b013e3182973a4f pmid:25611960
- 14. Barrero Farfan ID, Murray SC, Labar S, Pietsch D (2013) A multi-environment trial analysis shows slight grain yield improvement in Texas commercial maize. Field Crops Research 149: 167–176.
- 15. USDA-ERS (2013) Economic reserach sevice. Washington D.C.: USDA-NASS. https://doi.org/10.1097/ACC.0b013e3182973a4f pmid:25611960
- 16. Rosenzweig C, Tubiello FN, Goldberg R, Mills E, Bloomfield J (2002) Increased crop damage in the US from excess precipitation under climate change. Global Environmental Change 12: 197–202.
- 17. Wuebbles D, Hayhoe K (2004) Climate Change Projections for the United States Midwest. Mitigation and Adaptation Strategies for Global Change 9: 335–363.
- 18. Campos H, Cooper M, Habben JE, Edmeades GO, Schussler JR (2004) Improving drought tolerance in maize: a view from industry. Field Crops Research 90: 19–34.
- 19. Kakumanu A, Ambavaram MMR, Klumas C, Krishnan A, Batlang U, et al. (2012) Effects of Drought on Gene Expression in Maize Reproductive and Leaf Meristem Tissue Revealed by RNA-Seq. Plant Physiology 160: 846–867. pmid:22837360
- 20. Rengel D, Arribat S, Maury P, Martin-Magniette M-L, Hourlier T, et al. (2012) A Gene-Phenotype Network Based on Genetic Variability for Drought Responses Reveals Key Physiological Processes in Controlled and Natural Environments. PLoS ONE 7: e45249. pmid:23056196
- 21. Bänziger M, Edmeades GO, Beck D, Bellon M (2000) Drought and Nitrogen Stress Tolerance in Maize: From Theory to Practice. Mexico, D.F.: CIMMYT. pmid:25506959
- 22. Bolaños J, Edmeades GO (1996) The importance of the anthesis-silking interval in breeding for drought tolerance in tropical maize. Field Crops Research 48: 65–80.
- 23. Abbas HK, Williams WP, Windham GL, Pringle HC, Xie W, et al. (2002) Aflatoxin and Fumonisin Contamination of Commercial Corn (Zea mays) Hybrids in Mississippi. Journal of Agricultural and Food Chemistry 50: 5246–5254. pmid:12188638
- 24. Windham GL, Williams WP (2002) Evaluation of Corn Inbreds and Advanced Breeding Lines for Resistance to Aflatoxin Contamination in the Field. Plant Disease 86: 232–234.
- 25. Mayfield K, Betrán FJ, Isakeit T, Odvody G, Murray SC, et al. (2012) Registration of Maize Germplasm Lines Tx736, Tx739, and Tx740 for Reducing Preharvest Aflatoxin Accumulation. J Plant Reg 6: 88–94.
- 26. Amaike S, Keller NP (2011) Aspergillus flavus. Annual Review of Phytopathology 49: 107–133. pmid:21513456
- 27. Kelley RY, Williams WP, Mylroie JE, Boykin DL, Hawkins LK, et al. (2009) Genomic profile of maize response to Aspergillus flavus infection. Toxin Reviews 28: 129–141.
- 28. Scheidegger KA, Payne GA (2003) Unlocking the Secrets Behind Secondary Metabolism: A Review of Aspergillus flavus from Pathogenicity to Functional Genomics. Toxin Reviews 22: 423–459.
- 29. Christensen SA, Kolomiets MV (2011) The lipid language of plant–fungal interactions. Fungal Genetics and Biology 48: 4–14. pmid:20519150
- 30. Scott GE, Zummo N (1990) Registration of Mp313E Parental Line of Maize. Crop Sci 30: 1378–1378.
- 31. Williams WP, Windham GL (2012) Registration of Mp718 and Mp719 Germplasm Lines of Maize. J Plant Reg 6: 200–202.
- 32. Llorente CF, Betrán FJ, Bockholt A, Fojt F (2004) Registration of Tx772 Maize Registration by CSSA. Crop Sci 44: 1036-a-1037.
- 33. Guo BZ, Krakowsky MD, Ni X, Scully BT, Lee RD, et al. (2011) Registration of Maize Inbred Line GT603 J Plant Reg 5: 211–214.
- 34. Paul C, Naidoo G, Forbes A, Mikkilineni V, White D, et al. (2003) Quantitative trait loci for low aflatoxin production in two related maize populations. Theoretical and Applied Genetics 107: 263–270. pmid:12677406
- 35. Willcox M, Davis G, Warburton M, Windham G, Abbas H, et al. (2013) Confirming quantitative trait loci for aflatoxin resistance from Mp313E in different genetic backgrounds. Molecular Breeding: 1–12.
- 36. Xiang K, Zhang ZM, Reid LM, Zhu XY, Yuan GS, et al. (2010) A Meta-analysis of QTL associated with ear rot resistance in maize. Maydica 55: 281–290. pmid:20009198
- 37. Brooks TD, Williams WP, Windham GL, Willcox MC, Abbas HK (2005) Quantitative Trait Loci Contributing Resistance to Aflatoxin Accumulation in the Maize Inbred Mp313E. Crop Sci 45: 171–174.
- 38. Alwala S, Kimbeng CA, Williams WP, Kang MS (2008) Molecular Markers Associated with Resistance to Aspergillus flavus in Maize Grain: QTL and Discriminant Analyses. Journal of New Seeds 9: 1–18.
- 39. Warburton M, Brooks T, Windham G, Paul Williams W (2011) Identification of novel QTL contributing resistance to aflatoxin accumulation in maize. Molecular Breeding 27: 491–499.
- 40. Evans HC, Holmes KA, Phillips W, Wilkinson MJ (2002) What's in a name: Crinipellis, the final resting place for the frosty pod rot pathogen of cocoa? Mycologist 16: 148–152.
- 41. Wisser RJ, Balint-Kurti PJ, Nelson RJ (2006) The Genetic Architecture of Disease Resistance in Maize: A Synthesis of Published Studies. Phytopathology 96: 120–129. pmid:18943914
- 42. Carson ML, Balint-Kurti PJ, Blanco M, Millard M, Duvick S, et al. (2006) Registration of Nine High-Yielding Tropical by Temperate Maize Germplasm Lines Adapted for the Southern USA Registration by CSSA. Crop Sci 46: 1825–1826.
- 43. Flint-Garcia SA, Buckler ES, Tiffin P, Ersoz E, Springer NM (2009) Heterosis Is Prevalent for Multiple Traits in Diverse Maize Germplasm. PLoS ONE 4: e7433. pmid:19823591
- 44. Nelson PT, Coles ND, Holland JB, Bubeck DM, Smith S, et al. (2008) Molecular Characterization of Maize Inbreds with Expired U.S. Plant Variety Protection Crop Sci 48: 1673–1685.
- 45. Nelson PT, Goodman MM (2008) Evaluation of Elite Exotic Maize Inbreds for Use in Temperate Breeding Crop Sci 48: 85–92.
- 46. Nelson PT, Jines MP, Goodman MM (2006) Selecting among available, elite tropical maize inbreds for use in long-term temperate breeding. Maydica 51: 255–262.
- 47. Ortiz R, Taba S, Tovar VHC, Mezzalama M, Xu Y, et al. (2010) Conserving and Enhancing Maize Genetic Resources as Global Public Goods–A Perspective from CIMMYT Crop Sci 50: 13–28.
- 48. Whitehead FC, Caton HG, Hallauer AR, Vasal S, Cordova H (2006) Incorporation of elite subtropical and tropical maize germplasm into elite temperate germplasm. Maydica 51: 43–56.
- 49. Flint-Garcia SA, Thuillet A-C, Yu J, Pressoir G, Romero SM, et al. (2005) Maize association population: a high-resolution platform for quantitative trait locus dissection. The Plant Journal 44: 1054–1064. pmid:16359397
- 50. McMullen MD, Kresovich S, Villeda HS, Bradbury P, Li H, et al. (2009) Genetic Properties of the Maize Nested Association Mapping Population. Science 325: 737–740. pmid:19661427
- 51. Mikel MA, Dudley JW (2006) Evolution of North American Dent Corn from Public to Proprietary Germplasm. Crop Sci 46: 1193–1205.
- 52. Yan J, Warburton M, Crouch J (2011) Association Mapping for Enhancing Maize (Zea mays L.) Genetic Improvement Crop Sci 51: 433–449.
- 53. Cairns JE, Crossa J, Zaidi PH, Grudloyma P, Sanchez C, et al. (2013) Identification of Drought, Heat, and Combined Drought and Heat Tolerant Donors in Maize. Crop Sci 53: 1335–1346.
- 54. Cullis B, Gogel B, Verbyla A, Thompson R (1998) Spatial Analysis of Multi-Environment Early Generation Variety Trials. Biometrics 54: 1–18.
- 55. Cullis BR, Smith AB, Coombes NE (2006) On the design of early generation variety trials with correlated data. Journal of Agricultural, Biological, and Environmental Statistics 11: 381–393.
- 56. Gilmour AR, Cullis BR, Verbyla AP (1997) Accounting for Natural and Extraneous Variation in the Analysis of Field Experiments. Journal of Agricultural, Biological, and Environmental Statistics 2: 269–293.
- 57. Gilmour AR, Gogel BJ, Cullis BR, Thompson R (2009) ASReml User Guide Release 3.0 Hemel Hempstead, HP1 1ES, UK: VSN International Ltd. pmid:25506961
- 58. Malosetti M, Ribaut J-M, van Eeuwijk FA (2013) The statistical analysis of multi-environment data: modeling genotype-by-environment interaction and its genetic basis. Frontiers in physiology 4. pmid:24550833
- 59. Piepho H-P, Pillen K (2004) Mixed modelling for QTL× environment interaction analysis. Euphytica 137: 147–153.
- 60. Yu J, Pressoir G, Briggs WH, Vroh Bi I, Yamasaki M, et al. (2006) A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet 38: 203–208. pmid:16380716
- 61. Han B, Kang HM, Eskin E (2009) Rapid and Accurate Multiple Testing Correction and Power Estimation for Millions of Correlated Markers. PLoS Genet 5: e1000456. pmid:19381255
- 62. Gao X, Starmer J, Martin ER (2008) A multiple testing correction method for genetic association studies using correlated single nucleotide polymorphisms. Genetic Epidemiology 32: 361–369. pmid:18271029
- 63. Gao X (2011) Multiple testing corrections for imputed SNPs. Genetic Epidemiology 35: 154–158. pmid:21254223
- 64. Larsson SJ, Lipka AE, Buckler ES (2013) Lessons from Dwarf8 on the Strengths and Weaknesses of Structured Association Mapping. PLoS Genetics 9: e1003246. pmid:23437002
- 65. Murray SC, Rooney WL, Mitchell SE, Sharma A, Klein PE, et al. (2008) Genetic improvement of sorghum as a biofuel feedstock: II. QTL for stem and leaf structural carbohydrates. Crop Science 48: 2180–2193. pmid:18956863
- 66. Murray SC, Rooney WL, Hamblin MT, Mitchell SE, Kresovich S (2009) Sweet sorghum genetic diversity and association mapping for brix and height. The Plant Genome 2: 48–62.
- 67. Warburton ML, Williams WP, Windham GL, Murray SC, Xu W, et al. (2013) Characterization of a maize association mapping panel for new sources of Aspergillus flavus and aflatoxin accumulation resistance. In press.
- 68. De La Fuente GN, Murray SC, Isakeit T, Park Y-S, Yan Y, et al. (2013) Characterization of Genetic Diversity and Linkage Disequilibrium of ZmLOX and ZmLOX5 Loci in Maize. PLoS ONE 8: e53973. pmid:23365644
- 69. Betrán FJ, Bockholt A, Fojt F, Mayfield K, Pietsch D (2004) Registration of Tx714 Maize Germplasm Line Crop Sci 44: 1028–1028.
- 70. Wicklow D, Norton R, McAlpin C (1998) β-Carotene inhibition of aflatoxin biosynthesis amongAspergillus flavus genotypes from Illinois corn. Mycoscience 39: 167–172.
- 71. Windham GL, Williams WP, Buckley PM, Abbas HK (2003) Inoculation Techniques Used to Quantify Aflatoxin Resistance in Corn. Toxin Reviews 22: 313–325.
- 72. Walker RD, White DG (2001) Inheritance of Resistance to Aspergillus Ear Rot and Aflatoxin Production of Corn from CI2. Plant Disease 85: 322–327.
- 73. Windham GL, Williams PW, Buckley PM (2005) Techniques used to identify aflatoxin-resistant corn. Aflatoxin and Food Safety: CRC Press. pp. 407–422.
- 74. Oakey H, Verbyla A, Pitchford W, Cullis B, Kuchel H (2006) Joint modeling of additive and non-additive genetic line effects in single field trials. Theoretical and Applied Genetics 113: 809–819. pmid:16896718
- 75. Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, et al. (2011) A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species. PLoS ONE 6: e19379. pmid:21573248
- 76. Nei M (1972) Genetic Distance Between Populations. American Naturalist 106: 283–292.
- 77. Pritchard JK, Stephens M, Rosenberg NA, Donnelly P (2000) Association Mapping in Structured Populations. The American Journal of Human Genetics 67: 170–181.
- 78. Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software structure: a simulation study. Molecular Ecology 14: 2611–2620. pmid:15969739
- 79. Lipka AE, Tian F, Wang Q, Peiffer J, Li M, et al. (2012) GAPIT: genome association and prediction integrated tool. Bioinformatics 28: 2397–2399. pmid:22796960
- 80. Kang HM, Zaitlen NA, Wade CM, Kirby A, Heckerman D, et al. (2008) Efficient Control of Population Structure in Model Organism Association Mapping. Genetics 178: 1709–1723. pmid:18385116
- 81. Zhang Z, Ersoz E, Lai C-Q, Todhunter RJ, Tiwari HK, et al. (2010) Mixed linear model approach adapted for genome-wide association studies. Nat Genet 42: 355–360. pmid:20208535
- 82. Myles S, Peiffer J, Brown PJ, Ersoz ES, Zhang Z, et al. (2009) Association Mapping: Critical Considerations Shift from Genotyping to Experimental Design. The Plant Cell Online 21: 2194–2202. pmid:19654263
- 83. Moskvina V, Schmidt KM (2008) On multiple-testing correction in genome-wide association studies. Genetic Epidemiology 32: 567–573. pmid:18425821
- 84. Gao X, Becker LC, Becker DM, Starmer JD, Province MA (2010) Avoiding the high Bonferroni penalty in genome-wide association studies. Genetic Epidemiology 34: 100–105. pmid:19434714
- 85. Benjamini Y, Hochberg Y (1995) Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B (Methodological) 57: 289–300.
- 86. Badu-Apraku B, Fakorede MAB, Menkir A, Kamara AY, Adam A (2004) Effects of drought screening methodology on genetic variances and covariances in Pool 16 DT maize population. The Journal of Agricultural Science 142: 445–452.
- 87. Banziger M, Edmeades GO, Lafitte HR (1999) Selection for drought tolerance increases maize yields across a range of nitrogen levels. Crop science 39: 1035–1040.
- 88. Bänziger M, Lafitte HR (1997) Efficiency of Secondary Traits for Improving Maize for Low-Nitrogen Target Environments. Crop Sci 37: 1110–1117.
- 89. Chapman SC, Edmeades GO (1999) Selection Improves Drought Tolerance in Tropical Maize Populations: II. Direct and Correlated Responses among Secondary Traits. Crop Sci 39: 1315–1324.
- 90. Sabadin PK, Malosetti M, Boer MP, Tardin FD, Santos FG, et al. (2012) Studying the genetic basis of drought tolerance in sorghum by managed stress trials and adjustments for phenological and plant height differences. Theoretical and Applied Genetics 124: 1389–1402. pmid:22297563
- 91. Campbell BC, Molyneux RJ, Schatzki TF (2003) Current Research on Reducing Pre‐ and Post‐harvest Aflatoxin Contamination of U.S. Almond, Pistachio, and Walnut. Toxin Reviews 22: 225–266.
- 92. Cullis BR, Gleeson AC (1991) Spatial Analysis of Field Experiments-An Extension to Two Dimensions. Biometrics 47: 1449–1460.
- 93. van Eeuwijk FA, Bink MCAM, Chenu K, Chapman SC (2010) Detection and use of QTL for complex traits in multiple environments. Current Opinion in Plant Biology 13: 193–205. pmid:20137999
- 94. Mathews K, Malosetti M, Chapman S, McIntyre L, Reynolds M, et al. (2008) Multi-environment QTL mixed models for drought stress adaptation in wheat. Theoretical and Applied Genetics 117: 1077–1091. pmid:18696042
- 95. Boer MP, Wright D, Feng L, Podlich DW, Luo L, et al. (2007) A Mixed-Model Quantitative Trait Loci (QTL) Analysis for Multiple-Environment Trial Data Using Environmental Covariables for QTL-by-Environment Interactions, With an Example in Maize. Genetics 177: 1801–1813. pmid:17947443
- 96. Betrán FJ, Ribaut JM, Beck D, de León DG (2003) Genetic Diversity, Specific Combining Ability, and Heterosis in Tropical Maize under Stress and Nonstress Environments. Crop Sci 43: 797–806.
- 97. Pritchard JK, Stephens M, Donnelly P (2000) Inference of Population Structure Using Multilocus Genotype Data. Genetics 155: 945–959. pmid:10835412
- 98. Ajmone-Marsan P, Gorni C, Chittò A, Redaelli R, van Vijk R, et al. (2001) Identification of QTLs for grain yield and grain-related traits of maize (Zeamays L.) using an AFLP map, different testers, and cofactor analysis. Theoretical and Applied Genetics 102: 230–243.
- 99. Ajmone-Marsan P, Monfredini G, Ludwig WF, Melchinger AE, Franceschini P, et al. (1995) In an elite cross of maize a major quantitative trait locus controls one-fourth of the genetic variation for grain yield. Theoretical and Applied Genetics 90: 415–424. pmid:24173932
- 100. Ajmone-Marsan P, Monfredini PG, Ludwig WF, Melchinger AE, Franceschini GP, et al. (1994) Identification of genomic regions affecting plant height and their relationship with grain yield in an elite maize cross. Maydica 39: 133–139.
- 101. Austin DF, Lee M (1996) Comparative mapping in F2∶3 and F6∶7 generations of quantitative trait loci for grain yield and yield components in maize. Theoretical and Applied Genetics 92: 817–826. pmid:24166546
- 102. Beavis WD, Smith OS, Grant D, Fincher R (1994) Identification of Quantitative Trait Loci Using a Small Sample of Topcrossed and F4 Progeny from Maize. Crop Sci 34: 882–896.
- 103. Veldboom LR, Lee M (1994) Molecular-marker-facilitated studies of morphological traits in maize. II: Determination of QTLs for grain yield and yield components. Theoretical and Applied Genetics 89: 451–458. pmid:24177894
- 104. Veldboom LR, Lee M (1996a) Genetic Mapping of Quantitative Trait Loci in Maize in Stress and Nonstress Environments: I. Grain Yield and Yield Components. Crop Sci 36: 1310–1319.
- 105. Veldboom LR, Lee M (1996b) Genetic Mapping of Qunatitative Trait Loci in Maize in Stress and Nonstress Environments: II. Plant Height and Flowering. Crop Sci 36: 1320–1327.
- 106. Lee B-h, Johnston R, Yang Y, Gallavotti A, Kojima M, et al. (2009) Studies of aberrant phyllotaxy1 Mutants of Maize Indicate Complex Interactions between Auxin and Cytokinin Signaling in the Shoot Apical Meristem. Plant Physiology 150: 205–216. pmid:19321707
- 107. Schön CC, Utz HF, Groh S, Truberg B, Openshaw S, et al. (2004) Quantitative Trait Locus Mapping Based on Resampling in a Vast Maize Testcross Experiment and Its Relevance to Quantitative Genetics for Complex Traits. Genetics 167: 485–498. pmid:15166171
- 108. Chia J-M, Song C, Bradbury PJ, Costich D, de Leon N, et al. (2012) Maize HapMap2 identifies extant variation from a genome in flux. Nat Genet 44: 803–807. pmid:22660545
- 109. Gore MA, Chia J-M, Elshire RJ, Sun Q, Ersoz ES, et al. (2009) A First-Generation Haplotype Map of Maize. Science 326: 1115–1117. pmid:19965431
- 110. Remington DL, Thornsberry JM, Matsuoka Y, Wilson LM, Whitt SR, et al. (2001) Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proceedings of the National Academy of Sciences 98: 11479–11484. pmid:11562485
- 111. Yan J, Shah T, Warburton ML, Buckler ES, McMullen MD, et al. (2009) Genetic Characterization and Linkage Disequilibrium Estimation of a Global Maize Collection Using SNP Markers. PLoS ONE 4: e8451. pmid:20041112
- 112. Sekhon RS, Lin H, Childs KL, Hansey CN, Buell CR, et al. (2011) Genome-wide atlas of transcription during maize development. The Plant Journal 66: 553–563. pmid:21299659
- 113. Winter D, Vinegar B, Nahal H, Ammar R, Wilson GV, et al. (2007) An “Electronic Fluorescent Pictograph” Browser for Exploring and Analyzing Large-Scale Biological Data Sets. PLoS ONE 2: e718. pmid:17684564
- 114. Goldman IL, Rocheford TR, Dudley JW (1993) Quantitative trait loci influencing protein and starch concentration in the Illinois Long Term Selection maize strains. Theoretical and Applied Genetics 87: 217–224. pmid:24190215
- 115. Goldman IL, Rocheford TR, Dudley JW (1994) Molecular Markers Associated with Maize Kernel Oil Concentration in an Illinois High Protein × Illinois Low Protein Cross. Crop Sci 34: 908–915.
- 116. Schaeffer M, Byrne P, Coe EH (2006) Consesus quantitative trait maps in maize: A database strategy. Maydica 51: 357–367.
- 117. Romay MC, Millard MJ, Glaubitz JC, Peiffer JA, Swarts KL, et al. (2013) Comprehensive genotyping of the USA national maize inbred seed bank. Genome biology 14: R55. pmid:23759205
- 118. Bohn MM, Khairallah M, Jiang C, González-de-León D, Hoisington DA, et al. (1997) QTL Mapping in Tropical Maize: II. Comparison of Genomic Regions for Resistance to Diatraea spp. Crop Sci 37: 1892–1902.
- 119. Buckler ES, Holland JB, Bradbury PJ, Acharya CB, Brown PJ, et al. (2009) The Genetic Architecture of Maize Flowering Time. Science 325: 714–718. pmid:19661422
- 120. Mideros SX, Warburton ML, Jamann TM, Windham GL, Williams WP, et al. (2014) Quantitative Trait Loci Influencing Mycotoxin Contamination of Maize: Analysis by Linkage Mapping, Characterization of Near-Isogenic Lines, and Meta-Analysis. Crop Science 54: 127–142.
- 121. Atwell S, Huang YS, Vilhjalmsson BJ, Willems G, Horton M, et al. (2010) Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines. Nature 465: 627–631. pmid:20336072
- 122. Breseghello F, Sorrells ME (2006) Association Mapping of Kernel Size and Milling Quality in Wheat (Triticum aestivum L.) Cultivars. Genetics 172: 1165–1177. pmid:16079235
- 123. González-Martínez SC, Wheeler NC, Ersoz E, Nelson CD, Neale DB (2007) Association Genetics in Pinus taeda L. I. Wood Property Traits. Genetics 175: 399–409. pmid:17110498
- 124. Neale DB, Savolainen O (2004) Association genetics of complex traits in conifers. Trends in Plant Science 9: 325–330. pmid:15231277
- 125. Pasam R, Sharma R, Malosetti M, van Eeuwijk F, Haseneyer G, et al. (2012) Genome-wide association studies for agronomical traits in a world wide spring barley collection. BMC Plant Biology 12: 16. pmid:22284310
- 126. Quesada T, Gopal V, Cumbie WP, Eckert AJ, Wegrzyn JL, et al. (2010) Association Mapping of Quantitative Disease Resistance in a Natural Population of Loblolly Pine (Pinus taeda L.). Genetics 186: 677–686. pmid:20628037
- 127. Krill AM, Kirst M, Kochian LV, Buckler ES, Hoekenga OA (2010) Association and Linkage Analysis of Aluminum Tolerance Genes in Maize. PLoS ONE 5: e9958. pmid:20376361
- 128. Weber AL, Zhao Q, McMullen MD, Doebley JF (2009) Using Association Mapping in Teosinte to Investigate the Function of Maize Selection-Candidate Genes. PLoS ONE 4: e8227. pmid:20011044
- 129. Wilson LM, Whitt SR, Ibáñez AM, Rocheford TR, Goodman MM, et al. (2004) Dissection of Maize Kernel Composition and Starch Production by Candidate Gene Association. The Plant Cell Online 16: 2719–2733. pmid:15377761
- 130. Pongpanich M, Sullivan PF, Tzeng J-Y (2010) A quality control algorithm for filtering SNPs in genome-wide association studies. Bioinformatics 26: 1731–1737. pmid:20501555