A pearl millet inbred germplasm association panel (PMiGAP) comprising 250 inbred lines, representative of cultivated germplasm from Africa and Asia, elite improved open-pollinated cultivars, hybrid parental inbreds and inbred mapping population parents, was recently established. This study presents the first report of genetic diversity in PMiGAP and its exploitation for association mapping of drought tolerance traits. For diversity and genetic structure analysis, PMiGAP was genotyped with 37 SSR and CISP markers representing all seven linkage groups. For association analysis, it was phenotyped for yield and yield components and morpho-physiological traits under both well-watered and drought conditions, and genotyped with SNPs and InDels from seventeen genes underlying a major validated drought tolerance (DT) QTL. The average gene diversity in PMiGAP was 0.54. The STRUCTURE analysis revealed six subpopulations within PMiGAP. Significant associations were obtained for 22 SNPs and 3 InDels from 13 genes under different treatments. Seven SNPs associations from 5 genes were common under irrigated and one of the drought stress treatments. Most significantly, an important SNP in putative acetyl CoA carboxylase gene showed constitutive association with grain yield, grain harvest index and panicle yield under all treatments. An InDel in putative chlorophyll a/b binding protein gene was significantly associated with both stay-green and grain yield traits under drought stress. This can be used as a functional marker for selecting high yielding genotypes with ‘stay green’ phenotype under drought stress. The present study identified useful marker-trait associations of important agronomics traits under irrigated and drought stress conditions with genes underlying a major validated DT-QTL in pearl millet. Results suggest that PMiGAP is a useful panel for association mapping. Expression patterns of genes also shed light on some physiological mechanisms underlying pearl millet drought tolerance.
Citation: Sehgal D, Skot L, Singh R, Srivastava RK, Das SP, Taunk J, et al. (2015) Exploring Potential of Pearl Millet Germplasm Association Panel for Association Mapping of Drought Tolerance Traits. PLoS ONE 10(5): e0122165. https://doi.org/10.1371/journal.pone.0122165
Academic Editor: Swarup Kumar Parida, National Institute of Plant Genome Research (NIPGR), INDIA
Received: November 10, 2014; Accepted: February 7, 2015; Published: May 13, 2015
Copyright: © 2015 Sehgal et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: The authors wish to acknowledge Biotechnology and Biological Sciences Research Council (BBSRC) and Department for International Development (DFID) for funding the work via grant number BB/F004133/1. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Pearl millet [Pennisetum glaucum (L.) R. Br.] is the sixth most important global cereal crop which is grown by subsistence farmers in the semi-arid regions of sub-Saharan Africa and the Indian subcontinent . It is the main source of food for 500 million of the poorest people living predominantly in parts of Asia and Africa. It is the hardiest cereal crop that can be grown in a vast range of harsh environmental conditions, for instance, environments with high mean temperatures and frequent droughts and/or with poor soil fertility. Pearl millet grain has relatively high nutritional value compared to wheat, rice and maize in terms of both protein content and amino acid composition [2–4]. It also has superior levels of grain Fe and Zn [5, 6]. Furthermore, pearl millet also has relatively high energy density as compared to maize, wheat or sorghum .
A tremendous phenotypic variability exists in cultivated pearl millet for many agronomic traits such as flowering time, panicle length, grain and stover characteristics, and for tolerance to many biotic and abiotic stresses [8, 9]. Despite this, narrow gene pools are used for generating pearl millet varieties and hybrids and use of wild pearl millets and landrace germplasm, except as donors of specific traits such as apomixis  or resistance to pests and diseases , is very limited. Further, most of the allele mining for agronomically important traits including biotic and abiotic stress resistance has been achieved so far using bi-parental mapping populations [12–17]. For example, based on mapping in conventional bi-parental mapping populations, many quantitative trait loci (QTLs) for drought tolerance [14–16] downy mildew resistance  and yield and stover quality traits  have been identified over the past twenty years.
While conventional linkage mapping has identified a number of important quantitative traits in pearl millet, it has been severely limited by the resolution provided by two (or a few) parent-derived mapping populations. Due to small (79–125) population sizes and the early generations used, resolution has been in the range of 10-30cM in these studies. Therefore, there is need to explore and venture into improved alternatives for allele mining for improving pearl millet open-pollinated varieties and hybrids. Association mapping, also known as linkage disequilibrium (LD) mapping, offers an alternative means of allele mining, which utilizes ancestral recombination events in germplasm collections or natural populations to make marker-phenotype associations . This approach has four major advantages over conventional QTL mapping. Firstly, a much larger and diverse genepool having thousands of recombination events is surveyed. Secondly, it saves time and labour that goes in designing mapping populations and enables the mapping of many traits in a single panel. Thirdly, the high mapping resolution achieved in association mapping results in small confidence intervals of the detected loci and sometimes even resolves the candidate quantitative trait gene (QTG). Finally, it has the potential to identify the causal polymorphism within a gene and/or quantitative trait nucleotide in a QTG linked with the two alternative phenotypes. Association mapping has been successfully used in many crops such as maize, barley, sorghum, rice, and common wheat to detect important markers or genes [20–26].
Recently, a pearl millet germplasm association panel (PMiGAP) comprising 250 inbred lines has been assembled from a large set of 1000 diverse breeding lines and accessions of landraces, elite cultivars and mapping population parents, collected from wide geographical range in Africa and Asia . It is anticipated that PMiGAP will provide the pearl millet community with a high-resolution platform for fine mapping of QTLs and (or) for allele mining of favourable genes of agronomic importance. To test whether PMiGAP is a good panel for association analysis, candidate gene-based association mapping was performed using 17 genes underlying a major QTL for drought tolerance on linkage group 2 . For diversity and genetic structure analysis, PMiGAP was genotyped with 37 SSR and CISP markers representing all seven linkage groups. For association analysis, it was phenotyped for yield and yield components and morpho-physiological traits under both well-watered and drought conditions, and genotyped with SNPs and InDel markers designed from seventeen genes in the present and our previous study .
Materials and Methods
The permission to carry out field experiments was obtained from International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Hyderabad, India and phenotyping was carried out at fields of ICRISAT, Patancheru, Hyderabad, India.
Plant material and DNA extraction
Two hundred and fifty inbred lines of PMiGAP, representing 23 countries across three continents (S1 Table), were used for association analysis. The PMiGAP has been developed from a pearl millet core collection [8, 29], landraces, cultivars and breeding lines representing entire cultivated global diversity of pearl millet. The selfing programme to develop inbred lines from these open-pollinated entries began in 2007/08 at ICRISAT using single row plots of 4-m length of each source population under standard agronomic practices to raise a healthy crop. One selfed panicle was harvested from each of three representative plants of each of the 250 accessions. The S1 seeds were harvested on a single-plant basis, and head-to-row progenies from each accession were grown in triplets of single-row plots of 4-m length during early post-rainy season in 2008. Three selfed panicles were harvested from one of the three representative S1 progenies to produce S2 seeds, with selection for slightly reduced vigour (to accelerate the rate of progress towards homozygosity), adequate selfed seed set, and typical plant architecture. Following the same methodology of selfing and selection, the generation was advanced to S3 (summer, 2009), S4 (late rainy season, 2009), and S5 (summer, 2010). During summer 2010, 250 PMiGAP-entries were testcrossed to one tester, ICMA 843–22, using bulk pollen from individual entries. Seeds of selfed inbred and test crosses were harvested and threshed. During 2010 late rainy season sowing, all 250 lines were sown as single-row plots of 4-m row length. Leaf tissue sampling at late seedling stage from five representative plants of each of the 250 entries was collected and bulked for DNA extraction and stored at -80°C. DNA was extracted from frozen leaf using DNeasy plant DNA kit (Qiagen, Hilden, Germany) and was quantified using a NanoDrop 1000 spectrophotometer.
Phenotyping of PMiGAP under drought stress
The 250 PMiGAP entries were assigned to four precocity groups (61, 63, 63 and 63 entries in early, medium early, medium and late maturity groups, respectively) so that flowering time would not compound the results of drought tolerance. They were phenotyped as test cross hybrid trials during the summer seasons (January to May) of 2011 and 2012 in the experimental farm of the International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), located in Patancheru, Telengana, India (altitude 545 m above mean sea level, latitude. 17.53° N and longitude 78.27° E). Both seasons experiments were laid out in alpha-lattice designs with two replications in 3 test environments. Individual plots were 4.0 m long; net (harvested) plot area was two rows by 3.0 by 0.6 m. Four checks; 843-22A x ICMR 01004, 843-22A x ICMR 01029, 863B x ICMR 01004 and 841A x D 23 were repeated across trials and used in calculating adjusted means of the traits. The 3 test environments consisted of two terminal drought stress treatments, an early-onset and a late-onset of terminal stress, plus a common, fully-irrigated non-stress treatment. Drought stress in the more severe early-onset treatment was initiated at 50% flowering, by withholding irrigation from about 1 week before flowering. Drought stress in the late-onset treatment was initiated during early grain filling by withholding irrigation from approximately 50% flowering. These entries were evaluated for 16 morphological, morpho-physiological and agronomical traits. The investigated traits were grain yield (GY), panicle yield (PY), panicle harvest index (PHI), time to 75% flowering (FT), plant height (PH), panicle length (PL), panicle diameter (PD), panicle number (PN), number of tillers per plant (TPP), biomass yield (BY), grain harvest index (GHI), thousand grain weight (TGW), grain number per panicle (GNPP), grain number per m2 (GNPM), stay green (SG) and leaf rolling (LR). These traits were measured as described in [14, 15]. Briefly, PH, PL, and PD were measured from main stems of five representative plants of each entry in a plot. At harvest, data were recorded from the harvested area on plant numbers (PC), head numbers (HC) and fresh stover yield (FSY). Effective tiller number (ET) was calculated as the ratio HC/PC. PY, GY and TGW were recorded after oven drying for approximately 24h. Stover dry matter yield (SDMY) was estimated from plot FSY using the fresh and dry weights of a chopped subsample of stover from each plot. BY was calculated as PY + SDMY on a plot basis. Panicle grain number (PGN) was derived from these primary data as (GY/HC)/ (TGM/1000). Grain harvest index represents the ratio between grain yield and biomass yield at harvest, and panicle harvest index the ratio between grain weight and panicle weight. Flowering time was recorded as days from seedling emergence to stigma emergence in 75% of the main shoots in a plot. Leaf rolling was measured after the day first symptom became visible on PMiGAP genotypes (normally after 10 days of the last irrigation) and continued until 80% of the leaves had rolled in the water stress treatment using a scale of 1 (20% leaf rolled) to 5 (80% and above rolled). Similarly, stay green was measured from the day leaf drying was visible using the scale of 1 (20% leaf staying green) to 5 (80% leaf staying green).
Phenotypic data analyses
All phenotypic analysis was conducted using GenStat ver. 14th Edition. The minimum, maximum and mean values of each trait in each environment were calculated using summary statistics option in GenStat. Normality of the data for each trait was checked by drawing normal plots. For each environment, the restricted maximum likelihood (REML) analysis was performed with replications as a fixed effect and entries as random effects. For combined environment analysis, REML model was used with environments and replications*environments as fixed effect and entries and entries*environments as random effects. The REML model produced best linear unbiased predictors (BLUPs) and variance components for all traits. The variance components were used for calculating broad sense heritability (h2) for traits in each environment and over combined environments. For each environment h2 was calculated as: h2 = Vg/ (Vg + Verr/r) where Vg is genotypic variance and Verr is the error variance and r = number of replications for a single environment. For combined environments, h2 was calculated as Vg / ((Vg + (VGxE /n) + (Verr /nr)) where Vg is genotypic variance, VGxE is genotype x environment interaction variance, Verr is the error variance, n = number of environments and r = number of replications. The phenotypic correlations among traits were obtained using Minitab 15 (http://www.minitab.com/en-IT/products/minitab/).
Genotyping of PMiGAP entries with genome wide SSR markers was done using M13 tailed (5’CACGACGTTGTAAAACGAC3’) forward primers as described previously . The PCR products were resolved on an ABI 3730 DNA Sequencer (Applied Biosystems, CA, USA). The GeneMapper program, version 3.7 (Applied Biosystems, Foster City, CA, USA), was used for reading and scoring alleles.
Genotyping with gene-based conserved intron spanning primers (CISP) and single nucleotide polymorphism (SNP) markers
Genotyping of PMiGAP entries with InDel markers was done as described by Sehgal et al. . For SNP genotyping, the seventeen genes underlying major DT-QTL  were amplified and sequenced in 48 randomly selected entries using the protocols described in Sehgal et al. . Sequences obtained from 48 genotypes were aligned using MACAW 2.05 software and the putative SNPs were also verified on the sequence chromatograms as described in Sehgal et al. . All the SNPs obtained in individual genes were used for genotyping 250 PMiGAP entries using the KASPar genotyping system (Kbiosciences, UK).
Model-based population structure analysis
The Bayesian model-based population structure analysis implemented in STRUCTURE v2.2  was used to analyse the population genetic structure in PMiGAP. K (genetic groups) values from 1 to 15 were tested by applying the ‘no admixture’ and the ‘correlated allele frequency’ models . Three independent runs were achieved for each K and replication number was set to 50,000 for the burn-in and the Markov chain Monte Carlo (MCMC) periods. Once all the runs were finished, a zip archive containing all of the results-f files was created and used as an input in Structure Harvester program (http://taylor0.biology.ucla.edu/structureHarvester/index.php) to estimate ∆K . It is an ad hoc measure which identifies the number of subpopulations by estimating the rate of change in the log probability of data between successive K values.
Genetic diversity assessment and cluster analysis
The assessment of genetic diversity was conducted using the software POPGENE version 1.32 . The following genetic diversity parameters were used: average number of alleles per locus, effective number of alleles, observed heterozygosity, expected heterozygosity, Shannon’s information index and Nei’s genetic diversity index. Genetic distances between subpopulations were calculated with Nei’s parameter using the DARWin 5 programme . Principal coordinate analysis (PCA) was also performed using DARWin 5.
Linkage disequilibrium (LD) analysis
Linkage disequilibrium was estimated with the software program TASSEL 3.0 (http://www.maizegenetics.net). LD significance was determined with 100,000 permutations for each locus. The squared correlation coefficients (r2) between loci were used for quantifying LD. LD was calculated for each candidate gene separately.
Kinship matrix and marker-trait associations
A pairwise kinship matrix was estimated using the program SPAGeDi . Negative values between pairs of individuals were set to 0 in the resulting matrix. Marker-trait associations were determined using TASSEL version 3.0 (http://www.maizegenetics.net), employing both the general linear model (based on the Q- or PCA-matrix) and the mixed linear model (based on Q- or PCA-matrix and the kinship matrix).
The significance of marker trait associations (MTAs) was initially based on FDR-adjusted P values  with cut off set at 0.05. However, FDR-adjusted P-values were found to be highly stringent. Hence considering the potential risk of type II error, we used another criterion as described in Pasam et al. . Based on this approach, the P-values obtained within the bottom 0.1 percentile of the distribution are significant. A threshold P-values of 0.05 corresponded to the bottom 0.1 percentile in the present study, which was then used to declare significant MTAs.
Semi-quantitative RT-PCR of candidate genes
Seeds of three contrasting genotypes, H 77/833-2 (drought susceptible), PRLT 2/89-33 (drought tolerant) and ICMR 01029 (near isogenic line introgressed with drought tolerance QTL on LG2 from drought tolerant parent PRLT 2/89-33) were sown in 8 inch pots filled with compost in a controlled environment tuned with the following conditions: 10 h light/14 h dark, temperature 20°C (night)/28°C (day) and relative humidity of 60% (day)/80% (night), suitable for pearl millet growth. Three replicates per genotype were sown in the pots and plants were grown till maturity. Water stress was initiated by withholding water supply on the 45th day after sowing (a week after panicle emergence) and continued for 7 days. Leaves were harvested in liquid nitrogen from the plants in both control and stress treatments, and stored at -80°C. Total RNA was isolated from the frozen leaves of the drought-stressed and control materials using RNeasy Plant Mini Kit (Qiagen, UK) with manufacturer’s protocol. RNA was treated with RNase free DNase (Qiagen, UK) to remove genomic DNA contamination followed by inactivation of DNase with RNeasy Min Elute cleanup kit (Qiagen, UK) according to the manufacturer’s instructions. The quantification of the purified RNA was done at 260 nm in a spectrophotometer. The quality of the RNA was checked by both gel analysis, and 260/280 and 260/230 nm spectrophotometric ratios. The RNA was stored at -80°C before use. One micro gram of total RNA was used to synthesize cDNA using iScriptTM cDNA synthesis kit (Biorad, UK) as per manufacturer’s protocol. Real-time PCR was performed using the SYBR Green PCR master mix kit (Applied Biosystems, Foster City, CA, USA) according to the manufacturers’ recommendations. The following PCR program was used; 10 min at 95°C, 40 cycles at 95°C for 15 s, 54°C for 1 min and 72°C for 40 s, with a final cycle at 72°C for 7 min. Relative mRNA accumulation of each candidate sequence was compared to the control by using the standard comparative Ct (2-∆∆Ct) method. Average Ct value was calculated for 3 replicates for the housekeeping actin gene and for each candidate sequence in controlled and stressed cDNA samples. ∆Ct was then determined by taking a difference of the two values.
Genetic diversity in pearl millet germplasm association panel (PMiGAP)
A total of 37 SSR and InDel markers, covering all the seven linkage groups, were used to evaluate the genetic diversity in PMiGAP. A total of 259 alleles, ranging from 2 (Xibmsp09/AP, Xibmsp07/AP, Xibmcp11/AP and Xibmsp43/AP) to 16 (Xipes0236), were detected across 250 pearl millet accessions which represent an average allele number (A) of 7 per locus (Table 1). When alleles with a frequency lower than 0.05 were excluded, A reduced to 117 alleles (3.16 alleles per locus). The average effective number of alleles (Ae) for 37 markers was 2.71. The difference between A and Ae indicates that a large proportion of the alleles (54.8%) had frequencies lower than 5%, with 40.1% of them (57 alleles) present in a single genotype. The Shannon’s information index ranged from 0.20 (Xicmp 3056) to 2.21 (Xpsmp2203), with a mean of 1.06 for all loci. The Nei’s gene diversity ranged from 0.07 (Xibmsp31/AP) to 0.87 (Xpsmp2203), with a mean of 0.54 for all loci.
The model-based STRUCTURE analysis was performed for K populations varying from 1 to 15. ∆K showed the highest likelihood at K = 6 (Fig 1) and topologically meaningful clustering was captured at K = 6. Notwithstanding the geographic origins, the six subpopulations (A, B, C, D, E and F) identified by STRUCTURE (Fig 2) corroborated pedigree and/or characteristics of the lines. Population A (red) included sixteen accessions from west and east Africa, one accession from southern Africa, one accession from central Africa and two accessions from India. The eight accessions whose source was ICRISAT gene bank were also included in this population. These eight accessions (PRLT 2/89-33, ICMP 451-P6, ICMP 451-P8, ICMP 85410-P7, GS 156, D2 WS, GB 8735 and P1449-3) shared either a pedigree or geographic origin with other west and east African accessions in population A. For instance, both ICMP 451-P6 and ICMP 451-P8 are derived from LCSN 72-1-2-1-1, a selection made in Burkina Faso (West Africa). The accession ICMP 85410-P7 is a derivative of a cross based on a germplasm from east and west Africa. Similarly, PRLT 2/89-33 is an inbred derived from the ICRISAT Bold Seeded Early Composite, based predominantly on Iniadi landrace germplasm from West Africa. Population B (green) was a large group comprising of 48 accessions from seemingly diverse geographic origins, including 19 accessions from west Africa, 5 accessions from east Africa, 4 accessions from southern Africa, 2 accessions from central Africa, 9 accessions from India and one accession each from USA, Yemen and South Africa. The six accessions (ICMS 7703, H 77/833-2, 843B, ICMB 89111, IP 8275, GS 154) from ICRISAT gene bank were also part of this large group. As expected, 843B and its derivative ICMB 89111 (bred at ICRISAT using 843B as one of the parents) were grouped together in the population B. The lines with similar traits, but with diverse origins, were generally characteristic of this population. For example, accessions from Yemen (IP 20349) and USA (IP 21155) have characteristic features of productive tillers > 12. The accessions from India and Niger included in this population are either salinity tolerant (IP 6101, IP 3732) or thermo-tolerant (IP 21517, IP 3175). Similarly, three accessions from east and southern Africa (IP 13363, IP 19386, IP 19448) possess characteristic features of thick panicles (>50mm). Two early maturing accessions from west Africa and India (IP 13520, IP 9532) were also part of this population. Population C (blue) comprised of 9 accessions from west Africa, 2 accessions each from east and central Africa and 5 accessions from Asia (India and Pakistan). Population C also contained a large (11) number of breeding lines released by ICRISAT for different breeding programs. For example, 81B-P6 and ICMB 90111-P6 are downy mildew (DM) resistant selections which have been widely used as mapping population parental lines (MPPLs). Similarly, LGD 1-B-10, P310-17-B, P1449-2-P1 and WSIL-P8 developed by ICRISAT have been widely used as MPPLs. Another two lines, SOSAT-C88 and Raj 171, released by ICRISAT in west coast of Africa and India, respectively, were also included in this population. Population D (yellow) comprised of 44 accessions predominantly from West Africa (18) and 4 accessions each from east, central and southern Africa and one accession each from South Africa and India. This population also included a large number of breeding lines/cultivars developed and released by ICRISAT. The lines Okashana 1 (released in Namibia), GICKV 93191 and WC-C75 (released in India), 863B (drought tolerant; MPPL), IP 18293-P152 (MPPL), ICML 1 and ICML 2 (resistant to ergot) and five others were part of this subpopulation. Population E (purple) contained 50 accessions, with 60% of them originating from west (21) and east (8) Africa. The remaining accessions included 3 accessions from central Africa, 9 accessions from India, 1 accession each from Pakistan and USA and 7 breeding lines/cultivars released by ICRISAT. Similar to population B, lines with similar traits yet diverse origins were grouped in this population. For example, salinity tolerant lines from India and Niger (IP 3757, IP 6102) and early maturing (<37 days) lines from Togo and Pakistan (IP 18132, IP 17720) were part of this population. Population F (turquoise) was the largest group with 51 accessions of diverse origins including 15 accessions from west Africa, 9 accessions each from east and central Africa, 11 accessions from India, 1 accession each from south Africa and Pakistan, 2 Tifton accessions (Tift 186 and Tift 383) from USA and a few breeding lines released from ICRISAT. Again, lines with similar traits, regardless of geographic origins, were grouped in this population. For example, drought tolerant lines from Uganda and Ghana (IP 8955, IP 9406), lines with sweet stalk from India, Cameroon, Burkina Faso and Nigeria (IP 3471, IP 14439, IP 13817, IP 12128), lines with purplish-black seed colour from Sudan (IP 10759, IP 13324), lines with yellow endosperm from Burkina Faso (IP 15536, IP 15533) and lines with thick panicle from Zimbabwe and Burkina Faso (IP 16403, IP 12845); all were included in population F.
Each accession is represented by a thin vertical line, which can be partitioned into six colored segments representing estimated membership probabilities (Q) of the individual to the six clusters.
The distance-based principal component analysis (PCA) showed grouping fairly consistent with model-based grouping, except for population D. The population D was merged with populations F and A (results not shown). Genetic diversity parameters calculated for six subpopulations are given in Table 2. The diversity parameters were also estimated in populations from 26 geographic regions (S2 Table). The highest number of observed (5.2) and effective number of alleles (2.9) were observed in the population from India. The expected heterozygosity was also highest in population from India (0.56) followed by Niger and Mali (0.54).When only populations from Africa are compared, the highest number of observed alleles was obtained in population from Burkina Faso (4.60) followed by Niger and Nigeria (4.26 and 4.24, respectively) and the highest effective number of alleles was obtained in population from Niger (2.83) (S2 Table).
Moisture environments effects
The effect of drought environment was evident on all measured traits (S3 and S4 Tables). The average reduction in grain yield due to drought stress was 24.2 and 29.2% in 2011 and 2012, respectively (S3 and S4 Tables). The similar effects were observed on all yield components, but the absolute reductions varied with trait. Panicle yield (19.5 and 22.1% reduction in 2011 and 2012, respectively), biomass yield (23.2 and 19.7% reduction in 2011 and 2012, respectively), and grain number per square meter (15.5 and 16.1% reduction in 2011 and 2012, respectively) were the traits most affected by drought stress (S3 and S4 Tables). Grain harvest index (GHI) represents the ratio of grain to biomass yield, and panicle harvest index (PHI) is the ratio of grain to total panicle weights. GHI has been advocated as an index of the ability to convert biomass to grain and PHI as a measure of the ability to set and fill grains. We observed almost similar reductions in both GHI and PHI in the late stress conditions but greater in the early stress conditions (S3 Table). These results suggest the greater effect of the early stress (before flowering) on productive tiller number.
Heritabilities and correlation of traits
The heritability was generally high (>0.75) or moderately high (0.50–0.75) for all the traits under control conditions (S3 Table). The heritability estimates in early and late stress treatments ranged from 0.60 (BY) to 0.94 (FT) and 0.44 (BY) to 0.94 (FT), respectively. Across environments, heritability estimates for panicle and biomass yield were low (<0.50) and it was moderate for grain yield and panicle harvest index (≥0.50). For remaining traits, these estimates were fairly high (>0.70) (S3 Table). The correlation values among traits under control and drought stress conditions are provided in S5 and S6 Tables. In general, GY and FT were negatively correlated under both control and drought stress conditions but correlation was not significant under drought stress conditions. A significant negative correlation between GY and LR and positive correlation between GY and SG under drought stress conditions in both years (S6 Table) suggest that LR and SG have more effect on GY than FT under terminal drought stress in pearl millet The 3D contour plots confirm this (Fig 3A and 3B). The genotypes having stay green score more than 2.0 showed more grain yield than the genotypes having scores less than 2.0 (Fig 3C). An opposite trend was observed for leaf rolling and grain yield i.e. genotypes having leaf rolling scores more than 2.0 had significantly lesser yield than genotypes having scores less than 2.0 (Fig 3D).
Effects of stay green (c) and leaf rolling (d) on grain yield under drought stress.
Traits GNPM, PN and GNPP showed significant differences among the six subpopulations under all treatments. Populations C and D had the highest GNPP and GNPM under both control and two drought stress environments while population B had the highest PN under all treatments. Other phenotypic traits were not significantly different among the subpopulations (results not shown).
Frequency of SNPs in candidate genes and linkage disequilibrium
Previously, we mapped 17 genes as SNP and CISP markers on LG2 in the drought tolerance QTL region . In this study, we genotyped PMiGAP with these gene-based markers and also re-sequenced the genes on a subset of randomly selected 48 genotypes of PMiGAP to identify more SNPs and InDels. The PMiGAP was finally genotyped with 39 SNPs and 7 InDel markers from 17 genes. A total of 251 SNPs with an average of one SNP per 38 bp were identified in 9487 bp sequenced region from 17 candidate genes. The number of SNPs per gene ranged from one in putative acetyl CoA carboxylase, protein phosphatase 1 regulatory subunit SDS22 and PSI reaction center subunit III to 60 in alanine glyoxylate aminotransferase. The average SNP frequency ranged from one SNP per 10 bp in PSI reaction center subunit III to one SNP per 600 bp in acetyl CoA carboxylase (Table 3). The InDel markers ranged from 1 bp (serine carboxypeptidase III precursor) to 20 bp in protein phosphatase 1 regulatory subunit SDS22 (Table 4).
The extent of LD was assessed among all 1,575 pairs of loci (LD calculated for loci mapped on LG2) for all accessions as well as for the six subpopulations separately. Across all accessions, 28% of the total marker pairs were in LD (based on squared correlation coefficient r2; P<0.01). When LD was calculated within each subpopulation, the frequency of pairs of loci with significant LD (P<0.05) was reduced by more than half.
For candidate genes where multiple SNPs were genotyped across all PMiGAP entries, average intragenic LD (r2) values were calculated (Fig 4). The average r2 ranged from 0 in alanine glyoxylate aminotransferase, catalase, serine/threonine protein kinase and serine carboxypeptidase III precursor to 1.0 in putative Chlorophyll a/b binding protein and phytochrome C (Fig 4). The average r2 in putative Zn finger CCCH type, ubiquitin conjugating enzyme, acyl CoA oxidase, phosphoglycerate kinase, uridylate kinase, actin depolymerizing factor, dipeptidyl peptidase IV and photolyase was 0.02, 0.32, 0.17, 0.10, 0.02, 0.29, 0.09 and 0.18, respectively.
The squared correlation coefficient (r2) values are denoted by a color scale from white (0.0) to red (1.0) in the upper triangle. The p values ranging from non-significant (0.01; white) to highly significant (<0.0001; red) are shown in the lower triangle. 1. Uridylate kinase 2. Acyl CoA oxidase 3. Zn finger CCCH type 4. Ubiquitin conjugating enzyme 5. Actin depolymerising factor 6. Phytochrome C 7. Dipeptidyl peptidase IV 8. Serine carboxypeptidase 9. Serine/threonine protein kinase 10. Phosphoglycerate kinase 11. Chl a/b binding protein 12. Catalase 13. Alanine glyoxalate aminotransferase and 14. Photolyase.
Association between genes and traits
The two model approaches, general linear model (GLM) and mixed linear model (MLM), were compared for all traits using both Q-matrix and three PCAs in both models. The results were same irrespective of the matrix (Q- or PCA-matrix) chosen in the models. The QQ plots of traits (S1 Fig) suggested that the MLM model is superior at accounting for spurious associations resulting from population structure and/or familial relatedness. Here we present results of only MLM model-based associations. Briefly, out of 39 SNPs and 7 InDels genotyped in 250 genotypes of PMiGAP, we found significant (P<0.05) association for 22 SNPs and 3 InDels from 13 candidate genes with drought tolerance traits. Out of 6 InDel markers developed in the present study (Table 4), we obtained significant associations with two InDel markers, one each in uridylate kinase and phosphoglycerate kinase genes. One InDel marker designed from chlorophyll a/b binding protein gene (Xibmcp09) in previous study  showed significant association with traits only under drought stress conditions. Table 5 shows SNPs and InDels from 17 genes associated with traits under different environments (irrigated and two drought stress treatments). Fig 5 shows significant allelic effects of various genes on different traits.
Semi-quantitative RT PCR of candidate genes
Semi-quantitative PCR was conducted on four candidate gene sequences (putative Zn finger CCCH type, chlorophyll a/b binding protein, ubiquitin conjugating enzyme and serine/threonine protein kinase) using drought tolerant (PRLT 2/89-33) and susceptible (H77/833-2) parents of a mapping population which was used to identify DT-QTL on LG2 [14, 15], and a near isogenic line (NIL) ICMR 1029 having DT-QTL introgressed from PRLT2/89-33. All the four genes showed differential expression between drought tolerant and sensitive parents under drought stress (Fig 6). In drought tolerant parent and NIL, transcript accumulation of putative Zn finger CCCH type and chlorophyll a/b binding protein increased significantly whereas in drought sensitive parent their levels decreased under drought stress (Fig 6). The transcript levels of putative ubiquitin conjugating enzyme and serine/threonine protein kinase decreased by ≥2-fold in tolerant parent and NIL as compared to sensitive parent H77/833-2 (Fig 6).
A pearl millet germplasm association panel (PMiGAP), comprised of 250 landraces, elite cultivars and mapping population parents, collected from a wide geographical range in Africa and Asia, was recently established from the global pearl millet germplasm collection of ICRISAT . The present study represents the first report of genetic diversity in PMiGAP and exploitation of this germplasm panel for association mapping of drought tolerance traits.
Across the 250 PMiGAP entries examined, we obtained a total gene diversity value of 0.54. This value is close to gene diversity estimates obtained previously for a pearl millet world collection from Africa and Asia [38, 39] and higher than gene diversity estimates obtained by Budak et al.,  and Mariac et al.,  for a world-wide collection of pearl millet maintained in Plant Genetic Resources Conservation Unit at University of Georgia and cultivated pearl millet landraces from Niger, respectively. However, the gene diversity estimates in PMiGAP are lower than those obtained for pearl millet landraces derived from West and Central Africa . The lower gene diversity found here, despite having a large representation from West and Central Africa, could be caused by the fact that we used a lot of gene-based CISP (conserved intron spanning primers) markers that were developed previously  in addition to SSRs with different repeat motifs. In contrast, Stich et al.,  used a higher proportion of dinucleotide SSR markers, which are more variable than CISPs (produce two alleles at a locus) and SSRs with longer repeat motifs. The average number of alleles (7.0) in the present study was close to those reported previously in pearl millet with SSR markers [38–41] but less than reported by Stich et al.,  for the same reason as described before.
Comparison of genetic diversity parameters, estimated based on twenty-six geographic origins, revealed the highest observed and effective number of alleles in genotypes from India followed by Burkina Faso, Nigeria, Niger and Mali (S2 Table). The expected heterozygosity was also highest in genotypes from India followed by Niger and Mali. Similar trend was also observed for number of private alleles (S2 Table). These results indicate the highest genetic diversity in populations of pearl millet from India followed by West Africa. The results also suggest that pearl millet collections from India can be a useful resource for introducing novel variations into elite germplasm.
The STRUCTURE analysis revealed six sub-populations in PMiGAP (Fig 2). This was consistent with the results of PCA (results not shown). The six groups obtained within PMiGAP did not correspond with origin of country or agro-ecological zone but with pedigrees and/or similar agronomic traits. These results are consistent with previous reports of genetic diversity in pearl millet germplasm from similar geographic areas [38–40, 42]. The present and previous studies, therefore, suggest that genetic diversity in pearl millet has been shaped largely by diversifying human selection rather than geographic origin . The present results also support extensive germplasm exchange among different geographical regions .
The candidate gene-based association mapping is a promising approach to bridge the gap between quantitative and molecular genetic approaches for complex trait dissection . Using this approach, polymorphic sites within candidate genes are linked with phenotypic variation using statistical methods to identify causative polymorphisms . In the present study, 17 genes underlying a major drought tolerance (DT) QTL  were sequenced to find SNPs (Table 3) and for studying their association with drought tolerance traits in PMiGAP. We obtained a higher SNP frequency in the present study (Table 3) as compared to our previous study , which is not unexpected considering the different number of genotypes used for SNP discovery in the two studies. In maize, a contrasting variation in SNP frequency was also reported in two different germplasm sets having different number of accessions [44, 45]. Our present results of SNP frequency are similar to those obtained in many other outcrossing species such as perennial ryegrass , maize , rye  and sugar beet .
The linkage disequilibrium measure r2 varied extremely from gene to gene in the present study (Fig 4). For example, it was 0 in putative alanine glyoxylate aminotransferase, catalase, serine/threonine protein kinase and serine carboxypeptidase III precursor and it was 1.0 in putative putative chlorophyll a/b binding protein and phytochrome C. The r2 in the remaining genes varied from less than 0.10 (putative Zn finger CCCH type, phosphoglycerate kinase, uridylate kinase and catalase) to upto 0.32 (ubiquitin conjugating enzyme, acyl CoA oxidase, phosphoglycerate kinase, uridylate kinase, actin depolymerizing factor, dipeptidyl peptidase IV and photolyase). These results are not unexpected considering that linkage disequilibrium estimates vary according to the target region in the gene (introns, exons etc.) and number of polymorphic sites [48, 49]. In sorghum and barley, r2 in the candidate genes (CGs) ranged from 0.024 to 0.21  and 0.0 to 1.0 , respectively. The mean r2 was 0.23 for CGs in our study, which is comparable to r2 obtained for CGs in sorghum  and Lolium perenne . Since for most of the genes studied here only a part was sequenced (ranging from 180 to 740 bp for different genes; Table 3), we could not study the decay of linkage disequilibrium along each CG. Full-length sequencing of the genes will become possible in future as whole genome sequencing project reaches completion in pearl millet.
We tested both GLM, MLM models for all traits, and obtained QQ plots to choose the best model for association analysis. It has been reported that directly fitting both Q/PCA (structure) and K (kinship), without testing the model, may overcorrect population structure and familial relatedness for some traits and results in type II error . No significant deviations were observed between the observed and expected values for the traits under different treatments using the MLM model (S1 Fig). Thus, based on QQ plots we found that MLM model is the best for controlling both type I and type II errors. Further, prior to initiating association analysis with CGs, we tested SSR markers underlying DT-QTL for association analysis using Q+K and PCA+K models. We obtained significant associations of Xpsmp2059, Xipes0152, Xicmp3056, Xipes0117 and Xipes0218 with many drought tolerance traits (results not shown) which confirms previous results of association of these markers with DT-QTL based on bi-parental fine mapping population .
Out of 39 SNPs and 7 InDels from 17 genes genotyped in PMiGAP, significant associations were obtained for 22 SNPs and 3 InDels from 13 genes considering all treatments together (one irrigated and two drought stress treatments) (Table 5). Of these, four genes belong to classes of transcription factors (Zn finger CCCH type), signaling proteins (serine/threonine protein kinase and protein phosphatase 1) and (or) regulatory proteins (ubiquitin conjugating enzyme). Zn finger CCCH type belongs to a large and diverse family of transcription factors members, which play important roles transcriptional regulation, RNA binding and protein-protein interactions . Phosphorylation and dephosphorylation, catalyzed by a variety of protein kinases and phosphatases, respectively, modulate a wide range of biological processes such as the stability and subcellular localization of target proteins, protein-protein interactions, and regulation of plant K+ channels in guard and mesophyll cells [54–56]. Similarly, ubiquitin conjugating enzyme is part of ubiquitin proteasome system playing a significant role during senescence in remobilization of amino acids to supply the developing organs elsewhere in the plant. The major function of ubiquitination is to select target proteins for proteasomal degradation. The covalent attachment of ubiquitin to a target protein involves an enzyme cascade mediated by three enzymes, ubiquitin activating enzyme E1, ubiquitin conjugating enzyme E2 and ubiquitin ligase E3 . Many recent reports have described the important roles played by Zn finger CCCH type transcription factors, serine/threonine protein kinases, protein phosphatases and ubiquitin conjugating enzymes in the regulation of plant response and adaptation to multiple abiotic stresses including drought stress [57–63]. However, whether these genes contribute to the natural variation in plant responses to drought stress is not known yet. Once proven, it can provide impetus to initiate gene-based breeding in many crops and speed up the development of stress-tolerant varieties. The present study is the first report of association of these important genes with natural variations in traits under drought stress in a germplasm panel through association analysis (Table 5).
Except for protein phosphatase 1 regulatory subunit SDS22, we obtained significant associations of SNPs in the remaining three genes under both irrigated and drought stress conditions. For example, SNPs Xibmsp15/AP1.1 (A/G) and Xibmsp15/AP1.2 (A/G) in Zn finger CCCH type are associated with panicle diameter (PD) and thousand grain weight (TGW), respectively, under both irrigated and late drought stress conditions. Since both PD and TGW are highly correlated with grain yield, these two SNPs can be used for marker-assisted selection (MAS) in pearl millet for improving yield under both irrigated and water-limited environments. The SNP in serine/threonine protein kinase gene, however, was not associated with the same trait under both irrigated and drought stress conditions (Table 5) i.e. it was associated with grain number per m2 (GNPM) under irrigated conditions and with panicle harvest index (PHI) under late stress conditions. Hence, these SNPs can be used for MAS either in specific environment or in combination with other SNPs from genes showing significant associations with GNPM and PHI. Similarly, the SNP in protein phosphatase 1 gene can be used for MAS for grain yield, panicle harvest index and panicle yield under drought stress conditions. For ubiquitin conjugating enzyme, we obtained significantly associated SNPs for grain harvest index, panicle yield and thousand grain weight under irrigated conditions and for stay green, grain number per panicle and grain harvest index under late drought stress conditions. The significant association of SNPs in ubiquitin conjugating enzyme (UBC) gene with stay green trait under terminal drought stress confirms the importance of this gene during leaf senescence [57, 64].
We obtained positive correlation of grain yield with stay green under terminal drought stress treatments (Fig 3; S6 Table) which suggests that delayed senescence or stay green phenotype during reproductive and ripening stages is important for genetic improvement of grain yield in pearl millet under drought stress. In addition, a significant negative correlation of leaf rolling with grain yield points to the equal importance of high water extraction via high transpiration for reproductive success under terminal drought stress. Our present results are in agreement with recent findings of Kholova and Vadez  in pearl millet who demonstrated that sustained ‘stay green phenotype’ and ‘transpiration’ during grain filling are crucial for grain yield under drought stress. ‘Stay green’ is an important trait reflecting the capacity of the plant to photosynthesize. A strong correlation between the photosynthetic capacity and grain yield has been reported in many cereals including wheat and maize [66, 67]. For example, wheat genotypes with the ability to maintain green leaf area throughout grain filling duration have been suggested as potential candidates for improving yield in arid and semi-arid regions . Also in maize and sorghum, stay green lines have been used in breeding programmes to enhance yield of these important grain crops .
Of the genes that underlie DT-QTL, Zn finger CCCH type has been functionally characterized in rice [62, 70]. Transgenic rice overexpressing Zn finger CCCH type exhibited delayed senescence of leaves and retained more photosynthetic activity even under heading and seed-setting stages . In the present study, semi quantitative RT-PCR of putative Zn finger CCCH type and chlorophyll a/b binding protein genes in drought tolerant and sensitive lines exposed to terminal drought stress revealed increased expression of both genes in drought tolerant lines and decreased expression in drought sensitive line (Fig 6) thus suggesting a direct effect of this transcription factor on downstream genes for photosynthesis. These results are similar to that obtained in rice [62, 70] and further confirm the role of Zn finger CCCH type as a negative regulator of leaf senescence. Furthermore, it has been demonstrated that the potential targets of this transcription factor are key genes required to promote the leaf senescence program such as receptor-like kinases, which function as key components in the perception of environmental signals and subsequent phosphorylation cascades . The semi-quantitative RT-PCR carried out for putative serine/threonine protein kinase gene in drought tolerant and sensitive lines under drought stress, as expected, showed a 2–4 fold decreased expression in tolerant lines as compared to the sensitive genotype (Fig 6). In addition, a 2-fold decreased expression of putative UBC gene was observed in drought tolerant lines as compared to the sensitive genotype. These results indicate that key signaling enzymes and ubiquitin proteasome system are repressed in pearl millet under terminal drought stress in order to keep the photosynthetic machinery intact.
We obtained significant association of an InDel in putative chlorophyll a/b binding protein gene with grain yield, thousand grain weight and panicle yield under both early and late drought stress conditions and with grain harvest index, panicle diameter, panicle harvest index, stay green and leaf rolling under late stress conditions (Table 5). Fig 5 (e, f and g) shows clearly that the genotypes carrying this InDel show stay green phenotype and correspondingly higher grain yield under terminal drought stress conditions. The chloroplast located light-harvesting chlorophyll a/b-binding proteins (LHCP) collect and transfer light energy to photosynthetic reaction centers [71, 72]. Recently, allelic variations in LHCP gene have also been reported to be associated with plant height, spike length, number of grains per spike, thousand grain weight, flag leaf area and leaf color under well-watered conditions in barley . The significant association of an InDel in LHCP gene with grain yield and yield components, and stay green traits under terminal drought stress is an important finding of the present study. This InDel can be used for MAS in pearl millet to select high yielding genotypes having ‘stay green’ phenotype under terminal drought stress.
The role of phytochromes in sensing moisture conditions has been reported. For example, PhyE has been associated with local adaptation to dry environments like alpine habitats . PhyB is reported to be involved in gas exchange rates [74, 75] and in the regulation of abscisic acid (ABA) metabolism . Recently, Boggs et al.,  demonstrated the roles of PhyA, PhyB and PhyE in influencing stomatal conductance plasticity via ABA regulation under drought stress. In the present study, we found significant associations of two SNPs in PhyC with many traits including stay green, panicle diameter, panicle harvest index and panicle length under both well-watered and drought stress conditions. Previously, PhyC has been shown to be associated with agronomic traits in pearl millet under well-watered conditions .
Acetyl CoA carboxylase (ACoC) is a key enzyme in fatty acid biosynthesis. Fatty acid biosynthesis involves the cyclic condensation of two-carbon units derived from acetyl-coA . The first committed step in the pathway is the synthesis of malonyl-coA from acetyl-coA and CO2 by the enzyme ACoC. The tight regulation of ACoC controls the overall rate of fatty acid synthesis. It has been shown in barley that ceasing activity of ACoC inhibits growth and development of seedlings as there is not enough fatty acids to contribute to membrane structures . We genotyped a single SNP in ACoC gene in PMiGAP entries and found significant association of this SNP with many traits (Table 5), most importantly with grain yield, grain harvest index and panicle yield under all conditions i.e. fully-irrigated non-stressed condition as well as early and late drought stress treatments. This SNP marker can be used for MAS in pearl millet for improving yield under both irrigated and terminal drought stress conditions (Table 5). Further, under late drought stress condition this SNP was associated with many other yield components thus indicating the importance of this gene as the stress progresses.
Like the ubiquitin proteasome system, the prolyl oligopeptidase family plays a significant role in storage protein mobilization during defense system activity and senescence. The prolyl oligopeptidase family of serine proteases includes four members; prolyl oligopeptidase (POP), dipeptidyl peptidase IV (DPPIV), oligopeptidase B (OPB), and acylaminoacyl peptidase (ACPH). Of these, POP has been demonstrated to confer drought stress tolerance . The roles of other members of this family are not known yet. In the present study, we obtained evidence of the role of another member of this family, DPPIV, under drought stress. We identified significant associations of DPPIV with grain yield, grain number per panicle, stay green and grain harvest index under late drought stress conditions (Table 5).
Similar to the results demonstrated in this study, many genes have been reported to be associated with drought tolerance using association mapping [82–84] thus reinforcing the complexity of the trait. Of all the marker-trait associations obtained in the present study, 7 associations from putative genes Zn finger CCCH type, uridylate kinase, ubiquitin conjugating enzyme, acetyl CoA carboxylase and phytochrome C were common under irrigated and one (or both) of the drought stress treatments indicating the potential of these associations to be applicable for MAS (Table 5). Of these, the most significant was the SNP in putative acetyl CoA carboxylase gene, which was significantly associated with grain yield, grain harvest index and panicle yield under all water stress treatments (Table 5). We did not obtain any significant association of putative catalase gene with any of the traits in our study. These results support those obtained by  in pearl millet suggesting that the anti-oxidant machinery does not play a direct causal role on the terminal drought tolerance of pearl millet that is conferred by DT-QTL.
Correlation among traits is commonly used by breeders for either simultaneous improvement of correlated traits or for reducing undesirable side effects when improving only one of the correlated traits. The negative correlation is undesirable by breeders specifically when the goal is to simultaneously increase the trait values of two negatively correlated characters. Many examples can be cited for such negatively correlated traits. For example, a negative correlation exists between grain yield and protein content in maize , durum wheat , and soybean . In the same way, positive correlations can also be undesirable. For example, a positive correlation between high oil and beta-glucan concentrations in oats is highly undesirable by breeders targeting high oil yield with less beta-glucan . On the positive side, such positive correlations have been successfully exploited to improve difficult traits or traits involving costly measurements. For instance, in maize the anthesis-silking interval is positively correlated with grain yield and it has been used as an indirect selection criterion for improving grain yield under drought stress .
The two major reasons for such genetic trait correlations at the level of QTL are pleiotropy and/or close linkage of genes. In the present study, we chose a major DT-QTL on LG2 that is basically a drought-tolerance grain yield QTL with QTLs for grain harvest index, panicle harvest index, biomass yield and other yield components, and stay green/delayed senescence co-mapping to the same interval [14–16, 28, 65]. The results of the present study helped to understand the cause of co-mapping of these traits in this interval or alternatively the underlying genetic cause of these multiple-trait associations. We conclude that both tight linkage of genes and intragenic linkage of quantitative trait polymorphisms are the molecular basis of these multiple-trait associations. For instance, UBC, LHCP and PhyC are the key regulatory and downstream genes in the DT-QTL interval found to be associated with the stay green trait in the present study (Table 5). Association analysis revealed unambiguously that each of the SNPs (or InDel) in each of these three genes is also significantly associated with grain yield and/or yield components. From a plant breeding perspective, these results are desirable and encouraging. For example, a functional marker derived from InDel in LHCP, affecting stay green and grain yield simultaneously (Fig 5), can be fixed in the breeding materials.
In the present study, we investigated the genetic diversity and structure in PMiGAP and explored its potential for association analysis. We obtained high genetic diversity in PMiGAP and a moderate genetic structure, which is ideal for conducting association analysis. Using MLM model, many significant marker-trait associations were obtained some of which can be used as functional markers for selecting high yielding plants under irrigated and drought stress conditions. Particularly, a SNP in putative acetyl CoA carboxylase gene and an InDel in putative chlorophyll a/b binding protein gene were the most important associations, worth using for MAS in pearl millet. Results also confirmed that PMiGAP is a useful panel for association mapping.
S1 Fig. QQ plots of GY, GHI, PHI and GNPM using (a) MLM and (b) GLM models.
GY = Grain yield, GHI = Grain harvest index, PHI = Panicle harvest index, GNPM = Grain number per panicle, MLM = Mixed linear model, GLM = General linear model.
S1 Table. List of pearl millet accessions used in the study, their origin and characteristics, and estimated fraction of the accession’s genome that originates from six inferred subpopulations (subpopulations A, B, C, D, E and F).
S2 Table. Genetic diversity parameters in populations from 26 geographic origins.
S3 Table. Mean and heritability of the traits.
S4 Table. Reductions in growth and yield parameters due to drought stress in 2011 and 2012.
S5 Table. Correlation among traits under control conditions.
The authors wish to thank all support staff who contributed to the development of the PMiGAP, as well as to seed multiplication of the PMiGAP testcrosses and phenotyping of these testcrosses. The authors are also thankful to Mrs. Kirsten Skot for genotyping PMiGAP with SSRs and InDel markers at in-house facility at IBERS and P. Maheshwar Rao for phenotyping PMiGAP for leaf rolling and senescence.
Conceived and designed the experiments: DS RSY. Performed the experiments: DS RS. Analyzed the data: DS LS SPD. Contributed reagents/materials/analysis tools: CTH RKS PCS AGBR JT RP. Wrote the paper: DS RSY LS CTH.
- 1. Haussmann BIG, Fred Rattunde H, Weltzien-Rattunde E, Traoré PSC, vom Brocke K, Parzies HK. Breeding strategies for sdaptation of pearl millet and sorghum to climate variability and Change in West Africa. J Agron Crop Sci. 2012; 198: 327–339.
- 2. Goswami AK, Sharma KP, Sehgal KL. Nutritive value of proteins of pearl millet of high-yielding varieties and hybrids. British J Nutr. 1969; 23: 913–916. pmid:5357056
- 3. Sawaya WN, Khalil JK, Safi WJ. Nutritional quality of pearl millet flour and bread. Plant Food Hum Nitr. 1984; 34: 117–125.
- 4. Ejeta G, Hassen MM, Mertz ET. In vitro digestibility and amino acid composition of pearl millet (Pennisetum typhoides) and other cereals. Proc Natl Acad Sci USA. 1987; 84: 6016–6019. pmid:3476923
- 5. Velu G, Rai KN, Muralidharan V, Kulkarni VN, Longvah T, Raveendran TS. Prospects of breeding biofortified pearl millet with high grain iron and zinc content. Plant Breed. 2007; 126: 182–185.
- 6. Govindaraj M, Rai KN, Shanmugasundaram P, Dwivedi SL, Sahrawat KL, Muthaiah AR, et al. Combining ability and heterosis for grain Iron and Zinc densities in pearl millet. Crop Sci. 2013; 53: 507–517.
- 7. Hill GM, Hanna WW. Nutritive characteristics of pearl millet grain in beef cattle diets. J Animal Sci. 1990; 68: 2061–2066. pmid:2384397
- 8. Bhattacharjee R, Khairwal IS, Bramel P, Reddy KN. Establishment of a pearl millet [Pennisetum glaucum (L.) R. Br.] core collection based on geographical distribution and quantitative traits. Euphytica. 2007; 155: 35–45.
- 9. Amadou I, Gounga ME, Le G-W. Millets, nutritional composition, some health benefits and processing. Emirate J Food Agric. 2013; 25: 501–508.
- 10. Ozias-Akins P, Roche D, Hanna WW. Tight clustering and hemizygosity of apomixis-linked molecular markers in Pennisetum squamulatum implies genetic control of apospory by a divergent locus that may have no allelic form in sexual genotypes. Proc Natl Acad Sci USA. 1998; 95: 5127–5132. pmid:9560240
- 11. Wilson JP, Hess DE, Hanna WW, Kumar KA, Gupta SC. Pennisetum glaucum subsp. monodii accessions with Striga resistance in West Africa. Crop Prot. 2004; 23: 865–870.
- 12. Jones ES, Liu CJ, Gale MD, Hash CT, Witcombe JR. Mapping quantitative trait loci for downy mildew resistance in pearl millet. Theor Appl Genet. 1995; 91: 448–456. pmid:24169834
- 13. Jones ES, Breese WA, Liu CJ, Singh SD, Shaw DS, Witcombe JR. Mapping quantitative trait loci for resistance to downy mildew in pearl millet field and glasshouse screens detect the same QTL. Crop Sci. 2002; 42: 1316–1323.
- 14. Yadav RS, Hash CT, Bidinger FR, Cavan GP, Howarth CJ. Quantitative trait loci associated with traits determining grain and stover yield in pearl millet under terminal drought-stress conditions. Theor Appl Genet. 2002; 104: 67–83. pmid:12579430
- 15. Yadav RS, Hash CT, Bidinger FR, Devos KM, Howarth CJ. Genomic regions associated with grain yield and aspects of post-flowering drought tolerance in pearl millet across stress environments and tester background. Euphytica. 2004; 136: 265–277.
- 16. Bidinger FR, Nepolean T, Hash CT, Yadav RS, Howarth CJ. Identification of QTLs for grain yield of pearl millet [Pennisetum glaucum (L.) R. Br.] in environments with variable moisture during grain filling. Crop Sci. 2007; 47: 969–980.
- 17. Gulia SK, Hash CT, Thakur RP, Breese WA, Sangwan RS, Singh DP, et al. Mapping new QTLs for downy mildew [Sclerospora graminicol (Sacc) J Schroet] resistance in pearl millet [Pennisetum glaucum (L) R. Br.]. In: Crop production in stress environments–genetic and management option. Jodhpur, India, Agrobios Publishers; 2007. pp. 373–386.
- 18. Nepolean T, Blummel M, Raj AB, Rajaram V, Senthilvel S, Hash CT. QTLs controlling yield and stover quality traits in pearl millet. International Sorghum Millets Newsletter 2006; 47:149–152
- 19. Zhu C, Gore M, Buckler ES, Yu J. Status and prospects of association mapping in plants. The Plant Genome. 2008; 1: 5–20.
- 20. Kraakman AT, Niks RE, Van den Berg PM, Stam P, Van Eeuwijk FA. Linkage disequilibrium mapping of yield and yield stability in modern spring barley cultivars. Genetics. 2004; 168: 435–446. pmid:15454555
- 21. Agrama H, Eizenga G, Yan W. Association mapping of yield and its components in rice cultivars. Mol Breed. 2007; 19: 341–356.
- 22. Yao J, Wang LX, Liu LH, Zhao CP, Zheng YL. Association mapping of agronomic traits on chromosome 2A of wheat. Genetica. 2009; 137: 67–75. pmid:19160058
- 23. Jin L, Lu Y, Xiao P, Sun M, Corke H, Bao J. Genetic diversity and population structure of a diverse set of rice germplasm for association mapping. Theor Appl Genet. 2010; 121: 475–487. pmid:20364375
- 24. Neumann K, Kobiljski B, Denčić S, Varshney R, Börner A. Genome-wide association mapping: a case study in bread wheat (Triticum aestivum L.). Mol Breed. 2011; 27: 37–58.
- 25. Bhosale SU, Stich B, Rattunde HFW, Weltzien E, Haussmann BI, Hash CT, et al. Association analysis of photoperiodic flowering time genes in west and central African sorghum [Sorghum bicolor (L.) Moench]. BMC Plant Biol. 2012; 12: 32. pmid:22394582
- 26. Yu X, Bai G, Liu S, Luo N, Wang Y, Richmond DS, et al. Association of candidate genes with drought tolerance traits in diverse perennial ryegrass accessions. J Exp Bot. 2013; 64: 1537–1551. pmid:23386684
- 27. Yadav RS, Sehgal D, Vadez V. Using genetic mapping and genomics approaches in understanding and improving drought tolerance in pearl millet. J Exp Bot. 2011; 62: 397–408. pmid:20819788
- 28. Sehgal D, Rajaram V, Armstead IP, Vadez V, Yadav YP, Hash CT, et al. Integration of gene-based markers in a pearl millet genetic map for identification of candidate genes underlying drought tolerance quantitative trait loci. BMC Plant Biol. 2012; 12: 9. pmid:22251627
- 29. Upadhyaya HD, Gowda CLL, Reddy KN, Singh S. Augmenting the pearl millet core collection for enhancing germplasm utilization in crop improvement. Crop Sci. 2009; 49: 573–580.
- 30. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000; 155: 945–959. pmid:10835412
- 31. Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics. 2003; 164: 1567–1587. pmid:12930761
- 32. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol. 2005; 14: 2611–2620. pmid:15969739
- 33. Yeh FC, Boyle T. POPGENE version 1.3.2, Microsoft window-based freeware for population genetic analysis. 1999. Accessed: http,//www.ualberta.ca/_fyeh/index.htm.
- 34. Perrier X, Jacquemoud-Collet JP. DARwin software. 2006. Accessed: http://darwin.cirad.fr.
- 35. Hardy OJ, Vekemans X. SPAGeDi: a versatile computer program to analyse spatial genetic structure at the individual or population levels. Mol Ecol notes. 2002; 2: 618–620.
- 36. Storey JD. A direct approach to false discovery rates. J R Stat Soc Series B: Series B Stat Methodol. 2002; 64: 479–498.
- 37. Pasam RK, Sharma R, Malosetti M, van Eeuwijk FA, Haseneyer G, Kilian B, et al. Genome-wide association studies for agronomical traits in a world wide spring barley collection. BMC Plant Biol. 2012; 12: 16. pmid:22284310
- 38. Oumar I, Mariac C, Pham J-L, Vigouroux Y. Phylogeny and origin of pearl millet (Pennisetum glaucum [L.] R. Br) as revealed by microsatellite loci. Theor Appl Genet. 2008; 117: 489–497. pmid:18504539
- 39. Saïdou AA, Mariac C, Luong V, Pham JL, Bezançon G, Vigouroux Y. Association studies identify natural variation at PHYC linked to flowering time and morphological variation in pearl millet. Genetics. 2009; 182: 899–910 pmid:19433627
- 40. Budak H, Pedraza F, Cregan P, Baenziger P, Dweikat I. Development and utilization of SSRs to estimate the degree of genetic relationships in a collection of pearl millet germplasm. Crop Sci. 2003; 43: 2284–2290.
- 41. Mariac C, Luong V, Kapran I, Mamadou A, Sagnard F, Deu M, et al. Diversity of wild and cultivated pearl millet accessions (Pennisetum glaucum [L.] R. Br.) in Niger assessed by microsatellite markers. Theor Appl Genet. 2006; 114: 49–58. pmid:17047913
- 42. Stich B, Haussmann BI, Pasam R, Bhosale S, Hash CT, Melchinger AE, et al. Patterns of molecular and phenotypic diversity in pearl millet [Pennisetum glaucum (L.) R. Br.] from West and Central Africa and their relation to geographical and environmental parameters. BMC Plant Biol. 2010; 10: 216. pmid:20925912
- 43. Sehgal D, Yadav RS. Molecular marker based approaches for drought tolerance. In: Mohan JS, Brar DS, editors. Molecular techniques in crop improvement, Springer: 2nd edition; 1991. pp. 207–230.
- 44. Tenaillon MI, Sawkins MC, Long AD, Gaut RL, Doebley JF, Brandon S. Patterns of DNA sequence polymorphism along chromosome 1 of maize (Zea mays ssp. mays L.). Proc Natl Acad Sci USA. 2001; 98: 9161–9166. pmid:11470895
- 45. Ching A, Caldwell KS, Jung M, Dolan M, Smith OS, Tingey S, et al. SNP frequency, haplotype structure and linkage disequilibrium in elite maize inbred lines. BMC Genet. 2002; 3: 19. pmid:12366868
- 46. Varshney RK, Beier U, Khlestkina EK, Kota R, Korzun V, Graner A, et al. Single nucleotide polymorphisms in rye (Secale cereale L.): discovery, frequency, and applications for genome mapping and diversity studies. Theor Appl Genet. 2007; 114: 1105–1116. pmid:17345059
- 47. Schneider K, Kulosa D, Soerensen TR, Möhring S, Heine M, Dursewitz G, et al. Analysis of DNA polymorphisms in sugar beet (Beta vulgaris L.) and development of an SNP-based map of expressed genes. Theor Appl Genet. 2007; 115: 601–615. pmid:17622508
- 48. Akey JM, Zhang K, Xiong M, Jin L. The effect of single nucleotide polymorphism identification strategies on estimates of linkage disequilibrium. Mol Biol Evol. 2003; 20: 232–242. pmid:12598690
- 49. Ke XY, Hunt S, Tapper W, Lawrence R, Stavrides G, Ghori J, et al. The impact of SNP density on fine-scale patterns of linkage disequilibrium. Human Mol Genet. 2004; 13: 577–588. pmid:14734624
- 50. Matthies IE, Weise S, Förster J, Korzun V, Stein N, Röder MS. Nitrogen-metabolism related genes in barley-haplotype diversity, linkage mapping and associations with malting and kernel quality parameters. BMC Genet. 2013; 14: 77. pmid:24007272
- 51. Xing Y, Frei U, Schejbel B, Asp T, Lübberstedt T. Nucleotide diversity and linkage disequilibrium in 11 expressed resistance candidate genes in Lolium perenne. BMC Plant Biol. 2007; 7: 43. pmid:17683574
- 52. Zhu C, Yu J. Nonmetric multidimensional scaling corrects for population structure in association mapping with different sample types. Genetics. 2009; 182: 875–888. pmid:19414565
- 53. Ciftci-Yilmaz S, Mittler R. The zinc finger network of plants. Cell Mol Life Sci. 2008; 65: 1150–1160. pmid:18193167
- 54. Li W, Luan S, Schreiber SL, Assmann SM. Evidence for protein phosphatase 1 and 2A regulation of K+ channels in two types of leaf cells. Plant Physiol. 1994; 106: 963–970. pmid:7824661
- 55. Schliebner I, Pribil M, Zühlke J, Dietzmann A, Leister D. A survey of chloroplast protein kinases and phosphatases in Arabidopsis thaliana. Current genom. 2008; 9: 184–190.
- 56. Chen Y-E, Zhao Z-Y, Zhang H-Y, Zeng X-Y, Yuan S. The significance of CP29 reversible phosphorylation in thylakoids of higher plants under environmental stresses. J Exp Bot. 2013; 64: 1167–1178. pmid:23349136
- 57. Lyzenga WJ, Stone SL. Abiotic stress tolerance mediated by protein ubiquitination. J Exp Bot. 2011; 63: 599–616. pmid:22016431
- 58. Akkasaeng C, Tantisuwichwong N, Chairam I, Prakrongrak N, Jogloy S, Pathanothai A. Isolation and identification of peanut leaf proteins regulated by water stress. Pak J Bot. 2007; 10: 1611–1617.
- 59. Huang XY, Chao DY, Gao JP, Zhu MZ, Shi M, Lin HX. A previously unknown zinc finger protein, DST, regulates drought and salt tolerance in rice via stomatal aperture control. Genes Dev. 2009; 23: 1805–1817. pmid:19651988
- 60. Mao X, Zhang H, Tian S, Chang X, Jing R. TaSnRK24 an SNF1-type serine/threonine protein kinase of wheat (Triticum aestivum L.) confers enhanced multistress tolerance in Arabidopsis. J Exp Bot. 2010; 61: 683–696. pmid:20022921
- 61. Wan X, Mo A, Liu S, Yang L, Li L. Constitutive expression of a peanut ubiquitin-conjugating enzyme gene in Arabidopsis confers improved water-stress tolerance through regulation of stress-responsive gene expression. J Biosci Bioeng. 2011; 111: 478–484. pmid:21193345
- 62. Jan A, Maruyama K, Todaka D, Kidokoro S, Abo M, Yoshimura E, et al. OsTZF1, a CCCH-tandem zinc finger protein, confers delayed senescence and stress tolerance in rice by regulating stress-related genes. Plant physiol. 2013; 161: 1202–1216. pmid:23296688
- 63. Chung E, Cho C- W, So H- A, Kang J- S, Chung YS, Lee J- H. Overexpression of VrUBC1, a mung bean E2 ubiquitin-conjugating enzyme, enhances osmotic stress tolerance in Arabidopsis. PloS One. 2013; 8.
- 64. Belknap WR, Garbarino JE. The role of ubiquitin in plant senescence and stress responses. Trends Plant Sci. 1996; 1: 331–335.
- 65. Kholová J, Vadez V. Water extraction under terminal drought explains the genotypic differences in yield, not the anti-oxidant changes in leaves of pearl millet (Pennisetum glaucum). Func Plant Biol. 2013; 40: 44–53.
- 66. Zhu XG, Long SP, Ort DR. Improving photosynthetic efficiency for greater yield. Annu Rev of Plant Biol. 2010; 61: 235–261. pmid:20192734
- 67. Parry MAJ, Reynolds M, Salvucci ME, Raines C, Andralojc PJ, Zhu X-G, et al. Raising yield potential of wheat. II. Increasing photosynthetic capacity and efficiency. J Exp Bot. 2011; 62: 453–467. pmid:21030385
- 68. Adu MO, Sparkes DL, Parmar A, Yawson DO. Stay green in wheat: comparative study of modern bread wheat and ancient wheat cultivars. ARPN J Agric Biol Sci. 2011; 6: 16–24.
- 69. Thomas H, Smart CM. Crops that stay green. Annu Appl Biol. 1993; 123: 193–219.
- 70. Kong Z, Li M, Yang W, Xu W, Xue Y. A novel nuclear-localized CCCH-type zinc finger protein, OsDOS, is involved in delaying leaf senescence in rice. Plant physiol. 2006; 141: 1376–1388. pmid:16778011
- 71. Paulsen H, Dockter C, Volkov A, Jeschke G. Folding and Pigment Binding of Light-Harvesting Chlorophyll a/b Protein (LHCIIb). In: The Chloroplast; 2010. pp. 231–244.
- 72. Xia Y, Ning Z, Bai G, Li R, Yan G, Siddique KHM, et al. Allelic variations of a light harvesting chlorophyll a/b-binding protein gene (Lhcb1) associated with agronomic traits in barley. PLoS One. 2012; 7: e37573. pmid:22662173
- 73. Ikeda H, Fujii N, Setoguchi H. Molecular evolution of phytochromes in Cardamine nipponica (Brassicaceae) suggests the involvement of PHYE in local adaptation. Genetics. 2009; 182: 603–614. pmid:19363127
- 74. Boccalandro HE, Ploschuk EL, Yanovsky MJ, Sánchez RA, Gatz C, Casal JJ. Increased phytochrome B alleviates density effects on tuber yield of field potato crops. Plant Physiol. 2003; 133: 1539–1546. pmid:14605224
- 75. Sokolskaya SV, Sveshnikova NV, Kochetova GV, Solovchenko AE, Gostimski SA, Bashtanova OB. Involvement of phytochrome in regulation of transpiration: red-/far red-induced responses in the chlorophyll-deficient mutant of pea. Funct Plant Biol. 2003; 30: 1249–1259.
- 76. González CV, Ibarra SE, Piccoli PN, Botto JF, Boccalandro HE. Phytochrome B increases drought tolerance by enhancing ABA sensitivity in Arabidopsis thaliana. Plant Cell Environ. 2012; 35: 1958–1968. pmid:22553988
- 77. Boggs JZ, Loewy K, Bibee K, Heschel MS. Phytochromes influence stomatal conductance plasticity in Arabidopsis thaliana. Plant Growth Regul. 2010; 60: 77–81.
- 78. Saïdou A-A, Mariac C, Luong V, Pham J-L, Bezançon G, Vigouroux Y. Association studies identify natural variation at PHYC linked to flowering time and morphological variation in pearl millet. Genetics. 2009; 182: 899–910. pmid:19433627
- 79. Rawsthorne S. Carbon flux and fatty acid synthesis in plants. Prog Lipid Res. 2002; 41: 182–196. pmid:11755683
- 80. Vahdatirad A, Esfandiari E. Effect of activity ceasing of acetyl CoA-carboxylase on growth and antioxidant system in seedling stage of barley. J Biol Sci. 2013; 13: 250–256.
- 81. Tan C-M, Chen R-J, Zhang J-H, Gao X-L, Li L-H, Wang P-R, et al. OsPOP5, a prolyl oligopeptidase family gene from rice confers abiotic stress tolerance in Escherichia coli. Int J Mol Sci. 2013; 14: 20204–20219. pmid:24152437
- 82. Edae EA, Byrne PF, Manmathan H, Haley SD, Moragues M, Lopes MS, et al. Association mapping and nucleotide sequence variation in five drought tolerance candidate genes in spring wheat. The Plant Genome. 2013; 6.
- 83. Setter TL, Yan J, Warburton M, Ribaut J-M, Xu Y, Sawkins M, et al. Genetic association mapping identifies single nucleotide polymorphisms in genes that affect abscisic acid levels in maize floral tissues during drought. J Exp Bot. 2010; 62:701–716. pmid:21084430
- 84. González-Martínez SC, Ersoz E, Brown GR, Wheeler NC, Neale DB. DNA sequence variation and selection of tag single-nucleotide polymorphisms at candidate genes for drought-stress response in Pinus taeda L. Genetics. 2006; 172: 1915–1926. pmid:16387885
- 85. Kholová J, Hash CT, Kočová M, Vadez V. Does a terminal drought tolerance QTL contribute to differences in ROS scavenging enzymes and photosynthetic pigments in pearl millet exposed to drought? Environ Exp Bot. 2011; 71: 99–106.
- 86. Duvick DN, Cassman KG. Post–green revolution trends in yield potential of temperate maize in the North-Central United States. Crop Sci. 1999; 39: 1622–1630.
- 87. Rharrabti Y, Elhani S, Martos-Nunez V, García del Moral LF. Protein and lysine content, grain yield, and other technological traits in durum wheat under Mediterranean conditions. J Agric Food Chem. 2001; 49: 3802–3807. pmid:11513670
- 88. Rotundo JL, Borrás L, Westgate ME, Orf JH. Relationship between assimilate supply per seed during seed filling and soybean seed composition. Field Crops Res. 2009; 112: 90–96.
- 89. Yan W, Frégeau-Reid J. Breeding line selection based on multiple traits. Crop Sci. 2008; 48: 417–423.
- 90. Magorokosho C, Tongoona P. Selection for drought tolerance in two tropical maize populations. Afric Crop Sci J. 2004; 11: 151–161.