Carotenoid Content and Root Color of Cultivated Carrot: A Candidate-Gene Association Study Using an Original Broad Unstructured Population

Accumulated in large amounts in carrot, carotenoids are an important product quality attribute and therefore a major breeding trait. However, the knowledge of carotenoid accumulation genetic control in this root vegetable is still limited. In order to identify the genetic variants linked to this character, we performed an association mapping study with a candidate gene approach. We developed an original unstructured population with a broad genetic basis to avoid the pitfall of false positive detection due to population stratification. We genotyped 109 SNPs located in 17 candidate genes – mostly carotenoid biosynthesis genes – on 380 individuals, and tested the association with carotenoid contents and color components. Total carotenoids and β-carotene contents were significantly associated with genes zeaxanthin epoxydase (ZEP), phytoene desaturase (PDS) and carotenoid isomerase (CRTISO) while α-carotene was associated with CRTISO and plastid terminal oxidase (PTOX) genes. Color components were associated most significantly with ZEP. Our results suggest the involvement of the couple PDS/PTOX and ZEP in carotenoid accumulation, as the result of the metabolic and catabolic activities respectively. This study brings new insights in the understanding of the carotenoid pathway in non-photosynthetic organs.


Introduction
Carotenoid compounds play an essential role in human health, preventing disease thanks to their antioxidant capacity, but also as provitamin A precursors. As humans cannot synthetize carotenoids, they have to be provided by plant-based dietary [1]. Carrot is one of the most important vegetables in the world, and a critical source of carotenoid as a large amount is accumulated in root tissues [2]. Moreover genetic resources exhibit a large range of colors and carotenoid content patterns [3], questioning the genetic control of carotenoid accumulation in carrot.
Carotenoid biosynthesis is today well established ( Fig. 1) and genes encoding carotenoid enzymes have been characterized in many species [4][5][6][7]. Multiple steps in the pathway have been identified as controlling the carotenoid diversity and amount in various plant organs. Substrate availability-isopentenyl diphosphate and dimethylallyl-diphosphate-is generally considered as a limitating factor as well as the catabolic activity [4,8]. Accumulation of phytoene, controlled by the phytoene synthase and the phytoene desaturase, has emerged as a key regulatory step in the accumulation of carotenoids in various storage organs [9][10][11][12][13].
Many studies have shown that carotenoid biosynthetic genes are involved in the genetic control of carotenoid content (maize [14], tomato [15], wheat [12,16], pepper [17]). Depending on the species, all carotenoid biosynthetic genes may be involved in the genetic basis of carotenoid content and are therefore meaningful candidate genes [4]. In some species, engineering the pathway using biosynthetic genes is now possible for crop enhancement of the carotenoid content. Golden Rice is such an example of metabolic pathway engineering for quality enhancement [18].
However, little is known about the genetic control of carotenoid accumulation in carrot. Heritability of carotenoid content in carrot roots has been estimated by [19] and ranges from 28% to 98% depending on the compound and the investigated genetic background. Two major loci Y and Y2 governing the orange intensity of xylem/phloem were identified [20]. The Y locus may block the synthesis of carotene and xanthophyll, whereas the Y2 locus determines the carotene accumulation but not the xanthophyll one [21,22]. A path analysis showed that phytoene accumulation may be one key step limiting carotenoid accumulation in white roots [9]. This was confirmed by [11], who turned a white rooted carrot in orange by overexpressing a phytoene synthase gene. Recently, a polymorphism of carotene hydroxylase CYP97A3 controlling the a-carotene content was identified [23] and the authors suggested a negative feedback regulation on PSY determining the carotenoid flux. Only two studies [21,24] have studied the genetic determinism of carotenoid content in carrot roots by linkage mapping, using a cross between an orange cultivated carrot and a white wild one. Moreover, almost all biosynthetic genes have been sequenced and mapped in carrot [25]. Two major QTLs governing carotenoid accumulation were localized, with some of carotenoid biosynthetic geneszeaxanthin epoxydase, carotene hydroxylase and carotenoid dioxygenase familiesmapped in the confidence interval or near these two QTLs.
As QTLs might be population-specific, association mapping has emerged in the last decade as an alternative to linkage analysis to dissect the basis of quantitative traits in plants. Such studies address the relationship between marker-based polymorphism and phenotypic variation in a diversified population. Using a diversified population may increase the resolution of such a study by using all ancestral recombination events [26]. One major interest of such a population is also the opportunity to study many alleles compared to a bi-parental cross study [27]. Association mapping targeting candidate genes has proven successful in many instances [28][29][30][31] and might bring new insights for carotenoid content as the genetic pathway has already been dissected through forward and reverse genetics in many organisms. However, one pitfall in association mapping is the lack of power when performed in structured panels. Structure can lead to an increase of false discovery rate. Indeed, false positives can be detected when phenotypic traits are correlated with underlying population structure at non causal loci [32]. Daucus carota L. genetic resources are known to be structured into two distinct genetic groups [33][34][35] according to their geographical origin. Moreover carotenoid content pattern is closely linked to these genetic groups: cultivars with high lycopene content belong mostly to one genetic group. In this case, association mapping can typically lead to false positive detection. Plastid terminal oxidase, CRTISO: Carotene isomerase, ZDS: ζ-carotene isomerase, LCYE: ε-lycopene cyclase, LCYB: β-lycopene cyclase, CHXB: β-carotene hydroxylase, CHXE: ε-carotene hydroxylase, ZEP: zeaxanthin epoxydase, VDE: violaxanthin de-epoxydase, NXS: Neoxanthin synthase, NCED: ninecis-epoxycarotenoiddioxygenase, ABA: abscisic acid. Adapted from [25] and [58].
Population stratification can be estimated with different ways, and added as a covariate in association models, limiting the detection of false positive. The first proposed model [36] called Q modelestimates the population stratification with the Bayesian model, now implemented in Structure software [37,38]. Then a principal component analysis has been proposed to correct for the structure [39]. The use of both Kinship matrix (K) and population structure in a unified mixed model approach to account for relatedness between individuals has been proposed by [27].
In order to overcome the structure bias, we have created a specific unstructured population with a broad genetic basis to perform an association mapping study for carrot root carotenoid content.
The aim of this study was to investigate the implication of biosynthetic genes in carrot root carotenoid content and related color traits using a broad unstructured population in a candidate-gene association approach. This work will offer new insights in the global understanding of the carotenoid biosynthetic pathway in carrot. This will allow us to identify favorable alleles with associated markers usable in marker-assisted selection (MAS) for product quality enhancement.

Plant material
The discovery population consisted of 380 individuals from the third generation of intercrossing of an initial panel of 67 cultivars, each represented by 6 individuals. This panel represents a large diversity of cultivated carrot: three white (Europe and Middle-east), eight yellow (Europe, Central Asia, Asia), two red (Asia), 45 orange (Europe, South & North America, Australia, Madagascar, central Asia, Asia) and eight purple ones (Europe & Middle East). Pollinators were introduced at maximum blooming to avoid genetic drift and seeds were harvested from each plant in a balanced way as suggested by [40]. At each generation, 150 seeds were randomly chosen and sown for the next one. The third intercrossing generation was sown in the field in Agrocampus-Ouest (Angers, France) and grown following standard practices. Roots were harvested at 97 days after sowing. At harvesting, 380 individuals were randomly chosen.

Phenotypic data
Color evaluation by spectrocolorimetry Since color is a quality attribute and may be considered as an indirect measure of carotenoid content [41][42], color was evaluated with a CM2600d Minolta (Japan) spectrocolorimeter equipped with a 5 mm measuring area. The illuminant used was D65 and calibration was done with a white standard and specular component of light was excluded. Two measures were done for epidermis and secondary phloem and one for secondary xylem.
CIELAB color space coordinates (LÃ, aÃ, bÃ, CÃ and h) were recorded. Data represent the mean of measures per tissue type.
Then roots extremities were cut off and roots were immediately ground and frozen in liquid nitrogen and stored at −80°C. Both carotenoids and DNA were extracted from the same root sample. Dry matter was determined after drying approximately 10 grams sample at 55°C for 7 days.

Carotenoid quantification by HPLC
The procedure was adapted from [43]. Extraction was done on approximately 500 mg of crushed frozen material to which 50 mL of b-apo-8'-carotenal at 0.1 g/L was first added as an internal standard. Samples were mixed with 7 mL MgCO 3 0.57%, 3,5-di-tert-butyl-4hydroxytoluene (BHT) 0.1% in methanol, then vortexed, and mixed with 7 mL of 0.1% BHTcontaining chloroform. After 15 min incubation in darkness, 7 mL of ultrapure water were added, and samples were centrifuged at 236 g for 10 min. One milliliter from the lower layer was concentrated under vacuum evaporation, and the dry extract was dissolved in 200 mL of acetonitrile/dichloromethane (50:50, v/v) containing 0.1% BHT. Samples were kept at 4°C and protected from direct light during the whole procedure. Extraction was carried out in duplicates.
The analyses for carotenoid quantification were done on a Shimadzu (Shimadzu Corporation, Kyoto, Japan) HPLC equipped with a ternary pumps (LC-10AT VP), a thermostated autosampler (SIL-10AD VP), a photodiode array detector (SPD-M10A VP), a controller (SCL-10A VP), an on-line degasser (Degasys DG-1310), and a temperature controller (Crococil). Data acquisition and processing were done using the LC workstation Class-VP (Shimadzu). The procedure was adapted from [44]. Carotenoids were separated along an YMC C30 (YMC, Japan) column (150 × 4. Carotenoid compounds were identified according to their elution order and UV-visible spectrum in comparison with their authentic standards (analysed individually and in combination in the conditions used for samples), and with data from the literature [45] when unavailable. Quantification was done at 296 nm (phytoene), 348 nm (phytofluene), 450 nm (lutein, b-apo-8'-carotenal, a-carotene, b-carotene), and 472 nm (lycopene), based on internal calibration using b-apo-8'-carotenal, b-carotene calibration curve and extraction yield. Data represent the mean of two assays per individual and are expressed as b-carotene equivalents in mg per 100gr of dry mater (DM).

Genotypic data
SNP discovery, haplotype-tagging SNP selection and genotyping Assuming results from previous studies, we choose carotenoid biosynthetic genes as potential good candidate genes to explain the observed variation in root carotenoid content.
In order to identify polymorphism in other candidate genes, 48 lines representing a large diversity were sequenced for GGPS2, ZDS1, PTOX, PSY1, PSY2, NCED1-2-3, LCYB2 and ABA2 gene fragments. Primers were designed based on publicly available databases (S1 Table). PCR reactions, cycling conditions and sequencing protocol were identical as in [35]. All sequences were aligned by using Geneious software and haplotypes were inferred with DnaSP.
The sequence for a marker associated to the Y2 locus responsible for carotenoid accumulation was obtained from [46].

SNP Genotyping
DNA was isolated and purified with a modified CTAB protocol [47] and DNA concentration was adjusted to 15 ng/mL. SNP genotyping was carried out by KASPar Assay (KBioscience -LGC Genomics). This technology is based on a property competitive allele-specific PCR system. The allele detection is based on a FRET quencher cassette which allows bi-allelic discrimination of known SNPs and InDels.
A total of 470 SNPs were found over all the 12,351 bp sequences from 17 genes. We identified 169 haplotypes over the 17 genes. Haplotype-tagging SNPs (HtSNPs) were chosen to maximize the number of haplotypes. We were able to design primers for 109 SNPs out of the 120 predicted (context sequences are provided in S2 Table). Finally, 93 SNPs were used after removing SNPs with Minor Allele Frequency (MAF) lower than 5% and missing data higher than 20%.
For each gene, haplotypes were reconstructed using PHASE implemented in DNASP [48]. LD within and between candidate genes was assessed with r 2 generated by TASSEL [49].

SSR Genotyping
For SSR markers, 15 primers used in [35] were chosen regarding to their genome coverage and reproducibility [50]. PCR reactions, capillary electrophoresis and fragment sizing were identical as in [35].

Population structure and relatedness
Both SSR and SNP datasets were used to investigate population stratification and relatedness between individuals as they can lead to false positive detection during association analysis.
Population stratification was first investigated on SSR dataset with the Bayesian modelbased STRUCTURE software [37] which is used to infer distinct populations and to assign individuals to the identified populations. The model allowed admixture and allele correlated frequencies and was run with a burnin period of 10 5 and a run length of 10 6 iterations. Ten independent runs were performed for each putative cluster number (K). The range of possible K tested was from one to ten. Evanno's method [51] was used to estimate the most probable K number.
Population stratification was also studied by PCA analysis by using SNP dataset on TASSEL software [49].
In order to study the relatedness between individuals, two Kinship matrixes were calculated by using TASSEL software based on SSR (K-SSR) markers and SNP (K-SNP) markers respectively.

Association tests
Association analysis was performed by using TASSEL software [49]. Single polymorphism with MAF less than 5% and/or more than 20% of missing data were removed from the analysis. Marker-Trait associations were calculated using six models to evaluate the effects of population stratification and kinship: first a naive model without correction, then models correcting for population stratification estimated by STRUCTURE (Q) and by PCA (P), and finally mixed linear models taking account for relatedness between individuals estimated with SNP (K-SNP) and SSR (K-SSR) and the population structure (Q+K and P+K).
Due to multiple testing, SNPs were declared as significantly associated with a threshold of 5.3×10 −4 p value as corrected with the standard Bonferonni procedure. The amount of variation explained by a SNP (r 2 ) was calculated for each significant association using a simple general linear model. Haplotypes were declared significantly associated with an arbitrary threshold of 0.05 p value due to the small number of tested genes.
The model pertinence was evaluated by plotting p value in a cumulative way as described by [52]. A uniform distribution of p values indicates an ideal model.

Phenotypic variation for carotenoid and color
A first visual observation of all 380 individuals showed no white or red roots. The whole population was composed with a gradient of yellow to orange roots. Due to the very low number of individuals exhibiting lycopene and limited content variability for this compound, association tests for lycopene content were not performed.
On average, b-carotene represented almost half of total carotenoid content and a-carotene represented about the third of b-carotene content (Table 1). Lutein content was relatively low as well as precursor compounds like phytoene and phytofluene. The unstructured population exhibited a large variation for all carotenoid compounds as shown by the high standard deviation.
Color components were measured for epidermis, secondary phloem and secondary xylem. Phenotypic variations were quite similar between tissues (Table 2). Table 3 shows the correlation between carotenoid compounds. All carotenoid compounds except lutein were highly correlated with b-carotene with a highly significant r 2 between 0.78 and 0.99. On the contrary, lutein was not correlated with any other compound (r 2 between 0.22 and 0.49). As secondary phloem represents the largest part of the roots, correlations between color components of this tissue and carotenoid content were investigated (Table 3). Except for lutein, all carotenoid compounds were mainly correlated with aÃ and h (r 2 between 0.70 and 0.77 or 0.6 and 0.69 respectively). Lutein was not correlated with any other trait.

Population Structure
All 15 SSR markers were polymorphic and 132 alleles were identified with a mean of 8.8 alleles per locus.
Population stratification was first investigated with STRUCTURE based on SSR markers. LnP (K) plot ( Fig. 2A) did not reach a plateau as expected in the presence of structure in the sample. The number of genetic groups was also investigated with the Evanno's method [51]. As shown on Fig. 2B, the most probable K was 2, 3 or 5. However, as shown in Fig. 2 (C, D, E) all individuals were admixed and none of them was clearly assigned to one group. At least the proportion of samples assigned to each group was roughly symmetric (*1/K). All these elements showed an absence of structure in the population. This conclusion is reinforced by the principal component analysis performed with SNP data (Fig. 3), in which no group was clearly defined.

Linkage Disequilibrium
No intergenic LD between genes was observed. All r 2 between genes were less than 0.2 between SNPs located in different genes. But high or low LD was observed between SNPs within genes depending on the considered gene. ZDS1, LCYB1, IPI, LCYE and ZEP genes showed a relatively high LD between SNPs as observed by [33]. On the contrary, other genes such as NCED family or PTOX exhibited no intragenic LD (S1 Fig.).

Model choice
As Structure did not converge and PCA did not show any structure in our population, models with Q and PCA covariables are not presented. To assess the goodness of each model (naive, K-SNP or K-SSR), cumulative p value plots are presented in Fig. 4 for carotenoid traits and SNPs based associations. For all traits, the naive model did not perform well and an excess of low p values was found. At least, the K-SSR model performed better than the naive one for SNP-trait associations as well as for haplotype-trait associations. The K-SNP model showed a uniform distribution of p values and therefore minimized the chance of spurious association. The presented association results are thus all based on the K-SNP model. Results were similar for color components traits: the K-SNP model performed better than any other models.

Association results based on SNPs
With the K-SNP model, 93 SNPs were tested against 21 traits. Among the 1953 markertrait pairs, 23 significant associations were found with a Bonferroni corrected threshold of 5,3×10 −5 (Fig. 5). Two SNPs (ZEP-117 and ZEP-361) in the zeaxanthin epoxydase gene were associated with total carotenoids (R 2 ZEP-117 = 0.21), b-carotene (R 2 ZEP-117 = 0.22), phytoene (R 2 ZEP-117 = 0.22) and phytofluene (R 2 ZEP-117 = 0.23) content. These results were similar when Table 3. Pearson correlation between root content of carotenoid compounds and color components of the secondary phloem. carotenoid traits were expressed on a fresh matter basis (data not shown). These two SNPs were also associated with color components aÃ and h for all three tested tissues (Table 4). These two SNPs were in high LD (r 2 = 1) and therefore redundant. These SNPs were located in a non-coding region. Fig. 6 shows the distribution of carotenoid content for each allele of the ZEP-117 SNP (Results were similar for the ZEP-361 SNP, data not shown). For all associated compounds, C:C, C:T and T:T genotype means were significantly different from each other (p < 0.05, Kruskal-Wallis test). This reveals a typical dominant action for this locus.
One polymorphism in the carotenoid isomerase gene (CRTISO) was associated with total carotenoids, b-carotene and a-carotene. At least one SNP in the plastid terminal oxidase (PTOX)a cofactor of the phytoene desaturasewas associated with a-carotene (Fig. 5). No association was detected between the Y2 related marker and carotenoid content. Association results based on haplotypes When testing haplotypes against carotenoid content, we detected 32 associations with a p value lower than 0.05 (Table 5). Among these 32 significant associations, 27 were associated with three genes. Total carotenoids, b-carotene, phytoene and phytofluene content were associated with ZEP gene. This gene was also associated with color components for the three investigated tissues. Phytoene desaturase gene was associated with lutein, phytoene, phytofluene and total carotenoid content and with the color saturation CÃ of inner root. The plastid terminal oxidase gene (PTOX) was associated with a-carotene, phytoene, phytofluene and total carotenoid content. It was also associated with the color components bÃ for both epidermis and secondary phloem and CÃ for epidermis. At last PSY2, ZDS1 and NCED1 were associated with color components and PSY1 with lutein content.

Unstructured population and consequences for association study
Carrot genetic resources often exhibit a strong stratification [33][34][35] which can lead to false positive detection when association mapping is performed. Here we conducted a candidate gene association study on root carotenoid content by using an unstructured population. Both SNPs and SSRs confirmed the absence of population stratification in this population. The same set of SSR markers were used in [35] and revealed a strong population stratification on a diversified panel of carrot accessions. So neither Q matrix nor PCA component were used in association mapping model. But the naive model showed an excess of low p values and was therefore not fitting. As all individuals descended from the same parents, relatedness between individuals was relatively high and association model had to be corrected for relatedness. A kinship matrix was still needed. Auzanneau et al. [53] already demonstrated in ray grass the interest of using a panmictic population in association mapping studies to overcome the population stratification bias. The choice of parents appeared to be essential. In our original set of cultivars set, red rooted carrot was under represented and we were not able to observe red roots after three generations. As population stratification was disrupted after three intercross generations, the choice of the parents should be based more on the phenotypical variation representativity than on original stratification. In order to reach a large variation for the targeted traits after several intercross generations, the first generation must exhibit a large range of phenotypic variation but also an equilibrated representation of phenotypic patterns. Indeed, as mentioned by [53], detected associations may be different depending on population parents.

Major role of the catabolic gene zeaxanthin epoxydase
Zeaxanthin epoxydase gene polymorphism was associated with total carotenoid, b-carotene, phytoene and phytofluene content. Moreover, associations with color components aÃ and h were also detected. As the two polymorphisms associated with traits were in a non-coding region, we were unable to detect the causal polymorphism of the phenotypic variation. But as this gene exhibited a moderate LD as shown in S1 Fig. and in [33], the causal polymorphism may be located somewhere else in the LD block.
ZEP is one of the major steps in the carotenoid pathway. Nevertheless, carotenoid accumulation in various organs is known to be the result of biosynthesis, degradation and storage [4,8]. An impaired function of the zeaxanthin epoxydase protein may result in an accumulation of b-carotene. As b-carotene, phytoene and phytofluene were highly and positively correlated, a high level of b-carotene was often associated with a high level of precursors phytoene and phytofluene. High level of phytoene and phytofluene may also be the result of a large b-carotene accumulation due to a reduced degradation. It also seems that the zeaxanthin epoxydase gene may drive the biosynthesis pathway towards the b-branch.
In a previous study, [21] performed a QTL detection in a biparental cross between a wild and an orange cultivated carrot. One major QTL was detected for aand b-carotene, phytoene,  zeta-carotene, phytoene and total carotenoid content but not for lutein content. As the marker Y for the Y2 locus related to carotenoid accumulation was mapped in the confidence interval of this QTL and the STS (Sequence-Tagged Site) marker ZEP was mapped outside of the confidence interval, the authors concluded that the role of ZEP was not obvious to explain the Y2 locus. According to our results, we suggest that ZEP may one probable candidate gene underlying the Y2 locus. Previous results on potato tubers also showed an association between ZEP and flesh color [54]. A specific allele only present in orange colored flesh genotypes was identified. This allele showed a reduced level of expression probably due to a large retrotransposon in the first intron. Similar observations were done in maize. Carotenoid accumulation in kernel was inversely associated with ZEP transcript levels [55]. However, no significant variation of ZEP transcript abundance between cultivars was found during carrot root development [43]. This suggests that a putative transcription difference would be at the allele level, which needs to be confirmed by a further study on ZEP alleles.
In tomato, the mutation high pigment 3 (hp3) occurred in the ZEP gene, leading to a 30% increase of carotenoid accumulation in the mature fruit [56]. These results, with related effects on ABA content and carotenoid storage capacity, reinforce our conclusion that ZEP is a major candidate gene governing carotenoid content in carrot roots.

Limitating role of the synthesis phytoene desaturase gene
We identified PDS and his cofactor PTOX [57] as associated with total carotenoid, phytoene, phytofluene, a-carotene and lutein content. This suggests that these genes are involved in the global carotenoid accumulation. An early regulation of the pathway may explain the large number of associations detected for these genes. These results are consistent with previous results which identified phytoene accumulation as a major regulatory step in the carotenoid pathway in plants [4,6]. Moreover a putative signature of selection for PDS after domestication of carrot was shown by [58]. This may be in relation with an early control of the metabolic pathway. PTOX has been identified as playing a major role in carotenoid accumulation. Arabidopsis mutant IMMUTANS [59,60] and tomato mutant ghost [61] exhibit an impaired function of PTOX leading to a deficient phytoene desaturation and an accumulation of the carotenoid precursor phytoene.
Just et al. [21] also identified CHXE, NCED2 and PDS as potential candidate genes linked to another major carotenoid QTL, proposed as the Y locus. But they could not conclude on the effect of each gene in this QTL region. As neither SNPs nor haplotypes from CHXE and NCED2 were associated with carotenoid traits, our results suggest that PDS may be one probable candidate explaining the Y locus.
Unfortunately, PTOX is not yet mapped into the carrot genome, which would be needed to better explain its role. A recent study suggests a complicated role for the PTOX which is also involved in chloroplast biogenesis and in photosystem II photoprotection [62].
Towards a particular mechanism driving the metabolic flux through the α-branch to lutein The orientation of the pathway towards the a-branch or the b-branch results in different pattern of carotenoid content. However, the underlying mechanism remains unclear in carrot. The metabolic node is known to be a major regulation step in the pathway [4]. For example, variation in LCYE in maize explained 58% of the variation in the two branches [63]. This was also observed in Arabidopsis thaliana [64], Brassica napus [65] and Solanum tuberosum [66]. But we did not detect any association for a-carotene and lutein content with the e-lycopene cyclase gene. This suggests a more complex regulation orientating the pathway to one of the branch, controlled by a genetic factor different from the tested carotenoid biosynthetic genes. Actually, Arango et al. [23] have shown the specific role of carotene hydroxylase CYP97A3 gene in the a-carotene level.
The study of correlation between all traits showed the independence of lutein content. All carotenoid compounds except lutein were correlated together. This also suggests a particular mechanism in lutein accumulation from a-carotene.

Conclusion
For the first time, we performed an association analysis on Daucus carota L. Moreover we developed an original unstructured population with a broad genetic basis, limiting the risk of spurious association. However, as all individuals descended from the same parents, relatedness estimated with kinship matrixes had to be included in the association model. We identified several SNPs and genes associated with carotenoid content and color components. Our results bring evidence that zeaxanthin epoxydase and phytoene desaturase are candidate genes involved in carotenoid accumulation of non-photosynthetic organs. Our study brings new insight into the carotenoid pathway functioning by stressing out two major steps in carotenoid metabolism and catabolism in a storage organ. Functional validation and dissection of the regulation of ZEP expression may clarify the mechanisms involved in carotenoid accumulation. Clarification of the involvement of the PTOX has also to be investigated. A mechanism explaining both the accumulation of xanthophylls and the pathway orientation towards the a-branch as well as lutein accumulation still remains to be specified. However, the genes identified in this study as associated with color components and carotenoid content may be useful in marker-assisted selection for carotenoid content enhancement in a breeding program.
Supporting Information S1