Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Conserved amino acid residues and gene expression patterns associated with the substrate preferences of the competing enzymes FLS and DFR

  • Nancy Choudhary,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Institute of Plant Biology & BRICS, Plant Biotechnology and Bioinformatics, TU Braunschweig, Braunschweig, Germany

  • Boas Pucker

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    b.pucker@tu-braunschweig.de

    Affiliation Institute of Plant Biology & BRICS, Plant Biotechnology and Bioinformatics, TU Braunschweig, Braunschweig, Germany

Abstract

Background

Flavonoids, an important class of specialized metabolites, are synthesized from phenylalanine and present in almost all plant species. Different branches of flavonoid biosynthesis lead to products like flavones, flavonols, anthocyanins, and proanthocyanidins. Dihydroflavonols form the branching point towards the production of non-colored flavonols via flavonol synthase (FLS) and colored anthocyanins via dihydroflavonol 4-reductase (DFR). Despite the wealth of publicly accessible data, there remains a gap in understanding the mechanisms that mitigate competition between FLS and DFR for the shared substrate, dihydroflavonols.

Results

An angiosperm-wide comparison of FLS and DFR sequences revealed the amino acids at positions associated with the substrate specificity in both enzymes. A global analysis of the phylogenetic distribution of these amino acid residues revealed that monocots generally possess FLS with Y132 (FLSY) and DFR with N133 (DFRN). In contrast, dicots generally possess FLSH and DFRN, DFRD, and DFRA. DFRA, which restricts substrate preference to dihydrokaempferol, previously believed to be unique to strawberry species, is found to be more widespread in angiosperms and has evolved independently multiple times. Generally, angiosperm FLS appears to prefer dihydrokaempferol, whereas DFR appears to favor dihydroquercetin or dihydromyricetin. Moreover, in the FLS-DFR competition, the dominance of one over the other is observed, with typically only one gene being expressed at any given time.

Conclusion

This study illustrates how almost mutually exclusive gene expression and substrate-preference determining residues could mitigate competition between FLS and DFR, delineates the evolution of these enzymes, and provides insights into mechanisms directing the metabolic flux of the flavonoid biosynthesis, with potential implications for ornamental plants and molecular breeding strategies.

Introduction

Flavonoids comprise a range of different subclasses including flavones, flavonols, anthocyanins, and proanthocyanidins, all of which have their own biological functions [1]. Flavonoids exhibit a remarkable diversity, encompassing a vast array of compounds that possess distinct properties, conferring specific evolutionary benefits [2]. The biosynthesis of these compounds involves a complex network of enzymatic reactions. Mutants with observable phenotypic variations in flavonoid biosynthesis have served as valuable model systems for studying the intricacies of biosynthetic pathways and their regulatory mechanisms [36].

Anthocyanins are renowned for their role as red, purple, or blue pigments, contributing to the vibrant colors seen in flowers and fruits [1]. These color patterns serve to attract animals for important biological processes such as pollination and seed dispersal [7]. Flavonols have been described as co-pigments that contribute to the overall coloration of various plant structures [8,9]. On their own, flavonols are colorless or pale yellow and have been described as important factors in the UV stress response [10]. Additionally, flavonoid biosynthesis produces other compounds such as flavones and proanthocyanidins [11,12]. Flavones play a crucial role in signaling and pathogen defense [11], whereas proanthocyanidins are key contributors to seed coat pigmentation [13]. Beyond their pigmentation functions, flavonoids have diverse roles in response to environmental factors like drought [14], cold [1517], and high light intensities [18,19], playing a vital physiological role in protecting against UV-induced oxidative stress [20,21], detoxifying reactive oxygen species (ROS) [22,23] and toxic metals [24], defending against pathogens [25], regulating symbiotic associations [26,27], and influencing auxin flux [28,29]. A theory says that the cytochrome P450 enzymes (like C4H, FNSII, F3’H, and F3’5’H) are anchored to the endoplasmic reticulum membrane and associate with other soluble enzymes in the cellular environment forming a “metabolon” [30]. Supported by biochemical experiments, flavonoid metabolons have been reported in a few plant species including soybean [31], Arabidopsis [32], kumquat [33], snapdragon, and torenia [34].

A range of different colors is provided by three classes of anthocyanins: pelargonidin (orange to bright red), cyanidin (red to pink), and delphinidin (purple or blue) glycosides [1]. These three classes are characterized by different numbers of hydroxyl groups at the B phenyl ring: pelargonidin (one), cyanidin (two), and delphinidin (three) [35]. The biosynthesis of anthocyanins involves the enzymatic conversion of colorless dihydroflavonols into leucoanthocyanidins by DFR [36]. ANS further converts leucoanthocyanidins into anthocyanidins. Recent research suggests the involvement of an anthocyanin-related glutathione S-transferase (arGST) in anthocyanin biosynthesis [37,38]. Anthocyanidins are transformed into anthocyanins through the addition of a sugar moiety by glycosyl transferases [39]. Subsequent modifications such as the addition of other sugar moieties [40] or acyl groups [41] are catalyzed by a large number of often promiscuous enzymes.

Dihydroflavonols are not exclusively channeled into anthocyanin biosynthesis but are also the substrate of the flavonol synthase (FLS) that produces kaempferol, quercetin, or myricetin depending on the available dihydroflavonol [42]. This leads to a competition for the dihydroflavonol substrates between DFR and FLS (Fig 1). Previous studies suggest that the ratio of FLS to DFR activity is an important factor in color determination [4246]. A study across multiple flowering plants discovered that high FLS activity can result in a lack of anthocyanin pigmentation despite a functional anthocyanin biosynthesis machinery being present [45]. The flavonoid 3’-hydroxylase (F3’H) converts dihydrokaempferol into dihydroquercetin by adding one hydroxyl group in the C3’ position of the B ring, while the flavonoid 3’,5’-hydroxylase (F3’5’H) converts dihydrokaempferol into dihydromyricetin by adding hydroxyl groups at the C3’ and C5’ positions [47]. Differences in the stability of the different dihydroflavonols pose a challenge for quantitative in vitro studies [48].

thumbnail
Fig 1. Simplified illustration of the flavonoid biosynthesis in plants showing the competition between FLS and DFR for common substrate, dihydroflavonols.

CHS, chalcone synthase; CHI, chalcone isomerase; F3H, flavanone 3-hydroxylase; F3′H, flavonoid 3′-hydroxylase; F3′5′H, flavonoid 3′,5′-hydroxylase; FLS, flavonol synthase; DFR, dihydroflavonol 4-reductase; ANS, anthocyanidin synthase; arGST, anthocyanin-related glutathione S-transferase; UFGT, UDP-glucose flavonoid 3-O-glucosyl transferase; MT, methyltransferase.

https://doi.org/10.1371/journal.pone.0305837.g001

The activity of different branches of flavonoid biosynthesis is controlled by transcriptional activation of the involved genes. FLS gene expression is activated by MYB transcription factors of the subgroup7 and this mechanism appears to be generally conserved across plant species with minor lineage-specific variations [49,50]. Recently, a pollen-specific activation of FLS by sugroup19 MYBs was described [51,52]. Transcriptional activation of DFR is controlled by an MBW transcription factor complex that is named after the three components: MYB, bHLH, and WD40 [53]. While research on A. thaliana mutants suggests that positive regulation dominates the gene expression control of flavonoid biosynthesis genes, several negative regulators have been identified through research on many different plant species [54].

Apart from the overall activity of FLS and DFR, substrate selectivity plays a crucial role in determining the production of flavonols and anthocyanins [55,56]. This selectivity is evident in species like Freesia hybrida, in which the presence of abundant kaempferol glycosides and delphinidin derivatives [57] indicates the separation of flavonol and anthocyanin biosynthesis based on the contrasting substrate preferences of DFR and FLS [57]. Despite testing various DFR copies in Freesia hybrida, none of them exhibited a preference for DHK, while all favored DHM as the substrate [57]. Some DFRs have been found to accept all three potential dihydroflavonols, indicating that the availability of specific substrates determines the class of anthocyanins produced. Other DFRs are incapable of accepting dihydrokaempferol as a substrate, resulting in the absence of orange anthocyanin-based pigmentation in species like petunia, apple, pear, hawthorn, rose, and cranberry [48,56,58]. The substrate specificity determining region was predicted by a multiple alignment of polypeptide sequences [59]. This region was further investigated based on the Gerbera hybrida DFR which can accept DHK and leads to orange flower pigmentation [60]. Two important amino acid residues implicated in the substrate specificity of DFR were identified based on chimeric Gerbera hybrida and Petunia hybrida DFRs followed by site-directed mutation experiments [56]. The relevance of all positions in a region of 26 amino acid residues was evaluated by permutation experiments. The mutation E145L of the G. hybrida DFR resulted in a non-functional enzyme while the N134L mutation restricted the substrate preference to dihydrokaempferol and excluded dihydroquercetin or dihydromyrecitin [56]. Many following studies confirmed the particular relevance of this N134 residue (position 133 in Arabidopsis thaliana DFR) and identified additional amino acid residues with specific properties [55,6163]. In summary, N allows the acceptance of all dihydroflavonols, L restricts the substrate to DHK, D enables acceptance of DHQ and DHM, and A leads to a high DHK affinity (Fig 2).

thumbnail
Fig 2. Illustration of the branching point in the flavonoid biosynthesis leading to anthocyanins and flavonols, respectively.

Asterisk (*) shows that the reactions from dihydroflavonols to anthocyanins also required the additional enzymes anthocyanidin synthase (ANS), anthocyanin-related glutathione S-transferase (arGST) and UDP-dependent anthocyanidin 3-O-glucosyltransferase (UGT) downstream of DFR. DFR and FLS can process substrates with different hydroxylation patterns leading to three distinct products each. The characters at the arrows represent the previously reported substrate-preference-determining residues of FLS and DFR. The DFR residues, represented as purple characters depict the 133 position while the FLS residues, represented as yellow characters depict the 132 position in FLS (position according to AtDFR and AtFLS1).

https://doi.org/10.1371/journal.pone.0305837.g002

While the substrate preference of DFR received substantial attention, little is known about the substrate preferences of FLS. Five specific residues, namely H132, F134, K202, F293, and E295, have been reported to be involved in the binding of the DHQ substrate [64]. Previous research in A. thaliana studied the impact of amino acid substitutions in AtFLS through incubation with DHQ. The mutants, H132F and H132Y exhibited 124% and 83% activity, respectively, compared to the wild-type (WT) [64]. The KM values of mutants H132F and H132Y displayed an approximately 2-fold decrease compared to the WT counterpart [64]. Conversely, mutations in the other four residues led to a significant decrease in FLS activity. Interestingly, the four highly conserved residues were found to be crucial for substrate binding and catalytic activity. The H132 residue was found to be variable across different flavonol synthases, with some having F residues and others having H residues. These findings suggested that the H132 position may play a role in determining substrate preferences among different flavonol synthases [64]. It is postulated that the position of H132 interacts via a hydrogen bond with the B-ring hydroxyl group of the dihydroflavonol substrates [65]. Given the previous reports about the relevance of N133 in DFR and H132 in FLS, we will refer to them as substrate-preference determining residues throughout this manuscript, while not ruling out the contribution of additional amino acid residues.

While the model organism A. thaliana has only a single DFR gene [66], multiple DFR copies have been reported in many other plant species. For example, studies in buckwheat [67] and strawberry [68] have identified DFR copies that show substantial differences in their substrate preference and consequently also in the produced anthocyanin classes. Genetic analysis of DFR1 and DFR2 in buckwheat suggested that both genes are genetically linked, but the genetic distance does not suggest a recent tandem duplication [67]. Buckwheat DFR2 differs from DFR1 by an asparagine to valine substitution at a position that corresponds to 134 in the Gerbera hybrida sequence and the presence of an additional glycine between positions 138 and 139. Enzyme assays suggest that DFR1 prefers DHK and DHM, while DFR2 prefers DHQ. DFR1 is expressed in most tissues, while DFR2 expression is restricted to seeds and roots. Strawberry DFR1 has a high preference for DHK and does not accept DHQ and DHM (‘DHK specialist’), while DFR2 does not accept DHK as a substrate, but works on DHQ and DHM (‘DHK rejector’) [68].

The objective of this study was to reveal mechanisms that control and mitigate the substrate competition of FLS and DFR. A systematic analysis of substrate-preference-determining amino acid residues in the polypeptide sequences of FLS and DFR revealed differences between monocots and dicots. Cross-species gene expression revealed an almost mutually exclusive activity of FLS and DFR, which could be another mitigation mechanism.

Methods

Analyzed species and data sets

The coding sequences for 211 plant species were retrieved from Phytozome [69] and NCBI/GenBank [70] (S1 File). The selection of plant species encompassed a diverse taxonomic range, including 20 Thallophyta (algae and fungi), 4 Bryophyta (mosses), 1 Pteridophyta (ferns and fern allies), 11 Gymnosperms (naked seed plants), and 175 Angiosperms (flowering plants). A Python script [71] was used to translate the coding sequences into corresponding polypeptide sequences.

Flavonoid biosynthesis gene identification

The polypeptide sequences were analyzed with KIPEs3 v0.35 [72,73] to identify flavonoid biosynthesis genes in each plant species. KIPEs3 itself supplied the bait sequences and essential residue information required for the analysis (flavonoid biosynthesis dataset v3); leading to the identification of candidate genes for FLS (flavonol synthase) and DFR (dihydroflavonol 4-reductase). In addition, flavanone 3-hydroxylase (F3H), flavonoid 3’-hydroxylase (F3’H), and flavonoid 3’5’-hydroxylase (F3’5’H) genes were identified in the investigated plant species. KIPEs3 also enabled the identification of FLS-like and DFR-like genes within the datasets.

Multiple sequence alignment and phylogenetic tree construction

The DFR and DFR-like polypeptide sequences and the FLS and FLS-like polypeptide sequences were aligned separately using MAFFT v7.310 [74] with default penalties for gaps and the protein weight matrix of BLOSUM62 [75]. The alignments were generated using the L-INS-i accuracy-oriented method and incorporated local pairwise alignment. The amino acid alignments were translated back to codon alignments using pxaa2cdn [76]. The alignments were cleaned using pxclsq [76] to remove alignment columns with very low occupancy (<0.1). Multiple ANR and CCR sequences retrieved from GenBank were used as an outgroup for DFR tree construction. Previously characterized DFRs were also included. Multiple F3H and ANS sequences obtained from GenBank were used as an outgroup for FLS tree construction to facilitate a comprehensive investigation in the context of the 2-oxoglutarate-dependent dioxygenase (2-ODD) family. Previously described FLS sequences were also included. The best model of evolution was inferred using ModelFinder [77]. Maximum likelihood trees were constructed for deciphering DFR and FLS evolution using IQ-TREE v2.0.3 [78] with 1,000 bootstrap replicates and the GTR+F+ASC+R10 nucleotide model. Additional phylogenetic trees were constructed based on MAFFT v7.310 and Muscle v5.1 (super5 algorithm) with IQ-TREE v2.0.3, FastTree v2.1.10 [79] using the GTR+CAT model, and MEGA v11 [80] using the neighbor-joining method, 1,000 bootstrap replicates and the Maximum Composite Likelihood model. The topologies of resulting trees were manually compared to validate the crucial nodes supporting major conclusions. Phylogenetic trees were visualized using iTOL v6.8 [81].

Transcriptomic analyses

For transcriptome analyses, a subset of 43 species was selected based on specific criteria: (1) the presence of FLS, DFR, F3H, and F3’H genes in the species, (2) the availability of sufficient transcriptomic datasets in the Sequence Read Archive (SRA) and (3) the presence of clear labels for RNA-seq samples with a minimum of three accession numbers corresponding to at least a single plant organ (leaf, root, stem, flower, seed, or fruit). To generate the count tables, all available paired-end RNA-seq datasets of the selected species were retrieved from the SRA (www.ncbi.nlm.nih.gov/sra) [82] using fastq-dump (https://github.com/ncbi/sra-tools). Kallisto v0.44 [83] was then employed to quantify transcript abundance based on the coding sequences of the respective plant species. The previously developed Python script kallisto_pipeline3.py [84,85] served as a wrapper to run kallisto across all input files. Next, the Python script merge_kallisto_output3.py [84,85] was executed to merge the individual kallisto output files into one count table per species. The count tables were processed to create a unified count table, encompassing the expression values of FLS and DFR across all 43 species. The values of close paralogs were replaced by their sum if these paralogs had the same substrate-preference determining residue at the most important position. RNA-seq based transcript abundance is taken as a proxy for gene expression and thereafter, in the expression analysis, referred to as ‘gene expression’.

To explore the expression patterns of FLS and DFR, a 2D density heatmap with marginal histograms was generated using the Python script Coexp_plot.py available at https://github.com/bpucker/DFR_vs_FLS.

Results

Substrate-preference determining amino acid residues in FLS and DFR and their pattern of occurrence in land plants

From a vast range of diverse taxonomic plants, DFR was identified only in angiosperms. Among the 175 angiosperm datasets analyzed, 129 exhibited at least one most likely functional DFR, characterized by the presence of essential residues associated with DFR activity. Overall, 207 DFR sequences were identified, accounting for cases where multiple DFRs were present in certain species. Multiple sequence alignments of the identified DFR sequences revealed potential NADPH binding and substrate binding sites in N-terminal regions (Fig 3). Analysis based on the conservation of the amino acid residue at position 133, i.e., 3rd position within the 26-amino acid-long substrate binding domain of DFR led to the classification of DFR proteins into three types: (i) DFR with asparagine (N) at the 133rd position which can recognize all three dihydroflavonols as substrates, (ii) DFR with aspartic acid (D) at the 133rd position which shows higher preference for dihydroquercetin and dihydromyricetin, and (iii) DFR characterized by alanine (A) at the 133rd position which exhibits a preference for dihydrokaempferol and a higher likelihood of rejecting dihydromyricetin. Henceforth, these three DFR types will be referred to as DFRN, DFRD, and DFRA, respectively.

thumbnail
Fig 3. Selected parts of multiple sequence alignment of DFR sequences restricted to 10 species to reduce the complexity.

Monocot and dicot species are highlighted by light pink and light green color, respectively. The dark violet, mid violet, and light violet colors indicate 100%, >75%, and >50% similarity among the sequences, respectively. Important domains associated with DFR functionality are highlighted and some columns are masked with three dots (…). Red boxes highlight the NADPH-binding domain and the 26 amino acid long substrate-binding domain. The red-star labeled residue at the 3rd position within this region is crucial for dihydroflavonol recognition (position 133 in the A. thaliana DFR). MAFFTv7 was applied to generate the alignment.

https://doi.org/10.1371/journal.pone.0305837.g003

Sequences of FLS enzymes were successfully identified in the datasets of 143 out of 175 angiosperm species. The inability to identify FLS sequences in some of the plant species may be attributed to technical reasons, i.e., incompleteness of the analyzed data sets. In total, 247 FLS sequences were identified, including instances where multiple copies of FLS were present within a species. FLS enzymes belong to the 2-oxoglutarate-dependent dioxygenase (2-ODD) superfamily. A comprehensive analysis of the multiple sequence alignment revealed the presence of characteristic conserved ferrous iron-binding motif (HX(D/E)XnH) and the 2-oxoglutarate binding residues (RXS) critical for FLS functionality (Fig 4). Furthermore, sequences of functional FLSs exhibited additional residues (G68, H75, P207, and G261) responsible for the proper folding of the 2-ODD protein. Specific motifs, such as "PxxxIRxxxEQP’’ and "SxxTxLVP’’ which were initially considered to be unique to FLS and distinguished it from other plant 2-ODDs [86] were also identified. However, a previous study identified two BnaFLS1 homologs which also exhibited F3H activity [87]. It was demonstrated that despite having these reportedly FLS-specific motifs, they could still have additional activities. FLS sequences also possessed five key amino acids (H132, F134, K202, F293, and E295) identified as potential DHQ binding site residues, with the latter four exhibiting notable conservation across diverse species. These residues were also found to be conserved in anthocyanidin synthase (ANS) sequences wherein they had a strongly conserved Y132 residue as well. Analysis based on the conservation of residue 132 led to the classification of FLS into three types: FLSH (histidine), FLSF (phenylalanine), and FLSY (tyrosine). The positions of amino acids are assigned based on the Arabidopsis thaliana sequences (AtDFR, AT5G42800, and AtFLS1, AT5G08640) for consistency and reference purposes.

thumbnail
Fig 4. Selected parts of an FLS multiple sequence alignment restricted to 10 species to reduce the complexity.

Monocot and dicot species are highlighted by light pink and light green color, respectively. The dark orange, mid orange, and light orange colors indicate 100%, >75%, and >50% similarity among the sequences, respectively. Important domains and residues associated with FLS functionality are highlighted and some columns are omitted as indicated by three dots (…). The conserved 2-ODD domain and the Fe2+ binding sites are indicated by black asterisks and red arrowheads, respectively. The black boxes highlight the FLS-specific motifs. The black arrowheads indicate potential DHQ-binding sites where the first residue (position 132) is thought to be critical for the substrate preference of FLS. The alignment was generated by MAFFT.

https://doi.org/10.1371/journal.pone.0305837.g004

The 129 diverse plant species harboring DFRs represented 28 orders out of the 64 orders recognized in the current Angiosperm Phylogeny Group (APG IV;2016) classification of angiosperms [88]. Interestingly, all monocot plants investigated displayed DFRN, with only a few minor exceptions which displayed neither of the three types. Among dicots, some species exclusively possessed either DFRN or DFRD, while others exhibited a combination of either DFRN and DFRD or DFRN and DFRA. Remarkably, DFRA, which is known to have a strong preference for DHK as a substrate, previously reported only in Fragaria, was also identified in Spirodela polyrhiza, Spirodela intermedium, and Begonia peltatifolia (S2 File). A total of 143 angiosperm plant species encompassing 25 orders were analyzed for functional FLS genes. Among these species, monocots exhibited either FLSF or FLSY, while all dicots possessed FLSH with a histidine residue at position 132. However, certain orders such as Fabales, Fagales, Malpighiales, and Malvales displayed multiple FLS candidates in some species with both FLSH and FLSY, characterized by histidine and tyrosine residues, respectively. (S3 File). The overall pattern of amino acid residues at position 133 in DFR sequences and position 132 in FLS sequences at the order level is summarized in Fig 5.

thumbnail
Fig 5. The patterns of commonly occurring amino acid residues at substrate-preference determining positions observed in different orders of angiosperms for FLS and DFR.

Orders are sorted by branching in the evolution of angiosperms according to the Angiosperm Phylogeny Group (2016). The residues were investigated across 172 angiosperm species and are represented here at the order level. Monocot and dicot species are highlighted by light pink and light green color, respectively. Rosales II consists of 2 species: Fragaria x ananassa and Fragaria vesca. n represents the number of analyzed species harboring DFR and FLS, respectively, within each order.

https://doi.org/10.1371/journal.pone.0305837.g005

Evolution of DFR

To understand the evolutionary relationships among multiple DFR sequences, a phylogenetic tree based on DFR protein-encoding sequences was constructed. Given that DFR belongs to the short-chain dehydrogenase (SDR) family, the closely related ANR, CCR, and DFR-like sequences from this family were included as outgroups. In the resulting analysis, the ANRs, and DFRs clustered together, distinct from the CCRs, as depicted in S4 File. The two clades placed as sisters to the CCR clade had A. thaliana tetraketide α-pyrone reductase (AtTKPR1 and AtTKPR2) in each clade, essential for pollen wall development and male fertility. TKPR2, known to be more active than TKPR1, clustered closer to the CCR clade. The TKPR1-like, TKPR2-like, CCRs and a large clade with an unknown function clustered together and were placed as a sister clade to the DFR and ANR clade. Adjacent to the DFR clade, another clade featured AtBEN1-encoding a DFR-like protein involved in the brassinosteroid metabolism in A. thaliana. This clade did not contain any sequences from the gymnosperms or monocots, only dicots were found. The proximity to the DFR clade and the absence of monocots suggests its evolutionary origin from a DFR predecessor gene after the split of the monocot and dicot lineages.

We did not identify any Gymnosperm DFR in our analysis, possibly due to their early-branching nature and distinct conserved residues compared to angiosperms. Previously characterized DFR sequences (S5 File), marked with red asterisks, were generally located within the DFR clade, except for fern Dryopteris erythrosora, DeDFR1 and DeDFR2, Ginkgo biloba, GbDFR1, GbDFR2, and GbDFR3, and Camellia sinensis, CsDFRc. The placement of these sequences could suggest an independent origin of DFR activity or enzymatic promiscuity in closely related clades. Fern DFRs, placed outside the ANR and DFR clade, may exhibit distinct characteristics due to their early-branching nature or possess promiscuous functions. Ginkgo DFRs within TKPR-like clades likely deviate from bona fide DFRs. The phylogenetic analysis of functional DFRs highlights distinct clades specific to monocots and dicots (Fig 6). Monocots mostly exhibit DFRN, while dicots generally display various DFR types—DFRN, DFRD, and DFRA. Notably, DFRD appears to have originated from a duplication event of DFRN, as observed in the Fabales and Malvales clades. DFRA is restricted in the Rosales and Alismatales clades. These findings suggest possible scenarios where dicots experienced a duplication event of DFRN, followed by a subsequent substitution event resulting in the emergence of DFRD. Furthermore, in the Rosales and Alismatales clade, the duplication could have been accompanied by a substitution from N to A. Upon examining the codons of the DFRA coding sequence at position 133 in F. vesca and F. ananassa and comparison against closely related DFRN sequences, we found that the codons did not exhibit any predisposition towards the emergence of DFRA in strawberries (AAT to GCC) (S6 File).

thumbnail
Fig 6. Phylogenetic analysis of DFR sequences in diverse plant species, highlighting amino acid residue diversity at position 133 associated with substrate specificity.

Gymnosperm, monocot, and dicot species are denoted by light blue, light pink, and light green color stripes, respectively. Non-DFR sequences are represented by dashed gray branches while the functional DFRs are indicated by solid black branches. The color-coded scheme represents different residues: Asparagine (light purple), Aspartic acid (periwinkle blue), Alanine (deep purple), and other amino acids (gray) at position 133. The preferred substrate of the DFR type is written in brackets: DHK, dihydrokaempferol; DHQ, dihydroquercetin and DHM, dihydromyricetin. Distinct clusters of DFRs from major plant orders are labeled for reference. DFR sequences identified in previous studies are highlighted by an asterisk at the start of the terminal branch, with asterisks of functional DFR genes colored in red (S5 File). Leaf labels are hidden to reduce the complexity. The outgroup comprises SDR members like ANRs, CCRs, and other DFR-like sequences (S4 File).

https://doi.org/10.1371/journal.pone.0305837.g006

Deep duplication events during the evolution of FLS

Through a phylogenetic tree generated with FLS protein-encoding sequences, we observed deep duplication events during the evolution of FLS enzymes. Considering their classification within the 2-oxoglutarate-dependent dioxygenase (2-ODD) family, closely related ANS, F3H, and FNSI sequences from the same family and FLS-like sequences lacking crucial residues for FLS functioning were included as outgroups. The phylogenetic tree revealed a largely distinct clustering of F3H, ANS, and FLS sequences. ANS and FLS formed a common cluster, while F3H grouped with FNSI sequences (S7 File).

The non-Apiaceae FNSI sequences clustered outside the F3H clade while the liverworts FNSI and Apiaceae FNSI were a part of a single clade with phylogenetically distinct F3H sequences. The liverworts FNSI are present at the root of this clade suggesting that plant F3H evolved from liverworts FNSI. An independent lineage of Apiaceae FNSI is located nested within the Apiaceae F3H sequences further confirming the evolution of Apiaceae FNSI from F3H [84,89]. Gymnosperm, monocot, and dicot sequences formed distinct clades within larger clades corresponding to the three 2-ODD members, i.e., F3H, ANS, and FLS. The FLS clade was divided into four primary clusters: a Gymnosperm clade branching first, a second clade which we hereafter call aFLS (ancestral FLS), a third monocot clade, and lastly, a large dicot clade. The aFLS clade encompasses FLS-like sequences from basal angiosperms and dicots and a few dicot FLS sequences. Only nine FLS sequences, one each from Linum usitatissimum, Lotus japonicus, Cajanus cajan, Quercus suber, Betula platyphylla, Acacia confusa AcFLS (JN812062), Camellia sinensis CsFLSa (KY615705), Fagopyrum tataricum FtFLS2 (JX401285), and Vitis vinifera FLS5 (AB213566) were in this clade. Although these FLS-like sequences covered a wide range of angiosperms, no monocots were represented in this clade. The presence of this aFLS clade sister to the distinct monocot and dicot clade implies an FLS duplication event preceding the monocot-dicot split. The large dicot cluster further separates into two distinct subclades: one including sequences exclusively belonging to the Brassicales order, and the other including sequences from the rest of the dicot plants (Fig 7). In the Brassicales subclade, a monophyletic group of Brassicales FLS and FLS-like sequences is evident, with AtFLS1 and sequences of other functional Brassicales FLSs clustered together. The presumed non-functional FLS sequences from the Brassiclaes order group together and are classified based on their phylogenetic relationship to their most likely A. thaliana orthologs (AtFLS2-AtFLS6). Within the latter subclade which excludes Brassicales, some FLS sequences from the same order appear in distinct clusters as observed in the Fagales, Sapindales, Malpighiales, and Malvales order, suggesting gene duplication events in the evolutionary history of FLS in some common ancestor of these orders.

thumbnail
Fig 7. Phylogenetic diversity of FLS sequences in diverse plant species, highlighting amino acid residue diversity at position 132 associated with DHQ-substrate binding.

Gymnosperm, monocot, and dicot species are denoted by light blue, light pink, and light green color stripes, respectively. Non-FLS sequences are represented by dashed gray branches while the functional FLSs are indicated by solid black branches. The background color highlights different presumably substrate-preference-determining amino acid residues: histidine (pale yellow), phenylalanine (dark orange), and tyrosine (dark golden) at position 132. The hypothesized preferred substrate of the FLS type is written in brackets: DHK, dihydrokaempferol; DHQ, dihydroquercetin, and DHM, dihydromyricetin. Distinct clusters of FLSs from major plant orders are labeled for reference. FLS sequences identified in previous studies are highlighted by an asterisk at the start of the terminal branch with asterisks of functional FLS genes colored in red (S5 File). The branches of Arabidopsis thaliana AtFLS1-AtFLS6 are labeled. Individual leaf labels are hidden to reduce the complexity. The outgroup comprises members of 2-ODD like F3H, ANS, and other FLS-like sequences (S7 File).

https://doi.org/10.1371/journal.pone.0305837.g007

While the first and last residues of the FLS substrate binding residues (H132, F134, K202, F293, and E295) exhibit diverse residues, the central three are strongly conserved in both FLS and ANS sequences. Notably, gymnosperm FLSs and aFLSs feature a Y132 residue, consistent with ANS sequences. Monocotyledonous plants display an F/Y(85%/15%) residue at position 132, whereas the majority of dicots consistently possess an H residue at position 132, except for multiple Gossypium species and Herrania umbratica from the Malvales order, which have a tyrosine residue at this position. The E295 residue is conserved in ANS sequences as well as in monocot and dicot FLSs. However, sequences from the gymnosperm, aFLSs, AtFLS4, and AtFLS5 clades exhibit non-conserved amino acid residues at position 295. The aFLSs exclusively feature aliphatic amino acids at this position (G/A/V/L/I).

Divergent expression patterns of FLS and DFR

Generally, either DFR or FLS is expressed in a given sample, but almost never both simultaneously (Fig 8a). This observation held true across various FLS and DFR types, underscoring a general phenomenon. However, the pattern intriguingly deviates in some cases.

thumbnail
Fig 8. 2D density heatmap with marginal histograms showing divergent expression patterns of FLS vs DFR across samples from 43 species.

Each sample shows only the expression of FLS or DFR, but not both. The sample size is indicated by n. Colorbar depicts the logarithmic density of points in the plot. (a) Combined expression of all FLS and DFR types, (b-g) specific combinations of FLS and DFR types.

https://doi.org/10.1371/journal.pone.0305837.g008

The almost mutually exclusive expression patterns were apparent for all combinations of FLS and DFR types (Fig 8b–8g). These patterns were particularly pronounced in the DFRA vs FLSH expression and DFRN vs FLSY expression plots. Contrastingly, in cases such as DFRN vs FLSH and DFRD vs FLSH, instances of co-expression of FLS and DFR were observed in some samples. Theoretically, there could be 9 different FLS-DFR combinations, given three FLS and three DFR types were examined. The absence of certain combinations raises the possibility that not all FLS and DFR combinations may occur in nature. For instance, to the best of our knowledge, FLSF is restricted to monocots, exclusively paired with DFRN. Hence, combinations involving DFRD and DFRA alongside FLSF might not exist in nature.

Besides FLS and DFR, the major genes influencing the dihydroflavonol levels are F3H, F3’H, and F3’5’H. These genes play a crucial role in determining the hydroxylation patterns of dihydroflavonols, which are common substrates for FLS and DFR (Fig 1). We analyzed the tissue-specific expression of F3H, F3’H, and F3’5’H in 43 species and correlated the expression data with different DFR and FLS types (S8 and S9 Files). The results suggest that the production of specific dihydroflavonols is more active in some species compared to others. The heatmap analysis reveals distinct expression patterns: plants like Cicer arietinum and Citrus sinensis exhibit high FLS activity, while Vitis vinifera and Dianthus caryophyllus show higher DFR activity. The red leaf of Lactuca sativa displayed higher expression of F3H, F3’H, and DFRN compared to the green leaf, indicating a greater substrate availability in the red leaf. Differences in gene expression are linked to substrate preferences and pigment production, impacting plant coloration (S8 File).

Discussion

Factors influencing the competition of the pivotal enzymes FLS and DFR for the common substrate, dihydroflavonols, remain poorly understood. The equilibrium between these enzymes can be modulated by several mechanisms including (i) the unique substrate specificity exhibited by FLS and DFR for different dihydroflavonols, (ii) the existence of multiple FLS/DFR copies with narrow substrate preferences, or (iii) the expression of specific FLS/DFR copies in specific cell types or under specific conditions, (iv) the availability of substrates, i.e., different dihydroflavonols, determined by the activity of flavonoid 3’-hydroxylase and flavonoid 3’,5’-hydroxylase, and (v) the channeling of the substrate through metabolons.

Mitigating substrate competition between FLS and DFR

A well-studied 26-amino acid region determines substrate specificity in DFR [55,90] and the grape crystal structure further emphasized the significance of the N133 amino acid in binding the 3’ and 4’ hydroxyl groups of DHQ [63]. Compared to DFR, the substrate specificity determining residues of FLS has received less attention. The substrate-binding residues of FLS (H132, F134, K202, F293, E295) show high conservation among all plant FLSs, except the H132 and E295 residues. Through the examination of A. thaliana fls1 mutant, it was hypothesized that the H132 residue may play a critical role in governing the diverse substrate preferences of FLS enzymes [64]. Numerous previous studies have employed enzyme assays to investigate the substrate preferences of plant FLSs and DFRs, while many others identified the type of anthocyanins and flavonols in various plants. Metabolite accumulation within a system could be influenced by factors beyond substrate specificity like enzyme abundance, cellular compartmentalization, and metabolic regulation. Hence, to evaluate the general substrate specificity of an enzyme, it is best to compare kcat/KM values rather than metabolite accumulation. Due to the unavailability of enzyme kinetics data in previous studies, we had to utilize metabolite accumulation as a proxy to derive substrate preferences of DFR and FLS. For instance, AcFLS from the monocot Allium cepa, characterized by the Y132 residue, exhibits a preference for DHQ over DHK [91], while monocots such as OsFLS [92] and ZmFLS [93], with the F132 residue, favor DHK over DHQ. In contrast, DFRN in maize and Dendrobium officinale predominantly produces cyanidin derivatives [94,95]. DFRN of barley displays in vitro reduction of DHK and DHQ whereas DHM was not tested [96]. Reports of procyanidin and prodelphinidin in testa tissue and a lack of detection of leucopelargonidin and its derivatives [97] do not align with a DHK and DHQ specificity of DFR in barley. DFRN of grape hyacinth and Cymbidium hybrida exhibit high activity towards DHM and DHQ [90,98], and Anthurium andraeanum DFR, featuring an uncommon S133 residue, displays a marked specificity for DHK [99].

Previous studies and our analyses identified multiple DFR copies in dicots. For example, in the Fabales order, M. truncatula DFR1 and L. japonicus DFR2 and DFR3 with an N133 residue reduces DHK more readily than DHQ, while M. truncatula DFR2 and L. japonicus DFR5 with a D133 residue show a higher activity towards DHQ [61,62]. In these studies, the relative DFR activity was demonstrated by HPLC and spectrophotometric assays, respectively. In the Rosales order, FLSH from Fragaria ananassa [100], Malus domestica [101], and Rosa hybrida [102] reduces both DHK and DHQ, with strawberry FLS favoring DHK and rose FLS favoring DHQ. Conversely, their DFRN exhibits a higher affinity for DHQ than DHK [68,103,104]. The strawberry species, Fragaria ananassa and Fragaria vesca have another DFR-type, DFRA which has a very high specificity for DHK [68]. Other examples are L. japonicus and M. truncatula, where a DFRD is present with a high preference for DHQ, while the DFRN prefers DHK.

This observation seems to contradict previous reports about the substrate preferences of DFRN. A potential explanation might be sub-/neofunctionalization upon duplication of DFR. We speculate that through a yet unknown mechanism, DFRN might become more specific for the substrate that is not favored by the additional DFR copy. Waki et al. reported that chalcone isomerase-like proteins (CHILs) can bind to chalcone synthase (CHS) and rectify its promiscuous activity making CHS more specific [105]. In a conceptually similar mechanism, multiple DFR copies with distinct specificities could be influenced by external factors to adjust their substrate preferences, i.e., to make the DFRN type accept the substrate not favored by the additional DFR copy.

In the Brassicales, FLSH from A. thaliana and Matthiola incana prefer DHK [106108], while their DFRN reduces both DHK and DHQ but favors DHQ [109111]. Citrus unshiu FLSH has higher DHK activity [112], whereas Zanthoxylum bungeanum from the same order, Sapindales, has DFRD and prefers DHM followed by DHQ and DHK [113]. In Caryophyllales, FLSH in Dianthus caryophyllus, Fagopyrum dibotrys, and Fagopyrum tataricum catalyze both DHK and DHQ [114116], while DFRN in Dianthus reduces DHQ more effectively than DHK [115]. In Fagopyrum esculentum, DFRN, active in all tissues, reduces both DHK and DHM at the same rate, while the second DFR enzyme with a V134 residue, expressed only in roots and seeds, exhibits high specificity towards DHQ [67]. In Solanales, Petunia hybrida and Nicotiana tabacum FLSH catalyze both DHK and DHQ with minimal activity towards DHM [58,117], whereas their DFRD is specific in reducing 3,4 di- and 3,4,5 tri-hydroxylated substrates but apparently cannot reduce DHK [55,56]. Gentiana triflora predominantly produces blue flowers and generally has cyanidin, delphinidin, and quercetin derivatives [118].

In our study, we observed that all monocots were mostly FLSF type, and some were FLSY type, while their DFR were predominantly DFRN type. Enzyme assay studies suggest that in monocots, FLSF generally prefers DHK while DFRN prefers double or triple-hydroxylated dihydroflavonol substrates. This difference in substrate preferences could help in avoiding competition between the two enzymes. In dicots, FLSH is observed in the majority of plants, but in fewer plants with multiple FLS sequences, FLSY is also observed. While DFR is more diverse in dicots, some have only DFRN or only DFRD, while others have DFRN along with either DFRD or DFRA candidates. Prior studies collectively suggest that FLSH in dicots can utilize both DHK and DHQ as substrates but may have a preference for one over the other in different plants, implying that the H132 residue remains more neutral between DHK and DHQ. The biosynthesis of DHM requires the presence of flavonoid-3’,5’-hydroxylase (F3’5’H) activity, which is rare. Dyer et al suggested that the competition for pollinators might have driven the evolution of blue flower color [119,120]. DHQ and DHM (if F3’5’H is present) appear to be the preferred substrate of DFR in most cases. Both in monocots and dicots, double and triple-hydroxylated substrates seem to be preferred by DFR, aligning with the notion that cyanidin is the most abundant and widely distributed floral anthocyanin [121]. While substrate-specifying residues may play a role in identifying the specificity of FLS and DFR enzymes, it is plausible that substrate preference may not be exclusively determined by the H132 and N133 residues, respectively, in all plants. Given the rapid progress in protein structure prediction [122], it might be possible to quantify the contribution of additional residues in the near future through the integration of all available data.

Functional divergence during DFR evolution

Short-chain dehydrogenases/reductases (SDR) is an NADP-dependent enzyme superfamily ubiquitous across all life forms. DFR, belonging to the SDR108E enzyme family, converts dihydroflavonols (DHK, DHQ, and DHM) into colorless leucoanthocyanidins. These leucoanthocyanidins subsequently undergo further modifications to produce a wide array of pigments such as anthocyanins and proanthocyanidins (also known as tannins). Cinnamoyl-CoA reductases (CCRs), involved in lignin biosynthesis, likely originated early in land plants and diverged distinctly from other SDR108E family [123], as seen by the distinct clustering of CCRs with DFR and ANR in our phylogenetic analysis. DeDFR1 and DeDFR2 in Dryopteris erythrosora, GbDFR1, GbDFR2, and GbDFR3 in Ginkgo biloba, and CsDFRc in Camellia sinensis were positioned outside the clade comprising functionally characterized DFRs. Fern DFRs, showing both in vitro and in vivo DFR activity, displayed a unique R residue at position 133, exhibiting substrate specificity for DHK and DHQ but an inability to catalyze DHM [124]. The positioning of these fern DFRs outside the DFR and ANR clade could suggest the independent evolution of DFRs in ancient land plants. Notably, ginkgo DFRs [125], questioned by Halbwirth et al (2019) [48] for their authenticity, were found within TKPR-like clades in our analysis, suggesting their classification as DFR-like sequences rather than bona fide DFRs. CsDFRc capable of restoring Arabidopsis tt3 mutant phenotypes [126] supports its DFR activity. At first glance, the positioning of CsDFRc outside the functional DFR clade could suggest that it might have evolved independently. Further investigation revealed that the clade containing CsDFRc also harbored an Arabidopsis sequence, AT4G27250, named ABA HYPERSENSITIVE 2 (ABH2) in a study by Weng et al (2016) [127]. This ABH2 is the long-sought-after phaseic acid reductase (PAR), which is a DFR-like enzyme involved in abscisic acid catabolism. This suggests potential promiscuous functions of the sequences in this clade where they have a PAR functionality but could also catalyze DFR-like reactions, warranting further exploration in future studies. A thorough examination of the clade comprising functionally characterized DFRs revealed primarily three distinct DFR types determined by the amino acid residue at position 133—DFRN, DFRD, and DFRA. Our phylogenetic analysis indicates that DFR underwent duplication events, likely associated with N133D and N133A substitutions. These duplications and substitutions led to the acquisition of novel or enhanced functions, characterized by narrower substrate specificity, compared to the non-specific DFRN. Until now, DFRA was considered exclusive to the Fragaria genus, but our detailed exploration has revealed DFRA in the Spirodela genus (S. intermedium and S. polyrhiza) and Begonia peltafolia. This discovery suggests a more widespread distribution of DFRA and multiple independent origins, although the power of this study is limited by available plant sequences in public repositories.

While we could not identify DFR in gymnosperms, the DFR sequences of monocots and dicots formed distinct clades. Notably, two monocot clades were observed—one dominated by DFRN sequences and another smaller one situated at the root of the dicot clade, harboring non-DFRN sequences of monocots. These sequences exhibited amino acid diversity at position 133, such as cysteine (C) in Musa acuminata, alanine (A) in Spirodela polyrrhiza, and threonine (T) in Dioscorea alata and D. rotundata. While cysteine and threonine share similar properties with alanine, predictions about their performance, such as a preference for DHK, remain speculative. Given the lack of enzyme assay studies for these proteins, the functional status of this clade remains an open question.

Our analysis identified DFRs that do not conform to DFRN, DFRD, or DFRA types, albeit in limited numbers. Few such DFRs have already been described, like Lotus japonicus DFR1 [62], and Ipomoea nil DFR-A [128], featuring residues S and H at position 133, respectively, were found to lack DFR activity. It remains unknown whether these have undergone neo-functionalization or are pseudogenes. Others, like monocot Anthurium andraeanum DFR, with a S133 residue, exhibit high activity towards DHK [99], while Vaccinium macrocarpon DFR1-1, DFR2-1 [129], and Fagopyrum esculentum FeDFR2 [67], featuring a V133 residue, prefer DHQ over other dihydroflavonols. Additionally, we identified DFR sequences with residues not studied by Johnson et al, including Musa acuminata (C133), Malus domestica (I133), Carica papaya (S133), Fagus sylvatica, and Quercus suber (R133). Investigating residues beyond N, D, and A at position 133 could unveil further insights into DFR functionality and substrate specificity.

DFRs with known functional activity and specificity towards DHK, as observed in strawberries where DFRA is active in a ’false fruit,’ and in A. andraeanum where DFRS is active in spathe leaf—a modified leaf—suggest that unconventional or modified structures could prompt the evolution of different DFRs. Additional DFR types may be discovered in the future within specific organs of plants that assume the function of fruit, flower, or leaves.

Functional divergence during FLS evolution

In the plant kingdom, the 2-oxoglutarate-dependent dioxygenase (2-ODD) superfamily is one of the largest enzyme families only second to cytochrome P450s (CYPs). Within this superfamily, the plant 2-ODD family can be categorized into three classes: DOXA, DOXB, and DOXC, with all enzymes involved in flavonoid biosynthesis falling into DOXC [130]. These 2-ODD enzymes are believed to have originated from a common ancestor before the emergence of land plants, and then underwent species-specific evolution in response to diverse environmental conditions [130]. The evolution of FLS and ANS is likely to have occurred after the emergence of F3H during seed plant evolution [130]. A noteworthy observation similar to that reported by Wang et al. (2019) [131], is the placement of ginkgo 2-ODD genes (GbFLS, GbANS, and GnF3H) into FLS, ANS, and F3H clades, respectively, indicating the divergence of F3H, FLS, and ANS preceded the separation of gymnosperms and angiosperms.

A comprehensive analysis of the functional FLS clade revealed a deep duplication in the evolutionary history of the FLS gene. We found that the FLS could be split into four different subgroups, gymnosperm sequences, aFLS, monocot sequences, and dicot sequences. The aFLS sequences consisting of dicot sequences were more similar to gymnosperm FLSs than to the monocot and dicot FLS sequences in the distinct latter clades suggesting the duplication of ancestral FLS before the divergence of monocots and dicots. The aFLS clade contains FLS-like sequences from basal angiosperms and dicots and some previously characterized dicot FLS sequences. They have a Y132 residue and an aliphatic amino acid residue at position 295, mostly valine. Some of the FLS sequences in the aFLS clade are previously described. For example, the Fagopyrum tataricum FtFLS2 is mainly expressed in roots, flowers, and immature seeds and is upregulated by exogenous application of salicylic acid and NaCl and not affected by abscisic acid [132134]. However, activity analysis of FtFLS2 was not performed. Camellia sinensis CsFLSa has high expression in buds but the buds mostly accumulate catechins. Heterologous expression of CsFLSa in tobacco failed, hence, FLS functionality remains unknown [135]. In Vitis vinifera, FLS5 has very high transcript levels but the quercetin accumulation coincides with the transcription of FLS4 (H132 residue) and not FLS5. The shaded berries accumulate as much FLS5 mRNA as the control berries while the FLS4 (H132 residue) mRNA is not accumulated at all [126]. AcFLS is expressed in almost all plant parts and the highest abundance is seen in flowers. The maximum level of AcFLS mRNA level was recorded six hours after wounding [136]. We observed in our study that Lotus japonicus FLS4 has a low overall transcript level. No previously described aFLS sequence has been tested to show in vivo FLS activity. The position of aFLS clade separate from the canonical FLS clade could indicate that members of the ancestral FLS clade might not be canonical FLS, but have undergone potential neofunctionalization since separation from the canonical FLS lineage.

The large dicot clade comprises a Brassicales order subclade and another clade comprising the rest of the dicots. Both clades show multiple duplication patterns at a shallower level. We presume that sequences in the Brassicales clade have functions similar to their A. thaliana AtFLS1-AtFLS6 orthologs. Preuss et al (2009) demonstrated that only AtFLS1 and AtFLS3 exhibit FLS activity, with other AtFLSs identified as non-functional and AtFLS3 only showing activity under extended assay conditions [137]. However, other studies only found strong evidence for FLS activity in the Brassicales FLS1 clade [87,106]. The presence of the FLS2-FLS6 clade across various Brassicales species suggests a potential undiscovered function for these enzymes. Distinct E295 residue patterns are observed among the FLS clades; AtFLS1, AtFLS2, AtFLS3, and AtFLS6 have E/D295 residue, whereas AtFLS4 and AtFLS5 have other non-conserved residues at this position. The AtFLS5 clade features aliphatic amino acids at this position, and AtFLS5 shows expression primarily in the roots—especially in the roots of seedlings [106]. Enzyme assay results by Chua et al. (2018) indicate that the mutations H132Y and H132F show higher specific activity than wild-type Arabidopsis FLS1, with E295L mutation resulting in only 7% of WT activity. However, the study only used DHQ as a substrate, and DHK was not tested. Additional studies testing the enzyme activity of mutant FLS (H132Y, H132F, and E295(V/L)) with different substrates would help to confirm the function of these aFLSs and the substrate preferences of FLSs with different residues.

In the second subclade, an interesting evolutionary pattern is observed, where FLS sequences from some plant orders appear in two different clades, particularly noticeable in the Fagales, Malpighiales, Malvales, and Sapindales orders. This suggests a gene duplication during the evolution of land plants probably before the split of the aforementioned orders followed by a significant divergence in FLS sequences in these orders due to functional requirements, leading to the formation of distinct FLS clades. Within the Malvales order, including cotton species, an additional duplication event was observed, originating from FLSH duplication, followed by a subsequent substitution leading to the re-emergence of FLSY (H132Y). The exclusive presence of the Y residue at position 132 in the Malvales order indicates a potential evolutionary step to enhance substrate affinity. Other orders like Solanales, Rosales, and Fagales are only represented in a single FLS clade.

Although the Citrus unshiu FLS within the Sapindales I clade exhibits a significantly greater affinity for DHK compared to DHQ, it predominantly accumulates quercetin 3-O-rutinoside (rutin) [138]. The authors of this study proposed the presence of more than one FLS in C. unshiu [112]. In other Citrus species, like C. sinensis, C. trifoliata, and C. clementina, we observed two FLS candidates, one each in the Sapindales I and II clade. Our analysis and enzyme assay data from previous studies suggest that FLSs within the first dicot subclade (Fagales I, Sapindales I, Malpighiales I, and Malvales I) display a pronounced substrate specificity for DHK. In contrast, FLS sequences within the second dicot subclade show a preference for both DHK and DHQ and may have a higher specificity for DHQ [100,102,114,115,126,135,139].

ANS/LDOX, which catalyzes the step immediately downstream of DFR, is closely related to FLS and shows 50–60% polypeptide sequence similarity. Besides the conserved 2-ODD family residues, ANS/LDOX and FLS have similar amino acid residues that have been implicated in the DHQ binding of FLS (Y132, F134, K202, F293, and E295 conserved ANS/LDOX residues based on position in AtFLS1) and show the same substrate/ascorbate binding residue, E230 in AtLDOX. Bifunctional ANS enzymes with FLS-like activity have been discovered in G. biloba [140], O. sativa [141], A. thaliana [65], and M. truncatula [142]. In vitro studies have shown that ANS can convert leucocyanidin to cyanidin and also dihydroquercetin to quercetin. AtLDOX [137] converted DHQ and DHK into quercetin and kaempferol, respectively, in the ratio of 1:0.7 suggesting a higher affinity for DHQ at least in A. thaliana. In the case of flavonols, FLS might be specific for DHK and ANS might be responsible for converting DHQ and DHK (to a lesser extent) to respective flavonols. This could be the reason behind different patterns of kaempferol and quercetin accumulation observed in different plants. In the case of anthocyanidins, DFR is generally more likely to accept DHQ and the ANS also might be more adapted to produce leucocyanidins.

FLS and DFR expression pattern contributes to competition mitigation across angiosperms

The exploration of gene expression across 43 angiosperm species revealed global patterns and offered insights into the competition mitigation between FLS and DFR. There is a predominant concentration of data points along the x and y axes (Fig 8), indicating that co-expression of FLS and DFR almost never occurred. This observation suggests that in the competition between FLS and DFR, only one of them is active, directing the substrate either towards flavonol or anthocyanins. The separation might even take place at the single-cell level, which could explain samples that show some activity of both genes. The analyzed RNA-seq samples are mixtures of different cell types thus FLS and DFR expression could be divergent, but would appear as co-expression because RNA from multiple different cell types is mixed during extraction. Transcription factors might be responsible for causing an almost mutual exclusion of FLS and DFR expression. For example, the presence of DFRA, known for preferring DHK, alongside FLSH, also inclined towards DHK, was associated with the exclusive expression of only one of the corresponding genes within each sample. Similar dynamics were observed for DFRN and FLSY, where both might favor DHQ, providing empirical evidence for substrate competition dynamics. In other cases, like FLSH and DFRN or FLSH and DFRD, there are some samples where both genes are co-expressed. The preference of the above-mentioned DFRs for DHQ and DHM, juxtaposed with FLSH’s affinity for DHK emphasizes the intricate substrate-specific regulation between the genes, thereby mitigating the competition.

Power and limitations of big data upcycling

The wealth of available transcriptomic data offers unprecedented opportunities for efficient data upcycling and comprehensive cross-species comparison. However, it is crucial to acknowledge that many datasets in the Sequence Read Archive (SRA) lack comprehensive metadata, and in some instances, samples may suffer from cryptic or mislabeling issues, posing challenges for data upcycling [143]. Therefore, careful consideration, filtering, and validation of sample tissue information are imperative to ensure the reliability and robustness of data interpretation and subsequent analyses. Nonetheless, the benefits of data reuse in biology outweigh the challenges, especially with the surge in publicly available data and the affordability of sequencing [144]. In this study, we harnessed transcriptomic and genomic data from 43 species distributed across the angiosperms to perform expression analyses. We also utilized vast amounts of genomic data available for a large number of species and employed tools like KIPEs3 to identify functional flavonoid pathway genes with high specificity and accuracy [73]. Achieving this through wet lab methods would not only be laborious, time-consuming, and costly, but also impossible due to the unavailability of the biological material. This underscores the importance of open data to enable efficient exploration of complex biological processes and to facilitate the discovery of novel insights.

Conclusion

The branching point of the flavonoid biosynthesis into flavonols, catalyzed by FLS, and anthocyanins/proanthocyanidins, catalyzed by DFR, is a pivotal step for metabolic flux control. The availability of massive plant sequence datasets and gene expression resources enabled the investigation of FLS and DFR across a wide spectrum of plant species. This transformative approach revealed global patterns that cannot be identified by analyzing individual gene functions in one species through classical wet lab methods. Amino acid residues associated with substrate preferences, evolutionary patterns, and potential competition mitigation mechanisms were explored. These insights into competing branches responsible for producing colored anthocyanins and non-colored flavonols can guide molecular breeding or engineering of ornamental plants.

Supporting information

S1 File. References of the coding sequence data sets that were used in this study.

https://doi.org/10.1371/journal.pone.0305837.s001

(TXT)

S2 File. Collection of ANR, CCR, DFR, and DFR-like sequences that were used for the phylogenetic analysis of DFR.

https://doi.org/10.1371/journal.pone.0305837.s002

(TXT)

S3 File. Collection of F3H, FNSI, ANS, FLS, and FLS-like sequences that were used for the phylogenetic analyses of FLS.

https://doi.org/10.1371/journal.pone.0305837.s003

(TXT)

S4 File. Phylogenetic trees of DFR.

(1) DFR tree showing outgroups constructed by IQ-TREE based on an MAFFT alignment, (2a-f) DFR-specific trees with collapsed outgroups. Gymnosperm, monocot, and dicot species are denoted by light blue, light pink, and light green color stripes, respectively. Non-DFR sequences are represented by dashed gray branches while the functional DFRs are indicated by solid black branches. The color-coded scheme represents different substrate-preference-determining residues at position 133: Asparagine (light purple), Aspartic acid (periwinkle blue), Alanine (deep purple), and other amino acids (gray). The preferred substrate of the DFR type is written in brackets, DHK, dihydrokaempferol; DHQ, dihydroquercetin and DHM, dihydromyricetin. Distinct clusters of DFRs from major plant orders are labeled for reference. DFR sequences identified in previous studies are highlighted by an asterisk at the start of the terminal branch, with asterisks of functional DFR genes colored in red. (2a) Constructed by IQ-TREE based on a MAFFT alignment, (2b) constructed by IQ-TREE based on a Muscle5 alignment, (2c) constructed by FastTree2 based on a MAFFT alignment, (2d) constructed by FastTree2 based on a Muscle5 alignment, (2e) constructed by MEGA based on a MAFFT alignment, and (2f) constructed by MEGA based on a Muscle5 alignment.

https://doi.org/10.1371/journal.pone.0305837.s004

(PDF)

S6 File. Examination of codons at position 133 within the DFRA coding sequence in Fragaria vesca and Fragaria ananassa, along with a comparison against closely related DFRN sequences.

The codons and corresponding amino acids at position 133 are highlighted in red. The color-coded scheme represents different residues: Asparagine (light purple), Alanine (deep purple), and other amino acids (gray) at position 133.

https://doi.org/10.1371/journal.pone.0305837.s006

(PDF)

S7 File. Phylogenetic trees of FLS.

(1) FLS tree showing outgroups constructed by IQ-TREE based on an MAFFT alignment, (2a-f) FLS-specific trees with collapsed outgroups. Gymnosperm, monocot, and dicot species are denoted by light blue, light pink, and light green color stripes, respectively. Non-FLS sequences are represented by dashed gray branches while the functional FLSs are indicated by solid black branches. The background color highlights different presumably substrate-preference-determining amino acid residues at position 132: histidine (pale yellow), phenylalanine (dark orange), and tyrosine (dark golden). The hypothesized preferred substrate of the FLS type is written in brackets, DHK, dihydrokaempferol; DHQ, dihydroquercetin and DHM, dihydromyricetin. Distinct clusters of FLSs from major plant orders are labeled for reference. FLS sequences identified in previous studies are highlighted by an asterisk at the start of the terminal branch, with asterisks of functional FLS genes colored in red. (2a) Constructed by IQ-TREE based on a MAFFT alignment, (2b) constructed by IQ-TREE based on a Muscle5 alignment, (2c) constructed by FastTree2 based on a MAFFT alignment, (2d) constructed by FastTree2 based on a Muscle5 alignment, (2e) constructed by MEGA based on a MAFFT alignment, and (2f) constructed by MEGA based on a Muscle5 alignment.

https://doi.org/10.1371/journal.pone.0305837.s007

(PDF)

S8 File. Approximation of dihydroflavonol availability based on gene expression of F3H, F3’H, and F3’5’H in different plant species.

https://doi.org/10.1371/journal.pone.0305837.s008

(PDF)

S9 File. Extended heatmap of 43 species depicting the transcript abundance of analyzed genes in different tissues.

DFR* refers to DFR sequences where the residue 133 is not N, D, or A. In Musa acuminata C133 residue, Lotus japonicus S133, and Carica papaya S133.

https://doi.org/10.1371/journal.pone.0305837.s009

(PDF)

References

  1. 1. Winkel-Shirley B. Flavonoid Biosynthesis. A Colorful Model for Genetics, Biochemistry, Cell Biology, and Biotechnology. Plant Physiol. 2001 Jun 1;126(2):485–93. pmid:11402179
  2. 2. Koes R, Verweij W, Quattrocchio F. Flavonoids: a colorful model for the regulation and evolution of biochemical pathways. Trends Plant Sci. 2005 May 1;10(5):236–42. pmid:15882656
  3. 3. Paz-Ares J, Ghosal D, Saedler H. Molecular analysis of the C1-I allele from Zea mays: a dominant mutant of the regulatory C1 locus. EMBO J. 1990 Feb;9(2):315–21. pmid:2303027
  4. 4. Martin C, Prescott A, Mackay S, Bartlett J, Vrijlandt E. Control of anthocyanin biosynthesis in flowers of Antirrhinum majus. Plant J. 1991 Jul;1(1):37–49. pmid:1844879
  5. 5. Buer CS, Muday GK. The transparent testa4 Mutation Prevents Flavonoid Synthesis and Alters Auxin Transport and the Response of Arabidopsis Roots to Gravity and Light[W]. Plant Cell. 2004 May 12;16(5):1191–205.
  6. 6. Mo Y, Nagel C, Taylor LP. Biochemical complementation of chalcone synthase mutants defines a role for flavonols in functional pollen. Proc Natl Acad Sci. 1992 Aug;89(15):7213–7. pmid:11607312
  7. 7. Winkel-Shirley B. Biosynthesis of flavonoids and effects of stress. Curr Opin Plant Biol. 2002 Jun;5(3):218–23. pmid:11960739
  8. 8. Asen S, Stewart RN, Norris KH. Co-pigmentation of anthocyanins in plant tissues and its effect on color. Phytochemistry. 1972 Mar;11(3):1139–44.
  9. 9. Asen S, Stewart RN, Norris KH. Anthocyanin, flavonol copigments, and pH responsible for larkspur flower color. Phytochemistry. 1975 Dec;14(12):2677–82.
  10. 10. Emiliani J, Grotewold E, Falcone Ferreyra ML, Casati P. Flavonols protect Arabidopsis plants against UV-B deleterious effects. Mol Plant. 2013 Jul;6(4):1376–9. pmid:23371934
  11. 11. Jiang N, Doseff AI, Grotewold E. Flavones: From Biosynthesis to Health Benefits. Plants. 2016 Jun;5(2):27. pmid:27338492
  12. 12. Dixon RA, Sarnala S. Proanthocyanidin Biosynthesis—a Matter of Protection. Plant Physiol. 2020 Oct 1;184(2):579–91. pmid:32817234
  13. 13. Todd JJ, Vodkin LO. Pigmented Soybean (Glycine max) Seed Coats Accumulate Proanthocyanidins during Development. Plant Physiol. 1993 Jun 1;102(2):663–70.
  14. 14. Nakabayashi R, Yonekura‐Sakakibara K, Urano K, Suzuki M, Yamada Y, Nishizawa T, et al. Enhancement of oxidative and drought tolerance in Arabidopsis by overaccumulation of antioxidant flavonoids. Plant J. 2014 Feb;77(3):367–79. pmid:24274116
  15. 15. Schulz E, Tohge T, Zuther E, Fernie AR, Hincha DK. Natural variation in flavonol and anthocyanin metabolism during cold acclimation in Arabidopsis thaliana accessions: Flavonoids and cold acclimation. Plant Cell Environ. 2015 Aug;38(8):1658–72.
  16. 16. Becker C, Klaering HP, Kroh LW, Krumbein A. Cool-cultivated red leaf lettuce accumulates cyanidin-3-O-(6″-O-malonyl)-glucoside and caffeoylmalic acid. Food Chem. 2014 Mar;146:404–11.
  17. 17. Schulz E, Tohge T, Zuther E, Fernie AR, Hincha DK. Flavonoids are determinants of freezing tolerance and cold acclimation in Arabidopsis thaliana. Sci Rep. 2016 Sep 23;6(1):34027. pmid:27658445
  18. 18. SU W, Zhang G, LI X, GU F, Shi B. Effect of light intensity and light quality on growth and total flavonoid accumulation of Erigeron breviscapus. Chin Tradit Herb Drugs. 1994;
  19. 19. Idris A, Linatoc AC, Bakar MFA, Ibrahim ZT, Audu Y. Effect of light quality and quantity on the accumulation of flavonoid in plant species. J Sci Technol. 2018;10(3).
  20. 20. Feild TS, Lee DW, Holbrook NM. Why Leaves Turn Red in Autumn. The Role of Anthocyanins in Senescing Leaves of Red-Osier Dogwood. Plant Physiol. 2001 Oct 1;127(2):566–74. pmid:11598230
  21. 21. falcone ferreyra M, Serra P, Casati P. Recent advances on the roles of flavonoids as plant protective molecules after UV and high light exposure. Physiol Plant. 2021 Aug 1;173.
  22. 22. Hanasaki Y, Ogawa S, Fukui S. The correlation between active oxygens scavenging and antioxidative effects of flavonoids. Free Radic Biol Med. 1994 Jun 1;16(6):845–50. pmid:8070690
  23. 23. Agati G, Azzarello E, Pollastri S, Tattini M. Flavonoids as antioxidants in plants: Location and functional significance. Plant Sci. 2012 Nov 1;196:67–76. pmid:23017900
  24. 24. Michalak A. Phenolic Compounds and Their Antioxidant Activity in Plants Growing under Heavy Metal Stress. Pol J Environ Stud. 2006 Jan 1;15:523–30.
  25. 25. Samanta A, Das G, Das S. Roles of flavonoids in Plants. Int J Pharm Sci Technol. 2011 Jan 1;6:12–35.
  26. 26. Subramanian S, Stacey G, Yu O. Distinct, crucial roles of flavonoids during legume nodulation. Trends Plant Sci. 2007 Jul;12(7):282–5. pmid:17591456
  27. 27. Liu CW, Murray JD. The Role of Flavonoids in Nodulation Host-Range Specificity: An Update. Plants Basel Switz. 2016 Aug 11;5(3):33. pmid:27529286
  28. 28. Brown DE, Rashotte AM, Murphy AS, Normanly J, Tague BW, Peer WA, et al. Flavonoids act as negative regulators of auxin transport in vivo in arabidopsis. Plant Physiol. 2001 Jun;126(2):524–35. pmid:11402184
  29. 29. Besseau S, Hoffmann L, Geoffroy P, Catherine L, Pollet B, Legrand M. Flavonoid Accumulation in Arabidopsis Repressed in Lignin Synthesis Affects Auxin Transport and Plant Growth. Plant Cell. 2007 Feb 1;19:148–62. pmid:17237352
  30. 30. Winkel BSJ. Metabolite Channeling and Multi-enzyme Complexes. In: Osbourn AE, Lanzotti V, editors. Plant-derived Natural Products: Synthesis, Function, and Application [Internet]. New York, NY: Springer US; 2009 [cited 2024 May 15]. p. 195–208.
  31. 31. Dastmalchi M, Bernards MA, Dhaubhadel S. Twin anchors of the soybean isoflavonoid metabolon: evidence for tethering of the complex to the endoplasmic reticulum by IFS and C4H. Plant J. 2016;85(6):689–706. pmid:26856401
  32. 32. Crosby KC, Pietraszewska-Bogiel A, Gadella TWJ, Winkel BSJ. Förster resonance energy transfer demonstrates a flavonoid metabolon in living plant cells that displays competitive interactions between enzymes. FEBS Lett. 2011 Jul 21;585(14):2193–8.
  33. 33. Tian S, Yang Y, Wu T, Luo C, Li X, Zhao X, et al. Functional Characterization of a Flavone Synthase That Participates in a Kumquat Flavone Metabolon. Front Plant Sci. 2022 Mar 2;13:826780. pmid:35310637
  34. 34. Fujino N, Tenma N, Waki T, Ito K, Komatsuzaki Y, Sugiyama K, et al. Physical interactions among flavonoid enzymes in snapdragon and torenia reveal the diversity in the flavonoid metabolon organization of different plant species. Plant J. 2018;94(2):372–92. pmid:29421843
  35. 35. Halbwirth H. The Creation and Physiological Relevance of Divergent Hydroxylation Patterns in the Flavonoid Pathway. Int J Mol Sci. 2010 Feb 4;11(2):595–621. pmid:20386656
  36. 36. Yoder JI, Belzile F, Tong Y, Goldsbrough A. Visual markers for tomato derived from the anthocyanin biosynthetic pathway. Euphytica. 1994 Jan;79(3):163–7.
  37. 37. Shao D, Li Y, Zhu Q, Zhang X, Liu F, Xue F, et al. GhGSTF12, a glutathione S-transferase gene, is essential for anthocyanin accumulation in cotton (Gossypium hirsutum L.). Plant Sci. 2021 Apr 1;305:110827. pmid:33691961
  38. 38. Eichenberger M, Schwander T, Hüppi S, Kreuzer J, Mittl PRE, Peccati F, et al. The catalytic role of glutathione transferases in heterologous anthocyanin biosynthesis. Nat Catal. 2023 Oct;6(10):927–38. pmid:37881531
  39. 39. Zhao ZC, Hu GB, Hu FC, Wang HC, Yang ZY, Lai B. The UDP glucose: flavonoid-3-O-glucosyltransferase (UFGT) gene regulates anthocyanin biosynthesis in litchi (Litchi chinesis Sonn.) during fruit coloration. Mol Biol Rep. 2012 Jun;39(6):6409–15. pmid:22447536
  40. 40. Francis FJ. Food colorants: anthocyanins. Crit Rev Food Sci Nutr. 1989;28(4):273–314. pmid:2690857
  41. 41. Tanaka Y, Sasaki N, Ohmiya A. Biosynthesis of plant pigments: Anthocyanins, betalains and carotenoids. Plant J Cell Mol Biol. 2008 Jun 1;54:733–49. pmid:18476875
  42. 42. Davies KM, Schwinn KE, Deroles SC, Manson DG, Lewis DH, Bloor SJ, et al. Enhancing anthocyanin production by altering competition for substrate between flavonol synthase and dihydroflavonol 4-reductase. Euphytica. 2003 Jun 1;131(3):259–68.
  43. 43. Holton TA, Brugliera F, Tanaka Y. Cloning and expression of flavonol synthase from Petunia hybrida. Plant J. 1993;4(6):1003–10. pmid:7904213
  44. 44. Davies K, Winefield C, Lewis D, Nielsen K, Bradley M, Schwinn K, et al. Research into control of flower colour and flowering time in Eustoma grandiflorum (Lisanthus). Flower Newsl. 1997 Jan 1;23:24–32.
  45. 45. Luo P, Ning G, Wang Z, Shen Y, Jin H, Li P, et al. Disequilibrium of Flavonol Synthase and Dihydroflavonol-4-Reductase Expression Associated Tightly to White vs. Red Color Flower Formation in Plants. Front Plant Sci [Internet]. 2016 [cited 2023 Jul 7];6. Available from: https://www.frontiersin.org/articles/10.3389/fpls.2015.01257
  46. 46. Nielsen K, Deroles SC, Markham KR, Bradley MJ, Podivinsky E, Manson D. Antisense flavonol synthase alters copigmentation and flower color in lisianthus. Mol Breed. 2002 Dec 1;9(4):217–29.
  47. 47. Tanaka Y, Brugliera F. Flower colour and cytochromes P450. Philos Trans R Soc B Biol Sci. 2013 Feb 19;368(1612):20120432. pmid:23297355
  48. 48. Halbwirth H, Miosic S, Milosevic M, Nitarska D, Thill J, Stich K, et al. Re-investigating substrate specificity of dihydroflavonol 4-reductase with respect to the B-ring hydroxylation pattern of substrates. Acta Hortic. 2019 Jul;(1242):889–98.
  49. 49. Stracke R, Ishihara H, Huep G, Barsch A, Mehrtens F, Niehaus K, et al. Differential regulation of closely related R2R3-MYB transcription factors controls flavonol accumulation in different parts of the Arabidopsis thaliana seedling: TFs of A. thaliana R2R3-MYB subgroup 7. Plant J. 2007 Apr 25;50(4):660–77.
  50. 50. Schilbert HM, Glover BJ. Analysis of flavonol regulator evolution in the Brassicaceae reveals MYB12, MYB111 and MYB21 duplications and MYB11 and MYB24 gene loss. BMC Genomics. 2022 Aug 19;23(1):604. pmid:35986242
  51. 51. Battat M, Eitan A, Rogachev I, Hanhineva K, Fernie A, Tohge T, et al. A MYB Triad Controls Primary and Phenylpropanoid Metabolites for Pollen Coat Patterning. Plant Physiol. 2019 May;180(1):87–108. pmid:30755473
  52. 52. Zhang X, He Y, Li L, Liu H, Hong G. Involvement of the R2R3-MYB transcription factor MYB21 and its homologs in regulating flavonol accumulation in Arabidopsis stamen. Hancock R, editor. J Exp Bot. 2021 May 28;72(12):4319–32.
  53. 53. Xu W, Dubos C, Lepiniec L. Transcriptional control of flavonoid biosynthesis by MYB—bHLH—WDR complexes. Trends Plant Sci. 2015 Mar;20(3):176–85. pmid:25577424
  54. 54. LaFountain AM, Yuan Y. Repressors of anthocyanin biosynthesis. New Phytol. 2021 Aug;231(3):933–49. pmid:33864686
  55. 55. Johnson ET, Ryu S, Yi H, Shin B, Cheong H, Choi G. Alteration of a single amino acid changes the substrate specificity of dihydroflavonol 4-reductase: Substrate specificity of DFR. Plant J. 2001 Dec 23;25(3):325–33.
  56. 56. Forkmann G, Ruhnau B. Distinct Substrate Specificity of Dihydroflavonol 4-Reductase from Flowers of Petunia hybrida. Z Für Naturforschung C. 1987 Oct 1;42(9–10):1146–8.
  57. 57. Li Y, Liu X, Cai X, Shan X, Gao R, Yang S, et al. Dihydroflavonol 4-Reductase Genes from Freesia hybrida Play Important and Partially Overlapping Roles in the Biosynthesis of Flavonoids. Front Plant Sci [Internet]. 2017 Mar 28 [cited 2023 Jul 7];8. Available from: http://journal.frontiersin.org/article/10.3389/fpls.2017.00428/fullpmid:28400785
  58. 58. Gerats AGM, De Vlaming P, Doodeman M, Al B, Schram AW. Genetic control of the conversion of dihydroflavonols into flavonols and anthocyanins in flowers of Petunia hybrida. Planta. 1982 Aug;155(4):364–8. pmid:24271874
  59. 59. Beld M, Martin C, Huits H, Stuitje AR, Gerats AGM. Flavonoid synthesis in Petunia hybrida: partial characterization of dihydroflavonol-4-reductase genes. Plant Mol Biol. 1989 Nov;13(5):491–502. pmid:2491667
  60. 60. Helariutta Y, Elomaa P, Kotilainen M, Griesbach RJ, Schröder J, Teeri TH. Chalcone synthase-like genes active during corolla development are differentially expressed and encode enzymes with different catalytic properties in Gerbera hybrida (Asteraceae). Plant Mol Biol. 1995 Apr;28(1):47–60. pmid:7787187
  61. 61. Xie DY, Jackson LA, Cooper JD, Ferreira D, Paiva NL. Molecular and Biochemical Analysis of Two cDNA Clones Encoding Dihydroflavonol-4-Reductase from Medicago truncatula. Plant Physiol. 2004 Mar 1;134(3):979–94.
  62. 62. Shimada N, Sasaki R, Sato S, Kaneko T, Tabata S, Aoki T, et al. A comprehensive analysis of six dihydroflavonol 4-reductases encoded by a gene cluster of the Lotus japonicus genome. J Exp Bot. 2005 Sep 1;56(419):2573–85. pmid:16087700
  63. 63. Petit P, Granier T, d’Estaintot BL, Manigand C, Bathany K, Schmitter JM, et al. Crystal Structure of Grape Dihydroflavonol 4-Reductase, a Key Enzyme in Flavonoid Biosynthesis. J Mol Biol. 2007 May 18;368(5):1345–57. pmid:17395203
  64. 64. Chua CS, Biermann D, Goo KS, Sim TS. Elucidation of active site residues of Arabidopsis thaliana flavonol synthase provides a molecular platform for engineering flavonols. Phytochemistry. 2008 Jan;69(1):66–75. pmid:17719613
  65. 65. Wilmouth RC, Turnbull JJ, Welford RWD, Clifton IJ, Prescott AG, Schofield CJ. Structure and Mechanism of Anthocyanidin Synthase from Arabidopsis thaliana. Structure. 2002 Jan;10(1):93–103. pmid:11796114
  66. 66. Østergaard L, Lauvergeat V, Næsted H, Mattsson O, Mundy J. Two differentially regulated Arabidopsis genes define a new branch of the DFR superfamily. Plant Sci. 2001 Feb;160(3):463–72. pmid:11166433
  67. 67. Katsu K, Suzuki R, Tsuchiya W, Inagaki N, Yamazaki T, Hisano T, et al. A new buckwheat dihydroflavonol 4-reductase (DFR), with a unique substrate binding structure, has altered substrate specificity. BMC Plant Biol. 2017 Dec;17(1):239. pmid:29228897
  68. 68. Miosic S, Thill J, Milosevic M, Gosch C, Pober S, Molitor C, et al. Dihydroflavonol 4-Reductase Genes Encode Enzymes with Contrasting Substrate Specificity and Show Divergent Gene Expression Profiles in Fragaria Species. PLOS ONE. 2014 Nov 13;9(11):e112707. pmid:25393679
  69. 69. Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012 Jan;40(D1):D1178–86. pmid:22110026
  70. 70. Clark K, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW. GenBank. Nucleic Acids Res. 2016 Jan 4;44(Database issue):D67–72.
  71. 71. Pucker B. script_collection [Internet]. 2023 [cited 2023 Jul 21]. https://github.com/bpucker/script_collection
  72. 72. Pucker B, Reiher F, Schilbert HM. Automatic Identification of Players in the Flavonoid Biosynthesis with Application on the Biomedicinal Plant Croton tiglium. Plants. 2020 Sep;9(9):1103. pmid:32867203
  73. 73. Rempel A, Choudhary N, Pucker B. KIPEs3: Automatic annotation of biosynthesis pathways. PLOS ONE. 2023 Nov 16;18(11):e0294342. pmid:37972102
  74. 74. Katoh K, Standley DM. MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Mol Biol Evol. 2013 Apr;30(4):772–80. pmid:23329690
  75. 75. Henikoff S, Henikoff JG. Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A. 1992 Nov 15;89(22):10915–9. pmid:1438297
  76. 76. Brown JW, Walker JF, Smith SA. Phyx: phylogenetic tools for unix. Kelso J, editor. Bioinformatics. 2017 Jun 15;33(12):1886–8. pmid:28174903
  77. 77. Kalyaanamoorthy S, Minh BQ, Wong TKF, Von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 2017 Jun;14(6):587–9. pmid:28481363
  78. 78. Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, Von Haeseler A, et al. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Teeling E, editor. Mol Biol Evol. 2020 May 1;37(5):1530–4. pmid:32011700
  79. 79. Price MN, Dehal PS, Arkin AP. FastTree 2 –Approximately Maximum-Likelihood Trees for Large Alignments. Poon AFY, editor. PLoS ONE. 2010 Mar 10;5(3):e9490. pmid:20224823
  80. 80. Tamura K, Stecher G, Kumar S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Battistuzzi FU, editor. Mol Biol Evol. 2021 Jun 25;38(7):3022–7. pmid:33892491
  81. 81. Letunic I, Bork P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021 Jul 2;49(W1):W293–6. pmid:33885785
  82. 82. Leinonen R, Sugawara H, Shumway M. The Sequence Read Archive. Nucleic Acids Res. 2011 Jan;39(Database issue):D19–21. pmid:21062823
  83. 83. Bray NL, Pimentel H, Melsted P, Pachter L. Near-optimal probabilistic RNA-seq quantification. Nat Biotechnol. 2016 May;34(5):525–7. pmid:27043002
  84. 84. Pucker B, Iorizzo M. Apiaceae FNS I originated from F3H through tandem gene duplication. Remington DL, editor. PLOS ONE. 2023 Jan 19;18(1):e0280155. pmid:36656808
  85. 85. Pucker B. CoExp [Internet]. 2022 [cited 2023 Nov 8]. https://github.com/bpucker/CoExp
  86. 86. Stracke R, De Vos RCH, Bartelniewoehner L, Ishihara H, Sagasser M, Martens S, et al. Metabolomic and genetic analyses of flavonol synthesis in Arabidopsis thaliana support the in vivo involvement of leucoanthocyanidin dioxygenase. Planta. 2009 Jan 1;229(2):427–45. pmid:18998159
  87. 87. Schilbert HM, Schöne M, Baier T, Busche M, Viehöver P, Weisshaar B, et al. Characterization of the Brassica napus Flavonol Synthase Gene Family Reveals Bifunctional Flavonol Synthases. Front Plant Sci [Internet]. 2021 [cited 2023 Oct 20];12. Available from: https://www.frontiersin.org/articles/10.3389/fpls.2021.733762
  88. 88. The Angiosperm Phylogeny Group, Chase MW, Christenhusz MJM, Fay MF, Byng JW, Judd WS, et al. An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG IV. Bot J Linn Soc. 2016 May 1;181(1):1–20.
  89. 89. Gebhardt Y, Witte S, Forkmann G, Lukačin R, Matern U, Martens S. Molecular evolution of flavonoid dioxygenases in the family Apiaceae. Phytochemistry. 2005 Jun 1;66(11):1273–84. pmid:15913674
  90. 90. Johnson ET, Yi H, Shin B, Oh BJ, Cheong H, Choi G. Cymbidium hybrida dihydroflavonol 4-reductase does not efficiently reduce dihydrokaempferol to produce orange pelargonidin-type anthocyanins. Plant J. 1999;19(1):81–5. pmid:10417729
  91. 91. Park S, Kim DH, Lee JY, Ha SH, Lim SH. Comparative Analysis of Two Flavonol Synthases from Different-Colored Onions Provides Insight into Flavonoid Biosynthesis. J Agric Food Chem. 2017 Jul 5;65(26):5287–98. pmid:28537403
  92. 92. Park S, Kim DH, Park BR, Lee JY, Lim SH. Molecular and Functional Characterization of Oryza sativa Flavonol Synthase (OsFLS), a Bifunctional Dioxygenase. J Agric Food Chem. 2019 Jul 3;67(26):7399–409. pmid:31244203
  93. 93. Falcone Ferreyra ML, Rius S, Emiliani J, Pourcel L, Feller A, Morohashi K, et al. Cloning and characterization of a UV-B-inducible maize flavonol synthase. Plant J. 2010;62(1):77–91. pmid:20059741
  94. 94. Žilić S, Serpen A, Akıllıoğlu G, Gökmen V, Vančetović J. Phenolic Compounds, Carotenoids, Anthocyanins, and Antioxidant Capacity of Colored Maize (Zea mays L.) Kernels. J Agric Food Chem. 2012 Feb 8;60(5):1224–31. pmid:22248075
  95. 95. Aoki H, Kuze N, Kato Y, Ei S, Inc F. Anthocyanin isolated from purple corn (Zea mays L). Foods Food Ingred J Jpn. 2001 Jan 1;199.
  96. 96. Jende-Strid B. Gene-enzyme relations in the pathway of flavonoid biosynthesis in barley. Theor Appl Genet. 1991 May 1;81(5):668–74. pmid:24221384
  97. 97. Nyegaard Kristiansen K, Rohde W. Structure of the Hordeum vulgare gene encoding dihydroflavonol-4-reductase and molecular analysis of ant18 mutants blocked in flavonoid synthesis. Mol Gen Genet MGG. 1991 Nov 1;230(1):49–59.
  98. 98. Liu H, Lou Q, Ma J, Su B, Gao Z, Liu Y. Cloning and Functional Characterization of Dihydroflavonol 4-Reductase Gene Involved in Anthocyanidin Biosynthesis of Grape Hyacinth. Int J Mol Sci. 2019 Jan;20(19):4743. pmid:31554290
  99. 99. Leonard E, Yan Y, Chemler J, Matern U, Martens S, Koffas MAG. Characterization of dihydroflavonol 4-reductases for recombinant plant pigment biosynthesis applications. Biocatal Biotransformation. 2008 Jan 1;26(3):243–51.
  100. 100. Almeida JRM, D’Amico E, Preuss A, Carbone F, de Vos CHR, Deiml B, et al. Characterization of major enzymes and genes involved in flavonoid and proanthocyanidin biosynthesis during fruit development in strawberry (Fragaria ×ananassa). Arch Biochem Biophys. 2007 Sep 1;465(1):61–71.
  101. 101. Li P, Lei K, Liu L, Zhang G, Ge H, Zheng C, et al. Identification and functional characterization of a new flavonoid synthase gene MdFLS1 from apple. Planta. 2021 Apr 15;253(5):105. pmid:33860366
  102. 102. Suzuki K ichi, Tsuda Sh, Fukui Y, Fukuchi-Mizutani M, Yonekura-Sakakibara K, Tanaka Y, et al. Molecular Characterization of Rose Flavonoid Biosynthesis Genes and Their Application in Petunia. Biotechnol Biotechnol Equip. 2000 Jan 1;14(2):56–62.
  103. 103. Fischer TC, Halbwirth H, Meisel B, Stich K, Forkmann G. Molecular cloning, substrate specificity of the functionally expressed dihydroflavonol 4-reductases from Malus domestica and Pyrus communis cultivars and the consequences for flavonoid metabolism. Arch Biochem Biophys. 2003 Apr 15;412(2):223–30. pmid:12667486
  104. 104. Tanaka Y, Fukui Y, Fukuchi-Mizutani M, Holton TA, Higgins E, Kusumi T. Molecular Cloning and Characterization of Rosa hybrida Dihydroflavonol 4-reductase Gene. Plant Cell Physiol. 1995 Sep 1;36(6):1023–31. pmid:8528604
  105. 105. Waki T, Mameda R, Nakano T, Yamada S, Terashita M, Ito K, et al. A conserved strategy of chalcone isomerase-like protein to rectify promiscuous chalcone synthase specificity. Nat Commun. 2020 Feb 13;11(1):870. pmid:32054839
  106. 106. Owens DK, Alerding AB, Crosby KC, Bandara AB, Westwood JH, Winkel BSJ. Functional Analysis of a Predicted Flavonol Synthase Gene Family in Arabidopsis. Plant Physiol. 2008 Jul 8;147(3):1046–61. pmid:18467451
  107. 107. Spribille R, Forkmann G. Conversion of Dihydroflavonols to Flavonols with Enzyme Extracts from Flower Buds of Matthiola incana R. Br. Z Für Naturforschung C. 1984 Aug 1;39(7–8):714–9.
  108. 108. Turnbull JJ, Nakajima Jichiro, Welford RWD, Yamazaki M, Saito K, Schofield CJ. Mechanistic Studies on Three 2-Oxoglutarate-dependent Oxygenases of Flavonoid Biosynthesis: ANTHOCYANIDIN SYNTHASE, FLAVONOL SYNTHASE, AND FLAVANONE 3β-HYDROXYLASE*. J Biol Chem. 2004 Jan 9;279(2):1206–16.
  109. 109. Bloor SJ, Abrahams S. The structure of the major anthocyanin in Arabidopsis thaliana. Phytochemistry. 2002 Feb 1;59(3):343–6. pmid:11830144
  110. 110. Martens S, Teeri T, Forkmann G. Heterologous expression of dihydroflavonol 4-reductases from various plants. FEBS Lett. 2002 Nov 20;531(3):453–8. pmid:12435592
  111. 111. Saito N, Tatsuzawa F, Nishiyama A, Yokoi M, Shigihara A, Honda T. Acylated cyanidin 3-sambubioside-5-glucosides in Matthiola incana. Phytochemistry. 1995 Mar;38(4):1027–32. pmid:7766384
  112. 112. Wellmann F, Lukačin R, Moriguchi T, Britsch L, Schiltz E, Matern U. Functional expression and mutational analysis of flavonol synthase from Citrus unshiu: Flavonol synthase. Eur J Biochem. 2002 Aug;269(16):4134–42.
  113. 113. Aiguo Z, Ruiwen D, Cheng W, Cheng C, Dongmei W. Insights into the catalytic and regulatory mechanisms of dihydroflavonol 4-reductase, a key enzyme of anthocyanin synthesis in Zanthoxylum bungeanum. Tree Physiol. 2023 Jan 1;43(1):169–84. pmid:36054375
  114. 114. Li C, Bai Y, Li S, Chen H, Han X, Zhao H, et al. Cloning, Characterization, and Activity Analysis of a Flavonol Synthase Gene FtFLS1 and Its Association with Flavonoid Content in Tartary Buckwheat. J Agric Food Chem. 2012 May 23;60(20):5161–8.
  115. 115. Stich K, Eidenberger T, Wurst F, Forkmann G. Flavonol Synthase Activity and the Regulation of Flavonol and Anthocyanin Biosynthesis during Flower Development in Dianthus caryophyllus L. (Carnation). Z Für Naturforschung C. 1992 Aug 1;47(7–8):553–60.
  116. 116. Jiang J. Cloning of flavonol synthase gene from Fagopyum dibotrys and its expression in Escherichia coli. Chin Tradit Herb Drugs. 2013;1974–8.
  117. 117. McCarthy EW, Landis JB, McCoy AG, Lawhorn AJ, Kurti A, Xu Y, et al. Homeologue differential expression in the flavonoid biosynthetic pathway underlies flower colour variation in natural and synthetic polyploids of Nicotiana tabacum (Solanaceae). Bot J Linn Soc. 2023 Oct 24;boad052.
  118. 118. Tanaka Y, Yonekura K, Fukuchi-Mizutani M, Fukui Y, Fujiwara H, Ashikari T, et al. Molecular and Biochemical Characterization of Three Anthocyanin Synthetic Enzymes from Gentiana triflora. Plant Cell Physiol. 1996 Jul 1;37(5):711–6. pmid:8819318
  119. 119. Gottlieb OR. Micromolecular Evolution, Systematics and Ecology: An Essay into a Novel Botanical Discipline. Springer Science & Business Media; 2012. 183 p.
  120. 120. Dyer AG, Jentsch A, Burd M, Garcia JE, Giejsztowt J, Camargo MGG, et al. Fragmentary Blue: Resolving the Rarity Paradox in Flower Colors. Front Plant Sci [Internet]. 2021 Jan 15 [cited 2024 May 18];11. Available from: https://www.frontiersin.org/journals/plant-science/articles/10.3389/fpls.2020.618203/full
  121. 121. Harborne JB, Williams CA. Advances in flavonoid research since 1992. Phytochemistry. 2000 Nov 1;55(6):481–504. pmid:11130659
  122. 122. Kuhlman B, Bradley P. Advances in protein structure prediction and design. Nat Rev Mol Cell Biol. 2019 Nov;20(11):681–97. pmid:31417196
  123. 123. Moummou H, Kallberg Y, Tonfack LB, Persson B, van der Rest B. The Plant Short-Chain Dehydrogenase (SDR) superfamily: genome-wide inventory and diversification patterns. BMC Plant Biol. 2012 Nov 20;12(1):219.
  124. 124. Chen X, Liu W, Huang X, Fu H, Wang Q, Wang Y, et al. Arg-type dihydroflavonol 4-reductase genes from the fern Dryopteris erythrosora play important roles in the biosynthesis of anthocyanins. PLOS ONE. 2020 May 1;15(5):e0232090. pmid:32357153
  125. 125. Hua C, Linling L, Shuiyuan C, Fuliang C, Feng X, Honghui Y, et al. Molecular Cloning and Characterization of Three Genes Encoding Dihydroflavonol-4-Reductase from Ginkgo biloba in Anthocyanin Biosynthetic Pathway. PLOS ONE. 2013 Aug 26;8(8):e72017. pmid:23991027
  126. 126. FUJITA A, GOTO-YAMAMOTO N, ARAMAKI I, HASHIZUME K. Organ-Specific Transcription of Putative Flavonol Synthase Genes of Grapevine and Effects of Plant Hormones and Shading on Flavonol Biosynthesis in Grape Berry Skins. Biosci Biotechnol Biochem. 2006 Jan 1;70(3):632–8. pmid:16556978
  127. 127. Weng JK, Ye M, Li B, Noel JP. Co-evolution of Hormone Metabolism and Signaling Networks Expands Plant Adaptive Plasticity. Cell. 2016 Aug;166(4):881–93. pmid:27518563
  128. 128. Des Marais DL, Rausher MD. Escape from adaptive conflict after duplication in an anthocyanin pathway gene. Nature. 2008 Aug;454(7205):762–5. pmid:18594508
  129. 129. Polashock JJ, Griesbach RJ, Sullivan RF, Vorsa N. Cloning of a cDNA encoding the cranberry dihydroflavonol-4-reductase (DFR) and expression in transgenic tobacco. Plant Sci. 2002 Aug 1;163(2):241–51.
  130. 130. Kawai Y, Ono E, Mizutani M. Evolution and diversity of the 2–oxoglutarate-dependent dioxygenase superfamily in plants. Plant J. 2014;78(2):328–43. pmid:24547750
  131. 131. Wang Z, Wang S, Wu M, Li Z, Liu P, Li F, et al. Evolutionary and functional analyses of the 2-oxoglutarate-dependent dioxygenase genes involved in the flavonoid biosynthesis pathway in tobacco. Planta. 2019 Feb 1;249(2):543–61. pmid:30293202
  132. 132. Yao Y, Sun L, Wu W, Wang S, Xiao X, Hu M, et al. Genome-Wide Investigation of Major Enzyme-Encoding Genes in the Flavonoid Metabolic Pathway in Tartary Buckwheat (Fagopyrum tataricum). J Mol Evol. 2021 Jun 1;89(4):269–86. pmid:33760965
  133. 133. Li X, Kim YB, Kim Y, Zhao S, Kim HH, Chung E, et al. Differential stress-response expression of two flavonol synthase genes and accumulation of flavonols in tartary buckwheat. J Plant Physiol. 2013 Dec 15;170(18):1630–6. pmid:23859559
  134. 134. Li X, Park NI, Kim YB, Kim HH, Park CH, Wu Q, et al. Accumulation of flavonoids and expression of flavonoid biosynthetic genes in tartary and rice-tartary buckwheat. Process Biochem. 2012 Dec 1;47(12):2306–10.
  135. 135. Jiang X, Shi Y, Fu Z, Li WW, Lai S, Wu Y, et al. Functional characterization of three flavonol synthase genes from Camellia sinensis: Roles in flavonol accumulation. Plant Sci. 2020 Nov;300:110632. pmid:33180711
  136. 136. Toh HC, Wang SY, Chang ST, Chu FH. Molecular cloning and characterization of flavonol synthase in Acacia confusa. Tree Genet Genomes. 2013 Feb 1;9(1):85–92.
  137. 137. Preuss A, Stracke R, Weisshaar B, Hillebrecht A, Matern U, Martens S. Arabidopsis thaliana expresses a second functional flavonol synthase. FEBS Lett. 2009 Jun 18;583(12):1981–6. pmid:19433090
  138. 138. Moriguchi T, Kita M, Ogawa K, Tomono Y, Endo T, Omura M. Flavonol synthase gene expression during citrus fruit development. Physiol Plant. 2002;114(2):251–8. pmid:11903972
  139. 139. Kou M, Li C, Song W, Shen Y, Tang W, Zhang Y, et al. Identification and functional characterization of a flavonol synthase gene from sweet potato [Ipomoea batatas (L.) Lam.]. Front Plant Sci. 2023 May 10;14:1181173. pmid:37235006
  140. 140. Xu F, Cheng H, Cai R, Li LL, Chang J, Zhu J, et al. Molecular Cloning and Function Analysis of an Anthocyanidin Synthase Gene from Ginkgo biloba, and Its Expression in Abiotic Stress Responses. Mol Cells. 2008 Dec 31;26(6):536–47. pmid:18779661
  141. 141. Reddy AM, Reddy VS, Scheffler BE, Wienand U, Reddy AR. Novel transgenic rice overexpressing anthocyanidin synthase accumulates a mixture of flavonoids leading to an increased antioxidant potential. Metab Eng. 2007 Jan 1;9(1):95–111. pmid:17157544
  142. 142. Pang Y, Peel GJ, Wright E, Wang Z, Dixon RA. Early Steps in Proanthocyanidin Biosynthesis in the Model Legume Medicago truncatula. Plant Physiol. 2007 Nov;145(3):601–15. pmid:17885080
  143. 143. Sielemann K, Hafner A, Pucker B. The reuse of public datasets in the life sciences: potential risks and rewards. PeerJ. 2020 Sep 22;8:e9954. pmid:33024631
  144. 144. Pucker B, Irisarri I, de Vries J, Xu B. Plant genome sequence assembly in the era of long reads: Progress, challenges and future directions. Quant Plant Biol. 2022 Jan;3:e5. pmid:37077982
  145. 145. Pucker B, Choudhary N. Gene expression data sets of selected plant species [Internet]. Universitätsbibliothek Braunschweig; 2023 [cited 2023 Nov 9]. https://doi.org/10.24355/dbbs.084-202306231402-0