Figures
Abstract
ATP-binding cassette (ABC) transporters actively transport various substances across membranes, while uridine diphosphate (UDP) glycosyltransferases (UGTs) are proteins that catalyse the chemical modification of various organic compounds. Both of these protein superfamilies have been associated with conferring herbicide resistance in weeds. Little is known about the evolutionary history of these protein families in the Archaeplastida. To infer the evolutionary histories of these protein superfamilies, we compared protein sequences collected from 10 species which represent distinct lineages of the Archaeplastida–the lineage including glaucophyte algae, rhodophyte algae, chlorophyte algae and the streptophytes–and generated phylogenetic trees. We show that ABC transporters were present in the last common ancestor of the Archaeplastida which lived 1.6 billion years ago, and the major clades identified in extant plants were already present then. Conversely, we only identified UGTs in members of the streptophyte lineage, which suggests a loss of these proteins in earlier diverging Archaeplastida lineages or arrival of UGTs into a common ancestor of the streptophyte lineage through horizontal gene transfer from a non-Archaeplastida eukaryote lineage. We found that within the streptophyte lineage, most diversification of the UGT protein family occurred in the vascular lineage, with 17 of the 20 clades identified in extant plants present only in vascular plants. Based on our findings, we conclude that ABC transporters and UGTs are ancient protein families which diversified during Archaeplastida evolution, which may have evolved for developmental functions as plants began to occupy new environmental niches and are now being selected to confer resistance to a diverse range of herbicides in weeds.
Citation: Caygill S, Dolan L (2023) ATP binding cassette transporters and uridine diphosphate glycosyltransferases are ancient protein families that evolved roles in herbicide resistance through exaptation. PLoS ONE 18(9): e0287356. https://doi.org/10.1371/journal.pone.0287356
Editor: Yanbin Yin, University of Nebraska-Lincoln, UNITED STATES
Received: June 4, 2023; Accepted: August 25, 2023; Published: September 21, 2023
Copyright: © 2023 Caygill, Dolan. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: This research was supported by a European Research Council (ERC) Advanced Grant De Novo-P (project number 787613) to LD from the European Commission. SC was supported by a British Biological Sciences Research Council (BBSRC) Scholarship through a doctoral training partnership (BB/M011224/1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: L. D. is a co-founder of moa Technology Ltd. This does not alter our adherence to PLOS ONE policies on sharing data and materials.
Introduction
Populations of weeds are continually evolving resistance to herbicides [1]. Herbicide resistance mechanisms can be classified into 2 general groups: target site resistance (TSR) and non-target site resistance (NTSR). Target site resistance (TSR) occurs as a result of mutations in the gene coding for the herbicide’s target protein, which causes reduced inhibition of the herbicide target. Non-target site resistance (NTSR) can occur due to a variety of mechanisms, but ultimately results in a reduction in the amount of herbicide reaching its target protein, or by reducing damage caused by the herbicide to the plant [2]. Some of the NTSR mechanisms reported include reduction in the uptake or translocation of the herbicide, metabolic detoxification of the herbicide and sequestration of the herbicide to a location where it is not active, such as the vacuole [3–6]. Diverse genetic changes affecting NTSR genes can lead to their encoded proteins contributing to herbicide resistance. For example, mutations leading to gene overexpression, the expression of a hyperactive form of the protein, loss-of-function mutations or mutations leading to changes in substrate specificity, can all result in NTSR [4, 7, 8]. The genetic basis of NTSR is poorly understood, but changes in the expression of both ATP-binding cassette (ABC) transporters and uridine diphosphate (UDP) glycosyltransferases (UGTs) have been shown to confer resistance to herbicides in weeds [3–5].
ABC transporters are one of the largest protein families in plants and are conserved across all domains of life [9]. ABC transporter proteins use the energy of ATP hydrolysis to drive active transport of substances across membranes [10, 11]. In plants, ABC transporters transport nutrients such as sugars, amino acids and metal ions; secondary metabolites such as flavonoids, terpenoids and alkaloids; phytohormones such as auxins, cytokinins and gibberellins; proteins and peptides; and xenobiotics [12–14]. Plant ABC transporters are also involved in the translocation of toxic compounds, such as herbicides, into the vacuole and extracellular spaces, where they are no longer toxic [15, 16]. Numerous ABC transporters have been identified to compartmentalise herbicides in this way, as part of a detoxification process that has evolved in some weeds with NTSR to at least one herbicide.
Most ABC transporters are membrane bound and consist of four subunits–two transmembrane domains (TMDs) and two nucleotide-binding domains (NBDs) (S1 Fig) [17]. Generally, the substrate binds to the TMDs of the transporter on one side of the membrane. Binding of ATP to the NBDs then causes a conformational change in the transporter, closing the TMDs around the substrate. Subsequently, the hydrolysis of ATP causes another conformational change in the transporter that opens up the TMDs on the opposite side of the membrane, releasing the substrate [10, 18]. It is with this mechanism that ABC transporters are able to transport herbicides as part of a herbicide-resistance-conferring detoxification process.
Agricultural fields have been treated with herbicides since the 1940s, exposing plants to chemical diversity that they would not have previously encountered. Some ABC transporters catalyse the transport of a wide variety of synthetic compounds across membranes that they would not have encountered in their evolutionary history. ABC transporters are one of the most diverse protein families in plants, exhibiting significant variation in sequence, structure, and function. In plants, ABC transporters have been classified into 8 subfamilies (ABCA, ABCB, ABCC, ABCD, ABCE, ABCF, ABCG and ABCI) [10, 11, 19, 20]. Previous phylogenetic analyses in plants have shown all ABC transporter subfamilies, excluding ABCI, constitute monophyletic clades [10, 11, 19]. However, previous phylogenetic analyses of these protein families focused primarily on angiosperms. To gain a more comprehensive understanding of the evolution of this protein family, here we carry out a phylogenetic analysis of ABC transporters from across the Archaeplastida.
Uridine diphosphate (UDP) glycosyltransferases (UGTs) are part of the largest family (Family 1) of the glycosyltransferase superfamily, and this subset of UGTs have been widely associated with herbicide resistance [21, 22]. UGTs are present in bacteria, fungi, plants, and animals and catalyse the transfer of glycosyl residues from activated nucleotide sugars, such as uridine diphosphate glucose, to acceptor molecules [23]. Acceptor molecules utilised by UGTs include flavonoids, phenolic acids, plant hormones such as IAA, xenobiotics, and other small molecules such as alkaloids, terpenoids and cytokinins [24]. While UGTs often transfer glycosyl residues to various endogenous compounds, a variety of exogenous xenobiotics, including herbicides, can also act as the acceptor molecules of UGTs. The addition of a glycosyl residue to a herbicide has the general effect of making it more hydrophilic and less toxic. Therefore, mutations in genes coding for UGTs that enable them to glycosylate herbicides, or mutations that cause overexpression of UGTs that can already glycosylate herbicides, can contribute to NTSR in weeds.
UGTs are classified by the presence of a C-terminal consensus sequence, the UDP-glycosyltransferase signature, known as the ‘PSPG’ motif (Plant Secondary Product Glycosyltransferase). The PSPG motif is made up of 44-amino acids and is thought to be the site of binding of the UDP moiety of the sugar molecule to the enzymes [25–27]. On the other hand, the N-terminal regions of UGTs is the domain which is involved in the recognition and binding of a diverse array of acceptor molecules. Therefore, the N-terminal domain is generally more variable than the C-terminus [28]. UGTs catalyse O-, S- and N-linkages and can also form different linkages with the same acceptor molecule [29]. For example, A. thaliana UGT72B1 has been reported to form an N-glucosidic bond with 3, 4-dichloroaniline and an O-glucosidic bond with 3,4-dihydroxybenzoic acid [30]. Previous phylogenetic analyses of plant UGTs has found this protein family to diverge into at least 20 distinct monophyletic groups [25, 26]. However, the taxon sampling of these studies has been limited to angiosperms, and so conclusions cannot be made regarding the evolutionary history of this protein family in the Archaeplastida lineage.
Here we report the phylogenetic relationships, of both the ATP-binding cassette transporter and uridine diphosphate glycosyltransferase protein superfamilies within the Archaeplastida lineage. We discovered that the current clade diversity of the ABC transporter superfamily existed in the last common ancestor of the Archaeplastida. This indicates that the ABC transporters diversified into the extant sub-families before the evolution of the last common ancestor of the Archaeplastida, extant 1.6 billion years ago [31–33]. By contrast UGT proteins are only present in the streptophyte lineage among the Archaeplastida and diversified in the vascular plant lineages. On the other hand, we show that UGTs are present in the streptophyte lineage of the Archaeplastida but absent from non-streptophyte lineages. This suggests either that this protein family, which exists in other eukaryotic lineages, was lost from the glaucophyte, rhodophyte and chlorophyte lineages, or that these proteins originated through horizontal gene transfer into the last common ancestor of the streptophytes, extant approximately 1.1–1.2 billion years ago in the Proterozoic era [34–39]. Furthermore, we show that the UGT family subsequently diversified throughout streptophyte evolution, mostly in the vascular lineage. This analysis suggests that gene superfamilies which evolved in the Proterozoic era, are now undergoing natural selection caused by herbicide application which is leading to the evolution of herbicide resistant weeds in agricultural systems.
Materials and methods
Data resources
Amino acid sequences of proteins from A. thaliana were collected from TAIR10 [40] (https://www.arabidopsis.org/). Sequences from O. sativa were retrieved from the rice genome annotation project [41]. Sequences from the liverwort M. polymorpha were collected from MarpolBase (http://marchantia.info/). Sequences from the hornwort Anthoceros agrestis were collected from [42] (https://www.hornworts.uzh.ch/en/download.html). Sequences from the streptophyte alga Klebsormidium nitens were obtained from the K. nitens genpme webpage [43]. (http://www.plantmorphogenesis.bio.titech.ac.jp/~algae_genome_project/klebsormidium/). Sequences from the lycophyte Selaginella moellendorffii, the moss Physcomitrium patens and the chlorophyte alga Chlamydomona reinhardtii were retrieved from Phytozome 12 [44] (https://phytozome.jgi.doe.gov/pz/portal.html). Protein sequences from the rhodophyte alga Cyanidioschyzon merolae were collected from the C. merolae genome webpage [45] (https://www.genome.jp/kegg-bin/show_organism?org=cme). Sequences from the glaucophyte alga Cyanophora paradoxa were collected from PhycoCosm [46] (https://phycocosm.jgi.doe.gov/Cyapar1/Cyapar1.home.html).
Sequence collection
Family 1 UGT and ABC transporter protein sequences from A. thaliana and O. sativa [40, 41], were used to perform BLASTP searches against the predicted proteomes of S. moellendorffii, M. polymorpha, A. agrestis, P. patens, K. nitens, C. reinhardtii and C. merolae. A minimum E-value threshold of 1e-10 was employed. The initial list of sequences for each species collected in this way was then used to ‘self-blast’ the predicted proteomes of that species in order to retrieve sequences belonging to species-specific Family 1 UGT and ABC transporter sub-families. Using InterProScan 92.0 [47], each Family 1 UGT sequence was checked for the UDP-glycosyltransferase domain (SSF53756), while each ABC transporter was checked for the presence of the ABC transporter pfam domain (PF12848). All sequences that did not contain the searched-for domains were removed before further analysis.
Sequence alignment
Collected amino acid sequences were aligned using the programme MAFFT [48] and its FFT-NS-2 algorithm. Alignments were then visualised using Unipro ugene v. 45.1 [49]. Sequences which could then be seen to be lacking important functional residues were then removed. Sequences were trimmed manually, with large non-conserved regions being removed, leaving only mostly conserved regions without gaps.
Phylogenetic analysis
The final trimmed amino acid sequence alignments were subsequently used in a maximum-likelihood phylogenetic analysis using the software PHyML 3.0 [50]. The parameters used were an estimated gamma distribution, the LG+G+F model of amino acid substitution, and a Chi2-based approximate likelihood ratio test (aLRT). The resulting phylogenies were visualised in FigTree v1.4.4 [51] and annotated in Inkscape v1.0.2 [52].
Results
960 ABC transporter and 552 UGT sequences were identified in the genomes of 10 species of Archaeplastida
We first set out to investigate the evolution of ATP-binding cassette (ABC) transporter and uridine diphosphate (UDP) glycosltransferase (UGT) gene families in the Archaeplastida lineage. The Archaeplastida is the lineage which includes glaucophyte algae, rhodophyte algae, chlorophyte algae and the streptophytes. ABC-transporter protein-encoding sequences were identified from 10 selected species (Table 1), representing distinct Archaeplastida lineages. UGT sequences were identified in the genomes of 6 of these 10 Archaeplastda species. There were 60 ABCs and 3 UGTs in the genome of the streptophyte alga Klebsormidium nitens; 100 ABCs and 50 UGTs in the liverwort Marchantia polymorpha; 85 ABCs and 15 UGTs in the moss Physcomitrium patens; 166 ABCs and 163 UGTs in the lycophyte Selaginella moellendorffii; 124 ABCs and 200 UGTs in the monocot Oryza sativa; and 120 ABCs and 121 UGTs in the eudicot Arabidopsis thaliana. UGT sequences were not recovered from the genomes of the glaucophyte algae Cyanophora paradoxa, the rhodophyte alga Cyanidioschyzon merolae, the chlorophyte alga Chlamydomonas reinhardtii, or the hornwort Anthoceros agrestis. While no UGT encoding genes were identified in these species, 111, 17, 47, and 130 ABC transporter coding sequences were identified in these species respectively. Taken together these data indicate that the ABC transporters were present in the genome of the last common ancestor of the Archaeplastida, while UGT encoding genes have been present in the Archaeplastida lineage since the divergence of the streptophytes from the last common ancestor shared with the chlorophytes.
Plant ABC transporters are ancient, and all 12 groups existed in the last common ancestor of the Archaeplastida
To investigate the evolutionary history of the ABC transporter protein family, we generated a multiple sequence alignment of 960 ABC transporter protein sequences, which we used to construct a Maximum Likelihood tree (Fig 1). Alignments were trimmed manually to remove large gaps and only a nucleotide-binding domain (NBD) of each sequence was retained.
Unrooted cladogram of a maximum likelihood (ML) analysis of Archaeplastida ABC transporter proteins conducted by PhyML 3.0 [50] using an estimated gamma distribution parameter, the LG_G+F model of amino acid substitution, and a Chi2-based approximate likelihood ratio (aLRT) test. Support values from the aLRT test for each clade are presented in parentheses. Protein sequences were aligned using MAFFT with the L-INS-I algorithm. ABC transporter classes are indicated by blue highlighting and letters. Coloured dots indicate the presence of sequences from different species in each clade. A. thaliana (yellow); O. sativa (dark green); S. moellendorffii (light blue); P. patens (light green); A. agrestis (dark purple); M. polymorpha (blue); K. nitens (light purple); C. reinhardtii (pink); C. merolae (maroon); C. paradoxa (orange). Confirmed NTSR genes or their A. thaliana homolog reported in Table 3 are indicated.
The topology of the phylogenetic tree shows that plant ABC transporters diverged into 6 previously identified, monophyletic clades. Each of these monophyletic clades has been classified by a single sub-family of ABC-transporters, with each subfamily designated a letter. For example, the monophyletic clade comprising of ABCA proteins is designated clade A. However, ABC transporter sub-families ABCE and ABCF are sister clades, and together form a single monophyletic clade, designated clade E/F. Our analysis revealed the previously defined monophyletic clades A, B, C, D, E/F, G (Fig 1). We also identified 6 previously unreported monophyletic clades that comprise sequences from a single species (C. paradoxa–Cp) or two species (C. paradoxa and C. merolae–CpCm)–Cp1, Cp2, Cp3, Cp4, Cp5, and CpCm1 (Fig 1). ABCI proteins are the only previously reported sub-family that do not form a monophyletic clade. We found ABCI proteins to comprise two distantly related monophyletic groups (and therefore constitute a polyphyletic group) (Fig 1). Sequences from the 4 clades B, C, E/F and G are encoded in the genomes of all 10 species of Archaeplastida used in this analysis. This suggests that these four clades were present in the last common ancestor of the Archaeplastida (Fig 2).
Cladogram of Archaeplastida showing ABC transporter clade origins and losses during plant evolution. Blue circles represent first appearances of a clade, black circles represent the loss of a clade in a particular lineage. Also shown are the relative distributions of ABC transporters into their respective clades for each species included in our analysis, where each distinct coloured block represents a separate ABC transporter protein clade.
The presence of an ABCA protein sequences in the rhodophyte C. merolae and the glaucophyte C. paradoxa suggests clade A is ancient and was present in the last common ancestor of the Archaeplastida. However, ABCA proteins are missing from the genomes of 4 species–C. reinhardtii, A. agrestis, P. patens and S. moellendorffii. The absence of ABCA proteins from these lineages likely represent independent gene loss events, following the divergence of these lineages from other land plants. There are no ABCA proteins encoded in the genome of the chlorophyte alga C. reinhardtii, but ABCA proteins are encoded by the streptophyte alga K. nitens. This suggests a loss of these proteins in the chlorophyte algae lineage sometime after its divergence from the last common ancestor of the chlorophytes and streptophytes. ABCA proteins are present in the genome of the liverwort M. polymorpha but absent from the other two bryophyte species included in the analysis–the hornwort A. agrestis and the moss P. patens. If hornworts are sister to the mosses and liverworts [42, 59], this topology suggests ABCA proteins were lost from the hornwort lineage following its divergence from the moss and liverwort lineages. ABCA proteins were then likely subsequently lost in the moss lineage following its divergence from the liverworts. ABCA proteins were found to be absent from the genome of S. moellendorffii but present in the other 2 vascular plants included in the analysis–A. thaliana and O. sativa. This indicates a loss of ABCA proteins in the lycophyte lineage following its divergence from the angiosperm lineage (Fig 2). The absence of ABCA proteins in some lineages within the Archaeplastida is likely due to ABCA proteins being secondarily lost in these lineages, and hence ABCA proteins were likely already present in the last common ancestor of the Archaeplastida.
ABCD proteins were present in 5 lineages of the Archaeplastida, the dicots, monocots, liverworts, streptophyte algae and glaucophyte algae. The presence of ABCD proteins in the genome of the glaucophyte C. paradoxa suggest that ABCD proteins had already diverged from other ABC transporters in the last common ancestor of the Archaeplastida. However, ABCD sequences were not found in C. merolae or C. reinhardtii, suggesting loss of the ABCD subfamily in the rhodophyte and chlorophyte algal lineages. ABCD proteins were missing from the genomes of A. agrestis, P. patens and S. moellendorffii, as observed for ABCA proteins. Their absence from these lineages suggests recent gene losses in the hornworts, mosses, and lycophytes respectively. Angiosperm genomes were found to encode ABCD proteins, however, the other vascular plant included in the analysis, S. moellendorffii, does not encode any ABCD proteins, suggesting that this clade was lost in this lineage following the divergence of the lycophyte lineage from the other vascular plants (Fig 2). ABCD proteins were therefore likely present in the last common ancestor of the Archaeplastida lineage, but were subsequently lost in the rhodophyte, chlorophyte, hornwort, moss and lycophyte lineages.
Many putative ABCI encoding genes were identified in the genomes of the plant species in this analysis. However, uniquely amongst ABC transporter proteins, each subunit of an ABCI protein is encoded by a separate gene. Therefore, many ABCI protein sequences identified were removed from the alignments generated as they did not align with the NBD of other ABC-transporters. In the phylogenetic analysis of the retained sequences, 2 monophyletic ABCI clades were defined. One was sister to the monophyletic ABCA protein clade, while the other is sister to a monophyletic clade that includes ABCD and ABCB proteins. Each of the ABCI clades comprise sequences from A. thaliana, O. sativa, M. polymorpha and C. paradoxa. ABCI proteins-encoding genes in the genome of C. paradoxa and the 3 previously mentioned land plant species is evidence that they were present in the last common ancestor of the Archaeplastida. However, absence of ABCI protein-encoding genes in the genomes of C. merolae, C. reinhardtii and K. nitens suggests that ABCI protein-encoding genes have been lost in these algal lineages. Furthermore, the tree topology shows that losses of ABCI proteins have occurred in the land plants. The absence of ABCI proteins-encoding genes in A. agrestis, P. patens and S. moellendorffii suggest a loss of ABCI proteins in the hornwort, moss and lycophyte lineages (Fig 2). However, since many putative ABCI proteins were removed from the analysis, we cannot be entirely certain that members of this subfamily have been entirely lost from these lineages mentioned. The evolution of the ABCI gene clade may be more difficult to resolve compared to other ABC transporter clades, due to their unique property of individual genes encoding each protein subunit.
Taken together the data show that ABC transporters are conserved among the Archaeplastida. There are sequences from the 10 sampled species from the Archaeplastida that constitute four previously identified monophyletic clades, B, C, E/F and G (Fig 1). This suggests that ABC transporters had diverged into these four subfamilies before the divergence of the Archaeplastida lineages and were therefore present in the last common ancestor of all species included in the analysis. We also identified 6 other ancient monophyletic clades of ABC transporter which constituted sequences that were present in either glaucophyte, or both glaucophyte and rhodophyte algae (Cp1, Cp2, Cp3, Cp4, Cp5 and CpCm1). ABCA, ABCD and ABCI proteins are not present in the genomes of every species of the Archaeplastida analysed here. Our phylogenetic analysis revealed that although ABCA proteins were present in the last common ancestor of the Archaeplastida, members of this clade were subsequently lost in the chlorophyte, hornwort, moss and lycophyte lineages. We show that ABCD proteins also had an ancient origin, likely already having diverged in the last common ancestor of the Archaeplastida. ABCD proteins were subsequently lost in the rhodophyte, chlorophyte, hornwort, moss and lycophyte lineages. The ABCI proteins included in our analysis comprise 2, distantly related clades. Conservation of ABCI proteins included in our analysis was limited, however, their presence in C. paradoxa suggests an ancient origin of ABCI proteins. No ABCI protein-encoding sequences were found in the rhodophyte, chlorophyte, streptophyte algae, hornwort, moss or lycophyte lineages, which is consistent with the loss of genes encoding these proteins from these lineages. This suggests that ABCI proteins are ancient but have been lost from many lineages during Archaeplastida evolution.
Plant UGTs diversified recently, 14/20 identified groups are only present in Angiosperms
There are currently 45 known families of glycosyltransferases (GTs) in plants [60]. Uridine diphosphate glycosyltransferases (UGTs) are the largest glycosyltransferase family and are the only glycosyltransferase family to have been reported to contribute to NTSR. They comprise a monophyletic clade, known as Family 1 GTs. Family 28 GTs are a distinct glycosyltransferase protein family that are related to UGTs [23, 61–63]. Family 28 GTs have not been reported to contribute to NTSR. Family 28 GTs were included as an outgroup for our phylogenetic analysis. To investigate the evolutionary history of UGTs in Archaeplastida, phylogenetic trees were constructed using maximum likelihood statistics (Fig 3). This analysis showed that UGTs are only present in the streptophyte lineage of the Archaeplastida. UGTs are present in other eukaryotes, suggesting they were either lost in the glaucophyte, rhodophyte and chlorophyte lineages, or were introduced into the last common ancestor of the streptophyte lineage by horizontal gene transfer. Evidence of horizontal gene transfer of a glycosyltransferase gene has been previously reported from a Verrucomicrobium into the green alga Picochlorum SENEW3 [64]. Our phylogenetic analysis also shows UGT sequences comprise 20 major monophyletic groups, 16 of which were defined in previous analyses [25, 26].
Unrooted cladogram of a maximum likelihood (ML) analysis of Archaeplastida UGT proteins conducted by PhyML 3.0 [50] using an estimated gamma distribution parameter, the LG_G+F model of amino acid substitution, and a Chi2-based approximate likelihood ratio (aLRT) test. Support values from the aLRT test for each clade are presented in parentheses. Protein sequences were aligned using MAFFT with the L-INS-I algorithm. UGT classes are indicated by pink highlighting and letters. Coloured dots indicate the presence of sequences from different species in each group. A. thaliana (yellow); O. sativa (dark green); S. moellendorffii (light blue); P. patens (light green); A. agrestis (dark purple); M. polymorpha (blue); K. nitens (light purple); C. reinhardtii (pink); C. merolae (maroon); C. paradoxa (orange). Confirmed NTSR genes or their A. thaliana homolog reported in Table 5 are indicated.
34 Family 28 GT sequences were identified in the genomes of the 10 Archaeplastida species, and these were included in the phylogenetic analysis. Family 28 GT sequences were found in 9 of the 10 Archaeplastida species but are missing from the rhodophyte alga C. merolae. This is consistent with Family 28 GTs being present in the last common ancestor of the Archaeplastida but subsequently lost in the rhodophyte algae lineage (Fig 4). This likely represents separate origins of Family 28 GTs and UGTs in the Archaeplastida, the latter of which is only present in the streptophyte lineage.
Cladogram of Archaeplastida phylogeny showing UGT clade origins and losses during plant evolution. The origins and losses of Family 28 glycosyltransferases are also shown. Blue circles represent first appearances of a clade, black circles represent the absence of a clade in a particular lineage. Also shown are the relative distributions of UGTs into their respective clades for each species included in our analysis, where each distinct coloured block represents either a separate UGT clade or a group of clades.
The UGT protein family has gone through many diversification events since the divergence of Family 28 GTs and UGTs from a last common ancestor. We identified 20 distinct monophyletic groups of UGTs, 16 of which had been previously identified. Previously reported groups are named alphabetically. Each of the 4 new groups identified in this analysis are comprised of UGT sequences from a single species and are named according to that species. The streptophyte algae K. nitens is the earliest diverging member of the Archaeplastida found with UGT protein-encoding genes in its genome. Three UGTs were identified in K. nitens and these formed a single monophyletic group (Kn1), sister to all other UGTs identified in the analysis. This is consistent with UGTs being present in the last common ancestor of the streptophytes since K. nitens is the earliest diverging member of the streptophyte lineage included in this analysis. The group O clade comprises 83 UGTs from 4 species of the Archaeplastida–O. sativa, S. moellendorffii, P. patens and M. polymorpha. Conservation of these sequences in these 4 land plant species and their absence from the genomes of the streptophyte algae indicates that this clade originated after the divergence of land plants from the streptophyte algae. Group O UGTs sequences are not present in the genomes of A. thaliana and A. agrestis, suggesting a loss of genes in the eudicot lineage after the divergence of O. sativa and A. thaliana from their last common ancestor, and in the hornwort lineage respectively. UGTs sequences from O. sativa, S. moellendorffii, P. patens, M. polymorpha and also A. thaliana constitute a large monophyletic clade consisting of UGT protein from groups F, G, H, I, J, K, L, N, P and Pp1. This suggests that this clade originated at approximately the same stage as group O UGTs, and later diverged into each of the 10 groups [25, 26]. This evidence suggests few divergence events of UGT groups occurred during early streptophyte evolution and then more prolific diversification in more recently diverging lineages.
13 of the 20 UGT groups–A, B, C, D, E, G, H, I, J, K, L, M, N–are only present in the genomes of A. thaliana and O. sativa. This is consistent with these groups being present in the last common ancestor of the angiosperms (Fig 4). Since these sequences are not present in the genome of S. moellendorffii, it suggests that they originated after the divergence of the seed plant lineage from the lycophytes. There are group F UGT sequences in each of the 3 vascular plant species included in the analysis, suggesting this group originated in the last common ancestor of the vascular plant lineage. Group F is sister to groups N and I, which comprise only angiosperm sequences. This suggests groups N and I diverged from a common ancestor which they shared with group F. These data show most UGT diversification in extant taxa occurred after the divergence of the seed plants from the lycophytes.
Five groups were defined by sequences from a singles species in our analysis. Group P comprises UGT sequences from only the monocot angiosperm O. sativa. There are sequences from only the lycophyte S. moellendorffii in groups Sm1 and Sm2, sequences from only the streptophyte algae K. nitens in the group Kn1, and there are sequences only from the moss P. patens in group Pp1. The formation of these monospecific clades is consistent with recent origins of these sequences but could also be an artifact of poor taxon sampling in this analysis. Group P is only found in O. sativa and not the other angiosperm A. thaliana, suggesting that group P originated after the divergence of the monocots and the eudicots or was lost in the dicot lineage. Group Sm1 is the largest clade identified in our analysis and comprises 141 UGT sequences. Sm1 sequences were only found in the S. moellendorffii genome. This is consistent with this group originating in the lycophyte lineage, following the divergence of lycophytes from other vascular plants. The large number of sequences in this group suggest a prolific expansion of the Sm1 UGTs in the lycophyte lineage. Sequences from group Sm2 were also identified only in S. moellendorffii, indicating this group also originated in the lycophyte lineage. However, with 6 sequences in this group, Sm2 did not go through the same extensive expansion as Sm1. Streptophyte algae sequences from K. nitens form a monophyletic clade (Kn1) which is sister to all other UGTs. Kn1 likely represents the earliest diverging UGTs identified in this analysis. Two moss sequences form a monophyletic clade (Pp1). Pp1 is sister to a larger monophyletic clade consisting of groups F, G, H, I, P and N. This suggests Pp1 genes share a common ancestor with sequences from groups F, G, H, I, P and N which that existed before the divergence of bryophytes from other land plants. However, the absence of Pp1 genes in M. polymorpha and A. agrestis suggest that sequences from this group were lost in these lineages. The origins and losses of these groups in different Archaeplastida lineages are summarised in Fig 4.
In summary, we show UGT sequences are present in streptophyte algae but not in glaucophytes, rhodophytes or chlorophytes; UGTs are encoded in the K. nitens genome and in land plants except the hornworts where we propose they were secondarily lost. This indicates that UGTs first arose after the divergence of the streptophyte lineage and continued to diversify following the colonisation of land.
Six ABC-transporter sub-families have been associated with non-target site resistance to herbicides
ABC transporters have been implicated in non-target site herbicide resistance (NTSR) of crops and weeds. To identify the clades ABC transporters that are associated with NTSR belong, a literature survey was conducted. This survey identified genes that have been associated with NTSR because the mRNA levels of the ABC transporter mRNA are higher in resistant than sensitive populations. These genes are classified as “candidate NTSR genes” and are listed in (Table 2). ABC transporters demonstrated to increase herbicide resistance when expressed at high levels as a transgene or by other experimental methods are classified as “confirmed NTSR genes” and are listed in (Table 3).
We identified a total of 41 ABC transporter candidate NTSR genes in a variety of weed and non-weed species from the literature (Table 2). They include the grass weeds, Italian ryegrass (Lolium multiflorum) [65], red sprangletop (Leptochloa chinensis) [66], American slough grass (Beckmannia syzigachne) [67], shortawn foxtail (Alopecurus aequalis) [68] and jungle rice (Echinochloa colona) [69], the broad leaf (eudicot) weeds flixweed (Descurainia Sophia) [70], hairy fleabane (Conyza bonariensis) [71], giant chickweed (Myosoton aquaticum) [66, 72], and Palmer amaranth (Amarnthus palmeri) [73] and the crop rice (Oryza sativa) [74]. The candidate NTSR genes were identified in populations resistance to 5 different HRAC mode of action herbicide classes, including acetolactate synthase inhibitors, acetyl coA carboxylase inhibitors, enolpyruvyl shikimate phosphate synthase inhibitors, glutamine synthetase inhibitors and auxin mimic/cellulose synthesis inhibitors.
There are 12 monophyletic clades of ABC transporters (A, B, C, D, E/F, G, Cp1, Cp2, Cp3, Cp4, Cp5 and CpCm1). Candidate NTSR genes encoding ABC transporters are found in sub-families ABCA, ABCB, ABCC, ABCF, ABCG and ABCI (Table 2). Thirteen ABCB and 18 ABCC members were identified as candidate NTSR genes. ABCG proteins constitute by far the largest monophyletic clade we identified in our phylogenetic analysis, representing approximately 2/3 of all the ABC transporters included. However, relatively few ABCG transporters were identified as candidate NTSR genes. From 4 species, we identified 6 distinct ABCG candidate NTSR genes–ABCG7, ABCG8, ABCG11, ABCG14, ABCG39 and ABCG48. A single candidate NTSR gene was found in the literature from each of the groups ABCA, ABCF and ABCI. Taken together, these data suggest candidate NTSR genes encoding ABC transporters belong to clades that were present in the last common ancestor of the Archaeplastida and that the frequency of candidate NTSR genes is not proportional to ABC transporter clade size.
Six ABC transporter proteins have been experimentally confirmed to confer NTSR in plants (Table 3). These 6 genes belong to 3 different clades–ABCB, ABCC and ABCG. These clades comprise sequences from all 10 species of Archaeplastida included in the analysis, and therefore had already diverged in the last common ancestor of the Archaeplastida. It can be concluded that ABC transporters that have been confirmed to contribute to NTSR belong to clades which were already present in the common ancestor of the Archaeplastida. The ABC transporters confirmed to confer resistance to herbicides did so to a variety of HRAC mode of action groups, including very long-chain fatty acid inhibitors, enolpyruvyl shikimate phosphate synthase inhibitors, auxin mimics, microtubule assembly inhibitors, PSI inhibitors, and a herbicide with an unknown mode of action (MSMA). We therefore show that ABC transporter proteins within the same monophyletic clade confer resistance to herbicides with different modes of action.
UGTs groups associated with herbicide resistance evolved after the divergence of Angiosperms from the Archaeplastida lineage
Many UGT genes have also been reported to contribute to NTSR in populations of resistant weeds in the field. We identified a total of 34 candidate NTSR UGT genes in a variety of weed and non-weed species from the literature (Table 4). They include the grass weeds, Italian ryegrass (Lolium multiflorum) [65], American slough grass (Beckmannia syzigachne) [67, 74, 80], Brachypodium hybridum [81], shortawn foxtail (Alopecurus aequalis) [68], Tausch’s goat grass (Aegilops tauschii) [82], and jungle rice (Echinochloa colona) [69], the broad leaf (eudicot) weeds flixweed (Descurainia Sophia) [70], hairy fleabane (Conyza bonariensis) [71], and giant chickweed (Myosoton aquaticum) [66, 72] and the crop rice (Oryza sativa) [74]. The candidate NTSR genes were identified in populations resistant to 4 classes of HRAC mode of action herbicides, including acetolactate synthase inhibitors, acetyl coA carboxylase inhibitors, enolpyruvyl shikimate phosphate synthase inhibitors and auxin mimic/cellulose synthesis inhibitors.
There are 16 monophyletic groups of UGT sequences in angiosperms. We identified candidate NTSR genes from 11 of these monophyletic groups in angiosperm weeds and crops (Table 4). There are 8 candidate NTSR group D UGT genes in 7 different species. These represent 7 distinct genes, UGT73B3, UGT73B5, UGT73C, UGT73C1, UGT73C5, UGT73D1 and UGT73E1. UGT73B5 was identified in both B. hybridum and A. aequalis. A single candidate NTSR gene was identified in flixweed, UGT71B1, a group E UGT. Group E is the second largest UGT group. No candidate NTSR UGT sequences were identified from group O, which is the largest UGT group in which angiosperm sequences were identified. These data indicate UGT proteins that detoxify herbicides belong to clades that were present in the last common ancestor of the Angiosperms.
Seven UGT genes, from 4 groups–A, D, E and G–have been experimentally confirmed to confer NTSR in plants (Table 5). There are genes from only two species, the angiosperms A. thaliana and O. sativa, in each of these groups in our phylogenetic analysis. This suggests that these groups were present in the last common ancestor and evolved following the divergence of the angiosperms from the lycophytes. 6 of the 7 genes identified confer NTSR to 3 HRAC mode of action classes, HPPD inhibitors, acetyl coA inhibitors and PSI inhibitors.
Discussion
Herbicide use has increased continuously since the 1940s, leading to the evolution of herbicide resistance in weeds through selection of naturally occurring alleles that confer herbicide resistance. Overexpression of genes encoding some members of the ATP-binding cassette (ABC) transporter family and uridine diphosphate (UDP) glycosyltransferase (UGT) family in weeds confers herbicide non-target site resistance (NTSR) [2, 4, 5]. ABC transporters are ATP-driven pumps that transport molecules across membranes, while UGT enzymes catalyse the metabolism of a diversity of compounds by transferring a sugar moiety to an acceptor molecule. By carrying out phylogenetic analyses, we set out to resolve the evolutionary relationships of these protein families, and to classify those genes associated with NTSR. We compared sequences from 10 species which represent distinct lineages of the Archaeplastida and generated phylogenetic trees. From the topology of these trees, it can be concluded that ABC transporters were present in the last common ancestor of the Archaeplastida. There are 12 monophyletic clades of ABC transporters (A, B, C, D, E/F, G, Cp1, Cp2, Cp3, Cp4, Cp5, and CpCm1) and one polyphyletic clade (I), all of which existed in the last common ancestor of the Archaeplastida. UGT enzymes existed in the last common ancestor of the streptophytes and have since diversified into 20 monophyletic clades (A, B, C, D, E, F, G, H, I, J, K, L, M, N, O, P, Kn1, Pp1, Sm1, and Sm2). Overall, the phylogenetic analysis shows that ABC transporters were present in the last common ancestor of the Archaeplastida, while UGTs proteins are only present in the streptophyte lineage among the Archaeplastida.
In plants, ABC transporters transport of a variety of molecules across membranes [10, 80]. ABC transporters confer herbicide resistance by sequestering herbicides and their metabolites in the vacuole or extracellular spaces, where they cannot interact with their target [16]. We hypothesised that a high diversity of ABC transporters among the Archaeplastida may facilitate the high diversity of substrates which can be recognised and transported by ABC transporters. However, we have shown that closely related ABC transporters, within the same clade, can be associated with resistance to a diverse range of herbicide mode of actions. For example, proteins from clade A are associated with resistance to 6 different herbicides, from 4 different HRAC mode of action groups (HRAC group 1, 2, 9 and 10). This suggests that ABC transporter diversity is not necessary for a diversity of molecules which can be recognised and transported. Nonetheless, this diversity may play a role diversity of ABC transporter localisation, and hence in herbicide detoxification by allowing ABC transporters to sequester chemicals in various locations. These locations include the vacuole and extracellular spaces, where herbicides can be sequestered and are no longer toxic to the plant.
ABC transporter clades in extant plants are ancient and we show that the common ancestor of the Archaeplastida, which existed around 1.6 billion years ago in the Proterozoic era, contained all clades of ABC transporters identified in our analysis [38, 86, 87]. These ABC transporters evolved during the Proterozoic to carry out various cellular functions, unrelated to herbicide resistance. However, certain ABC transporter genes are now being selected in modern agricultural systems for a new purpose–conferring resistance to herbicides. For example, ABCB1, a P-glycoprotein, which has been shown to cause resistance to dicamba, oryzalin, pendimethalin and MSMA when endogenously overexpressed in A. thaliana has been characterised [78]. ABCB1 has been shown to be involved in the transport of auxin, a crucial plant hormone essential for numerous developmental processes. ABCB1 mediates auxin efflux, regulating its distribution and concentration in different plant tissues. ABCB1 mutants have altered auxin distribution and as a result, show various developmental defects [88–90]. Herbicide resistance evolving through the altered function of genes that evolved to carry out an entirely different function, can be considered exaptation [91]. Therefore, we argue that the involvement of ABC transporters in herbicide resistance is an example of exaptation.
UGT enzymes catalyse the conjugation of sugars moieties to a variety of acceptor molecules, including herbicides which can lead to detoxification [22]. High sequence diversity among UGTs genes leads to protein structural heterogeneity, which in turn facilitates a wide range of acceptor molecules, including herbicides [24]. We have shown that a large proportion of UGT gene diversity evolved in the angiosperm lineage, and this may account for the ability of UGTs to detoxify herbicides. However, we have also shown that UGTs from closely related clades, or even those from within the same clade, may confer resistance to herbicides with a variety of different chemistries and modes of action. For example, UGT proteins from group D have been associated with, or confer resistance to, 7 different herbicides from 5 different HRAC mode of action groups (HRAC group 1, 2, 4/29, 9 and 22). This suggests sufficient variability has also evolved within UGT clades in extant plants to give UGTs the capacity to conjugate sugars to a diverse range of acceptor molecules, which includes herbicides.
UGTs proteins were present in the last common ancestor of the streptophytes, which existed 1.1–1.2 billion years ago, in the Proterozoic era [92–94]. However, UGTs are not present in the earlier diverging glaucophyte, rhodophyte or chlorophyte lineages. Since UGTs are present in other eukaryotes and some bacteria, their absence in non-streptophyte algal lineages suggests either a loss of UGT genes in these lineages or that UGTs were acquired in the last common ancestor of the streptophyte lineage through horizontal gene transfer. Our data suggest that there was little diversification of UGT sequences until after the vascular plants diverged from other streptophytes in the Silurian period of the Paleozoic era (420–450 million years ago) [92–95]. 17 of the 20 UGT clades we identified, evolved in the vascular lineage. 14 of these groups are present only in angiosperms, suggesting they evolved after the divergence of seed plants from lycophytes, which occurred in the Devonian period of the Paleozoic era (360–400 million years ago) [22, 92]. This diversification of UGTs likely accompanied an increase in morphological complexity of plants and the colonisation of a wider range of terrestrial habitats. UGTs were likely selected to carry out a variety of cellular functions in streptophytes, during their colonisation of land, and subsequent evolution of the vascular plants. Over the last 50 years, they have been selected for herbicide resistance in the agricultural environment.
We have shown that both ABC transporters and UGT enzymes evolved in the Proterozoic era. ABC transporters genes were present in the last common ancestor of the Archaeplastida and had already diversified into many clades in this lineage. Conversely, our data shows that UGTs genes are only present in the streptophyte lineage and diversified mainly in the vascular lineage among plants. The clade diversity of ABC transporters and UGT enzymes in extant plants may contribute to their ability to confer herbicide resistance. We propose that ABC transporters and UGTs were selected to carry out a variety of cellular functions in the Proterozoic, and now some members of these protein families are being selected to carry out an entirely different function–conferring resistance to herbicides. ABC transporters and UGTs conferring herbicide resistance is therefore an example of exaptation.
Supporting information
S1 Fig. Protein features of ABC transporters and UDP-glycosyltransferases in plants.
Diagrams of the typical ABC transporter (A) and UDP-glycosyltransferases (B). A shows the domain architecture of ABC transporters from each of the major sub-families in plants. The diagram is adapted from https://doi.org/10.1038/srep16724. B shows α-helices (green), β-strands (purple) and residues reported to form part of the acceptor pocket (red dashed lines). The PSPG motif reported to interact with the UDP-sugar donor is highlighted in grey. The structure is based on that of AtUGT72B1 reported in https://doi.org/10.1016/j.phytochem.2008.12.009.
https://doi.org/10.1371/journal.pone.0287356.s001
(TIF)
S2 Fig. Number of ABC proteins identified in each clade.
https://doi.org/10.1371/journal.pone.0287356.s002
(TIF)
S3 Fig. Number of UGT proteins identified in each clade.
Number of Family 28 glycosyltransferases identified in each species is also given.
https://doi.org/10.1371/journal.pone.0287356.s003
(TIF)
S1 Text. Untrimmed alignment of all ABC transporter sequences used in the phylogenetic analysis.
https://doi.org/10.1371/journal.pone.0287356.s004
(TXT)
S2 Text. Untrimmed alignment of all UGT sequences used in the phylogenetic analysis.
https://doi.org/10.1371/journal.pone.0287356.s005
(TXT)
S3 Text. Manually trimmed alignment of all ABC transporter sequences used in the phylogenetic analysis.
https://doi.org/10.1371/journal.pone.0287356.s006
(TXT)
S4 Text. Manually trimmed alignment of all UGT sequences used in the phylogenetic analysis.
https://doi.org/10.1371/journal.pone.0287356.s007
(TXT)
References
- 1. Heap I. The International Herbicide-Resistant Weed Database 2023. http://www.weedscience.org/Home.aspx (accessed January 26, 2023).
- 2. Powles SB, Yu Q. Evolution in Action: Plants Resistant to Herbicides. Annu Rev Plant Biol 2010;61:317–47. pmid:20192743
- 3. Yuan JS, Tranel PJ, Stewart CN. Non-target-site herbicide resistance: a family business. Trends Plant Sci 2007;12:6–13. pmid:17161644
- 4. Gaines TA, Duke SO, Morran S, Rigon CAG, Tranel PJ, Küpper A, et al. Mechanisms of evolved herbicide resistance. J Biol Chem 2020;295:10307–30. pmid:32430396
- 5. Délye C, Duhoux A, Pernin F, Riggins CW, Tranel PJ. Molecular Mechanisms of Herbicide Resistance. Weed Sci 2015;63:91–115. https://doi.org/10.1614/WS-D-13-00096.1.
- 6. Ghanizadeh H, Harrington KC. Non-target Site Mechanisms of Resistance to Herbicides. CRC Crit Rev Plant Sci 2017;36:24–34. https://doi.org/10.1080/07352689.2017.1316134.
- 7. Ito H, Gray WM. A gain-of-function mutation in the Arabidopsis pleiotropic drug resistance transporter PDR9 confers resistance to auxinic herbicides. Plant Physiol 2006;142:63–74. pmid:16877699
- 8. Xi J, Xu P, Xiang C-B. Loss of AtPDR11, a plasma membrane-localized ABC transporter, confers paraquat tolerance in Arabidopsis thaliana. The Plant Journal 2012;69:782–91. pmid:22026747
- 9. Henikoff S, Greene EA, Pietrokovski S, Bork P, Attwood TK, Hood L. Gene Families: The Taxonomy of Protein Paralogs and Chimeras. Science (1979) 1997;278:609–14. pmid:9381171
- 10. Kang J, Park J, Choi H, Burla B, Kretzschmar T, Lee Y, et al. Plant ABC Transporters. The Arabidopsis Book / American Society of Plant Biologists 2011;9:e0153. pmid:22303277
- 11. Theodoulou FL. Plant ABC transporters. Biochimica et Biophysica Acta (BBA)—Biomembranes 2000;1465:79–103. pmid:10748248
- 12. Theodoulou FL, Kerr ID. ABC transporter research: going strong 40 years on. Biochem Soc Trans 2015;43:1033–40. pmid:26517919
- 13. Borghi L, Kang J, Ko D, Lee Y, Martinoia E. The role of ABCG-type ABC transporters in phytohormone transport. Biochem Soc Trans 2015;43:924–30. pmid:26517905
- 14. Martinoia E, Klein M, Geisler M, Bovet L, Forestier C, Kolukisaoglu Ü, et al. Multifunctionality of plant ABC transporters–more than just detoxifiers. Planta 2002;214:345–55. pmid:11855639
- 15. Martinoia E, Grill E, Tommasini R, Kreuz K, Amrhein N. ATP-dependent glutathione S-conjugate “export” pump in the vacuolar membrane of plants. Nature 1993;364:247–9. https://doi.org/10.1038/364247a0.
- 16. Ge X, d’Avignon DA, Ackerman JJ, Sammons RD. Rapid vacuolar sequestration: the horseweed glyphosate resistance mechanism. Pest Manag Sci 2010;66:345–8. pmid:20063320
- 17. Higgins CF. ABC TRANSPORTERS: From Microorganisms to Man. vol. 8. 1992.
- 18. Locher KP. Mechanistic diversity in ATP-binding cassette (ABC) transporters. Nat Struct Mol Biol 2016;23:487–93. pmid:27273632
- 19. Verrier PJ, Bird D, Burla B, Dassa E, Forestier C, Geisler M, et al. Plant ABC proteins–a unified nomenclature and updated inventory. Trends Plant Sci 2008;13:151–9. pmid:18299247
- 20. Saha J, Sengupta A, Gupta K, Gupta B. Molecular phylogenetic study and expression analysis of ATP-binding cassette transporter gene family in Oryza sativa in response to salt stress. Comput Biol Chem 2015;54:18–32. pmid:25531538
- 21. Yu J, Hu F, Dossa K, Wang Z, Ke T. Genome-wide analysis of UDP-glycosyltransferase super family in Brassica rapa and Brassica oleracea reveals its evolutionary history and functional characterization. BMC Genomics 2017;18:474. pmid:28645261
- 22. Dimunová D, Matoušková P, Podlipná R, Boušová I, Skálová L. The role of UDP-glycosyltransferases in xenobioticresistance. Drug Metab Rev 2022;54:282–98. pmid:35635097
- 23. Yonekura-Sakakibara K, Hanada K. An evolutionary view of functional diversity in family 1 glycosyltransferases. The Plant Journal 2011;66:182–93. pmid:21443631
- 24. Bowles D, Isayenkova J, Lim E-K, Poppenberger B. Glycosyltransferases: managers of small molecules. Curr Opin Plant Biol 2005;8:254–63. pmid:15860422
- 25. Caputi L, Malnoy M, Goremykin V, Nikiforova S, Martens S. A genome-wide phylogenetic reconstruction of family 1 UDP-glycosyltransferases revealed the expansion of the family during the adaptation of plants to life on land. The Plant Journal 2012;69:1030–42. pmid:22077743
- 26. Ross J, Li Y, Lim E-K, Bowles DJ. Higher plant glycosyltransferases. Genome Biol 2001;2:reviews3004.1. pmid:11182895
- 27. Gloster TM. Advances in understanding glycosyltransferases from a structural perspective. Curr Opin Struct Biol 2014;28:131–41. pmid:25240227
- 28. Meech R, Mackenzie PI. STRUCTURE AND FUNCTION OF URIDINE DIPHOSPHATE GLUCURONOSYLTRANSFERASES. Clin Exp Pharmacol Physiol 1997;24:907–15. pmid:9406655
- 29. Tiwari P, Sangwan RS, Sangwan NS. Plant secondary metabolism linked glycosyltransferases: An update on expanding knowledge and scopes. Biotechnol Adv 2016;34:714–39. pmid:27131396
- 30. Loutre C, Dixon DP, Brazier M, Slater M, Cole DJ, Edwards R. Isolation of a glucosyltransferase from Arabidopsis thaliana active in the metabolism of the persistent pollutant 3,4-dichloroaniline. The Plant Journal 2003;34:485–93. pmid:12753587
- 31. HACKETT JD, SU YOON H, BUTTERFIELD NJ, SANDERSON MJ, BHATTACHARYA D. Plastid Endosymbiosis: Sources and Timing of the Major Events. Evolution of Primary Producers in the Sea 2007:109–32. https://doi.org/10.1016/B978-012370518-1/50008-4.
- 32. Yoon HS, Hackett JD, Ciniglia C, Pinto G, Bhattacharya D. A Molecular Timeline for the Origin of Photosynthetic Eukaryotes. Mol Biol Evol 2004;21:809–18. pmid:14963099
- 33. Butterfield NJ. Organic preservation of non-mineralizing organisms and the taphonomy of the Burgess Shale. Paleobiology 1990;16:272–86. https://doi.org/10.1017/S0094837300009994.
- 34. Nie Y, Foster CSP, Zhu T, Yao R, Duchêne DA, Ho SYW, et al. Accounting for Uncertainty in the Evolutionary Timescale of Green Plants Through Clock-Partitioning and Fossil Calibration Strategies. Syst Biol 2020;69:1–16. pmid:31058981
- 35. Morris JL, Puttick MN, Clark JW, Edwards D, Kenrick P, Pressel S, et al. The timescale of early land plant evolution. Proceedings of the National Academy of Sciences 2018;115:E2274–83. pmid:29463716
- 36. Becker B. Snow ball earth and the split of Streptophyta and Chlorophyta. Trends Plant Sci 2013;18:180–3. pmid:23102566
- 37. Herron MD, Hackett JD, Aylward FO, Michod RE. Triassic origin and early radiation of multicellular volvocine algae. Proceedings of the National Academy of Sciences 2009;106:3254–8. pmid:19223580
- 38. Hedges S, Blair JE, Venturi ML, Shoe JL. A molecular timescale of eukaryote evolution and the rise of complex multicellular life. BMC Evol Biol 2004;4:2. pmid:15005799
- 39. Douzery EJP, Snell EA, Bapteste E, Delsuc F, Philippe H. The timing of eukaryotic evolution: Does a relaxed molecular clock reconcile proteins and fossils? Proceedings of the National Academy of Sciences 2004;101:15386–91. pmid:15494441
- 40.
The Arabidopsis Information Resource (TAIR) n.d. https://www.arabidopsis.org/Blast/index.jsp.
- 41.
Rice Genome Annotation Project n.d. http://rice.plantbiology.msu.edu/ (accessed June 24, 2020).
- 42. Li F-W, Nishiyama T, Waller M, Frangedakis E, Keller J, Li Z, et al. Anthoceros genomes illuminate the origin of land plants and the unique biology of hornworts. Nat Plants 2020;6:259–72. pmid:32170292
- 43. Hori K, Maruyama F, Fujisawa T, Togashi T, Yamamoto N, Seo M, et al. Klebsormidium flaccidum genome reveals primary factors for plant terrestrial adaptation. Nat Commun 2014;5. pmid:24865297
- 44.
Phytozome v12.1: Search for genes, families and sequences n.d. https://phytozome.jgi.doe.gov/pz/portal.html#!search?show=BLAST (accessed June 24, 2020).
- 45. Matsuzaki M, Misumi O, Shin-I T, Maruyama S, Takahara M, Miyagishima SY, et al. Genome sequence of the ultrasmall unicellular red alga Cyanidioschyzon merolae 10D. Nature 2004;428:653–7. pmid:15071595
- 46. Price DC, Goodenough UW, Roth R, Lee J-H, Kariyawasam T, Mutwil M, et al. Analysis of an improved Cyanophora paradoxa genome assembly. DNA Res 2019;26:287–99. pmid:31098614
- 47. Blum M, Chang H-Y, Chuguransky S, Grego T, Kandasaamy S, Mitchell A, et al. The InterPro protein families and domains database: 20 years on. Nucleic Acids Res 2021;49:D344–54. pmid:33156333
- 48. Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform 2019;20:1160–6. pmid:28968734
- 49. Okonechnikov K, Golosova O, Fursov M, team the U. Unipro UGENE: a unified bioinformatics toolkit. Bioinformatics 2012;28:1166–7. pmid:22368248
- 50. Guindon S, Dufayard J-F, Lefort V, Anisimova M, Hordijk W, Gascuel O. New Algorithms and Methods to Estimate Maximum-Likelihood Phylogenies: Assessing the Performance of PhyML 3.0. Syst Biol 2010;59:307–21. pmid:20525638
- 51.
FigTree n.d. http://tree.bio.ed.ac.uk/software/figtree/ (accessed January 24, 2023).
- 52.
Draw Freely | Inkscape n.d. https://inkscape.org/ (accessed January 24, 2023).
- 53. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 2000;408:796–815. pmid:11130711
- 54. Kawahara Y, de la Bastide M, Hamilton JP, Kanamori H, Mccombie WR, Ouyang S, et al. Improvement of the oryza sativa nipponbare reference genome using next generation sequence and optical map data. Rice 2013;6:3–10. https://doi.org/10.1186/1939-8433-6-4.
- 55. Banks JA, Nishiyama T, Hasebe M, Bowman JL, Gribskov M, DePamphilis C, et al. The selaginella genome identifies genetic changes associated with the evolution of vascular plants. Science (1979) 2011;332:960–3. pmid:21551031
- 56. Rensing SA, Lang D, Zimmer AD, Terry A, Salamov A, Shapiro H, et al. The Physcomitrella genome reveals evolutionary insights into the conquest of land by plants. Science (1979) 2008;319:64–9. pmid:18079367
- 57. Bowman JL, Kohchi T, Yamato KT, Jenkins J, Shu S, Ishizaki K, et al. Insights into Land Plant Evolution Garnered from the Marchantia polymorpha Genome. Cell 2017;171:287–304.e15. pmid:28985561
- 58. Merchant SS, Prochnik SE, Vallon O, Harris EH, Karpowicz SJ, Witman GB, et al. The Chlamydomonas genome reveals the evolution of key animal and plant functions. Science (1979) 2007;318:245–51. pmid:17932292
- 59. Zhang J, Fu X-X, Li R-Q, Zhao X, Liu Y, Li M-H, et al. The hornwort genome and early land plant evolution. Nat Plants 2020;6:107–18. pmid:32042158
- 60. Hansen SF, Harholt J, Oikawa A, Scheller H V. Plant Glycosyltransferases Beyond CAZy: A Perspective on DUF Families. Front Plant Sci 2012;3:59. pmid:22629278
- 61. Drula E, Garron M-L, Dogan S, Lombard V, Henrissat B, Terrapon N. The carbohydrate-active enzyme database: functions and literature. Nucleic Acids Res 2022;50:D571–7. pmid:34850161
- 62. Fan B, Chen T, Zhang S, Wu B, He B. Mining of efficient microbial UDP-glycosyltransferases by motif evolution cross plant kingdom for application in biosynthesis of salidroside n.d. https://doi.org/10.1038/s41598-017-00568-z.
- 63. Zhang P, Zhang Z, Zhang L, Wang J, Wu C. Glycosyltransferase GT1 family: Phylogenetic distribution, substrates coverage, and representative structural features. Comput Struct Biotechnol J 2020;18:1383. pmid:32637037
- 64. Foflonker F, Price DC, Qiu H, Palenik B, Wang S, Bhattacharya D. Genome of the halotolerant green alga P icochlorum sp. reveals strategies for thriving under fluctuating environmental conditions. Environ Microbiol 2015;17:412–26. https://doi.org/10.1111/1462-2920.12541.
- 65. Cechin J, Piasecki C, Benemann DP, Kremer FS, Galli V, Maia LC, et al. Transcriptome Analysis Identifies Candidate Target Genes Involved in Glyphosate-Resistance Mechanism in Lolium multiflorum. Plants (Basel) 2020;9. pmid:32481698
- 66. Chen K, Peng Y, Zhang L, Wang L, Mao D, Zhao Z, et al. Whole transcriptome analysis resulted in the identification of Chinese sprangletop (Leptochloa chinensis) genes involved in cyhalofop-butyl tolerance. BMC Genomics 2021;22:521. pmid:34238252
- 67. Bai S, Zhao Y, Zhou Y, Wang M, Li Y, Luo X, et al. Identification and expression of main genes involved in non‐target site resistance mechanisms to fenoxaprop‐p‐ethyl in Beckmannia syzigachne. Pest Manag Sci 2020;76:2619–26. https://doi.org/10.1002/ps.5800.
- 68. Zhao N, Li W, Bai S, Guo W, Yuan G, Wang F, et al. Transcriptome Profiling to Identify Genes Involved in Mesosulfuron-Methyl Resistance in Alopecurus aequalis. Front Plant Sci 2017;8:1391. pmid:28848590
- 69. Rangani G, Rouse CE, Saski C, Noorai RE, Shankar V, Lawton-Rauh AL, et al. High Resistance to Quinclorac in Multiple-Resistant Echinochloa colona Associated with Elevated Stress Tolerance Gene Expression and Enriched Xenobiotic Detoxification Pathway. Genes (Basel) 2022;13:515. pmid:35328069
- 70. Yang Q, Deng W, Li X, Yu Q, Bai L, Zheng M. Target-site and non-target-site based resistance to the herbicide tribenuron-methyl in flixweed (Descurainia sophia L.). BMC Genomics 2016;17:551. pmid:27495977
- 71. Piasecki C, Yang Y, Benemann DP, Kremer FS, Galli V, Millwood RJ, et al. Transcriptomic Analysis Identifies New Non-Target Site Glyphosate-Resistance Genes in Conyza bonariensis. Plants 2019;8:157. pmid:31181629
- 72. Bai S, Liu W, Wang H, Zhao N, Jia S, Zou N, et al. Enhanced Herbicide Metabolism and Metabolic Resistance Genes Identified in Tribenuron-Methyl Resistant Myosoton aquaticum L. J Agric Food Chem 2018;66:9850–7. https://doi.org/10.1021/acs.jafc.8b02740.
- 73. Salas-Perez RA, Saski CA, Noorai RE, Srivastava SK, Lawton-Rauh AL, Nichols RL, et al. RNA-Seq transcriptome analysis of Amaranthus palmeri with differential tolerance to glufosinate herbicide. PLoS One 2018;13:1–33. https://doi.org/10.1371/journal.pone.0195488.
- 74. Zhao Y, Li W, Sun L, Wu R, Xu H, Su W, et al. Candidate Genes Involved in Tolerance to Fenoxaprop-P-Ethyl in Rice Induced by Isoxadifen-Ethyl Hydrolysate. Agronomy 2023;13:225. https://doi.org/10.3390/agronomy13010225.
- 75. Liu W, Bai S, Zhao N, Jia S, Li W, Zhang L, et al. Non-target site-based resistance to tribenuron-methyl and essential involved genes in Myosoton aquaticum (L.). BMC Plant Biol 2018;18:225. pmid:30305027
- 76. Lu Y-P, Li Z-S, Rea PA. AtMRP1 gene of Arabidopsis encodes a glutathione S -conjugate pump: Isolation and functional definition of a plant ATP-binding cassette transporter gene. Proceedings of the National Academy of Sciences 1997;94:8243–8. https://doi.org/10.1073/pnas.94.15.8243.
- 77. Pan L, Yu Q, Wang J, Han H, Mao L, Nyporko A, et al. An ABCC-type transporter endowing glyphosate resistance in plants. Proceedings of the National Academy of Sciences 2021;118:e2100136118. pmid:33846264
- 78. Windsor B, Roux SJ, Lloyd A. Multiherbicide tolerance conferred by AtPgp1 and apyrase overexpression in Arabidopsis thaliana. Nat Biotechnol 2003;21:428–33. pmid:12640467
- 79. Mentewab A, biotechnology CS-N, 2005 undefined. Overexpression of an Arabidopsis thaliana ABC transporter confers kanamycin resistance to transgenic plants. NatureCom n.d. pmid:16116418
- 80. Pan L, Gao H, Xia W, Zhang T, Dong L. Establishing a herbicide-metabolizing enzyme library in Beckmannia syzigachne to identify genes associated with metabolic resistance. J Exp Bot 2016;67:1745–57. https://doi.org/10.1093/jxb/erv565.
- 81. Matzrafi M, Shaar-Moshe L, Rubin B, Peleg Z. Unraveling the Transcriptional Basis of Temperature-Dependent Pinoxaden Resistance in Brachypodium hybridum. Front Plant Sci 2017;8:1064. pmid:28680434
- 82. Zhang D, Li X, Bei F, Jin T, Jia S, Bu R, et al. Investigating the Metabolic Mesosulfuron-Methyl Resistance in Aegilops tauschii Coss. By Transcriptome Sequencing Combined with the Reference Genome. J Agric Food Chem 2022;70:11429–40. https://doi.org/10.1021/acs.jafc.2c04529.
- 83. Huang X, Zhao S, Zhang Y, Li Y, Shen H, Li X, et al. A novel UDP-glycosyltransferase 91C1 confers specific herbicide resistance through detoxification reaction in Arabidopsis. Plant Physiology and Biochemistry 2021;159:226–33. pmid:33387851
- 84. Brazier-Hicks M, Edwards R. Functional importance of the family 1 glucosyltransferase UGT72B1 in the metabolism of xenobiotics in Arabidopsis thaliana. The Plant Journal 2005;42:556–66. pmid:15860014
- 85. Lira CE, Ahn JH, Lim J. Molecular Genetic Analysis of Tandemly Located Glycosyltransferase Genes, UGT73B1, UGT73B2, and UGT73B3, in Arabidopsis thaliana. vol. 49. 2006.
- 86. Archibald JM. Endosymbiosis and Eukaryotic Cell Evolution. Current Biology 2015;25:R911–21. pmid:26439354
- 87. Keeling PJ. The Number, Speed, and Impact of Plastid Endosymbioses in Eukaryotic Evolution. Annu Rev Plant Biol 2013;64:583–607. pmid:23451781
- 88. Geisler M, Murphy AS. The ABC of auxin transport: the role of p-glycoproteins in plant development. FEBS Lett 2006;580:1094–102. pmid:16359667
- 89. Blakeslee JJ, Bandyopadhyay A, Lee OR, Mravec J, Titapiwatanakun B, Sauer M, et al. Interactions among PIN-FORMED and P-Glycoprotein Auxin Transporters in Arabidopsis. Plant Cell 2007;19:131–47. https://doi.org/10.1105/tpc.106.040782.
- 90. Geisler M, Blakeslee JJ, Bouchard R, Lee OR, Vincenzetti V, Bandyopadhyay A, et al. Cellular efflux of auxin catalyzed by the Arabidopsis MDR/PGP transporter AtPGP1. The Plant Journal 2005;44:179–94. pmid:16212599
- 91. Gould SJ, Vrba ES. Exaptation—a Missing Term in the Science of Form. Paleobiology 1982;8:4–15. https://doi.org/10.1017/S0094837300004310.
- 92. Delwiche CF, Cooper ED. The Evolutionary Origin of a Terrestrial Flora. Current Biology 2015;25:R899–910. pmid:26439353
- 93. Becker B, Marin B. Streptophyte algae and the origin of embryophytes. Ann Bot 2009;103:999–1004. pmid:19273476
- 94. Leliaert F, Smith DR, Moreau H, Herron MD, Verbruggen H, Delwiche CF, et al. Phylogeny and Molecular Evolution of the Green Algae. CRC Crit Rev Plant Sci 2012;31:1–46. https://doi.org/10.1080/07352689.2011.615705.
- 95. Kenrick P, Crane PR. The origin and early evolution of plants on land. Nature 1997;389:33–9. https://doi.org/10.1038/37918.