Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Genome-wide identification of ABCC gene family and their expression analysis in pigment deposition of fiber in brown cotton (Gossypium hirsutum)

  • Na Sun,

    Roles Conceptualization, Data curation, Methodology, Software, Validation, Writing – original draft

    Affiliation School of Life Sciences, Anhui Agricultural University, Hefei, PR China

  • Yong-Fei Xie,

    Roles Formal analysis, Software, Validation

    Affiliation School of Life Sciences, Anhui Agricultural University, Hefei, PR China

  • Yong Wu,

    Roles Formal analysis, Methodology, Validation

    Affiliation School of Life Sciences, Anhui Agricultural University, Hefei, PR China

  • Ning Guo,

    Roles Data curation

    Affiliation School of Life Sciences, Anhui Agricultural University, Hefei, PR China

  • Da-Hui Li,

    Roles Project administration, Writing – review & editing

    Affiliation School of Life Sciences, Anhui Agricultural University, Hefei, PR China

  • Jun-Shan Gao

    Roles Conceptualization, Funding acquisition, Project administration, Writing – review & editing

    gaojsh@ahau.edu.cn

    Affiliation School of Life Sciences, Anhui Agricultural University, Hefei, PR China

Abstract

ABC (ATP-binding cassette) transporters are a class of superfamily transmembrane proteins that are commonly observed in natural organisms. The ABCC (ATP-binding cassette C subfamily) protein belongs to a subfamily of the ABC protein family and is a multidrug resistance-associated transporter that localizes to the tonoplast and plays a significant role in pathogenic microbial responses, heavy metal regulation, secondary metabolite transport, and plant growth. Recent studies have shown that the ABCC protein is also involved in the transport of anthocyanins/proanthocyanidins (PAs). To clarify the types and numbers of ABCC genes involved in PA transport in Gossypium hirsutum, the phylogenetic evolution, physical location, and structure of ABCC genes were classified by bioinformatic methods in the upland cotton genome, and the expression levels of these genes were analyzed at different developmental stages of the cotton fiber. The results showed that 42 ABCC genes were initially identified in the whole genome of upland cotton; they were designated GhABCC1-42. The gene structure and phylogenetic analysis showed that the closely related ABCC genes were structurally identical. The analysis of chromosomal localization demonstrated that there were no ABCC genes on the chromosomes of AD/At2, AD/At5, AD/At6, AD/At10, AD/At12, AD/At13, AD/Dt2, AD/Dt6, AD/Dt10, and AD/Dt13. Outside the genes, there were ABCC genes on other chromosomes, and gene clusters appeared on the two chromosomes AD/At11 and AD/Dt8. Phylogenetic tree analysis showed that some ABCC proteins in G. hirsutum were clustered with those of Arabidopsis thaliana, Vitis vinifera and Zea mays, which are known to function in anthocyanin/PA transport. The protein structure prediction indicated that the GhABCC protein structure is similar to the AtABCC protein in A. thaliana, and most of these proteins have a transmembrane domain. At the same time, a quantitative RT-PCR analysis of 42 ABCC genes at different developmental stages of brown cotton fiber showed that the relative expression levels of GhABCC24, GhABCC27, GhABCC28, GhABCC29 and GhABCC33 were consistent with the trend of PA accumulation, which may play a role in PA transport. These results provide a theoretical basis for further analysis of the function of the cotton ABCC genes and their role in the transport of PA.

Introduction

Cotton (Gossypium spp.) is an economically important crop that is cultivated worldwide. Cotton fiber is a necessity for daily life and is an important raw material for the textile industry. Colored cotton is an eco-friendly textile raw material that does not need to be dyed, bleached or otherwise treated to obtain a certain color in the process of making textiles. The common varieties of colored cotton are brown cotton and green cotton, and brown cotton is the primary cultivated variety [1]. However, colored cotton has a number of disadvantages, such as low yield, short fiber length, low color and genetic instability of the pigment [2], which limits its application and marketing. Therefore, it is urgently important to elucidate the molecular mechanism governing colored cotton fiber pigmentation formation. At present, there are in-depth studies on the metabolic pathways of brown cotton fiber pigments. The key functional genes and regulatory factors in the proanthocyanidin (PA) biosynthesis pathway have been well studied [3], but the mechanism underlying transport and oxidative polymerization has not been elucidated to date.

Studies have found that PAs in plant cells are stored in the large central vacuole; therefore, the polymerization of PAs may occur in the vacuole [4]. Some researchers have shown that some key enzymes in the PA metabolism pathway, such as ANR, ANS, and DFR, are located in the cytoplasm [5], and hypothesized that these three enzymes may perform anthocyanin and flavan-3-ol biosynthesis in a certain area of the cytoplasm [6, 7]. Studies have shown that in Arabidopsis thaliana seed coat cells, epicatechin and cyanidin are initially synthesized in the cytoplasm, are transported to the vacuole by transporters, and finally aggregate to form multimers in the vacuole [8]. Previous studies have shown that during the transport of epicatechin and catechin to the vacuole, four types of proteins are involved in the transport process: ABCC protein, MATE protein (TT12), GST protein (TT19) and P-ATPase protein (TT13) [9]. A study of A. thaliana seed coat PAs found that TT12 is involved in the transport of epicatechin glycosides and anthocyanin glycosides, while TT19 is involved in the transport of anthocyanin glycosides and epicatechin, and the proton pump encoded by TT13 provides a concentration gradient transport epicatechin and catechin [1012]. Studies have shown that the ABCC protein may be involved in the transport of flavonoids in some plants and can cotransport PA precursor substances with the GST protein [9].

ABC (ATP-binding cassette) is a family of ancient and large transmembrane proteins that are commonly observed in natural organisms. Most ABC transporters have activities in vivo and relying on the energy generated by ATP hydrolysis to achieve transmembrane transport of substrates inside and outside the cell, which include amino acids, liposomes, polysaccharides, peptides, heavy metal chelates, alkaloids and drugs [13, 14]. Multidrug resistance (MDR) is the first ABC transporter identified in eukaryotes and is involved in the process of excretion of intracellular drugs to prevent excessive accumulation of drugs in cells [15, 16]. Fifty-six and 53 ABC transporters have been identified in Drosophila and Bombyx mori, respectively, in which the Bmwh3 protein of B. mori and the white and brown ABC proteins of Drosophila have been shown to have pigment transport functions [1721]. The corn bronze-2 (bz2) mutant lacks a glutathione S-transferase encoded by the bz2 gene, resulting in the inability of anthocyanins to accumulate in vacuoles. Because glutathione S-transferase has a very important effect on the activity of MRP transporter binding substrates, it is speculated that in the maize bz-2 mutant, the MRP type ABC transporter is likely to participate in the anthocyanin transport process [22]. Studies have found that sodium orthovanadate, an inhibitor of ABC transporter, can significantly reduce the secretion of flavonoids. It is speculated that ABC transporter may be related to the secretion of flavonoids from soybean roots [23].

Plant ABC transporters were first discovered during plant detoxification, and subsequently, a large number of ABC transporters were identified in plants. To date, the functions of ABC transporters have exceeded the scope of the detoxification mechanism. Some studies have confirmed that ABC transporters play important roles in plant pathogenic microbial responses, regulation of heavy metals, and transport of secondary metabolites [24]. With the sequencing and implementation of the plant genome, the ABC transporter has been fully identified and studied in plants. At present, the number of ABC transporters identified in plants is considerably higher than those in animals or microorganisms; for instance, there are 123 ABC transporters in Oryza sativa, 127 ABC transporters in A. thaliana, and 89 ABC transporters in leguminous plants [2527]. A large number of ABC transporters in plants may be involved in complex metabolic activities.

The ABCC protein (ATP-binding cassette C subfamily) belongs to a subfamily of the ABC protein family and is a multidrug resistance-associated transporter. Most ABCC transporters have a transmembrane domain consisting of 3 to 5 transmembrane helices and are involved in many physiological processes, such as intracellular detoxification, transport of chlorophyll metabolites, and regulation of ion channels [28]. In addition, plant ABCC transporters play an important role in the process of storing glycosides and pigment metabolites in vacuoles. Previous studies have shown that ABCG10 regulates the expression level of isoflavones in sputum [29]; MRP3 is transformed into the leaves by constructing an interference vector and changes the color of the leaves in maize [30]; and VvABCC1 is involved in anthocyanins in grape skins [31].

Cotton is the most important fiber crop worldwide and exhibits a wide range of varieties, among which upland cotton (Gossypium hirsutum) is the most widely cultivated. With the completion of whole-genome sequencing of upland cotton [32], the genome-wide database can be used for systematic screening, identification and comparative genomics research and provides a rich resource for research on the biological functions of ABCC gene family members. At present, the ABCC genes of A. thaliana, Vitis vinifera, Zea mays and other species have been identified, and there are many studies on the function of the model plant ABCC gene family. In this study, the ABCC gene family of upland cotton was identified by bioinformatics. The number, sequence characteristics and evolutionary relationship of ABCC genes in upland cotton are analyzed at the genetic level. The function of these genes is predicted by qRT-PCR and homology comparison, and the role of the genes in the transport and accumulation of pigment in brown cotton fiber is further discussed to provide a theoretical basis for breeding pigment-stable varieties of brown cotton.

Results

Identification of the ABCC gene family in upland cotton

Using bioinformatic methods, ABCC family members were screened from upland cotton. At the same time, the basic information of all ABCC genes was searched using the EsPAsy online website, and the physical and chemical properties of protein length, molecular weight and isoelectric point were obtained and analyzed. The results showed that 42 ABCC genes were obtained in upland cotton (Table 1). The ABCC protein sequences appeared significant differences, and their amino acid lengths ranged from 435 aa (GhABCC39) to 1624 aa (GhABCC24). The molecular weights ranged from 48.52 kD (GhABCC39) to 182.95 kD (GhABCC24). The isoelectric points ranged from 5.4 (GhABCC39) to 8.96 (GhABCC40). It can be observed from the basic characteristics of these ABCC proteins that the gene family varies considerably regarding gene lengths and protein properties, indicating that the members of the gene family have different characteristics and potentially play different biological roles.

thumbnail
Table 1. Characteristics of GhABCC genes identified in G. hirsutum.

https://doi.org/10.1371/journal.pone.0246649.t001

Phylogenetic analysis of the ABCC gene family

To better clarify the genetic relationship between monocotyledonous and dicotyledonous species, a phylogenetic tree was constructed according to the ABCC sequences from Z. mays (19), G. hirsutum (42), A. thaliana (94) and V. vinifera (23) (Fig 1). According to the classification method described by Sun et al. [33], the GhABCC gene family was divided into four subfamilies: I, II, III, and IV. The results indicated that more ABCC genes in cotton and A. thaliana are clustered together, and the evolutionary relationship between them is closely related. The AtABCC1 and AtABCC2 genes in A. thaliana have been demonstrated to be involved in the transport of PAs [34, 35]; therefore, we hypothesize that members of subfamily III in G. hirsutum may be involved in the transport of anthocyanins in vacuoles.

thumbnail
Fig 1. Phylogenetic tree of ABCC proteins from G. hirsutum, A. thaliana, V. vinifera and Z. mays.

The tree was generated with MEGA 7.0 software (1000 bootstrap replicates) using the neighbor-joining method, Different colors indicate different subfamilies of ABCC.

https://doi.org/10.1371/journal.pone.0246649.g001

Characteristics of GhABCC gene structures

In this study, to better understand the evolutionary relationship of GhABCC genes, a phylogenetic tree was constructed with the ABCC proteins from G. hirsutum (Fig 2). As shown in Fig 2, there are 15 homologous pairs in the GhABCC gene, of which 11 pairs of bootstrap values are 100, indicating that the 11 pairs of GhABCC genes are closely related. The gene sequences of members of the same subfamily are very similar, indicating the possibility of their similarity of functions, reflecting the conservation of the GhABCC gene during evolution.

thumbnail
Fig 2. Phyletic evolution and gene structure of the ABCC gene family in G. hirsutum.

The exon-intron structure map of the upland cotton ABCC family was obtained using the GSDS online tool. Cotton ABCC family phylogenetic tree (left) and gene structure (right). Yellow boxes, blue boxes and black lines represent exons, 5’ or 3’ untranslated region (UTR) and introns, respectively.

https://doi.org/10.1371/journal.pone.0246649.g002

Distribution of conserved motifs of GhABCCs

Twenty conserved sequences of ABCC protein in upland cotton were identified by the online software MEME. Also, it can be observed that all GhABCCs contain motifs 1, 2, 3 and 7 (Fig 3), and the conserved regions of different subfamilies are different, indicating that these motifs may have some specific functions (Fig 4). The similarity and difference of gene structure and conserved motifs reflect the relative conservation of the GhABCC gene family in the lengthy evolutionary process and the diversity generated for adapting to the environment.

thumbnail
Fig 3. Conserved motif compositions of ABCC genes from upland cotton.

Twenty putative conserved motifs were elucidated using MEME with complete protein sequences. All motifs have been labeled by different colors.

https://doi.org/10.1371/journal.pone.0246649.g003

thumbnail
Fig 4. Distribution of conserved motifs of ABCC proteins in G. hirsutum.

The x-axis indicates the conserved sequences of the domain. The height of each letter indicates the conservation of each residue across all proteins. The y-axis is a scale of the relative entropy, which reflects the conservation rate of each amino acid.

https://doi.org/10.1371/journal.pone.0246649.g004

Chromosome localization of GhABCC genes

To further study the effect of gene evolution on the GhABCC gene family, chromosome localization of ABCC genes was analyzed using Mapinspect software in upland cotton. The results showed that ABCC genes were primarily distributed on At1, Dt1, At3, Dt3, At4, Dt4, Dt5, At7, Dt7, At8, Dt8, At9, Dt9, At11, Dt11 and Dt12 chromosomes in upland cotton, and most of them were in the middle and lower parts of chromosomes; only a small number were in the upper part (Fig 5). Forty-two ABCC genes were randomly distributed, among which 5 ABCC genes were distributed on the Dt8 chromosome and were the most abundant. It is generally believed that a 200-kb nucleotide group with more than three genes is considered a gene cluster. There is a gene cluster on chromosome DT8 in upland cotton, which may encode structural genes that catalyze different steps of the same metabolic pathway. Tandem gene replication is a process in which DNA molecules replicate one or more adjacent copies, which achieve the evolution of gene families through high-frequency gene production and death. According to the definition of gene tandem replication, GhABCC14, 16, 26; GhABCC14, 15, 17; GhABCC4, 5; GhABCC6, 40; GhABCC24, 41; GhABCC27, 32; and GhABCC29, 31 have gene tandem replication.

thumbnail
Fig 5. Chromosomal localization of ABCC genes in G. hirsutum.

The chromosome numbers are indicated at the top of each bar, while the size of a chromosome is indicated by its relative length. The unit on the left scale is Mb, and the short line indicates the approximate position of the GhABCC gene on the corresponding chromosome. Segmental duplication gene pairs are connected with color lines.

https://doi.org/10.1371/journal.pone.0246649.g005

Protein secondary structure prediction and subcellular localization analysis of GhABCCs

Using the online analysis tool SOPMA to make predictions, it was determined that the secondary structure of ABCC protein is mainly alpha helix and random coils, and the proportion of extended strand and beta turn is relatively small (Table 2). Using Cell-PLoc 2.0 software, it was found that ABCC proteins were mainly located on the cell membrane, and only GhABCC24, 27, 28, 29, 31, 32, 33 were located on the vacuole membrane (Table 2). It can be seen from the evolutionary tree that most of these 7 genes are in subfamily III (Fig 1); therefore, it is hypothesized that genes in subfamily III may play important roles in the transport of anthocyanins/PAs.

thumbnail
Table 2. Protein secondary structure prediction and subcellular localization analysis.

https://doi.org/10.1371/journal.pone.0246649.t002

Accumulation of PAs and expression analysis of ABCC genes in brown cotton

The different developmental stages of fiber in white cotton and brown cotton were observed. Before the boll stage, there is no difference in appearance between brown cotton and white cotton; during the period of boll opening, the color of brown cotton fiber gradually darkens, and white cotton is still white (Fig 6). It can be seen that environmental conditions may promote the accumulation of PAs in the brown cotton fiber and lead to the deposition of pigments, thereby darkening the color. Therefore, the contents of PAs at different development stages of brown cotton fiber were measured. The contents of PAs increased gradually with the development of fiber, reached the highest level at 12 DPA, and thereafter decreased gradually (Fig 7). The main reason for this phenomenon may be related to the expression level of related genes in the procyanidin biosynthesis pathway.

thumbnail
Fig 6. The phenotypes of white fiber and brown fiber at different stages of pigment deposition.

The photographs are taken from the budding stage to the end of the boll opening stage during different growth and development periods of cotton.

https://doi.org/10.1371/journal.pone.0246649.g006

thumbnail
Fig 7. PA content at different development stages of brown cotton fibers.

Abscissa indicates different days post anthesis of cotton fifibers, and ordinate indicates PA content, the error bars indicate SE.

https://doi.org/10.1371/journal.pone.0246649.g007

To analyze the expression levels of GhABCC genes related to PA accumulation during the development stages of brown cotton fiber, fluorescence quantitative primers were designed according to the GhABCC gene sequences (S1 Table), and quantitative RT-PCR was performed using RNA from fibers at 6 DPA, 12 DPA, 18 DPA, 24 DPA and 30 DPA. Since the content of PA in brown cotton fiber changes significantly during these five periods [36], these periods were choosed to determine the relative expression of ABCC genes. The results showed that the relative expression levels of most GhABCCs were inconsistent with the accumulation of PAs in brown cotton fibers, while the relative expression levels of GhABCC24, GhABCC27, GhABCC28, GhABCC29 and GhABCC33 were consistent with the trend of PA accumulation (Fig 8). These GhABCC genes belong to subfamily III, and it was also shown that the GhABCC genes related to PA transport are primarily located in subfamily III according to the analyses of subcellular localization. Therefore, it is hypothesized that one or several genes of GhABCC24, GhABCC27, GhABCC28, GhABCC29 and GhABCC33 may be involved in the accumulation of PAs. In addition, these genes were compared with A. thaliana ABCC1 through DNAMAN software. Among these genes, GhABCC27 and AtABCC1 had the highest homology, reaching 56.19%, and the evolutionary relationship between GhABCC27 and AtABCC1 was the closest in the composite phylogenetic tree (Fig 1). This result suggests that GhABCC27 may have the same function as AtABCC1, which is involved in the transport of PAs.

thumbnail
Fig 8. Expression patterns of ABCC genes in G. hirsutum.

Relative expression levels of the ABCC genes at different development stages of brown cotton fiber. The relative expression level was calculated using the 2−ΔΔCt method. Different colors represent expression level; 0 and 12 indicates the expression level, different colors represent expression level; red indicates high expression, and blue indicates low expression.

https://doi.org/10.1371/journal.pone.0246649.g008

Analysis of cis-acting elements of GhABCC27 promoter

The online promoter element prediction tool PlantCARE was used to analyze the cis-acting elements of GhABCC27. It was found that in addition to the basic core elements of the promoter such as CAAT-box and TATA-box, the promoter also includes a large number of elements involved in the light response (AE-box, ATCT-motif, Box 4, GT1-motif, LAMP-element, TCT-motif and chs-CMA1a), as well as plant abiotic stress inducing elements (ARE and MBS) and gibberellin response elements (GARE-motif and P-box). It has the same cis-acting elements as the GST transporter GhTT19, so it is speculated that GhABCC27 may have the same transport function as GhTT19. In addition, it also includes a cis-acting regulatory element related to endosperm expression (GCN4_motif) and a cold-responsive cis-acting element and other cis-acting elements (Table 3). These results indicate that the expression of GhABCC27 gene in brown cotton may be regulated by external environmental conditions such as light, plant hormones, and adversity stress.

thumbnail
Table 3. Functional prediction of cis acting elements of GhABCC27 promoter.

https://doi.org/10.1371/journal.pone.0246649.t003

Discussion

In recent years, there have been many reports on the structure and function of each member of the ABC gene family, but few reports on cotton have been published. Plant ABCC transporter plays an important role in the process of vacuolar storage of glycosides and pigment metabolites, and is generally named MRP in related research reports [37]. Studies have shown that both ZmMrp3 and ZmMrp4 of corn are involved in the accumulation of pigments in vacuoles [30]. Among the 129 ABC genes, 15 ABC genes encoding MRP proteins have been identified [38], but only the structure and function of the MRP gene family has been studied in A. thaliana. The initial discovery of the plant MRP gene was due to the observation that the entry of glutathione compounds into the vacuole was dependent on ATP for energy, rather than the proton potential difference inside and outside the membrane [39]. Both AtMRP1 and AtMRP2 have glutathione-conjugated transport activity, and AtMRP2 is more active than AtMRP1 in Arabidopsis [40]. There is evidence that AtABCC1 functions as an anthocyanin transporter that depends on GSH without the formation of an anthocyanin-GSH conjugate [41].

In this study, the ABCC gene families of Arabidopsis, grape, maize and upland cotton were identified and analyzed at the genomic level. 42 ABCC genes were identified in upland cotton. The high number of family members determines the diversity and specificity of ABCC gene family functions. According to the homology and domains of the conserved sequences of ABCC transporters, GhABCCs are divided into 4 subfamilies, and the members of each subfamily are named systematically. The increase in the number of genes among species is considered to be the way to promote the evolution of species. The main way to increase the number of gene families is gene replication. Gene replication can be divided into intragene replication and intergene replication. The upland cotton genome undergoes several gene duplication events in the process of replication, and the copies of these genes are usually free from selection pressure [42]. This phenomenon not only guarantees the evolution of upland cotton but also enriches the diversity of the upland cotton gene family. Previous studies have shown that epicatechin is synthesized in the cytoplasm, transported by transporters to the vacuole, aggregates and accumulates in the vacuole. Previous studies have found that the AtABCC1 and AtABCC2 genes play an important role in the accumulation of PAs in Arabidopsis seed coats [34, 35]. VvABCC17 has also been shown to be located in the vacuolar membrane and participate in the transport of glycosylated anthocyanins. From the evolutionary tree, we can clearly observe that AtABCC1, AtABCC2 and VvABCC17 [41] and G. hirsutum are clustered in subfamily III. The subcellular location predicts that the genes of subfamily III are primarily located on the vacuole membrane (Table 2). Therefore, we hypothesize that members of subfamily III may also play an important role in the synthesis of brown cotton fiber PAs. Through fluorescence quantitative RT-PCR analysis, it was found that the relative expression levels of GhABCC24, GhABCC27, GhABCC28, GhABCC29 and GhABCC33 were consistent with the trend of PA accumulation in brown cotton fibers (Figs 7 and 8). Therefore, we hypothesize that these genes may be related to the transport of PAs. Sequence alignment analysis of these genes with Arabidopsis ABCC1 through DNAMAN software shows that GhABCC27 and AtABCC1 have the highest homology, and the evolutionary relationship was the closest in the phylogenetic tree (Fig 1). This result suggests that GhABCC27 may have the same function as AtABCC1, which is involved in the transport of PAs.

The online promoter element prediction tool PlantCARE was used to analyze the cis-acting elements of GhABCC27. The results showed that in addition to the basic core elements of the promoter such as CAAT-box and TATA-box, the promoter also includes a large number of elements involved in the light response (AE-box, ATCT-motif, Box 4, GT1-motif, LAMP-element, TCT-motif and chs-CMA1a), as well as plant abiotic stress inducing elements (ARE and MBS) and gibberellin response elements (GARE-motif and P-box) (Table 3). These results indicate that the expression of GhABCC27 gene in brown cotton may be regulated by external environmental conditions such as light, plant hormones, and adversity stress. In addition, because GhABCC27 and GST transporter GhTT19 have the same cis-acting elements, GhABCC27 may have the same function of transporting PAs as GhTT19.

Materials and methods

Experimental material and genome databases

Zongcaixuan 1, which has natural brown fiber, is a kind of upland cotton line bred by our laboratory. This line is planted in the high-tech agricultural park of Anhui Agricultural University (Hefei, PR China) in accordance with normal field management. The genome-wide database for upland cotton is available from the website (http://mascotton.njau.edu.cn) [32]. The A. thaliana genome data are from the database (http://www.arabidopsis.org/).

Identification of ABCC gene in upland cotton genome

A local database of the whole genome sequence of G. hirsutum, V. vinifera, A. thaliana and Z. mays was established using DNATOOLS software. Using the amino acid sequence of A. thaliana AtABCC1 (PF00005.27) as a query, TblastN (E-value = 0.001) sequence alignment was performed on the established local database of amino acid sequences of four species [42, 43], and the ABCC family genes were initially screened. The results were tested in the Pfam [44] database (http://pfam.xfam.org/) and CDD [45] (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) to screen the ABCC sequence of the gene signature domain (ABC_membrane, ABC_tran). Use ExPAsy [46] (http://www.expasy.org/) to analyze the amino acid sequence online to determine the isoelectric point (PI) of amino acid, the molecular weight (MW) of the protein, and the instability coefficient.

Construction of the phylogenetic tree of the GhABCC gene family

The ClustalW tool involved in the MEGA7.0 [47] software was utilized to perform multiple sequence alignment according to the amino acid sequence of the upland cotton ABCC gene, and the phylogenetic tree was subsequently constructed by the neighboring method (NJ, Neighbor-Joining). Branch support values indicate nonparametric bootstrap values (in percentages of 1000 replicates). Meanwhile, a total of 178 ABCC protein sequences from A. thaliana, V. vinifera and Z. mays were obtained from the NCBI database (https://www.ncbi.nlm.nih.gov/). A comprehensive analysis was performed, and a composite phylogenetic tree of the ABCC proteins was drawn by the above method.

Analyses of chromosomal localization, gene structure and conserved motifs

The position information of each ABCC gene on the chromosome was obtained from the cotton genome database, and the physical position of these genes on the chromosome was mapped using MapInspect (http://mapinspect.software.informer.com) software. The exon-intron structure map of the upland cotton ABCC family was obtained using the GSDS [48] online tool (http://gss.cbi.pku.edu.cn/). According to the obtained protein sequence, the MEME [49] online analysis tool (http://meme.sdsc.edu/) was employed to analyze the motif pattern of the upland cotton ABCC family protein.

Signal peptide detection and subcellular localization prediction

The secondary structures of the ABCC proteins were predicted using the online analysis tool SOPMA (https://npsa-prabi.ibcp.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_sopma.html). The Cell-PLoc 2.0 online analysis platform can perform subcellular localization of proteins of eukaryotes, humans, plants, gram-positive bacteria, gram-negative bacteria and viruses. The subcellular localizations of these proteins were predicted by the Cell-PLoc 2.0 [50] algorithm (http://www.csbio.sjtu.edu.cn/bioinf/Cell-PLoc-2/).

Gene expression analysis of GhABCC

The primers for fluorescent quantitative RT-PCR were designed based on the selected ABCC subfamily gene sequences in upland cotton (S1 Table). The RNA of the fibers of different developmental stages of brown cotton was extracted, and the RNA was reverse transcribed into cDNA and subjected to qRT-PCR analysis. The qRT-PCR volume was 20 μL, including 10 μL of SYBR premix Ex Taq enzyme, 2 μL of cDNA, and 0.8 μL of upstream and downstream primers. The reaction procedure was as follows: 50°C for 2 min; 40 cycles of 95°C for 30 s, 95°C for 5 s, and 60°C for 20 s followed by 72°C for 10 min. The UBQ7 gene was used as an internal reference [51]. Each sample was subjected to three biological replicates, and the relative expression levels were calculated using the 2 −ΔΔCt method [52].

Determination of PA content

According to methods from Ikegami [53], the soluble and insoluble PAs of brown cotton fifibers at different developmental stages (6 DPA, 12 DPA, 18 DPA, 24 DPA, and 30 DPA) were extracted, and the content of PAs was determined by spectrophotometry according to the standard curve of catechins, which were used as controls [54]. For each experiment, three biological replicates were executed.

Analysis of promoter cis-acting elements

The sequence of about 2000 bp before the ATG upstream of the GhABCC27 gene is obtained from the upland cotton genome, which is the promoter sequence. Use PlantCARE [55] (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) software to conduct bioinformatics analysis of GhABCC27 promoter and predict its main cis-acting elements.

Conclusions

In this study, the ABCC gene family was analyzed by bioinformatics in different species, and 23, 19, 94 and 42 ABCC genes were identified in V. vinifera, Z. mays, A. thaliana and G. hirsutum, respectively. These genes were analyzed to determine their phylogenetic evolution, chromosome localization, gene structure, orthologous genes and gene expression patterns. ABCC genes could be divided into 4 major subfamilies in upland cotton; members of the same subfamily had the same or similar gene structure, and there was a large gap in the gene structure of members of the different subfamily. In the phylogenetic tree, through homology comparison with A. thaliana and prediction of subcellular location, it was preliminarily determined that the genes related to PA transport were located in subfamily III. By comparing the expression of ABCC subfamily genes and PA content at different developmental stages in brown cotton fiber, five candidate genes related to PA transport were screened out. Through bioinformatic analyses, the role of ABCC family genes in upland cotton in the process of pigment transport was initially explored, which may help to establish a theoretical foundation for further research studying the function of ABCC genes and analyzing the molecular mechanism of PA transport across membranes in brown cotton.

Supporting information

S1 Table. Sequences of primers used for RT-PCR in this study.

https://doi.org/10.1371/journal.pone.0246649.s001

(DOCX)

Acknowledgments

We thank the Master student Anane. G. Owusu for providing assistance with English writing. We thank Prof. Yan Meng (College of Life Sciences, Anhui Agricultural University) for critical comments to the manuscript.

References

  1. 1. Cui S.F.; Zhang H.N.; Li J.L.; Jin W.P.; Wang G.E. Research Progress on Natually-Colored Cotton. China Cotton. 2011, 38(2): 2–5.
  2. 2. Ru Z.L.; Wang G.X.; He S.P.; et al. The difference of fiber quality and fiber ultrastructure in different natural colored cotton[J]. Cotton Science. 2013, 25(2): 184–188.
  3. 3. Wisman E.; Hartmann U.; Sagasser M.; et al. Knock-Out Mutants from an En-1 Mutagenized Arabidopsis thaliana Population Generate Phenylpropanoid Biosynthesis Phenotypes[J]. Proceedings of the National Academy of Sciences of the United States of America. 1998, 95(21): 12432–12437. pmid:9770503
  4. 4. Janecki A.; Kolodziej H. Anti-adhesive activities of flavan-3-ols and proanthocyanidins in the interaction of group A-streptococci and human epithelial cells[J]. Molecules. 2010, 15(10):7139–7152. pmid:20953158
  5. 5. Pang Y.; Peel G.J.; Wright E.; et al. Early steps in proanthocyanidin biosynthesis in the model legume Medicago truncatula[J]. Plant Physiology. 2007, 145(3): 601–615. pmid:17885080
  6. 6. Winkel B.S. Metabolic channeling in plants[J]. Annual Review of Plant Biology. 2004, 55(55): 85–107.
  7. 7. Jørgensen K.; Rasmussen A.V.; Morant M.; et al. Metabolon formation and metabolic channeling in the biosynthesis of plant natural products[J]. Current Opinion in Plant Biology. 2005, 8(3): 280–291. pmid:15860425
  8. 8. Issa R. B. A plasma membrane H+-ATPase is required for the formation of proanthocyanidins in the seed coat endothelium of Arabidopsis thaliana[J]. Proceedings of the National Academy of Sciences of the United States of America. 2005, 102(7): 2649–2654. pmid:15695592
  9. 9. Zhao J. Flavonoid transport mechanisms: how to go, and with whom[J]. Trends in Plant Science. 2015, 20(9): 576–585. pmid:26205169
  10. 10. Zhao J.; Pang Y.; Dixon R.A. The mysteries of proanthocyanidin transport and polymerization[J]. Plant Physiology. 2010, 153(2): 437–443. pmid:20388668
  11. 11. Marinova K.; Pourcel L.; Weder B.; et al. The Arabidopsis MATE transporter TT12 acts as a vacuolar flavonoid/H+-antiporter active in proanthocyanidin-accumulating cells of the seed coat[J]. Plant Cell. 2007, 19(6): 2023. pmid:17601828
  12. 12. Kitamura S.; Shikazono N.; Tanaka A. TRANSPARENT TESTA 19 is involved in the accumulation of both anthocyanins and proanthocyanidins in Arabidopsis[J]. Plant Journal. 2004, 37(1): 104–114.
  13. 13. Dean M.; Rzhetsky A.; Allikmets R. The human ATP-binding cassette (ABC) transporter superfamily[J]. Genome Res. 2001, 11(7): 1156–1166. pmid:11435397
  14. 14. Higgins C. F. ABC transporters: from microorganisms to man[J]. Annu Rev Cell Biol. 1992, 8: 67–113. pmid:1282354
  15. 15. Dean M. ABC transporters, drug resistance, and cancer stem cells[J]. J Mammary Gland Biol Neoplasia. 2009, 14(1): 3–9. pmid:19224345
  16. 16. Gottesman M.M.; Fojo T.; Bates S.E. Multidrug resistance in cancer: role of ATP-dependent transporters[J]. Nat Rev Cancer. 2002, 2(1): 48–58. pmid:11902585
  17. 17. Xie X.; Cheng T.; Wang G. Genome-wide analysis of the ATP-binding cassette (ABC) transporter gene family in the silkworm, Bombyx mori[J]. Mol Biol Rep. 2012, 39(7): 7281–7291. pmid:22311044
  18. 18. Abraham E.G.; Sezutsu H.; Kanda T. Identification and characterisation of a silkworm ABC transporter gene homologous to Drosophila white[J]. Mol Gen Genet. 2000, 264(1–2): 11–19. pmid:11016828
  19. 19. Mackenzie S.M.; Brooker M.R.; Gill T.R. Mutations in the white gene of Drosophila melanogaster affecting ABC transporters that determine eye colouration[J]. Biochim Biophys Acta. 1999, 1419(2): 173–185. pmid:10407069
  20. 20. Komoto N.; Quan G.X.; Sezutsu H. A single-base deletion in an ABC transporter gene causes white eyes, white eggs, and translucent larval skin in the silkworm w-3(oe) mutant[J]. Insect Biochem Mol Biol. 2009, 39(2): 152–156. pmid:18996197
  21. 21. Croop J. M.; Tiller G. E.; Fletcher J.A. Isolation and characterization of a mammalian homolog of the Drosophila white gene[J]. Gene. 1997, 185(1): 77–85. pmid:9034316
  22. 22. Marrs K.A.; Alfenito M.R.; Lloyd A.M. A glutathione S-transferase involved in vacuolar transfer encoded by the maize gene Bronze-2[J]. Nature. 1995, 375(6530): 397–400. pmid:7760932
  23. 23. Sugiyama A.; Shitan N.; Yazaki K. Involvement of a soybean ATP-binding cassette-type transporter in the secretion of genistein, a signal flavonoid in legume-Rhizobium symbiosis[J]. Plant Physiol, 2007, 144(4): 2000–2008. pmid:17556512
  24. 24. Kovalchuk A.; Kohler A.; Martin F.; et al. Diversity and evolution of ABC proteins in mycorrhiza-forming fungi[J]. BMC Evol Biol. 2015, 15: 1–19.
  25. 25. Sanchez-Fernandez R.; Davies T.G.; Coleman J. O. The Arabidopsis thaliana ABC protein superfamily, a complete inventory[J]. J Biol Chem. 2001, 276(32): 30231–30244. pmid:11346655
  26. 26. Sugiyama A.; Shitan N.; Sato S. Genome-wide analysis of ATP-binding cassette (ABC) proteins in a model legume plant, Lotus japonicus: comparison with Arabidopsis ABC protein family[J]. DNA Res. 2006, 13(5): 205–228. pmid:17164256
  27. 27. Garcia O.; Bouige P.; Forestier C. Inventory and comparative analysis of rice and Arabidopsis ATP-binding cassette (ABC) systems[J]. J Mol Biol. 2004, 343(1): 249–265. pmid:15381434
  28. 28. Shao R.X.; Shen Y.K.; Zhou W.B.; Fang J.; Zheng B.S. Recent advances for plant ATP-binding cassette transporters. Journal of Zhejiang A&F University. 2013, 30(5): 761–768.
  29. 29. Banasiak J.; Biała W.; Staszków A.; et al. A Medicago truncatula ABC transporter belonging to subfamily G modulates the level of isoflavonoids[J]. Journal of Experimental Botany. 2013, 64(4): 1005. pmid:23314816
  30. 30. Goodman C.D.; Casati P.; Walbot V. A Multidrug Resistance–Associated Protein Involved in Anthocyanin Transport in Zea mays[J]. Plant Cell. 2004, 16(7): 1812–1826.
  31. 31. Francisco R.M.; Regalado A.; Ageorges A.; et al. ABCC1, an ATP Binding Cassette Protein from Grape Berry, Transports Anthocyanidin 3-O-Glucosides[J]. Plant Cell. 2013, 25(5): 1840–1854. pmid:23723325
  32. 32. Huang G.; Wu Z.; Percy R.G.; et al. Genome sequence of Gossypium herbaceum and genome updates of Gossypium arboreum and Gossypium hirsutum provide insights into cotton A-genome evolution. Nat Genet. 2020 May, 52(5):516–524. pmid:32284579
  33. 33. Sun R.; Wang K.; Guo T.; et al. Genome-wide identification of auxin response factor (arf) genes and its tissue-specific prominent expression in gossypium raimondii[J]. Functional&IntegrativeGenomics. 2015, 15 (4): 1–13. pmid:25809690
  34. 34. Park Jiyoung.; Song Won-Yong.; Ko Donghwi.; et al. The phytochelatin transporters AtABCC1 and AtABCC2 mediate tolerance to cadmium and mercury[J]. The Plant Journal. 2012, 69, 278–288. pmid:21919981
  35. 35. Claire e.Behrens.; Kaila e.smith.; Cristina V. Iancu; et al. transport of Anthocyanins and other Flavonoids by the Arabidopsis Atp-Binding Cassette transporter AtABCC2[J]. Scientific RepoRts. (2019) 9:437. pmid:30679715
  36. 36. Chen W.; Si G.Y.; Zhao G.; et al. Genomic Comparison of the P-ATPase Gene Family in Four Cotton Species and Their Expression Patterns in Gossypium hirsutum. Molecules. 2018, 23(5):1092. pmid:29734726
  37. 37. Klein M.; Burla B.; and Martinoia E. The multidrug resistance-associated protein (MRP/ABCC) subfamily of ATP-binding cassette transporters in plants[J]. FEBS Lett. 2006, 580: 1112–1122. pmid:16375897
  38. 38. Gailard S.; Jacquet H.; Vavasseur A; et al. AtMRP6/AtABCC6,an ATP-binding cassette transporter gene expressed during early steps of seedling development and up-regulated by cadmium in Arabidopsis thaliana[J]. BMC Plant Biol. 2008, 8: 22. pmid:18307782
  39. 39. Martinoia E.; Grill E.; Tommasini R; et al. ATP-dependent glutathione S-conjugate export pump in the vac-uolar membrane of plants[J]. Nature. 1993, 364:247–249.
  40. 40. Lu Y.P.; Li Z.S.; Drozdowicz Y.M.; et al. AtMRP2, an Arabidopsis ATP binding cassette transporter able to transport glutathione S-conjugates and chlorophyll catabolites: functional comparisons with AtMRP1[J]. Plant Cell. 1998, 10(2):267–282. pmid:9490749
  41. 41. Francisco R.M.; Regalado A.; Ageorges A.; Bo J. Burla. ABCC1, an ATP Binding Cassette Protein from Grape Berry, Transports Anthocyanidin 3-O-Glucosides. The Plant Cell. Vol. 25: 1840–1854, May 2013. pmid:23723325
  42. 42. Li F.; Fan G.; Wang K; et al. Genome sequence of the cultivated cotton gossypium arboreum[J]. Nature Genetics. 2014, 46(6): 567–572. pmid:24836287
  43. 43. Paterson A.H.; Wendel J.F.; Gundlach H.; et al. Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres[J]. Nature. 2012, 492(7429): 423–427. pmid:23257886
  44. 44. Punta M.; Coggill P.C.; Eberhardt R.Y.; et al. The pfam protein families database[J]. Nucleic Acids Research. 2008, 36 (Database issue): 263–266.
  45. 45. Marchlerbauer A.; Lu S.; Anderson J.B.; et al. Cdd: A conserved domain database for the functional annotation of proteins[J]. Nucleic Acids Research. 2011, 39 (Database issue): D225–229. pmid:21109532
  46. 46. Wilkins M.R.; Gasteiger E.; Bairoch A.; et al. Protein identification and analysis tools in the ExPASy server. Methods Mol Biol. 1999, 112:531–552. pmid:10027275
  47. 47. Kumars .; Stecher G.; Tamura K. Mega7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets[J]. Molecular Biology&Evolution. 2016, 33(7): 1870.
  48. 48. Guo A.Y.; Zhu Q.H.; Chen X.; et al. gsds: A gene structure display server[J]. Hereditas. 2007, 29(8): 1023. pmid:17681935
  49. 49. Bailey T.L.; Boden M.; Buske F.A.; et al. Meme suite: Tools for motif discovery and searching[J]. Nucleic Acids Research. 2009, 37 (Web Server issue): W202–W208. pmid:19458158
  50. 50. Chou K.C.; Shen H.B. Cell-PLoc: an improved package of web-servers for predicting subcellular localization of proteins in various organisms. Natural Science. 2010, 1090–1103
  51. 51. Wang M., Wang Q., Zhang B. Evaluation and selection of reliable reference genes for gene expression under abiotic stress in cotton (Gossypium hirsutum L.) [J]. Gene. 2013, 530(1): 44–50. pmid:23933278
  52. 52. Livak K.J.; Schmittgen T.D. Analysis of relative gene expression data using real-time quantitative PCR and the 2-ΔΔCT method. Methods. 2001, 25, 402–408. pmid:11846609
  53. 53. Ikegami A.; Akagi T.; Potter D.; et al. Molecular identifification of 1-Cys peroxiredoxin and anthocyanidin/flflavonol 3-O-galactosyltransferase from proanthocyanidin-rich young fruits of persimmon (diospyros kaki thunb.). Planta. 2009, 230, 841–855. pmid:19641937
  54. 54. Li Y.; Tanner G.; Larkin P.J. The DMACA-HCL protocol and the threshold proanthocyanidin content for bloat safety in forage legumes. J. Sci. Food Agric. 1996, 70, 89–101.
  55. 55. Lescot M.; Déhais P.; Thijs G.; et al. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic acids research. 2002, 30(1): 325–327. pmid:11752327