Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Genome-wide identification, phylogenetic, and expression analysis under abiotic stress conditions of LIM gene family in Medicago sativa L.

  • Lili Nian,

    Roles Writing – original draft

    Affiliation College of Forestry, Gansu Agricultural University, Lanzhou, China

  • Xuelu Liu ,

    Roles Conceptualization, Supervision

    liuxl@gsau.edu.cn (XL); yyb_17929@163.com (YY)

    Affiliations College of Forestry, Gansu Agricultural University, Lanzhou, China, College of Resources and Environmental Sciences, Gansu Agricultural University, Lanzhou, China

  • Yingbo Yang ,

    Roles Funding acquisition, Investigation

    liuxl@gsau.edu.cn (XL); yyb_17929@163.com (YY)

    Affiliation College of Resources and Environmental Sciences, Gansu Agricultural University, Lanzhou, China

  • Xiaolin Zhu,

    Roles Resources

    Affiliation College of Agronomy, Gansu Agricultural University, Lanzhou, China

  • Xianfeng Yi,

    Roles Formal analysis

    Affiliation The Animal Husbandry Research Institute of Guangxi Zhuang Autonomous Region, Nanning, China

  • Fasih Ullah Haider

    Roles Writing – review & editing

    Affiliation College of Resources and Environmental Sciences, Gansu Agricultural University, Lanzhou, China

Abstract

The LIM (Lin-11, Isl-1 and Mec-3 domains) family is a key transcription factor widely distributed in animals and plants. The LIM proteins in plants are involved in the regulation of a variety of biological processes, including cytoskeletal organization, the development of secondary cell walls, and cell differentiation. It has been identified and analyzed in many species. However, the systematic identification and analysis of the LIM genes family have not yet been reported in alfalfa (Medicago sativa L.). Based on the genome-wide data of alfalfa, a total of 21 LIM genes were identified and named MsLIM01-MsLIM21. Comprehensive analysis of the chromosome location, physicochemical properties of the protein, evolutionary relationship, conserved motifs, and responses to abiotic stresses of the LIM gene family in alfalfa using bioinformatics methods. The results showed that these MsLIM genes were distributed unequally on 21 of the 32 chromosomes in alfalfa. Gene duplication analysis showed that segmental duplications were the major contributors to the expansion of the alfalfa LIM family. Based on phylogenetic analyses, the LIM gene family of alfalfa can be divided into four subfamilies: αLIM subfamily, βLIM subfamily, γLIM subfamily, and δLIM subfamily, and approximately all the LIM genes within the same subfamily shared similar gene structure. The 21 MsLIM genes of alfalfa contain 10 Motifs, of which Motif1 and Motif3 are the conserved motifs shared by these genes. Furthermore, the analysis of cis-regulatory elements indicated that regulatory elements related to transcription, cell cycle, development, hormone, and stress response are abundant in the promoter sequence of MsLIM genes. Real-time quantitative PCR demonstrated that MsLIM gene expression is induced by low temperature and salt. The present study serves as a basic foundation for future functional studies on the alfalfa LIM family.

1 Introduction

Transcription factors, also known as trans-acting elements, regulate the expression of stress-inducible genes by directly or indirectly binding specifically the cis-elements in promoter regions. The family of LIM proteins possesses a cysteine-rich zinc-binding domain named as LIM domain [1]. The name of the LIM domain is derived from the acronyms of LIN11, ISL1, and MEC3, which are identified from animals containing LIM domain proteins [24]. The LIM domain is widely distributed and highly conserved in eukaryotes. The universal make-up of the LIM domain comprises two zinc fingers linked together with a short two-amino acid spacer [5]. In plants, HaP-LIM1 was first isolated from sunflower (Helianthus annuus L.) pollen [6], and then in tobacco (Nicotiana tabacum L.) [7], lily (Lilium longiflorum Thunb L.) [8], Arabidopsis (Arabidopsis thaliana L.) [9], cotton (Gossypium hirsutum L.) [10], tomato (Solanum lycopersicum L.) [11, 12], and foxtail millet (Setaria italica L.) [13], and other species have successively discovered LIM protein.

The LIM in plants comprises two sub-families based on the number of LIM domains: single LIM sub-family (DA1/DAR) and double LIM sub-family (2LIMs). The 2LIM protein contains two LIM domains separated by 40–50 amino acid residues. Structurally, the 2LIM subfamily in the plant is similar to the domains of cysteine-rich protein (CRP) family members in animals, but there are some differences. Such as the plant 2LIM protein contains a variable short C-terminus and lacks a Glycine region. Based on their expression mode, 2LIM proteins are divided into two types: WLIM (WLIM1 and WLIM2), which is widely expressed in various tissues, and PLIM (PLIM1 and PLIM2), which is widely expressed in pollen tubes [14]. The DA1 & DAR sub-family is present only in plants and contains a conserved LIM domain [15]. According to the amino acid sequence, DA1 & DAR proteins are divided into two types: Class I and Class II. So far, DA1 & DAR LIM proteins have been identified only in land plants. The subcellular localization of plant LIM protein is the same as that of animals, with three simultaneous localizations of cytoplasm, nucleus, and nucleoplasm. In animals, the LIM protein located in the nucleus is mainly involved in the transcriptional regulation of developmental genes [16], while the LIM protein located in the cytoplasm is mainly involved in the regulation of actin cytoskeleton formation as actin-binding protein (ABP) [17]. Tobacco NtWLIM1, sunflower WLIM1, and Arabidopsis LIM protein families all show the characteristics of simultaneous localization in the nucleus and cytoplasm [18]. Tobacco NtWLIM1 protein can specifically bind to PAL-box to activate key enzymes in the phenylpropane metabolic pathway and participate in the regulation of lignin biosynthesis [19]. NtWLIM2 participates in the regulation of cell proliferation and cell cycle progression by activating the expression of histone genes [20]. In addition, it was found that the DA1 & DAR subfamily determines organ size and plant autoimmune response [21, 22]. In recent years, studies have shown that up-regulation of LIM gene expression can respond to abiotic stresses, including drought, salinity, and hormones, indicating that LIM genes may play a key regulatory role in the resistance of plants to various stress responses [15, 23, 24].

As a perennial legume forage of the genus Medicago, alfalfa has the characteristics of high yield, good palatability, and strong adaptability, and has a long cultivation history, and is widely planted [25]. Alfalfa can not only be used as feed, but also has the functions of maintaining water and soil, improving soil, and protecting the ecological environment. Therefore, cultivating tolerant alfalfa varieties is an economic and effective way to resist adversity environment. This experiment uses bioinformatics analysis to identify the alfalfa LIM gene family at the genome-wide level, and further analyzes the gene structure, chromosome distribution, promoter cis-acting elements. Using qRT-PCR analyze the expression of LIM in the leaves of alfalfa at different treatment time points under salt and low-temperature stress. This study can provide a theoretical basis for future research on the function of the alfalfa LIM gene.

2 Materials and methods

2.1 Identification and data collection of alfalfa LIM transcription factor family genes

The genome-wide data of alfalfa is obtained from the “Alfalfa Breeder’s Toolbox” (https://www.alfalfatoolbox.org/). In order to identify all members of the LIM family genes in alfalfa, we downloaded the Arabidopsis LIM gene family sequence and annotation information from the TAIR (http://www.arabidopsis.org/) database. Using the TB tools software [26], based on the reference sequence of the Arabidopsis LIM gene family, the possible LIM family sequence in alfalfa was retrieved through Pfam (http://pfam.xfam.org/family), NCBI-CDD (https://www.ncbi.nlm.nih.gov/cdd/) and SMART (http://smart.embl-heidelberg.de/) online tools to predict protein conserved domains.

2.2 Basic physical and chemical properties and evolutionary analysis of LIM protein

The online software ExPASy (https://web.expasy.org/protparam/) was used to analyze the basic physical and chemical properties of the protein encoded by the alfalfa LIM gene, including the number of amino acids, molecular weight, theoretical isoelectric point (pI), and the average value of hydrophilicity (GRAVY), subcellular location is predicted by Psort-Prediction (http://psort1.hgc.jp/form.html) and Cell-PLoc (http://www.csbio.sjtu.edu.cn/bioinf/Cell-PLoc-2/).

We used 10 species to study the evolutionary relationship between alfalfa LIM genes and other plant LIM genes, including Arabidopsis (Arabidopsis thaliana L.), tobacco (Nicotiana tabacum L.), barrel medic (Medicago truncatula L.), soybean (Glycine max L.), tomato (Solanum lycopersicum L.), quinoa (Chenopodium quinoa L.), maize (Zea mays L.), rice (Oryza sativa L.), wheat (Triticum aestivum L.), and identified alfalfa LIM genes. Fast, scalable generation of LIM protein multiple sequence alignments using Clustal Omega (https://www.ebi.ac.uk/Tools/msa/clustalo/). At the same time, MEGA 7 software was used to construct the phylogenetic tree with the Maximum Likelihood method (ML), and the bootstrap was repeatedly set to 1000. After the phylogenetic tree is constructed, the members of the family are classified according to the classification standards of Arabidopsis, tobacco, and rice.

2.3 Chromosome location and gene duplication analysis

Using the annotation information of the LIM gene in the alfalfa genome database, the distribution of the members of the alfalfa LIM family on the 32 alfalfa chromosomes was analyzed. On the plant genome duplication database server (http://chibba.agtec.uga.edu/duplication/index/locket), the duplicate gene pairs are detected. The amino acid sequence of the partially repeated MsLIM gene was predicted using Clustalw software. Synonymous substitution rate (Ks) and non-synonymous substitution rate (Ka) and Ka/Ks are calculated using online tools Clustal Omega and PAL2NAL (http://www.bork.embl.de/pal2nal/).

2.4 Gene structure and conservative motif analysis

According to the GFF annotation file information of the whole genome of alfalfa, use the online website Gene Structure Display Server (GSDS) (http://gsds.cbi.pku.edu.cn/) to obtain the visualized picture of exon-intron [27]. The conserved motifs of alfalfa LIM protein were analyzed using Multiple Expectation Maximization for Motif Elicitation (MEME Suite) (http://meme-suite.org/), and the number of Motifs was set to 10 [28].

2.5 Analysis of promoter cis-acting elements

The 2000bp sequence upstream of the LIM gene was used as the promoter of the alfalfa LIM gene. The Plant CARE (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) database was used to predict and organize promoter cis-acting elements, and display them in the form of graphs.

2.6 Construction of protein interaction network diagram and three-dimensional structure prediction

The model plant Arabidopsis thaliana was used as a background to predict the protein network structure interaction of alfalfa LIM.STRING (http://STRINGdb.org/) software is used to construct a protein network structure diagram (confidence limit is 0.4) [29]. At the same time, we used SWISS-MODEL [30] (https://swissmodel.expasy.org/interactive) provided by the online software ExPaSy to model the three-dimensional structural homology of the protein spatial model of the MsLIM gene family of alfalfa.

2.7 Planting and stress treatment of alfalfa material

Select alfalfa seeds with the same shape and full grains, cut the seed coat and allowed to germinate in an artificial climate chamber. The culture temperature is 24°C, the relative humidity is 80%, and the photoperiod is 16/8 h. The seedlings 5 days after germination were transferred to a plastic box and cultured with MS solid medium under the same conditions. Four-week-old alfalfa with consistent growth status was subjected to stress treatment, and 100mmol/L NaCl and 4°C were used for salt stress and low-temperature stress, respectively. The treatment time was 0 h, 3 h, 6 h, 9 h, and 12 h. The mature leaves were sampled quickly and put into liquid nitrogen, 3 copies for each treatment, and then placed in -80°C cryopreservation for subsequent quantitative experiments by following protocol mentioned by.

2.8 RNA extraction and qRT-PCR detection

Use Shenggong’s UNIQ-10 column Trizol total RNA extraction kit to extract total RNA from each sample, and use Nano-Drop 2000 UV spectrophotometer to detect RNA quality and concentration. Use M-Mu LV first-strand cDNA synthesis kit reverse transcription RNA to obtain cDNA. After detecting the concentration, uniformly dilute to 100 ng/ul as the q RT-PCR reaction template. Specific primers were designed using Perl Primer v1.1.21 software [31]. The primers used in the experiment are shown in the S1 Table. Use Shenggong’s 2x SG Fast q PCR Master Mix kit, the reaction system is 20 μl, the PCR reaction program is 95°C pre-denaturation for 10 min, and then 40 cycles including 95°C denaturations for 15 s and 60°C annealings for 1 min, the instrument used For Applied Biosystems 7500, the experimental results are processed by the 2-△△Ct method [32]. Each experiment was repeated three times with independent RNA samples. Analysis of variance (ANOVA) of the relative expression level of each gene at different sampling points under each abiotic stress treatment was carried out following a generalized linear model using SPSS22 statistical software. Significant differences in mean values at different sampling times were determined by Tukey’s pairwise comparison tests, as indicated by different letters in the figures. The graphical representation of the experimental findings was produced by using Graphpad.

3 Results

3.1 Basic physical and chemical properties

Using the Arabidopsis LIM protein sequence (data in S2 Table) and the alfalfa genome sequence to perform BLAST comparison to extract the CDS (data in S1 File) and protein sequence (data in S2 File) of the alfalfa LIM gene, after de-redundancy and SMART identification [26], 21 alfalfa LIM genes were finally identified and named as MsLIM01-MsLIM21(data in S2 Table). The size of the coding sequences (CDSs) of the members of the MsLIM gene family is between 433–1113 nucleotides. The amino acid lengths of the proteins encoded by these genes vary from 154aa to 212aa, the molecular weight ranges from 17.14kDa to 23.73 kDa, and the PI variation range of the encoded protein from 6.20–9.16, the GRAVY value of all MsLIM proteins is less than 0, indicating that these proteins are hydrophilic. Prediction of the subcellular location showed that most MsLIM genes were located in the cytoplasm (16 MsLIM), and a few were located in mitochondria and peroxisomes.

3.2 The construction of phylogenetic tree

To explore the evolutionary relationship between the LIM gene family and predict its classification in alfalfa, the LIM conserved protein sequences of 9 different species (Arabidopsis thaliana, tobacco, barrel medic, soybean, tomato, quinoa, corn, rice, and wheat (data in S3 Table) are used for phylogenetic tree construction to study the evolutionary relationship between this family and other species LIM. According to the evolutionary relationship of LIM and the classification of Arabidopsis and rice, alfalfa LIM can be divided into four sub-families, namely αLIM, βLIM, γLIM, and δLIM (Fig 1). The subfamily δLIM includes 10 LIM proteins, which is the largest subfamily in LIM, while the numbers in the other three subfamilies are 4 (αLIM), 4 (βLIM), and 3 (γLIM). It can be seen from Fig 1 that the closest relationship with the alfalfa LIM gene family is the barrel medic and soybean, which belong to the legume family, indicating that alfalfa, barrel medic, and soybean are highly conserved during the evolution process, which can be passed through the studied species. The function of the LIM gene family is used to speculate the function of alfalfa LIM. At the same time, it was found that the LIM genes of monocotyledonous plants clustered into small branches separately, indicating that there are certain differences in the evolutionary process of monocotyledonous and dicotyledonous LIM gene families. Different evolutionary rates lead to the diversity of LIM gene functions, and the subfamily formed later produces new subfamily-specific functions after differentiation. Therefore, the functions of LIM genes in different subfamilies are different.

thumbnail
Fig 1. Phylogenetic tree showing the relatedness of alfalfa LIM proteins to that of Arabidopsis, tobacco, barrel medic, soybean, tomato, quinoa, maize, rice, and wheat.

A species acronym was added before each LIM protein name: At, Arabidopsis thaliana; Nt, Nicotiana tabacum; Mt, Medicago truncatula; Gm, Glycine max; Sl, Solanum lycopersicum; Cq, Chenopodium quinoa; Zm, Zea mays; Os, Oryza sativa; Ta, Triticum aestivum. (The information of the LIM family members is listed in the supporting information (S2 Table, Among them, the LIM amino acid sequence of Arabidopsis is from TAIR (http://www.arabiodpsis.org). The LIM amino acid sequences of tobacco, barrel medic, soybean, and tomato are from NCBI (https://www.ncbi.nlm.nih.gov/). The LIM amino acid sequences of quinoa, corn, rice, and wheat are from Phytozome (https://phytozome.jgi.doe.gov/pz/portal.html)).

https://doi.org/10.1371/journal.pone.0252213.g001

3.3 Chromosome location and gene duplication analysis

According to the annotation information in the alfalfa genome (data in S3 File), we found that 21 alfalfa LIM genes are distributed on 17 of the 32 chromosomes of alfalfa. There are 4 LIM genes on each of chromosomes 1, 2, and 4, which are located on chromosomes 1.1, 3.1, 7.1, 8.1, 1.2, 3.2, 4.2, 8.2, 1.4, 4.4, 7.4, and 8.4 respectively, and on chromosomes of 3 groups. There are 5 LIM genes located on chromosomes 1.3, 3.3, 4.3, 7.3, and 8.3, and no distribution of LIM genes is found on other chromosomes. In addition, except for the four homologous chromosomes of chromosome 8 which contain 2 LIM genes, all the other chromosomes contain 1 LIM gene. Since alfalfa is a 4-ploid homologous, the distribution of some homologous genes on homologous chromosomes is almost the same. Five or fewer genes located within 100 kb of the same chromosome are usually considered as tandem repeat genes. So there are no tandem repeat gene pairs in this study. The online tools Clustal Omega and PAL2NAL were used to analyze the MsLIM gene replication events, and a total of 8 replicated gene pairs were identified. These gene pairs are all fragment duplications, and their Ka/Ks values vary from 0.0903 to 0.2191 (Table 1), which are all less than 1, indicating that they are subject to purification selection during the evolution process.

3.4 Gene structure and basic motif analysis

To determine the distribution of exons and introns of MsLIM genes, this study used the online tool GSDS2.0 to analyze the structure of these genes. The 21 genes of alfalfa are divided into four subfamilies. The number of introns in all members of the αLIM, βLIM, and γLIM subfamily is 4, and the introns of MsLIM11, MsLIM12, and MsLIM13 in the δLIM subfamily are 3 (Fig 2). There are also 4 introns for all 7 members. Although the introns of the LIM gene are different, the members with the highest homology have similar gene structures, with the same number of exons and intron lengths. The motif owned by the gene family or shared by most members is likely to be an indispensable part of the family to perform important functions or structure. For example, it may be the binding site of some sequence-specific proteins (transcription factors) or involve important RNA initiation, termination, shearing, etc. of biological processes. Identifying the common motif of a gene family can understand the characteristics of the gene family so that these characteristics can be used to discover new members of the gene family. It can be seen from Fig 2 that although the conserved motifs of the 21 alfalfa LIM family genes are different in composition, they all contain motif 1 and motif 3 in the same order, with motif 3 first and motif 1 behind, so there is no duplication. MsLIM11, MsLIM12, MsLIM13 contain 5 motifs, MsLIM08, MsLIM09, MsLIM10, MsLIM18, MsLIM19, MsLIM20, and MsLIM21 contain 7 motifs, and the other 11 MsLIM genes all contain 6 motifs. In the alfalfa LIM gene family, 18 genes are containing motif2, motif4 and motif6. Only MsLIM08, MsLIM09, and MsLIM10 contain motif9. The analysis results indicate that the MsLIM gene family should all contain motif1 and motif3. MsLIM08, MsLIM09, and MsLIM10 are genes with specific functions in the MsLIM gene family. This prediction helps to discover new members of the MsLIM gene family.

thumbnail
Fig 2. Structural analysis of MsLIM genes in alfalfa.

(A) The distribution of motif in LIM proteins. (B) The exon-intron structure of LIM gene. (C) The amino acid composition of each motif (S4 Table).

https://doi.org/10.1371/journal.pone.0252213.g002

3.5 Analysis of promoter cis-acting elements

Promoter cis-acting elements are important binding regions for transcription initiation of transcription initiation factors and play an important role in regulating gene expression. To further analyze the possible biological functions of LIM, the 2.0 kb sequence upstream of the alfalfa LIM gene promoter (data in S4 Table) was used to predict cis-acting regulatory elements through the online website Plant CARE. It is predicted that there are many cis-regulatory elements related to transcription, cell cycle, development, hormone, and stress response in the promoter region of the alfalfa LIM gene, and some elements are specific to root specificity, leaf morphology, seed specificity, and meristem specificity (Fig 3). Sex and endosperm specificity are closely related. In addition, we also found many elements related to the hormone signaling pathway, such as methyl jasmonate (MeJA), abscisic acid (ABA), salicylic acid (SA), gibberellin (GA), and auxin (IAA). Among them, 21 LIMs have methyl jasmonate response elements (TGACG-motif and CGTCA-motif), and 37 LIMs have abscisic acid response elements ABRE, which indicates that most LIMs can participate in JA and ABA-mediated signaling pathways. The prediction also found that some elements can interact with various abiotic stresses (drought, salt, heat, cold, light, etc.). It needs to be pointed out that all LIMs contain light response-related components.

3.6 Protein interaction network diagram and three-dimensional structure prediction analysis

The use of protein network interactions to connect unknown functional proteins into protein interaction networks will help to further understand the protein biological functions enriched by protein network interactions and the dynamic network of regulation between various biological molecules in the cell. Therefore, this study used the model plant Arabidopsis as a background to predict the physicochemical properties of LIM protein and the potential interaction proteins related to its function (Fig 4). The expected edge number of our interaction network graph is 10, the average local clustering coefficient is 0.803, and the protein-protein interaction enrichment p-value is <0.00769, so we think the result is reasonable. We identified 3 LIM functioners and 10 potential interacting proteins directly related to the MsLIM protein. They are WLIM1, WLIM2a, PLIM2b, and AT1G01770, AT5G19230, AT5G19250, AT5G19240, DAR7, AT5G52950, AT5G06770, AT4G36860, AT3G06035, and EXPA13. Both WLIM1 and WLIM2a are GATA-type zinc finger transcription factor family proteins. WLIM1 contains 2 LIM domains and WLIM2a contains 1 LIM domain. Both of them can bind, stabilize and bind actin filaments, which indicate that it is involved in the construction and dynamics of actin cytoskeleton structure [9]. Its actin regulating activity is not regulated by pH and Ca2+. PLIM2b contains two LIM domains and is also a GATA-type zinc finger transcription factor family protein, which can bind to actin filaments and promote cross-linking into thick bundles, and has an actin stabilizing effect. It inhibits actin regulating activity at pH>6.8 but does not depend on Ca 2+. From this, we can infer that the functions of 21 LIMS transcription factors of alfalfa are similar to the functions of the above three Arabidopsis transcription factors. Among them, MsLIM01, MsLIM02, MsLIM03, MsLIM04, MsLIM11, MsLIM12, and MsLIM13 have similar functions to PLIM2b, MsLIM05, MsLIM06, MsLIM07, MsLIM08, MsLIM09, and MsLIM10 have similar functions to WLIM2a, MsLIM14, MsLIM15, MsLIM20, MsLIM16, MsLIM17, MsLIM19, and MsLIM21 has similar functions to WLIM1. In this study, the three-dimensional structural homology modeling of the amino acid sequences of 21 MsLIM gene family members of alfalfa was conducted, and the online software Swiss-Model analysis showed that the tertiary structure similarity of the amino acid sequences of 21 members was high. However, the tertiary structure is not exactly the same, which may be related to the length of α-helix, β-sheet, and the length of random coils. These similarities or differences may be the reason for their similar or different functions (Fig 5).

thumbnail
Fig 4. Protein-protein interactions network structure in Medicago sativa L.

The color scales represent the relative signal intensity scores.

https://doi.org/10.1371/journal.pone.0252213.g004

thumbnail
Fig 5. Tertiary structure of predicted LIM proteins in alfalfa.

https://doi.org/10.1371/journal.pone.0252213.g005

3.7 q-PCR analysis

To further determine the expression pattern of the alfalfa MsLIM gene under abiotic stress (low temperature and salt) treatment, q-PCR was used to quantitatively detect the MsLIM gene expression under low temperature and salt stress (S5 Table). In general, the MsLIM genes of the four alfalfa families showed different expression patterns, and the expression of these 21 genes could be induced by low temperature and salt treatment. Compared with the control (0h), under low-temperature stress (Fig 6A), the expression levels of MsLIM01, MsLIM02, MsLIM04, and MsLIM09 were significantly up-regulated at two-time points (9h and 12h). However, MsLIM03, MsLIM08, MsLIM10, MsLIM11, MsLIM12, MsLIM13, MsLIM15, MsLIM16, MsLIM17, MsLIM18, MsLIM19, MsLIM20, and MsLIM21 showed the highest expression levels at 3h induced by low temperature, indicating that the stress response of these genes was strong. The expression of MsLIM05, MsLIM06, and MsLIM07 reached the maximum after 6 hours of low-temperature induction. The expression of MsLIM14 at each time point of low-temperature stress was lower than that of the control (0h), indicating that MsLIM14 had a clear response to low-temperature stress. Under salt stress (Fig 6B), MsLIM01-06, MsLIM10, MsLIM12, MsLIM15, and MsLIM18-21 all reached their highest expression levels at 9h. Among them, the expression patterns of MsLIM02, MsLIM04, MsLIM05, MsLIM10, MsLIM12 and MsLIM20 in alfalfa are similar, and the expression level gradually increases from 0-9h to the highest point at 9h, and the expression level decreases in the subsequent experimental period. The expression of MsLIM08, MsLIM09, MsLIM13, MsLIM14, and MsLIM16 peaked at 2h, indicating that these genes had a strong response to salt stress. MsLIM11 has a clear response to salt stress. In addition, the expression level of MsLIM07 was lower than the control (0h) at 3h and 6h, but the expression level increased within 6-12h and reached the maximum at 12h.

thumbnail
Fig 6. qPCR expression analysis of LIM genes in Medicago sativa L.

q-PCR expression analysis of 21alfalfa LIM genes in response to a) cold and b) NaCl treatment at 0, 3, 6, 9, 12 h of treatment. The vertical bars represent the standard error of the means of three independent replicates. Different letters indicate mean values at different sampling points after Tukey’s pair-wise comparison test.

https://doi.org/10.1371/journal.pone.0252213.g006

4 Discussion

The LIM gene family has been studied in many plants, mainly focusing on structure, evolution, growth, and development, etc [33, 34]. However, in alfalfa, there is still a lack of comprehensive and systematic research on the LIM gene family. In 2020, the alfalfa genome is completed [35], which makes it possible to identify all LIM genes of alfalfa and analyze their functions.

In this study, 21 LIM genes were identified from the alfalfa genome. The number of LIM genes is significantly more than that of Arabidopsis (6), rice (6), and poplar (12), which may be related to the size of the genome. And multiplication has a certain relationship. The 21 alfalfa LIM gene family members are unevenly distributed on 32 chromosomes (Fig 7), and there is homology among different genes, indicating that the LIM gene duplication or recombination event occurred during the evolution of alfalfa. To understand the gene duplication events of the MsLIM gene in alfalfa, we analyzed the tandem and fragment duplication of the MsLIM gene and found that there are 8 pairs of fragment duplication genes in alfalfa (Table 2), and there is no tandem duplication. MsLIM causes its structure to change during the process of replication, which has a strong influence on the function of genes, indicating that fragment duplication plays an important role in the diversity of MsLIM protein [36].

thumbnail
Table 2. Ka/Ks analysis of the MsLIM gene pairs duplication.

https://doi.org/10.1371/journal.pone.0252213.t002

The phylogenetic tree results show that the closer the clustering relationship, the greater the possibility of having similar functions [37]. In the phylogenetic tree constructed in this study, LIM proteins can be divided into four subfamilies, and the αLIM, βLIM, γLIM and δLIM subfamilies have 4, 4, 3, and 10 genes respectively (Fig 1). In poplar, the LIM gene family can be divided into four sub-families, while in tomato, it is divided into two sub-families. It can be seen that the number of sub-families is different in different species. Through the subcellular localization analysis of alfalfa LIM protein, it was found that MsLIM01-MsLIM16 is located in the cytoplasm, MsLIM17 is located in the peroxisome, and MsLIM18-MsLIM21 is located in the mitochondrial matrix (Table 1). Studies have shown that the LIM protein located in the cytoplasm is mainly involved in the formation of the cytoskeleton as a regulator of actin [17]. Studies have also shown that the location of LIM proteins in different sub-cells may mean that their functions are also quite different. This conclusion has not yet been confirmed [14]. Gene classification, phylogeny, and subcellular localization analysis help to more accurately and conveniently study the functions of similar gene families. Gene structure analysis showed that most alfalfa MsLIM genes have a similar gene structure, generally having 4 introns and 4–5 exons (Fig 2). This result is similar to the results of previous studies [38], indicating that the family genes are evolutionarily conservative. The structure of individual genes in the alfalfa LIM family is different, and different exon-intron structures also contribute to the diversification of its gene functions [39]. MsLIM protein motifs are highly conserved in alfalfa. These conserved motifs determine the relative conservation of the MsLIM gene function. The difference in gene structure will inevitably lead to the diversity of protein and the difference of function, which is also the result of genetic evolution. In particular, some genes are missing some motifs. This motif difference may be one of the reasons for the functional diversity of MsLIM genes [40].

Analysis of promoter cis-acting elements showed that MSLIM may be involved in a variety of important biological processes, such as transcription, cell cycle, development, hormones, and biotic/abiotic stress (Fig 3). The MsLIM gene of alfalfa is closely related to hormone pathways and stress response. The main hormones are MEJA, ABA, GA, SA, and IAA. Common stresses include injury, drought, and low-temperature. Tobacco LIM participates in the regulation of cell proliferation and cell cycle processes through gene expression [20]. In addition to pollen-specific expression, the sunflower (Helianthus annuus L.) LIM gene also has specific expression in other organs [14]. Research on tomato and olive rape has also proved that LIM can perform multiple functions in tomato [23] and olive rape [41]. In addition, the LIM gene also plays an important role in mediating lignin metabolism in poplar (Populus alba L.) [42] and pear (Pyrus pyrifolia L.) [24]. It can be seen that LIM is a multifunctional gene family, which is essential for plant growth and development and stress defense response.

The q-PCR quantitative data show under low temperature and salt stress (Fig 6). MsLIM gene can participate in the response process of alfalfa to different stresses. When alfalfa is exposed to low temperature and salt stress, a large number of the MsLIM genes have a strong stress response after 3h of stress. After that, these MsLIM genes are all down-regulated, and some MsLIM genes only respond to one stress. MsLIM14 and MsLIM11 have an obvious response to low-temperature and salt stress. Most of the homologous genes showed similar expression patterns under the above two stresses, indicating that these genes may have similar physiological functions. Some alfalfa MsLIMs exhibit different expression patterns under stress, which may be because they participate in different defense mechanisms against the stress, and more experiments are needed to determine.

Transcription factors play an important role in the process of plant growth and development. By regulating one transcription factor, it is possible to regulate multiple genes with related functions, to achieve the purpose of improving plant traits. In this paper, by analyzing the alfalfa LIM transcription factor and studying its expression under two stresses i.e., low-temperature stress and salt stress, we have a preliminary understanding of the response of this gene in alfalfa after adversity stress, which lays a foundation for its functional verification and will be useful for future cultivation of alfalfa. New varieties of alfalfa and the development of the alfalfa industry are of great significance.

Supporting information

S2 Table. The MsLIM protein sequences information.

https://doi.org/10.1371/journal.pone.0252213.s002

(XLSX)

S3 Table. The LIM protein sequence information in this study.

https://doi.org/10.1371/journal.pone.0252213.s003

(XLSX)

S5 Table. Relative expression of 21 genes by qRT-PCR.

https://doi.org/10.1371/journal.pone.0252213.s005

(XLSX)

S1 File. Sequence coding for aminoacids in protein of alfalfa.

https://doi.org/10.1371/journal.pone.0252213.s006

(ZIP)

S2 File. Protein sequence file of alfalfa.

https://doi.org/10.1371/journal.pone.0252213.s007

(ZIP)

References

  1. 1. Sanchez-Garcıá I, Rabbits TH. The LIM domain: a new structural motif found in zinc-finger-like proteins. Trends Genet.1994;10(9):315–320. pmid:7974745.
  2. 2. Freyd G, Kim SK, Horvitz HR. Novel cysteine-rich motif and homeodomain in the product of the Caenorhabditis elegans cell lineage gene lin-II. Nature.1990;344(6269):876–879. pmid:1970421.
  3. 3. Karlsson O, Thor S, Norberg T, Ohlsson H, Edlund T. Insulin gene enhancer binding protein Isl-1 is a member of a novel class of proteins containing both a homeo- and a Cys-His domain. Nature. 1990;344(6269):879–882. pmid:1691825.
  4. 4. Way JC, Chalfie M. mec-3, a homeobox-containing gene that specifies differentiation of the touch receptor neurons in C. elegans. Cell. 1988;54(1):5–16. pmid:2898300.
  5. 5. Arnaud D, Déjardin A, Leplé JC, Lesage-Descauses MC, Pilate G. Genome-wide analysis of LIM gene family in Populus trichocarpa, Arabidopsis thaliana, and Oryza sativa. DNA Res.2007;14(3):103–116. pmid:17573466.
  6. 6. Baltz R, Domon C, Pillay DT, Steinmetz A. Characterization of a pollen-specific cDNA from sunflower encoding a zinc finger protein. Plant J. 1992;2(5):713–721. pmid:1302629.
  7. 7. Thomas C, Hoffmann C, Dieterle M, Van Troys M, Ampe C, Steinmetz A. Tobacco WLIM1 is a novel F-actin binding protein involved in actin cytoskeleton remodeling. Plant Cell. 2006;18(9):2194–2206. pmid:16905656.
  8. 8. Wang HJ, Wan AR, Jauh GY. An actin-binding protein, LlLIM1, mediates calcium and hydrogen regulation of actin dynamics in pollen tubes. Plant Physiol. 2008;147(4):1619–1636. pmid:18480376.
  9. 9. Papuga J, Hoffmann C, Dieterle M, Moes D, Moreau F, Tholl S, et al. Arabidopsis LIM proteins: a family of actin bundlers with distinct expression patterns and modes of regulation. Plant Cell. 2010;22(9):3034–3052. pmid:20817848.
  10. 10. Li Y, Jiang J, Li L, Wang XL, Wang NN, Li DD, et al. A cotton LIM domain-containing protein (GhWLIM5) is involved in bundling actin filaments. Plant Physiol Biochem. 2013;66:34–40. pmid:23466745.
  11. 11. Li L, Li Y, Wang NN, Li Y, Lu R, Li XB. Cotton LIM domain-containing protein GhPLIM1 is specifically expressed in anthers and participates in modulating F-actin. Plant Biol (Stuttg). 2015;17(2):528–534. pmid:25294521.
  12. 12. Khatun K, Robin AH, Park JI, Kim CK, Lim KB, Kim MB, et al. Genome-Wide Identification, Characterization and Expression Profiling of ADF Family Genes in Solanum lycopersicum L. Genes (Basel). 2016;7(10):79. pmid:27690110.
  13. 13. Yang R, Chen M, Sun JC, Yu Y, Min DH, Chen J, et al. Genome-Wide Analysis of LIM Family Genes in Foxtail Millet (Setaria italica L.) and Characterization of the Role of SiWLIM2b in Drought Tolerance. Int J Mol Sci. 2019;20(6):1303. pmid:30875867.
  14. 14. Eliasson A, Gass N, Mundel C, Baltz R, Krauter R, Evrard JL, et al. Molecular and expression analysis of a LIM protein gene family from flowering plants. Mol Gen Genet. 2000;264(3):257–267. pmid:11085265.
  15. 15. Zhao M, He L, Gu Y, Wang Y, Chen Q, He C. Genome-wide analyses of a plant-specific LIM-domain gene family implicate its evolutionary role in plant diversification. Genome Biol Evol. 2014;6(4):1000–1012. pmid:24723730.
  16. 16. Thaler JP, Lee SK, Jurata LW, Gill GN, Pfaff SL. LIM factor Lhx3 contributes to the specification of motor neuron and interneuron identity through cell-type-specific protein-protein interactions. Cell. 2002;110(2):237–249. pmid:12150931.
  17. 17. Tran TC, Singleton C, Fraley TS, Greenwood JA. Cysteine-rich protein 1 (CRP1) regulates actin filament bundling. BMC Cell Biol. 2005;6:45. pmid:16336664.
  18. 18. Brière C, Bordel AC, Barthou H, Jauneau A, Steinmetz A, Alibert G, et al. Is the LIM-domain protein HaWLIM1 associated with cortical microtubules in sunflower protoplasts? Plant Cell Physiol. 2003;44(10):1055–1063. pmid:14581630.
  19. 19. Kawaoka A, Kaothien P, Yoshida K, Endo S, Yamada K, Ebinuma H. Functional analysis of tobacco LIM protein Ntlim1 involved in lignin biosynthesis. Plant J. 2000;22(4):289–301. pmid:10849346.
  20. 20. Moes D, Gatti S, Hoffmann C, Dieterle M, Moreau F, Neumann K, et al. A LIM domain protein from tobacco involved in actin-bundling and histone gene transcription. Mol Plant. 2013;6(2):483–502. pmid:22930731.
  21. 21. Li Y, Zheng L, Corke F, Smith C, Bevan MW. Control of final seed and organ size by the DA1 gene family in Arabidopsis thaliana. Genes Dev. 2008;22(10):1331–1336. pmid:18483219.
  22. 22. Xu F, Zhu C, Cevik V, Johnson K, Liu Y, Liu K, et al. Autoimmunity conferred by chs3-2D relies on CSA1, its adjacent TNL-encoding neighbour. Sci Rep. 2015;5:8792. pmid:25740259.
  23. 23. Khatun K, Robin AHK, Park JI, Ahmed NU, Kim CK, Lim KB, et al. Genome-wide identification, characterization and expression profiling of LIM family genes in Solanum lycopersicum L. Plant Physiology and Biochemistry. 2016;108:177–190. pmid:27439220.
  24. 24. Cheng X, Li G, Muhammad A, Zhang JY, Jiang TS, Jin Q, et al. Molecular identification, phylogenomic characterization and expression patterns analysis of the LIM (LIN-11, Isl1 and MEC-3 domains) gene family in pear (Pyrus bretschneideri) reveal its potential role in lignin metabolism. Gene. 2019;686:237–249. pmid:30468911.
  25. 25. Shi SL, Nan LL, Smith KF. The Current Status, Problems, and Prospects of Alfalfa (Medicago sativa L.) Breeding in China. Agronomy.2017;7:1.
  26. 26. Chen C, Chen H, Zhang Y, Thomas HR, Frank MH, He YH, et al. TBtools—an integrative toolkit developed for interactive analyses of big biological data. Mol Plant. 2020;13(8):1194–1202. pmid:32585190.
  27. 27. Hu B, Jin J, Guo AY, Zhang H, Luo J, Gao G. GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics. 2015;31(8):1296–7. pmid:25504850.
  28. 28. Bailey TL, Johnson J, Grant CE, Noble WS. The MEME Suite. Nucleic acids research. 2015;43(W1):W39–49. pmid:25953851.
  29. 29. Franceschini A, Szklarczyk D, Frankild S, Kuhn K, Simonovic M, Roth A, et al. STRING v9.1: Protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Research. 2013, 41(D1):D808–D815. pmid:23203871.
  30. 30. Schwede T, Kopp J, Guex N, Peitsch MC. SWISS-MODEL: an automated protein homology-modeling server. Nucleic Acids Research.2003;31(13):3381–3385. pmid:12824332.
  31. 31. Marshall OJ. PerlPrimer: cross-platform, graphical primer design for standard, bisulphite and real-time PCR. Bioinformatics. 2004;20(15):2471–2. pmid:15073005.
  32. 32. Livak KJ, Schmittgen TD. Analysis of Relative Gene Expression Data using Real-Time Quantitative PCR. Method. 2001;25(4):402–8. pmid:11846609.
  33. 33. Weiskirchen R, Günther K. The CRP/MLP/TLP family of LIM domain proteins: acting by connecting. Bioessays. 2003;25(2):152–62. pmid:12539241.
  34. 34. Srivastava M, Larroux C, Lu DR, Mohanty K, Chapman J, Degnan BM, et al. Early evolution of the LIM homeobox gene family. BMC Biol.2010;8: 4. pmid:20082688.
  35. 35. Chen H, Zeng Y, Yang Y, Huang L, Tang B, Zhang H, et al. Allele-aware chromosome-level genome assembly and efficient transgene-free genome editing for the autotetraploid cultivated alfalfa. Nat Commun. 2020;11(1):2494. pmid:32427850.
  36. 36. Christiaens JF, Van Mulders SE, Duitama J, Brown CA, Ghequire MG, De Meester L, et al. Functional divergence of gene duplicates through ectopic recombination. EMBO Rep. 2012;13(12):1145–1151. pmid:23070367.
  37. 37. Zhou Z, Zhang CX, Zhang LY, Wang Q, Li WX, Tian Y, et al. Bioinformatics and Expression Analysis of the LysM Gene Family in Apple. Scientia Agricultura Sinica. 2014;47 (13):2602–2612.
  38. 38. Yang Y, Li B, Hu W R, Zhang J B and Fan L. Evolution and Expression Analysis of the LIM Protein Gene Family in Cotton. Plant Physiology Journal.2015;51(12):2133–2142.
  39. 39. Mattick JS. Introns: evolution and function. Current Opinion in Genetics & Development.1994; 4(6): 823–831. pmid:7888751.
  40. 40. Blanc G, Wolfe KH. Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes. Plant Cell. 2004;16(7):1667–78. pmid:15208399.
  41. 41. Park JI, Ahmed NU, Jung HJ, Arasan SK, Chung MY, Cho YG, et al. Identification and characterization of LIM gene family in Brassica rapa. BMC Genomics. 2014;15(1):641. pmid:25086651.
  42. 42. Arnaud D, Déjardin A, Leplé JC, Lesage-Descauses MC, Boizot N, Villar M, et al. Expression analysis of LIM gene family in poplar, toward an updated phylogenetic classification. BMC Res Notes. 2012;5:102. pmid:22339987.