Figures
Abstract
The divalent cation, Magnesium (Mg2+), is an essential mineral element for plant growth and development. Magnesium transporter (MGT) plays a vital role in maintaining Mg2 + homeostasis within plant cells. Although extensive research has been conducted in several crop species, no comprehensive study has yet been carried out on the MGT gene family in soybean (Glycine max L.), an economically valuable oil crop species. In this study, a total of 29 MGT genes encoding proteins (GmMGT) were identified in the soybean genome through comprehensive bioinformatics analysis. The GmMGT proteins were subsequently categorized into MRS2, CorA, and NIPA groups, with the majority predicted to be localized to the plasma membrane. Analyses of gene structures, conserved domains, and motifs indicated strong structural and functional similarities across the subgroups. Gene duplication, selection pressure, and synteny analyses demonstrated that GmMGT genes had undergone purifying selection, with only 12 segmentally duplicated gene pairs being identified. Gene ontology (GO) analysis revealed the involvement of all GmMGT genes in organism localization and ion transportation. Cis-acting regulatory element (CAREs) analysis identified 53 CAREs involved in light responsiveness, tissue-specific, phytohormone, and stress responses. Notably, nine major CAREs were abundantly found in the promoter regions of GLYMA.06G159100 and GLYMA.10G180200. Through the promoter analysis, we identified 81 miRNAs and 29 transcription factor families (TFFs), overrepresented under different stress conditions. RNA-seq data from 14 different soybean tissues demonstrated higher expression in flower tissue and lower levels in nodules with GLYMA.05G168200, GLYMA.10G180200, GLYMA.12G030100, GLYMA.12G168000, GLYMA.16G003900, GLYMA.20G210300 exhibiting elevated expression patterns. Transcriptomic analysis further revealed that, 10 GmMGTs were upregulated under biotic stress at 4h, 8h, 24h, and 48h after aphid infestation, with GLYMA.02G285600 and GLYMA.13G368400 being the most upregulated genes. Under abiotic stresses, GLYMA.03G159400, GLYMA.05G196600, and GLYMA.15G125900 were upregulated in response to dehydration, while GLYMA.04G005200, GLYMA.08G126600, GLYMA.10G180200 were induced at 1h, 6h, and 12h under salinity stress. These findings shed light on the versatile roles of GmMGT genes in plant growth, development and stress response, and they may serve as a valuable resource for further functional characterization of GmMGT genes within the soybean genome.
Citation: Akter N, Islam MSU, Zohra FT, Ratno RUR, Rahman MS, Rahman SM, et al. (2025) Genome-wide identification and functional characterization of magnesium transporter (MGT) gene family in soybean (Glycine max L.) and their expression profiles in response to aphid infestation, dehydration, and salt stresses. PLoS One 20(8): e0330440. https://doi.org/10.1371/journal.pone.0330440
Editor: Mojtaba Kordrostami, Nuclear Science and Technology Research Institute, IRAN, ISLAMIC REPUBLIC OF
Received: October 18, 2024; Accepted: July 31, 2025; Published: August 29, 2025
Copyright: © 2025 Akter et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
Magnesium (Mg2+) plays significant roles in plant developmental processes, including nucleic acid synthesis, protein synthesis, membrane stability, catalytic activity, and photosynthesis. It activates more than 300 enzymes [1,2]. Despite being an essential nutrient, recent research has revealed reduced Mg2 + concentrations in soil used for cereal crop cultivation [3]. Heavy rainfall can result in acidic soils saturated with Ca2 + , Mn2 + , Al2 + , K + , and H + , reducing Mg2 + availability for plant roots [4]. Increasing Mg2 + might significantly mitigate the detrimental effects of these ions. Although Mg2 + performs essential functions in plants, excessive concentrations can also be harmful [5]. In plants, Mg channels regulate Mg2 + concentration in the tonoplast and facilitate accumulation in the mesophyll tissues through vacuoles [1]. Indeed, MGTs, comprising NIPA, CorA, and MRS2 groups of proteins, are essential for maintaining Mg2 + homeostasis in plants [6,7]. The secondary structure pattern in NIPA, Cor-A, and MRS2 proteins is highly similar and includes one acidic N-terminal periplasmic domain and two C-terminal transmembrane (TM) domains [8–12]. A conserved GMN tripeptide motif comprising the Gly-Met-Asn residues has been identified in the hydrophobic portion of the protein, which is expected to represent the catalytic region of the Mg2 + transporter [13]. However, slight sequence modifications were observed in maize [14] and rice [15]. Mutation analyses revealed that any specific amino acid alteration within the GMN motif leads to a loss of Mg2 + transport functionality by the MGT protein [15].
The MGT gene family has been extensively studied in various plant species, including arabidopsis (Arabidopsis thaliana) [16], maize (Zea mays) [14], rice (Oryza sativa L.) [15], sugarcane (Saccharum officinarum) [17], tomato (Solanum lycopersicum L.) [18], banana (Musa paradisiaca Linn.) [19], and pear (Pyrus communis) [20]. MGT proteins have primarily been found in arabidopsis among plant species, followed by rice [15,16]. Beyond sequence variations, different MGT genes in plants have shown diverse expression patterns. For example, in arabidopsis, AtMGT1 is responsible for Mg2 + absorption [16], while AtMGT4, AtMGT5, and AtMGT9 play role in pollen formation [21,22]. AtMGT10 is involved in Mg2 + transport within the chloroplast [23], and AtMGT2 and AtMGT3 regulate vacuolar Mg2 + levels [24]. In rice, OsMGT1 is responsible for Mg2 + uptake by roots [25]. ZmMGT12 in maize has been identified as a circadian rhythmic transporter responsive to light [26], while ZmMGT10 facilitates Mg2 + absorption from soil [27]. In Dendrobium officinale, DoMGT1 is expressed in root, leaf, and steam tissues [28]. MGTs enhance plant endurance to environmental stresses. For instance, OsMGT1 contributes to salt stress and aluminum stress tolerance in rice [25]. The diversity in expression profiles among MGT genes reflects their functional complexity in plants.
Soybean (G. max L.) is one of the most economically valuable legumes, native to East Asia, and a significant source of oil (16–21%) and protein (37–48%) for both animal and human consumption. Soybeans are also rich in nutrients, minerals, and other beneficial compounds, including isoflavones, which provide multiple health benefits such as preventing age-related disorders, heart diseases, cancer, and osteoporosis [29]. However, soybean productivity, growth, and seed quality can be adversely affected by various environmental factors, such as drought, cold, salinity, toxic metals, and nutrient deficiencies, causing yield loss of up to 70% [30]. Therefore, developing stress-resistant soybean varieties is essential for ensuring future food security. Modern soybean varieties require effective nutritional management and Mg2 + is one of the most critical macronutrients for soybean seed development, productivity, and oil production [31].
To date, the comprehensive study of soybean (G. max L.) MGT gene family and their expression analysis under biotic and abiotic stress conditions remain unknown. The availability of the soybean (William 82) genome sequence has provided an opportunity to identify and characterize important gene families [32]. However, identification and functional characterization of targeted gene family members in wet lab conditions are challenging due to limitations in research grants, laboratory facilities, technical expertise, time constraints, and ethical considerations. Thus, comprehensive bioinformatics approaches were conducted to identify and characterize the MGT genes in the soybean genome. These analyses included phylogenetic analysis, gene structure, domain and motif analysis, Ka/Ks analysis, collinearity analysis, synteny analysis, chromosomal distribution analysis, duplication analysis, subcellular localization analysis, CARE analysis, GO analysis, transcription factor (TF) analysis, RNA-seq analysis of different tissues and expression analysis under various biotic and abiotic stresses. Our findings may provide valuable insights into the detailed functional roles of soybean MGT genes and uncover potential strategies to address Mg2 + deficiency and enhance soybean seed production efficiency.
2. Materials and methods
2.1. Database search and retrieval of MGT protein sequences in the soybean genome
The MGT DNA-binding domains of arabidopsis (A. thaliana), rice (O. sativa L.), and chickpea (Cicer arietinum) were initially utilized to retrieve MGT gene-encoded proteins from the G. max L. genome via Phytozome v13 (https://phytozome-next.jgi.doe.gov/), using BLAST-P (Protein-basic local alignment search tool) [33]. The Hidden Markov Model (HMM) profiles for the Mg transporter domains, PF05653 and PF01544, were obtained from the Pfam database [34]. These HMM profiles were subsequently used to screen the soybean proteomes through the HMMER package v3.0. Proteins lacking the MGT conserved domain (PF05653 and PF01544) were removed from the candidate list (https://pfam.xfam.org/). The retrieved amino acid sequences were then analyzed for conserved MGT domains using the SMART (Simple Modular Architecture Research Tool, http://smart.embl-heidelberg.de/) [35] and the NCBI CDD (Conserved Domain Database) (http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi/) [36].
2.2 Determination of physicochemical properties of GmMGT proteins
The ProtParam online tool (http://web.expasy.org/protparam/) [37] was used to predict basic physicochemical properties, including amino acid residues (aa), molecular weight (MW), isoelectric point (pI), instability index, aliphatic index, and Grand Average of Hydropathicity (GRAVY) of MGT proteins.
2.3. Phylogenetic analysis of MGT proteins of arabidopsis, rice, soybean, and chickpea
MGT protein sequences from A. thaliana, O. sativa L., G. max L., and C. arietinum (S1 Data) were retrieved from Phytozome v13 (https://phytozome.jgi.doe.gov/pz/portal.html/) and a phylogenetic tree was generated with MEGA11 software [38]. The ClustalW program was used to align the sequence with the Maximum Likelihood (ML) technique with default parameters [39], except for a 1000 bootstrap value and Pearson correction. The constructed tree was visualized with the Chiplot online tool (https://www.chiplot.online/).
2.4. Gene structure analysis
The coding sequences (CDS), genomic DNA sequences in FASTA format (S2 Data and S3 Data), and soybean gf3 files were obtained from Phytozome and submitted to the online tool Gene Structure Display Server, GSDS v2.0 (http://gsds.gao-lab.org/) [40] to evaluate the gene structure of MGTs.
2.5. Conserved domain and motif analysis
Conserved MGT domains were identified using the Pfam database (http://pfam.xfam.org), and the results were visualized using the TBtools software version-v2.0 [41]. The structural motifs of GmMGT protein sequences (S4 Data) were investigated using the Multiple EM for Motif Elicitation (MEME) (http://meme.nbcr.net/meme/), with default parameters and a fixed number of 20 motifs [42]. The motifs were visualized using MEME and the motif scanning method (MSA), available through its interface.
2.6. Gene duplication analysis and synonymous (Ks) and non-synonymous (Ka) substitution ratios calculation
The Ka/Ks Calculation tool (http://services.cbu.uib.no/tools/kaks) was used to estimate the substitution ratios of the soybean MGT gene family using duplicated MGT CDS sequences. The rates of molecular evolution were predicted using the Ka/Ks ratios for each pair of paralogous genes. The period of duplication and divergence (millions of years ago/MYA) was estimated using a synonymous mutation rate of substitutions per synonymous site per year as T = Ks/2λ × 10−6 (λ = 6.5 × 10−9) [43].
2.7. Collinearity and synteny analysis
To confirm gene duplication, collinearity and synteny analyses were conducted by using the Plant Genome Duplication Database (http://chibba.agtec.uga.edu/duplication/index/locus) [44]. Furthermore, TBtools version-v2.0 was used to illustrate the obtained soybean MGT collinear and syntenic pairs with arabidopsis, rice, and chickpea. The Circos program (http://circos.ca/) was used to illustrate the figures for collinearity and synteny analyses [45].
2.8. Analysis of chromosomal location
Chromosomal length and location of the 29 GmMGT genes were obtained from the Phytozome v13 database. Their genomic locations were mapped using MG2C (http://mg2c.iask.in/mg2c_v2.0/) [46].
2.9. Prediction of the subcellular localization and Gene Ontology (GO) analyses of GmMGTs
The subcellular localization of GmMGT proteins was predicted using the Wolf PSORT program (https://wolfpsort.hgc.jp/) [47]. To identify the functional relationships of the soybean MGT genes with various biological processes and molecular functional pathways, GO analysis was performed using the Plant Transcription Factor Database (PlantTFDB), PlantTFDB 4.0 (http://planttfdb.cbi.pku.edu.cn//) [48] and the Chiplot online tool (https://www.chiplot.online/) was used to visualize the analyzed data.
2.10. Cis-acting regulatory elements (CAREs) analysis of soybean MGT gene promoters
A 2000 bp sequences from 5′ untranslated region (5′ UTR) of GmMGT genomic sequences were extracted from the Phytozome v13. The cis-elements were analyzed using the PlantCARE online tool (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) [49] and verified in the PLACE databases (http://www.dna.afrc.go.jp/PLACE/) [50]. The obtained data were visualized using the R program (version R-4.2.1) [51].
2.11. Putative microRNA target site analysis
Soybean micro-RNA (miRNA) datasets were obtained from the plant microRNA encyclopedia (http://pmiren.com/) [52]. To identify miRNAs potentially targeting GmMGTs, the CDS sequences were examined for sequences complementary to miRNAs using the default parameters of psRNATarget (https://plantgrn.noble.org/psRNATarget/analysis?function=3) [53].
2.12. Transcription factor (TF) analysis of GmMGTs
To identify the important TFs associated with the identified GmMGT genes, PlantTFDB 4.0 was used. A regulatory network was constructed illustrating the interaction between MGT genes and predicted TFs using Cytoscape 3.9.1 [54].
2.13. Expression pattern analysis of GmMGT genes in different tissues
RNA-seq expression data for the identified MGT genes in various tissues were obtained from the RNA-Seq Atlas on SoyBase (https://www.soybase.org/) [55], and the BAR database (https://bar.utoronto.ca/). RPKM (reads per kilobase million) values on a log2 transformed scale were used to represent the expression value. The retrieved data were visualized using the R program (version R-4.2.1).
2.14. Transcriptomic analysis of GmMGT genes in response to soybean aphid infestation
Previously generated transcriptomic data in response to aphid infestation between susceptible and resistant soybean cultivars at five different sampling time points: 0h (no aphids), 4h, 8h, 24h, and 48h after aphid infestation were analyzed to study the transcriptomic profiling of identified GmMGT genes [56]. The fragments per kilobase million (FPKM) values were transformed to log2 format and compared with the control. The heatmap of transcriptomic data was visualized using the R program (version R-4.2.1).
2.15. Transcriptomic analysis of GmMGT genes under dehydration and salt stress
Transcriptomic data of GmMGTs in response to abiotic stresses (drought and salt) at three different time points (1h, 6h, and 12h) with control (0h) were used to analyze the specific expression patterns of GmMGT genes [57]. The FPKM values were transformed to log2 format and compared with the control. Heatmaps were generated using the R program (version R-4.2.1).
3. Results and discussions
Subsections 3.1 to 3.4 represent the general characterization of the studied genes; subsections 3.5 and 3.6 emphasize their evolutionary aspects; subsections 3.7 and 3.8 provide the information on gene localization; subsections 3.9 to 3.12 highlight their functional characteristics; and subsections 3.13 to 3.15 describe their RNA-Seq expression profiles.
3.1. Identification of GmMGT proteins and determination of their physicochemical properties
In this study, a total of 29 MGT gene-encoded proteins of soybean (G. max L.) were identified through genome-wide analysis using MGT protein sequences of arabidopsis, rice, and chickpea as query sequences. The predicted number of MGT genes varied across plant species, with some species possessing a higher number compared to soybean, while others had lower counts (Table 1).
The basic physicochemical properties of GmMGT proteins, including size, MW, pI, protein instability index, aliphatic index, and GRAVY, were analyzed (Table 2). The length of amino acid sequences varied from 127 aa (GLYMA.06G208700) to 556 aa (GLYMA.15G125900) with relative MW 14.05 kDa (GLYMA.06G208700) to 62.27 kDa (GLYMA.15G125900). The pI values varied from 4.66 (GLYMA.13G368400) to 9.91 (GLYMA.06G053100). It was found that 48.28% of GmMGT proteins were acidophilic (pI values < 7.00), in comparison to 61% of OsMGT in the rice genome showed an acidophilic nature [59]. The instability index indicated that 16 GmMGTs (55.17%) were stable proteins (instability index < 40). Moreover, the aliphatic index ranged from 86.10 (GLYMA.09G019600) to 135.24 (GLYMA.06G208700), suggesting that most GmMGT proteins were thermally stable. The GRAVY values ranged from −0.09 (GLYMA.02G117100) to 1.37 (GLYMA.06G208700), with 19 GmMGT proteins (65.42%) exhibiting positive values, suggesting their hydrophobic nature. A similar trend was observed in the rice genome, where 56.52% of OsMGT proteins are also hydrophobic [59]. This finding suggest that the MGT gene family has significantly expanded in soybeans compared to other species.
3.2. Phylogenetic analysis of arabidopsis, rice, soybean, and chickpea MGT proteins
To elucidate the evolutionary relationship of MGT protein family members, a phylogenetic tree was constructed using 102 MGT protein sequences comprising 29 GmMGTs, 23 AtMGTs, 23 OsMGTs, and 27 CaMGTs proteins in soybean, arabidopsis, rice, and chickpea (Fig 1). Based on the phylogenetic analysis, the MGT proteins were categorized into three groups (A, B, and C), including six subgroups (A1, A2, B1, B2, C1, and C2). Subgroups A1 comprised 7 MRS2-type GmMGT proteins, B1 contained 3 CorA-type GmMGT proteins. The subgroups C1 and C2 included 11 and 8 NIPA-type GmMGT proteins (S5 Data). Arabidopsis and rice MGT family proteins have previously been categorized into six clusters [59]. MGT proteins in durum wheat and camelina were clustered into nine categories, indicating increased diversity in MGT isoforms [11]. Four subgroups (except A2 and B2) contained MGT proteins from the above three species, whereas no MGT proteins of soybean were clustered in the A2 and B2 subgroups. Group C, consisting of 47 proteins, was the largest, followed by Group A (44 proteins), while Group B was the smallest, containing only 11 MGT proteins. Notably, the number of NIPA-type GmMGT clade-related proteins was much larger than the others, indicating a certain duplication event in this clade during the evolution process, which may be highly useful in soybean stress adaptation. The pronounced diversity of NIPA-type GmMGTs further implies that MRS2 and CorA proteins may have evolved from ancestral NIPA proteins.
MGT proteins were classified into 3 groups A for MRS2, B for CorA, and C for NIPA type MGTs including 6 subfamilies (A1, A2, B1, B2, C1, and C2) and marked by various colors. The GmMGTs are marked by a red star, the AtMGTs are marked by a blue triangle, the OsMGTs are marked by a pink rectangle, and the CaMGTs are marked by orange circle.
3.3. Gene structure analysis
The gene structures of GmMGTs were analyzed to determine their clade-specific characteristics and evolutionary significance. This analysis revealed that 25 GmMGT genes have both 5′ UTR and 3′ UTR, while 3 GmMGT genes (GLYMA.11G105300, GLYMA.18G091200, and GLYMA.09G019600) lack UTR sequences, and GLYMA.06G208700 has only 5′ UTR sequences (Fig 2; S6 Data and S7 Data). The number of exons ranged from 3 (GLYMA.06G208700) to 13 (GLYMA.02G068000 and GLYMA.16G149500), with GLYMA.02G117100 having the largest gene sequence of 8 kb long. The exon number in a gene may enhance the diversity of the coding gene by influencing post-transcriptional modifications such as alternative splicing [62,63]. Genes with fewer introns are known to respond quickly to stress and contribute significantly to environmental adaptation [64]. Interestingly, in most genes, the first exon and the last exon were relatively longer than the middle exons, implying a complex gene structure. These findings suggest that the GmMGT genes exhibit structural diversity, resulting in a wide range of gene activities.
In GmMGT gene structure, black lines represent introns, red-bolled lines represent exons and blue-bolled lines represent UTRs. The exon/intron structure of each GmMGT gene is proportionally displayed according to the scale at the bottom.
3.4. Conserved domain and motif analysis
For definitive categorization, GmMGT protein sequences were subjected to conserved domains, and motifs identification based on phylogenetic tree topologies. Three types of signature domains were determined, such as MRS2 in A1, A2, and CorA, along with silic_trans domain in B1, and NIPA, along with the Eama additional domain in subgroups C1 and C2 (Fig 3). The NIPA domain is responsible for various cellular processes, including magnesium homeostasis, metal detoxification, vesicle trafficking, plant development, and stress responsiveness. The CorA domain is involved in cell division, stress responses, and regulating cellular osmotic pressure [65]. The MRS2 domain facilitates the transport of Mg2+ from cytosol to the mitochondrial matrix to regulate mitochondrial metabolism and enhance inter-organellar communication [66].
The relative positions of each domain are shown in color boxes.
Twenty conserved motifs were determined, ranging in length from 11 aa to 50 aa in length (Fig 4 and Fig 5; Table 3). GmMGT proteins in group C had the highest number of motifs, including motif 1, motif 2, motif 3, motif 4, motif 5, motif 9, motif 10, motif 12, motif 13, motif 16, motif 17, and motif 19. Groups A and B had 7 and 5 conserved motifs, respectively. Motif 16 and motif 17 were found across all groups, suggesting their potential role in functional divergence. Although there were some variations, the motif numbers and the arrangements of start and end motifs were highly conserved within each subgroup, which align with the motif organizations in grape (V. vinifera L) [61] and orange (Citrus sinensis) [67]. Overall, the conserved domains and motifs act as markers of the GmMGT family and can be used to verify the candidate genes using various approaches.
Each colored box aligned on the right side of the figure indicates a specific motif. Different colors represented various motifs distributed in the domains of the proteins.
The motif logo illustrates the conserved amino acid positions across the aligned protein sequences. The overall height of each stack represents the information content of that position in the motif in bits. The height of the individual letters indicates the relative frequency of the corresponding amino acid at that position.
3.5. Ka/Ks analysis of GmMGT gene family
To determine the selective pressure and possible evolutionary relationship of the GmMGT genes, the Ka/Ks ratios of 12 segmental duplicated gene pairs of GmMGT were evaluated (Fig 6 and S8 Data). The substitution pressure of GmMGT duplicated gene pairs ranged from 0.05 (GLYMA.06G005000-GLYMA.04G005200) to 0.55 (GLYMA.13G368400-GLYMA.02G117100), revealing their purifying selection (Ka/Ks < 1). These segmentally duplicated genes were found to undergo strong environmental pressure and maintain stable protein functions [68]. This finding is consistent with other gene families in soybeans, such as SWEET [69], NHX [70], GRAS [71], CBF/DREB1 [72], Aux/IAA [73], IQD [74], and NOX [75]. However, positive selection (Ka/Ks > 1) was observed in the majority of the duplicated GmPOD40 genes [76]. As natural selection has no effect on the Ks values, the Ks values were utilized to determine the divergence period of the duplicated gene pairs using a clockwise rate of 6.5 × 10–9 mutations per synonymous site per year. Here, the divergence period varied from 2.33 MYA (Glyma.12G168000-Glyma.16G003900) to 36.51 MYA (Glyma.13G368400-Glyma.02G117100) with an average of 10.26 MYA. This analysis implies that the GmMGT genes primarily evolved through purifying selection which was a major driving force for GmMGT evolution at the protein level.
The number of nonsynonymous substitutions per nonsynonymous site is presented by Ka and the number of synonymous substitutions per synonymous site is represented by Ks. Ka/Ks represents the ratio of nonsynonymous (Ka) versus synonymous (Ks) changes.
3.6. Collinearity and synteny analyses
Collinearity and synteny analyses were conducted to determine the changes in evolution and replication events in the GmMGT gene family. Collinearity, a form of synteny, requires conserved gene order. A total of 12 GmMGT collinear gene pairs were identified, with the most collinear genes (4 genes) located on chromosome 2 (Fig 7). Conversely, the lowest number of collinear genes (only 1 gene) was identified on Chr4, Chr8, Chr10, Chr13, Chr15, Chr17, and Chr20. Synteny relationships were established across soybean, arabidopsis, rice, and chickpea, identifying 9, 7, 6, and 3 syntenic gene pairs, respectively (Fig 8). The number of synteny gene pairs between chickpea and arabidopsis, rice and arabidopsis, chickpea and soybean were 1 (Ca_11902-AT3G19640), 3 (OS01G0708300-AT5G11960, OS03G0684400-AT5G22830, and OS12G0566400-AT5G03345), and 6 (CA_12201-GLYMA.03G159400, CA_15254-GLYMA.02G285600, CA_04249-GLYMA.06G159100, CA_16924-GLYMA.05G196600, CA_11407-GLYMA.11G255400, CA_08544-GLYMA.05G153000), respectively. The highest number of syntenic genes (6 genes) was identified on Chr1 in rice. Interestingly, no syntenic gene pairs were identified in soybean with rice, and arabidopsis, implying a closer genetic relationship with chickpea than arabidopsis and rice. The significant synteny predicted between rice-maize and arabidopsis-rice MGT genes indicated close evolutionary relations due to chromosomal arrangement and duplication processes [15,77]. These collinearity and synteny analyses at the gene level can reveal complex evolutionary relations and chromosomal distribution of soybeans over genome evolution.
Chromosomes 1-20 are represented by various colored rectangles. Different colored lines indicate collinear blocks in the soybean genome, while colored lines GmMGT collinear gene pairs.
Red color blocks represent the syntenic blocks of soybean and light green, pink, and aqua color blocks represent the syntenic blocks of arabidopsis, rice, and chickpea respectively. The colored lines represent the syntenic gene pairs of MGTs.
3.7. Chromosomal location and gene duplication events for GmMGTs
The chromosomal mapping demonstrated that GmMGT genes was unevenly distributed over 17 of the 20 chromosomes within 60 Mb chromosomal length (Fig 9). The highest number of GmMGT genes were identified on chromosomes Chr02, and Chr06 (4 genes each), followed by Chr5 (3 genes), Chr11, Chr12, Chr14, and Chr16 (2 genes each). Only one gene was found on Chr03, Chr04, Chr08, Chr09, Chr10, Chr13, Chr15, Chr17, Chr18, and Chr20. In general, no relationship was found between chromosomal length and gene number,. The majority of these genes may not have evolved from a similar evolutionary subclade. The duplication analysis identified only 12 segmental duplications and no tandem duplication in the GmMGT genes, similar to the findings in banana [19] and maize [14]. Segmental duplication enhances the rapid adaptation of plant genomes to new environmental conditions [78]. Therefore, it is hypothesized that the expansion of the MGT genes through segmental duplication plays an essential role in soybean adaptation to environmental changes.
The chromosome number is indicated above each bar. The chromosome scale is in millions of bases (Mb), indicating chromosomal length on the left. Each gene pair of segmental duplication is indicated by a similar color.
3.8. Prediction of subcellular localization of GmMGT family members
Proteins are distributed throughout cells to participate in various physiological processes. According to web-based predictions, GmMGT protein signals were identified in the nucleus, mitochondria, cytoplasm, chloroplast, cytoskeleton, peroxisome, Golgi, vacuole, endoplasmic reticulum (E.R), plasma membrane (P.M), and extracellular organelles (Fig 10 and Fig 11; S9 Data). The maximum protein signals were detected on P.M (100%), followed by 75.86% (22 GmMGTs) in both E.R and vacuole, 41.38% (12 GmMGTs) in both Golgi and chloroplast. In banana and orange, maximum protein signals (70% MaMGT and 71.43% CsMGT) were also identified in the P.M [19,67]. Therefore, a better understanding of the proteins present in the plasma membrane could help in exploring techniques to enhance plants’ natural defenses [79]. GmMGT proteins are assumed to be important in glycosylation, transporting and storing nutrients and metabolites, as they were also predominantly found in E.R. and vacuole. In wheat, cytoplasmic proteins TaMGT3D.1 and TaMGT4D were upregulated after stripe rust pathogen infection [60]. Thus, GmMGT proteins predicted in the cytoplasm might be upregulated under biotic stresses. This study demonstrated that the GmMGT proteins are organelle-specific, with most GmMGTs functioning intracellularly in various microenvironments.
Sub-cellular localization of GmMGT proteins in different cellular organelles is represented by a heatmap. The name of each GmMGT protein is shown on the left side of the heatmap and the name of each corresponding cellular organelle is shown at the bottom of the heatmap. The color pink in the heatmap represents presence and the color white-absence. Here endoplasmic reticulum is represented by E.R., and the plasma membrane is represented by P.M.
The percentage of GmMGT protein that appeared in different cellular organelles is shown on the left side of the bar diagram. The cellular organelles including the nucleus, mitochondria, cytoplasm, chloroplast, cytoskeletal, peroxisome, Golgi, vacuole, E.R., P.M., and extracellular organelles, are presented at the bottom of the bar diagram.
3.9. Gene Ontology (GO) analysis of GmMGT genes
To specifically identify the gene functions, GO analysis was performed for 29 GmMGTs (Fig 12 and S10 Data). GO provides a comprehensive framework for categorizing genes based on their biological processes, molecular functions, and cellular components in which their respective gene products are active. The biological process is the most diverse and largest group among these three GO categories [80]. The GmMGT genes were annotated with two fundamental GO categories, including biological processes and cellular components. A total of 16 GO terms were associated with the biological processes category, and 13 with cellular components. The most significantly enriched biological process GO terms include transmembrane transport (GO:0055085), metal ion transport (GO:0030001), magnesium ion transmembrane transport (GO:1903830), and divalent inorganic cation transport (GO:0072511), with p-value: 1.00E-30. In terms of cellular components, the most enriched GO term was the early endosome (GO:0005769, p-value: 1.00E-30). Notably, GmMGTs were mostly related to membrane cellular components, including intrinsic component of membrane (GO:0031224, p-value: 1.20E-10), and integral component of membrane (GO:0016021, p-value: 8.30E-11). Previous studies have reported that the prevalent biological process GO category like the “cellular process” is enhanced during abiotic stresses [81]. This annotation analysis results indicate that GmMGT genes are enriched in the membrane and involved in various ion, and organism transportation.
GO enrichment analysis of GmMGT is represented by a diagram. GO terms of GmMGT genes associated with biological processes, and cellular components are denoted by blue and red triangles, respectively. The number of the corresponding GmMGT genes (19 to 29) is shown inside of the circle by different colors.
3.10. Cis-acting regulatory elements (CAREs) analysis of GmMGT gene promoters
CAREs are essential for regulating the expression and functions of genes. To evaluate the regulatory activity of GmMGT genes, a total of 53 CAREs were identified using 2000 bp of 5’ UTR (Fig 13; S11 Data and S12 Data). GmMGT contained 24 cis-acting elements related to light responsiveness, 14 to tissue-specific expression, 11 to phytohormone responsiveness, and 4 to stress responsiveness, revealing the potential of these genes during the diverse developmental and stimulus processes [58]. Nine major cis-elements were dominantly associated with four different functions, including light-responsive elements: Box 4, G-box, and glycosyltransferase 1 (GT1-motif); tissue-specific elements: AU-rich element (ARE); phytohormone responsive elements: abscisic acid-responsive element (ABRE), CGTCA-motif, and TGACG-motif; and stress responsive elements: long terminal repeat (LTR), and thymine-cytosine-rich repeats (TC-rich repeats).
The name of each GmMGT gene is represented on the left side of the heatmap. The number of putative CAREs of each GmMGT gene is shown on the right side of the heatmap and represented by four different colors (black = 0, green = 1-4, red = 5-8, pink = 9-12, and purple = 13-16). Light responsiveness, tissue-specific expression, phytohormone responsiveness, and stress responsiveness functions associated with CAREs of the corresponding genes are shown at the bottom of the heatmap and denoted by red, yellow, green, and blue-colored bolled lines, respectively.
The most abundant light-responsive element, Box 4, was present in all GmMGT genes, specifically enriched in GLYMA.09G019600 and GLYMA.11G255400. Moreover, G-box and GT1-motif were abundantly identified in 21 and 19 GmMGTs gene promoters, respectively. These four cis-elements are effective in gene regulation by light signals and photosynthetic mechanisms, ultimately stimulating genes involved in defense processes [82]. Tissue-specific elements were abundant in GLYMA.06G15900, GLYMA.08G126600, and GLYMA.10G126600. Furthermore, the highly detected tissue-specific element, ARE, and phytohormone-responsive element, ABRE, are involved in numerous biological functions linked to soybean growth and development. Phytohormone-responsive elements were identified abundantly in the GLYMA.02G068000 gene promoter, while stress-responsive CAREs were abundant in GLYMA.12G168000 and GLYMA.20G210300. LTR and TC-rich repeats were found abundantly in only 9 and 10 GmMGT genes, respectively. These two motifs regulate the expression of genes involved in stress response, allowing plants to grow and adapt to adverse conditions [83,84]. Similar findings have been reported in rice [59], orange [67], and banana [19]. Moreover, nine major cis-elements were identified in GLYMA.06G159100, and GLYMA.10G180200 promoter regions, and GLYMA.10G180200 contained the highest number of motifs (18) followed by GLYMA.02G285600 (17), GLYMA.06G159100 (17), indicating their higher efficiency in several biological processes, including soybean growth, development and stress resistance.
3.11. Putative microRNA target site analysis
A total of 81 different miRNAs were found, ranging from 20–24 nucleotides in length, targeting 27 out of the 29 GmMGT genes (S13 Data). In this analysis, the number of miRNAs per gene ranged from 1–28, and the lowest number of miRNAs was identified in GLYMA.03G159400, GLYMA.06G005000, and GLYMA.06G208700. GLYMA.17G227100 was targeted by the highest number (28 miRNAs) of miRNA, followed by GLYMA.14G097400 (20 miRNAs), GLYMA.02G117100 (16 miRNAs). Gma-miR169 was highly abundant (31), comprising 23 members (a-h, j-m, p, q, s, u-z, aa, and ab). Gma-miR169 targeted GLYMA.02G117100, GLYMA.10G180200, GLYMA.14G097400, and GLYMA.17G227100. Previous studies have demonstrated that the miR169 is susceptible to several abiotic conditions such as salinity, drought, and cold stresses in various plant species [85]. In Arabidopsis, miR169 regulates stress-induced flowering by suppressing the transcription factor AtNFYA [86]. This suggests that GmMGTs are involved in promoting vegetative growth as well as stress responses in soybeans.
3.12. Transcription factor (TF) analysis of GmMGT genes
TFs are proteins characterized by the presence of at least one DNA-binding domain and are responsible for regulating transcriptional pathways in plant cells. Based on network and sub-network analysis, we identified 29 TFFs in the promoter region of targeted 29 GmMGT genes in which the transcription factor binding sites (TFBSs) varied in number and distribution (Figs 14–16 and S14 Data). For instance, TFBFs were more numerous in GLYMA.08G126600, while only one TFBF was identified in GLYMA.11G105300. According to our findings, 7 TFFs, including ERF, MYB, NAC, C2H2, MIKC-MADS, LBD, and BBR-BPC were considered as major TFFs, comprising 74.22% of the total TFs.
The name of each GmMGT gene is shown on the left side of the heatmap. The major seven TFs are shown on the bottom of the heatmap and represented by seven different colors (ERF = Red, MYB = Yellow, NAC = Green, C2H2=Light blue, MIKC_MADS = Purple, LBD = Dark blue, BBR_BPC = Orange). Black colored boxes represent the absence and green colored boxes- the presence of TFs, aligned on the right side of the figure.
The name of each GmMGT gene is shown at the bottom of the bubble chart. The transcription factors of each GmMGT gene are shown on the left side of the bubble chart. The number of transcription factors binding sites (count = 0-200) of each GmMGT gene is shown on the right side of the bubble chart.
Regulatory network nodes of interaction between GmMGT genes and their regulatory transcription factors are represented by the TF network. The GmMGT genes and major TF families are shown on the right side of the figure with particular shapes.
Among the TFs, the ERF family contained the highest number of TFs (36), followed by MYB (25), NAC (17), C2H2 (11), MIKC-MADS (9), LBD (7), and BBR-BPC (2). ERF-associated TFBSs accounted for 52.9% of all binding sites and were overrepresented in 65.6% of GmMGT genes. The ERF regulates plant resistance to various abiotic stresses such as drought, salt, cold, and other adversities [87]. On the other hand, MYB is engaged in various biological activities, including circadian rhythm, stress responsiveness, cell identity, seed and floral development, and metabolic control in plants [88]. The NAC family has been demonstrated to influence a wide range of developmental processes, and the overexpression or knockdown of NAC gene expression impacts plant defense and signaling systems [89]. The abundance of various TFs in the GmMGT gene promoters suggests that GmMGT genes are complexly integrated in various signal-transduction mechanisms with a potential reservoir of functional diversity.
3.13. Expression pattern analysis of GmMGT genes in different tissues
To analyze the tissue-specific expressions of GmMGT genes, available RNA-seq data were collected from SoyBase and Bar databases. Expression profiles were visualized using two heatmaps encompassing 14 tissues, including four common types (leaf, flower, root, and nodule). According to SoyBase data, the maximum GmMGT genes exhibited different expression patterns in young leaf, flower, one cm pod, pod shell 10 DAF (Days after flowering), pod shell 14 DAF, seed 14 DAF, seed 21 DAF, and seed 35 DAF (Fig 17 and S15 Data). Additionally, GLYMA.05G168200, GLYMA.05G196600, GLYMA.06G159100, GLYMA.06G208700, GLYMA.10G180200, GLYMA.12G030100, GLYMA.12G168000, GLYMA.15G125900, GLYMA.16G003900, GLYMA.16G149500, GLYMA.20G210300 were differentially expressed in each tissue type. Conversely, no expression was detected for GLYMA.06G208700, GLYMA.13G368400, and GLYMA.18G091200. Notably, GLYMA.12G030100, GLYMA.12G168000, GLYMA.16G003900, and GLYMA.20G210300 exhibited elevated expression patterns in various tissues. Particularly, GLYMA.16G003900 was highly induced in young leaf, flower, one cm pod, pod shell 10 DAF, pod shell 14 DAF, seed 10 DAF, and root. The induced expression of 25 GmMGTs was observed in flower tissue, with GLYMA.11G255400 having the highest expression profiles among them. In contrast, maximum genes exhibited the lowest expression pattern in nodules, except for GLYMA.02G285600 which expressed highly in nodules.
RPKM normalized values of expressed genes were log2-transformed and visualized as a heatmap. The name of each GmMGT gene is shown on the left side and the tissue types are represented at the bottom of the heatmap based on data retrieved from the SoyBase database. The expression values were mapped using a color gradient from white to red (count = 0-30) shown on the right side of the heatmap.
BAR database analysis revealed that most GmMGT genes were expressed in flower, root hair 12 HAI, and in the shoot apical meristem (SAM) tissues in Soybean (Fig 18 and S16 Data). GLYMA.02G280800, GLYMA.02G068000, GLYMA.02G117100, GLYMA.03G159400, GLYMA.05G168200, GLYMA.05G196600, GLYMA.06G053100, GLYMA.08G126600, GLYMA.10G180200, GLYMA.12G030100, GLYMA.12G168000, GLYMA.12G030100, GLYMA.14G033700, GLYMA.15G125900, GLYMA.16G003900, GLYMA.16G149500, and GLYMA.20G210300 were expressed differentially in each tissues whereas no expression profile was identified in GLYMA.06G208700, and GLYMA.18G091200. Six GmMGTs, including GLYMA.05G168200, GLYMA.10G180200, GLYMA.12G030100, GLYMA.12G168000, GLYMA.16G003900, and GLYMA.20G210300, showed elevated expression in different types of tissues. GLYMA.16G003900 exhibited higher expression patterns in flower, green pods, leaf, roots, root hair 48 HAI, root hair 48 HAI stripped, root tip, root hair 48 HAI mock, and in SAM, being selected as a highly expressed candidate gene. The highest expression profiles of the largest number of genes (25 GmMGTs) were detected in flower tissue. Nevertheless, all of the GmMGT genes showed the lowest expression pattern in nodules, while GLYMA.02G285600 was highly induced in nodules compared to others. Based on the findings from both databases, the majority of genes showed higher expression in flower tissues and lower expression in nodules. This variation in expression patterns indicates that GmMGT genes may have diverse roles, with tissue-specific regulation. Similar variations in expression patterns have been observed in arabidopsis [90] and rice [91], suggesting that the expression of GmMGT genes during different developmental stages is highly tissue-specific.
RPKM normalized values of expressed genes were log2-transformed and visualized as a heatmap. The name of each GmMGT gene is shown on the left side and the tissue types are represented at the bottom of the heatmap based on data retrieved from the Bar uTorrent database. The expression values were mapped using a color gradient from white to red (count = 0-75) shown on the right side of the heatmap. The abbreviations “HAI” and “SAM” on the tissue label represent “Hours after inoculation” and “Shoot apical meristem”, respectively.
3.14. Transcriptomic analysis of GmMGT genes in response to soybean aphid infestation
In this study, previously observed gene expression levels (log2 fold change) between susceptible and resistant GmMGTs were further studied using top leaves of soybean cultivars, sampled at five different sampling times: 0h (no aphids), 4h, 8h, 24h, and 48h with 15 adult aphids [56] (Fig 19 and S17 Data). NIPA-type GLYMA.11G255400 and GLYMA.18G091200 showed no expression at any time point in both susceptible and resistant soybean genotypes. In susceptible varieties, only MRS2-type GLYMA.03G159400 and GLYMA.05G168200, and NIPA-type GLYMA.05G153000 were expressed at 4h after infestation. In resistant varieties, 17 GmMGTs were upregulated at 4h, while 26 GmMGTs were upregulated at 8h after infestation. Among them, 27 GmMGT genes were induced at 24h, but only 10 GmMGTs remained upregulated after 48h of aphid infestation. Notably, 10 GmMGT genes (NIPA-type GLYMA.02G068000, GLYMA.02G280800, MRS2-type GLYMA.03G159400, GLYMA.05G168200, GLYMA.08G126600, GLYMA.10G180200, GLYMA.13G368400, GLYMA.20G210300 and CorA-type GLYMA.02G285600, GLYMA.15G125900) exhibited higher expression level under all four time point (4h, 8h, 14h, 48h) in resistant genotype. The expression pattern of MRS2-type GLYMA.13G368400 was highest after 24h and 48h of infestation, while CorA-type GLYMA.02G285600 exhibited an induced expression pattern in 8h, 24h, and 48h after aphid infestation in the resistant soybean genotype. Aphids have a major impact on the photosystem pathways of susceptible soybean cultivars, which also explains the adverse effects of aphid infestation on soybean yield. Differentially expressed gene clusters have been identified in various aphid-plant systems studies, such as soybean [56] and sugarcane [92], which provide insights into genes and processes responsible for resistance to aphids. An aphid-resistant line of sorghum (Sorghum bicolor) displayed greater upregulation of lipid and protein-regulating genes, cellular catabolic processes, and enhanced transcriptional initiation after sugarcane aphid infestation than the susceptible line, supporting our findings [92]. Therefore, the upregulated genes may serve as potential candidates for conferring aphid resistance provided by the resistant line of the soybean MGT gene family.
Clustering of GmMGT genes from three clades (NIPA, CorA, and MRS2) are shown according to their expression profiles between susceptible and resistant soybean cultivars in different periods (4h, 8h, 24h, 48h) with control (0h; no aphid). The expression values were mapped using a color gradient from low to high (green to red color) shown on the right side of the heatmap. NIPA, MRS2, and CorA type MGT genes were denoted by light blue, yellow, and red color respectively. Separators have been used to distinguish between different conditions on the heatmaps.
3.15. Transcriptomic analysis of GmMGT genes under dehydration and salt stress
To identify GmMGT genes responding to abiotic stresses, RNA-seq data were analyzed at three different time points (1h, 6h, 12h) with control (0h) under dehydration and salt stresses [57] (Fig 20 and S18 Data). Among 29 GmMGTs, NIPA-type GLYMA.02G068000, GLYMA.05G196600, GLYMA.06G053100, MRS2-type GLYMA.03G159400, and CorA-type GLYMA.15G125900 exhibited elevated expression in response to dehydration. GmMGTs are engaged in both early and late responses to abiotic stresses. For instance, the NIPA-type GLYMA.05G196600 and MRS2-type GLYMA.03G159400 consistently upregulated at all three time points, while CorA-type GLYMA.15G0125900 was upregulated after 1h of treatment. These genes could be positive regulators for enhancing dehydration stress endurance in soybeans. It has been observed that Mg2 + deficiency may trigger oxidative stress and stomatal closure, which can reduce plant gene transpiration [93]. Moreover, MGTs can modify water usage efficiency and drought tolerance by regulating antioxidant enzymes, photosynthesis, and stomatal closure [4,94].
Clustering of GmMGT genes from three GmMGT clades (NIPA, CorA, and MRS2) are shown according to their expression profiles under dehydration and salt stress treatments in different periods (1h, 6h, 12h) with control (0h). The expression values were mapped using a color gradient from low to high (green to red color) shown on the right side of the heatmap. NIPA, MRS2, and CorA type MGT genes were denoted by orange, light blue, and red color, respectively. Separators have been used to distinguish between different conditions on the heatmaps.
Under salt stress, NIPA-type GLYMA.02G068000, GLYMA.04G005200, GLYMA.06G053100, GLYMA.12G030100, MRS2-type GLYMA.08G126600, GLYMA.10G180200 were highly induced. The expression levels of NIPA-type GLYMA.04G005200 and MRS2-type GLYMA.08G126600, GLYMA.10G180200 showed the highest expression levels after 12h of salt stress treatment, considered as strong candidates under salt stress conditions. Notably, CorA-type GmMGT genes showed no significant differences in expression under salt stress, while NIPA genes were more expressed than MRS2 genes. The OsMGT1 gene may significantly boost OsHKT1 activity, which decreases excess Na+ in rice tissues and enhances resistance under salinity stress [93]. In summary, this transcriptomic study indicates that GmMGT genes with differential expression levels under various abiotic stresses may be involved in the Mg2 + transporting system and many cellular pathways.
5. Conclusion
In this study, 29 GmMGT genes were identified and characterized in the soybean genome for the first time, using comprehensive bioinformatics tools. These genes were found to be unevenly distributed across 17 chromosomes, with 12 segmental duplicated gene pairs. Segmental duplication was found to be the main driving force for the expansion of GmMGT genes. The structural variabilities in gene sequences confirm the diverse roles of these genes in plant developmental processes. Notably, most of the GmMGT genes in the same subgroups exhibited highly similar gene structure, conserved domains, and motif organization. The collinearity and synteny analyses lay a foundation for understanding crop evolution. Furthermore, our exploration of GO terms associated with various biological processes suggested that the majority of GmMGT genes were involved in the ion transportation process. The abundance of light-responsive cis-elements in GmMGT gene promoters indicates their potential involvement in photosynthesis. The presence of TFs and miRNA molecules in the promoter region indicates the signal transduction process of GmMGTs involved in response to various environmental stresses. At the same time, the expression profiles in various tissues and stress conditions confirm the overlapping functions of these genes. Highly induced genes, GLYMA.02G285600, and GLYMA.13G368400 in aphid infestation, GLYMA.03G159400, and GLYMA.05G196600, GLYMA.15G0125900 in dehydration, GLYMA.04G005200, GLYMA.08G126600, and GLYMA.10G180200 in salt stress, can be used as potential targets for developing stress-resistant soybean cultivars. This study provides the theoretical groundwork for further functional exploration of GmMGT genes in stress responses and plant developmental processes.
Supporting information
S1 Data. Protein sequences of AtMGT, OsMGT, GmMGT, and CaMGT.
https://doi.org/10.1371/journal.pone.0330440.s001
(S1 Data.DOCX)
S3 Data. Genomic sequences of GmMGT genes.
https://doi.org/10.1371/journal.pone.0330440.s003
(S3 Data.DOCX)
S4 Data. Protein sequences of GmMGT genes.
https://doi.org/10.1371/journal.pone.0330440.s004
(S4 Data.DOCX)
S5 Data. Distribution of GmMGT genes among groups based on phylogenetic analysis with Arabidopsis, rice, and chickpea MGT genes.
https://doi.org/10.1371/journal.pone.0330440.s005
(S5 Data.DOCX)
S6 Data. In silico predicted the number of introns and exons in GmMGT genes.
https://doi.org/10.1371/journal.pone.0330440.s006
(S6 Data.DOCX)
S7 Data. Exon-intron sequences of GmMGT genes to identify gene structure.
https://doi.org/10.1371/journal.pone.0330440.s007
(S7 Data.XLSX)
S8 Data. Time of gene duplication estimatsed for different paralogous pairs of GmMGT genes based on Ka and Ks values to demonstrate gene evolution.
https://doi.org/10.1371/journal.pone.0330440.s008
(S8 Data.XLSX)
S9 Data. Predicted protein signals of GmMGTs in subcellular organelles.
https://doi.org/10.1371/journal.pone.0330440.s009
(S9 Data.XLSX)
S10 Data. The details GO analysis of the predicted GmMGT genes was performed using the Plant TFDB.
https://doi.org/10.1371/journal.pone.0330440.s010
(S10 Data.XLSX)
S11 Data. Promoter region of GmMGT genes.
https://doi.org/10.1371/journal.pone.0330440.s011
(S11 Data.DOCX)
S12 Data. The predicted CAREs of the upstream promoter region (2.0 kb genomic sequences) of GmMGT gene family members are predicted to identify the gene response to light, tissue, hormone, and stress.
https://doi.org/10.1371/journal.pone.0330440.s012
(S12 Data.XLSX)
S13 Data.
miRNA targeted prediction of GmMGTs. The miRNA data was downloaded from the plant micro RNA encyclopedia (http://pmiren.com/).
https://doi.org/10.1371/journal.pone.0330440.s013
(S13 Data.DOCX)
S14 Data. Identified main 7 TF families associated with the regulation of identified GmMGT genes in G. max genome.
https://doi.org/10.1371/journal.pone.0330440.s014
(S14 Data.XLSX)
S15 Data. Tissue-specific expression profiles of GmMGT genes retrieved from SoyBase database.
https://doi.org/10.1371/journal.pone.0330440.s015
(S15 Data.XLSX)
S16 Data. Tissue-specific expression profiles of GmMGT genes retrieved from Bar uTorrent database.
https://doi.org/10.1371/journal.pone.0330440.s016
(S16 Data.XLSX)
S17 Data. Expression profiles of GmMGT genes under biotic stress like Aphid infestation.
https://doi.org/10.1371/journal.pone.0330440.s017
(S17 Data.XLSX)
S18 Data. Expression profiles of GmMGT genes under abiotic stress like dehydration and salt stresses.
https://doi.org/10.1371/journal.pone.0330440.s018
(S18 Data.XLSX)
Acknowledgments
The authors are very grateful to the Laboratory of Functional Genomics and Proteomics, Department of Genetic Engineering and Biotechnology, Faculty of Biological Science and Technology, Jashore University of Science and Technology, Jashore 7408, Bangladesh for providing the opportunity to conduct this research. The authors wish to thank Mr. Tanzir Ahmed, Assistant Professor, Department of English, Faculty of Arts and Social Science, Jashore University of Science and Technology, Jashore 7408, Bangladesh for extensively editing the manuscript to avoid grammatical errors. The authors acknowledge and appreciate the reviewers and the members of the editorial panel for their valuable comments and critical suggestions for improving the quality of this manuscript.
References
- 1. Chen ZC, Peng WT, Li J, Liao H. Functional dissection and transport mechanism of magnesium in plants. Semin Cell Dev Biol. 2018;74:142–52. pmid:28822768
- 2. Franken GAC, Huynen MA, Martínez-Cruz LA, Bindels RJM, de Baaij JHF. Structural and functional comparison of magnesium transporters throughout evolution. Cell Mol Life Sci. 2022;79(8):418. pmid:35819535
- 3. Senbayram M, Gransee A, Wahle V, Thiel H. Role of magnesium fertilisers in agriculture: plant–soil continuum. Crop Pasture Sci. 2015;66(12):1219.
- 4. Niu Y, Jin G, Zhang YS. Root development under control of magnesium availability. Plant Signal Behav. 2014;9(9):e29720. pmid:25763706
- 5. Guo W, Chen S, Hussain N, Cong Y, Liang Z, Chen K. Magnesium stress signaling in plant: just a beginning. Plant Signal Behav. 2015;10(3):e992287. pmid:25806908
- 6. Kobayashi NI, Tanoi K. Critical Issues in the Study of Magnesium Transport Systems and Magnesium Deficiency Symptoms in Plants. Int J Mol Sci. 2015;16(9):23076–93. pmid:26404266
- 7. Shaul O. Magnesium transport and function in plants: the tip of the iceberg. Biometals. 2002;15(3):309–23. pmid:12206396
- 8. Kehres DG, Lawyer CH, Maguire ME. The CorA magnesium transporter gene family. Microb Comp Genomics. 1998;3(3):151–69. pmid:9775386
- 9. Knoop V, Groth-Malonek M, Gebert M, Eifler K, Weyand K. Transport of magnesium and other divalent cations: evolution of the 2-TM-GxN proteins in the MIT superfamily. Mol Genet Genomics. 2005;274(3):205–16. pmid:16179994
- 10. Bui DM, Gregan J, Jarosch E, Ragnini A, Schweyen RJ. The bacterial magnesium transporter CorA can functionally substitute for its putative homologue Mrs2p in the yeast inner mitochondrial membrane. J Biol Chem. 1999;274(29):20438–43. pmid:10400670
- 11. Faraji S, Ahmadizadeh M, Heidari P. Genome-wide comparative analysis of Mg transporter gene family between Triticum turgidum and Camelina sativa. Biometals. 2021;34(3):639–60. pmid:33783656
- 12. Svidová S, Sponder G, Schweyen RJ, Djinović-Carugo K. Functional analysis of the conserved hydrophobic gate region of the magnesium transporter CorA. Biochim Biophys Acta. 2011;1808(6):1587–91. pmid:21074514
- 13. Smith RL, Banks JL, Snavely MD, Maguire ME. Sequence and topology of the CorA magnesium transport systems of Salmonella typhimurium and Escherichia coli. Identification of a new class of transport protein. J Biol Chem. 1993;268(19):14071–80. pmid:8314774
- 14. Li H, Du H, Huang K, Chen X, Liu T, Gao S, et al. Identification, and Functional and Expression Analyses of the CorA/MRS2/MGT-Type Magnesium Transporter Family in Maize. Plant Cell Physiol. 2016;57(6):1153–68. pmid:27084594
- 15. Saito T, Kobayashi NI, Tanoi K, Iwata N, Suzuki H, Iwata R, et al. Expression and functional analysis of the CorA-MRS2-ALR-type magnesium transporter family in rice. Plant Cell Physiol. 2013;54(10):1673–83. pmid:23926064
- 16. Li L, Tutone AF, Drummond RS, Gardner RC, Luan S. A novel family of magnesium transport genes in Arabidopsis. Plant Cell. 2001;13(12):2761–75. pmid:11752386
- 17. Wang Y, Hua X, Xu J, Chen Z, Fan T, Zeng Z, et al. Comparative genomics revealed the gene evolution and functional divergence of magnesium transporter families in Saccharum. BMC Genomics. 2019;20(1):83. pmid:30678642
- 18. Regon P, Chowra U, Awasthi JP, Borgohain P, Panda SK. Genome-wide analysis of magnesium transporter genes in Solanum lycopersicum. Comput Biol Chem. 2019;80:498–511. pmid:31176140
- 19. Tong M, Liu W, He H, Hu H, Ding Y, Li X, et al. Identification and functional analysis of the CorA/MGT/MRS2-type magnesium transporter in banana. PLoS One. 2020;15(10):e0239058. pmid:33001980
- 20. Zhao Z, Wang P, Jiao H, Tang C, Liu X, Jing Y, et al. Phylogenetic and expression analysis of the magnesium transporter family in pear, and functional verification ofPbrMGT7in pear pollen. The Journal of Horticultural Science and Biotechnology. 2017;93(1):51–63.
- 21. Chen J, Li L, Liu Z, Yuan Y, Guo L, Mao D, et al. Magnesium transporter AtMGT9 is essential for pollen development in Arabidopsis. Cell Res. 2009;19(7):887–98. pmid:19436262
- 22. Li L-G, Sokolov LN, Yang Y-H, Li D-P, Ting J, Pandy GK, et al. A mitochondrial magnesium transporter functions in Arabidopsis pollen development. Mol Plant. 2008;1(4):675–85. pmid:19825572
- 23. Schock I, Gregan J, Steinhauser S, Schweyen R, Brennicke A, Knoop V. A member of a novel Arabidopsis thaliana gene family of candidate Mg2+ ion transporters complements a yeast mitochondrial group II intron-splicing mutant. Plant J. 2000;24(4):489–501. pmid:11115130
- 24. Lenz H, Dombinov V, Dreistein J, Reinhard MR, Gebert M, Knoop V. Magnesium deficiency phenotypes upon multiple knockout of Arabidopsis thaliana MRS2 clade B genes can be ameliorated by concomitantly reduced calcium supply. Plant Cell Physiol. 2013;54(7):1118–31. pmid:23628997
- 25. Chen ZC, Yamaji N, Motoyama R, Nagamura Y, Ma JF. Up-regulation of a magnesium transporter gene OsMGT1 is required for conferring aluminum tolerance in rice. Plant Physiol. 2012;159(4):1624–33. pmid:22732245
- 26. Li H, Liu C, Zhou L, Zhao Z, Li Y, Qu M, et al. Molecular and functional characterization of the magnesium transporter gene ZmMGT12 in maize. Gene. 2018;665:167–73. pmid:29702186
- 27. Li H, Wang N, Ding J, Liu C, Du H, Huang K, et al. The maize CorA/MRS2/MGT-type Mg transporter, ZmMGT10, responses to magnesium deficiency and confers low magnesium tolerance in transgenic Arabidopsis. Plant Mol Biol. 2017;95(3):269–78. pmid:28871377
- 28. Zhang GJCT. Cloning and expression analysis of a magnesium transporter gene in Dendrobium officinale. Drugs H. 2014;:3443–8.
- 29. Bagale S. Nutrient Management for Soybean Crops. International Journal of Agronomy. 2021;2021:1–10.
- 30. Kumar S. Abiotic Stresses and Their Effects on Plant Growth, Yield and Nutritional Quality of Agricultural Produce. IJFSA. 2020;4(4):367–78.
- 31. Gerendás J, Führs H. The significance of magnesium for crop quality. Plant Soil. 2013;368(1–2):101–28.
- 32. Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T, Nelson W, et al. Genome sequence of the palaeopolyploid soybean. Nature. 2010;463(7278):178–83. pmid:20075913
- 33. Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012;40(Database issue):D1178-86. pmid:22110026
- 34. Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, et al. Pfam: the protein families database. Nucleic Acids Res. 2014;42(Database issue):D222-30. pmid:24288371
- 35. Letunic I, Khedkar S, Bork P. SMART: recent updates, new developments and status in 2020. Nucleic Acids Res. 2021;49(D1):D458–60. pmid:33104802
- 36. Lu S, Wang J, Chitsaz F, Derbyshire MK, Geer RC, Gonzales NR, et al. CDD/SPARCLE: the conserved domain database in 2020. Nucleic Acids Res. 2020;48(D1):D265–8. pmid:31777944
- 37.
Gasteiger E, Hoogland C, Gattiker A, Duvaud S, Wilkins MR, Appel RD, et al. Protein identification and analysis tools on the ExPASy server: Springer; 2005.
- 38. Tamura K, Stecher G, Kumar S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol Biol Evol. 2021;38(7):3022–7. pmid:33892491
- 39. Thompson JD, Gibson TJ, Higgins DG. Multiple sequence alignment using ClustalW and ClustalX. Curr Protoc Bioinformatics. 2002;Chapter 2:Unit 2.3. pmid:18792934
- 40. Hu B, Jin J, Guo A-Y, Zhang H, Luo J, Gao G. GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics. 2015;31(8):1296–7. pmid:25504850
- 41. Chen C, Chen H, Zhang Y, Thomas HR, Frank MH, He Y, et al. TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data. Mol Plant. 2020;13(8):1194–202. pmid:32585190
- 42. Bailey TL, Johnson J, Grant CE, Noble WS. The MEME Suite. Nucleic Acids Res. 2015;43(W1):W39-49. pmid:25953851
- 43. Lynch M, Conery JS. The evolutionary fate and consequences of duplicate genes. Science. 2000;290(5494):1151–5. pmid:11073452
- 44. Lee T-H, Tang H, Wang X, Paterson AH. PGDD: a database of gene and genome duplication in plants. Nucleic Acids Res. 2013;41(Database issue):D1152-8. pmid:23180799
- 45. Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, et al. Circos: an information aesthetic for comparative genomics. Genome Res. 2009;19(9):1639–45. pmid:19541911
- 46. Chao J, Li Z, Sun Y, Aluko OO, Wu X, Wang Q, et al. MG2C: a user-friendly online tool for drawing genetic maps. Mol Hortic. 2021;1(1):16. pmid:37789491
- 47. Horton P, Park K-J, Obayashi T, Fujita N, Harada H, Adams-Collier CJ, et al. WoLF PSORT: protein localization predictor. Nucleic Acids Res. 2007;35(Web Server issue):W585-7. pmid:17517783
- 48. Jin J, Tian F, Yang D-C, Meng Y-Q, Kong L, Luo J, et al. PlantTFDB 4.0: toward a central hub for transcription factors and regulatory interactions in plants. Nucleic Acids Res. 2017;45(D1):D1040–5. pmid:27924042
- 49. Rombauts S, Déhais P, Van Montagu M, Rouzé P. PlantCARE, a plant cis-acting regulatory element database. Nucleic Acids Res. 1999;27(1):295–6. pmid:9847207
- 50. Higo K, Ugawa Y, Iwamoto M, Higo H. PLACE: a database of plant cis-acting regulatory DNA elements. Nucleic Acids Res. 1998;26(1):358–9. pmid:9399873
- 51.
%J RCT R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. 2021.
- 52. Guo Z, Kuang Z, Wang Y, Zhao Y, Tao Y, Cheng C, et al. PmiREN: a comprehensive encyclopedia of plant miRNAs. Nucleic Acids Res. 2020;48(D1):D1114–21. pmid:31602478
- 53. Samad AFA, Sajad M, Nazaruddin N, Fauzi IA, Murad AMA, Zainal Z, et al. MicroRNA and Transcription Factor: Key Players in Plant Regulatory Network. Front Plant Sci. 2017;8:565. pmid:28446918
- 54. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504. pmid:14597658
- 55. Severin AJ, Woody JL, Bolon Y-T, Joseph B, Diers BW, Farmer AD, et al. RNA-Seq Atlas of Glycine max: a guide to the soybean transcriptome. BMC Plant Biol. 2010;10:160. pmid:20687943
- 56. Brechenmacher L, Nguyen THN, Zhang N, Jun T-H, Xu D, Mian MAR, et al. Identification of Soybean Proteins and Genes Differentially Regulated in Near Isogenic Lines Differing in Resistance to Aphid Infestation. J Proteome Res. 2015;14(10):4137–46. pmid:26350764
- 57. Belamkar V, Weeks NT, Bharti AK, Farmer AD, Graham MA, Cannon SB. Comprehensive characterization and RNA-Seq profiling of the HD-Zip transcription factor family in soybean (Glycine max) during dehydration and salt stress. BMC Genomics. 2014;15:950. pmid:25362847
- 58. Zhang L, Wen A, Wu X, Pan X, Wu N, Chen X, et al. Molecular identification of the magnesium transport gene family in Brassica napus. Plant Physiol Biochem. 2019;136:204–14. pmid:30690277
- 59.
Mohamadi SF, Jelodar NB, Bagheri N, Gharakhili GAN, Hashemi-Petroudi SH. Genome-wide computational assay of the Mg transporter (MGT) gene family in rice (Oryza sativa L.). 2022.
- 60. Tang Y, Yang X, Li H, Shuai Y, Chen W, Ma D, et al. Uncovering the role of wheat magnesium transporter family genes in abiotic responses. Front Plant Sci. 2023;14:1078299. pmid:36844102
- 61. Ge M, Zhong R, Sadeghnezhad E, Hakeem A, Xiao X, Wang P, et al. Genome-wide identification and expression analysis of magnesium transporter gene family in grape (Vitis vinifera). BMC Plant Biol. 2022;22(1):217. pmid:35477360
- 62. Iñiguez LP, Hernández G. The Evolutionary Relationship between Alternative Splicing and Gene Duplication. Front Genet. 2017;8:14. pmid:28261262
- 63. Koralewski TE, Krutovsky KV. Evolution of exon-intron structure and alternative splicing. PLoS One. 2011;6(3):e18055. pmid:21464961
- 64. Jeffares DC, Penkett CJ, Bähler JJ. Rapidly regulated genes are intron poor. Tig. 2008;24(8):375–8.
- 65. Heidari P, Abdullah, Faraji S, Poczai PJA. Magnesium transporter gene family: genome-wide identification and characterization in Theobroma cacao, Corchorus capsularis, and Gossypium hirsutum of family Malvaceae. Plants. 2021;11(8):1651.
- 66. He Z, Tu Y-C, Tsai C-W, Mount J, Zhang J, Tsai M-F, et al. Structure and function of the human mitochondrial MRS2 channel. bioRxiv. 2023;:2023.08.12.553106. pmid:37645897
- 67. Bin M, Yi G, Zhang XJ. Discovery and characterization of magnesium transporter (MGT) gene family in Citrus sinensis and their role in magnesium deficiency stress. PGR. 2023;:1–14.
- 68. Li N, Xu R, Wang B, Wang J, Huang S, Yu Q, et al. Genome-Wide Identification and Evolutionary Analysis of the SRO Gene Family in Tomato. Front Genet. 2021;12:753638. pmid:34621298
- 69. Patil G, Valliyodan B, Deshmukh R, Prince S, Nicander B, Zhao M, et al. Soybean (Glycine max) SWEET gene family: insights through comparative genomics, transcriptome profiling and whole genome re-sequence analysis. BMC Genomics. 2015;16(1):520. pmid:26162601
- 70. Akram U, Song Y, Liang C, Abid MA, Askari M, Myat AA, et al. Genome-Wide Characterization and Expression Analysis of NHX Gene Family under Salinity Stress in Gossypium barbadense and Its Comparison with Gossypium hirsutum. Genes (Basel). 2020;11(7):803. pmid:32708576
- 71. Wang L, Ding X, Gao Y, Yang S. Genome-wide identification and characterization of GRAS genes in soybean (Glycine max). BMC Plant Biol. 2020;20(1):415. pmid:32891114
- 72. Park S, Shi A, Mou B. Genome-wide identification and expression analysis of the CBF/DREB1 gene family in lettuce. Sci Rep. 2020;10(1):5733. pmid:32235838
- 73. Singh VK, Jain M. Genome-wide survey and comprehensive expression profiling of Aux/IAA gene family in chickpea and soybean. Front Plant Sci. 2015;6:918. pmid:26579165
- 74. Feng L, Chen Z, Ma H, Chen X, Li Y, Wang Y, et al. The IQD gene family in soybean: structure, phylogeny, evolution and expression. PLoS One. 2014;9(10):e110896. pmid:25343341
- 75. Zhang Z, Zhao Y, Feng X, Luo Z, Kong S, Zhang C, et al. Genomic, molecular evolution, and expression analysis of NOX genes in soybean (Glycine max). Genomics. 2019;111(4):619–28. pmid:29621573
- 76. Aleem M, Riaz A, Raza Q, Aleem M, Aslam M, Kong K, et al. Genome-wide characterization and functional analysis of class III peroxidase gene family in soybean reveal regulatory roles of GsPOD40 in drought tolerance. Genomics. 2022;114(1):45–60. pmid:34813918
- 77. Magadum S, Banerjee U, Murugan P, Gangapur D, Ravikesavan R. Gene duplication as a major force in evolution. J Genet. 2013;92(1):155–61. pmid:23640422
- 78. Fraser JA, Huang JC, Pukkila-Worley R, Alspaugh JA, Mitchell TG, Heitman J. Chromosomal translocation and segmental duplication in Cryptococcus neoformans. Eukaryot Cell. 2005;4(2):401–6. pmid:15701802
- 79. Komatsu S, Konishi H, Hashimoto M. The proteomics of plant cell membranes. J Exp Bot. 2007;58(1):103–12. pmid:16804056
- 80. Thomas PD. The Gene Ontology and the Meaning of Biological Function. Methods Mol Biol. 2017;1446:15–24. pmid:27812932
- 81. Yao L-M, Wang B, Cheng L-J, Wu T-L. Identification of key drought stress-related genes in the hyacinth bean. PLoS One. 2013;8(3):e58108. pmid:23472143
- 82. Biłas R, Szafran K, Hnatuszko-Konka K, Kononowicz AK. Cis-regulatory elements used to control gene expression in plants. Plant Cell Tiss Organ Cult. 2016;127(2):269–87.
- 83. Liu J, Wang F, Yu G, Zhang X, Jia C, Qin J, et al. Functional Analysis of the Maize C-Repeat/DRE Motif-Binding Transcription Factor CBF3 Promoter in Response to Abiotic Stress. Int J Mol Sci. 2015;16(6):12131–46. pmid:26030672
- 84. Chen W, Provart NJ, Glazebrook J, Katagiri F, Chang H-S, Eulgem T, et al. Expression profile matrix of Arabidopsis transcription factor genes suggests their putative functions in response to environmental stresses. Plant Cell. 2002;14(3):559–74. pmid:11910004
- 85. Xu M, Zhu J, Zhang M, Wang L. Advances on plant miR169/NF-YA regulation modules. Yi Chuan. 2016;38(8):700–6. pmid:27531608
- 86. Xu MY, Zhang L, Li WW, Hu XL, Wang M-B, Fan YL, et al. Stress-induced early flowering is mediated by miR169 in Arabidopsis thaliana. J Exp Bot. 2014;65(1):89–101. pmid:24336445
- 87. Wu Y, Li X, Zhang J, Zhao H, Tan S, Xu W, et al. ERF subfamily transcription factors and their function in plant responses to abiotic stresses. Front Plant Sci. 2022;13:1042084. pmid:36531407
- 88. Dubos C, Stracke R, Grotewold E, Weisshaar B, Martin C, Lepiniec L. MYB transcription factors in Arabidopsis. Trends Plant Sci. 2010;15(10):573–81. pmid:20674465
- 89. Collinge M, Boller T. Differential induction of two potato genes, Stprx2 and StNAC, in response to infection by Phytophthora infestans and to wounding. Plant Mol Biol. 2001;46(5):521–9. pmid:11516145
- 90. Schmid M, Davison TS, Henz SR, Pape UJ, Demar M, Vingron M, et al. A gene expression map of Arabidopsis thaliana development. Nat Genet. 2005;37(5):501–6. pmid:15806101
- 91. Chen ZC, Ma JF. Magnesium transporters and their role in Al tolerance in plants. Plant Soil. 2012;368(1–2):51–6.
- 92. Serba DD, Meng X, Schnable J, Bashir E, Michaud JP, Vara Prasad PV, et al. Comparative Transcriptome Analysis Reveals Genetic Mechanisms of Sugarcane Aphid Resistance in Grain Sorghum. Int J Mol Sci. 2021;22(13):7129. pmid:34281180
- 93. Chen ZC, Yamaji N, Horie T, Che J, Li J, An G, et al. A Magnesium Transporter OsMGT1 Plays a Critical Role in Salt Tolerance in Rice. Plant Physiol. 2017;174(3):1837–49. pmid:28487477
- 94. Tränkner M, Jákli B, Tavakol E, Geilfus C-M, Cakmak I, Dittert K, et al. Magnesium deficiency decreases biomass water-use efficiency and increases leaf water-use efficiency and oxidative stress in barley plants. Plant Soil. 2016;406(1–2):409–23.