Figures
Abstract
N6-methyladenosine (m6A) is one of the most prevalent mRNA modifications in eukaryotes, playing a crucial role in plant development and stress responses. The m6A modification is regulated by three key components: writers (which add methyl groups), erasers (which remove them), and readers (which interpret the modification). Despite its significance, the role of m6A regulatory genes in plants, particularly in soybean, remains largely unexplored. This study identified 42 m6A regulatory genes in soybean through a comprehensive genome-wide analysis. Structural analysis revealed diverse gene architectures and functional variations across different subgroups. A total of 18 gene duplication events were identified, predominantly evolving under purifying selection. Network analysis and hub gene identification suggested weaker interactions among eraser proteins. Moreover, interaction analysis between miRNAs and soybean m6A regulatory genes indicated a stronger miRNA-mediated regulation of writer components compared to erasers and readers. Expression profiling under various stress conditions highlighted distinct regulatory patterns, where GmECT9, GmECT13, and GmECT17 were highly expressed in roots and nodules, while GmMTB2 and GmALKBH9B2 exhibited maximum upregulation and downregulation, respectively, under combined water deficit and heat stress. Additionally, in the case of mosaic virus infection, GmALKBH9B4 showed significant downregulation, whereas GmMTB1, GmMTB2, and GmALKBH9B1 were upregulated. This study provides a foundational framework for understanding m6A regulatory genes in soybean. The genome-wide insights presented here contribute to a deeper understanding of the molecular mechanisms governing m6A regulation and its potential implications for stress responses and plant development.
Citation: Sayed SB, Rabbi MA, Debnath JP, Hossen K, Ghosh A (2025) Genome-wide identification and characterization of m6A regulatory genes in Soybean: Insights into evolution, miRNA interactions, and stress responses. PLoS One 20(7): e0328773. https://doi.org/10.1371/journal.pone.0328773
Editor: Kai Huang, University of Pennsylvania, UNITED STATES OF AMERICA
Received: May 25, 2025; Accepted: July 7, 2025; Published: July 24, 2025
Copyright: © 2025 Sayed et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Abbreviations: ac, amino acid; bp, base pair; CDS, Coding DNA sequence; KDa, kilo Dalton; pI, Iso electric point; m6A, N6-methyladenosine; LFC, Log Fold Change; CC, cellular component; BP, biological process; MF, molecular function; FDR, False discovery rate; FPKM, fragments per kilobase of exon per million; CRE, cis-acting regulatory element; PPI, protein-protein interaction; EN, emerging nodules; MN, mature nodules; ELR, emerging lateral roots; YLR, young lateral roots; SMV, soybean mosaic virus; 3D, three dimensions; Ka, Nonsynonymous rate; Ks, Synonymous rate; Mya, Million years ago
Introduction
Plants have continuously suffered from various biotic and abiotic stressors such as pathogens, nutritional imbalance, salt, light intensity, and drought. To cope with these environmental variations, plants must evolve multiple mechanisms [1]. Epigenetic modification is one of them, which is associated with gene expression regulation under stress [2]. Epigenetics refers to stimuli-triggered changes in gene expression without altering the underlying DNA sequence that can be passed to offspring [3]. This process is mainly involved in DNA/RNA methylation, histone modifications, chromatin remodeling, and noncoding RNAs.
To date, RNA modification is an emerging major epigenetic modification in the plant life cycle and plays a critical role in the regulation of gene expression, both at the transcriptional and post-transcriptional levels [4]. Like DNA and protein, RNA goes through 160 types of post-translational modifications, where mRNA modification is found to be less common in comparison to tRNA and rRNA. Among mRNA modifications, N6-methyl-adenosine (m6A) is one of the most abundant, inner, dynamic, and reversible posttranscriptional modifications in eukaryotes and serves as a novel epigenetic marker that is involved in various biological processes [5]. Furthermore, three enzyme complexes, namely methyltransferases (writers), demethylases (erasers), and m6A-binding proteins (readers), are found to be involved in introducing, deleting, and interpreting specific methylation marks on mRNAs, respectively [6].
The writer complex was first discovered in mammals that included methyltransferase-like 3 (METTL3) [7], methyltransferase-like 14 (METTL14) [8], wilms tumor 1-associated protein (WTAP) [9], RNA binding motif protein 15/15B (RBM15/RBM15B) [10], vir-like m6A methyltransferase associated (VIRMA or KIAA1429) [11], zinc finger CCCH-type containing 13 (ZC3H13) [12], methyltransferase-like 16 (METTL16) [13], methyltransferase-like 4 (METTL4) [14], methyltransferase-like 5 (METTL5) [15], and zinc finger CCHC-type containing 4 (ZCCHC4) [16]. The orthologs of these complexes have also been identified in Arabidopsis, such as MTA (ortholog of METTL3), MTB (ortholog of METTL14), VIR (ortholog of VIRMA), and FIP37 (ortholog of WTAP) [17–20]. Moreover, E3 ubiquitin ligase HAKAI, an additional m6A writer component, has been identified in Arabidopsis [21]. METTL3 (S-adenosyl-methionine-binding protein) is the key catalytic component of the m6A methyltransferase complex. METTL14 and WTAP, making complex with METTL3, are found to be involved in M6A installation and modification, respectively [8]. Besides this, VIRMA is a regulatory subunit of m6A methyltransferase, recruiting the m6A complex to the special RNA site and facilitating m6A installation [11].
Conversely, the m6A demethylase erases the methyl group from the mRNA, indicating the reversibility of m6A modification. FTO (fat mass and obesity-associated protein) is the first eraser complex discovered in mammals that facilitates in restoration of the methylated base to the adenine base [22]. Subsequently, ALKBH5 (alkB homolog 5), the second mammalian m6A demethylase facilitating the removal of m6A, potentially affects different subsets of target mRNAs [23]. Arabidopsis contains several putative m6A eraser ALKBH family proteins [20]. ALKBH9B and ALKBH10B are the only two eraser proteins found in Arabidopsis that have been functionally investigated concerning viral infection and floral transition [24].
While writers and erasers are actively engaged in their dynamic action, the reader family regulates mRNA nuclear export, splicing, degradation, translation, and stability. The YT521-B homology (YTH) domain family is the first identified mammalian reader complex, which includes YTHDF1, YTHDF2, YTHDF3, YTHDC1, and YTHDC2 [25]. Different reader proteins have different functions on m6A-modified RNAs. For example, YTHDF2 regulates the degradation [26], YTHDF1 modulates translation [27], FMRP enhances nuclear export and stability [28], and YTHDC1 modulates nuclear export and splicing [29]. Furthermore, YTHDC2 and Eukaryotic initiation factor 3 (eIF3) are also crucial in mRNA translation [27,30]. Notably, three evolutionarily conserved C-terminal regions (ECT) family proteins have recently been functionally characterized in Arabidopsis as YTHD homologs [31,32].
The m6A modification is a crucial regulator performing three main functions: mRNA processing, plant growth and development, and stress response [33]. Researchers suggest that m6A regulatory writer proteins are essential for shoot and root growth, leaf, cotyledon, and floral development [18]. Partial loss of FIP37 causes huge over-proliferation of shoot meristems. Moreover, depletion of FIP37 results in embryo lethality [17]. The m6A modification-associated genes are expressed differentially in response to biotic and abiotic stresses, including drought [34], salt [35], cold [36], and bacterial infections [37]. Nevertheless, m6A modifications have been studied in limited plant species, highlighting the need for future investigation in other plants.
Soybean (Glycine max), a Leguminosae family member, has become one of the main economic oilseed beans [38]. Currently, researchers are shedding light on pinpointing stress-responsive writer as well as eraser genes in soybean [39,40]. Nevertheless, comprehensive or systematic analysis of all the m6A gene families remains scarce. Therefore, we were encouraged to conduct genome-wide identification of writers, erasers, and readers genes of the m6A pathway in soybean. A total of 12 writers, 11 erasers, and 19 readers were identified. A comprehensive analysis of the m6A regulatory genes in soybeans, including classification, evolution, gene structure, and potential interaction network, was performed. Expression patterns in response to a range of biotic and abiotic stresses were also studied.
Materials and methods
Genome-wide identification of the m6A gene family in Soybean
To identify m6A components and their protein families in soybeans, the amino acid sequences of m6A-related proteins reported in Arabidopsis thaliana [41], including writers, erasers, and readers, were used as queries to perform BLASTP against the soybean genomic sequences in the plant genomics resource database phytozome [42]. Putative genes and corresponding protein sequences were retrieved from the same database, and the presence of conserved domains was confirmed by NCBI Conserved Domain (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) and Pfam (http://pfam.xfam.org/). The identity of the “writer” family was confirmed by the presence of the MT-A70 domain (PF05063), Wtap domain (PF17098), and Vir-N domain (PF15912). The presence of 2-OG Fe (II) oxygenase superfamily (PF13532) and YTH domain (PF04146) confirmed the identity of the “eraser” and “reader” families, respectively. Candidates without the m6A conserved and typical domains were not further considered. The prefix “Gm” for Glycine max was added to all members, followed by subclass IDs (MT, FIP37, VIRILIZER, ECT, CPSF, and ALKBH). Chromosomal location, strand position, CDS coordinate (5’ to 3’), gene length, CDS length, and protein length were retrieved from the Phytozome database [42]. Furthermore, the physicochemical properties of the identified proteins, such as molecular weight and theoretical isoelectric point, were calculated using the ExPASy-ProtParam tool [43]. Subcellular localization of proteins was predicted using BUSCA [44].
Chromosomal localization and gene duplication events
The GFF (General Feature Format) file of soybean was retrieved from the phytozome database [42]. Chromosomal locations were visualized by TBtool [45]. Using the Plant Genome Duplication Database (PGDD, http://pdgd.njau.edu.cn:8080/) [46], paralogous genes were found by calculating the number of duplication events among soybean genes. From the same database, the synonymous substitution rate (Ks) and non-synonymous substitution rate (Ka) were obtained. Using the Ka/Ks ratio, the selection pressure of duplicated genes was computed. Ka/Ks < 1 denotes purifying selection, Ka/Ks = 1 implies neutral drifting, and Ka/Ks > 1 signifies positive or Darwinian selection for a pair of genes [47]. Using T = Ks/ (2 × 6.1 × 10−9) × 10−6 Mya formula, the approximate date (Mya, million years ago) of each duplication event was estimated [48].
Gene structure, conserved domain, and conserved motif analysis
Genomic and CDS sequences were retrieved from the phytozome database [42]. Subsequently, the physical mapping of the exon-intron sequence was plotted using GSDS (Gene Structure Display Server 2.0, http://gsds.gao-lab.org/). The conserved domains of m6A regulatory protein sequences were analyzed in the Batch CD-Search program [49], and then the output file was submitted to TBtools for visualizing analyses. The conserved motifs were analyzed using TBtools with a maximum number of motifs of 10 [45].
Phylogenetic analysis
Protein sequences of writer, eraser, and reader of 13 species, namely Linum usitatissimum, Acorus americanus, Anacardium occidentale, Aquilegia coerulea, Brachypodium hybridum, Coffea arabica, Gossypium barbadense, Oryza sativa, Panicum hallii, Solanum lycopersicum, Triticum aestivum, Zea mays, and Porphyra umbilicalis were retrieved from the phytozome database. Multiple sequence alignment was performed using the MUSCLE algorithm [50], and phylogenetic trees for the protein families were constructed using Molecular Evolutionary Genetics Analysis (MEGA) 11 software [51]. The maximum likelihood method was employed with node reliability assessed through bootstrap analysis based on 1000 replicates. The resulting phylogenetic trees were further refined and graphically represented using the Interactive Tree of Life (iTOL) platform (https://itol.embl.de).
Promoter analysis in Soybean m6A regulatory genes
The 1000 bp upstream sequences from the transcription start site were collected and analyzed through the PlantCare database [52] for cis-acting element analysis. The presence of identified cis-acting elements on the putative promoter of every gene had been depicted using TBtool [45].
Functional enrichment visualization of m6A regulatory genes
Gene ontology enrichment in three categories, such as cellular component (CC), biological process (BP), and molecular function (MF), was executed using the STRING v12 database (https://string-db.org/). FDR (False discovery rate) =<0.05 was set. The number of terms was shown as 10. The results were shown as a bar chart and a bubble chart.
Protein-protein interactions, cluster prediction, and hub gene identifications
The STRING v12 database (https://string-db.org/) was used to construct the protein-protein interaction network among m6A regulatory genes in soybeans. The output file from STRING was visualized by Cytoscape software [53]. The network’s co-expressing network clusters (highly linked areas) were examined using the Molecular Complex Detection (MCODE) (v2.0.2) plugin [54]. Hub genes were recognized using the cytoHubba plugin [55] in Cytoscape [53] with the shortest path and ranked by degree.
Structure construction by homology modeling
The homology modeling approach was used to predict the three-dimensional (3D) structures of the m6A regulatory proteins. The amino acid sequence was queried against the SWISS-MODEL server (https://swissmodel.expasy.org/) [56] to search for templates, and the best templates with a similar amino acid sequence and known three-dimensional structures were used to build the models. These proteins’ structures were examined with the UCSF ChimeraX [57] visualization tools.
microRNA targets prediction
Mature miRNAs were downloaded from the miRBase database [58]. The psRNAT-target database [59] was used to search the regulatory relationship between miRNAs and soybean m6A regulatory genes, with an Expectation threshold of < 5 and other parameters at their default values. Cytoscape [53] was used to visualize the interaction network.
Expression analysis of m6A regulatory genes
RNAseq datasets (GSE137263 [60], GSE129509 [61], and GSE186317 [62]) were downloaded from the GEO database (https://www.ncbi.nlm.nih.gov/geo/) of the NCBI (https://www.ncbi.nlm.nih.gov/) to observe the expression pattern of m6A genes in different organs, biotic, and abiotic stress. Normalized FPKM (fragments per kilobase of exon per million) data were used to observe expression levels in emerging nodules, mature nodules, emerging lateral roots, and mature lateral roots. To identify the expression patterns in abiotic stress, including salt stress, heat stress, dehydration, water deficit, and combined water deficit and heat stress, Log Fold Change (LFC) values were extracted using the R package DESeq2 [63]. The expression pattern of m6A regulatory genes in soybean mosaic virus (SMV) infection was also observed. The heatmaps were plotted using TBtool with normalized FPKM (roots and nodules) and LFC values (abiotic and biotic stress).
Results
Genome‑wide identification and characterization of m6A regulatory genes in Soybean
After a comprehensive screening of the soybean genome, 12 m6A writers, 11 m6A erasers, and 19 m6A readers were identified. All m6A regulatory genes were named according to their subclass identifier. The amino acid sequence length, relative molecular weights (MWs), isoelectric points (pI), subcellular localization, and gravy were calculated (Table 1). It was found that the soybean writer’s CDS length ranged from 624 bp to 6561 bp. GmVIRILIZER2 is the biggest member of the writer family and has a polypeptide length of 2187 aa and a molecular weight of 240.88 kDa. Meanwhile, GmFIP37c is the smallest, having a polypeptide length of 208 aa and a molecular weight of 23.13 kDa. The pI values of the writer family differed widely, ranging from 5.04 (GmFIP37a) to 8.81 (GmMTC2). Among the 12 writer members, ten exhibit acidic pI, while the remaining two show basic features. Likewise, the CDS length of the m6A eraser varies from 957 bp to 2052 bp. GmALKBH9B3 is the longest eraser protein with a 684 aa long and a molecular weight of 73.46 kDa. On the other hand, the smallest one, GmALKBH10B4, is 957 aa long with a molecular weight of 35.39 kDa. In terms of pI, eight of eleven erasers have acidic pI, whereas three others possess basic pI. The predicted CDS length of the m6A reader also varies from 387 bp (GmECT15) to 2127 bp (GmECT9). The largest one (GmECT9) is 77.17 kDa in size and 709 aa long. The smallest one (GmECT15) is 129 aa long with a molecular weight of 14.59 kDa. In terms of localization, most m6A regulatory proteins were in the nucleus, and only several proteins were in the plasma membrane (GmECT13, GmALKBH10B5, GmECT9) and chloroplast (GmMTC2, GmECT6). The Grand Average of Hydropathicity (GRAVY) values for all m6A proteins are less than zero, indicating that all soybean m6A proteins are hydrophilic.
Chromosomal localization and gene duplication
The identified putative 42 genes have been distributed throughout the 17 chromosomes of soybeans (Table 1 and S1 Fig). The distribution of genes across the chromosomes can be summarized as follows: Chromosomes 17 and 8 each have the highest count, with four genes. Following closely, chromosomes 16, 10, 14, 5, and 2 each have three genes. Chromosomes 20 and 19 have two genes each. Finally, chromosomes 4, 6, 3, 1, 12, 9, and 15 each have a single gene in their respective location. Different duplication events of m6A regulatory genes in soybeans were identified through PGDD [46], as gene duplication is an important mechanism for gene family expansion. A total of 18 duplication events were observed (Table 2 and S1 Fig). Six writer’s sister gene pairs (GmCPSF30a-GmCPSF30b, GmMTA1-GmMTA2, GmMTB1-GmMTB2, GmFIP37a-GmFIP37c, GmFIP37b-GmFIP37d, GmVIRILIZER1-GmVIRILIZER2) were observed to be duplicated. Alongside, five erasers were discovered to be duplicated: GmALKBH10B7-GmALKBH9B1, GmALKBH10B1-GmALKBH10B3, GmALKBH10B4-GmALKBH10B5, GmALKBH10B2-GmALKBH10B6, and GmALKBH9B2-GmALKBH9B3. In addition, sister gene pairs were shown to be duplicated in seven readers (GmECT4-GmECT11, GmECT15-GmECT16, GmECT5-GmECT12, GmECT9-GmECT13, GmECT8-GmECT17, GmECT1-GmECT3, and GmECT10-GmECT14). All the duplication events of m6A regulatory genes were segmental duplications (Table 2). To explore the evolutionary history and direction of selection, non-synonymous to synonymous substitution (Ka/Ks) ratios were calculated. Purifying selection, neutral selection, and positive selection were indicated by Ka/Ks ratios below 1, equal to 1, and above 1, respectively. In this study, all analyzed duplication events yielded Ka/Ks ratios that were less than 1, indicating the influence of purifying selection on these genes throughout their evolutionary history. The time of divergence of these gene pairs varied from 5.71 to 15.50 Mya (Table 2).
Structural analysis of writer, eraser, and reader members
Using amino acid sequences, maximum likelihood phylogenetic trees were built to investigate the structure and sequencing properties of the writer, eraser, and reader genes in soybeans (Fig 1A). All the writer genes, such as GmMTA, GmMTB, GmMTC, GmVIRILIZER, and GmFIP37 clustered according to their group. Likewise, all the GmALKBH10B genes clustered together except GmALKBH10B7, which formed a cluster with GmALKBH9B1. In the reader group, most of the GmECT genes clustered in the same clade. However, GmECT15 and GmECT16 were placed away from those clusters. Besides, GmECT10 and GmECT14 were clustered with GmCPSF30 genes rather than GmECT genes. The exon-intron distribution indicates the diversity of gene functions. According to the gene structure, the exon numbers of soybean m6A writers, erasers, and readers varied from 6–27, 6–8, and 2–9, respectively (Fig 1B). Amongst the m6A writers, the highest number of exons was observed in GmVIRILIZER1 (27), and the lowest number was seen in GmMTB1 (6) and GmMTB2 (6). In terms of eraser, all the genes contain only 6 to 8 exons, indicating the lowest number of exons among the three m6A regulatory proteins. Within readers, GmECT15 contains only two exons.
A. Using MEGA11, three groups’ phylogenetic trees were constructed. B. The gene structures were plotted by using GSDS. The exon-intron length was proportionately shown and could be approximated using the scale below. Exons are shown as blue boxes, upstream/downstream sequences are represented as dark red boxes, and introns are shown as grey lines. C. Functional domains of m6A regulatory genes in soybean. The YTH reader domains were marked in lavender, while the Eraser domain 2OG-Fe (II) oxy and superfamily domains were indicated by sky blue and cyan color, respectively. The MTA-70, MTA-70 superfamily, WTAP, and VIR-N writer domains were visually distinguished by light grey, dark grey, red, and yellow, respectively.
Group-specific conserved domains analysis indicates MT-A70, Wtap, HAKAI, and VIR_N as m6A writers. MT-70 domain was found in MTA, while the MT-70 superfamily was found in both MTB and MTC proteins (Fig 1C). In addition, GmFIP37a, GmFIP37b, GmFIP37c, and GmFIP37d were characterized with the WTAP superfamily. The VIR_N superfamily was found in GmVIRILIZER1 and GmVIRILIZER2. We found GmHAKAI according to phytozome. However, Pfam analysis did not show a HAKAI-related domain for GmHAKAI. Therefore, this gene was excluded from downstream analysis. The eraser family is characterized by the 2OG-FeII_Oxy superfamily (Fe (II) dependent Oxygenase superfamily) and 2OG-FeII_Oxy, all the identified putative eraser genes containing these conserved domains (Fig 1C). YTH (YT-521-B-like domain) was used to identify reader proteins, and all the identified readers contained that domain (Fig 1C). In the motif analysis, MTC1 and MTC2 contain only one motif (S2A Fig). Other members of the writer contain 4 to 6 motifs. All the eraser families contain 5–9 conserved motifs, while 2–5 motifs were observed in the reader family (S2B and S2C Fig).
The phylogenetic classifications of m6A regulatory genes in soybean
Demonstrating phylogenetic relationships is crucial as it gives insight into deducing the gene origins, examining their molecular adaptation, learning about the evolution of morphological traits, and rebuilding demographic variations in recently differentiated species [64]. Phylogenetic trees for m6A writers, erasers, and readers were constructed to study the evolutionary relationships of m6A modifiers. All the writer family, MT, FIP37(WTAP), VIR_N, and HAKAI formed four groups named I.II.III and IV (Fig 2A). The phylogeny revealed that GmVIRILIZER, GmMTA, GmMATB, and GmMTC have three common closest orthologs of Gossypium barbadense, Linum usitatissimum, and Anacardium occidentale. Another writer family, GmFIP37, has the closest ortholog of Coffea arabica and Solanum lycopersicum, while Linum usitatissimum is the distantly related ortholog. Zea mays is the most distantly related ortholog of GmVIRILIZER, demonstrating the plant kingdom’s evolutionary divergence. On the other hand, the red algae species Porphyra umbilicalis has the most distant association with the orthologs of GmMTA, suggesting a wider evolutionary range. The orthologs of GmMTB and GmMTC are most distantly linked to Coffea arabica, highlighting the evolutionary connection among flower species.
A. Phylogenetic tree of writer genes. B. Phylogenetic tree of m6A eraser genes. C. Phylogenetic tree of m6A reader genes.
The YTH domain-containing eraser protein orthologs also form 4 different groups named I, II, III, and IV (Fig 2B). The various species’ readers are categorized according to their class (Fig 2C). Most of these readers’ orthologs of Oryza sativa are out-grouped from their respective clades. While monocot-monocot and dicot-dicot couples are found in certain sister taxa, monocot-dicot pairs are found in others. The closely related species A. americanus and A. coerulea, for example, show strong evidence for their nodes and a recent shared ancestor. The implication here is that monocots and dicots may have split out from a common ancestor. These three evolutionary trees demonstrate how m6A dynamics are conserved across a variety of kingdoms. The presence and functional relevance of the m6A alteration in different species within these categories have been demonstrated by these phylogenies.
Identification of cis-regulatory elements in the putative promoter region
The spatiotemporal specific expression of protein-coding genes can be uncovered by analyzing the cis-acting regulatory elements (CREs) in the promoter region. Using the PlantCare database [52], CREs in the promoter regions (1000 bp upstream of the transcription start site) of m6A regulatory genes were predicted to better understand the transcriptional regulation and potential biological functions of soybean m6A regulatory genes (Fig 3). These CRE elements were divided into four groups, including light-responsive, phytohormone-responsive, stress-responsive, and development-related. Light-responsive elements were predominant in all the m6A regulatory genes, suggesting that these gene sets may be regulated by light signaling. ABRE and TGA are predominant in phytohormone elements. This substantial number of phytohormone-responsive elements indicated the possible activation or inhibition of the expression level of the m6A regulatory gene by the hormone signals. In terms of development, CAT-box, GCN4-motif, ARE, and O2-site were enriched in the promoter regions. These CREs were respectively associated with meristem expression, endosperm expression, anaerobic induction, and metabolism regulation. MBS (related to drought response), TC-rich repeat (related to defense and stress), and LTR (related to cold response) elements were also highly enriched.
The 1000 bp 5′ upstream region of all the identified m6A genes was retrieved and analyzed using the PlantCARE database to determine the presence and number of cis-acting regulatory elements. The presence of selected motifs (x-axis) in the promoter of corresponding genes (Y-axis) was represented by the number of motifs in the heatmap.
Functional analysis of m6A regulatory genes
The GO analysis of the m6A regulatory genes revealed that the biological process was mostly engaged in the regulation of gene expression (Fig 4A). Nucleic acid metabolic process, regulation of mRNA metabolic process, negative regulation of gene expression, and mRNA metabolic process were the dominant molecular processes. RNA binding, mRNA binding, Nucleic acid binding, and binding were observed in molecular function (Fig 4B). Intracellular anatomical structure was seen in the cellular component (Fig 4C). These results highlight the critical function of m6A regulatory genes in coordinating cellular and molecular processes that influence the dynamics of genes and RNA.
A. Bubble diagram illustrating the enrichment of the gene ontology for the category of biological processes. B. A bubble chart for the Gene Ontology enrichment analysis term for the category of molecular function C. Bubble diagram for the term cellular component category from the analysis of Gene Ontology enrichment. Bubble size is proportional to the number of genes, and bubble color represents the value of False Discovery Rate (FDR). Lighter green bubbles indicate lower FDR values, meaning the results are statistically significant.
Protein-protein interaction and cluster identification
A multi-protein complex performs together in m6A modification, highlighting the prediction of a potential protein-protein interaction network in this regard. There were 192 edges and 42 nodes in the protein-protein interaction (PPI) network (Fig 5). Two clusters were identified by using MCODE [54]. Cluster 1 contains writer proteins, including MTA, MTB, and VIRILLIZER (S3A Fig). Cluster 2 contains reader and writer proteins named GmFIP37a, GmFIP37b, GmFIP37c, GmFIP37d, MTC1, MTC 2, CPSF30a and ECT14 (S3B Fig). The cluster formation gives valuable insight into the biological mechanism of m6A regulatory proteins. These proteins are vital for plant growth and development, as well as for how well plants adapt to their surroundings. These findings shed light on the molecular mechanism and possible biological purposes of soybean m6A regulatory genes.
PPI networks of writer eraser and reader genes were generated using the STRING database. Nodes represent the proteins, and edges represent the protein-protein associations.
Identification of hub genes and their 3D structure modeling
The top ten hub genes were extracted by the cytohubba plugin [55] in Cytoscape [53]. Notably, there was no eraser member in this hub gene (S3C Fig). GmVIRILIZER1 may interact with GmVIRILIZER2, GmECT10, GmECT14, GmMTB1, and GmMTB2, according to the network. GmECT10 and GmECT14 interact with GmMTB1. Furthermore, GmMTA1 engages in interactions with every hub gene in this network (S3C Fig). However, GmMTA2 has no connection with GmMTA1 or GmCPSF30a. Furthermore, GmMTB2 engages in interactions with GmMTB1, GmECT14, and GmECT10. No interaction was found between GmCPSF30a and reader hub genes. However, GmFIP37d interacts with two reader families, including GmECT10 and GmECT14. GmVIRILIZER2 interacts with GmMTB1, GmECT10, GmECT14 and GmMTB2. The protein structure of 10 hub members was predicted using SWISS-MODEL [56]. Protein structure has a polypeptide chain of amino acids specific to each protein type. The chain is folded into a secondary structure characterized by alpha-helices and beta-sheets [65] (Fig 6).
Blue and green colors represent the alpha helix and beta sheets, respectively.
MicroRNA target prediction
Plant microRNAs (miRNAs), small non-coding RNAs which are usually form duplexes with their targets. They play a crucial role in regulating a wide range of biological metabolic activities, including hormone signaling, plant growth, flowering time, leaf morphogenesis, and reactions to environmental stressors. Therefore, potential miRNA binding sites of m6A regulatory genes were examined in this study by using psRNATarget [59]. After filtering interactions with an Expectation penalty score below 5, it was found that 10 writers had 81 miRNAs, 10 erasers had 31 miRNAs, and 18 readers had 60 miRNAs (S4 Fig). It is noteworthy that soybean m6A writers (n = 81) seemed to be more regulated by miRNA than m6A readers (n = 60) and erasers (n = 31). Interestingly, our research did not identify any miRNAs that regulate the two writers (GmMTC1 and GmMTC2), one eraser (ALKBH9B2), and one reader (ECT16) in soybeans. However, more research is required to fully understand the complex interactions between m6A regulatory genes and miRNA.
Expression analysis of m6A regulatory genes in roots and nodules of Soybean
Symbiotic legume nodules and lateral roots arise away from the root meristem via dedifferentiation events in soybean [61]. These organs share some morphological and developmental similarities. We explored the expression level of m6A in emerging nodules (EN), mature nodules (MN), emerging lateral roots (ELR), and young lateral roots (YLR) of soybeans. Most of the genes displayed a broad expression range across all the samples, indicating that they were extensively involved in the growth and development of roots and nodules (Fig 7). GmMTA1 displayed vast expression levels in mature nodules. Notably, GmMTB1, GmFIP37a, GmFIP37d, GmALKBH10B6, GmECT1, and GmECT12 showed a medium level of expression in all the selected samples (EN, MN, ELR, and YLR). GmALKBH10B5 showed higher expression levels in mature nodules compared to EN, ELR, and YLR. Additionally, GmECT9 was expressed tremendously in all the selected tissues, while GmECT13 exhibited immense expression in EN and ELR compared to MN and YLR. Importantly, GmECT17 showed higher expression in ELR but displayed medium expression in the other three tissues.
Soybean RNA writer, eraser, and reader gene’s genome-wide RNA-Seq data were obtained from the GEO of NCBI. Expression levels in emerging nodules, mature nodules, emerging lateral roots, and mature lateral roots were observed by using normalized FPKM (fragments per kilobase of exon per million). The color scale provided at the right of the figure represents the level of expression.
Expression patterns of m6A regulatory genes under various abiotic and biotic stress
The expression pattern of m6A regulatory genes was observed under various abiotic and biotic stresses (Figs 8A and 8B). The majority of m6A regulatory genes remained unresponsive when subjected to salt stress and dehydration. GmMTB2 and GmECT9 exhibited significant upregulations in heat stress and combined water deficit and heat stress (Fig 8A). On the contrary, GmALKBH9B2 showed downregulation in combined water deficit and heat stress. The remaining soybean’s m6A genes displayed a medium level of expression except GmECT4 and GmECT2. These two genes did not show any changes across all the abiotic stresses (Fig 8A).
A. The expression pattern of 42 genes was analyzed in response to five abiotic stress conditions, including salt, heat, dehydration, water deficit, and combined water deficit and heat stress. B. The expression profile of writer, eraser, and reader was analyzed in response to SMV (Soybean Mosaic Virus). Heatmaps were generated using TBtool with the LFC (Log Fold Change), which delineates the expression pattern of 42 m6A regulatory genes. The color scale provided at the right of the figure represents the level of expression. The stress-induced upregulation and downregulation of all the genes are indicated by the red and blue colors, respectively.
The soybean mosaic virus (SMV) treatment resulted in the significantly higher up-regulation of multiple genes, such as GmMTB1, GmMTB2, and GmALKBH9B1 (Fig 8B). In the writer’s family GmMTA2, GmCPSF30a, GmVIRILIZER1, GmMTA1, GmVIRILIZER2, and GmCPSF30b showed upregulation compared to control. In the ALKBH group, three genes, such as GmALKBH10B3, GmALKBH10B7, and GmALKBH10B6 upregulated in SMV treatment. Additionally, in the readers group, GmECT9, GmECT2, GmECT17, GmECT15, and GmECT7 displayed upregulation. GmALKBH9B4 was the only member of the m6A regulatory gene that displayed significant downregulation in SVM treatment. Overall, the expression patterns of m6A regulatory genes in soybeans under different stress conditions indicated that they might play critical biological roles in various stress responses.
Discussion
RNA N6-methyladenosine plays a crucial regulatory role in plant growth and development. The methylation levels of target transcripts are dynamically regulated by three types of m6A regulatory proteins. A total of 42 candidate m6A regulatory genes were identified in soybean, including 12 writers, 11 erasers, and 19 readers (Table 1). The gene sets were confirmed by the conserved domains of writer, reader, and eraser. Soybean (42) has more m6A regulatory genes than others, such as Arabidopsis (33), tomato (25), grape (40), rice (33), S. moellendorffii (22), M. polymorpha (16), P. patens (18), and Chinese pine (36) [66]. On the other hand, soybean falls short of tobacco (52) and upland cotton (75), common wheat (85), and maize (55) [66]. According to Su et al [66], a low number of m6A regulatory genes were found in Chinese pine (36) compared to species such as common wheat and upland cotton, though Chinese pine has the largest genome size (25.4 Gb). The plants that have experienced polyploidization events, such as common wheat (2n = 6x = 42), upland cotton (2n = 4x = 52), and tobacco (2n = 4x = 48), exhibit a higher abundance of m6A regulatory genes [67–69]. Likewise, soybean (2n = 4x = 40) [70], a polyploid species, also possesses a higher number of m6A regulatory genes. Additionally, maize, a segmental allopolyploid, also shows a higher number of m6A regulatory genes [71]. Using PGDD, a total of 18 duplication events were identified in this study. All the duplication events were segmental duplication (Table 2, and S1 Fig). From the Ka and Ks analysis, duplicated gene pairs went through purifying selection, a finding consistent with studies on tomatoes [72], tea plants [73], and tobacco [66].
To explore the evolutionary patterns of m6A writer, eraser, and reader genes, phylogenetic trees were constructed across 13 plant species, which included Rhodophyta (red algae) and Angiosperms (comprising eudicots and monocots) (Fig 2). Monocot-dicot, monocot-monocot, and dicot-dicot pairs were spotted in this study. For instance, Acorus americanus and Aquilegia coerulea share a recent ancestor, implying shared monocot-dicot divergence. In the previous study on m6A genes of Oryza sativa, a similar evolutionary pattern was observed [74]. Cis-regulatory elements in the promoter region were identified. According to this analysis, CREs were associated with light response, hormone response, plant growth and development, and stress response. Similar findings were also observed in other plant species such as tobacco [66], tea [73], tomato [72], and poplar [75]. Light-responsive CREs were predominant in soybean m6A regulatory genes (Fig 3).
GO terms in three categories, including biological process, cellular components, and molecular function, were documented. These results indicate the involvement in RNA modification of writer, eraser, and reader genes of soybean (Fig 4). The PPI network revealed that MTA1, MTA2, MTB1, and MTB2 are highly interconnected genes, indicating their importance as the catalytic core (Fig 5). Most of the erasers did not interact with the writer and the reader. On the other side, CPSF30a and CPSF30b, two readers, had stronger interaction than any other reader components (Fig 5). Cluster analysis showed two cluster formations of m6A regulatory genes of soybeans (S3 Fig). Six writer components formed cluster 1, highlighting their dynamic action in RNA modification (S3A Fig). Writer components also predominated in cluster 2; nevertheless, no erasers were found in any cluster (S3A and S3B Fig). Previous studies also found similar results that there were no interactions among eraser proteins in tobacco [66]. Furthermore, hub gene identification exhibited that there were no eraser genes in the top 10 hub genes (S3C Fig). Using homology modeling, the 3D structure of 10 hub genes was predicted as these genes might be the core catalytic component in m6A methylation (Fig 6). The secondary structures, such as alpha helices and beta sheets, are important to maintain protein stability and functionality [76]. To explore the mutual interaction of miRNA and m6A soybean, miRNA target prediction was performed. Interestingly, the writer component had greater miRNA regulation compared to readers and erasers (S4 Fig). These findings indicate that soybean writers may prefer to be regulated by miRNA.
Researchers have demonstrated that altering the m6A system may cause irregular growth and development. Many previous studies have mentioned the association of m6A modification with reproductive cell development [77] and fruit ripening [78]. In this study, soybean m6A expression was observed in roots and nodules. Readers and erasers have a higher expression than writers regarding roots and nodules (Fig 7). Two reader genes named GmECT9 and GmECT13 are highly expressed in the respective samples. According to prior studies, CPSF30L in Arabidopsis might regulate APA by binding to m6A-modified RNAs that are involved in nitrate signaling, particularly WRKY1 and NRT1.1 [79]. As GmECT9 and GmECT13 were highly expressed in roots and nodules, it can be hypothesized that these genes have a role in nitrogen absorption. However, further investigation is needed in this regard.
CRE elements such as MBS (related to drought response), TC-rich repeat (related to defense and stress), and LTR (related to cold response) were highly enriched in soybean m6A genes according to Promoter analysis (Fig 3). This result indicates the involvement of soybeans’ m6A genes in environmental stimuli. Salt stress has been shown to have a major impact on the m6A methylation levels of mRNA in Arabidopsis [35]. The expression patterns of GmMTAs and GmMTBs under various abiotic stressors have been documented in earlier research, suggesting their possible role in stress tolerance, particularly in the reaction to darkness or alkalinity [39]. According to another research, GmALKB10B genes were probably triggered by alkalinity, cold, and dryness [40]. In this study, five abiotic stress responses, such as salt stress, dehydration, heat stress, water deficit, and combined water and heat have been reported. Most of the soybean’s m6A genes did not show any changes in salt and dehydration stress, indicating their stress tolerance possibility (Fig 8A). Notably, two genes named GmMTB2 and GmALKBH9B2 showed upregulation and downregulation, respectively, in combined water and heat stresses. Interestingly, GmECT4 and GmECT2 did not show any changes across all the abiotic stresses. Therefore, we assumed that these two genes might be stress tolerance m6A genes in soybeans. Validating these findings requires a wet lab inquiry. Distinct biotic stresses also have a great impact on the plant life cycle. NtALKBH10 and NtVIR1 of tobacco exhibited robust responses to specific biotic stresses (R. solanacearum and black shank) but showed insensitivity to the Cucumber Mosaic Virus (CMV) [66]. In this study, biotic stress responses (soybean mosaic virus) have been documented. Most of the soybean’s m6A genes displayed upregulation in SMV treatment (Fig 8B). Nevertheless, one member called GmALKBH9B4 was downregulated in this regard. In conclusion, the m6A regulatory genes of soybeans showed a variety of expression patterns under stress, nodules, and roots, suggesting that these genes have a multitude of roles. The precise processes of the putative m6A regulatory genes in soybeans require further investigation.
Conclusion
This study presents a comprehensive and systematic investigation of m6A regulatory genes in soybean, and a total of 42 m6A regulatory genes were identified. We analyzed the features of these genes in terms of gene structure, conserved domains, and motifs. Phylogenetic study and duplication events were also examined. Eighteen duplication events that evolved through purifying selection were found. Functional analysis of m6A regulatory genes in cis-elements and interaction networks demonstrated their crucial roles in soybeans. The altered expression pattern of these genes had been profiled in roots, nodules, and under distinct stresses. For instance, the three readers—GmECT9, GmECT13, and GmECT17—were highly expressed in roots and nodules. GmMTB2 exhibited the highest upregulation in response to combined water deficit and heat stress, while GmALKBH9B2 showed the greatest downregulation. Furthermore, GmALKBH9B4 was significantly downregulated, whereas GmMTB1, GmMTB2, and GmALKBH9B1 were upregulated under soybean mosaic virus treatment. This study gives a reference framework for investigating the functional variety of soybean m6A regulatory genes at the epigenetic level.
Supporting information
S1 Fig. Chromosomal distribution of m6A regulatory genes.
All m6A regulatory genes are found to be located on different chromosomes of soybeans. The relative size of the corresponding chromosomes and the position of the respective genes could be estimated by using the scale provided left side of the figure. The chromosome numbers are provided in the middle of each bar. The straight lines connect the duplicated gene pairs.
https://doi.org/10.1371/journal.pone.0328773.s001
(PDF)
S2 Fig. Organization and distribution of the conserved motifs.
The organization and distribution of the conserved motifs in the m6A writer (A), eraser (B), and reader (C) genes. The squares in the motif represent the positions of conserved domains.
https://doi.org/10.1371/journal.pone.0328773.s002
(PDF)
S3 Fig. Cluster and hub genes identification in soybeans.
Two clusters (A and B) were identified using MCODE. C. The top 10 hub genes were extracted by the Density of Maximum Neighborhood Component method in the cytohubba plugin in Cytoscape.
https://doi.org/10.1371/journal.pone.0328773.s003
(PDF)
S4 Fig. Genome-wide network of soybean miRNA.
Genome-wide miRNA-regulated networks of the writer (A), eraser (B), and reader (C). Green nodes: miRNAs, Pink nodes: genes that may be miRNA targets, and Black edges: correlations.
https://doi.org/10.1371/journal.pone.0328773.s004
(PDF)
Acknowledgments
The authors extend their gratitude to the Department of Biochemistry and Molecular Biology at Shahjalal University of Science and Technology, Sylhet, Bangladesh, for providing lab facilities and logistic support.
References
- 1. Nawaz M, Sun J, Shabbir S, Khattak WA, Ren G, Nie X, et al. A review of plants strategies to resist biotic and abiotic environmental stressors. Sci Total Environ. 2023;900:165832. pmid:37524179
- 2. Thiebaut F, Hemerly AS, Ferreira PCG. A Role for Epigenetic Regulation in the Adaptation and Stress Responses of Non-model Plants. Front Plant Sci. 2019;10:246. pmid:30881369
- 3. Deans C, Maggert KA. What do you mean, “epigenetic”?. Genetics. 2015;199(4):887–96. pmid:25855649
- 4. Tang J, Chen S, Jia G. Detection, regulation, and functions of RNA N6-methyladenosine modification in plants. Plant Commun. 2023;4(3):100546. pmid:36627844
- 5. Wu R, Jiang D, Wang Y, Wang X. N (6)-Methyladenosine (m(6)A) Methylation in mRNA with A Dynamic and Reversible Epigenetic Modification. Mol Biotechnol. 2016;58(7):450–9. pmid:27179969
- 6. Fang X, Li M, Yu T, Liu G, Wang J. Reversible N6-methyladenosine of RNA: The regulatory mechanisms on gene expression and implications in physiology and pathology. Genes Dis. 2020;7(4):585–97. pmid:33335958
- 7. Bokar JA, Shambaugh ME, Polayes D, Matera AG, Rottman FM. Purification and cDNA cloning of the AdoMet-binding subunit of the human mRNA (N6-adenosine)-methyltransferase. RNA. 1997;3(11):1233–47. pmid:9409616
- 8. Liu J, Yue Y, Han D, Wang X, Fu Y, Zhang L, et al. A METTL3-METTL14 complex mediates mammalian nuclear RNA N6-adenosine methylation. Nat Chem Biol. 2014;10(2):93–5. pmid:24316715
- 9. Ping X-L, Sun B-F, Wang L, Xiao W, Yang X, Wang W-J, et al. Mammalian WTAP is a regulatory subunit of the RNA N6-methyladenosine methyltransferase. Cell Res. 2014;24(2):177–89. pmid:24407421
- 10. Patil DP, Chen C-K, Pickering BF, Chow A, Jackson C, Guttman M, et al. m(6)A RNA methylation promotes XIST-mediated transcriptional repression. Nature. 2016;537(7620):369–73. pmid:27602518
- 11. Yue Y, Liu J, Cui X, Cao J, Luo G, Zhang Z, et al. VIRMA mediates preferential m6A mRNA methylation in 3’UTR and near stop codon and associates with alternative polyadenylation. Cell Discov. 2018;4:10. pmid:29507755
- 12. Wen J, Lv R, Ma H, Shen H, He C, Wang J, et al. Zc3h13 Regulates Nuclear RNA m6A Methylation and Mouse Embryonic Stem Cell Self-Renewal. Mol Cell. 2018;69(6):1028-1038.e6. pmid:29547716
- 13. Barone S, Cerchia C, Summa V, Brindisi M. Methyl-Transferase-Like Protein 16 (METTL16): The Intriguing Journey of a Key Epitranscriptomic Player Becoming an Emerging Biological Target. J Med Chem. 2024;67(17):14786–806. pmid:39150226
- 14. Luo Q, Mo J, Chen H, Hu Z, Wang B, Wu J, et al. Structural insights into molecular mechanism for N6-adenosine methylation by MT-A70 family methyltransferase METTL4. Nat Commun. 2022;13(1):5636. pmid:36163360
- 15. van Tran N, Ernst FGM, Hawley BR, Zorbas C, Ulryck N, Hackert P, et al. The human 18S rRNA m6A methyltransferase METTL5 is stabilized by TRMT112. Nucleic Acids Res. 2019;47(15):7719–33. pmid:31328227
- 16. Pinto R, Vågbø CB, Jakobsson ME, Kim Y, Baltissen MP, O’Donohue M-F, et al. The human methyltransferase ZCCHC4 catalyses N6-methyladenosine modification of 28S ribosomal RNA. Nucleic Acids Res. 2020;48(2):830–46. pmid:31799605
- 17. Shen L, Liang Z, Gu X, Chen Y, Teo ZWN, Hou X, et al. N(6)-Methyladenosine RNA Modification Regulates Shoot Stem Cell Fate in Arabidopsis. Dev Cell. 2016;38(2):186–200. pmid:27396363
- 18. Bodi Z, Zhong S, Mehra S, Song J, Graham N, Li H, et al. Adenosine Methylation in Arabidopsis mRNA is Associated with the 3’ End and Reduced Levels Cause Developmental Defects. Front Plant Sci. 2012;3:48. pmid:22639649
- 19. Zhong S, Li H, Bodi Z, Button J, Vespa L, Herzog M, et al. MTA is an Arabidopsis messenger RNA adenosine methylase and interacts with a homolog of a sex-specific splicing factor. Plant Cell. 2008;20(5):1278–88. pmid:18505803
- 20. Hu J, Manduzio S, Kang H. Epitranscriptomic RNA Methylation in Plant Development and Abiotic Stress Responses. Front Plant Sci. 2019;10:500. pmid:31110512
- 21. Růžička K, Zhang M, Campilho A, Bodi Z, Kashif M, Saleh M, et al. Identification of factors required for m6 A mRNA methylation in Arabidopsis reveals a role for the conserved E3 ubiquitin ligase HAKAI. New Phytol. 2017;215(1):157–72. pmid:28503769
- 22. Jia G, Fu Y, Zhao X, Dai Q, Zheng G, Yang Y, et al. N6-methyladenosine in nuclear RNA is a major substrate of the obesity-associated FTO. Nat Chem Biol. 2011;7(12):885–7. pmid:22002720
- 23. Zheng G, Dahl JA, Niu Y, Fedorcsak P, Huang C-M, Li CJ, et al. ALKBH5 is a mammalian RNA demethylase that impacts RNA metabolism and mouse fertility. Mol Cell. 2013;49(1):18–29. pmid:23177736
- 24. Duan H-C, Wei L-H, Zhang C, Wang Y, Chen L, Lu Z, et al. ALKBH10B Is an RNA N6-Methyladenosine Demethylase Affecting Arabidopsis Floral Transition. Plant Cell. 2017;29(12):2995–3011. pmid:29180595
- 25. Xu C, Wang X, Liu K, Roundtree IA, Tempel W, Li Y, et al. Structural basis for selective binding of m6A RNA by the YTHDC1 YTH domain. Nat Chem Biol. 2014;10(11):927–9. pmid:25242552
- 26. Wang X, Lu Z, Gomez A, Hon GC, Yue Y, Han D, et al. N6-methyladenosine-dependent regulation of messenger RNA stability. Nature. 2014;505(7481):117–20. pmid:24284625
- 27. Wang X, Zhao BS, Roundtree IA, Lu Z, Han D, Ma H, et al. N(6)-methyladenosine Modulates Messenger RNA Translation Efficiency. Cell. 2015;161(6):1388–99. pmid:26046440
- 28. Edens BM, Vissers C, Su J, Arumugam S, Xu Z, Shi H, et al. FMRP Modulates Neural Differentiation through m6A-Dependent mRNA Nuclear Export. Cell Rep. 2019;28(4):845-854.e5. pmid:31340148
- 29. Roundtree IA, Luo G-Z, Zhang Z, Wang X, Zhou T, Cui Y, et al. YTHDC1 mediates nuclear export of N6-methyladenosine methylated mRNAs. Elife. 2017;6:e31311. pmid:28984244
- 30. Hsu PJ, Zhu Y, Ma H, Guo Y, Shi X, Liu Y, et al. Ythdc2 is an N6-methyladenosine binding protein that regulates mammalian spermatogenesis. Cell Res. 2017;27(9):1115–27. pmid:28809393
- 31. Scutenaire J, Deragon J-M, Jean V, Benhamed M, Raynaud C, Favory J-J, et al. The YTH Domain Protein ECT2 Is an m6A Reader Required for Normal Trichome Branching in Arabidopsis. Plant Cell. 2018;30(5):986–1005. pmid:29618631
- 32. Arribas-Hernández L, Bressendorff S, Hansen MH, Poulsen C, Erdmann S, Brodersen P. An m6A-YTH Module Controls Developmental Timing and Morphogenesis in Arabidopsis. Plant Cell. 2018;30(5):952–67. pmid:29643069
- 33. Hu J, Cai J, Xu T, Kang H. Epitranscriptomic mRNA modifications governing plant stress responses: underlying mechanism and potential application. Plant Biotechnol J. 2022;20(12):2245–57. pmid:36002976
- 34. Hou N, Li C, He J, Liu Y, Yu S, Malnoy M, et al. MdMTA-mediated m6 A modification enhances drought tolerance by promoting mRNA stability and translation efficiency of genes involved in lignin deposition and oxidative stress. New Phytol. 2022;234(4):1294–314. pmid:35246985
- 35. Hu J, Cai J, Park SJ, Lee K, Li Y, Chen Y, et al. N6 -Methyladenosine mRNA methylation is important for salt stress tolerance in Arabidopsis. Plant J. 2021;106(6):1759–75. pmid:33843075
- 36. Yang D, Xu H, Liu Y, Li M, Ali M, Xu X, et al. RNA N6-Methyladenosine Responds to Low-Temperature Stress in Tomato Anthers. Front Plant Sci. 2021;12:687826. pmid:34149789
- 37. Zhang T-Y, Wang Z-Q, Hu H-C, Chen Z-Q, Liu P, Gao S-Q, et al. Transcriptome-Wide N6-Methyladenosine (m6A) Profiling of Susceptible and Resistant Wheat Varieties Reveals the Involvement of Variety-Specific m6A Modification Involved in Virus-Host Interaction Pathways. Front Microbiol. 2021;12:656302. pmid:34122371
- 38. Dilawari R, Kaur N, Priyadarshi N, Prakash I, Patra A, Mehta S, et al. Soybean: A Key Player for Global Food Security. Soybean Improvement. Springer International Publishing; 2022. p. 1–46.
- 39. Liu P, Liu H, Zhao J, Yang T, Guo S, Chang L, et al. Genome-wide identification and functional analysis of mRNA m6A writers in soybean under abiotic stress. Front Plant Sci. 2024;15:1446591. pmid:39055358
- 40. Zhao J, Yang T, Liu P, Liu H, Zhang H, Guo S, et al. Genome-Wide Identification of the Soybean AlkB Homologue Gene Family and Functional Characterization of GmALKBH10Bs as RNA m6A Demethylases and Expression Patterns under Abiotic Stress. Plants (Basel). 2024;13(17):2491. pmid:39273973
- 41. Nishat ZS, Hasan MdS, Islam MdS, Hossain T, Ghosh A. Identification of epitranscriptomic methylation marker genes in Arabidopsis and their expression profiling in response to developmental, anatomical, and environmental modulations. Current Plant Biology. 2022;30:100247.
- 42. Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012;40(Database issue):D1178-86. pmid:22110026
- 43. Wilkins MR, Gasteiger E, Bairoch A, Sanchez JC, Williams KL, Appel RD, et al. Protein identification and analysis tools in the ExPASy server. Methods Mol Biol. 1999;112:531–52. pmid:10027275
- 44. Savojardo C, Martelli PL, Fariselli P, Profiti G, Casadio R. BUSCA: an integrative web server to predict subcellular localization of proteins. Nucleic Acids Res. 2018;46(W1):W459–66. pmid:29718411
- 45. Chen C, Chen H, Zhang Y, Thomas HR, Frank MH, He Y, et al. TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data. Mol Plant. 2020;13(8):1194–202. pmid:32585190
- 46. Lee T-H, Tang H, Wang X, Paterson AH. PGDD: a database of gene and genome duplication in plants. Nucleic Acids Res. 2013;41(Database issue):D1152-8. pmid:23180799
- 47. Juretic N, Hoen DR, Huynh ML, Harrison PM, Bureau TE. The evolutionary fate of MULE-mediated duplications of host gene fragments in rice. Genome Res. 2005;15(9):1292–7. pmid:16140995
- 48. Islam MS, Mohtasim M, Islam T, Ghosh A. Aldehyde dehydrogenase superfamily in sorghum: genome-wide identification, evolution, and transcript profiling during development stages and stress conditions. BMC Plant Biol. 2022;22(1):316. pmid:35786175
- 49. Lu S, Wang J, Chitsaz F, Derbyshire MK, Geer RC, Gonzales NR, et al. CDD/SPARCLE: the conserved domain database in 2020. Nucleic Acids Res. 2020;48(D1):D265–8. pmid:31777944
- 50. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32(5):1792–7. pmid:15034147
- 51. Tamura K, Stecher G, Kumar S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol Biol Evol. 2021;38(7):3022–7. pmid:33892491
- 52. Lescot M, Déhais P, Thijs G, Marchal K, Moreau Y, Van de Peer Y, et al. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 2002;30(1):325–7. pmid:11752327
- 53. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504. pmid:14597658
- 54. Hogue CW, Groll M. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics. 2001;29:137–40.
- 55. Chin C-H, Chen S-H, Wu H-H, Ho C-W, Ko M-T, Lin C-Y. cytoHubba: identifying hub objects and sub-networks from complex interactome. BMC Syst Biol. 2014;8 Suppl 4(Suppl 4):S11. pmid:25521941
- 56. Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 2018;46(W1):W296–303. pmid:29788355
- 57. Meng EC, Goddard TD, Pettersen EF, Couch GS, Pearson ZJ, Morris JH, et al. UCSF ChimeraX: Tools for structure building and analysis. Protein Sci. 2023;32(11):e4792. pmid:37774136
- 58. Kozomara A, Birgaoanu M, Griffiths-Jones S. miRBase: from microRNA sequences to function. Nucleic Acids Res. 2019;47(D1):D155–62. pmid:30423142
- 59. Dai X, Zhuang Z, Zhao PX. psRNATarget: a plant small RNA target analysis server (2017 release). Nucleic Acids Res. 2018;46(W1):W49–54. pmid:29718424
- 60. DeMers LC, Redekar NR, Kachroo A, Tolin SA, Li S, Saghai Maroof MA. A transcriptional regulatory network of Rsv3-mediated extreme resistance against Soybean mosaic virus. PLoS One. 2020;15(4):e0231658. pmid:32315334
- 61. Adhikari S, Damodaran S, Subramanian S. Lateral Root and Nodule Transcriptomes of Soybean. Data. 2019;4(2):64.
- 62. Sinha R, Induri SP, Peláez-Vico MÁ, Tukuli A, Shostak B, Zandalinas SI, et al. The transcriptome of soybean reproductive tissues subjected to water deficit, heat stress, and a combination of water deficit and heat stress. Plant J. 2023;116(4):1064–80. pmid:37006191
- 63. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550. pmid:25516281
- 64. Kapli P, Yang Z, Telford MJ. Phylogenetic tree building in the genomic age. Nat Rev Genet. 2020;21(7):428–44. pmid:32424311
- 65. Kocincová L, Jarešová M, Byška J, Parulek J, Hauser H, Kozlíková B. Comparative visualization of protein secondary structures. BMC Bioinformatics. 2017;18(Suppl 2):23. pmid:28251875
- 66. Su H, Meng L, Qu Z, Zhang W, Liu N, Cao P, et al. Genome-wide identification of the N6-methyladenosine regulatory genes reveals NtFIP37B increases drought resistance of tobacco (Nicotiana tabacum L.). BMC Plant Biol. 2024;24(1).
- 67. International Wheat Genome Sequencing Consortium (IWGSC). Shifting the limits in wheat research and breeding using a fully annotated reference genome. Science. 2018;361(6403):eaar7191. pmid:30115783
- 68. Li F, Fan G, Lu C, Xiao G, Zou C, Kohel RJ, et al. Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1) provides insights into genome evolution. Nature Biotechnology. 2015;33: 524–30.
- 69. Kelly LJ, Leitch AR, Clarkson JJ, Hunter RB, Knapp S, Chase MW. Intragenic recombination events and evidence for hybrid speciation in Nicotiana (Solanaceae). Mol Biol Evol. 2010;27(4):781–99. pmid:19897524
- 70. Clarindo WR, de Carvalho CR, Alves BMG. Mitotic evidence for the tetraploid nature of Glycine max provided by high quality karyograms. Plant Syst Evol. 2007;265(1–2):101–7.
- 71. Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, et al. The B73 maize genome: complexity, diversity, and dynamics. Science. 2009;326(5956):1112–5. pmid:19965430
- 72.
Shen H, Luo B, Wang Y, Li J, Hu Z, Xie Q. Genome-wide identification, classification and expression analysis of m 6 A gene family in Solanum lycopersicum. 2022.
- 73. Zhu C, Zhang S, Zhou C, Xie S, Chen G, Tian C, et al. Genome-Wide Investigation of N6-Methyladenosine Regulatory Genes and Their Roles in Tea (Camellia sinensis) Leaves During Withering Process. Front Plant Sci. 2021;12:702303. pmid:34211493
- 74. Hasan M, Nishat ZS, Hossain T, Ghosh A. Biochemistry and Biophysics Reports.
- 75. Sun X, Wu W, Yang Y, Wilson I, Shao F, Qiu D. Genome-Wide Identification of m6A Writers, Erasers and Readers in Poplar 84K. Genes (Basel). 2022;13(6):1018. pmid:35741780
- 76. Eisenberg D. The discovery of the alpha-helix and beta-sheet, the principal structural features of proteins. Proc Natl Acad Sci U S A. 2003;100(20):11207–10. pmid:12966187
- 77. Cheng P, Bao S, Li C, Tong J, Shen L, Yu H. RNA N6-methyladenosine modification promotes auxin biosynthesis required for male meiosis in rice. Dev Cell. 2022;57(2):246-259.e4. pmid:35026163
- 78. Zhou L, Tang R, Li X, Tian S, Li B, Qin G. N6-methyladenosine RNA modification regulates strawberry fruit ripening in an ABA-dependent manner. Genome Biol. 2021;22(1):168. pmid:34078442
- 79. Hou Y, Sun J, Wu B, Gao Y, Nie H, Nie Z, et al. CPSF30-L-mediated recognition of mRNA m6A modification controls alternative polyadenylation of nitrate signaling-related gene transcripts in Arabidopsis. Mol Plant. 2021;14(4):688–99. pmid:33515769