Figures
Abstract
Long noncoding RNAs (lncRNAs) are RNA fragments that generally do not code for a protein but are involved in epigenetic gene regulation. In this study, lncRNAs of Brassica rapa were classified into long intergenic noncoding RNAs, natural antisense RNAs, and intronic noncoding RNAs and their expression analyzed in relation to genome-wide 24-nt small interfering RNAs (siRNAs), DNA methylation, and histone H3 lysine 27 trimethylation marks (H3K27me3). More than 65% of the lncRNAs analyzed consisted of one exon, and more than 55% overlapped with inverted repeat regions (IRRs). Overlap of lncRNAs with IRRs or genomic regions encoding for 24-nt siRNAs resulted in increased DNA methylation levels when both were present. LncRNA did not overlap greatly with H3K27me3 marks, but the expression level of intronic noncoding RNAs that did coincide with H3K27me3 marks was higher than without H3K27me3 marks. The Brassica genus comprises important vegetables and oil seed crops grown across the world. B. rapa is a diploid (AA genome) thought to be one of the ancestral species of both B. juncea (AABB genome) and B. napus (AACC) through genome merging (allotetrapolyploidization). Complex genome restructuring and epigenetic alterations are thought to be involved in these allotetrapolyploidization events. Comparison of lncRNAs between B. rapa and B. nigra, B. oleracea, B. juncea, and B. napus showed the highest conservation with B. oleracea. This study presents a comprehensive analysis of the epigenome structure of B. rapa at multi-epigenetic levels (siRNAs, DNA methylation, H3K27me3, and lncRNAs) and identified a suite of candidate lncRNAs that may be epigenetically regulated in the Brassica genus.
Citation: Mehraj H, Shea DJ, Takahashi S, Miyaji N, Akter A, Seki M, et al. (2021) Genome-wide analysis of long noncoding RNAs, 24-nt siRNAs, DNA methylation and H3K27me3 marks in Brassica rapa. PLoS ONE 16(3): e0242530. https://doi.org/10.1371/journal.pone.0242530
Editor: Yong Pyo Lim, Chungnam National University, REPUBLIC OF KOREA
Received: November 2, 2020; Accepted: March 18, 2021; Published: March 31, 2021
Copyright: © 2021 Mehraj et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: This work was supported by an Open Partnership Joint Projects of JSPS Bilateral Joint Research Projects, Fund for the Promotion of Joint International Research (16KK0171) (JSPS), and Grant-in-Aid for Exploratory Research (20K21313) to R. Fujimoto.
Competing interests: The authors have declared that no competing interests exist.
Introduction
The Brassica genus comprises vegetable and oil seed crops. The “Triangle of U” proposed the genomic relationship among six major species of the Brassica genus. Three allotetraploid species, each of which contains two complete diploid genomes derived from two different parental species, Brassica juncea L. (AABB genome, 2n = 4x = 36), B. napus L. (AACC, 2n = 4x = 38), and B. carinata L. (BBCC, 2n = 4x = 34) are derived from the natural hybridization of the diploid species, B. rapa L. (AA, 2n = 2x = 20), B. nigra L. (BB, 2n = 2x = 16), and B. oleracea L. (CC, 2n = 2x = 18) [1]. Some species in the Brassica genus show morphological divergence (termed morphotype). B. rapa includes leafy vegetables such as Chinese cabbage (var. pekinensis), pak choi (var. chinensis), and komatsuna (var. perviridis), root vegetables including turnip (var. rapa), and oilseed crops (var. oleifera) [2]. The first whole genome sequence determined in the genus Brassica was that of B. rapa [3]. Later, whole genome sequences of B. oleracea, B. nigra, B. napus, and B. juncea were determined [4–7].
Plant transcriptome analyses have revealed RNAs devoid of protein-coding potential, which are called noncoding RNAs (ncRNAs). Some ncRNAs contain exons, which potentially code for short proteins or peptides, however, experimental investigation is required to validate their functions. Two families of ncRNAs are known; long noncoding RNAs (lncRNAs) that are longer than 200 nucleotides (nt), and small RNAs (sRNAs) that are ~18–30 nt in length [8–12]. LncRNAs are classified by their position and orientation of transcription; long intergenic noncoding RNAs (lincRNAs), intronic noncoding RNAs (incRNAs) derived from introns, and natural antisense transcripts (NATs) transcribed from the complementary DNA strand of their associated genes [8–12]. Recent studies have shown that lncRNAs play crucial roles in various physiological processes such as vernalization [13–15], photoperiod-sensitive male sterility [16], red-light-mediated seedling photomorphogenesis [17], seed dormancy [18], and the transcriptional regulation of plant innate immunity [19].
Epigenetics has been defined as ‘‘the study of changes in genome expression that are mitotically and/or meiotically heritable and that do not entail a change in DNA sequence” [20]. DNA methylation and histone modification are well-known epigenetic modifications, and lncRNAs are considered to be involved in epigenetic regulation [21]. In plants, DNA methylation is established through RNA-directed DNA methylation (RdDM) [22, 23]. Plant-specific RNA POLYMERASE IV (Pol IV) and Pol V are involved in RdDM. Pol IV-transcribed ncRNAs are cleaved into 24-nucleotide small interfering RNAs (24-nt siRNAs) by DICER-LIKE 3 (DCL3), and 24-nt siRNAs are loaded into ARGONAUTE 4 (AGO4). Pol V-transcribed lncRNAs act as scaffold molecules of AGO-siRNA complexes, which recruit DOMAINS REARRANGED METHYLTRANSFERASE 2 (DRM2) for catalyzing de novo DNA methylation. In Arabidopsis thaliana two cold-induced lncRNAs (COLDAIR and COLDWRAP) regulate histone methylation by recruiting polycomb repressive complex 2 (PRC2), which catalyzes the tri-methylation of histone H3 lysine 27 (H3K27me3), to the chromatin region of FLOWERING LOCUS C (FLC) [14, 24–26].
Previous studies have identified lncRNAs associated with different lines, tissues or environmental changes in Brassica [27–33]. However, there are few reports of the association between lncRNAs and epigenetic modifications in B. rapa. In this study, we examined the DNA methylation levels and H3K27me3 levels in the region covering lncRNAs and found an association between lncRNAs and inverted repeat regions (IRRs) or 24-nt siRNAs, and association between genes overlapping with lncRNAs and different epigenetic marks (DNA methylation and H3K27me3).
Materials and methods
Plant materials and growth conditions
Six lines of B. rapa, inbred lines of RJKB-T24 (var. pekinensis) [34] and Yellow sarson (var. trilocularis), doubled haploid lines of BRA2209 (var. rapa), Homei (var. pekinensis), and Osome (var. perviridis), and a commercial F1 hybrid cultivar, ‘Harunosaiten’ (var. pekinensis) (Watanabe Seed Co., Ltd.) were used. The six lines of B. rapa were selected to compare between different varieties, and the three B. oleracea lines were included to compare between species. RJKB-24 is breeding material used to produce a commercial F1 hybrid cultivar and expresses the common agricultural traits desired for Chinese cabbage such as appropriate heading date and vernalization requirement [27, 35, 36]. Three B. oleracea cabbage (var. capitata) F1 hybrid cultivars, ‘Reiho’, ‘Matsunami’ (Ishii Seed Growers CO., LTD), and ‘Kinkei 201’ (Sakata Seed Co., Ltd.) were used.
Seeds were surface sterilized and grown on agar solidified Murashige and Skoog (MS) plates with 1% (w/v) sucrose under long day (LD) condition (16h light / 8h dark) at 21°C. Fourteen-day first and second leaves of B. rapa and 19-day first and second leaves of B. oleracea were harvested for isolation of genomic DNA or total RNA.
RJKB-T24 was used for RNA-sequencing (RNA-seq) for detection of lncRNAs. Briefly, after performing QC of the sequenced reads, putative mRNAs were identified by aligning the sequence reads to the B. rapa reference genome v1.5 using HISAT2 and then assembling transcripts with Stringtie. Assembled transcripts with a mapping code of ’u’, indicating they are intergenic but not part of the annotated reference genome, were then compared to the SwissProt database using blastx (e-value 1e-10; [37]). Transcripts with hits were classified as putative mRNAs, while transcripts with no hits were classified as putative lincRNAs. A more detailed description of this method can be found in Shea et al, 2019 [27].
DNA extraction and PCR
Genomic DNA was isolated by the Cetyl trimethyl ammonium bromide (CTAB) method [38]. The PCR reaction was performed using the following conditions; 1 cycle of 94°C for 3 min, 35 cycles of 94°C for 30 s, 55°C for 30 s, and 72°C for 1 min, and final extension at 72°C for 3 min. The PCR products were electrophoresed on 1.0% agarose gel. Primer sequences used in this study are shown in S1 Table.
RNA extraction and RT-PCR
Total RNA from the first and second leaves were isolated by SV Total RNA Isolation System (Promega Co., WI, USA). To analyze lncRNA expression, cDNA was synthesized from 500 ng total RNA using PrimeScript RT reagent Kit (Takara Bio., Shiga, JAPAN). The absence of genomic DNA contamination was confirmed by PCR using a control without reverse transcriptase. The PCR conditions were 94°C for 2 min followed by 35 cycles of 94°C for 30 s, 55°C for 30 s, and 68°C for 30 s. The primers used in this study are listed in S1 Table.
Detection of epigenetic states in lncRNA regions
To examine the epigenetic states (DNA methylation levels, H3K27me3, and 24-nt siRNA levels) of lncRNA coding regions in B. rapa, we used previous sequence data of whole genome bisulfite sequencing (WGBS) [39], chromatin immunoprecipitation sequencing (ChIP-seq) [40], and small RNA-sequencing (sRNA-seq) data [39], which were generated using samples from the same line, tissue, and developmental stages but harvested independently. Two biological replicates were used for all analyses.
The reads of WGBS were mapped to the B. rapa reference genome v.1.5 using Bowtie2 version 2.2.3 and Bismark v0.14.3, and data covering genomic regions encoding lncRNA in chromosomes A01 to A10 were extracted. In order to estimate the methylation levels of CG, CHG, and CHH (H is A, C, or T) contexts, the numbers of methylated and unmethylated reads were extracted for each cytosine position using bismark methylation extractor script with the paired-end parameter. The methylation level at each cytosine position was calculated by dividing the number of methylated cytosines (mC) reads by the total number of reads.
The reads of sRNA-seq were mapped to the B. rapa reference genome v.1.5 using Bowtie2 version 2.2.3. We classified the alignment reads by length, and the 24-nt aligned reads covering the lncRNA regions in chromosomes A01 to A10 were extracted.
The reads of ChIP-seq using anti-H3K27me3 (Millipore, 07–449) antibodies were mapped to the B. rapa reference genome v.1.5 using Bowtie2 version 2.2.3, and data covering genomic regions encoding for lncRNA in chromosomes A01 to A10 were extracted.
Comparison of putative mRNA and lncRNAs in B. rapa to other related species of Brassica
The putative mRNAs and lncRNAs were first compared to the reference genomes of B. nigra, B. oleracea, B. juncea, and B. napus by best-hit blastn (e-value 1e-10) to identify homologous regions in closely related Brassica species [6, 7, 41]. In order to parse the local High-scoring segment pair (HSP) alignments produced by blastn, genBlastA was used to produce a representative putative gene that is homologous to the query [42]. The parsed local alignments were then analyzed by a custom python script (available at http://www.github.com/danshea/lncRNA) to examine the overall alignment length and computed coverage of the aligned homologous region to the putative mRNA or lncRNA query transcript sequence. These results were then imported into R and plotted to assess overall relative coverage of the sequences among the Brassica species using Simple plot and ggbio in the Bioconductor package [43].
Gene ontology (GO) analysis
The genes covering incRNAs or NATs were used for GO analysis. The incRNAs and NATs having DNA methylation, siRNAs, or H3K27me3 were identified and their corresponding genes were also used for GO analysis using agriGO [44] following the methods described by Shimizu et al, 2014 [45]. Statistical tests for enrichment of functional terms used the hypergeometric test and false discovery rate (FDR) correction for multiple testing to a level of 1% FDR.
Results
Characterization of lncRNA in B. rapa
We identified 1,444 lincRNAs, 551 NATs, and 93 incRNAs using RNA-seq data of 14-day first and second leaves with and without four weeks of cold treatment [27]. In order to investigate the relationship between lncRNAs and epigenetic modifications or the species specificity of lncRNAs, we analyzed in more detail the RNA-seq data of the 14-day first and second leaves without cold treatment. There was no strong bias in the chromosomal distribution of expressed lncRNAs (S1 Fig). More than 65% of lncRNAs contained one exon (lincRNAs, 65.7%; NATs, 72.4%; incRNAs, 71.0%), whereas the proportion of mRNAs containing only one exon is 15.1% (Fig 1A). The mean transcript lengths of lincRNAs (725 nt) and incRNAs (779 nt) were shorter than that of NATs (1,271 nt) and mRNAs (1,305 nt) (Fig 1B). About 40% of lincRNAs were located within 2 kb of the genic regions, and about 10% of lincRNAs were located more than 20kb from the genic region (Fig 1C). In this study, we focused on lncRNAs mapped to the chromosomes A01 to A10 as previous WGBS, ChIP-seq, and sRNA-seq have omitted the placed scaffolds for their analyses [39, 40]. 763 of 1,173 (65.0%) lincRNAs, 291 of 529 (55.0%) NATs, and 66 of 92 (71.7%) incRNAs overlapped with IRRs such as transposable elements (TEs) detected by RepeatMasker, suggesting that IRRs are the source of more than half of lncRNAs in B. rapa (Fig 2).
(A) Number of exons in lncRNA. (B) Nucleotide length of lincRNA, NAT, incRNA, putative mRNA and mRNA. “a” and “b” represent significant differences by one-way ANOVA test (“*”, p<0.05; “***”, p<0.001). (C) The proportion lncRNA distances to the nearest gene.
“+” indicates with and “-” indicated without overlapping IRRs or genomic regions encoding siRNAs.
Relationship between lncRNAs and siRNA or DNA methylation
We have performed sRNA-seq using 14-day first and second leaves, which are identical developmental stages and tissues to previously analyzed samples [39], but independently harvested for this study. We identified the lncRNAs having perfect sequence identity to genomic regions encoding 24-nt siRNAs. 219 of 1,173 (18.7%) lincRNAs, 74 of 529 (14.0%) NATs, and 16 of 92 (17.4%) incRNAs in A01-A10 overlapped with unique-mapped genomic regions encoding 24-nt siRNAs, and more than 80% of the lncRNAs overlapping with genomic regions encoding 24-nt siRNAs were from regions that harbored IRRs (Fig 2). 24-nt siRNAs were mapped in a similar way to lncRNA and its 5’ and 3’ flanking regions; this mapping pattern is different from those of genic regions or IRRs (Fig 3).
The x-axis represents the target region (genic region, IRRs, putative mRNAs, or lncRNAs) and the flanking 5’ and 3’ regions. The y-axis represents the reads per million (RPM) value of 24-nt siRNA that overlaps with the corresponding target region.
We examined the whole genome DNA methylation state by WGBS using the same 14-day first and second leaves [39]. The average DNA methylation levels in regions covering lncRNAs were similar to those of the whole genome (S2 Fig). DNA methylation level in regions producing NATs was lower than those producing any of the three types of lncRNAs. Overlap of lncRNAs with IRRs or genomic regions encoding for 24-nt siRNAs was associated with increased DNA methylation levels and overlapping with both caused further increases in DNA methylation levels (Fig 4).
“+” indicates with and “-” indicates without overlapping genomic regions encoding for siRNAs or IRRs.
The DNA methylation pattern was similar in lincRNA and NAT coding regions, and DNA methylation levels in the 5’ and 3’ flanking regions were higher than in the body regions encoding lincRNAs and NATs (S3 Fig). The DNA methylation levels over the incRNA coding regions and the flanking regions did not change (S3 Fig). Overall, mapping of DNA methylation levels to lncRNA coding regions were different from those of genic regions or IRRs (S3 Fig).
We examined the influence on the lncRNA expression level when IRRs or siRNA clusters on the genome spanned the lncRNA regions. IRRs did not affect the expression levels of lncRNAs (Fig 5). The expression levels of lncRNAs covered with siRNA clusters were higher than those not covered by siRNA clusters (Fig 5).
*, p<0.05; **, p<0.01; ***, p<0.001 (Student t-test).
Relationship between lncRNAs and H3K27me3
We have examined the H3K27me3 distribution using the 14-day first and second leaf samples [40] and identified the genomic regions encoding for lncRNAs that have H3K27me3 marks. 127 of 1,173 (10.8%) lincRNAs, 83 of 529 (15.7%) NATs, and 15 of 92 (16.3%) incRNA coding genomic regions had H3K27me3 marks (S4 Fig). H3K27me3 was enriched in the transcribed region of lncRNAs, similar to the genic regions, but H3K27me3 levels in lncRNA coding genomic regions were lower than in regions coding for mRNAs (S5 Fig). The expression level of incRNAs was higher when the encoding regions had H3K27me3 than without H3K27me3 (Fig 5). In NATs and lincRNAs, there was no difference of expression level with and without H3K27me3 on their encoding regions (Fig 5).
Characterization of un-annotated genes
In 2,052 lncRNAs mapped to intergenic regions of the genome, 608 transcripts had hits (e-value < 1.0e-10) against the Swissport database using BLASTX, indicating they could be un-annotated genes. The expression levels of the putative mRNAs from these regions were similar to that of mRNA [27]. Putative mRNAs tended to have fewer exons compared with annotated mRNAs (Fig 1A). The mean transcript length of putative mRNA genes was similar to that of mRNAs (Fig 1B).
314 of 490 (64.1%) putative mRNAs in A01-A10 overlapped with IRRs (Fig 2) but the expression level of putative mRNAs overlapping with IRRs was similar to those not-overlapping with IRRs (Fig 5). Mapping of 24-nt siRNAs and DNA methylation levels to genomic regions encoding putative mRNAs were similar to that of genic regions (Fig 3, S3 Fig). The average expression level of putative mRNAs with overlapping 24-nt siRNAs was higher than that without any overlap (Fig 5). The pattern of the average of DNA methylation level in the genomic region encoding putative mRNAs and their flanking regions was similar to that to the average of the genic regions (S3 Fig). Overlapping IRRs or 24-nt siRNAs resulted in an increase in DNA methylation levels and overlapping with both causes further increase in DNA methylation levels (Fig 4).
53 of 490 (10.8%) genomic regions corresponding to putative mRNAs had H3K27me3. H3K27me3 was enriched in the genomic regions encoding for lncRNAs, especially around the transcription start site, and this was similar to the pattern in the genic region (S5 Fig). The expression levels of putative mRNAs with corresponding genomic H3K27me3 were lower than those without H3K27me3, but there was no significant difference (Fig 5).
GO analysis of incRNAs and NATs and their relationship with epigenetic states
The paired genes overlapping with incRNA and NATs were subjected to GO analysis to assess whether there is any relationship between the gene ontology and the different epigenetic marks (Fig 6). GO category “transcription and DNA-dependent” and “metabolic process” were overrepresented for the incRNA data. For NATs, GO category “oxidation reduction”, “transcription and DNA-dependent”, and “carbohydrate metabolic process” were overrepresented.
*, p<0.05; **, p<0.01; ***, p<0.001. Overrepresentation of the identified paired genes between incRNA or NATs to DNA methylation, siRNAs or H3K27me3 loci for each GO category is shown.
In relation to the paired genes that overlap with incRNA and DNA methylation, siRNA or H3K27me3 marks, DNA methylation was overrepresented for “transcription and DNA-dependent”. This indicates that the expression of these genes that overlap with incRNAs may be regulated through DNA methylation, and involved in transcription. Similarly, for NATs, H3K27me3 was overrepresented for “oxidation reduction” and “transcription and DNA-dependent”, genes overlapping with NATs and each epigenetic mark were overrepresented for “primary metabolic process” and “metabolic process”, whereas siRNA was absent for “carbohydrate metabolic process”. A detailed list of each GO analysis is presented in S2–S7 Tables.
Examination of the conservation of lncRNAs among the Brassica genus
Using the sequences of the 2,088 lncRNAs of B. rapa, a best-blast hit search against the B. nigra, B. oleracea, B. juncea, and B. napus reference genomes was conducted using GenBlastA (e-value = 1e-10) to examine the conservation of lncRNAs. B. rapa lncRNAs were most conserved in B. oleracea and moderate conservation was observed in B. nigra and B. napus. LncRNAs were less conserved in B. juncea, especially in the incRNAs (Fig 7).
We selected twelve lncRNAs that showed high sequence similarity with the B. oleracea reference genome. We tested whether these lncRNA coding genomic sequences were conserved in three commercial cultivars of cabbage (B. oleracea) by PCR using genomic DNA as template. PCR amplification of genomic regions corresponding to all twelve lncRNAs was confirmed in all three B. oleracea lines. Next, we tested by RT-PCR whether these putative lncRNAs in B. oleracea are expressed in the first and second leaves (S6 Fig). Six of twelve lncRNAs were expressed in all three cultivars. One lncRNA was expressed in two of three lines, and one lncRNA was expressed in one of three lines. The remaining four lncRNAs were not expressed or were slightly expressed in all three lines (Table 1). We also examined the variation within B. rapa species using six lines. Seven of the twelve lncRNAs were expressed in all six lines, and the remaining five lncRNA showed line-specificity (Table 1). The sequences of the twelve lncRNA are listed in (S8 Table).
Discussion
We analyzed 2,088 lncRNAs in leaves of B. rapa; their characteristics such as number of exons, length of transcripts (except for NATs), and lower expression levels were similar to those reported in other plant species [46–51]. The total number of lncRNAs seems to be similar among species [52, 53], but there tends to be low sequence conservation between different species [54]. Lower conservation of lncRNAs than mRNAs was observed between B. napus and its two ancestral species, B. rapa and B. oleracea [29, 48], even though the hybridization was relatively recent event, ~1,910–7, 180 years ago [55]. We also found a lower sequence conservation of B. rapa lncRNAs among other species of the genus Brassica (B. nigra, B. oleracea, B. juncea, and B. napus). In 12 selected highly conserved lncRNAs, we found conservation of the lncRNAs at both the sequence and transcriptional level between B. rapa and B. oleracea, but there was transcriptional variation within species agreeing with the IRR diversification of each species after the allotetraploidization of B. napus [56]. The analysis of sequence homology of lncRNAs in B. rapa with other members of the Brassica genus may deepen the understanding of the evolutionary dynamics of lncRNAs in the genus Brassica.
We found over 55% of each lncRNAs (65.0% lincRNAs, 55.0% NATs, and 71.7% incRNAs) overlapped with IRRs including TEs throughout the B. rapa genome. TEs are considered a source of siRNAs [57], and lincRNAs identified in A. thaliana, rice, and maize, were associated with 22.2%, 49.7%, and 51.5% of TEs, respectively [58]. In this study, we found 18.7% lincRNAs, 14.0% NATs, and 17.4% of incRNAs covered unique-mapped 24-nt siRNAs, and over 80% of lncRNAs covering the genomic regions encoding for 24-nt siRNAs overlapped with IRRs with perfect sequence match, suggesting that lncRNAs covering IRRs are also the likely source of siRNAs in B. rapa. For mRNA, the average expression levels of lncRNAs having 24-nt siRNAs was higher than that of lncRNAs not having 24-nt siRNAs [39]. LncRNAs and 24-nt siRNAs are known to be involved in the increase of DNA methylation, which agrees with reports of increased gene expression when DNA methylation overlaps with an exonic region [59]. The detailed mechanism is not clear, but our results also agrees that lncRNAs may be involved in this gene regulation mechanism.
The epigenetic functions of different lncRNAs have been identified in diverse organisms [21, 22], but there is limited information in B. rapa. LncRNAs show a close association with DNA methylation of the genomic region encoding the lncRNA [60], but studies have focused on the model plant species, A. thaliana [11, 61]. In this study, we found a similar level of DNA methylation in regions encoding lncRNA compared to previously described DNA methylation in the B. rapa genome [39]. Levels of DNA methylation can be positively regulated by siRNAs through the RdDM pathway [62], and Pol IV is important for biogenesis of 24-nt siRNAs [63]. Identification of long Pol IV-dependent transcripts is difficult because these transcripts can be rapidly processed to produce 24-nt siRNAs. There are several reports that detected Pol IV-dependent siRNA-precursor transcripts in A. thaliana of different lengths [63–65]. Most Pol IV-dependent transcript regions overlapped with Pol IV-dependent siRNA loci, and CHH methylation depends on the production of Pol IV-dependent transcripts [65]. However, if Pol IV-dependent transcripts are short as 30–40 nucleotides [63, 64], the lncRNAs that we identified are much longer and not likely to be Pol-IV dependent transcripts that act as siRNA-precursors.
Pol V transcripts accumulate at very low levels and have been difficult to identify. However, using RNA immunoprecipitation, Pol V-dependent lncRNAs were identified, and CHH methylation and 24-nt siRNA accumulation were shown to be restricted to Pol V transcribed regions [66]. In this study, about 13% of lncRNAs overlapped with genomic regions encoding 24-nt siRNAs, and the DNA methylation level of those regions was higher than the average. It has been reported that the median length of Pol V transcripts was 689 nt [66]. The length of the lncRNAs overlapping genomic regions encoding for 24-nt siRNAs identified in this study resembles the length of the Pol V transcript precursors.
The level of H3K27me3 plays a role in tissue specific gene expression [67]. LncRNAs are also considered to be involved in regulating histone modifications. Binding of lncRNAs to PRC2 has been observed in human/animal [68, 69], and COLDAIR and COLDWRAP have been shown to recruit the PRC2 complex to FLC during vernalization in A. thaliana [13, 14, 70, 71]. EMF2B is a component of the PRC2 complex and in rice, a mutant of emf2b was reported to lose H3K27me3 and derepress some of the lincRNAs, suggesting that expression of these lincRNAs is regulated by PRC2-mediated histone methylation of the region encoding the lncRNA [72]. We found that about 10–16% of lncRNAs overlapped with H3K27me3 regions, which is lower than the overlap with mRNAs. The pattern of H3K27me3 in lncRNA overlapping regions was similar to mRNAs; H3K27me3 accumulated in the body region of transcripts, especially around the transcription start site. However, there was no negative relationship between the presence of H3K27me3 and lncRNA expression levels, and incRNA coding regions having H3K27me3 marks showed higher expression levels than incRNA coding regions without H3K27me3 marks. H3K27me3 over lncRNA coding regions was also found in different tissues of maize, which was responsible for the regulation of tissue-specific lncRNAs expression [73]. Coincidentally, DNA methylation in intronic regions is known to influence splicing patterns [74], which may reflect epigenetic tissue specific gene regulation involving RdDM and H3K27me3. However, as our study focused on the analyses in a single tissue/developmental stage, we could not identify the lncRNAs that are transcriptionally silenced by H3K27me3, leading to the underestimation of lncRNAs with H3K27me3 marks. Further study will be required to examine the transcriptional regulation of lncRNAs by H3K27me3.
GO analysis of the genes that overlap with incRNA and NATs loci revealed that both were overrepresented in the “transcription and DNA-dependent” category. For incRNAs, all five genes were annotated as transcription factors/regulators but the function of these genes remains unknown without further analyses. However, for genes overlapping with NATs and DNA methylated loci, three of the six identified genes were annotated as beta-galactosidase that are known to be involved in pollen development and fertilization [75, 76]. Interestingly, annotated genes enriched for the “transcription and DNA-dependent” category in “NATs” contained WRKY27 and WRKY62 genes that are related to pathogen defense [77, 78]. Furthermore, a gene (MAF4) known to be involved in flowering regulation that is regulated by NATs in Arabidopsis [79] was also identified, indicating that this NAT may be regulated by DNA methylation and involved in flowering regulation in Brassica. However, further analyses such as deletion of the incRNA/NAT locus or manipulating their expression will be needed to understand the function of these lncRNAs in Brassica.
This study revealed that a small proportion of lncRNAs in B. rapa are conserved with other Brassica species. The majority of lncRNAs in B. rapa overlap with IRRs and there is some overlap with DNA methylation and 24-nt siRNAs, hinting at regulation of lncRNA expression through the RdDM pathway. Interestingly, some of the lncRNAs that overlapped with the genomic regions having H3K27me3 marks were more highly expressed, which was unexpected because of the known gene suppression activity of H3K27me3. This may indicate an unknown regulatory mechanism of lncRNAs by H3K27me3. However, the current study focuses on the genome-wide quantification of lncRNAs, 24-nt siRNAs, DNA methylation, and H3K27me3, and further quantitative analysis at the locus specific level may reveal a clearer relationship between these marks.
Epigenetic state of the genome (including DNA methylation and H3K27me3, expression of lncRNAs and 24-nt siRNAs) can dynamically change depending on the stage of development, biotic and abiotic stress conditions, and the expression of lncRNAs can also change depending on the tissue or environment and influence gene expression [80]. This study focused on the association of lncRNAs and epigenetic marks, but to understand the association of lncRNAs with agronomical traits, further exploration of lncRNAs under different tissues, environmental conditions, and cultivars (or varieties) will be needed.
Supporting information
S1 Fig. Chromosomal (A01-A10) distribution of lncRNAs and mRNAs.
Log2 FPKM of mRNAs represented in blue (positive values) and orange (negative values), and lncRNAs in green (positive values) and pink (negative values).
https://doi.org/10.1371/journal.pone.0242530.s001
(PPTX)
S2 Fig. DNA methylation level of each type of lncRNAs, putative mRNA, total, and IRRs.
“Total” represents the DNA methylation level of all regions.
https://doi.org/10.1371/journal.pone.0242530.s002
(PPTX)
S3 Fig. DNA methylation of the genic region, IRRs, or regions coding for putative mRNA and lncRNAs.
https://doi.org/10.1371/journal.pone.0242530.s003
(PPTX)
S4 Fig. Proportion of lncRNAs, putative mRNA, mRNA with (K27+) or without (K27-) H3K27me3 on encoding regions.
https://doi.org/10.1371/journal.pone.0242530.s004
(PPTX)
S5 Fig. H3K27me3 mapped to the genic region, IRRs, or encoding regions of putative mRNA or lncRNAs.
https://doi.org/10.1371/journal.pone.0242530.s005
(PPTX)
S6 Fig. RT-PCR of each identified lncRNAs in B. rapa and B. oleracea lines.
The number represents each line as follows: For B. rapa lines, 1 = RJKB-T24, 2 = Homei, 3 = Harunosaiten, 4 = BRA2209, 5 = Osome, 6 = Yellow Sarson. For B. oleracea lines, 7 = Reiho, 8 = Matsunami, 9 = Kinkei 201.
https://doi.org/10.1371/journal.pone.0242530.s006
(PPTX)
S1 Table. Sequence of primers used in this study.
https://doi.org/10.1371/journal.pone.0242530.s007
(XLSX)
S2 Table. GO function term overrepresented in genes overlapping with incRNA expressing loci (p<0.001).
https://doi.org/10.1371/journal.pone.0242530.s008
(XLSX)
S3 Table. GO function term overrepresented in genes overlapping with incRNA expressing loci and DNA methylation (p<0.001).
https://doi.org/10.1371/journal.pone.0242530.s009
(XLSX)
S4 Table. GO function term overrepresented in genes overlapping with NAT expressing loci (p<0.0001).
https://doi.org/10.1371/journal.pone.0242530.s010
(XLSX)
S5 Table. GO function term overrepresented in genes overlapping with NAT expressing loci and DNA methylation (p<0.0001).
https://doi.org/10.1371/journal.pone.0242530.s011
(XLSX)
S6 Table. GO function term overrepresented in genes overlapping with NAT expressing loci and siRNAs (p<0.0001).
https://doi.org/10.1371/journal.pone.0242530.s012
(XLSX)
S7 Table. GO function term overrepresented in genes overlapping with NAT expressing loci and H3K27me3 (p<0.0001).
https://doi.org/10.1371/journal.pone.0242530.s013
(XLSX)
S8 Table. Sequences of the identified 12 lncRNAs.
https://doi.org/10.1371/journal.pone.0242530.s014
(XLSX)
Acknowledgments
We are grateful to Ms. Tomoko Kusumi for her technical assistance throughout this project.
References
- 1. U N. Genome analysis in Brassica with special reference to the experimental formation of B. napus and peculiar mode of fertilization. Jpn. J. Bot. 1935;7: 389–452.
- 2. Lv H, Miyaji N, Osabe K, Akter A, Mehraj H, Shea DJ, et al. The Importance of genetic and epigenetic research in the Brassica vegetables in the face of climate change. In: Kole C, editor. Genomic designing of climate-smart vegetable crops. Springer; 2020. pp. 161–255.
- 3. Wang X, Wang H, Wang J, Sun R, Wu J, Liu S, et al. The genome of the mesopolyploid crop species Brassica rapa. Nat Genet. 2011;43: 1035–1040. pmid:21873998
- 4. Liu S, Liu Y, Yang X, Tong C, Edwards D, Parkin IAP, et al. The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes. Nat Commun. 2014;5: 3930. pmid:24852848
- 5. Parkin IAP, Koh C, Tang H, Robinson SJ, Kagale S, Clarke WE, et al. Transcriptome and methylome profiling reveals relics of genome dominance in the mesopolyploid Brassica oleracea. Genome Biol. 2014;15: R77. pmid:24916971
- 6. Yang J, Liu D, Wang X, Ji C, Cheng F, Liu B, et al. The genome sequence of allopolyploid Brassica juncea and analysis of differential homoeolog gene expression influencing selection. Nat Genet. 2016;48: 1225–1232. pmid:27595476
- 7. Chalhoub B, Denoeud F, Liu S, Parkin IAP, Tang H, Wang X, et al. Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome. Science. 2014;345: 950–953. pmid:25146293
- 8. Karlik E, Ari S, Gozukirmizi N. LncRNAs: genetic and epigenetic effects in plants. Biotechnol Biotechnol Equip. 2019;33: 429–439.
- 9. Rai MI, Alam M, Lightfoot DA, Gurha P, Afzal AJ. Classification and experimental identification of plant long non-coding RNAs. Genomics. 2019;111: 997–1005. pmid:29679643
- 10. Mattick JS, Rinn JL. Discovery and annotation of long noncoding RNAs. Nat Struct Mol Biol. 2015;22: 5–7. pmid:25565026
- 11. Chekanova JA. Long non-coding RNAs and their functions in plants. Curr Opin Plant Biol. 2015;27: 207–216. pmid:26342908
- 12. Liu J, Wang H, Chua NH. Long noncoding RNA transcriptome of plants. Plant Biotechnol J. 2015;13: 319–328. pmid:25615265
- 13. Heo JB, Sung S. Vernalization-mediated epigenetic silencing by a long intronic noncoding RNA. Science 2011;331: 76–79. pmid:21127216
- 14. Kim DH, Sung S. Vernalization-triggered intragenic chromatin loop formation by long noncoding RNAs. Dev Cell. 2017;40: 302–312.e4. pmid:28132848
- 15. Swiezewski S, Liu F, Magusin A, Dean C. Cold-induced silencing by long antisense transcripts of an Arabidopsis Polycomb target. Nature. 2009;462: 799–802. pmid:20010688
- 16. Ding J, Lu Q, Ouyang Y, Mao H, Zhang P, Yao J, et al. A long noncoding RNA regulates photoperiod-sensitive male sterility, an essential component of hybrid rice. Proc Natl Acad Sci U S A. 2012;109: 2654–2659. pmid:22308482
- 17. Wang Y, Fan X, Lin F, He G, Terzaghi W, Zhu D, et al. Arabidopsis noncoding RNA mediates control of photomorphogenesis by red light. Proc Natl Acad Sci U S A. 2014;111: 10359–10364. pmid:24982146
- 18. Fedak H, Palusinska M, Krzyczmonik K, Brzezniak L, Yatusevich R, Pietras Z, et al. Control of seed dormancy in Arabidopsis by a cis-acting noncoding antisense transcript. Proc Natl Acad Sci U S A. 2016;113: E7846–E7855. pmid:27856735
- 19. Seo JS, Sun HX, Park BS, Huang CH, Yeh SD, Jung C, et al. ELF18-INDUCED LONG-NONCODING RNA associates with mediator to enhance expression of innate immune response genes in Arabidopsis. Plant Cell. 2017;29: 1024–1038. pmid:28400491
- 20. Wu CT, Morris JR. Genes, genetics, and epigenetics: A correspondence. Science. 2001;293: 1103–1105. pmid:11498582
- 21. Wang C, Wang L, Ding Y, Lu X, Zhang G, Yang J, et al. LncRNA structural characteristics in epigenetic regulation. Int J Mol Sci. 2017;18: 2659. pmid:29292750
- 22. Matzke MA, Kanno T, Matzke AJM. RNA-directed DNA methylation: The evolution of a complex epigenetic pathway in flowering plants. Annu Rev Plant Biol. 2015;66: 243–267. pmid:25494460
- 23. Fujimoto R, Sasaki T, Ishikawa R, Osabe K, Kawanabe T, Dennis ES. Molecular mechanisms of epigenetic variation in plants. Int J Mol Sci. 2012;13: 9900–9922. pmid:22949838
- 24. Liu ZW, Zhao N, Su YN, Chen SS, He XJ. Exogenously overexpressed intronic long noncoding RNAs activate host gene expression by affecting histone modification in Arabidopsis. Sci Rep. 2020;10: 3094. pmid:32080227
- 25. Kim DH, Xi Y, Sung S. Modular function of long noncoding RNA, COLDAIR, in the vernalization response. PLoS Genet. 2017;13: e1006939. pmid:28759577
- 26. Itabashi E, Osabe K, Fujimoto R, Kakizaki T. Epigenetic regulation of agronomical traits in Brassicaceae. Plant Cell Rep. 2018;27: 87–101. pmid:29058037
- 27. Shea DJ, Nishida N, Takada S, Itabashi E, Takahashi S, Akter A, et al. Long noncoding RNAs in Brassica rapa L. following vernalization. Sci Rep. 2019;9: 9302. pmid:31243302
- 28. Wang A, Hu J, Gao C, Chen G, Wang B, Lin C, et al. Genome-wide analysis of long non-coding RNAs unveils the regulatory roles in the heat tolerance of Chinese cabbage (Brassica rapa ssp.chinensis). Sci Rep. 2019;9: 5002. pmid:30899041
- 29. Zhang J, Wei L, Jiang J, Mason AS, Li H, Cui C, et al. Genome-wide identification, putative functionality and interactions between lncRNAs and miRNAs in Brassica species. Sci Rep. 2018;8: 4960. pmid:29563515
- 30. Huang L, Dong H, Zhou D, Li M, Liu Y, Zhang F, et al. Systematic identification of long non-coding RNAs during pollen development and fertilization in Brassica rapa. Plant J. 2018;96: 203–222. pmid:29975432
- 31. Song X, Liu G, Huang Z, Duan W, Tan H, Li Y, et al. Temperature expression patterns of genes and their coexpression with LncRNAs revealed by RNA-Seq in non-heading Chinese cabbage. BMC Genomics. 2016;17: 297. pmid:27103267
- 32. Liu X, Hao L, Li D, Zhu L, Hu S. Long non-coding RNAs and their biological roles in plants. Genomics Proteomics Bioinformatics. 2015;13: 137–147. pmid:25936895
- 33. Yu X, Yang J, Li X, Liu X, Sun C, Wu F, et al. Global analysis of cis-natural antisense transcripts and their heat-responsive nat-siRNAs in Brassica rapa. BMC Plant Biol. 2013;13: 208. pmid:24320882
- 34. Kawamura K, Kawanabe T, Shimizu M, Nagano AJ, Saeki N, Okazaki K, et al. Genetic distance of inbred lines of Chinese cabbage and its relationship to heterosis. Plant Gene. 2016;5: 1–7.
- 35. Miyaji N, Shimizu M, Miyazaki J, Osabe K, Sato M, Ebe Y, et al. Comparison of transcriptome profiles by Fusarium oxysporum inoculation between Fusarium yellows resistant and susceptible lines in Brassica rapa L. Plant Cell Rep. 2017;36: 1841–1854. pmid:28819684
- 36. Shea DJ, Shimizu M, Itabashi E, Miyaji N, Miyazaki J, Osabe K, et al. Genome re-sequencing, SNP analysis, and genetic mapping of the parental lines of commercial F1 hybrid cultivar of Chinese cabbage. Breed. Sci. 2018;68: 375–380. pmid:30100805
- 37. Kong L, Zhang Y, Ye ZQ, Lu XQ, Zhao SQ, Wei L, et al. CPC: assess the protein-coding potential of transcripts using sequence features and support vector machine. Nucleic Acids Res. 2007;35(Web Server issue): W345–W349. pmid:17631615
- 38. Murray MG, Thompson WF. Rapid isolation of high molecular weight plant DNA. Nucleic Acids Res. 1980;8: 4321–4326. pmid:7433111
- 39. Takahashi S, Osabe K, Fukushima N, Takuno S, Miyaji N, Shimizu M, et al. Genome-wide characterization of DNA methylation, small RNA expression, and histone H3 lysine nine di-methylation in Brassica rapa L. DNA Res. 2018;25: 511–520. pmid:29982343
- 40. Akter A, Takahashi S, Deng W, Shea DJ, Itabashi E, Shimizu M, et al. The histone modification H3 lysine 27 tri-methylation has conserved gene regulatory roles in the triplicated genome of Brassica rapa L. DNA Res. 2019;26: 433–443. pmid:31622476
- 41. Yu J, Zhao M, Wang X, Tong C, Huang S, Tehrim S, et al. Bolbase: A comprehensive genomics database for Brassica oleracea. BMC Genomics. 2013;14: 664. pmid:24079801
- 42. She R, Chu JSC, Wang K, Pei J, Chen N. genBlastA: Enabling BLAST to identify homologous gene sequences. Genome Res. 2009;19: 143–149. pmid:18838612
- 43. Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004;5: R80. pmid:15461798
- 44. Du Z, Zhou X, Ling Y, Zhang Z, Su Z. agriGO: a GO analysis toolkit for the agricultural community. Nucleic Acids Res. 2010;38: W64–W70 (2010). pmid:20435677
- 45. Shimizu M, Fujimoto R, Ying H, Pu ZJ, Ebe Y, Kawanabe T, et al. Identification of candidate genes for Fusarium yellows resistance in Chinese cabbage by differential expression analysis. Plant Mol. Biol. 2014;85: 247–257. pmid:24668026
- 46. Wang R, Zou J, Meng J, Wang J. Integrative analysis of genome-wide lncRNA and mRNA expression in newly synthesized Brassica hexaploids. Ecol Evol. 2018;8: 6034–6052. pmid:29988444
- 47. Wang H, Chung PJ, Liu J, Jang IC, Kean MJ, Xu J, et al. Genome-wide identification of long noncoding natural antisense transcripts and their responses to light in Arabidopsis. Genome Res. 2014;24: 444–453. pmid:24402519
- 48. Shen E, Zhu X, Hua S, Chen H, Ye C, Zhou L, et al. Genome-wide identification of oil biosynthesis-related long non-coding RNAs in allopolyploid Brassica napus. BMC Genomics. 2018;19: 745. pmid:30314449
- 49. Liu J, Jung C, Xu J, Wang H, Deng S, Bernad L, et al. Genome-wide analysis uncovers regulation of long intergenic noncoding RNAs in Arabidopsis. Plant Cell. 2012;24: 4333–4345. pmid:23136377
- 50. Joshi RK, Megha S, Basu U, Rahman MH, Kav NNV. Genome wide identification and functional prediction of long non-coding RNAs responsive to Sclerotinia sclerotiorum infection in Brassica napus. PLoS One. 2016;11: e0158784. pmid:27388760
- 51. Chen R, Li M, Zhang H, Duan L, Sun X, Jiang Q, et al. Continuous salt stress-induced long non-coding RNAs and DNA methylation patterns in soybean roots. BMC Genomics. 2019;20: 730. pmid:31606033
- 52. Kopp F, Mendell JT. Functional classification and experimental dissection of long noncoding RNAs. Cell. 2018;172: 393–407. pmid:29373828
- 53. Kung JTY, Colognori D, Lee JT. Long noncoding RNAs: Past, present, and future. Genetics. 2013;193: 651–669. pmid:23463798
- 54. Vandivier LE, Anderson SJ, Foley SW, Gregory BD. The conservation and function of RNA secondary structure in plants. Annu Rev Plant Biol. 2016;67: 463–488. pmid:26865341
- 55. Lu K, Wei L, Li X, Wang Y, Wu J, Liu M, et al. Whole-genome resequencing reveals Brassica napus origin and genetic loci involved in its improvement. Nat Commun. 2019;10: 1154. pmid:30858362
- 56. Sampath P, Murukarthick J, Izzah NK, Lee J, Choi H-I, Shirasawa K, et al. Genome-wide comparative analysis of 20 miniature inverted-repeat transposable element families in Brassica rapa and B. oleracea. PLoS One. 2014;9: e94499. pmid:24747717
- 57. Cho J. Transposon-derived non-coding RNAs and their function in plants. Front Plant Sci. 2018;9: 600. pmid:29774045
- 58. Wang D, Qu Z, Yang L, Zhang Q, Liu ZH, Do T, et al. Transposable elements (TEs) contribute to stress-related long intergenic noncoding RNAs in plants. Plant J. 2017;90: 133–146. pmid:28106309
- 59. Bewick AJ, Schmitz RJ. Gene body DNA methylation in plants. Curr Opin Plant Biol. 2017;36: 103–110. pmid:28258985
- 60. Di Ruscio A, Ebralidze AK, Benoukraf T, Amabile G, Goff LA, Terragni J, et al. DNMT1-interacting RNAs block gene-specific DNA methylation. Nature. 2013;503: 371–376. pmid:24107992
- 61. Au PCK, Dennis ES, Wang MB. Analysis of argonaute 4-associated long non-coding RNA in Arabidopsis thaliana sheds novel insights into gene regulation through RNA-directed DNA methylation. Genes (Basel). 2017;8: 198. pmid:28783101
- 62. Yan H, Bombarely A, Xu B, Frazier TP, Wang C, Chen P, et al. siRNAs regulate DNA methylation and interfere with gene and lncRNA expression in the heterozygous polyploid switchgrass. Biotechnol Biofuels. 2018;11: 208. pmid:30061930
- 63. Zhai J, Bischof S, Wang H, Feng S, Lee TF, Teng C, et al. A one precursor one siRNA model for Pol IV-dependent siRNA biogenesis. Cell. 2015;163: 445–455. pmid:26451488
- 64. Blevins T, Podicheti R, Mishra V, Marasco M, Wang J, Rusch D, et al. Identification of Pol IV and RDR2-dependent precursors of 24 nt siRNAs guiding de novo DNA methylation in Arabidopsis. Elife. 2015;4: e09591. pmid:26430765
- 65. Li S, Vandivier LE, Tu B, Gao L, Won SY, Li S, et al. Detection of Pol IV/RDR2-dependent transcripts at the genomic scale in Arabidopsis reveals features and regulation of siRNA biogenesis. Genome Res. 2015;25: 235–245. pmid:25414514
- 66. Böhmdorfer G, Sethuraman S, Jordan Rowley M, Krzyszton M, Hafiz Rothi M, Bouzit L, et al. Long non-coding RNA produced by RNA polymerase V determines boundaries of heterochromatin. Elife. 2016;5: e19092. pmid:27779094
- 67. Makarevitch I, Eichten SR, Briskine R, Waters AJ, Danilevskaya ON, Meeley RB, et al. Genomic distribution of maize facultative heterochromatin marked by trimethylation of H3K27. Plant Cell. 2013;25: 780–793. pmid:23463775
- 68. Tu S, Yuan GC, Shao Z. The PRC2-binding long non-coding RNAs in human and mouse genomes are associated with predictive sequence features. Sci Rep. 2017;7: 41669. pmid:28139710
- 69. Gaiti F, Degnan BM, Tanurdžić M. Long non-coding regulatory RNAs in sponges and insights into the origin of animal multicellularity. RNA Biol. 2018;15: 696–702. pmid:29616867
- 70. Tian Y, Zheng H, Zhang F, Wang S, Ji X, Xu C, et al. PRC2 recruitment and H3K27me3 deposition at FLC require FCA binding of COOLAIR. Sci Adv. 2019;5: eaau7246. pmid:31032401
- 71. Yu T, Zhu H. Long non-coding RNAs: Rising regulators of plant reproductive development. Agronomy. 2019;9: 53.
- 72. Johnson C, Conrad LJ, Patel R, Anderson S, Li C, Pereira A, et al. Reproductive long intergenic noncoding RNAs exhibit male gamete specificity and polycomb repressive complex 2-mediated repression. Plant Physiol. 2018;177: 1198–1217. pmid:29844229
- 73. Li L, Eichten SR, Shimizu R, Petsch K, Yeh CT, Wu W, et al. Genome-wide discovery and characterization of maize long non-coding RNAs. Genome Biol. 2014;15: R40. pmid:24576388
- 74. Regulski M, Lu Z, Kendall J, Donoghue MTA, Reinders J, Llaca V, et al. The maize methylome influences mRNA splice sites and reveals widespread paramutation-like switches guided by small RNA. Genome Res. 2013;23: 1651–1662. pmid:23739895
- 75. Jakobsen MK, Poulsen LR, Schulz A, Fleurat-Lessard P, Møller A, Husted S, et al. Pollen development and fertilization in Arabidopsis is dependent on the MALE GAMETOGENESIS IMPAIRED ANTHERS gene encoding a type V P-type ATPase. Genes Dev. 2005;19: 2757–2769. pmid:16291648
- 76. Wang Y, Zhang WZ, Song LF, Zou JJ, Su Z, Wu WH. Transcriptome analyses show changes in gene expression to accompany pollen germination and tube growth in Arabidopsis. Plant Physiol. 2008;148: 1201–1211. pmid:18775970
- 77. Mukhtar MS, Deslandes L, Auriac MC, Marco Y, Somssich IE. The Arabidopsis transcription factor WRKY27 influences wilt disease symptom development caused by Ralstonia solanacearum. Plant J. 2008;56: 935–947. pmid:18702671
- 78. Fukushima S, Mori M, Sugano S, Takatsuji H. Transcription Factor WRKY62 Plays a Role in Pathogen Defense and Hypoxia-Responsive Gene Expression in Rice. Plant Cell Physiol. 2016;57: 2541–2551. pmid:27837098
- 79. Zhao X, Li J, Lian B, Gu H, Li Y, Qi Y. Global identification of Arabidopsis lncRNAs reveals the regulation of MAF4 by a natural antisense RNA. Nat. Commun. 2018;9: 5056. pmid:30498193
- 80. Chen L, Zhu QH, Kaufmann K. Long non-coding RNAs in plants: emerging modulators of gene activity in development and stress responses. Planta. 2020;252: 92. pmid:33099688