Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Identification and analysis of differentially expressed long non-coding RNAs between multiparous and uniparous goat (Capra hircus) ovaries

  • Yinghui Ling ,

    Contributed equally to this work with: Yinghui Ling, Lina Xu, Long Zhu

    Roles Data curation

    Affiliations College of Animal Science and Technology, Anhui Agricultural University, Anhui Hefei, China, Local Animal Genetic Resources Conservation and Biobreeding Laboratory of Anhui Province, Anhui Hefei, China

  • Lina Xu ,

    Contributed equally to this work with: Yinghui Ling, Lina Xu, Long Zhu

    Roles Writing – review & editing

    Affiliation Institute of Plant Protection and Agro-Products Safety, Anhui Academy of Agricultural Sciences, Hefei, Anhui, China

  • Long Zhu ,

    Contributed equally to this work with: Yinghui Ling, Lina Xu, Long Zhu

    Roles Writing – original draft

    Affiliations College of Animal Science and Technology, Anhui Agricultural University, Anhui Hefei, China, Local Animal Genetic Resources Conservation and Biobreeding Laboratory of Anhui Province, Anhui Hefei, China

  • Menghua Sui,

    Roles Methodology

    Affiliations College of Animal Science and Technology, Anhui Agricultural University, Anhui Hefei, China, Local Animal Genetic Resources Conservation and Biobreeding Laboratory of Anhui Province, Anhui Hefei, China

  • Qi Zheng,

    Roles Investigation

    Affiliations College of Animal Science and Technology, Anhui Agricultural University, Anhui Hefei, China, Local Animal Genetic Resources Conservation and Biobreeding Laboratory of Anhui Province, Anhui Hefei, China

  • Wenyong Li,

    Roles Investigation

    Affiliation Key Laboratory of Embryo Development and Reproductive Regulation of Anhui Province, Fuyang Normal University, Fuyang, Anhui, China

  • Yong Liu,

    Roles Validation

    Affiliation Key Laboratory of Embryo Development and Reproductive Regulation of Anhui Province, Fuyang Normal University, Fuyang, Anhui, China

  • Fugui Fang ,

    Roles Validation

    zhangxiaorong01@163.com (ZXR); 332792872@qq.com (FFG)

    Affiliations College of Animal Science and Technology, Anhui Agricultural University, Anhui Hefei, China, Local Animal Genetic Resources Conservation and Biobreeding Laboratory of Anhui Province, Anhui Hefei, China

  • Xiaorong Zhang

    Roles Funding acquisition

    zhangxiaorong01@163.com (ZXR); 332792872@qq.com (FFG)

    Affiliations College of Animal Science and Technology, Anhui Agricultural University, Anhui Hefei, China, Local Animal Genetic Resources Conservation and Biobreeding Laboratory of Anhui Province, Anhui Hefei, China

Abstract

Long non-coding RNAs (lncRNAs) play important roles in almost all biological processes. However, there is little information on the effects of lncRNAs on ovulation and lambing rates. In the present study, we used high-throughput RNA sequencing to identify differentially expressed lncRNAs between the ovaries of multiparous (Mul) and uniparous (Uni) Anhui White goats. Among the 107,255,422 clean reads, 183,754 lncRNAs were significantly differentially expressed between the Uni and Mul. Among them, 455 lncRNAs were co-expressed between the two samples, whereas, 157,523 lncRNAs were uniquely expressed in the Uni, and 25,776 uniquely lncRNAs were expressed in the Mul. Through Cis role analysis, 24 lncRNAs were predicted to overlap with cis-regulatory elements, which involved in Progesterone-mediated oocyte maturation, Steroid biosynthesis, Oocyte meiosis, and gonadotropin-releasing hormone (GnRH) signaling pathway. These 4 pathways were related to ovulation, and the KEGG pathway analysis on target genes of the differentially expressed lncRNAs confirmed this results. In addition, 10 lncRNAs harbored precursors of 40 miRNAs, such as TCONS_00320849 related to a mature miRNA sequence, miR-365a, which was reported to be related to proliferation, were annotated in the precursor analysis of miRNAs. The present expand the understanding of lncRNA biology and contribute to the annotation of the goat genome. The study will provide a resource for lncRNA studies of ovulation and lambing.

Introduction

Long non-coding RNAs (lncRNAs) are non-coding RNA transcripts of more than 200 nucleotides in length. Initially, lncRNA was regarded as transcriptional “noise” that lacked a complete open reading frame and had no ability to encode proteins [1]. Human homologue 19 (H19), found in mice, was the first lncRNA which was discovered in mammals in 1998 [2], then many more lncRNAs were discovered in the following 20 years. Recent studies have shown that lncRNA participates in many important physiological processes, such as X chromosome inactivation [3], dosage compensation [4], and gene imprinting [5]. Additionally, lncRNAs have a significant impact on the diagnosis, treatment, and development of disease. For example, lncRNA plays an important role in epigenetics [6, 7], post-transcriptional regulation [8], maintenance of stem cell pluripotency, and regulation of gene expression during disease processes [9].

Although lncRNAs are widely found in animals, their mechanisms of action are unclear [10, 11]. Using high-throughput poly (A)-independent and strand-specific RNA-seq of rats, lncRNA expression was found to be more tissue specific than that of mRNA [12]. Knockdown of several lncRNAs in mature oocytes increases developmental rates in cattle and leads to larger blastocysts [13]. The sixth CCCTC binding-factor (CTCF) binding site (CCCTC) was identified in H19 DMR, however, had a significant methylation difference between the high- and low-fertility bulls [14]. Boulanger promised that Forkhead box L2 (Foxl2) loss of function dissociated from loss of lncRNA expression is sufficient to cause an XX female-to-male sex reversal in the goat model and, as in the mouse model, an agenesis of eyelids [15]. Ovarian hormones regulate Foxl2 expression and thereby expand the number of genes controlled by the hypothalamic–pituitary–gonad axis that ultimately dictates reproductive fitness [16]. Although lncRNA studies have been begun in several species, the research conducted into the effects of lncRNA on the reproductive rate was relatively limited. High-throughput sequencing can be used to identify the lncRNAs, and bioinformatics analysis can predict the function of lncRNAs, which would provide a basis for animal reproduction study and possibly improve the reproductive rate of goats.

Compared with other goats, Anhui White goat (AWG) is known for its higher fertility, precocious puberty, and higher leather quality. The kidding rate of AWG is 2.27–2.39 and the ewes can be in estrus all year round, which makes it to be an ideal model for the study of goat breeding traits [17]. To explore the function of lncRNAs in ovulation and lambing, we used Solexa sequencing to identify differentially expressed lncRNAs between the ovaries of multiparous and uniparous AWGs in the study, and predict the target genes of the differentially expressed lncRNAs. In addition, classified annotation, GO analysis and KEGG pathway were used to analyze the function of target genes. Through Cis role analysis, 24 lncRNAs were predicted to overlap with cis-regulatory elements, which involved in Progesterone-mediated oocyte maturation, Steroid biosynthesis, Oocyte meiosis, and gonadotropin-releasing hormone (GnRH) signaling pathway. These 4 pathways were related to ovulation, and the KEGG pathway analysis on target genes of the differentially expressed lncRNAs confirmed this results. These results will help us to further understand the role of lncRNAs in ovulation traits.

Materials and methods

Experimental animal preparation and library construction

The experimental goats in this study, Anhui White goats (AWG, an indigenous Chinese breed), were obtained from the College of Animal Science and Technology, Anhui Agricultural University, Hefei, China. Based on the long-term observation, laparoscopic and ultrasound detection, 6 target goats, 3 multiparous goats (Mul) and 3 uniparous goats (Uni), which were undergoing oestrus, were selected for their all ovaries. According to the at least 3-year records of lambing, the litter size of Mul and Uni were more than one and only one, respectively. Meanwhile, the physical condition and age, which is 3 to 4 years old, of the experimental samples were basically consistent. To further eliminate the influence of other factors, the unified management system of the field was adopted in the study farm feeding and stabling. The selected goats were killed and dissected, and both whole ovaries from each goat were collected immediately, snap-frozen in liquid nitrogen, and stored at –80°C until use. One whole ovary from each goat was used for RNA-Seq, meanwhile, the other one from the same goat was used for real-time PCR. All experimental procedures involving AWGs performed in the present study had been given prior approval by the ethics committee of Anhui Agricultural University under permit no. AHAU20101025.

The total RNA of the ovaries (12 samples) from the 6 goats was extracted using RNAiso Plus (Takara) following the manufacturer’s protocol. The quantity and quality of the total RNA were measured using the Agilent 2100 Bioanalyzer system. To minimize differences among the goats, an equal quantity (10 μg) of total RNA isolated from one ovary from the 3 individual multiparous and uniparous goats, respectively, were pooled for library preparation and Illumina sequence (The Beijing Genomics Institute). In addition, the total RNA from the other one of the ovaries from the 3 individual multiparous and uniparous goat were reverse transcribed, respectively, into cDNA using the AceQ qPCR SYBR Green Master Mix lncRNA RT-PCR Kit (Vazyme Biotech Co., Ltd.) for qRT-PCR verification.

Basic data statistics

Before further analysis, raw data filtering was needed to decrease data noise. "Raw reads" were defined as reads containing the adapter sequence, a high content of unknown bases (reads with more than 10% unknown bases), and low-quality reads (>50% low-quality bases in a read, with a low-quality base defined as a base whose sequencing quality was no more than 10). Then clean reads were mapped to the rRNA reference sequence using SOAP aligner/ SOAP2 short-read alignment software [18] to remove the remaining rRNA reads, and leave the reads that were used for transcriptome assembly and quantification. The statistical analysis of the alignment results is presented for each sample.

Assembly and novel lncRNA prediction

Reads were mapped to the genome and assembled using Cufflinks [19]. Faux-reads were then generated from reference transcripts to capture features in the reference sequence that, due to low coverage, might be missing from the sequencing data. We used a reference annotation-based assembly method to reconstruct the transcripts, while background noise was reduced by using the FPKM (fragments per kilobase of exon per million fragments mapped) value and coverage threshold. These reads were then merged with the sequenced reads for assembly. The set of transcripts generated in the last step was then compared with the reference transcripts to remove the transcripts that were approximately equivalent to the whole or a portion of a reference transcript. Through comparison with the reference, we were able to detect the novel transcripts and calculate the coding potential of these transcripts to identify the novel lncRNA [20]. After the assembly, we compared the assembled transcripts to the reference annotation by Cuffcompare, and categorized the transcripts to 12 classes (=, c, j, e, i, o, p, r, u, x, s and.). Among them, only five candidate categories of transcripts will be extracted, including 'i', 'j', 'o', 'u', and 'x', which may contain novel transcripts. The “u” category meant unknown intergenic transcript. The “x” category meant lncRNAs with exonic overlap with known transcripts, but on the opposite strand. The “i” category contained lncRNAs with transcripts falling entirely within a reference intron. The “j” category meant potentially novel isoforms (fragments) with at least one splice junction shared with the reference transcripts.

Classification and annotation of lncRNAs

The lncRNAs were also classified by their position compared with the reference gene, with different strategies used for function prediction. For antisense lncRNA, we performed free energy calculation to detect possible hybridization sites for lncRNA and mRNA, which might be RNA-RNA interactions. To further reveal potential antisense lncRNA–mRNA interactions, we searched for all antisense lncRNA-mRNA complementary base pair duplexes using RNAplex [21], a tool specially created to rapidly search for short interactions between two long RNAs. If such interactions are found upstream or downstream of a gene, we will search for lncRNAs located in unknown regions. Because lncRNA can be processed to yield small RNAs, small RNA precursors can be predicted. Recent genome-wide studies suggested that the function of a significant fraction of long non-coding transcripts may be to serve as precursors for microRNAs (miRNAs) that are usually generated via the sequential cleavage of long transcripts by the enzymes Drosha and Dicer [22, 23]. Thus, we aligned lncRNAs to miRBase [24] to detect potential pre-miRNAs by selecting those with a hit coverage greater than 90%.

The SVM-based software miRPara [25] was also used to predict probable miRNAs. In addition, Rfam divides non-coding RNAs into families based on evolution from a common ancestor. To better annotate novel lncRNAs from an evolutionary standpoint, we classified all predicted novel lncRNAs into different non-coding RNA families using INFERNAL, which categorizes non-coding RNAs and their conserved primary sequence and RNA secondary structure through the use of multiple sequence alignments, consensus secondary structure annotation, and covariance models [26]. These results will classify lncRNA families by their RNA structure and sequence similarities, helping us to reveal the potential functions of the lncRNAs.

Screening of differentially expressed lncRNAs and real-time PCR validation

Cuffdiff was used to calculate expression levels of lncRNAs in more than one condition and test them for significant differences on the basis of FPKM values. To screen differentially expressed genes from the Cuffdiff results, we followed the priority rule: STATUS = OK and P ≤ 0.05. The differentially expressed lncRNAs were screened using NOIseq [27] following the priority rule: filtering condition, log2 (ratio) ≥ 1; probability ≥ 0.8.

Real-time PCR (qPCR) was performed to validate the Solexa sequencing data. Eight differentially expressed lncRNAs between the two libraries were randomly selected for qPCR verification. The primers were designed based on the lncRNA sequences (S1 Table). After incubation at 37°C for 1 h and deactivation at 95°C for 5 min, we obtained the template for qPCR. The reaction solution of qPCR contained 2.0 μl cDNA, 10 μl AceQ qPCR SYBR Green Master Mix and ROX Reference Dye 1, 0.4 μl of each primer, and 6.8 μl ddH2O. GAPDH was used as the internal reference gene control. qPCR was performed using standard protocols on the StepOnePlus Real-Time PCR System (Thermo Fisher Scientific). The threshold cycle (CT) was obtained from each reaction, and the relative expression level of each lncRNA was evaluated using the 2-ΔΔCt method. Three biological replicates for each selected lncRNA were used. The expression levels of lncRNA in the ovaries of Mul and Uni were determined individually and the t-test was used to examine the significant of expression differences between the two samples using SAS v8.0 software.

Target genes prediction of differentially expressed lncRNAs and function analyses

To explore the functions of lncRNAs, we predicted the target genes of the differentially expressed lncRNAs through Cis role. Cis role is lncRNA action on neighboring target genes. The basic principles of Cis role of target gene is prediction the lncRNA function and the protein coding gene which near the lncRNA’s coordinates. We searched coding genes 100 kb upstream and downstream of lncRNA and then analyzed their function by functional enrichment analysis.

Gene Ontology (GO) and KEGG pathway analyses can help us to understand the biological functions of genes [28]. GO enrichment analysis of lncRNA target genes were implemented by the GOseq R package (http://www.bioconductor.org/packages/release/bioc/html/goseq.html), in which gene length bias was corrected. GO terms with corrected P-value less than 0.05 were considered significantly enriched by differential expressed genes. KEGG is database resource for understand high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information. We used KOBAS software (http://kobas.cbi.pku.edu.cn/download.php) to test the statistical enrichment of lncRNA target genes in KEGG pathways. Pathway with corrected FDR-value less than 0.05 were considered significantly enriched by differential expressed genes.

Results

Basic data statistics

In the base composition of the reads, on the X-axis, 1–90 bp represents “read 1” and 91–180 bp represents “read 2”. The A curve should overlap with the T curve, while the G curve should overlap with the C curve. In this case, the figure showed a balanced composition (S1 Fig). By comparing with the rRNA reference sequence through SOAP aligner/ SOAP2 short-read alignment software, 52,178,296 and 55,077,126 clean reads in the Uni and Mul libraries, respectively, were mapped to the genome. Only 0.02% and 0.03% of the total sequence reads were mapped to single-end in the two libraries, respectively. Additionally, only 0.01% of the total sequence reads were mapped to paired-end mapping reads in the two libraries (S2 Table). Otherwise, about 80% of the total mapped reads were mapped to the reference genome, indicating that the quality of the two libraries is good (S3 Table).

Assembly and novel lncRNA prediction

The clean reads were mapped to the genome using TopHat (Fig 1). There were 663,188 genes were identified in the gene sequencing coverage statistics of the Uni. Among them, transcript coverage on the genome of 63,140 genes was more than50%, which accounted for 9.24% of the identified genes. Meanwhile, 675,422 genes were identified in the Mul library, and 383,043 genes (accounting for 56.71%), whose transcript coverage on the genome was more than 50%. In total, 511,721 lncRNAs and 13,875 mRNAs were identified in the two libraries. Among them, 271,584 lncRNA and 13,224 mRNA were co-expressed in the two libraries, and 240,137 lncRNAs and 651 mRNAs were respectively expressed in the certain library (Table 1).

thumbnail
Fig 1. Transcript coverage stat of uniparous goats (Uni) and multiparou goats (Mul).

https://doi.org/10.1371/journal.pone.0183163.g001

thumbnail
Table 1. Expression statistic of different lncRNAs and mRNA.

https://doi.org/10.1371/journal.pone.0183163.t001

Cuffcompare categories were used to distinguish transcripts according to 12 class codes (S4 Table). Only five candidate categories of transcripts will be extracted, including “i”, “j”, “o”, “u” and “x” which may contain novel transcripts. The “u” category contained 525,553 transcripts, which meant unknown intergenic transcript lncRNAs (lincRNAs). The “x” category contained 4,053 transcripts, which meant lncRNAs with exonic overlap with known transcripts, but on the opposite strand. The “i” category contained lncRNAs with transcripts falling entirely within a reference intron. The “j” category meant potentially novel isoforms (fragments) with at least one splice junction shared with the reference transcripts, which contained 38,842 transcripts, and 102 transcripts in “o” category. Due to the complexity of the repeat region, the probability of transcript misassembly is also increased. Thus, we filtered transcripts classified with “r”, which could possibly be artificial fragments. As determined by their genomic locations with respect to nearby genes, the 568,550 lncRNAs included members of all four classes of lncRNAs, with most in the u class, which overlap unknown intergenic transcript lncRNAs. The results of the lncRNA triage were used for further analysis.

Classification and annotation of lncRNA

By examining the up/downstream lncRNAs of a gene, we found 27,228 lncRNAs had up/downstream gene. Of these, 13,069 lncRNAs were located in the antisense strand and 14,159 were located in the sense strand. Therefore, 49% of lncRNAs were downstream of the bracketing genes, and the others were upstream. These lncRNAs could overlap with cis-regulatory elements probably involved in transcriptional regulation. TCONS_00076241, TCONS_00087806, and TCONS_00076240 were upstream of XM_005686239.1 (BUB1) in the oocyte meiosis pathway. TCONS_00248477 was downstream of XM_005676195.1 (MAP3K19) in the GnRH signaling pathway. TCONS_00136407 and TCONS_00146968 were downstream of XM_005689083.1 (MOS) in the progesterone-mediated oocyte maturation pathway. TCONS_00136407 and TCONS_00146968 were downstream of XM_005689083.1 (MOS) in the oocyte meiosis pathway. MOS, BUB1, and MAP3K19 may regulate oocyte meiosis (Table 2).

We selected 65,536 lncRNAs which coverage were greater than 90% to examine the regulatory functions of the lncRNAs. We aligned lncRNAs to miRBase to determine if they could target known miRNAs: TCONS_00320849 had 16 known precursors, TCONS_00284825 had 10 known precursors, TCONS_00080255 had 7 known precursors, TCONS_00284070, TCONS_00304715, TCONS_00424188, TCONS_00533458, TCONS_00308106, TCONS_00087926, and TCONS_00205426 had 1 known precursor, respectively. Bta-mir-365-1, cfa-mir-493, and bta-mir-493 are precursors of TCONS_00284825, whereas oar-mir-493 is the precursor of TCONS_00320849, which the identities were 100%. We found that TCONS_00087926, TCONS_00304715, TCONS_00424188, and TCONS_00320849 had several novel precursors, but TCONS_00320849 was related to a mature miRNA sequence, miR-365a, which had the same mature miRNA in the miRBase database (Fig 2, S5 and S6 Tables).

thumbnail
Fig 2. The pre-miRNA and secondary structure of oar-mir-493, bta-mir-493 and bta-mir-365-1.

Note: A, known pre-miRNA and secondary structure pictures of oar-mir-493; B, known pre-miRNA and secondary structure pictures of bta-mir-493; C, known pre-miRNA and secondary structure pictures of bta-mir-365-1.

https://doi.org/10.1371/journal.pone.0183163.g002

To better annotate the novel lncRNA from an evolutionary standpoint, we classified all of the predicted novel lncRNAs into different non-coding RNA families using INFERNAL. Fifty families were obtained via the lncRNA family prediction. The C/G content was between 25% and 70%. Xist_exon1, TUG1_3, TUG1_4, SOX2OT_exon3, and SMAD5 may regulate reproduction and embryonic development [2931] (Table 3).

Differentially expressed lncRNAs and real-time PCR Validation

Compared to the Uni library, 26,187 up-regulated and 157,567 down-regulated lncRNAs were identified in the Mul library (Fig 3). A total of 183,754 lncRNAs were significantly differently expressed between the Mul and Uni with |log2Ratio|2 ≥ 1 and p-value ≤ 0.05. Among them, 455 lncRNAs, which included 188 known genes and 267 predicted transcripts, were co-expressed in the two samples. In the Mul, the highest expressed lncRNA from the co-expressed lnRNAs was XLOC-013414, which FPKM value was 5052.08, whereas, its FPKM in the Uni was 572.03. In the Uni, the highest expressed lncRNA from the co-expressed lnRNAs was 102188530, which FPKM value was 741.67, whereas, its FPKM in the Mul was 109.21. In addition, 157,523 lncRNAs were uniquely expressed in the Uni, and 25,776 uniquely lncRNAs were expressed in the Mul. The expression of uniquely expressed lncRNA was relatively low. Such as the expression of XLOC-505751, which was highest uniquely expressed in the Uni, was 60.29. And the expression of XLOC-263636, which was highest uniquely expressed in the Mul, was 70.38 (S7 Table).

thumbnail
Fig 3. Stat chart of up-down-regulated expressed genes.

https://doi.org/10.1371/journal.pone.0183163.g003

To ensure the accuracy and reliability of the RNA-seq results, eight differentially expressed lncRNAs were randomly selected for qPCR to confirm the expression changes. The results showed good consistency between the RNA-seq results and the qPCR results (Fig 4).

thumbnail
Fig 4. Real-time PCR results of randomly selected differentially expressed lncRNAs.

Note: X-axis represents selected 8 differential expressed transcripts in two libraries. Here, GAPDH was chosen as the reference gene. Relative expression value per selected transcripts between uniparous goats (Uni) and multiparous goats (Mul) samples was calculated (y-axis). Superscript letters indicate significant difference at the level of 0.05.

https://doi.org/10.1371/journal.pone.0183163.g004

Function analyses of target genes of differentially expressed lncRNAs

The target genes of the differentially expressed lncRNAs were predicted through Cis role. After searching the coding genes 100 kb upstream and downstream of lncRNA, we analyzed the function of the target genes through GO and KEGG enrichment analysis. In total, there were 32,627 genes were predicted as the target genes of differentially lncRNAs between the Uni and Mul (S8 and S9 Tables).

Based on the gene background, there were 439, 432, and 407 target genes clustered into 141, 198, and 868 GO terms in component, function and process ontology, respectively. The GO terms “Cell part”, “Binding” and “Cellular process” were the biggest terms, which included the most target genes, in the above ontologies, respectively. However, compared with the reference gene background, no GO term was significantly enriched (p-value ≤ 0.05). The “Phosphatidylinositol binding” and “Cyclic-nucleotide phosphodiesterase activity” in function ontology, and “Cholesterol metabolic process”, “Cell aging”, “Aging”, “Steroid metabolic process” in process ontology were the only GO terms with corrected p-value < 1 (S10 Table).

According to the KEGG pathway analysis, a total of 875 target genes, which showed 22,885 background genes, of the differentially expressed lncRNAs between the Uni and Mul, were annotated for 236 pathways. Among them, 12 pathways, “mTOR signaling pathway (ko04150)”, “Measles (ko05162)”, “Olfactory transduction (ko04740)”, “Regulation of autophagy (ko04140)”, “Insulin signaling pathway (ko04910)”, “RNA degradation (ko03018)”, “RNA polymerase (ko03020)”, “Nucleotide excision repair (ko03420)”, “Adipocytokine signaling pathway (ko04920)”, “Fatty acid biosynthesis (ko00061)” “Purine metabolism (ko00230)”, and “SNARE interactions in vesicular transport (ko04130)”, were significantly enriched (p-value ≤ 0.05). In addition, five pathways, “Progesterone-mediated oocyte maturation (ko04914) (p-value = 0.1484)”, “Steroid biosynthesis (ko00100) (p-value = 0.1511)”, “Oocyte meiosis (ko04114) (p-value = 0.1963)”, “GnRH signaling pathway (ko04912) (p-value = 0.5711)”, and “Steroid hormone biosynthesis (ko00140) (p-value = 0.2323)” were related to the animal reproduction [3234] (S11 Table).

Discussion

In the study, we used RNA-seq to study the lncRNAs expressed in the ovaries of multiparous (Mul) and uniparous (Uni) goat to explore the function of lncRNAs in ovulation and lambing. In total, 107,255,422 clean reads were obtained from the two libraries, and 52,178,296 and 55,077,126 clean reads in the Uni and Mul libraries, respectively, were mapped to the genome. However, a small part of the lncRNA sequence still could not be aligned to the target genome and located to a specific chromosome. Because the sequence was broken up into smaller pieces called segments, with the software inferring that the read spans a splice junction and estimating the location of the splice sites of the junction. On one hand, we believed that this shortcoming may be because the reference goat genome is still not perfect, with more pieces to be spliced. On the other hand, there may be post-transcriptional RNA editing, resulting in inconsistencies between the lncRNA sequence and the corresponding DNA sequence. Some lncRNAs highly expressed in the ovary may lack a classical poly (A) tail [35]. In the renal medulla, 28% of lncRNAs were found to lack a classical poly (A) tail [12]. Strand-specific sequences rather than 3’-directed cDNA sequences were used in this study. Strand-specific sequences help to determine the polarity of transcripts and improve the accuracy of the quantification of antisense transcripts. When lncRNA and mRNA were isolated from total RNA, the ribo-depleted method was used instead of the poly (A) enrichment method. Large-scale bioinformatic studies suggest that a significant fraction (>24%) of long non-coding transcripts present in cells may lack a classical poly (A) tail.

In this study, we searched for lncRNAs located in unknown regions if they could be found upstream or downstream of a gene to predict the function of the lncRNA. TCONS_00076241, TCONS_00087806, and TCONS_00076240 were upstream of XM_005686239.1 (BUB1). TCONS_00248477 was downstream of XM_005676195.1 (MAP3K19). TCONS_00136407 and TCONS_00146968 were downstream of XM_005689083.1 (MOS). Studies have shown that XM_005689083.1 and XM_005686239.1 play a role in meiosis I in the oocyte [36]. XM_005676195.1 may regulate GnRH in the GnRH signaling pathway. Bta-mir-365-1, cfa-mir-493 [37], and bta-mir-493 [38] are precursors of TCONS_00284825 and oar-mir-493 [39] is a precursor of TCONS_00320849 in pre-miRNA prediction. TCONS_00320849 has a mature miRNA sequence, miR-365a, with the same mature sequence in the miRBase database. miR-365 can regulate proliferation by mediating the expression of KRAS [40]. miR-365 regulates cell cycle progression and apoptosis in various types of tumors [41]. Thus TCONS_00136407, TCONS_00146968, and TCONS_00320849 may also play roles in meiosis in the oocyte.

To further detect the lncRNAs, which may participate in the ovulation and lambing of goat, we analyzed the differentially expressed lncRNAs between the Uni and Mul. A total of 183,754 lncRNAs were significantly differently expressed between the two libraries. Among them, 455 lncRNAs, which included 188 known genes and 267 predicted transcripts, were co-expressed between the two samples. In addition, 157,523 lncRNAs were uniquely expressed in the Uni, and 25,776 lncRNAs were uniquely expressed in the Mul. Accordingly, we analyzed the lncRNAs which have high expression levels and were significantly difference expressions between the Uni and Mul libearies. XLOC_013414 was differentially expressed lncRNA with the highest expression level in the Uni library, whereas 102188530 (COL1A1) had the highest expression in the Mul library. The function of XLOC_013414 is unknown but COL1A1 polymorphisms may individually play minor roles in osteoporosis and fracture [42]. Meanwhile, XLOC_500835, XLOC_075014, XLOC_523784 were significantly differentially expressed novel lncRNA between the Uni and the Mul. And their target genes were KIAA0368, CNGA3 and MID2, respectively. KIAA0368 as selectively up-regulated by contact allergene, may be potential markers of skin irritation and allergy [43]. Achromatopsia can be caused by mutations in the CNGA3 gene [44]. MID2 leads to misregulation of microtubule organization and the downstream disease pathology associated with X-linked intellectual disabilities [45]. In consideration of the specific and higher expression levels of XLOC_013414 and COL1A1, some further research of their roles in the ovary is warranted.

According to the KEGG pathway analysis, a total of 875 target genes, which showed 22,885 background genes, of the differentially expressed lncRNAs between the Uni and Mul were annotated to 236 pathways. Among them, 12 pathways were significantly enriched (p-value ≤ 0.05). In addition, five pathways, “Progesterone-mediated oocyte maturation”, “Steroid biosynthesis”, “Oocyte meiosis”, “GnRH signaling pathway”, and “Steroid hormone biosynthesis” were related to the animal reproduction. An increased FSH secretion level will lead to multiple follicular development and ovulation in the follicular phase [4648]. In addition, Neat1 knockout mice fail to become pregnant despite normal ovulation [49]. Ovarian hormones can regulate the expression of Foxl2 to expand the number of genes controlled by the hypothalamic–pituitary–gonadal axis, ultimately dictating reproductive fitness [16].

In summary, the differentially expressed profile of lncRNAs was successfully established in the Uni and Mul phase libraries. Through Cis role analysis, 24 lncRNAs were predicted to overlap with cis-regulatory elements, which involved in Progesterone-mediated oocyte maturation, Steroid biosynthesis, Oocyte meiosis, and gonadotropin-releasing hormone (GnRH) signaling pathway. Meanwhile, the KEGG pathway analysis on target genes of the differentially expressed lncRNAs confirmed this results. In addition, 10 lncRNAs harbored precursors of 40 miRNAs, which was reported to be related to proliferation, were annotated in the precursor analysis of miRNAs. The present expand the understanding of lncRNA biology and will provide a resource for lncRNA studies of ovulation and lambing.

Supporting information

S1 Fig. Base composition analysis of sample Uni and Mul.

https://doi.org/10.1371/journal.pone.0183163.s001

(JPG)

S1 Table. Summary of lncRNA primers sequences for the RT-PCR.

https://doi.org/10.1371/journal.pone.0183163.s002

(XLSX)

S2 Table. Statistics of clean data mapping to rRNA by soap.

https://doi.org/10.1371/journal.pone.0183163.s003

(XLSX)

S3 Table. Statistics of clean data without rRNA mapping to genome by tophat.

https://doi.org/10.1371/journal.pone.0183163.s004

(XLSX)

S4 Table. Cuffcompare category to distinguish transcripts by 12 symbols.

https://doi.org/10.1371/journal.pone.0183163.s005

(XLS)

S5 Table. Result of comparison with known pre-miRNA.

https://doi.org/10.1371/journal.pone.0183163.s006

(XLS)

S7 Table. The differentially expressed lncRNAs between the two libraries.

https://doi.org/10.1371/journal.pone.0183163.s008

(XLSX)

S8 Table. The protein-coding genes within 100 k downstream of an lncRNA.

https://doi.org/10.1371/journal.pone.0183163.s009

(XLSX)

S9 Table. The protein-coding genes within 100 k upstream of an lncRNA.

https://doi.org/10.1371/journal.pone.0183163.s010

(XLSX)

S10 Table. 10 GO analysis for the target genes of the differentially expressed lncRNAs.

https://doi.org/10.1371/journal.pone.0183163.s011

(XLSX)

S11 Table. KEGG pathways for the target genes of the differentially expressed lncRNAs.

https://doi.org/10.1371/journal.pone.0183163.s012

(XLS)

Acknowledgments

This research was supported by the National Natural Science Foundation of China (31372310, 31472096), the Natural Science Foundation of Anhui Province (1708085MC61), the Key Research Projects of Natural Science in Anhui Colleges and Universities (KJ2017A334) and the Agricultural Science and Technology Innovation Program of China (ASTIP-IAS13).

References

  1. 1. Wilusz JE, Sunwoo H, Spector DL. Long non-coding RNAs: functional surprises from the RNA world. Genes Dev. 2009;23:1494–1504. pmid:19571179
  2. 2. Pachnis V, Brannan CI, Tilghman SM. The structure and expression of a novel gene activated in early mouse embryogenesis. EMBO J. 1998;7:673–681.
  3. 3. Yue M, Charles Richard JL, Ogawa Y. Dynamic interplay and function of multiple noncoding genes governing X chromosome inactivation. Biochim Biophys Acta. 2016;1859(1):112–120. pmid:26260844
  4. 4. Lakhotia SC. Divergent actions of long noncoding RNAs on X-chromosome remodelling in mammals and Drosophila achieve the same end result: dosage compensation. J Genet. 2015;94(4): 575–584. pmid:26690511
  5. 5. Eswards CA, Ferguson-smith AC. Mechanisms regulation imprinted genes in clusters. Curr Opin Cell Boil. 2007;19(3):281–289.
  6. 6. Daniel JA, Pray-grant MG, Grant PA. Effector proteins for methylated histones: an expanding family. Cell Cycle. 2005;4:919–926. pmid:15970672
  7. 7. Shi Y, Lan F, Matson C, Mulligan P, Whetstine JR, Cole PA, et al. Histone demethylation mediated by the nuclear amine oxidase homolog LSD1. Cell. 2004;119:941–953. pmid:15620353
  8. 8. Abdelmohsen K, Panda A, Kang MJ, Xu J, Selimyan R, Yoon JH, et al. Senescence-associated lncRNAs: senescence-associated long non-coding RNAs. Aging Cell. 2013;12(5):890–900. pmid:23758631
  9. 9. Kretz M, Siprashvili Z, Chu C, Webster DE, Zehnder A, Qu K, et al. Control of somatic tissue differentiation by the long non-coding RNA TINCR. Nature. 2013;493:231–235. pmid:23201690
  10. 10. Sánchez Alvarado A, Yamanaka S. Rethinking differentiation: stem cells, regeneration, and plasticity. Cell. 2014;157:110–119. pmid:24679530
  11. 11. Bassett AR, Akhtar A, Barlow DP, Bird AP, Brockdorff N, Duboule D, et al. Considerations when investigating lncRNA function in vivo. Elife. 2014;3:e03058. pmid:25124674
  12. 12. Wang F, Li L, Xu H, Liu Y, Yang C, Cowley AW Jr, et al. Characteristics of Long Non-coding RNAs in the Brown Norway Rat and Alterations in the Dahl Salt-Sensitive Rat. Sci Rep. 2014;4:7146. pmid:25413633
  13. 13. Caballero J, Gilbert I, Fournier E, Gagné D, Scantland S, Macaulay A, et al. Exploring the function of long non-coding RNA in the development of bovine early embryos. Reprod Fertil Dev. 2014;27(1):40–52. pmid:25472043
  14. 14. Jena SC, Kumar S, Rajput S, Roy B, Verma A, Kumaresan A, et al. Differential methylation status of IGF2-H19 locus does not affect the fertility of crossbred bulls but some of the CTCF binding sites could be potentially important. Mol Reprod Dev. 2014;81(4):350–362. pmid:24436206
  15. 15. Boulanger L, Pannetier M, Gall L, Allais-Bonnet A, Elzaiat M, Le Bourhis D, et al. FOXL2 Is a Female Sex-Determining Gene in the Goat. Curr Biol. 2014;24(4):404–408. pmid:24485832
  16. 16. Herndon MK, Nilson JH. Maximal expression of foxl2 in pituitary gonadotropes requires ovarian hormones. PLoS One. 2015;10(5):e0126527. pmid:25955311
  17. 17. Ling Ying-Hui, Ren Chun-Huan, Guo Xiao-Fei, Xu Li-Na, Huang Ya-Feng, Luo Jian-Chuan, et al. Identification and characterization of microRNAs in the ovaries of multiple and uniparous goats (Capra hircus) during follicular phase. BMC Genomics. 2014;15:339. pmid:24886377
  18. 18. Li R, Yu C, Li Y, Lam TW, Yiu SM, Kristiansen K, et al. SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics. 2009;25(15):1966–1967. pmid:19497933
  19. 19. Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nature biotechnology. 2010;28(5):511–515. pmid:20436464
  20. 20. Kong L, Zhang Y, Ye ZQ, Liu XQ, Zhao SQ, Wei L, et al. CPC: assess the protein-coding potential of transcripts using sequence features and support vector machine. Nucleic acids research. 2007;35(suppl 2):345–349.
  21. 21. Carrieri C, Cimatti L, Biagioli M, Beugnet A, Zucchelli S, Fedele S, et al. Long non-coding antisense RNA controls Uchl1 translation through an embedded SINEB2 repeat. Nature. 2012,491(7424):454–457. pmid:23064229
  22. 22. Cai X, Hagedorn CH, Cullen BR. Human microRNAs are processed from capped, polyadenylated transcripts that can also function as mRNAs. RNA. 2004;10(12):1957–66. pmid:15525708
  23. 23. Lee YS, Nakahara K, Pham JW, Kim K, He Z, Sontheimer EJ, et al. Distinct roles for Drosophila Dicer-1 and Dicer-2 in the siRNA/miRNA silencing pathways. Cell. 2004;117(1):69–81. pmid:15066283
  24. 24. Griffiths-Jones S, Grocock RJ, van Dongen S, Bateman A, Enright AJ. miRBase: microRNA sequences, targets and gene nomenclature. Nucleic acids research. 2006;34 (Database issue):140–144.
  25. 25. Wu Y, Wei B, Liu H, Li T, Rayner S. MiRPara: a SVM-based software tool for prediction of most probable microRNA coding regions in genome scale sequences. BMC bioinformatics. 2011;12(1):107.
  26. 26. Nawrocki EP, Kolbe DL, Eddy SR. Infernal 1.0: inference of RNA alignments. Bioinformatics. 2009;25(10):1335–1337. pmid:19307242
  27. 27. Tarazona S, García-Alcalde F, Dopazo J, Ferrer A, Conesa A. Differential expression in RNA-seq: a matter of depth. Genome research. 2011;21(12):2213–2223. pmid:21903743
  28. 28. Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, et al. KEGG for linking genomes to life and the environment. Nucleic Acids Research. 2008;36(Database issue):480–484.
  29. 29. Gayen Srimonta, Maclary Emily, Hinten Michael, and Kalantry Sundeep. Sex-specific silencing of X-linked genes by Xist RNA [J]. Proc Natl Acad Sci U S A. 2016,113(3):E309–E318. pmid:26739568
  30. 30. Amaral PP, Neyt C, Wilkins SJ, Askarian-Amiri ME, Sunkin SM, et al. Complex architecture and regulated expression of the Sox2ot locus during vertebrate development [J]. RNA. 2009,15(11):2013–27. pmid:19767420
  31. 31. Hadjimichael Christiana, Nikolaou Christoforos, Papamatheakis Joseph, and Kretsovali Androniki. MicroRNAs for Fine-Tuning of Mouse Embryonic Stem Cell Fate Decision through Regulation of TGF-β Signaling [J]. Stem Cell Reports. 2016,6(3):292–301. pmid:26876669
  32. 32. Lutz LB, Cole LM, Gupta MK, Kwist KW, Auchus RJ, et al. Evidence that androgens are the primary steroids produced by Xenopus laevis ovaries and may signal through the classical androgen receptor to promote oocyte maturation. Proc Natl Acad Sci U S A. 2001;98(24):13728–33. pmid:11707587
  33. 33. Luan X, Liu D, Cao Z, Luo L, Liu M, et al. Transcriptome profiling identifies differentially expressed genes in Huoyan goose ovaries between the laying period and ceased period [J]. PLoS One. 2014;9(11):e113211. pmid:25419838
  34. 34. Richards JS, Pangas SA. The ovary: basic biology and clinical implications [J]. J Clin Invest, 2010;120(4):963–972. pmid:20364094
  35. 35. Carmichael Gordon, Yang Li, Chen Ling-Ling. A new class of intron-derived long noncoding RNAs. The FASEB Journal. 2012;26:203.
  36. 36. Bernhardt ML, Kim AM, O'Halloran TV, Woodruff TK. Zinc Requirement During Meiosis I–Meiosis II Transition in Mouse Oocytes Is Independent of the MOS-MAPK Pathway. Biol Reprod. 2011;84 (3):526–36. pmid:21076080
  37. 37. Zhou W, Zhang C, Jiang H, Zhang Z, Xie L, He X. MiR-493 suppresses the proliferation and invasion of gastric cancer cells by targeting RhoC. Iran J Basic Med Sci. 2015;18(10):1027–1033. pmid:26730339
  38. 38. Strozzi F, Mazza R, Malinverni R, Williams JL. Annotation of 390 bovine miRNA genes by sequence similarity with other species. Anim Genet. 2009;40(1): 125.
  39. 39. Caiment F, Charlier C, Hadfield T, Cockett N, Georges M, Baurain D. Assessing the effect of the CLPG mutation on the microRNA catalog of skeletal muscle using high-throughput sequencing. Genome Res. 2010;20(12):1651–1662. pmid:20944086
  40. 40. Tesfaye D, Worku D, Rings F, Phatsara C, Tholen E, Schellander K, et al. Identification and expression profiling of microRNAs during bovine oocyte maturation using heterologous approach. Mol Reprod Dev. 2009;76(7): 665–677. pmid:19170227
  41. 41. Akbari Moqadam F, Boer JM, Lange-Turenhout EA, Pieters R, den Boer ML. Altered expression of miR-24, miR-126 and miR-365 does not affect viability of childhood TCF3-rearranged leukemia cells. Leukemia. 2014;28(5):1008–1014. pmid:24153013
  42. 42. Xie P, Liu B, Zhang L, Chen R, Yang B, Dong J, et al. Association of COL1A1 polymorphisms with osteoporosis: a meta-analysis of clinical studies. Int J Clin Exp Med. 2015;8(9):14764–1481. pmid:26628959
  43. 43. Corsini E, Sheasgreen J, Marinovich M, Galli CL. Use of differential display-polymerase chain reaction to identify genes selectively modulated by chemical allergens in reconstituted human epidermis. Toxicol In Vitro. 2002;16(4):427–431. pmid:12110282
  44. 44. Moskowitz A, Hansen RM, Akula JD, Eklund SE, Fulton AB. Rod and rod-driven function in achromatopsia and blue cone monochromatism. Invest Ophthalmol Vis Sci. 2009;50:950–958. pmid:18824728
  45. 45. Gholkar AA, Senese S, Lo YC, Vides E, Contreras E, et al. The X-Linked-Intellectual-Disability-Associated Ubiquitin Ligase Mid2 Interacts with Astrin and Regulates Astrin Levels to Promote Cell Division. Cell Rep. 2016;14(2):180–188. pmid:26748699
  46. 46. Messinis IE, Templeton AA. The importance of follicle-stimulating hormone increase for folliculogenesis. Hum Reprod. 1990;5(2):153–156. pmid:2108988
  47. 47. van der Meer M, Hompes PG, Scheele F, Schoute E, Veersema S, Schoemaker J. Follicle stimulating hormone (FSH) dynamics of low dose step-up ovulation induction with FSH in patients with polycystic ovary syndrome. Hum Reprod. 1994;9(9):1612–1617. pmid:7836509
  48. 48. Ganguly A, Meur SK, Ganguly I. Changes in circulatory FSH of Barbari goats following treatment with high molecular weight inhibin isolated from buffalo follicular fluid. Res Vet Sci. 2013;95(2):374–380. pmid:23602073
  49. 49. Nakagawa Shinichi, Shimada Masayuki, Yanaka Kaori. The lncRNA Neat1 is required for corpus luteum formation and the establishment of pregnancy in a subpopulation of mice. The Company of Biologists Ltd|Development. 2014;141(23):4618–4627. pmid:25359727