Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Expression of microRNAs in Horse Plasma and Their Characteristic Nucleotide Composition

  • Seungwoo Lee,

    Affiliation BK21 Plus Program for Creative Veterinary Science Research, Research Institute for Veterinary Science and College of Veterinary Medicine, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea

  • Seungwoo Hwang,

    Affiliation Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, 34141, South Korea

  • Hee Jeong Yu,

    Affiliation BK21 Plus Program for Creative Veterinary Science Research, Research Institute for Veterinary Science and College of Veterinary Medicine, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea

  • Dayoung Oh,

    Affiliation BK21 Plus Program for Creative Veterinary Science Research, Research Institute for Veterinary Science and College of Veterinary Medicine, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea

  • Yu Jung Choi,

    Affiliation BK21 Plus Program for Creative Veterinary Science Research, Research Institute for Veterinary Science and College of Veterinary Medicine, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea

  • Myung-Chul Kim,

    Affiliation BK21 Plus Program for Creative Veterinary Science Research, Research Institute for Veterinary Science and College of Veterinary Medicine, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea

  • Yongbaek Kim,

    Affiliation BK21 Plus Program for Creative Veterinary Science Research, Research Institute for Veterinary Science and College of Veterinary Medicine, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea

  • Doug-Young Ryu

    dyryu@snu.ac.kr

    Affiliation BK21 Plus Program for Creative Veterinary Science Research, Research Institute for Veterinary Science and College of Veterinary Medicine, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, South Korea

Abstract

MicroRNAs (miRNAs) in blood plasma are stable under high levels of ribonuclease activity and could function in tissue-to-tissue communication, suggesting that they may have distinctive structural characteristics compared with non-circulating miRNAs. In this study, the expression of miRNAs in horse plasma and their characteristic nucleotide composition were examined and compared with non-plasma miRNAs. Highly expressed plasma miRNA species were not part of the abundant group of miRNAs in non-plasma tissues, except for the eca-let-7 family. eca-miR-486-5p, -92a, and -21 were among the most abundant plasma miRNAs, and their human orthologs also belong to the most abundant group of miRNAs in human plasma. Uracil and guanine were the most common nucleotides of both plasma and non-plasma miRNAs. Cytosine was the least common in plasma and non-plasma miRNAs, although levels were higher in plasma miRNAs. Plasma miRNAs also showed higher expression levels of miRNAs containing adenine and cytosine repeats, compared with non-plasma miRNAs. These observations indicate that miRNAs in the plasma have a unique nucleotide composition.

Introduction

MicroRNAs (miRNAs) represent a class of small (~22 nt) noncoding RNAs, which regulate gene expression by binding to specific mRNA targets and promoting their degradation and/or translational inhibition [1, 2]. miRNAs recognize many mRNAs with partial complementarity, mostly involving a “seed” region that encompasses residues 2–8 from the 5′-end, or 5′2–5′8.

The genome of the horse (Equus caballus) has 64 chromosomes (2n = 64) containing 2.7 Gb with 20,000 protein-coding genes [3]. To date, 690 mature horse miRNA species have been identified (http://www.mirbase.org/, accessed May 2015) in various tissues including the liver, colon, muscle, cartilage, subchondral bone, and ovarian follicle as well as cord blood-derived mesenchymal stromal cells [48].

Uracil and cytosine are typically over- and under-represented in mammalian miRNAs, respectively [913]. Analyses of position-dependent nucleotide composition have demonstrated that uracil predominates at both ends of mammalian miRNAs and at the 5′9 position [913], while depletions of guanine and cytosine nucleotides are observed at 5′- and 3′- ends of miRNAs, respectively [9, 13]. However, the guanine–cytosine content of the seed region is higher than those of other regions [14].

Many studies have detected miRNAs in a variety of extracellular biological fluids such as plasma or serum, which contain miRNAs mostly derived from blood and endothelial cells [1517]. Tissue-specific miRNAs for example, from the liver, are also represented, indicating that circulating miRNAs have various origins [1821]. Although the biological function and origin of plasma miRNAs remain to be determined, some plasma miRNA species could be involved in inter-cellular and inter-tissue signal communication [22]. Circulating miRNAs in the plasma are protected from degradation despite the presence of high ribonuclease activities [16, 23, 24].

We hypothesized that plasma miRNAs may have unique structural characteristics compared with intracellular miRNAs, such as nucleotide composition, to ensure stability and function in their distinctive environment. To investigate this, we examined the expression and nucleotide composition of plasma miRNAs in the horse, and compared them with those of other horse tissues.

Materials and Methods

Horses

All animal experiments were approved by the Institutional Animal Care and Use Committees of Seoul National University (Approval #SNU–131218–1) and performed in accordance with institutional guidelines on animal use in research and with the EU Directive 2010/63/EU for animal experiments. Three healthy geldings, 21–25 years of age, were fasted in individual stalls without exercise for 12 h. Blood samples were then collected from the jugular vein into conical tubes containing ethylenediaminetetraacetic acid (Sigma-Aldrich, St. Louis, MO). Immediately after collection, each blood sample was gently inverted several times and placed on ice. Plasma was separated by centrifuging at 1500 ×g for 20 min and stored at −80°C until further use.

Library preparation and sequencing

Total RNA was extracted from plasma using Trizol reagent (Invitrogen, Carlsbad, CA) according to the manufacturer’s instructions. Residual DNA was removed with DNase I (Takara Bio, Shiga, Japan) using the manufacturer’s guidelines. The RNA integrity number was used to assess integrity with the Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA). Total RNA with an RIN value greater than 7 was separated on a 15% denaturing polyacrylamide gel, and small RNAs ranging from 18 to 32 nucleotides were gel purified.

The small RNA library was prepared with the TruSeq Small RNA Sample Prep Kit (Illumina, San Diego, CA) according to the manufacturer’s instructions. Briefly, the adapters (5′ adapter, 5′-GUUCAGAGUUCUACAGUCCGACGAUC-3′ and 3′ adapter, 5′-UCGUAUGCCGUCUUCUGCUUGUidT-3′) were ligated to both ends of the purified small RNAs and a reverse-transcription reaction was performed to generate single-stranded cDNA using the primer 5′-CAAGCAGAAGACGGCATACGA-3′. cDNA was then PCR amplified using the primer pair forward primer, 5′-AATGATACGGCGACCACCGACAGGTTCAGAGTTCTACAGTCCGA-3′ and reverse primer, 5′-CAAGCAGAAGACGGCATACGA-3′. The amplified cDNA constructs were gel-purified and then sequenced on the HiSeq2000 (Illumina) according to the manufacturer's instructions, using the sequencing primer 5′-CGACAGGTTCAGAGTTCTACAGTCCGACGATC-3′.

Sequencing data processing

Small RNA sequences were analyzed using the CLC Genomics Workbench version 6.5.1 (CLC bio, Aarhus, Denmark). Briefly, small RNA sequences were filtered for quality and size, and reads of low quality and lengths less than 15 were discarded. Clean reads were mapped to precursor miRNA sequences from mirbase release 21 (http://www.mirbase.org/) and noncoding RNA sequences from Ensembl horse genome release 78 (http://www.ensembl.org/), allowing two mismatches. miRNA variants were determined based on the presence of up to five additional nucleotides on either the 5′ or 3′ end of the reads.

Data access

Raw sequence datasets in FASTQ format were deposited in the NCBI Sequence Reads Archive under accession numbers SRX170338, SRX170339, and SRX170340. MiRNA sequence data for non-plasma tissues are from Kim et al. [8]. The accession numbers for the colon tissues are SRX187171, SRX187172, SRX187173, and SRX187174; those for the muscle tissues are SRX187166, SRX187167, SRX187168, and SRX187169; those for the liver tissues are SRX187162, SRX187163, SRX187164, and SRX187165.

Normalization of miRNA expression levels

Read count data were normalized by the trimmed mean of M-values (TMM) method [25] available in the edgeR Bioconductor package [26]. The TMM normalization method can adjust for situations where different tissues express diverse miRNA repertoires such that some miRNA genes may be very highly expressed in one tissue, but not in another. The TMM-normalized expression values were thus obtained as count per million (CPM) unit. Subsequent to normalization, we filtered out miRNAs expressed at very low levels. Because the smallest group size in the plasma is three samples, we retained those miRNAs with at least one CPM in at least three samples. This low expression filtering retained 266 miRNAs out of a total of 428.

Hierarchical clustering

For unsupervised hierarchical clustering analysis, logarithms of CPM values of the 266 miRNAs were taken and inputted to the EMA Bioconductor package [27] using the average linkage method with a Pearson centered correlation as a similarity metric. To draw a heatmap, log CPM values of each miRNA were standardized across samples by z-transformation such that the row mean and variance became 0 and 1, respectively.

Identification of potential novel miRNAs

The reads that did not match any database were mapped to the horse reference genome (EquCab2) using SOAP version 1.11 [28], with no mismatches allowed. The reads with perfect matches were subjected to Mireap version 0.2 (http://sourceforge.net/projects/mireap/) to predict novel miRNAs. The precursors of predicted novel miRNAs were further subjected to Mipred [29] to filter out pseudo-miRNAs as previously described [30].

Target prediction

The 5′ and 3′ untranslated regions of horse genes were extracted from the Ensembl horse genome release 78 (http://www.ensembl.org/). A search for miRNA target genes was performed using Target-align [31] with the following parameters [3234]: (1) no more than four mismatches between the small RNA and its target (G–U nucleotides count as 0.5 mismatches); (2) no more than two adjacent mismatches in the miRNA/target duplex; (3) no mismatches in positions 10 and 11 of the miRNA/target duplex; and (4) no more than 2.5 mismatches in positions 1 and 12 of the miRNA/target duplex (5′ of miRNA). Functional annotation of the target genes was performed using the PANTHER classification system version 9.0 (http://www.pantherdb.org/) [35].

Nucleotide composition

miRNA sequences were imported from the CLC Genomics Workbench into Excel 2013 (Microsoft, Redmond, WA), and were arranged such that each spreadsheet cell contained a nucleotide for a specific miRNA position. The COUNTIF and PRODUCT commands as well as the Pivot Table feature were used to determine the nucleotide compositions. The number of 22 nt-miRNAs containing at least one MNR (four nucleotides or longer) for each nucleotide was estimated using a Perl script (S1 Material).

Statistical analyses

Statistical analyses were performed using SPSS 15.0 (SPSS, Chicago, IL). The comparison of miRNA expression levels was evaluated using one-way analysis of variance (ANOVA) with Tukey’s post hoc test. The comparison of nucleotide compositions was performed using one-way ANOVA with Tukey’s post hoc test and the Student's t-test. Data were considered significantly different when P was < 0.05.

Results

MiRNA species enriched in blood plasma

Next generation sequencing-based small RNA profiles were generated for three horse plasma libraries (Table 1), with 21.8–25.6 million sequencing reads obtained from each library. After removing low-quality reads from the raw sequences, we obtained 21.7–25.4 million clean reads, of which 10.3–15.8 million (45.3–72.7%) were previously known miRNA species.

thumbnail
Table 1. Sequencing read statistics for small RNA libraries of horse plasma.

https://doi.org/10.1371/journal.pone.0146374.t001

Sequencing identified 366 unique miRNA species in the three libraries, of which 305 were commonly found (Fig 1A). The sequences were 20–27 nt in length, and those 22 nt long were the most frequent (Fig 1B). The genes for the 366 miRNA species are located within all of the horse chromosomes, with the exception of the smallest chromosome 31 (Fig 1C and 1D). Among these, chromosomes 11, 24, and X contained the highest number of genes for plasma miRNAs. Chromosomes 7, 16, and 23 contributed most to the plasma miRNA content, as assessed by the sum of the CPM levels for each miRNA species.

thumbnail
Fig 1. Sequencing of miRNAs in three horse plasma libraries.

(A) Venn diagram representing the numbers of miRNA species identified in the libraries. (B) Length distribution of miRNAs. The x-axis represents sequence lengths, and the y-axis the occurrence frequency of miRNA species for each given length. (C, D) Chromosomal location of miRNA genes. The x-axes represent chromosome number, and the y-axes the numbers of miRNA species (C) and reads (D) on the respective chromosome. Black and gray bars represent sense and antisense strands, respectively. For (C) and (D), data are shown as mean ± S.D. (n = 3).

https://doi.org/10.1371/journal.pone.0146374.g001

Ten plasma miRNA species exhibiting the highest expression levels in the libraries are listed with their characteristics in Table 2. These accounted for 68.6% of total plasma miRNA reads, and the three most highly expressed miRNA species, eca-let-7f, -7a, and eca-miR-191a, accounted for 40.6%. Those reads that had not been annotated were used to identify potentially novel miRNA species in silico. S1 Table lists the five potentially novel miRNA species exhibiting the highest expression levels.

thumbnail
Table 2. Ten most abundant miRNA species in horse plasma.

Location represents the chromosome on which the miRNA genes are located, start and end sites, and either sense [+] or antisense strand [−].

https://doi.org/10.1371/journal.pone.0146374.t002

Clustering of miRNA expression

As shown in the dendrogram at the top of Fig 2, samples were clustered according to tissue origins. Plasma samples constituted a distinct cluster, separated from the other three tissue clusters by a large inter-cluster branch length [8]. The plasma cluster also showed the smallest intra-cluster branch length reflecting the homogeneous expression profiles of the plasma samples. Two large miRNA clusters were observed in the gene dendrogram: the first consisted of miRNAs with consistently high plasma expression in plasma, and showed a simple dendrogram structure. The second cluster consisted of miRNAs with lower plasma expression, and showed a more complicated dendrogram structure with several subclusters, reflecting the heterogeneous expression profiles between the subclusters. A total of 87 plasma miRNAs exhibited more than 2-fold higher expression levels than each of the three non-plasma tissues, and 30 exhibited more than 10-fold higher expression levels; 81 showed more than 2-fold lower expression levels and 50 more than 10-fold lower expression levels. MiRNA species that showed higher and lower levels of expression (CPM) in the plasma compared with the liver, colon, and muscle tissues are listed in S2 and S3 Tables, respectively.

thumbnail
Fig 2. Unsupervised hierarchical clustering of horse miRNA expression profiles.

Values in the color key represent z-transformed expression values, such that highest expression values of each row correspond to bright red, and the lowest to bright green. Plasma samples constitute a distinct cluster, separated from the other three tissue clusters.

https://doi.org/10.1371/journal.pone.0146374.g002

Expression of blood plasma-enriched miRNA species in non-plasma tissues

The expression of the 10 miRNA species exhibiting the highest plasma expression (Table 2) was next examined in the colon, muscle, and liver (Fig 3). Seven miRNA species, eca-miR-21, -24, -92a, -191a, -223, -486-5p, and -103, were found to have significantly higher expression levels in the plasma than the three other tissues (P < 0.05). The expression levels for eca-let-7f and -7g did not differ significantly among the four tissue types, whereas eca-let-7a expression was significantly lower in the plasma (P < 0.05). This suggests that the 10 most abundant plasma miRNA species did not belong to the abundant miRNA group in non-plasma tissues, except for the eca-let-7 family species (S4 Table).

thumbnail
Fig 3. Differential expression.

Tissue distribution was examined for 10 miRNA species that were most highly expressed in horse plasma. Different alphabetical letters (a, b, and c) above the bars represent significant differences between the tissues (n = 3–4, one-way ANOVA with Tukey’s post hoc test, P < 0.05).

https://doi.org/10.1371/journal.pone.0146374.g003

MiRNA target prediction and gene ontology (GO) analyses

To examine the functional importance of plasma miRNAs, we derived potentially affected targets using a prediction program. A total of 22 potential target mRNAs were identified for 10 miRNAs having the highest expression in the plasma (Table 2). WW domain containing adaptor with coiled-coil (WAC) mRNA was shown to be a putative common target of three of the miRNA species with nearly identical sequences: eca-let-7a, -7f, and -7g. No potential target was identified for eca-miR-486-5p, -223, or -103.

Among the potential targets, 10 mRNAs were subcategorized into 17 GO classifications including nine biological processes (S1A Fig), six molecular functions (S1B Fig), and two cellular components (data not shown). Specifically, “metabolic process” (seven targets, 23.3%) and “cellular process” (seven targets, 23.3%) were highly represented in the biological process of classification; together with “catalytic activity” (six targets, 31.6%) and “binding” (six targets, 31.6%) in the molecular function classification; and “cell part” (one target, 50%) and “membrane” (one target, 50%) in the cellular component classification.

Overall nucleotide composition of miRNAs

We examined the overall nucleotide composition of miRNA in the four tissues, focusing particularly on cytosine frequencies. We conducted miRNA expression-unweighted (Fig 4A) and -weighted (Fig 4B) analyses, with the latter assessing the miRNA nucleotide composition by taking into account their expression levels in the tissues. Thus, miRNA species having higher expression levels would affect the nucleotide compositions to a greater extent in expression-weighted analysis.

thumbnail
Fig 4. Overall nucleotide composition of miRNAs in different tissues, as assessed by miRNA expression-unweighted (A) and -weighted criteria (B).

Different Roman numerals (I, II, III, and IV) above the bars represent significant differences between occurrence frequencies of nucleotides within the tissue, and different alphabetical letters (a, b, and c) represent the significant difference between occurrence frequencies of a nucleotide among the different tissues (n = 3–4, one-way ANOVA with Tukey’s post hoc test, P < 0.05). * indicates that the occurrence frequency of a nucleotide is smaller in the expression-unweighted criterion than -weighted counterpart within the tissue, or vice versa (Student’s t-test, P < 0.05).

https://doi.org/10.1371/journal.pone.0146374.g004

In all four tissue miRNAs, cytosine was the nucleotide with the lowest frequency under both expression-unweighted and -weighted criteria (P < 0.05). Cytosine frequencies in non-plasma tissue miRNAs ranged from 51.2 to 65.7% and from 2.73 to 17.6% of the means of the frequencies of the other three nucleotides under expression-unweighted and -weighted criteria, respectively. Those equivalents in plasma miRNAs were much higher, at 86.5% and 56.8%, respectively.

The cytosine frequency was significantly higher in plasma miRNAs than in those of non-plasma tissues, at more than 1.25- and 2.88-fold under expression-unweighted and -weighted criteria, respectively (P < 0.05). Despite their high cytosine content, plasma miRNAs had a guanine–cytosine content slightly higher or similar to those of other tissue miRNAs, ranging from 1.05- to 1.11-fold and from 1.04- to 1.49-fold that of plasma miRNAs in the expression-unweighted and -weighted analyses, respectively. This may reflect the low guanine content of plasma miRNAs among the tissue miRNAs. The guanine content of plasma miRNAs was 86.1–92.6% of those of non-plasma tissues in expression-unweighted analysis (P < 0.05). In expression-weighted analysis, the guanine content of plasma miRNA was 73.9% and 83.1% of those of colon and liver miRNAs, respectively (P < 0.05), and not significantly different from that of muscle miRNAs.

In non-plasma tissue miRNAs, cytosine frequencies under expression-weighted analyses were lower by at least 3.17-fold than those under expression-unweighted analyses (P < 0.05). However, the difference in cytosine frequencies of plasma miRNAs was not significant between the two criteria.

Position-dependent nucleotide composition of miRNAs

Position-dependent nucleotide compositions were analyzed for 18 nucleotide positions of miRNAs, including the first to ninth nucleotide positions at both 5′- and 3′-ends (Fig 5). This aimed to assess nucleotide positions responsible for the low cytosine content in tissue miRNAs and the relatively high cytosine content in plasma miRNAs (Fig 4).

thumbnail
Fig 5. Position-dependent nucleotide composition for 5′- and 3′-ends of miRNAs.

The composition was assessed in the expression-unweighted (A–D) and -weighted criteria (E–H), in plasma (A and E), colon (B and F), muscle (C and G), and liver (D and H). The x-axes represent nucleotide positions of miRNA sequences, and the y-axes the occurrence frequencies of nucleotides (n = 3–4).

https://doi.org/10.1371/journal.pone.0146374.g005

In expression-unweighted analysis, plasma miRNAs had the highest cytosine frequencies at 14 of the 18 nucleotide positions compared with the three other tissue types (P < 0.05; Fig 5A). The four exceptions were observed at 5′3, 5′5, 5′7, and 3′2. The cytosine frequency ranged from 15.6 to 28.5% at the 18 positions of plasma miRNA, with equivalents of 13.2–25.1%, 9.73–19.1%, and 9.44–19.4% for colon, muscle, and liver tissue miRNAs, respectively. The lowest and highest cytosine frequencies in both plasma and colon miRNAs were observed at 3′1 and 5′3, respectively, while these positions were 5′1 and 5′7 for muscle miRNAs, respectively, and 3′1 and 5′3 for the second lowest and highest cytosine frequencies in muscle miRNAs, respectively. In the liver miRNAs, 3′1 and 5′4 had the lowest and highest cytosine frequencies, respectively, while 5′3 had the second highest frequency.

In expression-weighted analysis, plasma miRNAs exhibited the highest cytosine frequencies at 16 of the 18 nucleotide positions compared with other tissues (P < 0.05; Fig 5E). The exceptions were observed at positions 5′5 and 3′1. In plasma miRNAs, cytosine occurred at frequencies of 6.2–26.4% in the 18 positions, with 5′7 and 5′4 having the lowest and highest frequencies, respectively. In the miRNAs of colon, muscle, and liver tissues, the lowest cytosine frequencies were found at 5′7, 5′1, and 5′2, where the frequencies were 2.2%, 0.34%, and 0.93%, respectively. The highest cytosine frequencies were found at 3′2, 5′3, and 3′2, where the frequencies were 9.61%, 1.50%, and 2.79%, respectively.

Variations of position-dependent cytosine frequencies were higher in the expression-weighted analysis than in the expression-unweighted analysis. The coefficients of variation (CVs) for cytosine frequencies among the 18 positions were 15.9–17.6% among the tissue miRNAs in the expression-unweighted analysis, compared with 30.1–37.7% in the expression-weighted analysis.

The other three nucleotides exhibited much higher variations in position-dependent nucleotide frequencies than cytosine. The CV ranges for adenine, uracil, and guanine frequencies at the 18 positions of the four tissue miRNAs were 21.1–27.1%, 31.2–43.6%, and 24.7–35.0% in the expression-unweighted analyses, respectively. The equivalents for the expression-weighted criterion were 73.5–122%, 69.8–108%, and 65.6–131%, respectively.

The position-dependent guanine–cytosine content was significantly higher at seven positions in the plasma miRNAs than those of other tissues in the expression-unweighted analysis: 5′1, 3′1, 3′2, and 3′4–3′7 (P < 0.05). The content differences between those of plasma and other tissue miRNAs were 1.54-fold or less. In the expression-weighted analysis, positions 5′1 and 5′6 had at least 2.5- and 3.0-fold higher guanine–cytosine contents in the plasma miRNAs than in other tissue miRNAs, respectively (P < 0.05). However, plasma miRNAs had at least 27.0% and 13.8% lower guanine–cytosine contents at positions 5′2 and 3′8 than the other tissue miRNAs, respectively (P < 0.05).

Occurrence of mononucleotide repeat (MNR) motifs in miRNAs

We next examined the occurrence of miRNAs carrying MNRs of 4 repeats or longer. In expression-unweighted analysis, the frequencies of miRNAs containing MNR(s) ranged from 3.77–4.46%, 6.48–7.84%, 5.16–5.69%, and 5.12–6.26% for adenine, uracil, guanine, and cytosine, respectively (Fig 6A). Among them, adenine repeat-containing miRNAs occurred least frequently (P < 0.05). Uracil repeat-containing miRNAs exhibited the highest frequencies in muscle and liver tissue miRNAs (P < 0.05).

thumbnail
Fig 6. Occurrence frequencies of MNR-containing miRNAs (four repeats or longer) in tissue miRNAs, as assessed in expression-unweighted (A) and -weighted criteria (B).

Different Roman numerals (I, II, and III) above the bars represent significant differences between frequencies of MNR-containing miRNAs within miRNAs of a tissue, and different alphabetical letters (a and b) represent the significant difference between frequencies of MNR-containing miRNAs for a nucleotide among different tissue miRNAs (n = 3–4, one-way ANOVA with Tukey’s post hoc test, P < 0.05). * indicates that the frequency of MNR-containing miRNAs for a nucleotide is smaller in the expression-unweighted criterion than the -weighted counterpart within the tissue miRNAs, or vice versa (Student’s t-test, P < 0.05).

https://doi.org/10.1371/journal.pone.0146374.g006

In the expression-weighted analysis, adenine and cytosine repeat-containing miRNAs occurred with frequencies of 10.5% and 10.3% in plasma miRNAs, respectively (Fig 6B). The adenine repeat frequency was 2.70-fold higher than that of the expression-unweighted analysis (P < 0.05), while there was no significant difference from the equivalent expression-unweighted analysis for the cytosine repeat frequency. The frequencies for guanine and uracil repeat-containing miRNAs in plasma miRNAs and for any type of MNR-containing miRNAs in non-plasma miRNAs were 2.35% or less under the expression-weighted criterion. These were at least 58.6% lower than the equivalents in the expression-unweight criterion (P < 0.05).

Discussion

The genes for the 366 unique plasma miRNA species identified in the present study are located on all but one of the horse chromosomes (chromosome 31; Fig 1). Interestingly, of the 690 miRNA genes currently identified, none is located on chromosome 31 (http://www.mirbase.org), suggesting that chromosome 31 contributes little or not at all to miRNA expression.

Clustering analysis showed that the plasma miRNA expression pattern could be readily distinguished from that of other tissues (Fig 2). The potential targets of plasma miRNAs are involved in catalytic, binding, and receptor pathways, as well as metabolic and cellular processes (S1 Fig). Among the 10 most abundant plasma miRNAs of the horse (Table 2), the human orthologs of eca-miR-486-5p, -92a, and -21 also belong to the 10 most abundant miRNAs in human plasma [20], which indicates their importance in mammalian plasma.

Consistent with previous studies in humans and animals [913], uracil and guanine were the most frequent nucleotides in tissue miRNAs in the expression-unweighted criterion (Fig 4). This was also observed in the expression-weighted criterion except for the muscle, in which adenine was the predominant nucleotide. The eca-miR-1 species with its high adenine content (nine of 22 nucleotides) accounted for 86.4% of the total miRNA reads in the muscle (S4 Table), and may be responsible for the high adenine content in the expression-weighted analysis. Uracil was the most frequent nucleotide identified at both ends of miRNAs where guanine and cytosine were under-represented (Fig 5); this has previously been reported [9, 13], suggesting that similarities exist between the nucleotide composition of horse miRNAs and those of other animals.

Cytosine was the least frequent nucleotide in the miRNAs of all tissues analyzed, but its under-representation was greater in non-plasma tissues than the plasma (Fig 4). Indeed, analysis of the position-dependent nucleotide composition found that most nucleotide positions in plasma miRNAs had higher cytosine contents than those of other tissues (Fig 5). These findings suggest that cytosine is non-preferentially included in many horse tissue miRNAs, but is less under-represented in plasma miRNAs in a generally position-independent manner. However, the overall guanine–cytosine content of plasma miRNAs, which may augment the strength of base-pairing to mRNA targets [14], was slightly higher than or similar to other tissue miRNAs because the high cytosine content tended to be compensated for by low guanine content (Fig 4). In contrast to the plasma miRNAs, cytosine frequencies were lower in the expression-weighted analysis than the -unweighted analysis in non-plasma tissues, suggesting that miRNA species with high expression levels tend to have lower cytosine contents.

Despite differences in the position-dependent cytosine content among miRNAs of the four tissues (Fig 5), the patterns of position-dependent cytosine frequencies were comparable among the tissue miRNAs in the expression-unweighted criterion, possibly reflecting common miRNA structures. The CVs for cytosine frequencies were similar among the four tissue miRNAs, ranging from 15.9 to 17.6%. Additionally, the lowest and highest cytosine contents were observed at nucleotide positions 3′1 and 5′3/5′4, respectively, in the four tissue miRNAs under the expression-unweighted criterion.

A small number of miRNA species accounts for the majority of the total read counts in many tissues [22, 3638], indicating that rigorous selection must occur to use only specific miRNAs. This trend was also observed in horse tissues, but to a lower extent in the plasma compared with other tissues (Table 2 and S4 Table). The high CVs for position-dependent nucleotide frequencies in the expression-weighted criterion compared with the -unweighted criterion might reflect the tissue-dependent over-representation of a small number of miRNA species.

The patterns for MNR-containing miRNA occurrence frequencies were generally similar among the four tissue miRNAs in the expression-unweighted criterion (Fig 6A), possibly reflecting the common structure of miRNAs. A 22-nt ssRNA could have 422 possible sequence combinations, of which about 5.57% could have MNR(s) of at least four repeats for each nucleotide. The present study showed that the occurrence frequencies for guanine and cytosine repeat-containing miRNAs were similar to the predicted level and that adenine and uracil repeat-containing miRNAs were under- and over-represented, respectively. In the expression-weighted criterion, however, the MNR occurrence frequencies were much lower than the predicted level, except for adenine and cytosine repeat-containing miRNAs in the plasma. These findings suggest that MNR-containing miRNAs tend to be expressed in a non-preferential manner. The strikingly high expression levels of adenine and cytosine repeat-containing miRNAs in the plasma suggest a preferred expression of miRNAs containing these repeats.

Conclusion

In view of their location, plasma miRNAs may have distinctive structural characteristics and functional roles. In support of this hypothesis, plasma miRNAs exhibited a unique nucleotide composition with respect to cytosine content and that of adenine and cytosine repeats. Because plasma is unlikely to produce miRNAs, the origin (release) and fate (stability) of plasma miRNA species are responsible for determining the nucleotide composition. It can be hypothesized that structural characteristics are required for miRNA biological functions, but further studies are necessary to examine the sources, functions, and trafficking of miRNAs circulating in the blood plasma.

Supporting Information

S1 Fig. Gene ontology of the predicted target genes for the 10 most highly expressed miRNA species in horse plasma.

Categorization of miRNA-target genes was performed according to biological processes (A) and molecular functions (B).

https://doi.org/10.1371/journal.pone.0146374.s001

(PDF)

S1 Material. A Perl script used to estimate the number of 22-nt miRNAs containing at least one MNR (four nucleotides or longer) for each nucleotide.

https://doi.org/10.1371/journal.pone.0146374.s002

(PL)

S1 Table. Five most abundant putative novel miRNA species in horse plasma.

Location represents the chromosome on which the miRNA genes are located, start and end sites, and either sense [+] or antisense strand [−].

https://doi.org/10.1371/journal.pone.0146374.s003

(PDF)

S2 Table. MiRNA species that showed higher levels of expression (CPM) in the plasma compared with the liver, colon, and muscle tissues.

https://doi.org/10.1371/journal.pone.0146374.s004

(PDF)

S3 Table. MiRNA species that showed lower levels of expression (CPM) in the plasma compared with the liver, colon, and muscle tissues.

https://doi.org/10.1371/journal.pone.0146374.s005

(PDF)

S4 Table. Ten most abundant miRNA species in the horse colon, muscle, and liver.

https://doi.org/10.1371/journal.pone.0146374.s006

(PDF)

Acknowledgments

This study was supported by the Bio-industry Technology Development Program (Grant no. 111159–4) of iPET (Korea Institute of Planning and Evaluation for Technology in Food, Agriculture, Forestry and Fisheries), Ministry of Agriculture, Food and Rural Affairs, South Korea. We appreciate the assistance from the KOBIC Research Support Program.

Author Contributions

Conceived and designed the experiments: DYR SL. Performed the experiments: SL HJY DO YJC MCK YK DYR. Analyzed the data: SH SL DYR. Contributed reagents/materials/analysis tools: SH. Wrote the paper: DYR SL.

References

  1. 1. Baek D, Villén J, Shin C, Camargo FD, Gygi SP, Bartel DP. The impact of microRNAs on protein output. Nature. 2008;455: 64–71. pmid:18668037
  2. 2. Naeem H, Küffner R, Csaba G, Zimmer R. miRSel: automated extraction of associations between microRNAs and genes from the biomedical literature. BMC Bioinformatics. 2010;11: 135. pmid:20233441
  3. 3. Wade CM, Giulotto E, Sigurdsson S, Zoli M, Gnerre S, Imsland F, et al. Genome sequence, comparative analysis, and population genetics of the domestic horse. Science. 2009;326: 865–867. pmid:19892987
  4. 4. Barrey E, Bonnamy B, Barrey EJ, Mata X, Chaffaux S, Guerin G. Muscular microRNA expressions in healthy and myopathic horses suffering from polysaccharide storage myopathy or recurrent exertional rhabdomyolysis. Equine Vet J. 2010;38: 303–310.
  5. 5. Donadeu FX, Schauer SN. Differential miRNA expression between equine ovulatory and anovulatory follicles. Domest Anim Endocrinol. 2013;45: 122–125. pmid:23932580
  6. 6. da Silveira JC, Veeramachaneni DNR, Winger Q a., Carnevale EM, Bouma GJ. Cell-secreted vesicles in equine ovarian follicular fluid contain miRNAs and proteins: a possible new form of cell communication within the ovarian follicle. Biol Reprod. 2012;86: 71. pmid:22116803
  7. 7. Buechli ME, Lamarre J, Koch TG. MicroRNA-140 expression during chondrogenic differentiation of equine cord blood-derived mesenchymal stromal cells. Stem Cells Dev. 2013;22: 1288–1296. pmid:23157248
  8. 8. Kim MC, Lee SW, Ryu DY, Cui FJ, Bhak J, Kim Y. Identification and characterization of microRNAs in normal equine tissues by next generation sequencing. PLoS One. 2014;9: e93662. pmid:24695583
  9. 9. Zhang B, Stellwag EJ, Pan X. Large-scale genome analysis reveals unique features of microRNAs. Gene. 2009;443: 100–109. pmid:19422892
  10. 10. Gu YQ, Gong G, Xu ZL, Wang LY, Fang ML, Zhou H, et al. miRNA profiling reveals a potential role of milk stasis in breast carcinogenesis. Int J Mol Med. 2014;33: 1243–1249. pmid:24584717
  11. 11. Yuan L, Zhang X, Li L, Jiang H, Chen J. High-throughput sequencing of microRNA transcriptome and expression assay in the sturgeon, Acipenser schrenckii. PLoS One. 2014;9: e115251. pmid:25506840
  12. 12. Sun J, Li M, Li Z, Xue J, Lan X, Zhang C, et al. Identification and profiling of conserved and novel microRNAs from Chinese Qinchuan bovine longissimus thoracis. BMC Genomics. 2013;14: 42. pmid:23332031
  13. 13. Wang B. Base composition characteristics of mammalian miRNAs. J Nucleic Acids. 2013;2013: 951570. pmid:23710337
  14. 14. Carmel I, Shomron N, Heifetz Y. Does base-pairing strength play a role in microRNA repression? RNA. 2012;18: 1947–1956. pmid:23019592
  15. 15. Chim SSC, Shing TKF, Hung ECW, Leung TY, Lau TK, Chiu RWK, et al. Detection and characterization of placental microRNAs in maternal plasma. Clin Chem. 2008;54: 482–490. pmid:18218722
  16. 16. Mitchell PS, Parkin RK, Kroh EM, Fritz BR, Wyman SK, Pogosova-Agadjanyan EL, et al. Circulating microRNAs as stable blood-based markers for cancer detection. Proc Natl Acad Sci U S A. 2008;105: 10513–10518. pmid:18663219
  17. 17. Turchinovich A, Weiz L, Burwinkel B. Extracellular miRNAs: the mystery of their origin and function. Trends Biochem Sci. 2012;37: 460–465. pmid:22944280
  18. 18. Duttagupta R, Jiang R, Gollub J, Getts RC, Jones KW. Impact of cellular miRNAs on circulating miRNA biomarker signatures. PLoS One. 2011;6: e20769. pmid:21698099
  19. 19. Pritchard CC, Kroh E, Wood B, Arroyo JD, Dougherty KJ, Miyaji MM, et al. Blood cell origin of circulating microRNAs: a cautionary note for cancer biomarker studies. Cancer Prev Res. 2012;5: 492–497.
  20. 20. Williams Z, Ben-Dov IZ, Elias R, Mihailovic A, Brown M, Rosenwaks Z, et al. Comprehensive profiling of circulating microRNA via small RNA sequencing of cDNA libraries reveals biomarker potential and limitations. Proc Natl Acad Sci U S A. 2013;110: 4255–4260. pmid:23440203
  21. 21. Laterza OF, Lim L, Garrett-Engele PW, Vlasakova K, Muniappa N, Tanaka WK, et al. Plasma microRNAs as sensitive and specific biomarkers of tissue injury. Clin Chem. 2009;55: 1977–1983. pmid:19745058
  22. 22. Lukiw WJ, Andreeva T V., Grigorenko AP, Rogaev EI. Studying microRNA function and dysfunction in Alzheimer’s disease. Front Genet. 2013;3: 327. pmid:23390425
  23. 23. Arroyo JD, Chevillet JR, Kroh EM, Ruf IK, Pritchard CC, Gibson DF, et al. Argonaute2 complexes carry a population of circulating microRNAs independent of vesicles in human plasma. Proc Natl Acad Sci U S A. 2011;108: 5003–5008. pmid:21383194
  24. 24. Vickers KC, Remaley AT. Lipid-based carriers of microRNAs and intercellular communication. Curr Opin Lipidol. 2012;23: 91–97. pmid:22418571
  25. 25. Robinson MD, Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 2010;11: R25. pmid:20196867
  26. 26. Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26: 139–140. pmid:19910308
  27. 27. Servant N, Gravier E, Gestraud P, Laurent C, Paccard C, Biton A, et al. EMA—A R package for Easy Microarray data analysis. BMC Res Notes. 2010;3: 277. pmid:21047405
  28. 28. Li R, Li Y, Kristiansen K, Wang J. SOAP: short oligonucleotide alignment program. Bioinformatics. 2008;24: 713–714. pmid:18227114
  29. 29. Jiang P, Wu H, Wang W, Ma W, Sun X, Lu Z. MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features. Nucleic Acids Res. 2007;35: W339–W344. pmid:17553836
  30. 30. Chen X, Li Q, Wang J, Guo X, Jiang X, Ren Z, et al. Identification and characterization of novel amphioxus microRNAs by Solexa sequencing. Genome Biol. 2009;10: R78. pmid:19615057
  31. 31. Xie F, Zhang B. Target-align: a tool for plant microRNA target identification. Bioinformatics. 2010;26: 3002–3003. pmid:20934992
  32. 32. Sun J, Zhou Y, Cai H, Lan X, Lei C, Zhao X, et al. Discovery of novel and differentially expressed microRNAs between fetal and adult backfat in cattle. PLoS One. 2014;9: e90244. pmid:24587298
  33. 33. Allen E, Xie Z, Gustafson AM, Carrington JC. microRNA-directed phasing during trans-acting siRNA biogenesis in plants. Cell. 2005;121: 207–221. pmid:15851028
  34. 34. Schwab R, Palatnik JF, Riester M, Schommer C, Schmid M, Weigel D. Specific effects of microRNAs on the plant transcriptome. Dev Cell. 2005;8: 517–527. pmid:15809034
  35. 35. Mi H, Muruganujan A, Casagrande JT, Thomas PD. Large-scale gene function analysis with the PANTHER classification system. Nat Protoc. 2013;8: 1551–1566. pmid:23868073
  36. 36. Gao F-B. Context-dependent functions of specific microRNAs in neuronal development. Neural Dev. 2010;5: 25. pmid:20920300
  37. 37. Coulouarn C, Factor VM, Andersen JB, Durkin ME, Thorgeirsson SS. Loss of miR-122 expression in liver cancer correlates with suppression of the hepatic phenotype and gain of metastatic properties. Oncogene. 2009;28: 3526–3536. pmid:19617899
  38. 38. Van Rooij E. The art of microRNA research. Circ Res. 2011;108: 219–234. pmid:21252150