MicroRNAs (miRNAs) are a class of small RNA molecules that regulate gene expression by inhibiting the protein translation or targeting the mRNA cleavage. They play many important roles in living organism cells; however, the knowledge on miRNAs functions has become more extensive upon their identification in biological fluids and recent reports on plant-origin miRNAs abundance in human plasma and serum. Considering these findings, we performed a rigorous bioinformatics analysis of publicly available, raw data from high-throughput sequencing studies on miRNAs composition in human and porcine breast milk exosomes to identify the fraction of food-derived miRNAs. Several processing and filtering steps were applied to increase the accuracy, and to avoid false positives. Through aforementioned analysis, 35 and 17 miRNA species, belonging to 25 and 11 MIR families, were identified, respectively. In the human samples the highest abundance levels yielded the ath-miR166a, pab-miR951, ptc-miR472a and bdi-miR168, while in the porcine breast milk exosomes, the zma-miR168a, zma-miR156a and ath-miR166a have been identified in the largest amounts. The consensus prediction and annotation of potential human targets for select plant miRNAs suggest that the aforementioned molecules may interact with mRNAs coding several transcription factors, protein receptors, transporters and immune-related proteins, thus potentially influencing human organism. Taken together, the presented analysis shows proof of abundant plant miRNAs in mammal breast milk exosomes, pointing at the same time to the new possibilities arising from this discovery.
Citation: Lukasik A, Zielenkiewicz P (2014) In Silico Identification of Plant miRNAs in Mammalian Breast Milk Exosomes – A Small Step Forward? PLoS ONE 9(6): e99963. doi:10.1371/journal.pone.0099963
Editor: Vinod Scaria, CSIR Institute of Genomics and Integrative Biology, India
Received: February 27, 2014; Accepted: May 20, 2014; Published: June 16, 2014
Copyright: © 2014 Lukasik, Zielenkiewicz. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the Institute of Biochemistry and Biophysics, Polish Academy of Sciences (IBB PAS). The IBB PAS had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
MicroRNAs (miRNAs) are a class of short (18–24 nt) regulatory RNAs that are widely evolutionary conserved among many species , . These single-stranded, non-coding molecules mediate post-transcriptional gene regulation by promoting cleavage or inhibiting translation of the target mRNA , . As a mature sequence form, miRNAs are generated in a multi-step process, which begins in nucleus from miRNA gene transcription into long primary transcript with many stem-loop units (pri-miRNA). The pri-miRNA is further processed into the hairpin precursor (pre-miRNA) and cleaved to generate the miRNA:miRNA* duplex with two nucleotide overhangs at the 3′ ends. In plants, these 2-nucleotide 3′-overhangs are then methylated by Hua Enhancer 1 (HEN1) methyltransferase , while in animals they remain unmethylated. In most cases, one of the duplex strands (*-strand) is degraded in the last stage of miRNA maturation process. Whereas, the second strand is loaded on the RISC (RNA-Induced Silencing Complex) multi-complex and binds to the specific mRNA transcript . Throughout this hybridization, miRNAs negatively regulate expression of target genes, which control cell development, apoptosis, proliferation, differentiation and function in living organisms , . Plant miRNAs not only play a role in organ development but also regulate nutrient homeostasis, environmental stress responses and phase changes , . In humans, several reports have associated an expression profile of specific miRNAs with certain pathological stages, tumorigenesis or patient’s response to treatment. Thus, in medicine, miRNAs have become new diagnostic and prognostic biomarkers , , and have been incorporated in a few therapies for treating several human disorders , .
Growing interest in miRNAs and advancing experimental, and computational analytical approaches have contributed to a significant increase in information on miRNAs over the past few years. Using high-throughput sequencing methods, such as the Roche 454 Life Sciences System, Illumina Genome Analyzer and Applied Biosystems SOLiD system, along with many bioinformatics approaches, it is currently possible to identify large fraction of miRNAs, determine their expression level, predict precursor sequences, target genes and many other characteristics , . The recent detection of miRNAs in body fluids (e.g., serum, urine, saliva, blood and milk) indicates that these molecules may play even greater role as gene expression regulators than initially anticipated , , . The Gu et al. and Zhou et al. studies on miRNAs composition in porcine and human breast milk exosomes, respectively, demonstrated that resistant to harsh conditions, immune-related miRNAs are present and enriched in the examined membranous vesicles. Therefore, the authors suggest that breast milk exosomal miRNA molecules may be transferred to an infant’s body via the digestive tract and affect immune system development , . Even more intriguing was a recent report on cross-kingdom regulation by plant miRNA, wherein the study by Zhang et al. provided evidence not only that exogenous, food-derived miRNAs are abundant in human serum but also that they can negatively regulate expression of specific genes in mammals. For example, MIR168a inhibits expression of the low-density lipoprotein receptor adapter protein 1 (LDLRAP1) in liver and thereby disrupts LDL plasma homeostasis . The plant-origin miRNAs were also identified by the Wang’s et al. group, which showed that aforementioned molecules compose a significant sRNAs fraction in human plasma .
Considering the recent assumptions and evidences that endogenous and exogenous miRNAs might be sufficiently stable to pass through the gastrointestinal (GI) tract and enter circulation without losing functionality, we decided to do step forward, and determine whether plant miRNAs, especially those that were identified in serum, can be packed into mammalian breast milk exosomes. For this reason, we performed accurate bioinformatics analysis of the publicly available, raw data from small RNAs high-throughput sequencing studies on porcine and human breast milk exosomes. In the 12 datasets we successfully identified 17 and 35 miRNA species (from 11 and 25 MIR families), respectively. Additionally, to determine whether theses plant miRNAs may influence humans organism we predicted and annotated the target mRNAs that may potentially interact with the select miRNAs molecules.
Small RNA Tags Analysis and Identification of Plant miRNAs in Breast Milk Exosomes
The raw data collected from 8 porcine and 4 human small RNA libraries included over 179.90 and 86.37 million reads, respectively. After removing the low-quality tags and contaminants as well as reads clustering, the analyzed datasets from the H. sapiens and S. scrofa included 1057,293/1,228,454/683,354/1,388,526 and 1,192,136/1,347,110/496,284/663,436/284,128/862,361/347,690/1,077,666 unique sequences, respectively.
The most important step in this study (the full workflow shown in Figure 1) was the identification of sequences with significant homology to plant miRNAs. Herein, the BlastN search against 10,597 miRNAs (from 127 plant species) and rigorous filtration of the obtained results were performed. In 8 S. scrofa and 4 H. sapiens samples, the initial number of potential plant miRNAs (with unique sequences) was 149/101/140/23/15/70/98/115 and 4,291/11,664/139/214, respectively. To discard the sequences with human and pig origins, the selected reads were verified in two steps. In the first, 26,846 H. sapiens and 3,834 S. scrofa ncRNA sequences were downloaded and supplemented with 155,394 and 64,365 mRNA sequences as well as the 1,350 and 769 repeat-associated RNA sequences, respectively. The putative plant miRNAs tags were then matched to this generated RNA datasets and significantly similar sequences were eliminated. For a second verification step, the reduced collections of potential exogenous miRNAs sequences from S. scrofa and H. sapiens were mapped to the pig and human genomes, respectively. In this procedure the common RNA editing modifications were taken into account. As a result of the aforementioned processes, several reads were discarded which yielded 120/86/49/0/0/56/60/69 and 751/2,435/0/5 total tags, respectively. In the final step of the described analysis, specific human microbiome sequences search did not yield any reads match. Therefore, the remaining tags collections were assumed to be plant-origin miRNAs, from which only molecules represented by five or more reads were considered. The length distribution examined for sequences classified as not originating from the H. sapiens, S. scrofa and human microbiome shown that most of these reads (from both species datasets) had 21 and 22 nucleotides (Fig. 2A,B).
The reads collected from the 4 H. sapiens and 8 S. scrofa data sets were, each individually, cleaned and matched to known plant miRNAs to select all putative food-derived molecules. The matched tags were further subjected to few filtering steps, which resulted in elimination of all human and pig ncRNAs, repeat-associated RNAs, exon fragments and sequences successfully mapped to reference genomes, respectively. The remained reads were additionally examined to find and discard tags that with high probability represent specific microbiome sequences. As a second part of the analysis, the human targets prediction and annotation were carried out for select plant miRNAs. The aforementioned steps are detail described in the Materials and Methods section. Blue hexagons represent the data used and generated in the following processing/filtering steps (green rectangles) of the analysis.
The summary of the sequence length distribution generated from (A) the S. scrofa and (B) H. sapiens tags, respectively, which remained after all processing and verification steps of the preformed bioinformatics analysis. Most of the generated reads were 21–24 nucleotides long.
From the bioinformatics analysis herein, 17 plant miRNA species that belong to 11 MIR families were identified in six porcine breast milk exosomes samples. The abundance levels of identified miRNAs were rather low, as well as the variations in plant miRNAs profiles across six samples. The matrix of calculated pairwise Pearson’s correlations is presented in Table 1 (average r = 0.230). Among the identified miRNA families, the MIR167, MIR319 and MIR444 composed the most members. The miRNA species with the highest abundance level (mean value from six samples) were as follows: zma-miR168a, zma-miR156a, ath-miR166a, ath-miR319b and ptc-miR319d.
In contrast to the porcine data analysis, plant miRNA species were only identified in two out of four human breast milk small RNAs libraries; however, the Pearson’s correlation between these two samples indicated low changes in the identified plant miRNAs profiles (r = 0.675; P-value = 3.14e−09). As a result of the H. sapiens datasets examination, 35 miRNAs from 25 MIR families were detected, including one miRNA* - aly-miR157d*. The human and porcine plant miRNAs collections shows certain similarities. For example, the molecules with the highest abundance levels in the H. sapiens samples include ath-miR166a, pab-miR951, ptc-miR472a, bdi-miR168, aly-miR167d, osa-miR444b.2 and zma-miR156a, and the most numerous families were MIR166 and MIR167. The full lists of plant miRNA species/MIR families identified in porcine and human breast milk exosomes can be found in Tables S1–S8 in File S1 and Tables S9–S12 in File S2, respectively.
Prediction and Annotation of Putative Human Target Genes for Plant miRNAs
MiRNAs functions in living organisms are associated with their binding to target mRNA, whereupon this mRNA is cleaved, or protein translation is inhibited. Thus, selecting and annotating miRNAs potential targets are the first steps in defining their roles in the cell. For initial insight into the probable influence of plant miRNAs on human organism and to determine whether they may regulate important biological processes, putative targets were predicted for the five select miRNAs with the highest abundance levels. The results generated by intersection of the miRanda, RNAhybrid and PITA method suggested that 1,282 unique human mRNAs are potential targets for the aforementioned plant miRNAs. These mRNA molecules include 369 mRNAs for ath-miR319b, 364 for ctr-miR167, 120 for ath-miR166a, 264 for zma-miR156a and 165 for osa-miR444b.2. The putative targets were further sorted, and the best 20–25 molecules with the highest alignment scores and lowest MFE of the structure were selected. The performed annotations using the Blast2GO and DAVID v6.7 software, with the additional KEGG pathway mapping, show that predicted targets included several mRNAs of proteins associated with the immune system function (e.g., ZEB1, IKAROS2, IL1-RAcPL and IL1RL1), molecules required for mediating hormone responses (e.g., NcoA-1 and MC4R), transcription factors and additional receptors relevant to an organism’s health (e.g., low-density lipoprotein receptor (LDLR), histamine receptor H2 (HRH2) and poliovirus receptor-related protein 4 (PVRL4)). Generally, the proposed human targets codes proteins involved in important biological processes, such as gene expression, steroid biosynthesis, transport, immune responses and starch, purine, and sphingolipid metabolism. The best predicted targets for the select plant miRNAs with their GO annotations, as well as the KEGG pathway mapping results are presented in Tables S13 and S14 in File S3.
Recent advances in experimental and computational analytical approaches have resulted in an explosion of information on miRNA molecules, which play many important roles in a wide range of organisms. Such methods have facilitated the identification of miRNAs profiles in human body fluids (e.g., saliva, blood, urine and milk), which served as biomarkers for detecting and monitoring various pathological conditions in certain circumstances , , . Thus far, most studies have investigated host-origin small RNA molecules; therefore, discovering miRNAs from exogenous species in human serum was surprising and intriguing . Zhang et al. identified food-derived, plant miRNAs, which were sufficiently stable to pass through human GI tract and enter the circulation. Moreover, in vitro and in vivo studies demonstrated that one of these molecules regulate expression of specific gene; thus, influencing certain molecular process in human organism . The aforementioned study was not the only research that reported plant small RNAs in H. sapiens samples. Wang et al. also identified in human plasma a wide range of small RNAs from many different organisms, including food-derived miRNAs .
As for host-origin miRNAs, recent studies indicate that immune-related, endogenous miRNAs are enriched in breast milk exosomes. These molecules are resistant to relatively harsh conditions and thus, are assumed to influence infant immune system development , . Considering aforementioned reports, the following question arises: can food-derived miRNAs pass through GI tract, enter circulation and pack into breast milk exosomes? To answer this question, we performed a rigorous bioinformatics analysis using publicly available, high-throughput sRNA sequencing data from mammalian breast milk exosomes. We processed 12 datasets (4 from H. sapiens and 8 from S. scrofa) and stringently verified the obtained results to avoid sequencing, and searching errors (Fig. 1). Herein, we eliminated the low-quality and endogenous-origin reads, as well as sequences highly likely to be part of the human microbiomes genomes. The tags that remained after these processing steps had length distributions typical for mature plant miRNA sequences; the most abundant reads had 21 and 22 nucleotides (Fig. 2A,B). This can be an additional indication that the aforementioned tags represent small, exogenous-origin RNAs –.
We identified 17 plant miRNAs from 11 MIR families in the porcine breast milk exosomes. In turn, in human breast milk exosomes, we detected 35 exogenous miRNAs belonging to 25 plant MIR families (Tables S1–S8 in File S1 and Tables S9–S12 in File S2). The plant miRNAs profile compositions in breast milk exosomes from both species are consistent with Zhang’s et al. study on food-derived miRNAs in human serum . MiRNAs identified at high levels in blood, such as MIR166a, MIR168a, MIR167d and MIR156a, were also highly abundant in mammalian breast milk exosomes. This observation suggests that plant miRNAs, which are sufficiently stable in serum, may be further transferred to milk exosomes. Moreover, the aforementioned miRNAs are evolutionarily conserved in diverse plant species (including those being an integral part of our daily diet) and typically appear at high levels in their various organs –. Thus, we can assume that even if a large fraction of the food-derived miRNAs was degraded along their pathway to mammal serum, the quantity of identified miRNAs was still sufficient to access the mammary glands and pack into the milk exosomes.
The accuracy of the carefully performed bioinformatics analysis reduced the initial potential plant miRNAs data from 149/101/140/23/15/70/98/115 and 4,291/11,664/139/214 to 120/86/49/0/0/56/60/69 and 751/2,435/0/5 total tags in porcine and human breast milk exosomes, respectively. The rather low abundance of these exogenous-origin molecules can be explained by the fact that data analyzed herein were derived from studies whose main goal was to identify mammalian miRNAs. The protocols and solutions employed in samples preparation, as well as sequencing procedure from aforementioned studies, met the known standards, and are commonly/successfully used in this type of experiments –. What is more important, they were constructed so as to enhance precision and efficiency in identifying animal miRNAs. It has been reported that the 2′-O-methyl modification of plant miRNAs 3′ ends can decrease the RNA ligase efficiency to ligate the adaptor oligonucleotides . Therefore, sub-optimal parameters may reduce the number of plant miRNAs in the sequencing data. The results presented by Zhang et al. raised a debate among scientists; critics suggest that plant miRNAs in animal small RNA samples arise from cross-contamination, sequencing error and bias , . However, it is unlikely that all plant miRNAs identified in animal datasets generated by different techniques and research groups may originate during sequencing or from contamination. In our study, the plant miRNAs have been identified in most, but not all, human and porcine breast milk exosomes samples; while, the correlation calculated between subsets, where these plant molecules were present, was moderate and low, respectively (Table 1). This highlights the high variation in plant miRNAs profiles from breast milk exosomes across different healthy individuals and may be an additional argument supporting the concept of food-derived plant miRNAs abundance rather than samples cross-contamination. Clearly, further analyses and experiments are necessary to resolve this issue and answer a more important question – whether the food-derived miRNA molecules influence human organism?
Endogenous miRNAs play many important roles in living organism cells, however exogenous-origin miRNAs may also regulate expression of specific genes and influence relevant biological processes . Therefore, the second part of our study considers potential target prediction for five identified plant miRNAs with the highest abundance level - ath-miR319b, ctr-miR167, ath-miR166a, zma-miR156a and osa-miR444b.2. The targets were proposed by three independent methods (miRanda, PITA and RNAhybrid), which use different features for the miRNA target predictions. The obtained results were intersected, namely, the target was considered only if it was selected by all three algorithms. This kind of combining methods is a common practice to increase the precision/accuracy of predictions –. The proposed targets were further sorted using the highest alignment scores and lowest MFE of the structures; the top 20–25 hits were selected. Among the aforementioned, predicted molecules several mRNAs coding immune-related proteins were found, such as the transcription factor - zing finger E-box-binding homeobox 1 (ZEB1) . The ZEB1 and ZEB2 mRNAs are known targets of human miR200 family members. The miR200 by inhibiting the ZEB1 and ZEB2 proteins production induce the mesenchymal-to-epithelial transition in cancer cell lines and reduce their aggressiveness . Additional molecules that participate in organism’s immune responses include the receptor accessory protein of the IL-18 (IL-18RAcPL) and interleukin-1 receptor-like 1 (IL1RL1). The IL1RL1 mediates the biological effects of IL-33, while the IL-18RAcP is required for the high binding affinity of the interleukin 18, which participates in the IFN-gamma production of Th1 cells. Both receptors were shown to be crucial for specifically-induced inflammation, pathogenesis of particular autoimmune disorders and several other organism’s immune responses; therefore, they are thought to be promising therapeutic targets –. Apart from immune-related proteins another essential target molecule was proposed - histamine H2 receptor (HRH2). The HRH2 polypeptide stimulates gastric acid secretion and regulates intestinal secretion, as well as the gastrointestinal motility . Currently, several known therapies use histamine H2 receptor antagonist to cure peptic ulcers (even relapses) and affect gastric acid secretion . Treatments based on the HRH2 antagonist are also commonly used in gastro-esophageal reflux diseases in infants and children , . Among the putative plant miRNAs targets, one protein is also important for the therapeutics - nectin-4, alternatively referred to as poliovirus receptor-related protein 4 (PVRL4). This member of the immunoglobulin family have gathered special attention after novel studies showing that the Measles virus, which contributes to over 120,000 child deaths each year, uses nectin-4 protein as a receptor to infect and spread through airway epithelial cells –. The complete detailed information about the aforementioned, interesting molecules and potential biological impact of their gene expression inhibition by particular plant miRNAs can be found in Table 2. To ensure that proposed targets are important and to determine whether they participate in the same biological processes, the predicted molecules were mapped on metabolic pathways from the KEGG database. This part of the analysis showed that several putative targets may be involved in starch, purine, sphingolipid, drug and amino acid metabolism or in more general processes, e.g., transport, immune responses and transcription regulation. Thus, the identified plant-derived miRNAs may represent novel molecular modulators of the aforementioned human biological pathways.
Our study shows that plant miRNA molecules are abundant in human and porcine breast milk exosomes. The analysis herein using publicly available, high-throughput sequencing data revealed that the food-derived small RNA composition primarily includes conserved plant miRNAs species and is similar to the composition identified by Zhang et al. in human serum. What is more, the aforementioned molecules may regulate potential human targets genes important in several biological pathways. Although additional experimental evidence is necessary, our analysis may shed new light on exogenous-origin miRNAs, their stability under harsh conditions and potential roles in living organisms. Moreover, our data support a closer look at plant-derived molecules and their properties. Conclusively, the analysis herein shows that the miRNA “world” is broader than previously thought and suggests a novel field that awaits exploration.
Materials and Methods
The 12 raw, small RNA sequencing datasets from porcine and human breast milk exosomes studies were collected from the NCBI Gene Expression Omnibus database records (GEO, http://www.ncbi.nlm.nih.gov/geo/), available under the accession numbers GSE36590 and GSE32253, respectively. The aforementioned datasets comprised 8 small RNA subsets from milk exosomes of 3 female pigs (from six lactigenous stages, 0, 3, 7, 14, 21 and 28 days after birth) and 4 small RNA subsets from milk exosomes of 4 healthy women (60 days after birth). Each of small RNA subset was generated by single-end sequencing in 36 bp reads using the Illumina Genome Analyzer II according to manufacturer’s instructions , . The sample preparation protocol was similar in both studies ,  and included methods, and solutions commonly used in similar experiments –. Briefly, the protocol comprised:
- breast milk samples collection and storing at −80°C until analyzed,
- centrifugations and further filtering to eliminate fat globules, cells and cellular debris,
- exosomes isolation by the ExoQuick precipitation procedure (System Biosciences Inc., USA),
- total RNA extraction using the TRIzol-LS (Invitrogen, USA) according to manufacturer’s instructions,
- small RNAs analysis with the Agilent Bioanalyzer 2100 and the RNA 6000 Nano LabChip Kit (Agilent, USA),
- small RNAs fraction isolation by the polyacrylamide gel electrophoresis (PAGE),
- Illumina adaptors ligation and conversion to cDNA by the RT-PCR.
Bioinformatics Analysis of Small RNA Tags
Each sRNAs dataset was individually, bioinformatically analyzed to clean, remove unnecessary tags and identify plant miRNAs sequences. The full workflow for this analysis is shown in Figure 1.
In the first step of the raw data processing, the adaptor sequences were removed from each read. Then, all low-quality tags were eliminated from the datasets, exactly the sequences with: any N bases, more than 4 bases whose quality score was lower than 10 and more than 6 bases whose quality score was lower than 13. The reads shorter than 17 nucleotides, with a poly A tail, with 5′ primer contaminants and missing insert tags or a 3′ primer were also excluded. The remaining reads were combined into one kind and counted. Next, the sequences that show significant similarity to plant miRNAs were selected. The plant miRNA sequences were downloaded from the Plant MicroRNA Database (PMRD, release of June 2012, http://bioinformatics.cau.edu.cn/PMRD/)  and the BlastN method was used to find tags without any gaps and mismatches in the alignment, and with sequence coverage that differed by no more than one nucleotide. The E-value threshold was set at 0.01. The reads selected at this step were assumed as plant miRNAs sequences and two additional verification stages of the analysis were performed to confirm their exogenous origin. First, each annotated sequence that was highly likely a human or pig small non-coding RNA was filtered from the potential plant miRNAs reads. Herein, the Homo sapiens and Sus scrofa tRNAs, rRNAs, snoRNAs, and snRNAs (available at the Rfam 11.0 release and ENSEMBL 71.0 release) as well as pre-miRNAs and miRNAs sequences (obtained from the miRBase 20.0 release)  were collected. The similarity between the potential plants-origin reads and the aforementioned ncRNAs was investigated, individually for each specie, using the BlastN method with the E-value threshold of 0.01; gaps and mismatches were allowed. The tags homologous to the pig and human ncRNAs, respectively, were discarded from the study. A similar search was performed to eliminate repeat-associated sequences and exon fragments. The H. sapiens and S. scrofa mRNAs and repeat-associated RNAs were downloaded from the NCBI database (April 2013, http://www.ncbi.nlm.nih.gov), and Repbase (17.11 release), respectively. Additionally, the human coding sequences (CDS) were obtained from the NCBI CCDS Database (release 11.0, http://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi) . After this procedure, the remaining tags were verified in a second step, wherein the reads were mapped to the human and pig genomes, respectively. The Bowtie 0.12.8 software (http://bowtie-bio.sourceforge.net)  with one mismatch allowed were used to map the reads (the potential plant miRNAs) to the S. scrofa (Sscrofa9.53, ftp://ftp.sanger.ac.uk/pub/S_scrofa/assemblies/Ensembl_Sscrofa9/) and H. sapiens genomes (hg19), respectively. By removing tags that perfectly or near perfectly mapped to the aforementioned genomes, the sets of putative plant miRNA species have been reduced. Considering the RNA editing process, which has been observed in mammals –, the potential plant miRNAs sequences were once again mapped to the relevant genomes; however, in this step two mismatches were allowed including one that represent common AI edits (e.g. A>G and T>C). If the sequence has been successfully mapped to the reference genome, it was eliminated from the plant miRNAs dataset. To exclude the possibility that selected small RNAs sequences originated from bacteria, fungi or Archaea, which have been reported as abundant in human plasma and the gastrointestinal tract , , the remaining reads were compared to sequences from human blood, oral, GI tract and skin microbiomes. The reference genomes were downloaded from the Human Microbiome Project website (HMP, http://www.hmpdacc.org/HMRGD/)  and further aligned to the tags using the BlastN method. The E-value threshold was set at 0.01; gaps and mismatches were not allowed. All reads that met the given criteria were discarded from the study. Finally, the remaining tags, which were observed no less than 5 times, were considered as credible plant miRNA sequences. To verify the reliability of the analyzed data and highlight certain variations in plant miRNAs profiles across healthy individuals, the Pearson’s correlations were calculated for human and pig samples, respectively.
Potential Human Targets Predictions for the Plant miRNAs
Since recent report on particular plant miRNA indicates its function as mammalian LDLRAP1 protein expression level regulator , it was interesting to designate human mRNAs that may probably interact with the select identified food-derived miRNAs, potentially transferred from milk to infant’s body via the GI tract. For this reason, the putative H. sapiens targets predictions were performed using three different computational methods: miRanda (http://www.microrna.org/microrna/getDownloads.do) , RNAhybrid 2.1 (http://bibiserv.techfak.uni-bielefeld.de/rnahybrid/dl_pre-page.html)  and PITA software (http://genie.weizmann.ac.il/pubs/mir07/mir07_exe.html) . The miRanda search procedure examines sequence complementarity, interspecies conservation and thermodynamic stability of the miRNA:mRNA duplex, while RNAhybrid 2.1 is a tool for finding the minimum free energy (MFE) of target and short RNA (e.g., miRNA) hybridization. The PITA algorithm is based on the miRNA:target interaction model, which calculates the difference between free energy of the miRNA:mRNA bound and the unbound (ΔΔG) state. The described methods were successfully used in many human miRNAs studies – and together, they cover most of the known characteristics of miRNA:target interaction, namely the: seed complementary, interspecies conservation, free energy, target-site accessibility and target-site abundance . For each program, specific rules and restrictions were set up. The prediction parameters of the miRanda method were as follows: (1) G:U base pairing was permitted but scored lower (score +2) than canonical base pairs (score +5), (2) the alignments with gaps and non-canonical base pairs in the “seed” regions (2–8 nt at the 5′ end of the molecule) were discarded, and (3) alignments with scores over 130 and MFE of the structure less than −17 kcal/mol were selected. The RNAhybrid 2.1 selected human mRNAs where the following applied: (1) the hybridization MFE was equal to or below −17 kcal/mol, (2) the maximum bulge loop size was 2 nucleotides, and (3) the maximum internal loop size was 2 nucleotides. The PITA targets search considered only sequences where the 7-8-mer “seed” region did not include mismatches and G:U base pairs, while their calculated ΔΔG scores were below −10. The H. sapiens 3′ UTR, 5′ UTR and CDS sequences, that served as potential plant miRNAs targets, were downloaded from the UCSC Bioinformatics Site (April 2013, http://genome.ucsc.edu/index.html) and NCBI CCDS Database, respectively. Using the intersection of all three methods a consensus list of putative human targets was generated and further sorted by the highest alignment score and lowest MFE of the structure. The top best 20–25 hits were collected. To designate potential processes involving the predicted mRNA sequences and to suggest a probable influence of food-derived plant miRNAs on human organism, the selected targets were annotated using the Blast2GO (http://www.blast2go.com/b2ghome)  and DAVID v6.7 tools (http://david.abcc.ncifcrf.gov/home.jsp) . In the analysis by Blast2GO software, the GO terms were obtained based on the BlastX search against the “nr” NCBI database with the E-value threshold of 1e−6. The KEGG (Kyoto Encyclopedia of Genes and Genomes)  database was also searched; the E-value threshold was sat at 1e−10. The best hits from each annotation were collected.
Contains the following files: Table S1. The plant miRNAs identified in sample 1 of porcine breast milk exosomes collected 0 days after birth. Table S2. The plant miRNAs identified in sample 2 of porcine breast milk exosomes collected 0 days after birth. Table S3. The plant miRNAs identified in sample 3 of porcine breast milk exosomes collected 0 days after birth. Table S4. The plant miRNAs identified in sample of porcine breast milk exosomes collected 14 days after birth. Table S5. The plant miRNAs identified in sample of porcine breast milk exosomes collected 21 days after birth. Table S6. The plant miRNAs identified in sample of porcine breast milk exosomes collected 28 days after birth. Table S7. The summary list of plant-derived miRNAs identified in six samples of porcine breast milk exosomes (collected 0, 14, 21 and 28 days after birth) together with their calculated average reads counts. Table S8. Plant MIR families identified in six samples of porcine breast milk exosomes collected 0, 14, 21 and 28 days after birth.
Contains the following files: Table S9. The plant miRNAs identified in sample 1 of human breast milk exosomes. Table S10. The plant miRNAs identified in sample 2 of human breast milk exosomes. Table S11. The summary list of plant miRNAs identified in two samples of human breast milk exosomes together with their calculated average reads counts. Table S12. Plant MIR families identified in two samples of human breast milk exosomes.
Contains the following files: Table S13. List of the H. sapiens best potential targets predicted for five select plant miRNAs. Table S14. List of the potential KEGG processing pathways, in which the five select plant miRNAs may participate in human organism.
Conceived and designed the experiments: AL PZ. Analyzed the data: AL. Wrote the paper: AL PZ.
- 1. Liu N, Okamura K, Tyler DM, Phillips MD, Chung WJ, et al. (2008) The evolution and functional diversification of animal microRNA genes. Cell Res 18: 985–996. doi: 10.1038/cr.2008.278
- 2. Jones-Rhoades MW (2012) Conservation and divergence in plant microRNAs. Plant Mol Biol 80: 3–16. doi: 10.1007/s11103-011-9829-2
- 3. Ying SY, Chang DC, Lin SL (2008) The microRNA (miRNA): overview of the RNA genes that modulate gene function. Mol Biotechnol 38: 257–268. doi: 10.1007/s12033-007-9013-8
- 4. Hu W, Coller J (2012) What comes first: translational repression or mRNA degradation? The deepening mystery of microRNA function. Cell Res 22: 1322–1324. doi: 10.1038/cr.2012.80
- 5. Yang Z, Ebright YW, Yu B, Chen X (2006) HEN1 recognizes 21–24 nt small RNA duplexes and deposits a methyl group onto the 2′ OH of the 3′ terminal nucleotide. Nucleic Acids Res 34: 667–675. doi: 10.1093/nar/gkj474
- 6. Kawamata T, Tomari Y (2010) Making RISC. Trends Biochem Sci 35: 368–376. doi: 10.1016/j.tibs.2010.03.009
- 7. Zhang B, Wang Q, Pan X (2007) MicroRNAs and their regulatory roles in animals and plants. J Cell Physiol 210: 279–289. doi: 10.1002/jcp.20869
- 8. Wienholds E, Plasterk RH (2005) MicroRNA function in animal development. FEBS Lett 579: 5911–5922. doi: 10.1016/j.febslet.2005.07.070
- 9. Dugas DV, Bartel B (2004) MicroRNA regulation of gene expression in plants. Curr Opin Plant Biol 7: 512–520. doi: 10.1016/j.pbi.2004.07.011
- 10. Kruszka K, Pieczynski M, Windels D, Bielewicz D, Jarmolowski A, et al. (2012) Role of microRNAs and other sRNAs of plants in their changing environments. J Plant Physiol 169: 1664–1672. doi: 10.1016/j.jplph.2012.03.009
- 11. Esteller M (2011) Non-coding RNAs in human disease. Nat Rev Genet 12: 861–874. doi: 10.1038/nrg3074
- 12. De Guire V, Robitaille R, Tetreault N, Guerin R, Menard C, et al. (2013) Circulating miRNAs as sensitive and specific biomarkers for the diagnosis and monitoring of human diseases: promises and challenges. Clin Biochem 46: 846–860. doi: 10.1016/j.clinbiochem.2013.03.015
- 13. Broderick JA, Zamore PD (2011) MicroRNA therapeutics. Gene Ther 18: 1104–1110. doi: 10.1038/gt.2011.50
- 14. Mack GS (2007) MicroRNA gets down to business. Nat Biotechnol 25: 631–638. doi: 10.1038/nbt0607-631
- 15. Motameny S, Wolters S, Nürnberg P, Schumacher B (2010) Next Generation Sequencing of miRNAs – Strategies, Resources and Methods. Genes 1: 70–84. doi: 10.3390/genes1010070
- 16. Liu B, Li J, Cairns MJ (2012) Identifying miRNAs, targets and functions. Brief Bioinform.
- 17. Cortez MA, Bueso-Ramos C, Ferdin J, Lopez-Berestein G, Sood AK, et al. (2011) MicroRNAs in body fluids–the mix of hormones and biomarkers. Nat Rev Clin Oncol 8: 467–477. doi: 10.1038/nrclinonc.2011.76
- 18. Weber JA, Baxter DH, Zhang S, Huang DY, Huang KH, et al. (2010) The microRNA spectrum in 12 body fluids. Clin Chem 56: 1733–1741. doi: 10.1373/clinchem.2010.147405
- 19. Gu Y, Li M, Wang T, Liang Y, Zhong Z, et al. (2012) Lactation-related microRNA expression profiles of porcine breast milk exosomes. PLoS One 7: e43691. doi: 10.1371/journal.pone.0043691
- 20. Zhou Q, Li M, Wang X, Li Q, Wang T, et al. (2012) Immune-related microRNAs are abundant in breast milk exosomes. Int J Biol Sci 8: 118–123. doi: 10.7150/ijbs.8.118
- 21. Zhang L, Hou D, Chen X, Li D, Zhu L, et al. (2012) Exogenous plant MIR168a specifically targets mammalian LDLRAP1: evidence of cross-kingdom regulation by microRNA. Cell Res 22: 107–126. doi: 10.1038/cr.2011.158
- 22. Wang K, Li H, Yuan Y, Etheridge A, Zhou Y, et al. (2012) The complex exogenous RNA spectra in human plasma: an interface with human gut biota? PLoS One 7: e51009. doi: 10.1371/journal.pone.0051009
- 23. Xie F, Frazier TP, Zhang B (2011) Identification, characterization and expression analysis of MicroRNAs and their targets in the potato (Solanum tuberosum). Gene 473: 8–22. doi: 10.1016/j.gene.2010.09.007
- 24. Sun LM, Ai XY, Li WY, Guo WW, Deng XX, et al. (2012) Identification and comparative profiling of miRNAs in an early flowering mutant of trifoliate orange and its wild type by genome-wide deep sequencing. PLoS One 7: e43760. doi: 10.1371/journal.pone.0043760
- 25. Lukasik A, Pietrykowska H, Paczek L, Szweykowska-Kulinska Z, Zielenkiewicz P, et al. (2013) High-throughput sequencing identification of novel and conserved miRNAs in the Brassica oleracea leaves. BMC Genomics 14: 801. doi: 10.1186/1471-2164-14-801
- 26. Fahlgren N, Howell MD, Kasschau KD, Chapman EJ, Sullivan CM, et al. (2007) High-throughput sequencing of Arabidopsis microRNAs: evidence for frequent birth and death of MIRNA genes. PLoS One 2: e219. doi: 10.1371/journal.pone.0000219
- 27. Sunkar R, Jagadeeswaran G (2008) In silico identification of conserved microRNAs in large number of diverse plant species. BMC Plant Biol 8: 37. doi: 10.1186/1471-2229-8-37
- 28. Taylor DD, Zacharias W, Gercel-Taylor C (2011) Exosome isolation for proteomic analyses and RNA profiling. Methods Mol Biol 728: 235–246. doi: 10.1007/978-1-61779-068-3_15
- 29. Quackenbush JF, Cassidy PB, Pfeffer LM, Boucher KM, Hawkes JE, et al. (2014) Isolation of circulating microRNAs from microvesicles found in human plasma. Methods Mol Biol 1102: 641–653. doi: 10.1007/978-1-62703-727-3_34
- 30. Rekker K, Saare M, Roost AM, Kubo AL, Zarovni N, et al. (2014) Comparison of serum exosome isolation methods for microRNA profiling. Clin Biochem 47: 135–138. doi: 10.1016/j.clinbiochem.2013.10.020
- 31. Matsumoto S, Sakata Y, Suna S, Nakatani D, Usami M, et al. (2013) Circulating p53-responsive microRNAs are predictive indicators of heart failure after acute myocardial infarction. Circ Res 113: 322–326. doi: 10.1161/circresaha.113.301209
- 32. Roberts TC, Coenen-Stass AM, Betts CA, Wood MJ (2014) Detection and quantification of extracellular microRNAs in murine biofluids. Biol Proced Online 16: 5. doi: 10.1186/1480-9222-16-5
- 33. Munafo DB, Robb GB (2010) Optimization of enzymatic reaction conditions for generating representative pools of cDNA from small RNA. RNA 16: 2537–2552. doi: 10.1261/rna.2242610
- 34. Dickinson B, Zhang Y, Petrick JS, Heck G, Ivashuta S, et al. (2013) Lack of detectable oral bioavailability of plant microRNAs after feeding in mice. Nat Biotechnol 31: 965–967. doi: 10.1038/nbt.2737
- 35. Zhang Y, Wiggins BE, Lawrence C, Petrick J, Ivashuta S, et al. (2012) Analysis of plant-derived miRNAs in animal small RNA datasets. BMC Genomics 13: 381. doi: 10.1186/1471-2164-13-381
- 36. Sethupathy P, Megraw M, Hatzigeorgiou AG (2006) A guide through present computational approaches for the identification of mammalian microRNA targets. Nat Methods 3: 881–886. doi: 10.1038/nmeth954
- 37. Zhang Y, Verbeek FJ (2010) Comparison and integration of target prediction algorithms for microRNA studies. J Integr Bioinform 7.
- 38. Reczko M, Maragkakis M, Alexiou P, Papadopoulos GL, Hatzigeorgiou AG (2011) Accurate microRNA Target Prediction Using Detailed Binding Site Accessibility and Machine Learning on Proteomics Data. Front Genet 2: 103. doi: 10.3389/fgene.2011.00103
- 39. Alexiou P, Maragkakis M, Papadopoulos GL, Reczko M, Hatzigeorgiou AG (2009) Lost in translation: an assessment and perspective for computational microRNA target identification. Bioinformatics 25: 3049–3055. doi: 10.1093/bioinformatics/btp565
- 40. Williams TM, Moolten D, Burlein J, Romano J, Bhaerman R, et al. (1991) Identification of a zinc finger protein that inhibits IL-2 gene expression. Science 254: 1791–1794. doi: 10.1126/science.1840704
- 41. Park SM, Gaur AB, Lengyel E, Peter ME (2008) The miR-200 family determines the epithelial phenotype of cancer cells by targeting the E-cadherin repressors ZEB1 and ZEB2. Genes Dev 22: 894–907. doi: 10.1101/gad.1640608
- 42. Oboki K, Ohno T, Kajiwara N, Saito H, Nakae S (2010) IL-33 and IL-33 receptors in host defense and diseases. Allergol Int 59: 143–160. doi: 10.2332/allergolint.10-rai-0186
- 43. Chackerian AA, Oldham ER, Murphy EE, Schmitz J, Pflanz S, et al. (2007) IL-1 receptor accessory protein and ST2 comprise the IL-33 receptor complex. J Immunol 179: 2551–2555. doi: 10.4049/jimmunol.179.4.2551
- 44. Debets R, Timans JC, Churakowa T, Zurawski S, de Waal Malefyt R, et al. (2000) IL-18 receptors, their role in ligand binding and function: anti-IL-1RAcPL antibody, a potent antagonist of IL-18. J Immunol 165: 4950–4956. doi: 10.4049/jimmunol.165.9.4950
- 45. Del Valle J, Gantz I (1997) Novel insights into histamine H2 receptor biology. Am J Physiol 273: G987–996.
- 46. Pattichis K, Louca LL (1995) Histamine, histamine H2-receptor antagonists, gastric acid secretion and ulcers: an overview. Drug Metabol Drug Interact 12: 1–36. doi: 10.1515/dmdi.19220.127.116.11
- 47. Indrio F, Riezzo G, Raimondi F, Cavallo L, Francavilla R (2009) Regurgitation in healthy and non healthy infants. Ital J Pediatr 35: 39. doi: 10.1186/1824-7288-35-39
- 48. Vandenplas Y, Hegar B (2000) Diagnosis and treatment of gastro-oesophageal reflux disease in infants and children. J Gastroenterol Hepatol 15: 593–603. doi: 10.1046/j.1440-1746.2000.02169.x
- 49. Muhlebach MD, Mateo M, Sinn PL, Prufer S, Uhlig KM, et al. (2011) Adherens junction protein nectin-4 is the epithelial receptor for measles virus. Nature 480: 530–533. doi: 10.1038/nature10639
- 50. Reymond N, Fabre S, Lecocq E, Adelaide J, Dubreuil P, et al. (2001) Nectin4/PRR4, a new afadin-associated member of the nectin family that trans-interacts with nectin1/PRR1 through V domain interaction. J Biol Chem 276: 43205–43215. doi: 10.1074/jbc.m103810200
- 51. Noyce RS, Richardson CD (2012) Nectin 4 is the epithelial cell receptor for measles virus. Trends Microbiol 20: 429–439. doi: 10.1016/j.tim.2012.05.006
- 52. Zhang Z, Yu J, Li D, Liu F, Zhou X, et al. (2010) PMRD: plant microRNA database. Nucleic Acids Res 38: D806–813. doi: 10.1093/nar/gkp818
- 53. Kozomara A, Griffiths-Jones S (2014) miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res 42: D68–73. doi: 10.1093/nar/gkt1181
- 54. Pruitt KD, Harrow J, Harte RA, Wallin C, Diekhans M, et al. (2009) The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes. Genome Res 19: 1316–1323. doi: 10.1101/gr.080531.108
- 55. Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10: R25. doi: 10.1186/gb-2009-10-3-r25
- 56. Hogg M, Paro S, Keegan LP, O’Connell MA (2011) RNA editing by mammalian ADARs. Adv Genet 73: 87–120. doi: 10.1016/b978-0-12-380860-8.00003-3
- 57. Ramaswami G, Lin W, Piskol R, Tan MH, Davis C, et al. (2012) Accurate identification of human Alu and non-Alu RNA editing sites. Nat Methods 9: 579–581. doi: 10.1038/nmeth.1982
- 58. Ramaswami G, Zhang R, Piskol R, Keegan LP, Deng P, et al. (2013) Identifying RNA editing sites using RNA sequencing data alone. Nat Methods 10: 128–132. doi: 10.1038/nmeth.2330
- 59. Zhu B, Wang X, Li L (2010) Human gut microbiome: the second genome of human body. Protein Cell 1: 718–725. doi: 10.1007/s13238-010-0093-z
- 60. Peterson J, Garges S, Giovanni M, McInnes P, Wang L, et al. (2009) The NIH Human Microbiome Project. Genome Res 19: 2317–2323. doi: 10.1101/gr.096651.109
- 61. Enright AJ, John B, Gaul U, Tuschl T, Sander C, et al. (2003) MicroRNA targets in Drosophila. Genome Biol 5: R1. doi: 10.1186/gb-2003-5-1-r1
- 62. Rehmsmeier M, Steffen P, Hochsmann M, Giegerich R (2004) Fast and effective prediction of microRNA/target duplexes. RNA 10: 1507–1517. doi: 10.1261/rna.5248604
- 63. Kertesz M, Iovino N, Unnerstall U, Gaul U, Segal E (2007) The role of site accessibility in microRNA target recognition. Nat Genet 39: 1278–1284. doi: 10.1038/ng2135
- 64. Augustin R, Endres K, Reinhardt S, Kuhn PH, Lichtenthaler SF, et al. (2012) Computational identification and experimental validation of microRNAs binding to the Alzheimer-related gene ADAM10. BMC Med Genet 13: 35. doi: 10.1186/1471-2350-13-35
- 65. Korkmaz G, Tekirdag KA, Ozturk DG, Kosar A, Sezerman OU, et al. (2013) MIR376A is a regulator of starvation-induced autophagy. PLoS One 8: e82556. doi: 10.1371/journal.pone.0082556
- 66. Yan X, Liang H, Deng T, Zhu K, Zhang S, et al. (2013) The identification of novel targets of miR-16 and characterization of their biological functions in cancer cells. Mol Cancer 12: 92. doi: 10.1186/1476-4598-12-92
- 67. Ma YJ, Yang J, Fan XL, Zhao HB, Hu W, et al. (2012) Cellular microRNA let-7c inhibits M1 protein expression of the H1N1 influenza A virus in infected human lung epithelial cells. J Cell Mol Med 16: 2539–2546. doi: 10.1111/j.1582-4934.2012.01572.x
- 68. Luzi E, Marini F, Giusti F, Galli G, Cavalli L, et al. (2012) The negative feedback-loop between the oncomir Mir-24-1 and menin modulates the Men1 tumorigenesis by mimicking the “Knudson’s second hit”. PLoS One 7: e39767. doi: 10.1371/journal.pone.0039767
- 69. Pandey P, Qin S, Ho J, Zhou J, Kreidberg JA (2011) Systems biology approach to identify transcriptome reprogramming and candidate microRNA targets during the progression of polycystic kidney disease. BMC Syst Biol 5: 56. doi: 10.1186/1752-0509-5-56
- 70. Peterson SM, Thompson JA, Ufkin ML, Sathyanarayana P, Liaw L, et al. (2014) Common features of microRNA target prediction tools. Front Genet 5: 23. doi: 10.3389/fgene.2014.00023
- 71. Conesa A, Gotz S, Garcia-Gomez JM, Terol J, Talon M, et al. (2005) Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21: 3674–3676. doi: 10.1093/bioinformatics/bti610
- 72. Huang da W, Sherman BT, Lempicki RA (2009) Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 4: 44–57. doi: 10.1038/nprot.2008.211
- 73. Kanehisa M, Goto S (2000) KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28: 27–30. doi: 10.1093/nar/28.1.27
- 74. Cheung H, Chen NJ, Cao Z, Ono N, Ohashi PS, et al. (2005) Accessory protein-like is essential for IL-18-mediated signaling. J Immunol 174: 5351–5357. doi: 10.4049/jimmunol.174.9.5351
- 75. Saeki K, Yokoyama J, Wake K (1975) Inhibition of granulation tissue growth by histamine. J Pharmacol Exp Ther 193: 910–917.