Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Transcriptional profiling of two different physiological states of the yak mammary gland using RNA sequencing

  • Fan Jiangfeng ,

    Roles Data curation, Funding acquisition, Investigation, Visualization, Writing – original draft, Writing – review & editing

    fanjf@gsau.edu.cn

    Affiliation College of Veterinary Medicine, Gansu Agricultural University, Lanzhou, Gansu, P. R. China

  • Luo Yuzhu,

    Roles Conceptualization

    Affiliation College of Animal Science and Technology, Gansu Agricultural University, Lanzhou, Gansu, P. R. China

  • Yu Sijiu,

    Roles Project administration

    Affiliation College of Veterinary Medicine, Gansu Agricultural University, Lanzhou, Gansu, P. R. China

  • Cui Yan,

    Roles Methodology

    Affiliation College of Veterinary Medicine, Gansu Agricultural University, Lanzhou, Gansu, P. R. China

  • Xu Gengquan,

    Roles Investigation

    Affiliation College of Veterinary Medicine, Gansu Agricultural University, Lanzhou, Gansu, P. R. China

  • Wang Libin,

    Roles Software

    Affiliation Technology and Research Center of Gansu Province for Embryonic Engineering of Bovine and Sheep & Goat, Lanzhou, Gansu, P. R. China

  • Pan Yangyang,

    Roles Investigation

    Affiliation Technology and Research Center of Gansu Province for Embryonic Engineering of Bovine and Sheep & Goat, Lanzhou, Gansu, P. R. China

  • He Honghong

    Roles Data curation, Software

    Affiliation College of Veterinary Medicine, Gansu Agricultural University, Lanzhou, Gansu, P. R. China

Abstract

Yak milk is superior to common cow milk in nutrients including protein, fat and calories. However, the milk yield of the yak is very much lower compared with other dairy bovines. To understand the molecular mechanisms of lactogenesis, lactation and mammary gland development, mammary tissue samples were taken from five yaks during a dry period (DP, n = 3) and lactation period (LP, n = 2). Two types of cDNA sequence libraries that reflected the different physiological states of the mammary gland were constructed using RNA sequencing technology. After removing reads containing adapters, reads containing poly-N and low-quality reads from the raw data, 45,423,478 to 53,274,976 clean reads were obtained from these libraries. A total of 74.72% to 80.65% of the high-quality sequence reads were uniquely aligned to the BosGru v2.0 yak reference genome. Using the DESeq R package, 360 differentially expressed genes were detected between the two groups when the adjusted P value (padj < 0.05) was used as the cutoff value; this included 192 upregulated and 168 downregulated genes in the yak mammary gland tissue of the DP compared to the LP. A gene ontology analysis revealed that the most enriched GO terms were protein binding, multi-organism process, immune system and others. KEGG pathway analysis indicated that the differentially expressed genes were mostly enriched in Hippo signaling, insulin signaling, steroid biosynthesis and others. The analysis of the up- and downregulated genes provides important insights into the molecular events involved in lactogenesis, lactation and mammary gland development and will guide further research to enhance milk yield and optimize the constituents of yak milk.

Introduction

The yak (Bos grunniens) is a specific type of livestock that is mainly distributed in the Qinghai-Tibetan Plateau and adjacent regions. It occupies a very important and irreplaceable ecological and economical position in local regions. It has also been regarded as an iconic symbol of Tibet and high altitudes. At present, there are approximately 14 million yaks in the world, of which more than 90% are in China [1]. The yak is an abundant source of milk, meat and other necessities for Tibetans. Yak milk is also the raw material of local residents’ traditional food, such as milk tea, butter, cheese and others. Yak milk is superior to common cow milk in nutrients including protein, fat and calories. However, the yak has a much lower milk yield compared with other dairy bovines due to conditions of extreme harshness, including extreme cold and low air pressure and oxygen content with high solar radiation at high altitudes in the Tibetan Plateau. The milk yield of a yak is usually 200~300 kg, the lactation duration is just 150~200 d, and the daily milk yield is commonly 1.5~2.0 kg [24]. The very low milk yield has confused researchers for a long time. A reasonable approach to improving the milk yield of the yak would be to investigate the key genes and understand the regulatory mechanism of lactation.

High-throughput mRNA sequencing (RNA-seq) offers the ability to survey the genes and transcripts that are expressed in a biological sample under a specific physiological or pathological condition. It also provides a far more precise measurement of the levels at which genes and transcripts are expressed compared to other methods [4,5]. A large number of investigations have revealed the transcriptional profiles of different organs and tissues of various livestock using RNA-seq technology in recent years [4]. In the yak, RNA-seq was used to construct a tissue-specific cDNA library and survey the important regulatory genes related to reproduction [6], coat color [7] and others. Regarding the physiological process of lactation, many studies have been conducted on the mammary gland or milk of bovine [2,814] and other animals [1517]. In addition, Wang et al. [9] identified 517 annotated differentially expressed genes in mammary tissues from three yaks at 1 and 30 d after parturition using Affymetrix Bovine Genome Arrays. However, beyond that, limited research has been published on the detailed characterization of global genes expressed in the yak mammary gland. In the present study, we used RNA sequencing (RNA seq) technology to examine the genes expressed in the mammary gland of the yak at a dry period (DP) and a lactation period (LP). These data may provide candidate genes with a high probability of having functional roles in regulating the lactation process in the yak mammary gland and providing a theoretical basis for the improvement of the milk yield of the yak.

Materials and methods

Animals

All procedures involving animals were approved by the Animal Care and Use Committee of Gansu Agricultural University where the experiment was conducted. Five female yaks at 6~8 years-of-age were selected and raised in Xiahe County (Gansu Province, China) by a local herdsman. These yaks were raised under a similar environment with natural grazing and had free access to water with no supplementary feeding. The yaks were divided into two groups. Three of them (DP1, DP2, DP3) were in the dry period (DP group: calved and ablactated in the winter of the last year with no pregnancy and no milking at the time of sample collection) and two of them (LP1, LP2) were in the lactation period (LP group: calved in the spring of the current year and were suckling the calf at the time of sample collection). The selected female yaks were slaughtered in June 2016, and the mammary gland tissues were gathered and immersed into liquid nitrogen within 20 min after death.

Total RNA isolation and quality evaluation

Total RNA was isolated from each tissue sample using Trizol reagents (Cat. No. 15596018, Invitrogen, Carlsbad, CA) according to the manufacturer’s instructions. The integrity of the isolated RNA was assessed using the RNA Nano 6000 Assay Kit of the Bioanalyzer 2100 system (Agilent Technologies, CA,USA), and the concentration of RNA was measured using the Qubit® RNA Assay Kit (Cat. No. Q32852, Invitrogen, USA) in a Qubit® 2.0 Fluorometer (Life Technologies, USA).

Library preparation and RNA sequencing

In the present study, 3 μg of RNA per sample and the NEBNext® Ultra™ RNA Library Prep Kit for Illumina® (NEB, USA) were used to generate the sequence libraries according to the manufacturer’s recommendations. In brief, after purification from total RNA using poly-T oligo-attached magnetic beads, mRNA was split into fragments using divalent cations under an elevated temperature in NEBNext First Strand Synthesis Reaction Buffer(5X). Then, first strand cDNA was synthesized using M-MuLV Reverse Transcriptase(RNase H-)and random hexamer primers. Subsequently, second strand cDNA was synthesized using DNA Polymerase I and RNase H. The remaining overhangs were converted into blunt ends via exonuclease/polymerase activities. After adenylation of the 3’ ends of DNA fragments, NEBNext Adaptors with a hairpin loop structure were ligated for further hybridization. The library fragments were purified with the AMPure XP system (Beckman Coulter, Beverly, USA) to preferentially select cDNA fragments of 150~200 bp in length. Then, 3 μl of USER Enzyme (NEB, USA) was used with size-selected adaptor-ligated cDNA at 37°C for 15 min followed by 5 min at 95°C before PCR. Then, PCR was performed with Universal PCR primers, Index (X) Primer and Phusion High-Fidelity DNA polymerase. Finally, after PCR products were purified (AMPure XP system) and library quality was assessed on the Agilent Bioanalyzer 2100 system, the library preparations were sequenced on an Illumina Hiseq platform at Beijing Novogene Bioinformatics Technology Co., Ltd. (Beijing, China).

Read mapping

Reference genome and gene model annotation files of the yak (BosGru v2.0, https://www.ncbi.nlm.nih.gov/assembly/GCF_000298355.1/) were downloaded from the genome website directly. An index of the reference genome was built using Bowtie v2.2.3, and paired-end clean reads were aligned to the reference genome using TopHat v2.0.12. TopHat was selected as the mapping tool because it can generate a database of splice junctions based on the gene model annotation file and thus provide a better mapping result than other non-splice mapping tools.

Quantification of gene expression levels

HTSeq v0.6.1 was used to count the reads numbers mapped to each gene. Then, the expected number of Fragments Per Kilobase of transcript sequence per Millions base pairs sequenced (FPKM) of each gene was calculated based on the length of the gene and reads count mapped to this gene. Considers the effect of sequencing depth and gene length for the reads count at the same time, FPKM is currently the most commonly used method for estimating gene expression levels [18].

Differential expression analysis

A differential expression analysis between the DP and LP was performed using the DESeq R package (1.18.0). DESeq provides statistical methods for determining differential expression in digital gene expression data using a model based on the negative binomial distribution. The resulting P values were adjusted using the Benjamini and Hochberg’s approach for controlling the false discovery rate. Genes with an adjusted P value <0.05 found by DESeq were assigned as differentially expressed.

Annotations of differentially expressed genes

To extract biological meanings from differentially expressed genes, a gene enrichment analysis was performed with Gene Ontology (GO) [19] and Kyoto Encyclopedia of Genes and Genomes databases (KEGG, http://www.genome.jp/kegg) [20,21].

Results

Summary of the sequence reads

In the present study, two types of cDNA sequence libraries were constructed using mammary tissues of the dry period (DP) and lactation period (LP) from the yak. A total of 47,006,966 to 54,251,010 raw reads were generated for each sample. After removing reads containing adapters, reads containing poly-N and low-quality reads from raw data, 45,423,478 to 53,274,976 clean reads were obtained. At the same time, the Q20, Q30 and GC content of the clean data were calculated. The results indicated that the base composition of the raw sequence data was satisfactory and the ratio of the low-quality base sequence reads (<20) was very low (Table 1). All the downstream analyses were based on the clean reads with high quality.

thumbnail
Table 1. The data and qualities of RNA sequencing for five samples.

https://doi.org/10.1371/journal.pone.0201628.t001

Using TopHat v2.0.12, 76.85% to 84.56% of the total reads were aligned to the BosGru v2.0 (https://www.ncbi.nlm.nih.gov/assembly/GCF_000298355.1/) yak reference genome and 74.72% to 80.65% of the reads were uniquely mapped to the yak genome for each sample. The statistics results indicated that the percentage of reads mapped to the yak reference genome exon was over 52.5% in all these samples (Table 2).

thumbnail
Table 2. The summary of reads mapped to the yak reference genome (BosGru v2.0).

https://doi.org/10.1371/journal.pone.0201628.t002

An additional analysis using rMATS (http://rnaseq-mats.sourceforge.net/index.html) showed that there were two major alternative splice (AS) patterns in the yak transcriptome data, which included skipped exons (SEs) and mutually exclusive exons (MXEs). A total of 10,924 SE events were found in all these samples using Junction Counts plus read on target detection, in which nine SE events were different (six events were upregulated, and 3 events were downregulated in the yak mammary tissues of the DP compared with that of the LP). The statistics of the alternative splice (AS) types and amount is showed in Table 3.

thumbnail
Table 3. The statistics of alternative splice (AS) types and amount.

https://doi.org/10.1371/journal.pone.0201628.t003

Quantification of gene expression level

In the present study, HTSeq v0.6.1 was used to quantify the expression level of mapped genes in each sample, and the FPKM (expected number of Fragments Per Kilobase of transcript sequence per million base pairs sequenced) value of each gene was calculated based on the length of the gene and reads count mapped to this gene. In our analysis, the number of expressed genes (FPKM values > 1.0) ranged from 12,402 (49.85%) to 16,068 (64.59%) in all five samples. Among these genes, 793 (3.19%) to 2632 (10.58%) genes were highly expressed with FPKM values > 60.0. The statistics of gene distribution at different expression levels is shown in Table 4. Additional analyses indicated that 13,443 genes were expressed in both the DP and LP; furthermore, 2,607 and 233 genes were specifically expressed in the DP and LP, respectively (mean FPKM value > 1.0). We also show a Venn diagram (Fig 1), density distribution (Fig 2A) and violin diagram (Fig 2B) of FPKM to visually demonstrate the expression level of all the genes in the yak mammary gland tissues of the dry period (DP) compared to those of the lactation period (LP).

thumbnail
Fig 1. A venn diagram of the amounts of the expressed genes in two groups.

https://doi.org/10.1371/journal.pone.0201628.g001

thumbnail
Fig 2.

The Density Distribution (A) and Violin (B) Diagram of the FPKM in Two Groups.

https://doi.org/10.1371/journal.pone.0201628.g002

thumbnail
Table 4. The statistics of genes at different expression levels.

https://doi.org/10.1371/journal.pone.0201628.t004

The correlation of transcript expression between samples is the most important indicator of the reliability of the experimental results and rationality of the sampling. In the present study, the square of the Pearson correlation coefficient (r2) was calculated based on the normalized FPKM values to reflect the correlation of transcript expression between samples. The results indicated that r2 ranged from 0.892 to 0.969 between the different samples of each group. All of the correlation values reached 0.80, which means that the selection of every sample in the same group was satisfactory and the results were reliable. In addition, the r2 values between the samples of the DP group were as high as 0.92 (an ideal r2 value for the best sample-selected condition). The transcript expression of different biological replicates at each stage (dry period and lactation period) is shown in the scatter plot (Fig 3).

Identification of differentially expressed genes between the DP and LP

To better survey the regulatory mechanism of lactogenesis, milk secretion and mammary gland development, it is important to identify the differentially expressed genes of mammary gland tissue between the dry period (DP) and lactation period (LP). The DESeq R package (1.18.0) was used to perform a differential expression analysis. The results indicated that 360 differentially expressed genes were detected between the DP and LP when the adjusted P value (padj < 0.05) was used as a cutoff value. Of these, 192 genes were upregulated while 168 genes were downregulated in the yak mammary gland tissue of the dry period compared to that of the lactation period. The distribution of the differentially expressed genes is shown in a volcano plot (Fig 4).

thumbnail
Fig 4. A volcano plot of differentially expressed genes between two groups.

https://doi.org/10.1371/journal.pone.0201628.g004

In addition, a systematic cluster analysis of the differentially expressed genes was performed to gain further insight into the similarities at the transcriptome scale among the yak mammary tissues obtained from the dry period and lactation period. The results indicated that the individual samples belonging to the same group were clustered together; furthermore, the patterns based on the relative expression levels of differentially expressed genes had obvious differences between the two groups (Fig 5). These results implied that the samples used in the present study were selected appropriately and the grouping of these samples was reasonable.

thumbnail
Fig 5. Clustering of the differentially expressed genes in the yak mammary gland during the two periods.

Columns indicate individual samples, rows represent each differentially expressed gene, and the color scale represents the relative expression level of the differentially expressed genes.

https://doi.org/10.1371/journal.pone.0201628.g005

GO enrichment analysis of differentially expressed genes

A Gene Ontology (GO) enrichment analysis of 360 differentially expressed genes was implemented by the GOseq R package. The differentially expressed genes were categorized into three major functional types: biological process, molecular function, and cellular component. The top GO terms of enriched genes were protein binding, multi-organism process, immune system, and immune response, in which 115, 27, 11, and 11 genes were distributed, respectively. We show the most enriched 30 GO terms in a histogram (Fig 6). The number of upregulated and downregulated genes of every GO term in the DP compared with the LP is also reflected in the histogram.

thumbnail
Fig 6. The most enriched 30 GO terms (DP vs LP).

Biological Process: 1. Multi-organism process; 2. Immune response; 3. Immune system process; 4. Microtubule-based process; 5. Transition metal ion transport; 6. Regulation of cytoskeleton organization; 7. Regulation of organelle organization; 8. Ferrous iron transport; 9. Regulation of microtubule-based process; 10. Regulation of microtubule cytoskeleton organization; 11. Binding of sperm to the zona pellucida; 12. Cell recognition; 13. Cell-cell recognition; 14. Sperm-egg recognition; 15. Polysaccharide transport; 16. Polysaccharide localization; 17. Plant-type cell wall organization; 18. Plant-type cell wall organization or biogenesis.Cellular Component: 19. Anchored component of membrane. Molecular Function: 20. Protein binding; 21. Transition metal ion transmembrane transporter activity; 22. Cytokine activity; 23. Ferrous iron transmembrane transporter activity; 24. Iron ion transmembrane transporter activity; 25. Chemokine activity; 26. Chemokine receptor binding; 27. Kinase binding; 28. Polysaccharide transmembrane transporter activity; 29. Plastoquinol—plastocyanin reductase activity; 30. Oxidoreductase activity, acting on diphenols and related substances as donors, with copper protein as the acceptor.

https://doi.org/10.1371/journal.pone.0201628.g006

KEGG enrichment analysis of differentially expressed genes

KOBAS software was used to test the statistical enrichment of the differential expression of genes in KEGG pathways. In the yak mammary gland tissue of the DP compared with that of the LP, 192 upregulated genes were involved in 106 KEGG pathways; furthermore, 168 downregulated genes were involved in 135 KEGG pathways. The most enriched 20 KEGG pathways of up- and downregulated genes are shown in a scatter diagram (Fig 7). The diagram indicates that the upregulated genes were mostly enriched in the Hippo signaling, basal cell carcinoma, glutamatergic synapse, cytokine-cytokine receptor interaction, cell adhesion molecules (CAMs), vitamin digestion and absorption, axon guidance, chemokine signaling pathway, signaling pathways regulating pluripotency of stem cells, DNA replication, and graft-versus-host disease KEGG pathways; furthermore, the downregulated genes were mostly enriched in the insulin signaling, circadian entrainment, protein processing in the endoplasmic reticulum, platelet activation, salivary secretion, pantothenate and CoA biosynthesis, steroid biosynthesis, protein export, inflammatory mediator regulation of TRP channels, long-term depression, cGMP-PKG signaling pathway, PPAR signaling, p53 signaling and other KEGG pathways. In addition, although not significant, we identified two downregulated genes enriched in the mTOR signaling pathway (P Value = 0.14).

thumbnail
Fig 7.

The KEGG Pathway Enrichment of up- (A) and Downregulated Genes (B) in DP Compared with the LP.

https://doi.org/10.1371/journal.pone.0201628.g007

Discussion

The mammary gland is a dynamic organ that undergoes structural, physiological and functional changes during the female reproductive cycle. It can be divided into different periods according to the physiological processes of the mammary gland. In the dry period, a large-scale degenerating change occurs and lactation activity gradually ceases. The physiological processes within the mammary gland at this period include the following: a degenerating change in the cellular architecture occurs during the first days of active involution, possibly caused by the high intra-alveolar pressure. Then, extensive histological changes affect the synthesis of the milk constituents. During the steady state of involution, there is a loss of integrity of the mammary epithelium, and various substances enter the mammary lumen from the blood [22]. In addition, the dry period is essential for the mammary gland to regenerate after the previous lactation. In addition, it is also necessary to render an appropriate milk yield during the subsequent period [23]. On the other hand, the mammary gland is characterized by extensive mammogenesis and lactogenesis in the lactation period [9]. Therefore, it is obvious that these two periods provide perfect models for the comparative analysis of mammary gland development and lactation. In the present study, five yaks, of which three were in the dry period and two were in the lactation period, were selected as the object of research to investigate differentially expressed genes. The selection and grouping of these animals was reasonable and meaningful. We also constructed two types of cDNA sequence libraries using yak mammary tissues of the dry period (DP) and lactation period (LP) for the first time. In view of the differentially expressed genes between the two periods and the subsequent correlation and clustering analyses, the results of the animal grouping are reliable and accurate.

The most important aspect of mammary gland research must involve investigating lactation-related gene expression. Gene expression profiling of mammary tissue is a powerful tool that has been used to catalog and characterize the role of genes expressed during lactation. In the past few years, the transcriptome has been widely analyzed using microarrays [2428]. In recent years, however, next-generation sequencing (NGS) technology has been used to explore differentially expressed genes in the mammary gland tissue or milk under different physiological and pathological conditions [5,17,2931]. This research has revealed that an abundance of genes plays a very complex and important role in the regulation of lactation and mammary gland tissue health. Cui et al. [29] generated a bovine transcriptome from the mammary glands of four lactating Holstein cows with extremely high and low phenotypic values of milk protein and fat percentage using RNA-seq techniques. They obtained 48,967,376~75,572,578 uniquely mapped reads that covered 82.25% of the annotated transcripts. Among them, 31 differentially expressed genes (p < 0.05, false discovery rate q < 0.05) between the high and low groups of cows were revealed. Gene ontology and pathway analyses demonstrated that the 31 differentially expressed genes were enriched in specific biological processes with regard to protein metabolism, fat metabolism, and mammary gland development (p < 0.05). An integrated analysis of differential gene expression indicated that TRIB3, SAA (SAA1, SAA3, and M-SAA3.2), VEGFA, PTHLH, and RPL23A were the most promising candidate genes affecting milk protein and fat percentage. However, there has been no similar report about transcriptome research of the mammary gland in yaks using RNA-seq techniques. The present study has described the comprehensive mammary gland transcriptome of the yak during the dry period (DP) and lactation period (LP) for the first time. cDNA sequence libraries of the dry period (DP) and lactation period (LP) were constructed. A total of 45,423,478 to 53,274,976 clean reads were obtained and 76.85% to 84.56% of the total reads were aligned to the BosGru_v2.0 (https://www.ncbi.nlm.nih.gov/assembly/GCF_000298355.1/) yak reference genome. In addition, 74.72% to 80.65% of the reads were uniquely mapped to the yak genome for each sample. Compared with reports for the non-model organisms yak [32], sheep [33], pigs [34] and bovine [35], in which 61.4%~75.0% of reads mapped to the reference genome, our mapping percentage was almost comparable or better, which ensures a reliable downstream analysis. This study provides valuable materials for elucidating the molecular mechanisms of lactation regulation in the yak.

During the different stages of lactation, there are distinct changes of mammary tissue development and function. The dry period (DP) and lactation period (LP) are two entirely different stages of mammary gland development during the lactation cycle. To better survey the molecular mechanism of lactation, it is important to identify the differentially expressed genes (DE genes) between the two stages. In the yak mammary gland, 360 DE genes were detected between the two groups when the adjusted P value (padj < 0.05) was used as a cutoff value. Our gene ontology analyses indicated that the DE genes were annotated in different functional types. In the yak mammary gland tissue of the dry period compared to the lactation period, some downregulated genes were most enriched in the biological process that was related to the biosynthesis of milk components, such as the GO terms of polysaccharide transport (two genes were downregulated while 0 were upregulated) and polysaccharide localization (two genes were downregulated while 0 were upregulated). In addition, some downregulated genes were enriched in the biological process that was related to cell proliferation, such as the GO terms of regulation of cytoskeleton organization (four genes were downregulated while two were upregulated) and regulation of organelle organization (four genes were downregulated while two were upregulated). These results suggest that there is an adaptation in the mammary gland at the transcriptional level in the dry period and many genes that are involved in lactogenesis, lactation and mammary gland development are inactivated.

Furthermore, KEGG pathway analysis of the DE genes provided significant insight into the potential biological pathways activated or inhibited in response to lactogenesis and lactation in the yak mammary gland. In the present study, the strongly impacted KEGG pathways were ‘Hippo signaling pathway’, ‘Cytokine-cytokine receptor interaction’, ‘Cell adhesion molecules (CAMs)’, ‘Vitamin digestion and absorption’, ‘Insulin signaling pathway’, ‘Protein processing in endoplasmic reticulum’, ‘Steroid biosynthesis’, ‘Protein export’, ‘PPAR signaling pathway’, ‘p53 signaling pathway’, ‘GnRH signaling pathway’, ‘Oxytocin signaling pathway’, and ‘Estrogen signaling pathway’. These data may provide candidate genes with a high probability of having functional roles in regulating the change between the dry period and lactation period in the yak mammary gland.

Although the Hippo pathway has complex characteristics and a cross-linking dialogue with other signaling pathways, it has been acknowledged to play an important role in the regulation of organ size, cell cycle, apoptosis and maintenance of homeostasis. In response to high cell densities, activated LATS1/2 phosphorylates the transcriptional coactivators YAP and TAZ, promoting their cytoplasmic localization, leading to cell apoptosis and restricting organ size overgrowth. When the Hippo pathway is inactivated at low cell density, YAP/TAZ translocates into the nucleus to bind to the transcription enhancer factor (TEAD/TEF) family of transcriptional factors to promote cell growth and proliferation [36]. Recent findings suggested that the Hippo pathway regulates female reproductive system development [3739] and tumorigenesis [40,41]. Therefore, more and more researchers pay attention to Hippo pathway research. Aside from research related to breast cancer [4244], the physiological role of Hippo signaling in mammary gland development remains poorly understood. Chen et al. [45] used mouse genetics to interrogate Hippo signaling in the mammary gland in vivo. The results indicated that the Hippo pathway is genetically dispensable in virgin mammary glands but specifically required during pregnancy when mammary epithelia undergo rapid growth and terminal differentiation. In the present study, the results of KEGG pathway analysis indicate that the most enriched pathway term is Hippo signaling. A total of seven DE genes that belong to this pathway were upregulated in the yak mammary gland of the dry period compared to the lactation period. This implies that the development of the yak mammary gland is probably inhibited through the Hippo pathway during the dry period in the yak.

As an important pathway for DE genes, the insulin signaling pathway was enriched by downregulated genes in the yak mammary gland tissue of the dry period compared with the lactation period. Insulin is known to be an important regulator of milk secretion in the lactating mammary gland. Neville et al. [46] examined the role of insulin signaling in mammary development in the mouse. They found that the insulin receptor (IR) plays an important role both in alveolar development and differentiation of the mouse mammary gland. Their experiments conducted on mammary epithelial cells (MECs) isolated from pregnant mice have also verified that insulin can stimulate lumen formation, mammary cell size, acinar size, acinar casein content, and the formation of lipid droplets. Using an array analysis, they found significant downregulation of differentiation-specific genes and upregulation of cell cycle and extracellular matrix genes. Therefore, they believe that insulin stimulates differentiation and may inhibit cell proliferation in the mammary gland of the mid-pregnant mouse. Recently, Cohick [47] summarized the available data that address a role for insulin in the secretory differentiation of the mammary gland. Abundant data expanded our understanding of the role for insulin in functional differentiation that includes both transcriptional and post-transcriptional regulation of multiple genes involved in the development of the mammary gland and lactation process. In the present study, insulin signaling is also the most enriched pathway term of the downregulated DE genes in the dry period yak mammary gland compared with the lactation period. There are six genes involved in this pathway. These genes are related to fatty acid biosynthesis, gluconeogenesis and protein synthesis and proliferation/differentiation processes, such as FBP1 (fructose-bisphosphatase 1, Novel02167, log2 fold change = -5.3093), CALM (calmodulin, ENSP00000403491-D4, log2 fold change = -1.6542, ENSP00000379210-D7, log2 fold change = -1.7085), RPS6 (ribosomal protein S6, ENSBTAP00000045373-D8, log2 fold change = -1.7901), EIF4EBP1 (eukaryotic translation initiation factor 4E binding protein 1, ENSBTAP00000039615-D1, log2 fold change = -2.1687), and PTPRF (protein tyrosine phosphatase receptor type F, ENSBTAP00000024046-D1, log2 fold change = -1.6001). These data may provide candidate genes with a high probability of having functional roles in regulating lactogenesis and mammary gland development in the yak.

In the present study, steroid biosynthesis and the mTOR signaling pathway were also found to be enriched by downregulated genes in the yak mammary gland of the dry period compared with the lactation period. The important role of steroids for mammary gland growth and development has been commonly accepted by many researchers [48]. Recently, Shu et al. [8] found that the pathway of steroid biosynthesis was upregulated from late pregnancy to peak lactation in the porcine mammary gland.

In addition, a master role for mTOR in the regulation of protein synthesis, particularly translation, in all tissues of mammals has been well defined [49]. Several lines of evidence support a pivotal role for mTOR in milk protein synthesis in mice and cows [5053]. As described previously, the networks of factors involved in the regulation of milk protein synthesis in the bovine include the mTOR signaling pathway [53]. Similar data that highlighted a role for mTOR in the regulation of milk protein synthesis were also obtained both in rodents [54] and ruminants [55]. Recently, a phenomenon of mTOR pathway activation was reported in the transition of the porcine mammary gland [8]. Our results are consistent with this previous research. Therefore, we believe that the inhibition of the synthesis of milk constituents such as fatty acids and milk protein is probably related to the downregulation of steroid biosynthesis and the mTOR signaling pathway in the yak mammary gland during the dry period compared with the lactation period.

In summary, lactogenesis, milk secretion and mammary gland development are very complex processes. A large number of genes are involved in the regulation of these processes. Investigations of the differential expression of genes between different physiological stages of the mammary gland only provide a way to identify candidate genes with a high probability of having functional roles in these processes. Further analyses of these gene functions are necessary for us to have a complete understanding of the molecular mechanism of lactation regulation.

Conclusions

This study investigated the character of the mammary gland transcriptome in the yak using RNA-seq. Two types of cDNA sequence libraries were constructed using the yak mammary tissues of a dry period (DP) and a lactation period (LP). Then, a total of 360 differentially expressed genes were detected between the two groups when the adjusted P value (padj < 0.05) was used as a cutoff value. Although the functions of many of these DE genes are not yet thoroughly elucidated in the yak mammary gland, our GO and KEGG analysis of the up- and downregulated genes provides important insights into the molecular events involved in lactogenesis, lactation and mammary gland development and will guide further research to enhance milk yield and optimize the constituents of yak milk.

References

  1. 1. Qiu Q, Zhang G, Ma T, Qian W, Wang J, Ye Z, et al. The yak genome and adaptation to life at high altitude. Nat Genet. 2012;8: 946–949.
  2. 2. Zhou M, Jiang S, Xie R, Luo X, Yang P. Study on the milk yield pattern of Maiwa yak. Animal Husbandry and Feed Science. 2014;6: 261–263.
  3. 3. He S, Ma Y, Wang J, Li Q, Yang X, Tang S, et al. Milk fat chemical composition of yak breeds in China. J Food Compost Anal. 2011;24: 223–230.
  4. 4. Cui GX, Yuan F, Degen AA, Liu SM, Zhou JW, Shang ZH, et al. Composition of the milk of yaks raised at different altitudes on the Qinghai–Tibetan Plateau. Int Dairy J. 2016;59: 29–35.
  5. 5. Medrano JF, Rincon G, Islas-Trejo A. Comparative analysis of bovine milk and mammary gland transcriptome using RNA-Seq. Trejo. 2010;852.
  6. 6. Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009;10: 57. pmid:19015660
  7. 7. Guangxin E, An T, Luo X. Identification of long noncoding RNA and mRNA in yak coat color using RNAseq. Res J Biotechnol. 2017;12: 1–3.
  8. 8. Shu DP, Chen BL, Hong J, Liu PP, Hou DX, Huang X, et al. Global transcriptional profiling in porcine mammary glands from late pregnancy to peak lactation. OMICS. 2012;16: 123–137. pmid:22401658
  9. 9. Wang Y, Zhu J, Cai H, Yang Y, Jiang M. Crucial genes at the onset of lactation revealed by transcriptome screening of Domestic Yak mammary gland. Can J Anim Sci. 2016;97: 448–455.
  10. 10. Cánovas A, Rincon G, Islastrejo A, Wickramasinghe S, Medrano JF. SNP discovery in the bovine milk transcriptome using RNA-Seq technology. Mamm Genome. 2010;21: 592–598. pmid:21057797
  11. 11. Cong L, Cai W, Zhou C, Yin H, Zhang Z, Loor JJ, et al. RNA-Seq reveals 10 novel promising candidate genes affecting milk protein concentration in the Chinese Holstein population. Sci Rep. 2016;6: 26813. pmid:27254118
  12. 12. Mccabe M, Waters S, Morris D, Kenny D, Lynn D, Creevey C. RNA-seq analysis of differential gene expression in liver from lactating dairy cows divergent in negative energy balance. BMC Genomics. 2012;13: 193. pmid:22607119
  13. 13. Cã nA, Rincã N G, Bevilacqua C, Islas-Trejo A, Brenaut P, Hovey RC, et al. Comparison of five different RNA sources to examine the lactating bovine mammary gland transcriptome using RNA-Sequencing. Sci Rep. 2015;4: 5297–5297.
  14. 14. Seo M, Lee HJ, Kim K, Caetano-Anolles K, Jin YJ, Park S, et al. Characterizing milk production related genes in holstein using RNA-seq. Asian-Australas J Anim Sci. 2016;29: 343–351. pmid:26950864
  15. 15. Crisà A, Ferrè F, Chillemi G, Moioli B. RNA-Sequencing for profiling goat milk transcriptome in colostrum and mature milk. BMC Vet Res. 2016;12: 264. pmid:27884183
  16. 16. Zhang C, Zhao Y, Wang Y, Wu H, Fang X, Chen H. Deep RNA sequencing reveals that microRNAs play a key role in lactation in rats. J Nutr. 2014;144: 1142–1149. pmid:24899157
  17. 17. Suárezvega A, Gutiérrezgil B, Klopp C, Tosserklopp G, Arranz JJ. Comprehensive RNA-Seq profiling to evaluate lactating sheep mammary gland transcriptome. Sci Data. 2016;3: 160051. pmid:27377755
  18. 18. Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, Van Baren MJ, et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010;28: 511–515. pmid:20436464
  19. 19. Harris MA, Clark J, Ireland A, Lomax J, Ashburner M, Foulger R. The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res. 2004;32: D258–261. pmid:14681407
  20. 20. Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M. The KEGG resource for deciphering the genome. Nucleic Acids Res. 2004;32: D277. pmid:14681412
  21. 21. Kanehisa M, Goto S, Hattori M, Aokikinoshita KF, Itoh M, Kawashima S, et al. From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 2006;34: D354. pmid:16381885
  22. 22. Mt S, Hoedemaker M, Krömker V. [Physiological processes in the mammary gland tissue of dairy cows during the dry period]. Berl Munch Tierarztl Wochenschr. 2015;128: 76–83. pmid:25876288
  23. 23. Bernierdodier P, Girard CL, Talbot BG, Lacasse P. Effect of dry period management on mammary gland function and its endocrine regulation in dairy cows. J Dairy Sci. 2011;94: 4922–4936. pmid:21943744
  24. 24. Lu SL, Guo A, Becker K, Siegelmann H, Sebastiani P, Macbeth K, et al. Microarray analysis of global gene expression in the mammary gland following estrogen and progesterone treatment of ovariectomized mice. Cancer Epidemiol Biomarkers Prev. 2003;12: 1295S–1296S.
  25. 25. Hou X, Li Q, Huang T. Microarray analysis of gene expression profiles in the bovine mammary gland during lactation. Sci China Life Sci. 2010;53: 248–256. pmid:20596835
  26. 26. Li RW, Meyer MJ, Van Tassell CP, Sonstegard TS, Connor EE, Van Amburgh ME, et al. Identification of estrogen-responsive genes in the parenchyma and fat pad of the bovine mammary gland by microarray analysis. Physiol Genomics. 2006;27: 42–53. pmid:16788005
  27. 27. Broccolo F, Maran V, Oggioni M, Matteoli B, Greppi G, Ceccherini-Nelli L, et al. Development and validation of a dedicated microarray for the evaluation of bovine mammary gland health status and milk quality. Mol Biotechnol. 2013;54: 818–828. pmid:23224854
  28. 28. Kelsey JA, Nudda A, Corato A, Mosley EE, Mosley S, Williams BK, et al. Assessing changes in mammary gland gene expression using a cDNA microarray in the dairy cow following administration of bovine somatotropin. J Dairy Sci. 2005;83: 287–287.
  29. 29. Cui X, Hou Y, Yang S, Xie Y, Zhang S, Zhang Y, et al. Transcriptional profiling of mammary gland in Holstein cows with extremely different milk protein and fat percentage using RNA sequencing. BMC Genomics. 2014;15: 226. pmid:24655368
  30. 30. Paten AM, Duncan EJ, Pain SJ, Peterson SW, Kenyon PR, Blair HT, et al. Functional development of the adult ovine mammary gland—insights from gene expression profiling. BMC Genomics. 2015;16: 748. pmid:26437771
  31. 31. Chopra-Dewasthaly R, Korb M, Brunthaler R, Ertl R. Comprehensive RNA-Seq Profiling to Evaluate the Sheep Mammary Gland Transcriptome in Response to Experimental Mycoplasma agalactiae Infection. PLoS One. 2017;12: e0170015. pmid:28081235
  32. 32. Lan D, Xiong X, Wei Y, Xu T, Zhong J, Zhi X, et al. RNA-Seq analysis of yak ovary: improving yak gene structure information and mining reproduction-related genes. Sci China Life Sci. 2014;57: 925–935. pmid:24907937
  33. 33. Jäger M, Ott CE, Grünhagen J, Hecht J, Schell H, Mundlos S, et al. Composite transcriptome assembly of RNA-seq data in a sheep model for delayed bone healing. BMC Genomics. 2011;12: 158. pmid:21435219
  34. 34. Yang M, Ouyang J, Qiao R, Li P, Li W, Ren J, et al. A global view of porcine transcriptome in three tissues from a full-sib pair with extreme phenotypes in growth and fat deposition by paired-end RNA sequencing. BMC Genomics. 2011;12: 448. pmid:21906321
  35. 35. Wickramasinghe S, Rincon G, Islastrejo A, Medrano JF. Transcriptional profiling of bovine milk using RNA sequencing. BMC Genomics. 2012;13: 45. pmid:22276848
  36. 36. Zhao B, Tumaneng K, Guan K-L. The Hippo pathway in organ size control, tissue regeneration and stem cell self-renewal. Nat Cell Biol. 2011;13: 877–883. pmid:21808241
  37. 37. Zheng T, Wang J, Jiang H, Liu L. Hippo signaling in oval cells and hepatocarcinogenesis. Cancer Lett. 2011;302: 91–99. pmid:21247686
  38. 38. Polesello C, Tapon N. Salvador-warts-hippo signaling promotes Drosophila posterior follicle cell maturation downstream of notch. Curr Biol. 2007;17: 1864–1870. pmid:17964162
  39. 39. Sun S, Zhao S, Wang Z. Genes of Hippo signaling network act unconventionally in the control of germline proliferation in Drosophila. Dev Dyn. 2008;237: 270–275. pmid:18095349
  40. 40. Zhao B, Li LQ, Guan KL. The Hippo-YAP pathway in organ size control and tumorigenesis: an updated version. Genes Dev. 2010;24: 862–874. pmid:20439427
  41. 41. Hall CA, Wang R, Miao J, Oliva E, Shen X, Wheeler T, et al. Hippo pathway effector yap is an ovarian cancer oncogene. Cancer Res. 2010;70: 8517. pmid:20947521
  42. 42. Bartucci M, Dattilo R, Moriconi C, Pagliuca A, Mottolese M, Federici G, et al. TAZ is required for metastatic activity and chemoresistance of breast cancer stem cells. Oncogene. 2015;34: 681–690. pmid:24531710
  43. 43. Wang X, Su L, Ou Q. Yes-associated protein promotes tumour development in luminal epithelial derived breast cancer. Eur J Cancer. 2012;48: 1227–1234. pmid:22056638
  44. 44. Yuan M, Tomlinson V, Lara R, Holliday D, Chelala C, Harada T, et al. Yes-associated protein (YAP) functions as a tumor suppressor in breast. Cell Death Differ. 2008;15: 1752–1759. pmid:18617895
  45. 45. Chen Q, Zhang N, Gray RS, Li H, Ewald AJ, Zahnow CA, et al. A temporal requirement for Hippo signaling in mammary gland differentiation, growth, and tumorigenesis. Genes Dev. 2014;28: 432–437. pmid:24589775
  46. 46. Neville MC, Webb P, Ramanathan P, Mannino MP, Pecorini C, Monks J, et al. The insulin receptor plays an important role in secretory differentiation in the mammary gland. Am J Physiol Endocrinol Metab. 2013;305: E1103–1114. pmid:23982156
  47. 47. Cohick WS. Physiology and endocrinology sympusium: effects of insulin on mammary gland differentiation during pregnancy and lactation. J Anim Sci. 2016;94: 1812–1820. pmid:27285678
  48. 48. Choudhary R, Pathank D, Choudhary S, Verma R. Expression of steriod receptors in goat mammary glands. In: Dairy Industry Conference. 2016.
  49. 49. Wang X, Proud CG. The mTOR pathway in the control of protein synthesis. Physiology. 2006;21: 362–369. pmid:16990457
  50. 50. Rius AG, Appuhamy JA, Cyriac J, Kirovski D, Becvar O, Escobar J, et al. Regulation of protein synthesis in mammary glands of lactating dairy cows by starch and amino acids. J Dairy Sci. 2010;93: 3114–3127. pmid:20630229
  51. 51. Appuhamy JA, Knoebel NA, Nayananjalie WA, Escobar J, Hanigan MD. Isoleucine and leucine independently regulate mTOR signaling and protein synthesis in MAC-T cells and bovine mammary tissue slices. J Nutr. 2012;142: 484–491. pmid:22298573
  52. 52. Burgos SA, Cant JP. IGF-1 stimulates protein synthesis by enhanced signaling through mTORC1 in bovine mammary epithelial cells. Domest Anim Endocrinol. 2010;38: 211–221. pmid:20163929
  53. 53. Massimo B, Loor JJ. Gene networks driving bovine mammary protein synthesis during the lactation cycle. Bioinform Biol Insights. 2011;5: 83. pmid:21698073
  54. 54. Prizant RL, Barash I. Negative effects of the amino acids Lys, His, and Thr on S6K1 phosphorylation in mammary epithelial cells. J Cell Biochem. 2008;105: 1038–1047. pmid:18767117
  55. 55. Toerien CA, Trout DR, Cant JP. Nutritional stimulation of milk protein yield of cows is associated with changes in phosphorylation of mammary eukaryotic initiation factor 2 and ribosomal s6 kinase 1. J Nutr. 2010;140: 285–292. pmid:20032484