Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

In-Depth Duodenal Transcriptome Survey in Chickens with Divergent Feed Efficiency Using RNA-Seq

  • Guoqiang Yi ,

    Contributed equally to this work with: Guoqiang Yi, Jingwei Yuan, Huijuan Bi

    Affiliation National Engineering Laboratory for Animal Breeding and MOA Key Laboratory of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China

  • Jingwei Yuan ,

    Contributed equally to this work with: Guoqiang Yi, Jingwei Yuan, Huijuan Bi

    Affiliation National Engineering Laboratory for Animal Breeding and MOA Key Laboratory of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China

  • Huijuan Bi ,

    Contributed equally to this work with: Guoqiang Yi, Jingwei Yuan, Huijuan Bi

    Affiliation National Engineering Laboratory for Animal Breeding and MOA Key Laboratory of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China

  • Wei Yan,

    Affiliation National Engineering Laboratory for Animal Breeding and MOA Key Laboratory of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China

  • Ning Yang,

    Affiliation National Engineering Laboratory for Animal Breeding and MOA Key Laboratory of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China

  • Lujiang Qu

    Affiliation National Engineering Laboratory for Animal Breeding and MOA Key Laboratory of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China

In-Depth Duodenal Transcriptome Survey in Chickens with Divergent Feed Efficiency Using RNA-Seq

  • Guoqiang Yi, 
  • Jingwei Yuan, 
  • Huijuan Bi, 
  • Wei Yan, 
  • Ning Yang, 
  • Lujiang Qu


Since the feed cost is a major determinant of profitability in poultry industry, how to improve feed efficiency through genetic selection is an intriguing subject for breeders and producers. As a more suitable indicator assessing feed efficiency, residual feed intake (RFI) is defined as the difference between observed and expected feed intake based on maintenance and growth. However, the genetic mechanisms responsible for RFI in chickens are still less well appreciated. In this study, we investigated the duodenal transcriptome architecture of extreme RFI phenotypes in the six brown-egg dwarf hens (three per group) using RNA sequencing technology. Among all mapped reads, an average of 75.62% fell into annotated exons, 5.50% were located in introns, and the remaining 18.88% were assigned to intergenic regions. In total, we identified 41 promising candidate genes by differential expression analysis between the low and high RFI groups. Furthermore, qRT-PCR assays were designed for 10 randomly chosen genes, and nine (90.00%) were successfully validated. Functional annotation analyses revealed that these significant genes belong to several specific biological functions related to digestibility, metabolism and biosynthesis processes as well as energy homeostasis. We also predicted 253 intergenic coding transcripts, and these transcripts were mainly involved in fundamental biological regulation and metabolism processes. Our findings provided a pioneering exploration of biological basis underlying divergent RFI using RNA-Seq, which pinpoints promising candidate genes of functional relevance, is helpful to guide future breeding strategies to optimize feed efficiency and assists in improving the current gene annotation in chickens.


Chicken meat and egg products continue to be an important source of nutrition for most people around the world. In the past decades, many yield-related traits in chickens have been greatly improved to meet the ever-increasing global demand [1, 2]. Currently, certain traits such as daily weight gain, total egg number and age at first egg have come close to their selection limits in nature due to long-term artificial selection. Meanwhile, feed prices would likely contribute to a substantial increase although feed has accounted for more than 60% of the total production cost [3, 4]. The increasing cost with no further increase in production kept pressure for us to investigate how to improve feed efficiency. In this sense, breeding more efficient chickens would mean big savings and enhance the profitability for breeders and producers.

Two major assessment criteria for feed efficiency are feed conversion ratio (FCR) and residual feed intake (RFI), respectively. FCR is defined as the amount of feed consumed per unit of egg weight for layers, and is not a desirable measurement for several statistical and biological reasons [46]. Thus, an alternative concept RFI was proposed and calculated as the difference between observed feed intake and the expected feed requirement based on maintenance and growth [5, 7, 8]. RFI may be a more suitable strategy evaluating feed efficiency due to its phenotypic independence in relation to growth and production traits used in its estimation [3]. It should be noted that RFI shows moderate to high heritability, indicating that genetic improvement could be accelerated by exploring associated genes and markers to be used in molecular breeding. Furthermore, several previous studies demonstrated that selection for low RFI (superior feed efficiency) may lead to lower the production cost and environmental nitrogen pollution in chickens and other livestock [911]. Therefore, pursuing the potential functional genes and genetic markers underlying RFI is an intriguing issue.

Currently, several previous studies have unveiled some candidate quantitative trait loci (QTL) involved in RFI through association and linkage analyses [1215], but these genetic evidence is still not enough. Furthermore, all these work started from the genome-scale perspective. Considering that divergent RFI performances should result from different expression levels of related genes, so monitoring the transcriptome changes in chickens with extreme RFI would offer a new opportunity to decipher its underlying mechanisms. In recent years, RNA sequencing (RNA-Seq) technology has emerged as a powerful and revolutionary approach to quantify gene expression levels and survey detailed transcriptome profiling at unprecedented resolution and sensitivity [16, 17]. Compared with microarray platform, RNA-Seq has several clear advantages such as a wider dynamic range of expression levels, higher accuracy and reproducibility, lower background noise and ability to detect novel transcripts [16, 18, 19]. Moreover, RNA-Seq method has attracted considerable interest and received great success concerning many economic traits in livestock [2024]. Hence, applying RNA-Seq to dig out involved functional genes would serve as a great complement to traditional genomic methods.

In order to identify causal genes modulating RFI performance and get a closer insight in transcriptome architecture in chickens, we conducted a global transcriptome profiling including differential expression analysis, novel transcript prediction and functional annotation based on high-quality RNA-Seq data from duodenal epithelial tissues. Our findings will allow a better understanding of the underlying mechanisms implicated in RFI, contribute to breeding more efficient chickens by genetic improvement and help to optimize the current chicken gene model.

Materials and Methods

Ethics statement

The whole protocols and procedures involving animals were performed in accordance with the Guidelines for the Care and Use of Experimental Animals established by the Ministry of Agriculture of China (Beijing, China). All animal work was approved by the Animal Welfare Committee of China Agricultural University (permit number: SYXK 2007–0023). Before tissue sampling, birds were humanely sacrificed by cervical dislocation. All efforts were made to minimize their suffering.

Sample selection and tissue harvest

A pure line of brown-egg dwarf layers (DW), maintained and selected mainly for egg production for over 10 years in the Poultry Genetic Resource and Breeding Experimental Unit of China Agricultural University [25], was used in this study. At 28 wk of age, a total of 252 hens were randomly selected and transferred to individual cages with intelligent system for recording individual feed intake (FI) and egg mass production (EM). These hens were kept under the 16L:8D light regimen and raised in the same environment with feed and water ad libitum. The FI and EM were measured at two independent stages in which the first one was from 32 to 44 wk of age and the second was from 57 to 60 wk of age. The individual body weight (BW) was surveyed at the start and end of each stage to calculate the mean BW (MBW), metabolic BW (MBW0.75) and daily BW gain (BWG). The residual feed intake (RFI) index was estimated with the model as follows: where RFI = residual feed intake, FI = daily feed intake, MBW0.75 = metabolic body weight, BWG = daily body weight gain, EMD = daily egg mass (adjusted for abnormal eggs), b0 = the intercept, and b1, b2, b3 = partial regression coefficients. The RFI estimates for each stage were calculated with the linear model fit function (lm) implemented in R.

We preferred those samples with extreme RFI phenotypes in a consistent pattern at two experimental stages, considering that the desired birds should show stable performances in both the early and late periods. The average RFI rank in two stages was used to prioritize samples because the mean was subject to outliers or extreme values. At the end of the whole experimental period (61 wk of age), we selected six samples consisting of two groups (three biological replicates per group) to represent two distinct RFI performances. In particular, besides that FI and RFI were significantly lower in the low RFI group, almost all phenotypes in both groups were similar. Table 1 details the measurements of RFI and its component traits at the two stages. The heritability estimates of RFI in this population are close to 0.30, though the larger sample size should be required to increase the reliability. For RNA isolation, duodenal epithelial tissues as a major part of the digestive system were harvested immediately from postmortem samples, frozen in liquid nitrogen and then stored at -80°C until further processing.

Table 1. Descriptive statistics of feed efficiency and relevant traits.

RNA extraction, library preparation and sequencing

Total RNA was isolated using TRIzol reagent (Invitrogen, USA) after grinding the frozen duodenal sample into fine powder under liquid nitrogen environment. The quality and quantity of RNA were monitored by 1% agarose gels, NanoPhotometer spectrophotometer (Implen, CA, USA) and Qubit 2.0 Flurometer (Life Technologies, CA, USA). For the eligible samples, the RNA integrity number (RIN, a score from 0 to 10) was accessed using Agilent Bioanalyzer 2100 system (Agilent Technologies, CA, USA). Only RNA samples with RIN larger than seven were used for cDNA library construction. For each sample, library with about 200 bp insert size was prepared with TruSeq RNA Sample Prep Kit v2 (Illumina, San Diego, CA, USA), and then was subjected to 2 × 100 bp paired-end (PE100) sequencing on a HiSeq 2000 instrument (Illumina). All six samples were sequenced on one lane. The raw sequence data from this article is publicly available in the NCBI Short Reads Archive (SRA) with accession number SRP055561 (BioProject number: PRJNA276492). The experiment accessions for the six chickens are SRX892810-SRX892815.

Differential expression analysis

For ensuring high-quality data, we removed low-quality reads and reads containing adapter contamination or at least 10 Ns from raw data (FASTQ format) using in-house Perl scripts. Prior to downstream analyses, the overall quality of clean data was further examined using FastQC v0.11.2 ( The Galgal4 reference assembly (FASTA format) and annotated gene model (GTF format) were downloaded from Ensembl database ( For each library, we estimated the actual insert size distribution after indexing the reference genome using the Bowtie2 v2.2.3 with default parameters [26]. After that, the mean read insert size and corresponding standard deviation (SD) as well as 10 maximum multiple hits (—max-multihits = 10) were used for TopHat2 v2.0.12, to improve the accuracy of reads mapping and expression analysis [27, 28]. All other parameters were set to the default values. The distribution of mapped reads over exons, introns and intergenic regions was determined using the BEDTools suite [29].

Based on resulting alignment and Ensembl annotation files, gene-level read counts were enumerated using HTSeq v0.6.1 Python tool with the default “union” mode [30]. To enhance the statistical power for identifying differentially expressed genes (DEGs), we removed those genes with weak expression levels using the HTSFilter package [31]. The DESeq2 package [32] was employed to distinguish DEGs between the low and high RFI groups. DESeq2 first used empirical Bayes shrinkage method to estimate dispersions and fold changes by modeling read counts as following a Negative Binominal distribution. And then the Wald test P-value was inferred to evaluate the statistical significance. The derived P-values were adjusted for multiple testing using Benjamini-Hochberg method [33], in order to control the false discovery rate (FDR) due to numerous tested genes in a typical RNA-Seq dataset. Finally, the DEGs were declared at a significant level of |log2 (fold change)| > 0.585, raw P-value < 0.01 and FDR < 0.05. After that, we downloaded the latest chicken QTL database (, Release 26) [34], and compared putative DEGs with the reported QTLs associated with feed efficiency traits (FI, FCR and RFI). It should be note that the three traits should share similar genetic basis due to the strong genetic correlations among them.

Quantitative RT-PCR confirmations

To confirm our differential expression results, we conducted quantitative reverse transcription PCR (qRT-PCR) assays for 10 randomly selected DEGs in the same RNA samples used for RNA-Seq. The total RNA was used for first-strand cDNA synthesis using EasyScript cDNA Synthesis Super Mix kit (TransGen Biotech, Beijing, China) according to standard procedures. The full cDNA sequence for each gene was downloaded from NCBI database, and corresponding primers were designed using Primer5.0 software. Prior to qRT-PCR validation, we accessed the primer quality using an 8-point standard curve in triplicate to ensure the similar amplification efficiencies between target and control primers. All qRT-PCR reactions were conducted in triplicate on the ABI Prism 7500 sequence detection system (Applied Biosystems group) using SYBR green chemistry. The thermal cycle conditions were as follows: 1 cycle of pre-incubation at 50°C for 2 min and 95°C for 10 min, 40 cycles of amplification (95°C for 10 s and 60°C for 1 min). Relative gene expressions of DEGs were calculated using the 2-ΔCt method, with the housekeeping gene GAPDH serving as internal control. To compare with the sequencing-based results, we converted the mean 2-ΔCt value for each group to fold change by dividing it by the mean value for the control. For evaluating the concordance between predicted and observed expression levels of DEGs, regression analysis was conducted with the linear model fit function (lm) implemented in R.

Prediction and characterization of intergenic transcripts

To investigate and classify the novel transcript patterns, we performed a global transcriptome profiling. The aligned reads were assembled into transcripts based on reference-guided assembly strategy implemented in Cufflinks suite v2.2.1 [35, 36]. The resulting individual annotation file was compared with the Ensembl annotation model using the Cuffcompare option to capture both native and novel transcripts. For analyzing those unknown intergenic transcripts (“u”-labeled transcripts), Cuffmerge was first used to merge the assemblies from all samples. And then the merged transcript assembly was regard as input for Cuffdiff2 to estimate transcript abundances. Due to the potential presence of the assembly artifacts, unspliced pre-mRNAs and possible DNA contamination, we only kept transcripts with total length between 200 bp and 20 kb and harboring at least two exons. In addition, all transcripts with fragments per kilobase of exon per million mapped reads (FPKM) > 1 and its 95% confidence interval lower boundary > 0 were included to eliminate lowly expressed transcripts which were generally considered to be transcriptional noise. Finally, only transcripts located at least 500 bp away from any known genes remained considering that these sequences might be extended exons of known genes. The sequences of eligible transcripts were subsequently extracted by gffread and fed to Coding Potential Calculator (CPC) to predict their coding potential [37]. To ensure high level of reliability, the transcript with CPC score > 1 was classified as protein-coding candidate, and < -1 was non-coding.

Functional enrichment and annotation analyses

To gain insight into the biological functions of DEGs, the enriched Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were determined using GOSeq R package designed to correct for gene length bias [38]. The functional group with adjusted P-value < 0.05 and at least two DEGs in the background terms was considered significantly overrepresented. In addition, we performed functional annotation for these putative coding transcripts using Blast2GO v2.8.0 tool based on similarity searches and existing annotation associations [39]. These sequences were first blasted against the NCBI non-redundant database using BLASTX option with an E-value threshold of 0.001 and a maximum of 20 hits. The output in XML format was mapped to GO database and assigned to different functional categories. Subsequently, InterProScan annotation, ANNEX modification and GO-Slim reduction were conducted to refine the functional annotations. All steps were carried out at the default settings recommended by Blast2GO.


Overall assessment for mapping statistics

The RNA-Seq of six duodenal epithelial samples yielded around 513.8 million of raw 100-bp paired-end reads. After quality filtering, each sample remained approximately 8.1 gigabases (Gb) high-quality sequence data, ranging from 7.1 to 9.5 Gb. Using TopHat2 aligner, more than 86.06% of clean reads per sample were mapped back to the Galgal4 assembly. Almost 94.08–95.88% reads were aligned in a unique manner, while 4.12–5.92% as multiple-mapped reads. The detailed information of data quality and mapping statistics is presented in Table 2. Among all mapped reads, the vast majority of which (73.79–78.20%) fell into annotated exons, 16.27–20.59% was within the large intergenic territory, and only 4.85–5.98% was located in introns (Fig 1).

Fig 1. The percentage of reads mapped to exonic, intronic and intergenic regions.

Table 2. Summary statistics for sequence quality and alignment information of six samples.

Differential expression profiling

As a preliminary, we used HTSeq to determine the number of aligned reads per gene across all samples. According to the defined counting criterion, 75.13–77.79% of mapped reads were successfully matched to known gene model, and the remaining 22.21–24.87% were classified as “ambiguous” (reads which assigned to multiple genomic features) or “no feature” (reads which could not be assigned to any genomic feature). For enhancing the statistical power, weakly expressed genes were first filtered out according to derived Jaccard similarity index from HTSFilter package. Finally, those genes with normalized expression levels less than proposed threshold 6.473 in all six samples were removed, resulting in a total of 13,235 (77.36%) genes to be fed to DESeq2 for subsequent differential analysis.

In total, we detected 41 significant differentially expressed genes (DEGs) in response to divergent RFI based on aforementioned cutoffs. Of these putative genes, 21 were down-regulated in the low RFI group and the other 20 were up-regulated in the same group (Table 3, Fig 2). Moreover, we found that only five DEGs are located in seven previously reported QTL regions associated with feed efficiency traits (Table 3). To validate the accuracy of our predictions, 10 DEGs were randomly selected for qRT-PCR assays using the same RNA samples used for RNA-Seq. Primer sequences and validation results are listed in S1 Table. The comparative results of the fold changes predicted by RNA-Seq and qRT-PCR were displayed in Fig 3. For 10 chosen DEGs, nine showed the concordant expression patterns between RNA-Seq and qRT-PCR results (Fig 3A). After excluding the only one gene with opposite expression level, the computational and experimental fold changes in our study also showed a strong positive correlation with R2 = 0.9449 (Fig 3B).

Fig 2. Volcano plot reporting P values against fold changes.

The Volcano plot indicates-log10 (P-value) for genome-wide genes (Y-axis) plotted against their respective log2 (fold change) (X-axis). The red and blue dots represent significantly up- and down-regulated genes between the low and high residual feed intake groups respectively.

Fig 3. Illustrating of qRT-PCR confirmation results for 10 selected differentially expressed genes.

(A) X-axis represents 10 selected genes for qRT-PCR assays and Y-axis represents the log2 (fold change) derived from RNA-Seq and qRT-PCR. (B) Regression analysis of the log2 (fold change) values between RNA-Seq and qRT-PCR.

Table 3. Detailed information of differentially expressed genes responsible for divergent RFI.

Functional annotation of differential expressed genes

To investigate the associated functional categories of the 41 most significant genes, enriched GO terms and pathways were determined by the GOSeq package. It should be note that no one GO term or pathway remained statistically significant after Benjamini-Hochberg correction, likely due to incomplete gene annotation information in chickens. Therefore, we kept categories with an unadjusted threshold of P-values < 0.05 and at least two DEGs in the background terms to assess the potential functions. Finally, we identified 17 plausible GO terms which are mainly involved in organic acid biosynthetic process, carboxylic acid biosynthetic process, small molecule biosynthetic process, carboxylic acid metabolic process, single-organism biosynthetic process and lipid metabolic process (S2 Table). The KEGG pathway analysis revealed six overrepresented pathways, including steroid biosynthesis, p53 signaling pathway, glycerophospholipid metabolism, VEGF signaling pathway, phosphatidylinositol signaling system and metabolic pathways (S3 Table).

Landscape of intergenic transcripts

Considering that a high percentage (18.88%) of total mapped reads were assigned to intergenic regions, identifying and characterizing these unknown transcripts would be beneficial to improve current gene model. A total of 36,513–39,527 transcripts per sample were assembled from Cufflinks software, of which 43.09–46.72% were predicted to have a complete match with the annotated intron chain, 25.27–27.68% were potentially novel isoforms of known genes and 15.67–17.21% may involve the novel intergenic transcripts. A summary about transcripts classified into different classes is shown in Table 4. To survey the architecture of intergenic-expressed regions, all six samples were merged using Cuffmerge command, resulting in a total of 9,796 non-redundant and novel intergenic transcripts. After strict quality assurance procedures, a total of 472 qualified transcripts were included into the downstream analyses. According to the putative CPC scores of analyzed transcripts, 38 were predicted as transcripts with no coding ability and 253 were classified as protein-coding transcripts. It should be note that a majority of coding transcripts (156 out of 253, 61.66%) were located in those unknown contigs while 38.34% were assigned to anchored chromosomes.

Table 4. Summary of transcripts assembled (TA) with Cufflinks in each sample.

Potential functional roles for coding transcripts

The list of all 253 coding transcripts arising from intergenic regions was analyzed using Blast2GO tools to provide insight into their potential biological functions. Out of these transcripts, 163 could be assigned at least one GO term, generating a total of 1,375 GO classifications (S4 Table). All these transcripts were grouped into 30 GO functional categories at level 2, which were distributed under the three main categories of biological process (BP, 16), molecular function (MF, 8), and cellular components (CC, 6) (Fig 4). Within the BP category, cellular process (17.08%) was the most dominant group, followed by metabolic process (14.59%) and single-organism process (12.99%). Two sub-categories of binding (42.02%) and catalytic activity (32.45%) were enriched in MF group. Regarding CC category, there were three highly represented clusters including cell (34.47%), organelle (29.35%) and macromolecular complex (19.11%) compared to other three sub-categories.

Fig 4. Histogram presentation of gene ontology (GO) term for putative coding transcripts.

The GO terms were classified into different categories at level 2.


Recently, the increasing feed costs urge us to breed more efficient chickens through genetic improvement for profit maximization. Despite that several QTLs associated with RFI as a measure of feed efficiency have been identified, further refined exploration at the gene level is still required. To elucidate the genetic architecture underlying RFI, we provided a pioneering and comprehensive transcriptome profiling based on six chickens with extreme RFI performances. Our findings not only unearth many promising candidate genes implicated in RFI, but also gain new insight into their biological effects on feed efficiency. Evaluation of genetic merit based on functional genes would accelerate the genetic improvement of efficient chickens in the foreseeable future. In addition, assessing the global transcriptome landscape and annotating novel intergenic transcripts would assist in discovering new gene structures and improve current gene models.

The current RNA-Seq work provided greater sequence depth and obtained higher proportions of mapped reads than several previous chicken transcriptome studies ranging from 64.00 to 85.00% [4043]. The high-quality sequences and superior mapping rates enabled the accuracy and reliability of further differential expression analysis. Despite that we enhanced the detection power of DEGs recommended by a previous study [28], the number (n = 41) of DEGs was still not high. The value is very close to a recent chicken RNA-Seq result (n = 40) [24], but is lower than another RNA-Seq experiment in chickens (n = 164) [40]. Firstly, the experimental population is a pure line of brown-egg dwarf layers with lower genetic variation at a global level. The similar genetic basis between two divergent conditions may cause concordant expression signals for most genes, and reduce the number of DEGs occurring at random [24]. Moreover, the number of DEGs was also greatly influenced by different detection algorithms and biological replicates [4446]. Compared with QuasiSeq and DESeq used in the two aforementioned papers, DESeq2 (successor of DESeq) method provides greater inferential power in a typical RNA-Seq experiment with small replicate numbers [32, 45, 47].

Currently, the chicken QTL database deposited only 37 QTL regions associated with feed efficiency traits [34], and most of these QTLs suffered from wide confidence intervals covering dozens of genes or variants. This study is the first report for identification of functional determinants involved in RFI at the gene level by RNA-Seq in chickens. However, of particular interest is the poor concordance between DEGs and reported QTLs, which is in agreement with a previous study in chicken [48]. This outcome suggested that feed efficiency traits may be controlled by diverse QTLs or genes in different breeds, and pursuing the genetic evidence of feed efficiency by multiple methods and different populations is extremely essential.

To confirm the putative results from RNA-Seq, we randomly selected a subset of DEGs for qRT-PCR assays. Overall, there was excellent agreement and high concordance between the computational and experimental results, which was similar to some previous results in animals [23, 24, 40] and revealed good detection sensitivity and accuracy. After functional enrichment analyses, most GO terms and KEGG pathways were mainly involved in small molecule biosynthetic and metabolism processes. The results were also in accordance with several previous studies in cattle and pigs [6, 49, 50], and indicated that all identified DEGs may play important roles in controlling RFI through affecting digestive and metabolic processes [51]. It should be noted that negative genetic correlations between digestive efficiency and three feed efficiency traits (RFI, FI and FCR) were found [48], suggesting that the stronger digestive and metabolic abilities could lead to greater nutrient availability and compensate the lower feed intake in the more efficient chickens.

Generally, the difference in RFI performance between individual chickens attributes to five major biological processes including feed intake, digestibility and associated energy costs, metabolism and stress, physical activity and thermoregulation [51, 52], meaning that putative genes involved in these processes could be regarded as promising candidates associated with RFI. Considering that it is too redundant to discuss all genes and several genes do not have clear function in chickens, we only select five representative genes with potential functional evidence in feed efficiency.

As the most significant gene archived in the NCBI database, angiotensin I converting enzyme (peptidyl-dipeptidase A) 1 (ACE) has been reported to be a key element of the renin-angiotensin system (RAS) which can influence body energy homeostasis, fat accumulation and glucose tolerance [53, 54]. Particularly, ACE gene plays an important role in converting the inactive decapeptide angiotensin I (AngI) into the bioactive octapeptide angiotensin II (Ang II). Some previous results have demonstrated that infusion of Ang II could lead to reduced feed intake and body weight in rats [5557]. In agreement with these studies, low RFI chickens consumed an average of 25 g less feed than their counterparts ranked as high RFI in the present work. In addition, another study revealed that homozygous ACE knockout mice had higher energy expenditure related to increased fatty acid metabolism in the liver compared with wild-type mice [58]. This result meant that less energy was used for growth and production in the same feed intake, which would result in higher RFI. Therefore, we speculated that the increased expression of ACE gene in the low RFI group may optimize the feed efficiency by reducing feed intake and/or energy expenditure.

Some biological pathways like lipid metabolism and cholesterol biosynthesis were identified to be associated with RFI [50, 59]. A previous study suggested that the gene encoding the radical S-adenosyl methionine domain containing 2 (RSAD2) could serve as a modulator of lipid content and affect the lipid to protein ratio in the liver [60]. The high expression level of RSAD2 was always found in the tissue with the lower fat deposition. Additionally, some results supported that several body fat traits together with serum leptin concentration were positively related to RFI performance [51, 61]. The up-regulation of RSAD2 in the low RFI group may lead to decreased feed intake, high energy utilization and few energy costs by modulating fatty acid and leptin metabolism. Furthermore, another two significantly differential genes, cytosolic calcium-dependent phospholipase A2, group IVA (PLA2G4A) and fatty acid hydroxylase domain containing 2 (FAXDC2), were suggested to be implicated in lipid metabolism, steroid biosynthesis and metabolic pathways [62, 63]. The expression alterations of the two genes may cause the difference in the digestive and metabolic abilities between the low and high RFI groups.

Oxidative stress response is also an important factor influencing RFI, because the procedure may be an energy-demanding process. Two previous studies indicated that high RFI individuals were susceptible to stress [64, 65]. As a member of the sestrin family, sestrin 3 gene (SESN3) is involved in the maintenance of physiological concentrations of reactive oxygen species, and participates in the oxidative stress pathway [66, 67]. Lower respond to environmental stressors may need fewer energy costs and show better feeding behavior, resulting higher feed efficiency. Overall, RFI performance is a complex physiological process and variation in RFI may represent numerous intrinsic factors. Although we have identified 41 promising candidate genes, further investigation by increasing sample size and integrating different algorithms is critical to elucidate the biological mechanisms behind RFI.

It should be noted that an average of 18.88% matched reads were mapped to intergenic areas, suggesting that the current gene annotation in the chicken genome still needs to be further improved to determine the structures and functions of novel genes [68]. During transcript assembly and coding potential prediction, we employed stringent quality management to exclude likely false positives, resulting in fewer transcripts compared with a previous study [42]. In fact, the 38 putative non-coding transcripts could be regarded as long intergenic non-coding RNA (lincRNA) based on our quality control procedure. The fewer lincRNAs may be due to the fact that our RNA-Seq libraries are based on poly(A)+ mRNAs selection protocol. In this sense, only the lincRNAs with poly(A) tails could be identified while a number of transcripts are known to lack a classical poly(A) tail [69]. Hence, to detect and characterize all lncRNAs in detail, the specific library preparation procedure with rRNA depletion to enrich for non-rRNAs must be required [70]. Most protein-coding transcripts were located in unknown genomic contigs, suggesting that these genomic sequences may contain more novel genes and need further annotation [68, 71]. The Blast2GO results demonstrated that a majority of coding transcripts were responsible for fundamental biological regulation and metabolism processes.


In summary, we conducted a comprehensive differential expression analysis and characterized global trancriptome architectures based on high-quality RNA-Seq data, and subsequently performed functional annotation for these putative associated genes and protein-coding transcripts. We identified a total of 41 differentially expressed genes associated with RFI. These promising genes play a critical role in digestibility, metabolism, stress response and energy homeostasis, hence resulting in divergent RFI performances. Among 10 randomly chosen genes, nine were successfully validated. We also discovered 253 intergenic coding transcripts, which may be from some unannotated genes. Our findings lay the foundation for comprehensive understanding of RFI, are beneficial to direct future breeding schemes improving feed efficiency and assist in optimizing the current gene models.

Supporting Information

S1 Table. Primers information and validation results of the 10 chosen differentially expressed genes by qRT-PCR analysis.


S2 Table. GO enrichment analysis of 41 differentially expressed genes associated with residual feed intake.


S3 Table. Summary of the KEGG analysis of 41 differentially expressed genes associated with residual feed intake.


S4 Table. Detailed Blast2GO information for putative coding transcripts.



We are grateful for animal breeding support from the team of the National Engineering Laboratory.

Author Contributions

Conceived and designed the experiments: LQ NY. Performed the experiments: HB. Analyzed the data: GY. Contributed reagents/materials/analysis tools: JY WY GY. Wrote the paper: GY LQ NY.


  1. 1. Liu W, Li D, Liu J, Chen S, Qu L, Zheng J, et al. A genome-wide SNP scan reveals novel Loci for egg production and quality traits in white leghorn and brown-egg dwarf layers. PLoS One. 2011; 6: e28600. pmid:22174844
  2. 2. Rekaya R, Sapp RL, Wing T, Aggrey SE. Genetic evaluation for growth, body composition, feed efficiency, and leg soundness. Poult Sci. 2013; 92: 923–929. pmid:23472015
  3. 3. Willems OW, Miller SP, Wood BJ. Assessment of residual body weight gain and residual intake and body weight gain as feed efficiency traits in the turkey (Meleagris gallopavo). Genet Sel Evol. 2013; 45: 26. pmid:23865507
  4. 4. Aggrey SE, Karnuah AB, Sebastian B, Anthony NB. Genetic properties of feed efficiency parameters in meat-type chickens. Genet Sel Evol. 2010; 42: 25. pmid:20584334
  5. 5. Aggrey SE, Rekaya R. Dissection of Koch's residual feed intake: implications for selection. Poult Sci. 2013; 92: 2600–2605. pmid:24046405
  6. 6. Do DN, Ostersen T, Strathe AB, Mark T, Jensen J, Kadarmideen HN. Genome-wide association and systems genetic analyses of residual feed intake, daily feed consumption, backfat and weight gain in pigs. BMC Genet. 2014; 15: 27. pmid:24533460
  7. 7. Koch RM, Swiger LA, Chambers D, Gregory KE. Efficiency of feed use in beef cattle. J Anim Sci. 1963; 22: 486–494.
  8. 8. Luiting P, Urff EM. Optimization of a model to estimate residual feed consumption in the laying hen. Livest Prod Sci. 1991; 27: 321–338.
  9. 9. Zhang W, Aggrey SE. Genetic variation in feed utilization efficiency of meat-type chickens. World's Poult Sci J. 2003; 59: 328–339.
  10. 10. de Verdal H, Narcy A, Bastianelli D, Chapuis H, Meme N, Urvoix S, et al. Improving the efficiency of feed utilization in poultry by selection. 2. Genetic parameters of excretion traits and correlations with anatomy of the gastro-intestinal tract and digestive efficiency. BMC Genet. 2011; 12: 71. pmid:21846409
  11. 11. Saintilan R, Merour I, Brossard L, Tribout T, Dourmad JY, Sellier P, et al. Genetics of residual feed intake in growing pigs: Relationships with production traits, and nitrogen and phosphorus excretion traits. J Anim Sci. 2013; 91: 2542–2554. pmid:23482579
  12. 12. De Koning DJ, Windsor D, Hocking PM, Burt DW, Law A, Haley CS, et al. Quantitative trait locus detection in commercial broiler lines using candidate regions. J Anim Sci. 2003; 81: 1158–1165. pmid:12772842
  13. 13. De Koning DJ, Haley CS, Windsor D, Hocking PM, Griffin H, Morris A, et al. Segregation of QTL for production traits in commercial meat-type chickens. Genet Res. 2004; 83: 211–220. pmid:15462414
  14. 14. Parsanejad R, Praslickova D, Zadworny D, Kuhnlein U. Ornithine decarboxylase: haplotype structure and trait associations in White Leghorn chickens. Poult Sci. 2004; 83: 1518–1523. pmid:15384901
  15. 15. Wolc A, Arango J, Jankowski T, Settar P, Fulton JE, O'Sullivan NP, et al. Pedigree and genomic analyses of feed consumption and residual feed intake in laying hens. Poult Sci. 2013; 92: 2270–2275. pmid:23960108
  16. 16. Wang Z, Gerstein M, Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009; 10: 57–63. pmid:19015660
  17. 17. Ozsolak F, Milos PM. RNA sequencing: advances, challenges and opportunities. Nat Rev Genet. 2011; 12: 87–98. pmid:21191423
  18. 18. Nookaew I, Papini M, Pornputtapong N, Scalcinati G, Fagerberg L, Uhlen M, et al. A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae. Nucleic Acids Res. 2012; 40: 10084–10097. pmid:22965124
  19. 19. Marioni JC, Mason CE, Mane SM, Stephens M, Gilad Y. RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008; 18: 1509–1517. pmid:18550803
  20. 20. Kang X, Liu G, Liu Y, Xu Q, Zhang M, Fang M. Transcriptome profile at different physiological stages reveals potential mode for curly fleece in Chinese tan sheep. PLoS One. 2013; 8: e71763. pmid:23990983
  21. 21. Park KD, Park J, Ko J, Kim BC, Kim HS, Ahn K, et al. Whole transcriptome analyses of six thoroughbred horses before and after exercise using RNA-Seq. BMC Genomics. 2012; 13: 473. pmid:22971240
  22. 22. Corominas J, Ramayo-Caldas Y, Puig-Oliveras A, Estelle J, Castello A, Alves E, et al. Analysis of porcine adipose tissue transcriptome reveals differences in de novo fatty acid synthesis in pigs with divergent muscle fatty acid composition. BMC Genomics. 2013; 14: 843. pmid:24289474
  23. 23. Cui X, Hou Y, Yang S, Xie Y, Zhang S, Zhang Y, et al. Transcriptional profiling of mammary gland in Holstein cows with extremely different milk protein and fat percentage using RNA sequencing. BMC Genomics. 2014; 15: 226. pmid:24655368
  24. 24. Coble DJ, Fleming D, Persia ME, Ashwell CM, Rothschild MF, Schmidt CJ, et al. RNA-seq analysis of broiler liver transcriptome reveals novel responses to high ambient temperature. BMC Genomics. 2014; 15: 1084. pmid:25494716
  25. 25. Yi GQ, Liu WB, Li JY, Zheng JX, Qu LJ, Xu GY, et al. Genetic analysis for dynamic changes of egg weight in 2 chicken lines. Poult Sci. 2014; 93: 2963–2969. pmid:25306454
  26. 26. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012; 9: 357–359. pmid:22388286
  27. 27. Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013; 14: R36. pmid:23618408
  28. 28. Odawara J, Harada A, Yoshimi T, Maehara K, Tachibana T, Okada S, et al. The classification of mRNA expression levels by the phosphorylation state of RNAPII CTD based on a combined genome-wide approach. BMC Genomics. 2011; 12: 516. pmid:22011111
  29. 29. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010; 26: 841–842. pmid:20110278
  30. 30. Anders S, Pyl PT, Huber W. HTSeq-a Python framework to work with high-throughput sequencing data. Bioinformatics. 2014.
  31. 31. Rau A, Gallopin M, Celeux G, Jaffrezic F. Data-based filtering for replicated high-throughput transcriptome sequencing experiments. Bioinformatics. 2013; 29: 2146–2152. pmid:23821648
  32. 32. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014; 15: 550. pmid:25516281
  33. 33. Benjamin Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J R Stat Soc Ser B. 1995; 57: 289–300.
  34. 34. Hu ZL, Park CA, Wu XL, Reecy JM. Animal QTLdb: an improved database tool for livestock animal QTL/association data dissemination in the post-genome era. Nucleic Acids Res. 2013; 41: D871–879. pmid:23180796
  35. 35. Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010; 28: 511–515. pmid:20436464
  36. 36. Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc. 2012; 7: 562–578. pmid:22383036
  37. 37. Kong L, Zhang Y, Ye ZQ, Liu XQ, Zhao SQ, Wei L, et al. CPC: assess the protein-coding potential of transcripts using sequence features and support vector machine. Nucleic Acids Res. 2007; 35: W345–349. pmid:17631615
  38. 38. Young MD, Wakefield MJ, Smyth GK, Oshlack A. Gene ontology analysis for RNA-seq: accounting for selection bias. Genome Biol. 2010; 11: R14. pmid:20132535
  39. 39. Conesa A, Gotz S, Garcia-Gomez JM, Terol J, Talon M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005; 21: 3674–3676. pmid:16081474
  40. 40. Wang X, Yang L, Wang H, Shao F, Yu J, Jiang H, et al. Growth hormone-regulated mRNAs and miRNAs in chicken hepatocytes. PLoS One. 2014; 9: e112896. pmid:25386791
  41. 41. Schurch NJ, Cole C, Sherstnev A, Song J, Duc C, Storey KG, et al. Improved annotation of 3' untranslated regions and complex loci by combination of strand-specific direct RNA sequencing, RNA-Seq and ESTs. PLoS One. 2014; 9: e94270. pmid:24722185
  42. 42. Li T, Wang S, Wu R, Zhou X, Zhu D, Zhang Y. Identification of long non-protein coding RNAs in chicken skeletal muscle using next generation sequencing. Genomics. 2012; 99: 292–298. pmid:22374175
  43. 43. Thomas S, Underwood JG, Tseng E, Holloway AK. Long-read sequencing of chicken transcripts and identification of new transcript isoforms. PLoS One. 2014; 9: e94650. pmid:24736250
  44. 44. Liu Y, Zhou J, White KP. RNA-seq differential expression studies: more sequence or more replication? Bioinformatics. 2014; 30: 301–304. pmid:24319002
  45. 45. Seyednasrollah F, Laiho A, Elo LL. Comparison of software packages for detecting differential expression in RNA-seq studies. Brief Bioinform. 2013.
  46. 46. Rapaport F, Khanin R, Liang Y, Pirun M, Krek A, Zumbo P, et al. Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data. Genome Biol. 2013; 14: R95. pmid:24020486
  47. 47. Burden CJ, Qureshi SE, Wilson SR. Error estimates for the analysis of differential expression from RNA-seq count data. PeerJ. 2014; 2: e576. pmid:25337456
  48. 48. de Verdal H, Narcy A, Bastianelli D, Chapuis H, Meme N, Urvoix S, et al. Improving the efficiency of feed utilization in poultry by selection. 1. Genetic parameters of anatomy of the gastro-intestinal tract and digestive efficiency. BMC Genet. 2011; 12: 59. pmid:21733156
  49. 49. Chen Y, Gondro C, Quinn K, Herd RM, Parnell PF, Vanselow B. Global gene expression profiling reveals genes expressed differentially in cattle with high and low residual feed intake. Anim Genet. 2011; 42: 475–490. pmid:21906099
  50. 50. Karisa B, Moore S, Plastow G. Analysis of biological networks and biological pathways associated with residual feed intake in beef cattle. Anim Sci J. 2014; 85: 374–387. pmid:24373146
  51. 51. Herd RM, Arthur PF. Physiological basis for residual feed intake. J Anim Sci. 2009; 87: E64–71. pmid:19028857
  52. 52. Luiting P, Schrama JW, van der Hel W, Urff EM. Metabolic differences between White Leghorns selected for high and low residual food consumption. Br Poult Sci. 1991; 32: 763–782. pmid:1933447
  53. 53. Boustany CM, Bharadwaj K, Daugherty A, Brown DR, Randall DC, Cassis LA. Activation of the systemic and adipose renin-angiotensin system in rats with diet-induced obesity and hypertension. Am J Physiol Regul Integr Comp Physiol. 2004; 287: R943–949. pmid:15191907
  54. 54. Savary K, Michaud A, Favier J, Larger E, Corvol P, Gasc JM. Role of the renin-angiotensin system in primitive erythropoiesis in the chick embryo. Blood. 2005; 105: 103–110. pmid:15367438
  55. 55. Cassis LA, Marshall DE, Fettinger MJ, Rosenbluth B, Lodder RA. Mechanisms contributing to angiotensin II regulation of body weight. Am J Physiol. 1998; 274: E867–876. pmid:9612245
  56. 56. Brink M, Price SR, Chrast J, Bailey JL, Anwar A, Mitch WE, et al. Angiotensin II induces skeletal muscle wasting through enhanced protein degradation and down-regulates autocrine insulin-like growth factor I. Endocrinology. 2001; 142: 1489–1496. pmid:11250929
  57. 57. Weisinger RS, Blair-West JR, Burns P, Denton DA, Tarjan E. Role of brain angiotensin in thirst and sodium appetite of rats. Peptides. 1997; 18: 977–984. pmid:9357055
  58. 58. Jayasooriya AP, Mathai ML, Walker LL, Begg DP, Denton DA, Cameron-Smith D, et al. Mice lacking angiotensin-converting enzyme have increased energy expenditure, with reduced fat mass and improved glucose clearance. Proc Natl Acad Sci U S A. 2008; 105: 6531–6536. pmid:18443281
  59. 59. Nafikov RA, Beitz DC. Carbohydrate and lipid metabolism in farm animals. J Nutr. 2007; 137: 702–705. pmid:17311965
  60. 60. Dogan A, Lasch P, Neuschl C, Millrose MK, Alberts R, Schughart K, et al. ATR-FTIR spectroscopy reveals genomic loci regulating the tissue response in high fat diet fed BXD recombinant inbred mouse strains. BMC Genomics. 2013; 14: 386. pmid:23758785
  61. 61. Hoque MA, Katoh K, Suzuki K. Genetic associations of residual feed intake with serum insulin-like growth factor-I and leptin concentrations, meat quality, and carcass cross sectional fat area ratios in Duroc pigs. J Anim Sci. 2009; 87: 3069–3075. pmid:19465494
  62. 62. Hamill RM, Aslan O, Mullen AM, O'Doherty JV, McBryan J, Morris DG, et al. Transcriptome analysis of porcine M. semimembranosus divergent in intramuscular fat as a consequence of dietary protein restriction. BMC Genomics. 2013; 14: 453. pmid:23829541
  63. 63. Grzincic EM, Yang JA, Drnevich J, Falagan-Lotsch P, Murphy CJ. Global transcriptomic analysis of model human cell lines exposed to surface-modified gold nanoparticles: the effect of surface chemistry. Nanoscale. 2015; 7: 1349–1362. pmid:25491924
  64. 64. Iqbal M, Pumford NR, Tang ZX, Lassiter K, Wing T, Cooper M, et al. Low feed efficient broilers within a single genetic line exhibit higher oxidative stress and protein expression in breast muscle with lower mitochondrial complex activity. Poult Sci. 2004; 83: 474–484. pmid:15049502
  65. 65. Van Eerden E, Van Den Brand H, Parmentier HK, De Jong MC, Kemp B. Phenotypic selection for residual feed intake and its effect on humoral immune responses in growing layer hens. Poult Sci. 2004; 83: 1602–1609. pmid:15384913
  66. 66. Hagenbuchner J, Kuznetsov A, Hermann M, Hausott B, Obexer P, Ausserlechner MJ. FOXO3-induced reactive oxygen species are regulated by BCL2L11 (Bim) and SESN3. J Cell Sci. 2012; 125: 1191–1203. pmid:22349704
  67. 67. Hussong M, Borno ST, Kerick M, Wunderlich A, Franz A, Sultmann H, et al. The bromodomain protein BRD4 regulates the KEAP1/NRF2-dependent oxidative stress response. Cell Death Dis. 2014; 5: e1195. pmid:24763052
  68. 68. International Chicken Genome Sequencing Consortium. Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature. 2004; 432: 695–716. pmid:15592404
  69. 69. Yang L, Duff MO, Graveley BR, Carmichael GG, Chen LL. Genomewide characterization of non-polyadenylated RNAs. Genome Biol. 2011; 12: R16. pmid:21324177
  70. 70. Sultan M, Amstislavskiy V, Risch T, Schuette M, Dokel S, Ralser M, et al. Influence of RNA extraction methods and library selection schemes on RNA-seq data. BMC Genomics. 2014; 15: 675. pmid:25113896
  71. 71. Groenen MA, Wahlberg P, Foglio M, Cheng HH, Megens HJ, Crooijmans RP, et al. A high-density SNP-based linkage map of the chicken genome reveals sequence features correlated with recombination rate. Genome Res. 2009; 19: 510–519. pmid:19088305