Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Systems biology approach identifies functional modules and regulatory hubs related to secondary metabolites accumulation after transition from autotrophic to heterotrophic growth condition in microalgae

  • Bahman Panahi ,

    Roles Conceptualization, Data curation, Formal analysis, Writing – original draft, Writing – review & editing

    b.panahi@abrii.ac.ir, panahibahman@ymail.com

    Affiliation Department of Genomics, Branch for Northwest & West Region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran

  • Mohammad Farhadian,

    Roles Data curation, Writing – review & editing

    Affiliation Department of Animal Science, Faculty of Agriculture, University of Tabriz, Tabriz, Iran

  • Mohammad Amin Hejazi

    Roles Writing – review & editing

    Affiliation Department of Food Biotechnology, Branch for Northwest & West Region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran

Abstract

Heterotrophic growth mode is among the most promising strategies put forth to overcome the low biomass and secondary metabolites productivity challenge. To shedding light on the underlying molecular mechanisms, transcriptome meta-analysis was integrated with weighted gene co-expression network analysis (WGCNA), connectivity analysis, functional enrichment, and hubs identification. Meta-analysis and Functional enrichment analysis demonstrated that most of the biological processes are up-regulated at heterotrophic growth condition, which leads to change of genetic architectures and phenotypic outcomes. WGNCA analysis of meta-genes also resulted four significant functional modules across logarithmic (LG), transition (TR), and production peak (PR) phases. The expression pattern and connectivity characteristics of the brown module as a non-preserved module vary across LG, TR, and PR phases. Functional analysis identified Carotenoid biosynthesis, Fatty acid metabolism and Methane metabolism as enriched pathways in the non-preserved module. Our integrated approach was applied here, identified some hubs, such as a serine hydroxymethyltransferase (SHMT1), which is the best candidate for development of metabolites accumulating strains in microalgae. Current study provided a new insight into underlying metabolite accumulation mechanisms and opens new avenue for the future applied studies in the microalgae field.

Introduction

Microalgae have become attractive sources for metabolites, such as lipids, enzymes, polymers, toxins, and pigments production. These components can be utilized as adjuvant drugs, dietary supplements, seafood baits, cosmetics, tertiary wastewater treatment, and “green energy” [1]. Productions of above-mentioned metabolites have been optimized using the cultivation of microalgae on diverse media [2,3]. Microalgae are largely photosynthetic and cultivated in illuminated environments. However, biomass productivity and titers in photoautotrophic cultivation range from 0.055 to 0.061 g/ L/day at the laboratory scale. It is much lower for the industrial scale, and cannot meet demands of the global market [4].

To overcome the poor biomass accumulation, open ponds that mimic natural environments of the microalgae are alternative option for the most commercial microalgae [5]. These cultivation systems present relatively lower construction and operating costs and large ones can be constructed on nonagricultural lands [6]. Nevertheless, these cultivation systems have several disadvantages such as expensiveness in the harvesting step, continuous need to clean water; unfeasibility for the secondary metabolite production; and susceptibility to environmental conditions [6]. To control environmental parameters during cultivation, closed photo-bioreactors have been designed [7]. However, inefficient light dispersion and fouls biofilm development in long-term cultures, and high initial investment make them uneconomical for low-cost end-products [8].

Heterotrophic cultures in closed bioreactors are among the most promising strategies put forth to overcome this challenge. Heterotrophy is defined as the use of organic carbon such as acetate and glucose instead of CO2 as substrate and energy sources [9]. The cost-effectiveness, relative simplicity of operations and daily maintenance are main advantages of the heterotrophic cultures. It has been shown that the heterotrophic cultivation improves the production yield significantly and provides a feasible approach for metabolites production at industrial level [10].

Differential accumulation patterns of lipids and carotenoids after the transition from photoautotrophic to heterotrophic cultures have been documented in the large body of research [4]. However, little is known about underlying metabolite accumulations mechanisms at different culture conditions in microalgae.

With the development of high-throughput transcriptome sequencing technologies, the genes involved in the metabolite biosynthesis in microalgae have been identified [11].

However, previous studies in this field have been typically focused on identifying differentially expressed genes and did not consider the degree of interconnection between genes, where genes with similar expression patterns may be functionally important [12]. Importance of the network-based approaches to elucidate transcriptional circuits of metabolic processes [13] and discovery of key regulators under different environmental conditions [14] have been highlighted. Co-expression analysis is based on the ‘guilt-by-association’ paradigm, which stipulates two genes displaying correlated expression patterns across different conditions [15]. To the best of our knowledge, co-expression analysis has not been yet applied in microalga.

In this study, we integrated meta-analysis with weighted gene co-expression network to identify functionally enriched pathways and network-centric genes associated with secondary metabolites production after transition to the heterotrophic growth condition in Chlorella microalgae.

Materials and methods

Data collection

European Nucleotide Archive (ENA) database was used as a source of RNA-seq data collection. Datasets with biological samples for both autotrophic- and heterotrophic growth conditions were collected. The datasets belonged to Auxenochlorella protothecoides. The first dataset (PRJNA289168) contains 3 samples of A. protothecoides grown in photoautotrophic condition (in Bristol's salts plus 0.1% (w/v) proteose peptone) and 9 samples of heterotrophic cultures (Bristol's salts plus 12.5% (v/v) Rainbow papaya juice). The photoperiod was 24 h: 0 h (light: dark).The second data set, PRJNA484804, contains 3 samples of photoautotrophic cultures. For photoautotrophic cultivation, cells were maintained in a photoautotrophic medium containing basal medium (0.7 g KH2PO4, 0.3 g K2HPO4, 0.3 g MgSO4 7H2O, 0.3 mg FeSO4_7H2O, 0.01 mg vitamin B1, and 1 mL A5 trace mineral solution) adding 5 g/L glycine under illumination of 60 μmol photons·m−2 s−1. Moreover, this dataset contains 9 samples of heterotrophic cultures. For heterotrophic cultivation, cells were maintained in a heterotrophic medium containing basal medium adding 30 g/L glucose and 0.5 g/L glycine under dark condition [4].

In both data sets, samples were harvested with three biological replicate during the logarithmic growth phase in autotrophic growth mode. In heterotrophic growth condition, samples were harvested at logarithmic phase, transitional point between growth and metabolite biosynthesis phase hereafter we called transitional phase, and during peak of metabolite biosynthesis, production peak.

Data preprocessing

Quality control of the raw reads was done using FastQC v0.11.5 (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Then, low quality (below 30) reads were removed with Trimmomatic software v0.32 [16] using the following parameters: TRAILING:3, SLIDINGWINDOW:4:20, MINLEN: 40. Finally, reads were aligned to the A. protothecoides version ASM73321v1 reference genome (accessed from https://www.ncbi.nlm.nih.gov/genome) with TopHat2 version 2.0.12 [17] using the default parameter.

Differential gene expression analysis and meta-analysis

Expression counts of the A. protothecoides gene annotation [18] were quantified for each sample using Bioconductor RSubread package version 1.6.5 [19]. Differential gene expression was analyzed using the Bioconductor DESeq2 package version 1.10.1 [20]. Comparisons were performed using Wald’s test to determine the log2-fold change. Then, False discovery rate (FDR) [21] correction was used to account for multiple testing (p.adjust value cutoff of<0.05). Three comparisons (Auto vs. Logarithmic phase, Auto vs. Transition phase, and Auto vs. production phase at heterotrophic growth condition) were performed to identify DEGs.

To reduce the batch effects between the two datasets, the empirical Bayes algorithm was applied [22]. Then, common genes between two datasets were selected based on the transcriptome meta-analysis according to the prescribed in previous studies [11]. Finally, identified meta-genes subjected to additional analysis (Fig 1).

thumbnail
Fig 1. Flow chart of applied systems biology approach in current study.

https://doi.org/10.1371/journal.pone.0225677.g001

Weighted Gene Co-Expression Analysis (WGCNA)

Scale-free weighted co-expression networks of meta-genes were constructed using the WGCNA algorithm implemented in R WGCNA package [23]. Briefly, similarity co-expression matrix was calculated with Pearson's correlation cor (i,j) for meta-genes. Then, adjacencies between meta-genes were calculated using soft threshold power beta function [24] using the following formula: where aij represents the adjacencies between DEGs as a connection strengths index.

In the power of beta = 12, linear regression model fitting index R2 was higher than 0.8. Therefore, we selected this parameter as a satisfying scale-free topology criterion [12]. Finally, the adjacency was transformed into a topological overlap matrix (TOM) and corresponding dissimilarity matrix (1−TOM). Module identification was carried out with the dynamic tree cut method by hierarchically clustering genes using the dissimilarity matrix as the distance measure with a deep split value=2 and a minimum size cutoff= 60. The dissimilarity of eigengenes (MEs) higher than 0.75, indicating the similar expression profiles of containing genes, were selected as a cut-off to merge initial modules. Module visualization and further analysis were performed with Cytoscape software.

To evaluate the module preservation after the transition to heterotrophic condition, module preservation function implemented in the WGCNA R package was applied. As prescribed in previous studies, Zsummary < 5 or medianRank > 8 was used as criteria for consideration of a module as a non-preserved module[25].

Functional enrichment analysis

Pathway enrichment of initial sets of meta genes and non-preserved modules was performed using Kyoto encyclopedia of genes and genome (KEGG) database [26]. P-value < 0.05 was set as the cut-off criterion. Moreover, to identify potential transcription factors (TFs), transcriptional regulator (TRs), and protein kinases (PKs), amino acid sequences of meta genes were used to BLASTX search against the iTAK database version 1.6 [27] with a cut-off of E ≤ 10−5.

Identification and validation of hub genes

Hub genes were defined based on intra-module connectivity in non-preserved modules. Then, to validate and evaluate hub genes efficiency on different stages of discrimination, leave-one-out cross validation (LOOCV) was implemented based on expression values of hub genes [28].

Results and discussion

Transcriptomic changes after the transition to heterotrophic at different phases

Optimization of cultivation systems is one of the promising approaches to overcome the large-scale metabolite production challenges in microalgae. Dual-culture systems comprising heterotrophic growth followed by photoautotrophic cultivation are proposed as an effective issue for secondary metabolites production in some microalgae [29]. It has been proposed that multiple genes are involved in metabolites accumulation after the transition to heterotrophic growth modes, which may be interacted with each other to activate some signaling cascades [18]. Therefore, understanding the secondary metabolites accumulation underlying molecular mechanisms is important issues for optimization of this growth system to industrial levels production of secondary metabolites. Most of the previously studies only focus on screening of differentially expressed genes [4]. Nevertheless, it has been established that the integration of transcriptome meta-analysis with weighted co-expression network analysis is effective approach to shed light on complexity of biological processes. Using this approach, a set of responsible pathways and hub genes for secondary metabolites accumulation was determined that enabled us to propose some candidate genes for development of engineered microalgae strains. In this regards, 24 samples of RNA sequence data of A. protothecoides were retrieved from ENA database. The Meta genes abundance of A. protothecoides at three developmental stages (LG: logarithmic, TR: transition, and PR: production) during the transition from autotrophic to heterotrophic cultures was analyzed using the deep transcriptome RNA-sequencing. Venn diagram was used to show the meta analysis results at three developmental stages after the transition to heterotrophic condition (Fig 2). Transcriptomic Changes at three developmental stages viz, LG, TR and PR were analyzed using the integrated systems biology approach (Fig 1).

thumbnail
Fig 2.

Identified meta genes in two data sets at logarithmic (A), transition (B) and production peak (C) phases after the transition from autotrophic to heterotrophic growth mode.

https://doi.org/10.1371/journal.pone.0225677.g002

As shown in Fig 2, at LG phase, 2962 meta genes between two datasets was identified which 1380 genes of them were down-regulated and 1582 of them were up-regulated after transition from auto- to heterotrophic condition (Fig 2A). It is whilst; the number of meta genes during the transition from auto to heterotrophic condition at theTR phase was increased to 3472 with 1666 down-regulated and 1806 up-regulated meta gene (Fig 2B). With the progress to the PR phase, meta DEGs between two data sets were 3577 which 1713 and 1864 of them is down and up regulated, respectively (Fig 2C, S1 Table). This result demonstrates that with the progress to developmental phases after the transition to heterotrophic growth condition, most of the biological processes is up-regulated, which lead to the change of genetic architectures and phenotypic outcomes such as secondary metabolites accumulation [4].

Pathway enrichment of identified meta genes

The functional analysis of meta genes shows that some common and specific pathways are enriched at different phases (S2 Table). Results indicate that metabolic pathways are the main enriched pathways at developmental phases (S3 Table). In particular, a large number of genes were assigned to the metabolic pathways and biosynthesis of secondary metabolites. Although these pathways are found in all three phases, however, the changes in the number of up-regulated and down-regulated genes observed, suggesting the significant rearrangement in microalgae metabolism at different phases. Carbon metabolism is another enriched pathway. This observation confirms that of Martinez et al., [30], who showed that the metabolic pathways of carbon assimilation, size of the cells, volume densities of storage materials are changed after the transition to heterotrophic condition. Fatty acid biosynthesis pathway was found as another enriched pathway. In line with our finding, it has been reported that the increased amount of sugars in heterotrophic growth culture lead to accumulation of fatty acid [9].

Modulation of cellular energy state is another adaptation mechanism at heterotrophic cultures. Oxidative phosphorylation lead to produce ATP and NADH for maintenance and biosynthesis under the dark condition and carbon skeleton for biosynthesis under any growth condition [31]. Comparison of up-regulated genes involved in oxidative phosphorylation indicates that meta genes which encode the core subunits of mitochondrial complex I putatively, up-regulated at all phases. Nevertheless, some others which encode conserved non-core subunits of mitochondrial complex I, up-regulated specifically at PR phase. An example of non-core subunits which specifically up-regulated at the PR phase is ACP1 (encodes acyl carrier protein) a matrix-resident protein that has a key role in conformational regulation of mitochondrial complex I under different physiological condition [32]. It has been demonstrated that ACP homologs are key components in the production of polyketides and nonribosomal peptides [33], highlighting the roles of ACP in secondary metabolites accumulation at PR phases.

As expected, most of the photosynthesis-related meta genes such as PSAD-2 encoding the Photosystem I reaction center subunit II, PSBO1 encoding the Oxygen-evolving enhancer protein 1 of photosystem II, and PSAN encoding the Photosystem I reaction center subunit N are down-regulated at the LG, TR and PR phases. Down regulation of other photosynthesis-related genes at heterotrophic growth conditions has been previously demonstrated [4].

Identification of the transcription factors (TFs), transcriptional regulators (TRs) and protein kinases (PKs)

TFs are master transcriptional regulators in different physiological conditions [34,35]. As shown in Fig 3 and S3 Table, SBP, C3H and MYB_related, GARP-G2-linke, C2C2-GATA, EP2/ERF-AP2, and bZIP are the top large classes of TFs, which are contributed in transcriptional regulation transcriptome circuits at LG, TR, and PR phases. Most of these transcription factors families, especially SBP and C3H families were reported to be up-regulated at secondary metabolites accumulation circumstances [36]. Therefore, engineering these TFs would be a potential approach to develop new strain with improved secondary metabolites accumulation ability. There is also evidence that SBP is implicated in redox clean up [37]. Moreover, many TFs related to photosynthetic carbon fixation such as MYB-related TFs is up-regulated with the same pattern across developmental phases. It has been demonstrated that Myb-related transcription factor is involved in phytochrome signal transduction pathway leading to up-regulation of lchb (light-harvesting chlorophyll b) gene in Arabidopsis [38,39]

thumbnail
Fig 3. Distribution of TFs, TRs, and PKs families identified in meta genes.

The number TF families (A, B and C), TRs families (D, E, and F), PKs families (G, H, and I) in meta genes at LG, TR and PR phases, respectively.

https://doi.org/10.1371/journal.pone.0225677.g003

Transcriptional regulators (TRs) are at the interface between sensing and responding to environmental conditions [40]. Results indicated that GNAT (GCN5-related-N-acetyltransferase), SNF2 (sucrose non-fermenting 2), and SET families are top classes of TRs at three phases (S3 Table). Interestingly, some TRs such as PHD shows different expression pattern at LG phase in comparison with TR and PR phases. It is apparent from data in Fig 3 that all of the TRs belong to PHD family are up-regulated at PR and TR phases. Prior study has indicated that SBP, bHLH, GNAT are involved in regulation of nitrogen assimilation, transportation and incorporation of ammonium into carbon skeletons via the glutamine synthetase/glutamate synthase cycle [41]. Moreover, a regulatory impact of SET families on major lipid droplet protein (MLDP1), which is contributed to TAG regulation, has been approved previously [41].

PKs play important roles in signaling networks including the perception of biotic agents, light quality and quantity, phytohormones, and various environmental conditions [42]. Fifty-seven (29 up-regulated and 28 down–regulated), sixty-two (43 up-regulated and 19 down-regulated), and fifty (43 up-regulated and 17 down-regulated) PKs were identified among the meta genes at LG, TR, and PR phases, respectively (S3 Table). Result illustrated that PKs mostly are up-regulated with the progress of PR phases at the heterotrophic conditions. Moreover, TKL (Tousled protein kinase), CMGC, and CAMK families are among the main contributed PKs at heterotrophic conditions (Fig 3). The genes encoding TKL was dominantly up-regulated under the heterotrophic condition. As evidenced by genetic analyses of Arabidopsis mutant, TKL group play important role in the regulation of ethylene signaling [43] and accumulation of secondary metabolites [44] at heterotrophic condition.

Identification of functional modules related to developmental phases

The details of the signed weighted gene co-expression network construction are prescribed in [45]. By using the steps described in Materials and Methods section, three separate networks of highly correlated meta genes at LG, TR and PR phases were constructed. Shortly, weighted adjacency matrices were created using the soft threshold power. In this study soft threshold power =9 was selected. Then, transformed dissimilarity matrices was applied for average linkage hierarchical clustering with method implemented in flashClust R package [46]. The dendrograms obtained from the preliminary analysis are shown in Fig 4. The structures of the dendrograms are changed at different phases after transition to the heterotrophic growth mode. To further inspection of the network pattern changes at different developmental phases, modules of dendrogram were defined using the dynamic branch cutting method, which is displayed by different colors (Fig 5). Using the mentioned parameter setting, the branch-cutting method resulted 15 co-expressed modules with an average size 235, 12 of which were significant. After generating co-expressed modules, we compared the results with the corresponding set of modules in different developmental phases. As shown, four modules were identified in the LG network. The yellow and turquoise modules as the smallest and largest modules contained 212 and 810 genes, respectively. In the TR co-expression network, seven modules were assigned (Fig 5) with a size range from 72 (black) to 1,422 (turquoise) genes. At the PR phase, four modules were identified with a size range from 223 genes (yellow modules) to 1053 genes (turquoise module). Preservation analysis showed that three modules viz, blue, turquoise, and yellow has persevered between three phases. However, the brown module showed non-preservation between different phases (Table 1). Since the connectivity and expression patterns of the co-expressed genes at non-preserved modules are altered [25], they may be related to the over-production of secondary metabolites at the PR phase. Because Grey and Gold modules contain the genes that not assigned to any modules, these modules were excluded from the analysis. Corresponding genes for each defined modules were presented in S4 Table.

thumbnail
Fig 4.

Hierarchical cluster constructed with WGCNA at LG (A), TR (B), and PR (C) phases. Each vertical line (leaf) represents the corresponding genes.

https://doi.org/10.1371/journal.pone.0225677.g004

thumbnail
Fig 5.

Visual representation of the changes in the module structure between LG (A), TR (B), and PR (C) phases. Modules are illustrated with different colors.

https://doi.org/10.1371/journal.pone.0225677.g005

thumbnail
Table 1. Details of conservation analysis of defined modules at different developmental phases with permutation=200.

https://doi.org/10.1371/journal.pone.0225677.t001

Functional annotation of non-preserved module

To unravel the potential mechanisms responsible for the accumulation of secondary metabolites at heterotrophic growth condition we focused on the non-preserved module. As presented, the brown module was determined as a non-preserved module in three phases. In the non-preserved module, expression patterns and connectivity characteristics of the genes vary across three phases (Fig 6). Functionally enrichment analysis identified “Biosynthesis of secondary metabolites” as a functionally enriched KEGG pathway in the brown module. Other significantly enriched pathways in brown module were “Carotenoid biosynthesis”, “Fatty acid metabolism”, “Methane metabolism”, “Ascorbate and aldarate metabolism”, “Peroxisome proliferator-activated receptors (PPARs) signaling pathway” and “Adipocytokine signaling pathway” (Table 2). Prior studies have also highlighted the variation of the carotenoid biosynthesis, fatty acid metabolism and Pentose phosphate pathway underlying genes in the autotrophic-heterotrophic cultivation transition culture mode [4,47,48]. However, hub genes identification and their connectivity have not been surveyed. There are several genes shared by several pathways such as ACS2 (Acetyl CoA synthetase) (eight pathways), LACS7 (Long-chain acyl-CoA synthetase 7) (four pathways), SHMT1 (Serine hydroxymethyltransferase) (four pathways). This also validates the relevance of the selected module for the detection of important players in metabolite accumulation [12]. Interestingly, Protein- Protein interaction network (PPI) analysis and connectivity measurement using a soft connectivity algorithm confirms that these genes are highly connected in the PR phase generated from the brown module, while there is low or no such connection were found in the corresponding module at LG and TR phases. The potential of nodes with high intra-module connectivity as key and determinant genes in the different biological processes has been approved previously [25]. Moreover, it has been suggested that important nodes in large networks are often not among the whole-network hubs and selection of the hubs in sub-networks (modules) is more efficient than a whole-network. Based on these findings, we hypothesized that the nodes with high intra-module connectivity may play important roles in secondary metabolites over-production at the PR phase after the transition to heterotrophic conditions.

thumbnail
Fig 6.

PPI networks of co-expressed meta genes in brown module at LG (A), TR (B), and PR (C) phases. Changes in intra-module connectivity are highlighted at different phases.

https://doi.org/10.1371/journal.pone.0225677.g006

thumbnail
Table 2. Functional enrichment of non-preserved module based on KEGG database.

https://doi.org/10.1371/journal.pone.0225677.t002

ACS2 encoding a chloroplastic acetyl-CoA synthetase catalyzes the conversion of acetate to acetyl-CoA. Experimental evidence has proved that ACS is a key enzyme in the biosynthesis of acyl glycerides [49]. Moreover, ACS allows consumption of the acetate and increase the carbon flux towards the synthesis of fatty acids to enhance biosynthesis and accumulation of lipid at fermentative growth condition [50]. High connectivity of this gene in the PR network and increased expression levels in the PR phase, may be highlighted the importance of ACS in lipids accumulation at heterotrophic conditions.

Another interesting hub gene that is part of the tightly connected cluster in the brown module is SHMT1, which encodes the Serine hydroxymethyltransferase. SHMT1 simultaneously catalyzes the reversible conversions of l-serine to glycine and tetrahydrofolate to 5,10-methylene tetrahydrofolate. Contribution of tetrahydrofolate in cellular one-carbon (C1) pathways and ROS generation at stress condition has been reported previously [51]. Moreover, prior study have proved that SHMT1 switch nitrogen and carbon metabolism to secondary metabolite biosynthesis [11], validating the results of current study. An earlier study also showed that the overexpression of the SHMT1 increases biomass production [52].

There are other interesting hub genes (top 30 hub genes) in the brown module, which are presented in S5 Table. We found that these genes are related to ribosome biogenesis, fatty acid metabolism, carotenoid biosynthesis, sulfur relay system.

Ribosome biogenesis is a central process in the growing cells. A recent study has indicated the translational regulation of ribosome biosynthesis in response to carbon depletion [53]. Among different ribosome biogenesis involved genes, EIF3I was demined as a key gene for metabolite accumulation, based on the connectivity analysis. In agreement with our findings, the core functionality of eIF3 has been confirmed in a previous study [54]. At heterotrophic growth condition, the inorganic carbon is sufficiently supplied and marked increasing of ribosome biogenesis is needed to the biosynthesis of proteins and metabolites [30].

Further analysis of connectivity in the none-preserved module at the PR phase reveals another gene, LACS7, a part of peroxisome proliferator-activated receptors (PPAR) and Adipocytokine signaling pathways [55], as another important gene. Newly synthesized free fatty acids (FAs) need to activate to acyl-CoA form by LACS to involves in the glycerolipid metabolic pathways, indicating the critical roles of LACS in FA metabolism [56]. In Arabidopsis, a family of nine genes encoding LACS protein has been identified to play their roles in various aspects of lipid metabolism at different subcellular localizations. Among these LACS isoforms, peroxisomal LACS6 and LACS7 are involved in fatty acid β-oxidation [57]. The results of our analysis indicated that LACS7 is up-regulated during the LG and TR phases. However; marked down-regulation is shown at the PR phase. Consequently, inhibition of β-oxidation would prevent the loss of TAG during the PR phase. These findings clearly show that the lipids accumulations are dominantly related to decreased lipid catabolism at the PR phase. In agree with our findings, it has been reported that inactivation of the LACS6 and LACS7 lipid breakdown [58]

Another interesting gene that shows a large difference between connectivity in PR versus LG and TR brown module is LCE (Lycopene epsilon cyclase) which is involved in the carotenoids biosynthesis. Transcriptional variation of LCE during the lutein and beta-carotene accumulation circumstances has been reported [11] [4]; however, the changes in the connectivity pattern have not shown up yet, highlighting the efficiency of our integrative approach.

Some identified hub genes such as F751_5997, F751_3508, and F751_4289 were not annotated, whereas they should be considered as potential candidates for future studies. As above-mentioned, the efficiency of hub genes in different phase’s discrimination was assessed using the LOOCV method. Identified hub genes have discriminated different phases with 90.48% accuracy, validating the identified hub genes (S1 Fig).

Conclusion

In summary, our results show that the integration of transcriptome meta-analysis with WGCNA, along with connectivity analysis and functional enrichment can be used to identify modules associated with secondary metabolite accumulation in microalgae. Identified modules are used for exploratory analysis of contributed pathways in metabolite accumulation, as exemplified by the identified pathways such as “Methane metabolism”, “Ascorbate and aldarate metabolism” and “ PPARs”. Moreover, the integrated approach was applied here, proposed some candidate target genes, such as SHMT1, for the development of metabolites accumulating strains in microalgae. However, future efforts should be made to further investigation of the modules and candidate genes associated with secondary metabolites accumulation.

Supporting information

S1 Fig. Validation of hub genes based on LOOCV method.

https://doi.org/10.1371/journal.pone.0225677.s001

(TIF)

S1 Table. Identified meta genes in three phase of transition from auto- to heterotrophic condition.

https://doi.org/10.1371/journal.pone.0225677.s002

(XLSX)

S2 Table. Significant enriched pathways in different phases after transition to heterotrophic growth condition.

https://doi.org/10.1371/journal.pone.0225677.s003

(XLSX)

S3 Table. Identified TFs, PKs and TRs across LG, TR and PR phases.

https://doi.org/10.1371/journal.pone.0225677.s004

(XLSX)

S4 Table. Genes of each corresponding modules across LG, TR and PR phases.

https://doi.org/10.1371/journal.pone.0225677.s005

(XLSX)

S5 Table. Hub genes and connectivity in brown module at PR phase.

https://doi.org/10.1371/journal.pone.0225677.s006

(XLSX)

Acknowledgments

We would like to thank Dr Asgar Panahi for reading of the manuscript.

References

  1. 1. Xiong W, Li X, Xiang J, Wu Q (2008) High-density fermentation of microalga Chlorella protothecoides in bioreactor for microbio-diesel production. Appl Microbiol Biotechnol 78: 29–36. pmid:18064453
  2. 2. Pulz O, Gross W (2004) Valuable products from biotechnology of microalgae. Applied Microbiology and Biotechnology 65: 635–648. pmid:15300417
  3. 3. Harun R, Singh M, Forde GM, Danquah MK (2010) Bioprocess engineering of microalgae to produce a variety of consumer products. Renewable and Sustainable Energy Reviews 14: 1037–1047.
  4. 4. Xiao Y, He X, Ma Q, Lu Y, Bai F, et al. (2018) Photosynthetic Accumulation of Lutein in Auxenochlorella protothecoides after Heterotrophic Growth. Mar Drugs 16.
  5. 5. Cho DH, Choi JW, Kang Z, Kim BH, Oh HM, et al. (2017) Microalgal diversity fosters stable biomass productivity in open ponds treating wastewater. Sci Rep 7: 1979. pmid:28512332
  6. 6. Cecchin M, Benfatto S, Griggio F, Mori A, Cazzaniga S, et al. (2018) Molecular basis of autotrophic vs mixotrophic growth in Chlorella sorokiniana. 8: 6465.
  7. 7. Zijffers JW, Janssen M, Tramper J, Wijffels RH (2008) Design process of an area-efficient photobioreactor. Mar Biotechnol (NY) 10: 404–415.
  8. 8. Pulz O (2001) Photobioreactors: production systems for phototrophic microorganisms. Applied Microbiology and Biotechnology 57: 287–293. pmid:11759675
  9. 9. Gladue RM, Maxey JE (1994) Microalgal feeds for aquaculture. Journal of Applied Phycology 6: 131–141.
  10. 10. Akubude VC, Nwaigwe KN, Dintwa E (2019) Production of biodiesel from microalgae via nanocatalyzed transesterification process: A review. Materials Science for Energy Technologies 2: 216–225.
  11. 11. Panahi B, Frahadian M, Dums JT, Hejazi MA (2019) Integration of Cross Species RNA-seq Meta-Analysis and Machine-Learning Models Identifies the Most Important Salt Stress–Responsive Pathways in Microalga Dunaliella. Frontiers in Genetics 10.
  12. 12. Riquelme Medina I, Lubovac-Pilav Z (2016) Gene Co-Expression Network Analysis for Identifying Modules and Functionally Enriched Pathways in Type 1 Diabetes. PLoS One 11: e0156006. pmid:27257970
  13. 13. Wei H, Persson S, Mehta T, Srinivasasainagendra V, Chen L, et al. (2006) Transcriptional coordination of the metabolic network in Arabidopsis. Plant physiology 142: 762–774. pmid:16920875
  14. 14. Panahi B, Mohammadi SA, Ruzicka K, Abbasi Holaso H, Zare Mehrjerdi M (2019) Genome-wide identification and co-expression network analysis of nuclear factor-Y in barley revealed potential functions in salt stress. Physiology and Molecular Biology of Plants 25: 485–495. pmid:30956430
  15. 15. Saito K, Hirai MY, Yonekura-Sakakibara K (2008) Decoding genes with coexpression networks and metabolomics - 'majority report by precogs'. Trends Plant Sci 13: 36–43. pmid:18160330
  16. 16. Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30: 2114–2120. pmid:24695404
  17. 17. Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, et al. (2013) TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biology 14: R36. pmid:23618408
  18. 18. Gao C, Wang Y, Shen Y, Yan D, He X, et al. (2014) Oil accumulation mechanisms of the oleaginous microalga Chlorella protothecoides revealed through its genome, transcriptomes, and proteomes. BMC Genomics 15: 582. pmid:25012212
  19. 19. Liao Y, Smyth GK, Shi W (2013) The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote. Nucleic Acids Res 41: e108. pmid:23558742
  20. 20. Love MI, Huber W, Anders S (2014) Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15: 550. pmid:25516281
  21. 21. Benjamini Y, Hochberg Y (1995) Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B (Methodological) 57: 289–300.
  22. 22. Johnson WE, Li C, Rabinovic A (2006) Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics 8: 118–127. pmid:16632515
  23. 23. Langfelder P, Horvath S (2008) WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 9: 559. pmid:19114008
  24. 24. Botía JA, Vandrovcova J, Forabosco P, Guelfi S, D’Sa K, et al. (2017) An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks. BMC Systems Biology 11: 47. pmid:28403906
  25. 25. Bakhtiarizadeh MR, Hosseinpour B, Shahhoseini M, Korte A, Gifani P (2018) Weighted Gene Co-expression Network Analysis of Endometriosis and Identification of Functional Modules Associated With Its Main Hallmarks. Front Genet 9: 453. pmid:30369943
  26. 26. Kanehisa M, Goto S, Sato Y, Kawashima M, Furumichi M, et al. (2014) Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic Acids Res 42: D199–205. pmid:24214961
  27. 27. Zheng Y, Jiao C, Sun H, Rosli HG, Pombo MA, et al. (2016) iTAK: A Program for Genome-wide Prediction and Classification of Plant Transcription Factors, Transcriptional Regulators, and Protein Kinases. Mol Plant 9: 1667–1670. pmid:27717919
  28. 28. Tahmasebi A, Ashrafi-Dehkordi E, Shahriari AG, Mazloomi SM, Ebrahimie E (2019) Integrative meta-analysis of transcriptomic responses to abiotic stress in cotton. Progress in Biophysics and Molecular Biology 146: 112–122. pmid:30802474
  29. 29. Ogbonna JC, Tomiyamal S, Tanaka H (1998) Heterotrophic cultivation of Euglena gracilis Z for efficient production of α-tocopherol. Journal of Applied Phycology 10: 67–74.
  30. 30. MARTÍNEZ F, ASCASO C, ORÚS MI (1991) Morphometric and Stereologic Analysis of Chlorella vulgaris Under Heterotrophic Growth Conditions. Annals of Botany 67: 239–245.
  31. 31. Perez-Garcia O, Bashan Y (2015) Microalgal Heterotrophic and Mixotrophic Culturing for Bio-refining: From Metabolic Routes to Techno-economics. In: Prokop A, Bajpai RK, Zappi ME, editors. Algal Biorefineries: Volume 2: Products and Refinery Design. Cham: Springer International Publishing. pp. 61–131.
  32. 32. Subrahmanian N, Remacle C, Hamel PP (2016) Plant mitochondrial Complex I composition and assembly: A review. Biochimica et Biophysica Acta (BBA) - Bioenergetics 1857: 1001–1014. pmid:26801215
  33. 33. Byers DM, Gong H (2007) Acyl carrier protein: structure–function relationships in a conserved multifunctional protein family. Biochemistry and Cell Biology 85: 649–662. pmid:18059524
  34. 34. Panahi B, Mohammadi SA, Ebrahimie E (2014) Identification of miRNAs and their potential targets in halophyte plant Thellungiella halophila. BioTechnologia 94: 285–290.
  35. 35. Panahi B, Mohammadi SA, Khaksefidi RE, Fallah Mehrabadi J, Ebrahimie E (2015) Genome-wide analysis of alternative splicing events in Hordeum vulgare: Highlighting retention of intron-based splicing and its possible function through network analysis. FEBS Letters 589: 3564–3575. pmid:26454178
  36. 36. Li L, Zhang G, Wang Q (2016) De novo transcriptomic analysis of Chlorella sorokiniana reveals differential genes expression in photosynthetic carbon fixation and lipid production. BMC Microbiol 16: 223. pmid:27669744
  37. 37. Zinta G, Khan A, AbdElgawad H, Verma V, Srivastava AK (2016) Unveiling the Redox Control of Plant Reproductive Development during Abiotic Stress. Front Plant Sci 7: 700. pmid:27379102
  38. 38. Wang ZY, Kenigsbuch D, Sun L, Harel E, Ong MS, et al. (1997) A Myb-related transcription factor is involved in the phytochrome regulation of an Arabidopsis Lhcb gene. Plant Cell 9: 491–507. pmid:9144958
  39. 39. Xu YH, Liu R, Yan L, Liu ZQ, Jiang SC, et al. (2012) Light-harvesting chlorophyll a/b-binding proteins are required for stomatal response to abscisic acid in Arabidopsis. J Exp Bot 63: 1095–1106. pmid:22143917
  40. 40. López García de Lomana A, Schäuble S, Valenzuela J, Imam S, Carter W, et al. (2015) Transcriptional program for nitrogen starvation-induced lipid accumulation in Chlamydomonas reinhardtii. Biotechnol Biofuels 8: 207. pmid:26633994
  41. 41. Gargouri M, Park JJ, Holguin FO, Kim MJ, Wang H, et al. (2015) Identification of regulatory network hubs that control lipid metabolism in Chlamydomonas reinhardtii. J Exp Bot 66: 4551–4566. pmid:26022256
  42. 42. Sun J, Li L, Wang P, Zhang S, Wu J (2017) Genome-wide characterization, evolution, and expression analysis of the leucine-rich repeat receptor-like protein kinase (LRR-RLK) gene family in Rosaceae genomes. BMC Genomics 18: 763. pmid:29017442
  43. 43. Kieber JJ, Rothenberg M, Roman G, Feldmann KA, Ecker JR (1993) CTR1, a negative regulator of the ethylene response pathway in Arabidopsis, encodes a member of the raf family of protein kinases. Cell 72: 427–441. pmid:8431946
  44. 44. Ke S-W, Chen G-H, Chen C-T, Tzen JTC, Yang C-Y (2018) Ethylene signaling modulates contents of catechin and ability of antioxidant in Camellia sinensis. Botanical Studies 59: 11. pmid:29616373
  45. 45. Zhang B, Horvath S (2005) A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol 4: Article17.
  46. 46. Müllner D (2013) fastcluster: Fast hierarchical, agglomerative clustering routines for R and Python. Journal of Statistical Software 53: 1–18.
  47. 47. Gao C, Wang Y, Shen Y, Yan D, He X, et al. (2014) Oil accumulation mechanisms of the oleaginous microalga Chlorella protothecoides revealed through its genome, transcriptomes, and proteomes. BMC Genomics 15: 582. pmid:25012212
  48. 48. Li T, Yang F, Xu J, Wu H, Mo J, et al. (2020) Evaluating differences in growth, photosynthetic efficiency, and transcriptome of Asterarcys sp. SCS-1881 under autotrophic, mixotrophic, and heterotrophic culturing conditions. Algal Research 45: 101753.
  49. 49. Rengel R, Smith RT, Haslam RP, Sayanova O, Vila M, et al. (2018) Overexpression of acetyl-CoA synthetase (ACS) enhances the biosynthesis of neutral lipids and starch in the green microalga Chlamydomonas reinhardtii. Algal Research 31: 183–193.
  50. 50. Yan J, Cheng R, Lin X, You S, Li K, et al. (2013) Overexpression of acetyl-CoA synthetase increased the biomass and fatty acid proportion in microalga Schizochytrium. Appl Microbiol Biotechnol 97: 1933–1939. pmid:23070649
  51. 51. Moreno JI, Martín R, Castresana C (2005) Arabidopsis SHMT1, a serine hydroxymethyltransferase that functions in the photorespiratory pathway influences resistance to biotic and abiotic stress. The Plant Journal 41: 451–463. pmid:15659103
  52. 52. Zhang J, Li M, Bryan AC, Yoo CG, Rottmann W, et al. (2019) Overexpression of a serine hydroxymethyltransferase increases biomass production and reduces recalcitrance in the bioenergy crop Populus. Sustainable energy & fuels 3: 195–207.
  53. 53. Karlsen J, Asplund-Samuelsson J, Thomas Q, Jahn M, Hudson EP (2018) Ribosome Profiling of Synechocystis Reveals Altered Ribosome Allocation at Carbon Starvation. mSystems 3: e00126–00118.
  54. 54. Masutani M, Sonenberg N, Yokoyama S, Imataka H (2007) Reconstitution reveals the functional core of mammalian eIF3. The EMBO journal 26: 3373–3383. pmid:17581632
  55. 55. Choi CI (2019) Astaxanthin as a Peroxisome Proliferator-Activated Receptor (PPAR) Modulator: Its Therapeutic Implications. Mar Drugs 17.
  56. 56. Yang Z, Ji H, Liu D (2016) Oil biosynthesis in underground oil-rich storage vegetative tissue: comparison of Cyperus esculentus tuber with oil seeds and fruits. Plant and Cell Physiology: pcw165.
  57. 57. Fulda M, Shockey J, Werber M, Wolter FP, Heinz E (2002) Two long‐chain acyl‐CoA synthetases from Arabidopsis thaliana involved in peroxisomal fatty acid β‐oxidation. The Plant Journal 32: 93–103. pmid:12366803
  58. 58. Radakovits R, Jinkerson RE, Darzins A, Posewitz MC (2010) Genetic engineering of algae for enhanced biofuel production. Eukaryot Cell 9: 486–501. pmid:20139239