Algae in the order Trentepohliales have a broad geographic distribution and are generally characterized by the presence of abundant β-carotene. The many monographs published to date have mainly focused on their morphology, taxonomy, phylogeny, distribution and reproduction; molecular studies of this order are still rare. High-throughput RNA sequencing (RNA-Seq) technology provides a powerful and efficient method for transcript analysis and gene discovery in Trentepohlia jolithus.
Illumina HiSeq 2000 sequencing generated 55,007,830 Illumina PE raw reads, which were assembled into 41,328 assembled unigenes. Based on NR annotation, 53.28% of the unigenes (22,018) could be assigned to gene ontology classes with 54 subcategories and 161,451 functional terms. A total of 26,217 (63.44%) assembled unigenes were mapped to 128 KEGG pathways. Furthermore, a set of 5,798 SSRs in 5,206 unigenes and 131,478 putative SNPs were identified. Moreover, the fact that all of the C4 photosynthesis genes exist in T. jolithus suggests a complex carbon acquisition and fixation system. Similarities and differences between T. jolithus and other algae in carotenoid biosynthesis are also described in depth.
This is the first broad transcriptome survey for T. jolithus, increasing the amount of molecular data available for the class Ulvophyceae. As well as providing resources for functional genomics studies, the functional genes and putative pathways identified here will contribute to a better understanding of carbon fixation and fatty acid and carotenoid biosynthesis in T. jolithus.
Citation: Li Q, Liu J, Zhang L, Liu Q (2014) De Novo Transcriptome Analysis of an Aerial Microalga Trentepohlia jolithus: Pathway Description and Gene Discovery for Carbon Fixation and Carotenoid Biosynthesis. PLoS ONE 9(9): e108488. https://doi.org/10.1371/journal.pone.0108488
Editor: Cynthia Gibas, University of North Carolina at Charlotte, United States of America
Received: January 9, 2014; Accepted: August 30, 2014; Published: September 25, 2014
Copyright: © 2014 Li et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was financially supported by the Chinese National Basic Research Program of China (973 Program, No. 2011CB2009) and the China National Nature Science Foundation (No. 31300330). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The order Trentepohliales (Chlorophyta: Ulvophyceae) is one of the most widespread groups of organisms, and represents an ideal model taxon to investigate evolutionary patterns . Species of this order are widely distributed over an extensive range of habitats in areas with humid and rainy climates including tropical and temperate regions –. Many of these species have been reported as lichen photobionts and to produce carbohydrates for fungi –. As the only family within the Trentepohliales, with six genera, the Trentepohliaceae is generally characterized by the presence of abundance β-carotene and hematochrome, which color the algal thallus yellow, orange, or red . It is also characterized by the absence of pyrenoids in the chloroplast, a flagellar apparatus with unique ultrastructure, transverse cell walls with plasmodesmata, and differentiated reproductive cells , . In recent years, numerous studies have been carried out on many aspects of the biology of Trentepohlia. Many of these monographs are concerned with morphology, taxonomy, phylogeny, distribution and reproduction , –. Diversity, life history and ecology were examined in detail by Rindi et al. . Others have investigated physiology, photosynthesis and ultrastructure , , –. The classification and phylogeny of the Trentepohliales at the genus level have been determined, using morphological characteristics and molecular data , , respectively.
Trentepohlia Martius 1817 was first described by Printz in 1939, and afterwards detailed studies on Trentepohlia were carried out over a very long period of time , . These aeroterrestrial phototrophic microorganisms typically grow epiphytically and epilithically on natural surfaces such as tree bark, soil and rock . In addition, they are also found in urban areas on anthropogenic surfaces such as bricks, concrete, cement and other artificial surfaces . These rock-inhabiting algae are a kind of lithophytic alga that lives on, or within, rock substrates, expanding to a few millimeters underneath the rock surface . The nutrient supply of aeroterrestrial algae, however, has not been so well established . Lange et al. proposed that green algal lichens are capable of photosynthesizing in the presence of water vapor . Wright et al., 2001, showed that the atmosphere carries high concentrations of nitrate, ammonium and phosphate . It is a remarkable fact that water, as rain or snow, can be characterized as a fertilizer supplying essential nutrients to aeroterrestrial algae . Aeroterrestrial algal have developed, inter alia, thick cell walls, various compounds such as multi-functional sugar alcohols, and extracellular mucoid layers to ensure water availability . In addition, the biofilm can undergo large changes in volume during dry and wet periods, or during freezing and thawing . Extracellular polymeric substances (EPS), which form a coating around Trentepohlia, aid in the attachment of the alga to surfaces . EPS also substantially increases the water holding capacity of the algal biofilm and significantly reduces water loss in algal cells exposed to periodic drying , .
Several morphological characteristics have been clearly characterized by Hoffmann  and Rindi et al. . Liu et al. collected Trentepohlia in the Yajiageng valley, Mt. Gongga, and in 2012 named this alga as a new variety: Trentepohlia jolithus var. yajiagengensis var. nov. . This strain has a remarkably high content of carotenoids, which helps the algal cells resist strong ultraviolet radiation at high altitudes. This variety is also rich in oils, which gives it high resistance to cold dry winters , . The phylogenetic affinities of Trentepohlia and the other members of the Trentepohliales have long been unclear . Perhaps as a result of global warming, epiphytic species appear to be increasing. Aptroot et al. had shown that all lichens containing Trentepohlia occurring in the Noordhollands Duinreservaat have increased in abundance, and in 2007 suggested that global warming might affect Trentepohlia directly rather than the fungal components . The ecological distribution and physiological properties of a new variety of Trentepohlia aurea, sampled from the west plateau klint of the Sichuan Basin, China, has now been investigated in our laboratory.
High-throughput RNA-sequencing (RNA-Seq) is a recently developed technology that provides new strategies for analyzing the functional complexity of a transcriptome , . The application of such high-throughput approaches could greatly accelerate the research progress on a new species by providing an improved understanding of essential components in a particular cellular process, the regulation of networks involved in biological processes, metabolic pathways, and responses to stimuli and so on. There is a growing number of oleaginous microalgae for which de novo transcriptomes have been assembled and annotated –. However, to date, only a few studies have been carried out in other kinds of microalgae –. The limited amount of molecular data available for most of the Ulvophyceae is a significant hurdle to the completion of genetic analyses and biological studies. In the present study, using Illumina sequencing and bioinformatic analysis, we have examined the T. jolithus transcriptome to gain more insights into its metabolic pathways. The principal objective of this study was to annotate the functional genes and identify potential metabolic pathways, such as photosynthetic carbon fixation, and fatty acid and carotenoid biosynthesis. The results provide new insights into the regulation of photosynthesis and suggest potential strategies for optimizing conditions favoring growth and carotenogenesis.
Materials and Methods
No specific permits were required for the described field studies, which took place in locations with public right-of-way. The field studies did not involve endangered or protected species.
Specimen collection and RNA extraction
Specimens of Trentepohlia jolithus were randomly collected from reddish stones in Yanzigou, the east slope of Mt. Gongga, Luding, Sichuan province (China; 29°38′N, 102°7′E) on Dec 1, 2012 (Figure 1 A). According to the weather station at 3000 m, temperature, air humidity and light intensity exhibit diurnal and seasonal fluctuations. The mean temperature we measured during daytime varied from 12°C to–4°C at 3100 meters above sea level. This area has a very mild climate with an average relative humidity above 90%, which is ideal for Trentepohlia growth . The light intensity we measured during daytime ranged from 30 to 2500 µmol photons m−2 s−1 depending on the degree of cloud cover. When we were sampling at noon, the temperature, humidity and light intensity were 7°C, 51% and 2200 µmol photons m−2 s−1, respectively. These epilithic algae colonize the exposed rock surface and extend for a few tens of kilometers along the valley. The filamentous green algae form large, bright red to deep red biofilms on the rocks (Figure 1 A, B). Samples of the wild algae were carefully scraped from the stones with sharp knives under the existing environmental conditions. Several pieces of T. jolithus were selected based on their bright color and apparent cleanness. To maximize information on variability, T. jolithus samples from different heights were chosen for the experiment and sealed in plastic collection bags. The materials were kept in a car refrigerator and sent to our laboratory in Qingdao by air.
(A) Red-Stone-Valley in winter. (B) Reddish stones along rivers. (C) Microscopic structure of dried T. jolithus. (D) Microscopic structure of rehydrated T. jolithus after a few drops of water was added to the dried material.
In the laboratory, T. jolithus cells were harvested after washing and filtration to remove visible contaminants. First, the samples were rehydrated, a process that takes just a few seconds (Figs 1 B and 1 C). Mortars and a 70-µm sieve cloth were used to break up the biofilm gently during the washing process, using sterilized water. Then, a piece of algal material obtained by filtration through the sieve cloth at room temperature, was resuspended in sterile deionized water, and transferred into a new mortar. After washing several times with the sterilized water, any remaining lichen material and other contaminants were removed and clean unialgal samples were selected. Even so, there is a possibility that we sampled a community and that this transcriptome sample was a metatranscriptome greatly enriched for T. jolithus. The identity of the organism was further confirmed from these samples by careful examination of the morphology under a microscope and by 18S rDNA sequencing. The bulk of the clean selected samples was dried with hygroscopic filter paper and flash frozen immediately in liquid nitrogen for subsequent RNA extraction.
Preparation of total RNA
RNA from an 80 mg subsample was extracted and purified using an Plant RNA Kit following the manufacturer’s instructions. The RNase-free DNase I digestion protocol was performed to remove residual genomic DNA. Quantity and purity of the extracted RNA was determined by a Nanodrop ND-100 spectrophotometer (LabTech, Holliston, MA, USA). RNA integrity was confirmed via an Agilent 2100 bioanalyzer (Agilent; Palo Alto, CA, USA), which gave an RNA integrity number (RIN) of 7.9.
RNA-Seq Library Preparation and Sequencing
Four µg of the total RNA sample were used to isolate poly-A containing mRNA molecules, using poly-T oligo-attached magnetic beads (Illumina). Then, the poly-A RNA was purified and fragmented into smaller pieces (200–700 nt) using divalent cations at 94°C for exactly 5 minutes. This protocol uses two rounds of enrichment for poly-A mRNA followed by thermal mRNA fragmentation. Aliquots of purified mRNA were used for construction of the cDNA libraries using the mRNA-Seq Kit supplied by Illumina. During the cDNA synthesis process, the cleaved RNA fragments were primed with random hexamers and then reverse-transcribed into first strand cDNA using reverse transcriptase and random primers. The cDNA was next converted into double stranded DNA using the reagents supplied in the Illumina TruSeq RNA sample preparation kit, according to the manufacturer’s protocol, and the resulting dsDNA was used for library preparation. After adenylating the 3′ ends and ligating adapters to the ends of the dscDNA, we selectively enriched for those DNA fragments that had adapter molecules on both ends, and amplified the amount of DNA in the library using PCR. During the QC (Quality Control) steps, an Agilent 2100 Bioanalyzer and an ABI StepOnePlus Real-Time PCR System were used for quality assessment and quantification of the sample library. Finally, the library was sequenced using an Illumina HiSeq 2000, at Huada Genomics Institute Co. Ltd, China. After the sequencing was completed, the image data output was transformed by base calling into sequence data; this output was called “raw data” or “raw reads” and stored in fastq format. Raw sequence data has been submitted to the NCBI Sequence Read Archive (SRA) with the accession number SRP033549.
Raw Data Analysis and De Novo Transcriptome Assembly
Before assembly, the raw reads were filtered to obtain high-quality clean reads. Dirty raw reads with adaptor sequences, with unidentified nucleotides in excess of 5%, as well as low quality reads having more than 20% low quality base identification (base quality <10) were discarded. At this point, 90 bp paired-end reads were extracted from the raw data. De novo assembly of the clean reads was performed using Trinity software (release-20121005) as described for de novo transcriptome assembly without reference genome . The sequences resulting from the trinity assembly were called unigenes.
Unigene annotation and classification
For unigene functional annotation, BLAST alignments (E-value <10−5) were carried out using unigene sequences to query databases. The queried databases included the NCBI non-redundant protein database (NR), Kyoto Encyclopedia of Genes and Genomes (KEGG), Swiss-Prot, and Clusters of Orthologous Groups (COG). When a unigene did not align with anything in the above databases, software named ESTScan  was introduced to decide its sequence direction. Gene ontology (GO) assignments were applied using the Blast2go program for functional annotation . WEGO software  was used to do GO functional classification. Pathway assignments were determined following the KEGG pathway database using BlastX with an E-value threshold of 1.0E-5. Simple sequence repeat (SSR) detection was done with MIcroSAtellite (MISA) software using unigenes as reference. For SNP analysis, SNPs were detected using SoapSNP (http://soap.genomics.org.cn/soapsnp.html) essentially as described by Li et al. .
Pigment measurements and microscopic observation
Chlorophylls and carotenoids were extracted by 80% acetone and analyzed with a UV-120 system (Shimadzu, Japan) as described by Şükran et al. . The microscopic structure of the friable dried material was photographed using an inverted biological microscope (BM-37XB). When a few drops of water were added, the materials exhibited a strong water absorbency, which was observed by microscope, and photographed after a few seconds, in the same way.
Results and Discussion
Illumina sequencing and read assembly
In total, 55,007,830 Illumina PE raw reads were obtained. After removing reads with adapters and unknown or low quality bases, approximately 52.39 million clean reads with an average length of 90 bp were obtained (97.95% Q20 bases and 39.88% GC content). The complete assembly yielded 292,122 contigs with a mean length of 250 bp. The raw assembly contigs were clustered into a unigene dataset, which resulted in 92,414 unigenes that ranged from 200 bp to 9198 bp with a mean length of 619 bp and an N50 length of 986 bp (Table 1). The length distribution of the contigs and unigenes are illustrated in Figure 2.
Functional annotation and classification
For assignments of predicted genes, 41,328 assembled unigenes were obtained after annotations with the NR, NT, Swissprot, KEGG, COG and GO databases, using the BLAST algorithm, specifying E values of less than 10−5. The results indicated that 37,869 (91.63%) and 9,674 (23.41%) non-redundant unigenes showed identity with sequences in NCBI NR and NT databases, respectively. Of the 23,274 annotated unigenes, 56.32% had similarity to proteins in the Swiss-Prot database (Table 2). E-value and similarity distributions of the top hits in the NR database analysis revealed that 18.36% (6,950) and 5.54% (2,100) showed significant homology (E-value <1E-45) or high similarity (greater than 80%), respectively (Figure 3 A, B). About 43.66% of annotated unigenes could be assigned with a best score to the sequences from the top seven species, i.e., Coccomyxa subellipsoidea C-169 (9.48%), Volvox carteri f. nagariensis (8.56%), Physcomitrella patens subsp. patens (6.59%), Hordeum vulgare subsp. vulgare (5.03%), Chlorella variabilis (4.83%), Chlamydomonas reinhardtii (4.73%) and Selaginella moellendorffii (4.44%) (Figure 3 C).
(A) E-value distribution of BLAST hits for each unigene with a cutoff E-value of 1.0E-5. (B) Similarity distribution of the top BLAST hit for each unigene. (C) Species distribution of the top BLAST hit for each unigene in the NR database.
Functional annotation and pathway assignment
Based on NR annotation, 53.28% of unigenes (22,018) were assigned to gene ontology classes with 54 subcategories and 161,451 functional terms (Figure 4, Table S1). Biological process (68,837; 42.64%) and cellular component (68,184; 42.23%) comprised the majority of the functional terms. Additionally, 24,430 unigenes (15.13%) were classified into molecular function categories. Within the biological process category, cellular process (19.74%) was the most dominant group, followed by metabolic process (18.71%) and response to stimulus (8.96%). In the category of cellular component, cell and cell part were the most highly represented groups with the same percentage of 23.33%. Regarding molecular function, most of the corresponding genes were assigned to catalytic activity (44.03%) and binding (39.55%).
The results are summarized in three main categories: biological process, cellular component and molecular function. In total, 22,018 unigenes with BLAST matches to known proteins were assigned to this gene ontology.
Clusters of Orthologous Groups of proteins (COGs) were delineated by comparing protein sequences encoded in complete genomes, representing major phylogenetic lineages. The COG database was used to define the orthologous functions of unigenes. In total, 22,921 (55.46%) of the non-redundant unigenes (Table 2) were subdivided into 25 COG classifications. The top categories among the COG terms were ‘General function prediction only’ (5,652; 14.38%) and ‘Translation, ribosomal structure and biogenesis’ (5,180; 13.18%). Whereas ‘Extracellular structures’ (37; 0.09%) and ‘Nuclear structure’ (22; 0.06%) were poorly characterized (Figure 5, Table S2).
In total, 22,921 of the 41,328 sequences with an NR hit were grouped into 25 COG classifications.
The Kyoto Encyclopedia of Genes and Genomes (KEGG) database  was used to identify the biological pathways in T. jolithus. A total of 26,217 (63.44%) of the assembled unigenes were mapped to 128 KEGG pathways (Table S3). Highly represented pathways included metabolic pathways (6357 members), biosynthesis of secondary metabolites (3058 members), ribosome (2571 members) and spliceosome (1839 members). Additionally, terpenoid backbone biosynthesis (82 members), flavonoid biosynthesis (79 members), and fatty acid biosynthesis (106 members) were identified as potential pathways that need follow-up studies for confirmation.
EST-SSR Discovery, Distribution and Frequencies
Simple Sequence Repeats (SSRs), also known as microsatellites, have been developed as SSR markers for genomic mapping, DNA fingerprinting, and marker-assisted selection in many species , . They are tandemly repeated sequences comprised of 1–6 base pairs of DNA, with a conserved flanking sequence . In our study, all of the 92,414 unigenes generated were used to mine potential microsatellites using MISA software. 6,187 SSR primer pairs were designed from these loci and a total of 5,798 SSRs were identified in 5,206 unigenes. Of all the SSR containing unigenes, 527 sequences contained more than one SSR and 256 SSRs were present in compound form. Frequencies for each array type according to repeat motifs are illustrated in Figure 6, the most abundant being dimers (43.83%), followed by trimers (33.94%), monomers (12.90%), quadramers (4.19%), hexamers (3.64%) and pentamers (1.50%). SSRs with six tandem repeats (22.47%) were the most common, followed by five tandem repeats (21.70%), seven tandem repeats (15.04%), and nine tandem repeats (7.49%). A summary of the number of repeat units is available in Table S4. These SSR results provide useful new molecular markers for any future genetic linkage analyses in T. jolithus.
Single-nucleotide polymorphisms (SNP) were identified as heterozygous sites on the transcripts, using SOAPsnp software . A total of 131,478 putative SNPs were predicted with default parameter values as shown in Table 3 and Table S5. Within the detected SNPs, transitions (66.42%) were much more common than transversions (33.58%). The number of A-G transitions (43,348) was similar to that for C-T (43,979). Among the transversions, A-T (47.91%) dominated, followed by A-C (20.03%), G-T (19.36%) and C-G (12.69%). Considering their abundance and variety, it would be possible to use these SNP-based markers to generate dense genetic maps and, in the future, to perform marker assisted selection (MAS) as described by Barbazuk et al. . This SNP database also offers rich information on the diversity within the species and will be used to study remaining uncertainties about T. jolithus strain distribution. It was worth cautioning at this point that there is a possibility that multiple organisms from the original community contributed to the sequences and thus the genomic diversity reported here will need further confirmation that it is truly derived from the T. jolithus genome.
Pigment content and microscopic structures of T. jolithus
The chlorophyll level in the tested T. jolithus varied from 0.63%–0.70% with an average of 0.67% of dry weight (DW). Carotenoids accounted for 2.2% of DW, which was 3.30 times the level of chlorophyll (Table 4). These high levels of carotenoids would effectively protect the cells from photodamage in the high altitude valleys, where the alga is found. An investigation of the “Red-Stone-Valley” habitat showed that the alga formed an extensive red covering on exposed rocks in winter (Figure 1 A). It was typically distributed on rocks that are located away from the river. The alga is not found on rocks located in or near the river and are periodically submerged. This observation is consistent with the fact that aeroterrestrial phototrophic microorganisms typically form conspicuous biofilms at the interface between any type of solid substratum and the atmosphere (Figure 1 B). Under the microscope, erect filaments were clearly observed in the dried materials (Figure 1 C) of the alga. After absorbing water rapidly, shiny red, individual cells were evident in the filaments, in contrast to the more amorphous appearance of filaments in the dried materials (Figure 1 D).
18S small subunit ribosomal DNA gene
In addition to observing the microscopic structures of dried and rehydrated T. jolithus (Figure 1 C and D), we sequenced the 18S rDNA to avoid taxonomic confusion. PCR amplification and sequencing of Trentepohlia jolithus 18S rDNA were performed using primers designed for both algae and plants. The sequence was deposited with GenBank under the Accession no. KM112092. BLAST analysis indicated that the sequence was homologous to previously identified Trentepohlia. According to the results, we aligned the 18S rDNA sequence with previous reported Trentepohlia jolithus var. yajiagengensis var. nov (JN542473), which was also collected from Sichuan province in China . We found just two differences among 1612 bases and showed 99.88% sequence similarity (Figure S1).
Analysis of genes and pathways
Although there are over 40,000 species of microalgae identified to date, the genomes of fewer than a dozen of these have been sequenced (http://genome.jgi-psf.org). Overall, biological pathways in microalgae are far from fully documented. Because there is extensive genomic sequence data for model organisms such as Chlamydomonas reinhardtii, these organisms are best for carrying out research at the transcriptomics and proteomics level to study responses to variables such as physiological stress at the molecular level –. For less known organisms, transcriptome sequencing and comparison with known organisms can be an efficient approach for obtaining a great deal of functional genomics information, thereby gaining information about additional organisms and reducing the reliance on model organisms . In this connection, a few molecular and biochemical studies have been carried out in the Trentepohliaceae , , but there is still a very limited understanding of the metabolic pathways, and of mechanisms controlling growth. In the present study, we obtained a large number of cDNA fragments that were highly enriched in genes for carbon fixation and metabolic pathways representative of the order Trentepohliales. Among these, we focused on key genes involved in carbon fixation, and in fatty acid and carotenoid biosynthesis. The principal components of these metabolic pathways are described below.
From many studies on primary photosynthetic carbon metabolism, it was believed that the operation of the Calvin–Benson cycle (C3 pathway) was predominant in algae . The initial carbon fixation catalyzed by Rubisco produces two three-carbon molecules of 3-phosphoglyceric acid (3-PGA) through the carboxylation of the five-carbon ribulose-1,5-biphosphate (RuBP) . In addition, microalgae increase the level of CO2 by accumulating HCO3− to overcome Rubisco’s surprisingly poor affinity for CO2 and prevent the diffusion of CO2 out of the cell, while allowing the entry of other nutrients . However, recent metabolic labeling and genome sequencing data suggest that they may also perform C4 photosynthesis. In our study, both C3 and C4 photosynthesis genes were found by transcriptome sequencing. Together these results suggest that C4 photosynthesis might be more wide-spread than previously thought.
In our study, using BLAST against the KEGG database, most of the genes for the key enzymes related to the C3 (139 unigenes) and C4 (157 unigenes) pathways of carbon fixation were actively transcribed (Figure 7, Table S6). Eleven transcripts (including 9 separate unigenes) of ribulose 1,5-bisphosphate carboxylase/oxygenase (Rubisco), which is the central carboxylation enzyme for CO2 fixation during photosynthesis, were obtained from T. jolithus. We also identified most of the genes involved in the Calvin-Benson cycle, such as fructose-bisphosphate aldolase (EC 22.214.171.124), phosphoglycerate kinase (EC 126.96.36.199), triose-phosphate isomerase (EC 188.8.131.52) and so on. These results provide unequivocal molecular evidence for the existence of a C3-pathway in T. jolithus.
The numbers within the small boxes are enzyme codes. The boxes with a red border are enzymes identified in this study. The boxes with a black border are enzymes not identified in this study.
It was interesting that enzymes required for C4 photosynthesis were also identified (Figure 7, Table S6). Among these genes, pyruvate orthophosphate dikinase (PPDK) is an important C4 enzyme that is required for the regeneration of phosphoenolpyruvate (PEP) in mesophyll cells and is regulated at the transcriptional level . Phosphoenolpyruvate carboxykinase (PEPCK) is another key enzyme and used to catalyze the release of CO2 from oxaloacetate (OAA) to produce PEP in the bundle sheath cell cytosol . In addition to this, PPDK and PEPCK are also present in C3/CAM species , . Parsley et al. had proposed that in cotyledons PPDK may be important in supplying PEP for gluconeogenesis, and in ageing leaves it allows remobilization of nitrogen to supply reproductive tissue among C3 plants . Its activity is light/dark modulated by reversible phosphorylation in both C3 and C4 plants . PEPCK also supplies CO2 for fixation by Rubisco during the light period in some CAM species . In the present study, by transcriptome sequencing and gene annotation, 10 PPDK and 16 PEPCK transcripts were identified. The function of these two genes warrants further investigation. Transcriptome information about C4-related enzyme variations among various algae is considerable. In the Bacillariophyta, Phaeodactylum tricornutum  and Thalassiosira pseudonana  have been shown to have developed a C4-like photosynthesis pathway in addition to the carbon-concentrating mechanism (CCM). In the Rhodophyta, almost all of the key enzymes in the C4 carbon-fixation pathway have been detected; however, PEPC in Pyropia (Porphyra) haitanensis sporophytes and pyruvate phosphate dikinase (EC 184.108.40.206) in Pyropia (Porphyra) yezoensis were not detected among the ESTs or transcriptome respectively. In the Chlorophyta, Ulva linza and Ulva prolifera have been shown to contain the C4 pathway . Myrmecia incisa Reisigl H4301, a coccoid green microalgal species belonging to the Trebouxiophyceae , Ostreococcus tauri, the smallest free-living eukaryote  and Micromonas sp., a marine picoeukaryote  also possibly possesses a C4-like photosynthesis pathway. Despite all this, a short-term metabolic 14C labeling experiment carried out on Thalassiosira weissflogii and Thalassiosira pseudonana by Roberts et al. suggested that intermediate compounds during the carbon assimilation of photosynthetic pathways were diverse . Hence, our transcriptome data and literature on the related organisms – suggest that T. jolithus might possess a complex carbon acquisition and fixation system without loss of CCM. This effective mechanism could help T. jolithus cope with high light intensities, low temperatures and arid conditions , . It also emphasizes the requirement for metabolic and functional genetic analyses before accepting the presence of C4-metabolic enzymes as evidence for C4 photosynthesis.
Fatty acid biosynthesis
Fatty acids, which are the building blocks for the formation of various types of lipids, are a potential biofuel feedstock . The basic pathway for fatty acid biosynthesis in microalgae is found primarily in the chloroplast, and is generally believed to be analogous to the pathway found in higher plants . Based on the functional annotation of the transcriptome, we have successfully identified the genes encoding key enzymes involved in the biosynthesis and catabolism of fatty acids in T. jolithus (Table S8). Fatty acid biosynthesis of this species starts with the conversion of acetyl CoA to malonyl CoA, catalyzed by acetyl CoA carboxylase (ACCase, EC: 220.127.116.11), which is consistent with Dunaliella tertiolecta and Eustigmatos cf. polyphem. Then, malonyl-CoA, the central carbon donor for fatty acid synthesis, is transferred to malonyl-acyl carrier protein (ACP) catalyzed by long-chain acyl-CoA synthetase (FabD, EC: 18.104.22.168). The number of transcripts involved in all elongation reactions of this pathway were 4, 66, 1 and 2 for 3-Ketoacyl ACP synthase II (FabF, EC 22.214.171.124), 3-oxoacyl-ACP reductase (FabG, EC 126.96.36.199), 3-Hydroxy acyl-CoA dehydratase (FabZ, EC 188.8.131.52) and enoyl-ACP reductase I (FabI, EC 184.108.40.206 220.127.116.11), respectively. Hexadecenoic and Octadecanoic acid were finally synthesized by fatty acyl-ACP thioesterase A (FATA, EC 18.104.22.168 3.1.2.−) and oleoyl-ACP hydrolase (OAT, EC 22.214.171.124) catalysis. These C16 and C18 trienoic fatty acids could be used as the precursors for the synthesis of cellular membranes, long-chain polyunsaturated fatty acids (LC-PUFAs) and storage neutral lipids (mainly TAGs) . The PUFAs may enhance the fluidity of the phospholipid membrane, which makes it possible for T. jolithus to withstand chilling or cold stress in the alpine environment .
Carotenoids play an important role as light-harvesting pigments and protect the photosynthetic apparatus from photooxidative damage under excess light conditions . In 2011, Takaichi summarized 28 different structures of carotenoids from the algal species he studied . Previously, carotenogenesis pathways and their enzymes in oxygenic phototrophs had been investigated in cyanobacteria  and land plants . Microalgae had common pathways with land plants and also additional microalgal-specific pathways and carotenoids . Many carotenogenesis enzymes and genes such as CrtB, CrtP, CrtL-b, CrtR-b, ZEP, VDE, and CrtW were reported in the Chlorophyceae, including Chlorella, Chlamydomonas, Dunaliella and Haematococcus . Nevertheless, only a little was learned about the carotenogenesis pathways of algae except for some proposed chemical structures . The kinds and amounts of various carotenoids in Trentepohlia have been studied. High light intensity has been shown to be an important factor for increasing carotenoid levels in Trentepohlia aurea and Trentepohlia odorata by Abe et al.  and Tan et al. , respectively. Mukherjee et al. also found that carotenoid content increased several fold seasonally, possibly because the reduced local temperatures (∼25°C) and cloudless clear skies in winter provide ideal conditions favoring growth and carotenogenesis in two tropical species, T. aurea and T. cucullata . In this study, the high levels of carotenoids convincingly demonstrate the existence of a carotenoid biosynthesis pathway in T. jolithus. Sequence information and pathway analysis will facilitate molecular functional studies and pave the path for a better understanding of carotenogenesis pathways in this aerial microalga.
Based on the functional annotation of the transcriptome, we have successfully identified 99 unigenes encoding key enzymes involved in carotenogenesis in T. jolithus (Figure 8, Table S7). The mevalonate (MVA) pathway beginning with Acetyl-CoA in plant cytoplasm, and the MEP/DOXP pathway, beginning with pyruvate in the plastids of algae, are two known independent pathways for IPP synthesis . There were no identified unique sequences encoding mevalonate kinase (MVK, EC 126.96.36.199), which is required for the former pathway. MVK, phosphomevalonate kinase (PMVK, EC 188.8.131.52) and mevalonate pyrophosphate decarboxylase (MPD, EC 184.108.40.206) were also absent in Myrmecia incisa Reisigl H4301, a coccoid green microalgal species in the Trebouxiophyceae . The result of annotated unigenes was consistent with the lack of an MVA pathway in the Chlorophyceae . In contrast, all the genes encoding enzymes involved in the MEP/DOXP pathway have been identified. After one IPP was added to farnesyl pyrophosphate by two transcripts of geranylgeranyl pyrophosphate synthase, type III (GGPS, EC 220.127.116.11), geranylgeranyl pyrophosphate (GGPP) was produced and acted as the substrate for phytoene synthase (PSY, EC 18.104.22.168), encoded by another unigene, which is the same as that in E. cf. polyphem . Following the formation of phytoene (C40), four steps were needed in the conversion from phytoene to lycopene by the sequential catalysis of phytoene desaturase (PDS, EC 22.214.171.124), ζ-carotene isomerase (ZISO, EC 126.96.36.199), ζ-carotene desaturase (ZDS, EC 188.8.131.52) and carotenoid isomerase (CrtISO, EC 184.108.40.206) . In the current study on T. jolithus, the number of transcripts in the transcriptome library coding for the enzymes involved in these four steps was one for PSY, two for PDS, one for ZISO and four for CrtISO. Unlike E. cf. polyphem and M. incisa, ZISO involved a single isomerization instead of two desaturation steps worked by ZDS. Therefore, we could properly deduce that ZISO is a unique gene present in Trentepohlia. Subsequently, lycopene is an important intermediate in the biosynthesis and could be cyclized into either β-carotene through γ-carotene, or α-carotene through δ-carotene. In this study, one transcript coding for lycopene β-cyclase (CrtL-b, EC 220.127.116.11) and another for lycopene ε-cyclase (CrtL-e, EC 18.104.22.168) sufficiently confirmed the presence of β-carotene and α-carotene, which is similar to M. incisa  and Prochlorococcus marinus MED4 . In addition, CruA, which belongs to a new family of functional lycopene cyclases, has been found in this study and cyclizes lycopene into β-carotene through γ-carotene, the same as CrtL-b. By comparison, T. jolithus is the first Chlorophyte species that has been shown to use CruA, which suggests it could be used as a chemotaxonomic marker. Next, lutein biosynthesis in T. jolithus starts with the conversion of α-carotene to lutein by LUT5, CrtR-b and LUT1. On the other hand, β-carotene is hydroxylated by β-carotene 3-hydroxylase (CrtZ, EC: 22.214.171.124) and LUT5 to form zeaxanthin through β-cryptoxanthin. The transformation between zeaxanthin and violaxanthin is carried out by zeaxanthin epoxidase (ZEP, EC: 126.96.36.199) and violaxanthin deepoxidase (VDE, EC: 188.8.131.52) respectively, under low and high light conditions with antheraxanthin as the intermediate. All the genes encoding enzymes including 9-cis-epoxycarotenoid dioxygenase (NCED, EC 184.108.40.206), abscisic-aldehyde oxidase (AAO3, EC 220.127.116.11) and xanthoxin dehydrogenase (ABA2, EC 18.104.22.1688) involved in the ABA biosynthesis were detected, suggesting that ABA biosynthesis pathways in this alga are the same as E. cf. polyphem . Carotene β-ketolase (CrtO) was not detected in our study and if it is truly missing from the genome, its lack would hinder or preclude the synthesis of astaxanthin by this species.
Abbreviations of chemical compounds are as follows: GAP, D-glyceraldehyde-3-phosphate; PYR, pyruvate; DXP, 1-Deoxy-D-xylulose 5-phosphate; MEP, 2-C-Methyl-D-erythritol 4-phosphate; CDP-ME, 4-(Cytidine 5′-diphospho)-2-C-methyl-D-erythritol; CDP-ME2P, 2-Phospho-4-(Cytidine 5′-diphospho)-2-C-methyl-D-erythritol; MECP, 2C-methyl-D-erythritol-2,4-cyclodiphosphate; HMBPP, 1-Hydroxy-2-methyl-2-butenyl 4-diphosphate; DMAPP, Dimethylallyl-PP; IPP, Isopentenyl-PP; GPP, Geranyl-PP; FPP, (E,E)-Farnesyl-PP; GGPP, Geranylgeranyl-PP; PPFPP, prephytoene diphosphate; PE, phytoene; PF, 15,9′-dicis-Phytofluene; TζCar, 9,15,9′-tricis-ζ-Carotene; DζCar, 9,9′-Di-cis-ζ-Carotene; LYC, Lycopene; β-CRY, β-Cryptoxanthin; ZEA, Zeaxanthin; ANT, Antheraxanthin; VIO, Violaxanthin; 9-cis-VIO, 9-cis-Violaxanthin; XAN, Xanthoxin; ABAL, Abscisic aldehyde; ABA, Abscisate; α-CRY, α-Cryptoxanthin; ZEI, Zeinoxanthin; LUT, Lutein; α-Car, α-carotene; β-Car, β-carotene; γ-Car, γ-carotene; δ-Car, δ-carotene; ε-Car, ε-carotene. Abbreviations of enzymes are as follows: DXS, 1-deoxy-D-xylulose-5-phosphate synthase; DXR, 1-deoxy-D-xylulose-5-phosphate reductoisomerase; ISPD, 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; CMK, 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; MCS, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase; HDS, (E)-4-hydroxy-3-methylbut-2-enyl-diphosphate synthase; HDR, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; IDI, isopentenyl-diphosphate delta-isomerase; FDPS, farnesyl diphosphate synthase; GGPS, geranylgeranyl diphosphate synthase, type III; PSY, phytoene synthase; PDS, 15-cis-phytoene desaturase; ZISO, zeta-carotene isomerase; CrtISO, prolycopene isomerase; CrtR-b, lycopene beta-cyclase; CruA, lycopene cyclase CruA; CrtL-b, lycopene beta-cyclase; CrtL-e, lycopene epsilon-cyclase; LUT5, cytochrome P450, family 97, subfamily A (beta-ring hydroxylase); CrtZ, beta-carotene 3-hydroxylase; LUT1, carotene epsilon-monooxygenase; ZEP, zeaxanthin epoxidase; VDE, violaxanthin de-epoxidase; NCED, 9-cis-epoxycarotenoid dioxygenase; ABA2, xanthoxin dehydrogenase; AAO3, abscisic-aldehyde oxidase.
In this study, which presents the first transcriptomic study within the Trentepohliales, 41,328 assembled unigenes was obtained. The functional annotation and classification of these unigenes has given us a better understanding of the T. jolithus genome. After analyzing typical KEGG pathways, we discovered similarities and differences between T. jolithus and other algae. Our results help us explain why this aerial microalgae can survive and spread in such a cold, high altitude habitat. These results also pave the way for more detailed investigations of the mechanisms underlying the growth and metabolism of members of the Trentepohliaceae. The adaption mechanisms of T. jolithus to desiccation and a cold environment need further investigation. Molecular genetic manipulation of this organism might be an effective way to enhance the properties of this microalga to make it suitable for commercial development.
Alignment of 18S ribosomal DNA sequence of T. jolithus (KM112092) with Trentepohlia jolithus var. yajiagengensis var. nov (JN542473). Identical bases were shaded in light grey, and different bases were shaded in black.
List of T. jolithus unigenes in each GO category.
List of T. jolithus unigenes in each COG category.
128 KEGG pathways with pathway ID and KO information.
Summary of the number of repeat units identified from the T. jolithus unigene dataset.
Single-nucleotide polymorphisms (SNP) detected using SOAPsnp after unigene sequence assembly.
Enzymes encoded in the T. jolithus transcriptome involved in carbon fixation in photosynthetic organisms.
Enzymes involved in carotenoid biosynthesis of T. jolithus transcriptome.
We thank Yi Yuan for his exceptional help with the collection of Trentepohlia jolithus samples. We thank Dr. John van der Meer (Pan-American Marine Biotechnology Association) for his assistance with proofreading.
Conceived and designed the experiments: QQL JGL LTZ. Performed the experiments: QQL JGL LTZ QL. Analyzed the data: QQL LTZ. Contributed reagents/materials/analysis tools: QQL JGL. Wrote the paper: QQL.
- 1. Rindi F, Lam DW, López-Bautista JM (2009) Phylogenetic relationships and species circumscription in Trentepohlia and Printzina (Trentepohliales, Chlorophyta). Molecular Phylogenetics and Evolution 52: 329–339.
- 2. Thompson RH, Wujek DE (1997) Trentepohliales: Cephaleuros, Phycopeltis, and Stomatochroon: morphology, taxonomy and ecology. Science Publishers, Enfield, New Hampshire.
- 3. López-Bautista JM, Waters DA, Chapman RL (2002) The Trentepohliales revisited. Constancea 83. Available: http://ucjepsberkeleyedu/constancea/83/lopezetal/trentepohlialeshtml.
- 4. Rindi F, López-Bautista JM (2007) New and interesting records of Trentepohlia (Trentepohliales, Chlorophyta) from French Guiana, including the description of two new species. Phycologia 46: 698–708.
- 5. Gasulla F, Herrero J, Esteban-Carrasco A, Ros-Barceló A, Barreno E, et al. (2012) Photosynthesis in Lichen: Light Reactions and Protective Mechanisms, Advances in Photosynthesis - Fundamental Aspects, Dr Mohammad Najafpour (Ed.), ISBN: 978-953-307-928-8, InTech.
- 6. Chapman RL, Henk MC (1983) Ultrastructure of Cephaleuros virescens (Chroolepidaceae, Chlorophyta). IV. Absolute cnfiguration analysis of the cruciate flagellar apparatus and multilayered structure in the pre-release and post-release gametes. American Journal of Botany 70: 1340–1355.
- 7. Nakano T, Ihda T (1996) The identity of photobionts from the lichen Pyrenula japonica. Lichenologist 28: 437–442.
- 8. Printz H (1939) Vorarbeiten zu einer monographie der Trentepohliaceen. Nytt Magasin for Naturvidenskapene 80: 137–210.
- 9. López-Bautista JM, Chapman RL (2003) Phylogenetic affinities of the Trentepohliales inferred from small-subunit rDNA. International Journal of Systematic and Evolutionary Microbiology 53: 2099–2106.
- 10. Rindi F, Sherwood AR, Guiry MD (2005) Taxonomy and distribution of Trentepohlia and Printzina (Trentepohliales, Chlorophyta) in the Hawaiian Islands. Phycologia 44: 270–284.
- 11. López-Bautista JM, Rindi F, Guiry MD (2006) Molecular systematics of the subaerial green algal order Trentepohliales: an assessment based on morphological and molecular data. Int J Syst Evol Microbiol 56: 1709–1715.
- 12. Kapraun DF (2007) Nuclear DNA content estimates in green algal lineages: chlorophyta and streptophyta. Annals of Botany 99: 677–701.
- 13. Gupta S, Agrawal SC (2004) Vegetative survival and reproduction under submerged and air-exposed conditions and vegetative survival as affected by salts, pesticides, and metals in aerial green alga Trentepohlia aurea. Folia Microbiol (Praha) 49: 37–40.
- 14. Rindi F, Guiry MD (2002) Diversity, life history, and ecology of Trentepohlia and Printzina (Trentepohliales, Chlorophyta) in urban habitats in Western Ireland. Journal of Phycology 38: 39–54.
- 15. Häubner N, Schumann R, Karsten U (2006) Aeroterrestrial microalgae growing in biofilms on facades-response to temperature and water stress. Microbial Ecology 51: 285–293.
- 16. Abe K, Takahashi E, Hirano M (2007) Development of laboratory-scale photobioreactor for water purification by use of a biofilter composed of the aerial microalga Trentepohlia aurea (Chlorophyta). Journal of Applied Phycology 20: 283–288.
- 17. Gray DW, Lewis LA, Cardon ZG (2007) Photosynthetic recovery following desiccation of desert green algae (Chlorophyta) and their aquatic relatives. Plant, Cell & Environment 30: 1240–1255.
- 18. Abe K, Imamaki A, Hirano M (2002) Removal of nitrate, nitrite, ammonium and phosphate ions from water by the aerial microalga Trentepohlia aurea. Journal of Applied Phycology 14: 129–134.
- 19. Chapman RL (1984) An assessment of the current state of our knowledge of the Trentepohliaceae. In: Systematics of the green algae (Ed. by D.E.G. Irvine & D.M. John), 233–250. Academic Press, London.
- 20. Ettl H, Gärtner G (1995) Syllabus der Boden. Luft-und Flechtenalgen–721 pp, Gustav Fischer, Jena.
- 21. Gaylarde CC, Morton LG (1999) Deteriogenic biofilms on buildings and their control: a review. Biofouling 14: 59–74.
- 22. Hoffmann L (1989) Algae of Terrestrial Habitats. Botanical Review 55: 77–105.
- 23. Karsten U, Schumann R, Mostaert A (2007) Aeroterrestrial algae growing on man-made surfaces. Algae and cyanobacteria in extreme environments: Springer. PP. 583–597.
- 24. Lange O, Bilger W, Rimke S, Schreiber U (1989) Chlorophyll fluorescence of lichens containing green and blue-green algae during hydration by water vapor uptake and by addition of liquid water. Botanica Acta 102: 306–313.
- 25. Wright R, Alewell C, Cullen J, Evans C, Marchetto A, et al. (2001) Trends in nitrogen deposition and leaching in acid-sensitive streams in Europe. Hydrology and Earth System Sciences Discussions 5: 299–310.
- 26. Fletcher RL, Callow ME (1992) The settlement, attachment and establishment of marine algal spores. British Phycological Journal 27: 303–329.
- 27. Patel S, Majumder A, Goyal A (2012) Potentials of exopolysaccharides from lactic Acid bacteria. Indian J Microbiol 52: 3–12.
- 28. Blanc G, Agarkova I, Grimwood J, Kuo A, Brueggeman A, et al. (2012) The genome of the polar eukaryotic microalga Coccomyxa subellipsoidea reveals traits of cold adaptation. Genome Biol 13: R39.29.
- 29. Liu G, Zhang Q, Zhu H, Hu Z (2012) Massive Trentepohlia-bloom in a glacier valley of Mt. Gongga, China, and a new variety of Trentepohlia (Chlorophyta). PLoS One 7: e37725.
- 30. Gildemeister E (1916) The Volatile Oils Vol 2. John Wiley and Sons: New York.
- 31. Aptroot A, van Herk CM (2007) Further evidence of the effects of global warming on lichens, particularly those with Trentepohlia phycobionts. Environmental Pollution 146: 293–298.
- 32. Wang Z, Gerstein M, Snyder M (2009) RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 10: 57–63.
- 33. Marioni JC, Mason CE, Mane SM, Stephens M, Gilad Y (2008) RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res 18: 1509–1517.
- 34. Zheng MG, Tian JH, Yang GP, Zheng L, Chen GG, et al. (2013) Transcriptome sequencing, annotation and expression analysis of Nannochloropsis sp at different growth phases. Gene 523: 117–121.
- 35. Stadhouders R, Kolovos P, Brouwer R, Zuin J, van den Heuvel A, et al. (2013) Multiplexed chromosome conformation capture sequencing for rapid genome-scale high-resolution detection of long-range chromatin interactions. Nature Protocols 8: 509–524.
- 36. Wan LL, Han J, Sang M, Li AF, Wu H, et al. (2012) De Novo Transcriptomic Analysis of an Oleaginous Microalga: Pathway Description and Gene Discovery for Production of Next-Generation Biofuels. Plos One 7.
- 37. Rismani-Yazdi H, Haznedaroglu BZ, Hsin C, Peccia J (2012) Transcriptomic analysis of the oleaginous microalga Neochloris oleoabundans reveals metabolic insights into triacylglyceride accumulation. Biotechnology for Biofuels 5.
- 38. Rismani-Yazdi H, Haznedaroglu BZ, Bibby K, Peccia J (2011) Transcriptome sequencing and annotation of the microalgae Dunaliella tertiolecta: Pathway description and gene discovery for production of next-generation biofuels. BMC Genomics 12.
- 39. Guarnieri MT, Nag A, Smolinski SL, Darzins A, Seibert M, et al. (2011) Examination of Triacylglycerol Biosynthetic Pathways via De Novo Transcriptomic and Proteomic Analyses in an Unsequenced Microalga. Plos One 6.
- 40. Bochenek M, Etherington GJ, Koprivova A, Mugford ST, Bell TG, et al. (2013) Transcriptome analysis of the sulfate deficiency response in the marine microalga Emiliania huxleyi. New Phytologist 199: 650–662.
- 41. Yang S, Guarnieri MT, Smolinski S, Ghirardi M, Pienkos PT (2013) De novo transcriptomic analysis of hydrogen production in the green alga Chlamydomonas moewusii through RNA-Seq. Biotechnol Biofuels 6: 118.
- 42. von Dassow P, Ogata H, Probert I, Wincker P, Da Silva C, et al. (2009) Transcriptome analysis of functional differentiation between haploid and diploid cells of Emiliania huxleyi, a globally significant photosynthetic calcifying cell. Genome Biology 10.
- 43. Lu X, Cheng G (2009) Climate change effects on soil carbon dynamics and greenhouse gas emissions in Abies fabri forest of subalpine, southwest China. Soil Biology and Biochemistry 41: 1015–1021.44.
- 44. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, et al. (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nature Biotechnology 29: 644–U130.
- 45. Iseli C, Jongeneel CV, Bucher P (1999) ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences. Proc Int Conf Intell Syst Mol Biol: 138–148.
- 46. Conesa A, Gotz S, Garcia-Gomez JM, Terol J, Talon M, et al. (2005) Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21: 3674–3676.
- 47. Ye J, Fang L, Zheng H, Zhang Y, Chen J, et al. (2006) WEGO: a web tool for plotting GO annotations. Nucleic Acids Res 34: W293–297.
- 48. Li R, Li Y, Fang X, Yang H, Wang J, et al. (2009) SNP detection for massively parallel whole-genome resequencing. Genome Res 19: 1124–1132.
- 49. Şükran DTG, Rıdvan S (1998) Spectrophotometric determination of chlorophyll - A, B and total carotenoid contents of some algae species using different solvents. Tr J of Botany 22: 13–17.
- 50. Kanehisa M, Goto S (2000) KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28: 27–30.
- 51. Powell W, Machray GC, Provan J (1996) Polymorphism revealed by simple sequence repeats. Trends in Plant Science 1: 215–222.
- 52. Varshney RK, Graner A, Sorrells ME (2005) Genic microsatellite markers in plants: features and applications. Trends Biotechnol 23: 48–55.
- 53. Chambers GK, MacAvoy ES (2000) Microsatellites: consensus and controversy. Comp Biochem Physiol B Biochem Mol Biol 126: 455–476.
- 54. Barbazuk WB, Emrich SJ, Chen HD, Li L, Schnable PS (2007) SNP discovery via 454 transcriptome sequencing. Plant J 51: 910–918.
- 55. Lv H, Qu G, Qi X, Lu L, Tian C, et al. (2013) Transcriptome analysis of Chlamydomonas reinhardtii during the process of lipid accumulation. Genomics 101: 229–237.
- 56. Gonzalez-Ballester D, Casero D, Cokus S, Pellegrini M, Merchant SS, et al. (2010) RNA-seq analysis of sulfur-deprived Chlamydomonas cells reveals aspects of acclimation critical for cell survival. Plant Cell 22: 2058–2084.
- 57. Miller R, Wu G, Deshpande RR, Vieler A, Gartner K, et al. (2010) Changes in transcript abundance in Chlamydomonas reinhardtii following nitrogen deprivation predict diversion of metabolism. Plant Physiol 154: 1737–1752.
- 58. Parchman TL, Geist KS, Grahnen JA, Benkman CW, Buerkle CA (2010) Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery. BMC Genomics 11: 180.
- 59. Tsuji Y, Suzuki I, Shiraiwa Y (2009) Photosynthetic carbon assimilation in the coccolithophorid Emiliania huxleyi (Haptophyta): Evidence for the predominant operation of the C3 cycle and the contribution of β-carboxylases to the active anaplerotic reaction. Plant Cell Physiol 50: 318–329.
- 60. Wang L, Peterson RB, Brutnell TP (2011) Regulatory mechanisms underlying C4 photosynthesis. New Phytol 190: 9–20.
- 61. Moroney JV, Somanchi A (1999) How do algae concentrate CO2 to increase the efficiency of photosynthetic carbon fixation? Plant Physiology 119: 9–16.
- 62. Hibberd JM, Covshoff S (2010) The regulation of gene expression required for C4 photosynthesis. Annu Rev Plant Biol 61: 181–207.
- 63. Parsley K, Hibberd JM (2006) The Arabidopsis PPDK gene is transcribed from two promoters to produce differentially expressed transcripts responsible for cytosolic and plastidic proteins. Plant Molecular Biology 62: 339–349.
- 64. Aragon C, Pascual P, Gonzalez J, Escalona M, Carvalho L, et al. (2013) The physiology of ex vitro pineapple (Ananas comosus L. Merr. var MD-2) as CAM or C3 is regulated by the environmental conditions: proteomic and transcriptomic profiles. Plant Cell Reports 32: 1807–1818.
- 65. Chastain CJ, Fries JP, Vogel JA, Randklev CL, Vossen AP, et al. (2002) Pyruvate, orthophosphate dikinase in leaves and chloroplasts of C-3 plants undergoes light−/dark-induced reversible phosphorylation. Plant Physiology 128: 1368–1378.
- 66. Weise SE, van Wijk KJ, Sharkey TD (2011) The role of transitory starch in C-3, CAM, and C-4 metabolism and opportunities for engineering leaf starch accumulation. Journal of Experimental Botany 62: 3109–3118.
- 67. Kroth PG, Chiovitti A, Gruber A, Martin-Jezequel V, Mock T, et al. (2008) A model for carbohydrate metabolism in the diatom Phaeodactylum tricornutum deduced from comparative whole genome analysis. Plos One 3.
- 68. Armbrust EV, Berges JA, Bowler C, Green BR, Martinez D, et al. (2004) The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and metabolism. Science 306: 79–86.
- 69. Xu JF, Fan X, Zhang XW, Xu D, Mou SL, et al. (2012) Evidence of coexistence of C3 and C4 photosynthetic pathways in a Green-Tide-Forming Alga, Ulva prolifera. Plos One 7.
- 70. Ouyang LL, Chen SH, Li Y, Zhou ZG (2013) Transcriptome analysis reveals unique C4-like photosynthesis and oil body formation in an arachidonic acid-rich microalga Myrmecia incisa Reisigl H4301. BMC Genomics 14.
- 71. Derelle E, Ferraz C, Rombauts S, Rouze P, Worden AZ, et al. (2006) Genome analysis of the smallest free-living eukaryote Ostreococcus tauri unveils many unique features. Proceedings of the National Academy of Sciences of the United States of America 103: 11647–11652.
- 72. Worden AZ, Lee JH, Mock T, Rouze P, Simmons MP, et al. (2009) Green evolution and dynamic adaptations revealed by genomes of the marine picoeukaryotes Micromonas. Science 324: 268–272.
- 73. Roberts K, Granum E, Leegood RC, Raven JA (2007) C3 and C4 pathways of photosynthetic carbon assimilation in marine diatoms are under genetic, not environmental, control. Plant Physiol 145: 230–235.
- 74. Zhang XW, Ye NH, Liang CW, Mou SL, Fan X, et al. (2012) De novo sequencing and analysis of the Ulva linza transcriptome to discover putative mechanisms associated with its successful colonization of coastal ecosystems. BMC Genomics 13.
- 75. Winck FV, Paez Melo DO, Gonzalez Barrios AF (2013) Carbon acquisition and accumulation in microalgae Chlamydomonas: insights from “omics” approaches. J Proteomics.
- 76. Thompson GA Jr (1996) Lipids and membrane function in green algae. Biochim Biophys Acta 1302: 17–45.
- 77. Murata N, Sato N, Takahashi N, Hamazaki Y (1982) Composition and positional distribution of fatty acid in phospholipids from leaves of chilling sensitive and chilling-resistant plants. Plant Cell Physiol 23(6): 1071–1079.
- 78. Bartley GE, Scolnik PA (1995) Plant carotenoids: pigments for photoprotection, visual attraction, and human health. Plant Cell 7: 1027–1038.
- 79. Takaichi S (2011) Carotenoids in algae: distributions, biosyntheses and functions. Mar Drugs 9: 1101–1118.
- 80. Takaichi S, Mochimaru M (2007) Carotenoids and carotenogenesis in cyanobacteria: unique ketocarotenoids and carotenoid glycosides. Cell Mol Life Sci 64: 2607–2619.
- 81. Britton G (1998) Overview of carotenoid biosynthesis. In carotenoids: biosynthesis and metabolism; Britton, G, Liaaen-Jensen, S, Pfander, H, Eds; Birkhä user: Basel, Switzerland 3: 13–147.
- 82. Abe K, Mihara H, Hirano M (1998) Characteristics of growth and carotenoid accumulation of the aerial microalga Trentepohlia aurea in liquid culture. Journal of Marine Biotechnology 6: 53–58.
- 83. Tan CK, Lee YK, Ho KK (1993) Effect of light-intensity and ammonium-N on carotenogenesis of Trentepohlia odorata and Dunaliella bardawil. Journal of Applied Phycology 5: 547–549.
- 84. Mukherjee R, Borah SP, Goswami BC (2010) Biochemical characterization of carotenoids in two species of Trentepohlia (Trentepohliales, Chlorophyta). Journal of Applied Phycology 22: 569–571.
- 85. Lichtenthaler HK (1999) The 1-Deoxy-D-Xylulose-5-Phosphate pathway of Isoprenoid biosynthesis in plants. Annu Rev Plant Physiol Plant Mol Biol 50: 47–65.
- 86. Stickforth P, Steiger S, Hess WR, Sandmann G (2003) A novel type of lycopene epsilon-cyclase in the marine cyanobacterium Prochlorococcus marinus MED4. Arch Microbiol 179: 409–415.