Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Transcriptomics and Comparative Analysis of Three Antarctic Notothenioid Fishes

  • Seung Chul Shin,

    Affiliation Korea Polar Research Institute, Yeonsu-gu, Incheon, South Korea

  • Su Jin Kim,

    Affiliation College of Life Sciences and Biotechnology, Korea University, Seongbuk-gu, Seoul, South Korea

  • Jong Kyu Lee,

    Affiliation Korea Polar Research Institute, Yeonsu-gu, Incheon, South Korea

  • Do Hwan Ahn,

    Affiliations Korea Polar Research Institute, Yeonsu-gu, Incheon, South Korea, University of Science & Technology, Yuseong-gu, Daejeon, South Korea

  • Min Gyu Kim,

    Affiliation Korea Polar Research Institute, Yeonsu-gu, Incheon, South Korea

  • Hyoungseok Lee,

    Affiliation Korea Polar Research Institute, Yeonsu-gu, Incheon, South Korea

  • Jungeun Lee,

    Affiliation Korea Polar Research Institute, Yeonsu-gu, Incheon, South Korea

  • Bum-Keun Kim,

    Affiliation Korea Food Research Institute, Bundang-gu, Sungnam, South Korea

  • Hyun Park

    Affiliations Korea Polar Research Institute, Yeonsu-gu, Incheon, South Korea, University of Science & Technology, Yuseong-gu, Daejeon, South Korea

Transcriptomics and Comparative Analysis of Three Antarctic Notothenioid Fishes

  • Seung Chul Shin, 
  • Su Jin Kim, 
  • Jong Kyu Lee, 
  • Do Hwan Ahn, 
  • Min Gyu Kim, 
  • Hyoungseok Lee, 
  • Jungeun Lee, 
  • Bum-Keun Kim, 
  • Hyun Park


For the past 10 to 13 million years, Antarctic notothenioid fish have undergone extraordinary periods of evolution and have adapted to a cold and highly oxygenated Antarctic marine environment. While these species are considered an attractive model with which to study physiology and evolutionary adaptation, they are poorly characterized at the molecular level, and sequence information is lacking. The transcriptomes of the Antarctic fishes Notothenia coriiceps, Chaenocephalus aceratus, and Pleuragramma antarcticum were obtained by 454 FLX Titanium sequencing of a normalized cDNA library. More than 1,900,000 reads were assembled in a total of 71,539 contigs. Overall, 40% of the contigs were annotated based on similarity to known protein or nucleotide sequences, and more than 50% of the predicted transcripts were validated as full-length or putative full-length cDNAs. These three Antarctic fishes shared 663 genes expressed in the brain and 1,557 genes expressed in the liver. In addition, these cold-adapted fish expressed more Ub-conjugated proteins compared to temperate fish; Ub-conjugated proteins are involved in maintaining proteins in their native state in the cold and thermally stable Antarctic environments. Our transcriptome analysis of Antarctic notothenioid fish provides an archive for future studies in molecular mechanisms of fundamental genetic questions, and can be used in evolution studies comparing other fish.


Antarctic fish have undergone extraordinary evolutionary episodes since the onset of widespread glaciation in Antarctica approximately 34 million years ago, when the Southern Ocean cooled to the freezing point of seawater (−1.9°C) [1]. The Antarctic fish fauna are dominated by the perciform suborder Notothenioidei, which represents 77% of the species diversity and 91% of the biomass. There are currently 322 recognized species of Antarctic fishes, and a total of 132 notothenioid species are known [2]. Notothenioids have survived in the subzero waters of the continental shelf and may have experienced a unique type of adaptive radiation known as species flock [3], [4]. Notothenioid fishes possess a wide range of unique adaptations to the extreme Antarctic environment, such as antifreeze glycoproteins, loss of heat shock response [5], and lack of hemoglobin [6], [7]. The Antarctic notothenioid antifreeze glycopeptides are derived from a related pancreatic trypsinogen-like protease [8], [9], and they represent key evolutionary adaptations to life in subzero ice-laden water. Previous research on Antarctic notothenioid fishes has shown that these cold-adapted species lack a common cellular defense mechanism called the heat shock response, which involves the highly conserved and coordinated induction of a family of heat shock proteins in response to elevated temperatures [5], [10], [11]. Other important phenotypic features are the loss of erythrocytes and hemoglobin and the variable patterns of myoglobin expression in muscle tissues of white-blooded channichthyids [12], [13]. The notothenioids have undergone resistant and compensatory adaptations to the extreme Antarctic marine environment as well as regressive evolutionary changes. Thus, they are considered an attractive model species for evolutionary and physiological studies [4], [14].

Chen et al. [11] reported expressed sequence tag (EST) sequencing from Antarctic notothenioid Dissostichus mawsoni tissues and compared them to tissues of temperate/tropical teleost fish. They identified 177 notothenioid protein families that were overexpressed, which suggests that these protein families are upregulated by low temperatures. Further analysis of these upregulated genes indicated substantial expansion by gene duplication of 118 gene families involved in metabolic processes such as protein biosynthesis, folding and degradation, and lipid metabolism. This suggests that gene duplication may function as an adaptive strategy for organisms under freezing conditions. Detrich et al. [15] determined the genomic sizes of 11 notothenioid species including perches, notothes, dragonfish, and icefish, which have variable genome sizes ranging from 0.90 to 1.83 pg, and found that the evolution of phylogenetically derived notothenioid families was accompanied by genome expansion. The icefish (channichthyids), which are considered the most phylogenetically derived group within the notothenioids, have the largest genomes. Evolution in chronic cold and stable temperature conditions have resulted in these species lacking any erythrocytes or hemoglobin genes [6], [16], [17], [18] and in variable patterns of myoglobin expression in muscle tissues in cold, well-oxygenated seawater [12], [13].

Figure 1. Contig distribution of three notothenioid fish transcriptome sequences.

Table 1. Statistics for pyrosequencing of the three notothenioid species.

Although ESTs represent only a subset of the entire eukaryotic genome, their sequencing is helpful for investigating the transcriptome rather than the genome of an organism. It also allows one to focus on the genome sections with high levels of functional information, avoiding introns and intragenic regions that can complicate data analysis [19]. Next-generation sequencing technologies, such as 454 pyrosequencing, offer novel and rapid approaches for genome-wide characterization and profiling of mRNAs, small RNAs, transcription factor regions, chromatin structure, DNA methylation patterns, and metagenomics [20]. Pyrosequencing of ESTs provides an efficient way to generate sequence data for non-model organisms in the form of transcriptome sequencing, and can be used to characterize gene expression and identify novel genes [21], [22], [23]. The availability of complete genome sequences and large sets of ESTs from several fish species have stimulated the development of efficient and informative techniques for large-scale and genome-wide analysis of gene expression and comparative genomics.

Figure 2. Species distribution of three notothenioid fishes based on BLAST hits from nr sequence database.

Table 2. Summary of full-length cDNA in the three notothenioids species.

We herein describe the transcriptomes of three Antarctic notothenioid fishes, Notothenia coriiceps, Chaenocephalus aceratus, and Pleuragramma antarcticum, by 454 FLX Titanium sequencing. These three notothenioid fishes are characterized by distinct biological and ecological traits, although, like other members of this suborder, all of them lack a gas bladder. N. coriiceps (family Nototheniidae) retained an ancestral notothenioid benthic habitus. Instead, P. antarcticum (family Nototheniidae) and C. aceratus (family Channicthyidae) were subjected to trophic evolution towards a pelagic lifestyle, involving an important suite of adaptations [24]. Among them C. aceratus is a benthopelagic species while P. antarcticum is a true pelagic species living all stages of its development within the water column [25]. We herein report the generation of more than 71,539 contigs from these three Antarctic fish species. Forty percent of the contigs (28,724 BLAST hits of the 71,539 contigs) could be annotated based on similarity to known protein or nucleotide sequences. Our work represents an ongoing genome project studying N. coriiceps (NCBI Genome Project ID #66471), and this initial study identified EST sequences expressed in two tissues (liver and brain) of N. coriiceps and in two tissues of C. aceratus and P. antarcticum for comparative analyses. This represents the first report of publicly available pyrosequencing data for Antarctic fish and provides an important comparative resource for studies of physiology and evolutionary adaptation in fish biology.

Table 3. Summary of microsatellite marker identification in the three notothenioid species.

Materials and Methods

Ethics Statement

This study including sample collection and experimental research conducted on these animals was according to the law on activities and environmental protection to Antarctic approved by the Minister of Foreign Affairs and Trade of the Republic of Korea.

Sample collection

N. coriiceps (length 35 cm), C. aceratus (length 32 cm), and P. antarcticum (length 13 cm) were collected in the Antarctic Peninsula (62°14'S, 58°47'W) from December 2009 to January 2010. Benthic nearshore specimens of N. coriiceps and C. aceratus were obtained using the hook-and-line method from depths of 20 to 30 m. Cryopelagic specimens of P. antarcticum were caught in traps. After capture, these fish were maintained in flow-through aquaria at ambient seawater temperatures (−1.5°C) for 48 h before sacrifice. Brain and liver tissues of each specimen were dissected, immediately frozen in liquid nitrogen, and stored at −80°C until use.

Table 4. Functional annotation of proteins encoded in the transcriptomes of the three notothenioid fish based on gene ontology (GO).

cDNA preparation and sequencing

Total RNA was isolated by homogenization of each sample in a TRIzol (Invitrogen, Carlsbad, CA)/chloroform mixture, followed by processing using an RNeasy mini kit (Qiagen, Chatsworth, CA) for DNAse treatment and cleaning. RNA quality and quantity were analyzed using an Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA) and Nanodrop ND1000 (NanoDrop Technologies, Wilmington, DE), respectively. First- and second-strand cDNA were synthesized from 200 ng mRNA using a SuperScript Double-Stranded cDNA Synthesis Kit (Invitrogen) with 100 mM random hexamer primers (Macrogen, Seoul, Korea). Double-stranded cDNA was purified with a QIAquick MinElute PCR purification column (Qiagen). The cDNA library was normalized according to the protocol described in the Trimmer Direct Kit (Evrogen, Moscow, Russia). Briefly, 300 ng cDNA was denatured at 95°C for 5 min and allowed to renature at 68°C for 5 h in the hybridization buffer included with the kit (50 mM HEPES, pH 7.5, and 0.5 M NaCl). After incubation, the reaction mixture was treated with 1 ml 4-fold-diluted duplex-specific nuclease. Then normalized cDNA was amplified using PCR Advantage II polymerase (Clontech, Palo Alto, CA). After library construction, the samples were quantified using a Qubit fluorometer (Invitrogen), and average fragment sizes were determined by analyzing 1 ml samples on a bioanalyzer (Agilent) using a DNA 7500 chip. Approximately 10 mg cDNA from each of the six samples was used for sequencing on a GS-FLX Titanium platform (454 Life Sciences, Branford, CT) at the DNA Link Inc. facility (Seoul, Korea) according to the manufacturer's protocol.

Figure 3. Gene ontology (GO) enrichment analysis of five fish liver transcriptomes: three notothenioid fish, zebra fish, and medaka.

Figure 4. Comparison of shared and unique genes identified in four notothenioid fishes.

Numbers in parentheses represent the total number of enzymes in metabolic pathway analysis.

Bioinformatic analysis

The raw 454 sequence files in sff format were processed and assembled using Newbler. The resulting contigs were subjected to a BLASTx search against the non-redundant protein database (nr) with an e-value threshold of 10−3 and HSP length cutoff of 33. The gene ontology (GO) terms were assigned to each unique gene based on the GO terms annotated to the corresponding homologs in the UniProt database. GO mapping and annotation were performed with an annotation cutoff of 10−10. Enrichment analysis was performed using Fisher's exact test. All analyses were performed using the BLAST2GO program [26]. Identification of metabolic genes was accomplished by MetaFishNet computation [27]. All cDNA sequences of D. mawsoni were retrieved from the National Center for Biotechnology Information (NCBI). Putative full-length cDNAs were identified by comparing full-length genes and start signals in the UniProt and nr databases to those of ORF prediction using the software Full-Lengther [28] with a cutoff e-value of 1E−5. Once the start codon (ATG) and poly(A) tail had been identified, the sequence was considered a full-length cDNA. The unique sequences of each teleost fish tissue were used to search for microsatellite markers using msatcommander ( with a repeat threshold of eight dinucleotide repeats or five tri-, tetra-, penta-, and hexanucleotide repeats. The unique genes and homologous genes of the three Antarctic notothenioids were identified using BLASTX against the NCBI Refseq protein and Ensembl databases (Tetraodon nigroviridis, zebrafish Danio rerio, and Atlantic salmon Salmo salar) with an e-value threshold of 10−10.

Figure 5. Conservation of three notothenioid fish genes with other species.

Number of notothenioid fish homologous genes annotated in GO analysis.

Results and Discussion

Sequence assembly

Each liver and brain cDNA library of C. aceratus and P. antarcticum and the brain cDNA library of N. coriiceps were subjected to a one-quarter plate run, and the liver cDNA library of N. coriiceps was subjected to a one-plate run with the 454 GS-FLX Titanium platform. After removing low-quality regions, adaptors, and all possible contaminants, we obtained a total of 1,918,483 high-quality reads containing 584,174,779 bases from all of six libraries with average read length 318 bases, and sequencing depth of each library were 10∼15X (Table 1). A de novo assembly was performed for each of the six samples independently. The cleaned read data were entered into Newbler for assembly; the size-selected reads were assembled into 7,815 and 9,414 contigs from the liver and brain of C. aceratus, 10,271 and 9,671 contigs from the liver and brain of P. antarcticum, and 24,836 and 9,532 contigs from the liver and brain of N. coriiceps, respectively. The contigs ranged in size from 152 to 4,012 bp with an average size of 604 bp in the liver, and from 239 to 2,951 bp with an average size of 502 bp in the brain of C. aceratus. In P. antarcticum, they ranged from 102 to 7,900 bp with an average size of 701 bp in the liver, and from 100 to 2,520 bp with an average size of 488 bp in the brain. In N. coriiceps, they ranged from 371 to 6,171 bp with an average size of 966 bp in the liver, and from 238 to 2,457 bp with an average size of 582 bp in the brain. In total, 523 contigs were greater than 3 kb in length, and 2,179 contigs were composed of more than 300 reads, with the largest contig being 12,625 bp composed of 1,108 sequences, which this contig was annotated to titin. The size distribution of the reads is shown in Figure 1. All high-quality reads have been deposited in the NCBI and can be accessed in the Short Read Archive (SRA) under the accession number SRP007644. Table 1 presents a summary of the sequencing and assembly results, and all transcriptome information of the three fishes is accessible at Although the singletons potentially contained useful sequences with low levels of expression, they included short reads, and a small number of redundant sequences and singleton reads were excluded from further analysis. The genome sizes of two of these notothenioids (N. coriiceps, C = 1.13 pg and C. aceratus, C = 1.73 pg) were recently described [15], but the percentages of the transcribed genomes remain unknown. Thus, it is difficult to predict the depth of coverage of the Antarctic fish transcriptome by our de novo assembled sequences.

A total of 28,724 contigs had a significant BLASTx hit at a cutoff value of <1E−10 in the nr protein database: 6,689 of 17,229 contigs (38.8%) from C. aceratus, 8,982 of 19,942 contigs (45.0%) from P. antarcticum, and 13,053 of 34,368 contigs (38.0%) from N. coriiceps, respectively. To obtain an overall view of the transcriptome, these commonly expressed sequences in each tissue of the three fish (with an associated database match) represented a varied mix of functional groups (Table S1). However, in terms of sequence completeness, an estimate of the fraction of full-length sequences in the transcriptome was obtained. A sequence was considered full-length when it included the complete 5′ and 3′ sequences of the mRNA. We used the software Full-Lengther [28], and 36% to 52% of predicted transcripts were validated as full-length or putative full-length in each tissue of the three fish species (Table 2). Among these, 3,170 (41%) from the liver and 2,484 (26%) from the brain of C. aceratus, 10,188 (41%) from the liver and 2,209 (23%) from the brain of N. coriiceps, and 4,757 (46%) from the liver and 3,012 (31%) from the brain of P. antarcticum had significant BLAST matches. As expected, the majority of the sequences (81.2%) showed matches with teleost fish, with eukaryotes accounting for 89.0% of positive hits. Among the fish, the pufferfish Tetraodon nigroviridis showed the highest percentage of hits (24.6%), and the zebrafish Danio rerio represented approximately 13.6% of all hits (Fig. 2).

A total of 3,207 microsatellites were identified from 71,539 unique sequences from six libraries, including di-, tri-, tetra-, penta-, and hexanucleotide repeats (Table 3). Previous observations were reported that 454 pyrosequencing in transcriptomic studies were shown to be an excellent method for large scale prediction of molecular markers for future genetic linkage in non-model organisms [29], [30]. Therefore, given that these microsatellite predicted from transcriptomic sequences, they are likely linked to protein-coding genes, might have substantial physiological implications.

Gene ontology

The transcripts of the six libraries were assigned GO terms based on BLAST matches (Table 4). GO assignments were divided into molecular function, biological process, and cellular components. Predicted proteins assigned to biological process were mainly associated with cellular processes (20%–23%), metabolic processes (14%–19%), and biological regulation processes (11%–12%). Those assigned to molecular function were mainly linked to the binding of ATP, zinc ions, and protein (47%–50%); catalytic activities of enzymes (25%–32%); and transporter activity (5%–8%). Finally, those assigned to cellular components included intracellular locations (42%–43%), organelles (29%–31%), and macromolecular complexes (14%–17%).

GO enrichment analysis was performed on the three notothenioid fish, and these were compared to the transcriptome database of zebrafish and medaka (Oryzias latipes) (NCBI library IC, 14,410 for liver and 1,522 for brain of zebrafish; 17,414 for liver and 8,625 for brain of medaka) because tissue-specific transcriptomes of these two fishes are well known in public databases (Fig. 3). Five GO terms were significantly overexpressed in the liver of Antarctic fish relative to the temperature/tropical fish: ribonucleotide binding, protein modification by small protein conjugation or removal, purine nucleotide binding in the biological process category, and ligase activity in the molecular function category. Eight terms were underrepresented in the liver, including the small ribosomal subunit and cytosolic parts in the cellular component category; macromolecular complex subunit organization, viral reproductive process, viral infectious cycle, pancreas development, and reproduction in the biological process category; and reproductive process in the molecular function category. Of the overrepresented molecular function terms, ligase activity terms were primarily composed of ubiquitin (Ub)-conjugated protein (75 of 339 genes in N. coriiceps, 27 of 120 genes in C. aceratus, and 45 of 203 genes in P. antarcticum). The Ub-proteasome pathway is a cytosolic protein-degradation pathway of misfolded or damaged proteins that takes place two distinct and successive steps. The first step involves tagging of the misfolded or damaged protein by multiple Ub molecules and degradation of the tagged protein by the 26S proteasome complex [31], [32], [33]. Transcriptomic analysis of another Antarctic notothenioid fish, D. mawsoni, also revealed high levels of Ub-conjugated proteins compared to temperate/tropical teleosts [11]. Antarctic fish may have unusually high levels of misfolded or damaged proteins because low temperatures may affect the rate of protein folding [34]. Previous studies have shown that Antarctic notothenioid fish lack a common cellular defense mechanism, such as the heat shock response [5], [10], [35]. These cold-adapted fish require an alternative cellular protein homeostasis mechanism to ensure proper cell functioning. These findings suggest that increased levels of Ub-conjugated proteins in Antarctic fish may be involved in maintaining proteins in their native state in the cold and thermally stable Antarctic environments.

Comparative analysis among four notothenioid species

A total of 28,724 genes were identified from the three notothenioid species based on a BLAST search. Previously, Chen et al. characterized ESTs of D. mawsoni [11]. Therefore, we compared expressed transcriptomes of the liver and brain among these four notothenioid species to cross matching using tBLASTn. A total of 331 genes expressed in the liver and a total of 191 genes expressed in the brain were shared among the four species (e-value, 1E−10) (Fig. 4). In the three fishes focused on in this research, a total of 663 genes expressed in the brain and a total of 1,557 genes in the liver were shared (e-value, 1E−10). The summary of shared genes and identified genes among all species is shown in Table S2 and Table S3.

Li et al. [27] reported the construction of a genome-wide fish metabolic network model to identify and compare the metabolic pathway. They categorized 115 metabolic pathways from 5 fish genomes (D. rerio, O. latipes, Takifugu rubripes, T. nigroviridis, and Gymnopilus aculeatus) to create a list of all fish metabolic genes via gene ontology. And they identified the corresponding enzymes using either orthologous relationships to human genes or similarity to consensus enzyme sequences from this metabolic gene list. We analyzed all cDNA sequences from the four notothenioid fishes and 88 metabolic pathways were assigned; that is, no enzymes in 27 of these metabolic pathways were found mainly lipid related pathway, such as glycosphingolipid biosynthesis, mono-unsaturated fatty acid betaoxidation, omega-3 fatty acid metabolism, and sphingolipid metabolism, compared with the fish metabolic genes to other temperate/tropical teleosts (Table S4). In contrast, we have noticed that the enzyme in electron transport chain were more mapping than that in temperate/tropical fishes, that suggests greater demands for these functions in the cold Antarctic environment.

To assess the evolutionary conservation of the genes, the number of genes with homologs in Tetraodon, zebrafish, and Atlantic salmon (Salmo salar), which were the primary BLAST results, were compared (Fig. 5). A total of 3,605 genes (402 in N. coriiceps, 2296 in P. antarcticum and 907 in C. aceratus. 12.6% of the total number of unique notothenioid fish genes) were found. Among these genes, 1,134 (8.7%) from N. coriiceps, 563(6.3%) from P. antarcticum and 567 (8.5%) from C. aceratus were commonly found in all three species (Tetraodon, zebrafish, and Atlantic salmon).

The 15 known species of icefish and white-blooded fish, including C. aceratus, all lack the hemoglobin gene [1], [36]. This phenotype preceded the evolutionary radiation of the icefish. Di Prisco et al. [6] showed that the C. aceratus genome has transcriptionally inactive truncated variants of α1-globin-related DNA and lacks β-globin genes. They found that the C. aceratus transcriptome contained only cytoglobin for oxygen transport and/or oxygen-binding machinery (Table S5). Cytoglobin is one of four types of globin (hemoglobin, myoglobin, neuroglobin, and cytoglobin), which differ in structure, tissue distribution, and likely function, but mainly serve to transport oxygen in the circulatory system [37]. To determine the molecular phylogenetic position of C. aceratus cytoglobin, a phylogenetic tree was constructed using the neighbor-joining method from a distance matrix, calculated with MEGA4 [38]. Cytoglobin was grouped with the fish cytoglobin cluster (Figure S1). There have been no previous reports of cytoglobin sequences from other icefish species. The cytoglobin of C. aceratus showed the highest level of identity (72%) to that of O. latipes based on amino acid similarity (Figure S2). The mechanism of the compensatory physiological and circulatory adaptations that resulted in replacement of the lost hemoglobin and myoglobin functions remains unknown. Recently, Cheng et al. [39] hypothesized that neuroglobin may play a role in oxygen transport because this gene is widely found in icefish despite the fact that this fish has generally lost hemoglobin and myoglobin. The observation that at least one icefish have retained the cytoglobin gene is intriguing, and the function of the cytoglobin gene should be further explored to address the evolutionary development and alternative physiology of losing globin genes.


We generated and assembled the transcriptomes of three Antarctic notothenioid fish species. We generated more than 71,539 contigs, identified more than 28,724 unique genes expressed in the brain and liver of the three Antarctic fish, and identified more than 3,200 gene-associated microsatellites. The Antarctic fish transcriptome, the analyzed by high-throughput 454 sequencing, can be functionally characterized for a wide range of molecules encoded in the transcriptomes of members of the notothenioid. Comparative sequencing of the three notothenioid fish transcriptomes also provided information on the variation in evolution and speciation of species that live at permanently cold temperatures. We are currently performing whole-genome sequencing of N. coriiceps. Comparison between genome and transcriptome sequences will allow for a better understanding of gene structure and organization in molecular mechanisms of fundamental genetic questions and furthermore provide a comprehensive view into evolution studies to environmental challenges during climate changes.

Supporting Information

Figure S1.

Phylogenetic analysis of the icefish ( Chaenocephalus aceratus ) cytoglobin compared to other species.


Figure S2.

Alignment of the amino acid sequence of cytoglobin with other known fish cytoglobins.


Table S1.

Top 30 commonly expressed sequences with associated BLAST matches in the three notothenioid fishes.


Table S2.

Number of shared genes and number of enzymes (parentheses) involved in the metabolic pathway of each species.


Table S3.

Analysis of species-specific statistics of metabolic pathway.


Table S4.

List of enzymes in metabolic pathways identified in four notothenioid fishes.


Table S5.

Putatively identified globin genes in the three notothenioid fishes.



The authors wish to acknowledge Dr. Jeong-hoon Kim (Korea Polar Research Institute) for help with sample collection.

Author Contributions

Conceived and designed the experiments: SS HP. Performed the experiments: SK JKL DA MK JL. Analyzed the data: SS HL BK HP. Contributed reagents/materials/analysis tools: SK BK. Wrote the paper: SS HP.


  1. 1. Eastman JT (1993) Antarctic fish biology: evolution in a unique environment. San Diego: Academic. 7–12 p.
  2. 2. Eastman JT (2005) The nature of the diversity of Antarctic fishes. Polar Biol 28: 93–107.
  3. 3. Eastman JT, Clarke A (1998) A comparison of adaptive radiations of Antarctic fish with those of Non Antarctic fish. In: di Prisco G, Pisano E, Clarke A, editors. Fishes of Antarctica. A biological overview, Springer Milano-Heidelberg-New York, 3–26.
  4. 4. Eastman JT (2000) Antarctic notothenioid fishes as subjects for research in evolutionary biology. Antarct Sci 12: 276–287.
  5. 5. Hofmann GE, Buckley BA, Airaksinen S, Keen JE, Somero GN (2000) Heat-shock protein expression is absent in the Antarctic fish Trematomus bernacchii (family Nototheniidae). J Exp Biol 203: 2331.
  6. 6. di Prisco G, Cocca E, Parker SK, Detrich HWI (2002) Tracking the evolutionary loss of hemoglobin expression by the white-blooded Antarctic icefishes. Gene 295: 185–191.
  7. 7. Ruud JT (1954) Vertebrates without erythrocytes and blood pigment. Nature 173: 848–850.
  8. 8. Chen L, DeVries AL, Cheng CHC (1997) Evolution of antifreeze glycoprotein gene from a trypsinogen gene in Antarctic notothenioid fish. Proc Natl Acad Sci U S A 94: 3811.
  9. 9. Cheng CHC, Chen L (1999) Evolution of an antifreeze glycoprotein. Nature 401: 443–444.
  10. 10. Place SP, Hofmann GE (2005) Constitutive expression of a stress-inducible heat shock protein gene, hsp70, in phylogenetically distant Antarctic fish. Polar Biol 28: 261–267.
  11. 11. Chen Z, Cheng CH, Zhang J, Cao L, Chen L, et al. (2008) Transcriptomic and genomic evolution under constant cold in Antarctic notothenioid fish. Proc Natl Acad Sci U S A 105: 12944–12949.
  12. 12. Sidell BD, Vayda ME, Small DJ, Moylan TJ, Londraville RL, et al. (1997) Variable expression of myoglobin among the hemoglobinless Antarctic icefishes. Proc Natl Acad Sci U S A 94: 3420.
  13. 13. Moylan TJ, Sidell BD (2000) Concentrations of myoglobin and myoglobin mRNA in heart ventricles from Antarctic fishes. J Exp Biol 203: 1277.
  14. 14. Alber C (2009) Biology's next top model? Nature 458: 9.
  15. 15. Detrich HW III, Stuart A, Schoenborn M, Parker SK, Methe BA, et al. (2010) Genome enablement of the notothenioidei: genome size estimates from 11 species and BAC libraries from 2 representative taxa. J Exp Zool B Mol Dev Evol 314: 369–381.
  16. 16. Cocca E, Ratnayake-Lecamwasam M, Parker SK, Camardella L, Ciaramella M, et al. (1995) Genomic remnants of alpha-globin genes in the hemoglobinless antarctic icefishes. Proc Natl Acad Sci U S A 92: 1817.
  17. 17. Zhao Y, Ratnayake-Lecamwasam M, Parker SK, Cocca E, Camardella L, et al. (1998) The major adult α-globin gene of Antarctic teleosts and its remnants in the hemoglobinless icefishes. J Biol Chem 273: 14745.
  18. 18. Detrich HW III (2000) Recent Evolution of the Hemoglobinless Condition of the Antarctic Icefishes. In: di Prisco G, Giardina B, Weber RE, editors. Hemoglobin Function in Vertebrates: Molecular Adaptation in Extreme and Temperate Environments. Springer-Verlag, Milano-Heidelberg-New York, 39–49.
  19. 19. Parkinson J, Blaxter M (2009) Expressed sequence tags: an overview. Methods Mol Biol 533: 1–12.
  20. 20. Ansorge WJ (2009) Next-generation DNA sequencing techniques. N Biotechnol 25: 195–203.
  21. 21. Morozova O, Hirst M, Marra MA (2009) Applications of new sequencing technologies for transcriptome analysis. Annu Rev Genomics Hum Genet 10: 135–151.
  22. 22. Vera JC, Wheat CW, Fescemyer HW, Frilander MJ, Crawford DL, et al. (2008) Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing. Mol Ecol 17: 1636–1647.
  23. 23. Collins LJ, Biggs PJ, Voelckel C, Joly S, editors. (2008) An approach to transcriptome analysis of non-model organisms using short-read sequences. London: Imperial College Press. 3–14.
  24. 24. Albertson RC, Yan YL, Titus T, Pisano E, Vacchi M, et al. (2010) Molecular pedomorphism underlies craniofacial skeletal evolution in Antarctic notothenioid fishes. BMC evolutionary biology 10: 4.
  25. 25. Vacchi M, DeVries AL, Evans CW, Bottaro M, Ghigliotti L, et al.. (2012) A nursery area for the Antarctic silverfish Pleuragramma antarcticum at Terra Nova Bay (Ross Sea): first estimate of distribution and abundance of eggs and larvae under the seasonal sea-ice. Polar Biology: DOI: 10.1007/s00300-012-1199-y.
  26. 26. Conesa A, Gotz S, Garcia-Gomez JM, Terol J, Talon M, et al. (2005) Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21: 3674–3676.
  27. 27. Li S, Pozhitkov A, Ryan RA, Manning CS, Brown-Peterson N, et al. (2010) Constructing a fish metabolic network model. Genome Biology 11: R115.
  28. 28. Full-Lengther website. Available:
  29. 29. Bai X, Rivera-Vega L, Mamidala P, Bonello P, Herms DA, et al. (2011) Transcriptomic signatures of ash (Fraxinus spp.) phloem. PloS one 6: e16368.
  30. 30. Novaes E, Drost D, Farmerie W, Pappas G, Grattapaglia D, et al. (2008) High-throughput gene and SNP discovery in Eucalyptus grandis, an uncharacterized genome. BMC genomics 9: 312.
  31. 31. Glickman MH, Ciechanover A (2002) The ubiquitin-proteasome proteolytic pathway: destruction for the sake of construction. Physiol Rev 82: 373–428.
  32. 32. Goldberg AL (2003) Protein degradation and protection against misfolded or damaged proteins. Nature 426: 895–899.
  33. 33. Wickner S, Maurizi MR, Gottesman S (1999) Posttranslational quality control: folding, refolding, and degrading proteins. Science 286: 1888–1893.
  34. 34. Jaenicke R (1990) Protein structure and function at low temperatures. Philos Trans R Soc Lond B Biol Sci 326: 535–551.
  35. 35. Place SP, Zippay ML, Hofmann GE (2004) Constitutive roles for inducible genes: evidence for the alteration in expression of the inducible hsp70 gene in Antarctic notothenioid fishes. Am J Physiol Regul Integr Comp Physiol 287: R429–436.
  36. 36. Barber D, Mills Westermann J, White M (1981) The blood cells of the Antarctic icefish Chaenocephalus aceratus Lonnberg: light and electron microscopic observations. J Fish Biol 19: 11–28.
  37. 37. Pesce A, Bolognesi M, Bocedi A, Ascenzi P, Dewilde S, et al. (2002) Neuroglobin and cytoglobin. Fresh blood for the vertebrate globin family. EMBO Rep 3: 1146–1151.
  38. 38. Tamura K, Dudley J, Nei M, Kumar S (2007) MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol 24: 1596–1599.
  39. 39. Cheng CHC, di Prisco G, Verde C (2009) The “icefish paradox.” Which is the task of neuroglobin in Antarctic hemoglobin less icefish? IUBMB Life 61: 184–188.