Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Transcriptomic profile of early zebrafish PGCs by single cell sequencing

  • Xiaoyuan Zhang ,

    Contributed equally to this work with: Xiaoyuan Zhang, Xintian Li

    Roles Data curation, Formal analysis, Writing – original draft, Writing – review & editing

    Affiliations State Key Laboratory of Cell Biology, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China, Shanghai Key Laboratory of Molecular Andrology, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China

  • Xintian Li ,

    Contributed equally to this work with: Xiaoyuan Zhang, Xintian Li

    Roles Methodology

    Affiliations State Key Laboratory of Cell Biology, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China, Shanghai Key Laboratory of Molecular Andrology, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China

  • Ronghong Li,

    Roles Formal analysis

    Affiliations State Key Laboratory of Cell Biology, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China, Shanghai Key Laboratory of Molecular Andrology, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China

  • Yunbin Zhang,

    Roles Conceptualization

    Affiliations State Key Laboratory of Cell Biology, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China, Shanghai Key Laboratory of Molecular Andrology, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China

  • Yiping Li,

    Roles Conceptualization

    Affiliations State Key Laboratory of Cell Biology, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China, Shanghai Key Laboratory of Molecular Andrology, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China

  • Shifeng Li

    Roles Conceptualization, Methodology, Writing – original draft

    sfli01@sibcb.ac.cn.

    Affiliations State Key Laboratory of Cell Biology, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China, Shanghai Key Laboratory of Molecular Andrology, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China

Abstract

Single cell RNA-seq is a powerful and sensitive way to capture the genome-wide gene expression. Here, single cell RNA-seq was utilized to study the transcriptomic profile of early zebrafish PGCs (primordial germ cells) at three different developmental stages. The three stages were 6, 11 and 24 hpf (hours post fertilization). For each developmental stage, three zebrafish PGCs from one embryo were collected, and 9 samples in total were used in this experiment. Single cell RNA-seq results showed that 5099–7376 genes were detected among the 9 samples, and the number of expressed genes decreased as development progressed. Based on the gene expression pattern, samples from 6 and 11 hpf clustered closely, while samples from 24 hpf were more dispersed. By WGCNA (weighted gene co-expression network analysis), the two biggest modules that had inverse gene expression patterns were found to be related to PGC formation or migration. Functional enrichment analysis for these two modules showed that PGCs mainly conducted migration and cell division in early development (6/11 hpf) and translation activity became active in late development (24 hpf). Differentially expressed gene analyses showed that more genes were downregulated than upregulated between two adjacent stages, and genes related to PGC formation or migration reported by previous studies decreased significantly from 11 to 24 hpf. Our results provide base knowledge about zebrafish PGC development at the single cell level and can be further studied by other researchers interested in biological development.

Introduction

As early as 1930s, the zebrafish (Danio rerio) has been used for embryological research, and in 1980s, zebrafish was used as a genetically tractable organism [1, 2]. In the 1990s, thousands of zebrafish mutants related to early embryo development were identified [2]. A comparison of the reference genomes of human and zebrafish shows that about 70% of human genes have orthologs in zebrafish [1], so zebrafish has been widely used to study human genetic diseases, such as pediatric disease, cancer, immunological disease, inflammatory bowel disease and so on [25]. In addition, thanks to transparent embryos, small size, external fertilization and easily obtainable fertilized eggs, zebrafish becomes an excellent model organism for studying vertebrate development [2, 3, 6, 7].

PGCs are the earliest germline cells which will differentiate into male or female gametes that can give rise to progeny. Preformation and induction are two general ways for the specification of PGCs, and the specification of zebrafish PGCs belongs to preformation [810]. For the zebrafish, some maternal RNAs and proteins which are called germ plasm are unequally divided during later cleavage. As a result, cells having germ plasm will develop into PGCs and cells not having germ plasm will develop into somatic cells [8]. In 1997, Yoon et al. and Olsen et al. discovered the first zebrafish PGC marker vasa independently [11, 12], and more markers have since been found, such as nanos3, g1m, dnd1, tdrd7a and gra [1317]. In 2005, Blaser et al. successfully constructed a transgenic zebrafish whose PGCs could fluoresce green, and this technique made the separation of PGCs from embryos easy and was soon adopted by lots of researchers [18].

Transcriptome analysis is a useful way to detect dynamic changes of gene expression, and it can provide crucial clues to help understand the processes of embryogenesis and development [2, 19]. RNA-seq has become a state-of-the-art tool for transcriptomic research [20]. It has many advantages over previous techniques, such as RNA in situ hybridization, quantitative RT-PCR and microarrays. RNA-seq can determine the expression of thousands of genes accurately in just one experiment and can detect new transcripts effectively [21]. Moreover, RNA-seq can be used for rare materials with the newly emerged single-cell RNA-seq technique, which has already been used for the study of human PGCs [22, 23]. For the zebrafish, RNA-seq has been used for studying zebrafish development on the embryo level [6, 7, 24, 25]. However, little is known about the comprehensive transcriptome dynamics in zebrafish PGCs during various early developmental stages.

In this study, single cell sequencing was used to explore the transcriptomes of zebrafish PGCs at three developmental stages: 6 hpf (shield stage), 11 hpf (3-somite stage) and 24 hpf (prim-5 stage). The number of expressed genes decreased as development progressed, and the gene expression pattern was more alike in the 6 and 11 hpf stages (both were in the process of migration) than in the 6/11 and 24 hpf stages. Both gene network and differentially expressed gene analyses showed that PGCs conducted cell division to increase the cell number during migration and translation became active after reaching the destination of the gonad. In addition, genes related to PGC formation or migration, such as cxcr4b, dnd1 and tdrd7a, maintained high expression at 6/11 hpf while their expression decreased significantly from 11 to 24 hpf.

Results and discussion

Confirmation of zebrafish PGCs

In this study, the embryos of transgenic zebrafish carrying a kop-egfp-nanos3-3ʹUTR inserted fragment in the genome were used to isolate PGCs [18, 26]. Based on green fluorescence, single PGC was picked out manually using a capillary tube for downstream experiments (see Methods for details). Three PGCs were picked out from one embryo at each developmental stage (6, 11 and 24 hpf), so 9 PGCs were collected in total from 3 stages (Table 1).

thumbnail
Table 1. General information for the high throughput sequencing.

https://doi.org/10.1371/journal.pone.0220364.t001

Under a fluorescent microscope, cells emitting green fluorescence appeared in four locations of the embryo at 6 hpf, formed two relatively loose clusters at 11 hpf and two relatively close clusters in the gonad at 24 hpf (S1 Fig). This result is consistent with previous research about the migration route of zebrafish PGCs [27, 28], and implied that cells emitting green fluorescence might be PGCs. Vasa protein is a typical germline cell marker in zebrafish that has been used to mark PGCs [2931]. To validate that cells emitting green fluorescence were PGCs, immunostaining with Vasa antibody was used. Images of whole-mount immunofluorescence with Vasa antibody at 11 and 24 hpf showed that cells emitting green fluorescence and cells emitting red fluorescence (stained by Vasa antibody) colocalized perfectly (Fig 1A). Unfortunately, we also did a whole-mount immunofluorescence experiment for the zebrafish embryo at 6 hpf but failed because the red fluorescence was too faint to be detected at this time. In addition, digestive cells from zebrafish gonads at 24 hpf were stained by Vasa antibody and DAPI. Three different fields of a glass slide under the microscope all showed colocalization of cells emitting green fluorescence and red fluorescence (Fig 1B). Above all, we were confident that cells picked out by green fluorescence were PGCs without question.

thumbnail
Fig 1. Confirmation of the zebrafish PGCs by immunostaining with Vasa antibody.

(A) Whole mount immunostaining with Vasa antibody. Cells containing Vasa proteins can give out red fluorescence (second column). (B) Digested cells from zebrafish gonad at 24 hpf stained by Vasa antibody (second column) and DAPI (third column). Field 1, Field 2 and Field 3 mean three different fields of the glass slide under the microscope.

https://doi.org/10.1371/journal.pone.0220364.g001

Dynamic expression of the PGC transcriptome

To obtain the genome-wide gene expression profile during zebrafish PGC development, nine sequencing libraries were constructed for the Illumina platform with three biological repeats for each development stage. In total, over 35 M paired reads were obtained with about 3–5 M paired reads per individual library (Table 1). A majority of the paired reads were maintained after quality control, and more than 90% of the qualified reads could be mapped to the zebrafish genome (danRer10). The mapped reads were then assigned to specific genes based on the 32,266 annotated genes (danRer10 genome, Ensemble release 89), and gene expression was quantified by calculating the FPKM (Fragments Per Kilobase of exon model per Million mapped fragments) values. To avoid false positives and make the data more effective, genes that had FPKM ≥ 1 at least in three samples were taken into account, and genes not meeting this standard were discarded. Using this standard, 9329 genes remained (about 29% to the total annotated genes) and their expressions were used for the downstream calculation (S1 Table; Fig 2A). Single-cell qPCR was utilized to validate the expression values for 20 selected genes (S2 Fig), and the results were highly consistent with the sequencing data (Fig 3).

thumbnail
Fig 2. Dynamic changes of the number of genes detected in PGCs.

(A) Venn graph for the shared genes at the three developmental stages. (B) Changes of number of genes detected at the three developmental stages.

https://doi.org/10.1371/journal.pone.0220364.g002

thumbnail
Fig 3. Validation of sequencing data by single-cell qPCR.

The relative FPKM values and the relative expression levels of single-cell qPCR are shown. For each gene, the relative FPKM values are calculated as the ratios of the normalized FPKM values to the maximal normalized FPKM value of that gene, where the normalized FPKM values were the mean ratios of the original FPKM values to that of the actb1 gene. The red line indicate the relative FPKM values, the green line indicate the the relative expression levels obtained by single-cell qPCR. actb1 is used as an endogenous control in single-cell qPCR.

https://doi.org/10.1371/journal.pone.0220364.g003

The number of genes shared by three developmental stages was 7228, which accounted for more than 77% of the total detected genes, and genes presented only in one stage were less than 70 (Fig 2A). One possible explanation was that these cells were the same kind of cell. Genes detected only in the group of 6 and 11 hpf (1,103) were more than in the two other groups (11 and 24 hpf: 351; 6 and 24 hpf: 516, Fig 2A). Zebrafish PGCs start migration at 4.5 hpf and reach the final destination at 24 hpf [18, 27, 32, 33]. In the stage of 6 and 11 hpf, PGCs are in the processing of migration. At 24 hpf, the migration activity is very weak. So PGCs in the stages of 6 and 11 hpf shared more genes.

With development proceeding, the number of genes detected decreased. PGCs at 11 hpf had 557 fewer genes than at 6 hpf, and PGCs at 24 hpf had 817 fewer genes than at 11 hpf (Fig 2B). This was different from Yang’s result in which the number of detected genes for zebrafish embryos increased steadily during development [6]. Increasingly different kinds of tissues appear during zebrafish embryo development, and the zygotic genome has to transcribe more genes to maintain the function for so many tissues. As a result, genes expressed in the embryo increase with development. In this study, the cells in the three developmental stages are the same kind of cell, and their function is relatively unitary (to form gametes). Maternal transcripts in PGCs degrade with development [34], and gene expression in PGCs is repressed by Dnd1 so as to prevent PGCs from differentiating into somatic cells and maintain germline stem cells [35, 36]. In this study, the least FPKM for Dnd1 was 671, and this value was high enough to suppress the expression of somatic genes. As a result, genes detected in PGCs decreased during development.

Comparison of transcriptomes at different PGC developmental stages

To detect similarities in PGCs at different developmental stages, gene expression was compared by Pearson correlation analysis (Fig 4A). The Pearson correlation coefficient (PCC) between any two samples varied from 0.33 to 0.60 with the lowest between 24_1 and 6_2/11_1 and the highest between two samples (6_1 and 6_2) at 6 hpf. PCC between any two samples from the same developmental stage was bigger than 0.50 with one exception between 11_1 and 11_3 (PCC = 0.49). PCC between samples at 6 and 11 hpf was 0.47–0.54, and at 6/11 hpf and 24 hpf was 0.33–0.48. These results showed that individual PGCs were different from each other even at the same developmental stage, and that gene expression pattern of PGCs was more alike at 6 and 11 hpf than at 6/11 and 24 hpf.

thumbnail
Fig 4. Comparison of gene expression for PGCs at different developmental stages.

(A) Pearson correlation analysis between PGCs. The x-axis and y-axis mean gene expression calculated by log2(FPKM+1). (B) Principal component analysis for nine PGCs. The first two principal components are displayed on the graph.

https://doi.org/10.1371/journal.pone.0220364.g004

Principal component analysis (PCA) was utilized to explore gene expression patterns among PGCs at different developmental stages (Fig 4B). It showed that three samples at 6 hpf and two samples at 11 hpf clustered together closely, while samples at 24 hpf were relatively disperse. This result was further demonstrated by dendrogram analysis (S3 Fig).

In our project, single cell technique was used which required more stringent experimental steps, and minute difference between one cell to another could be captured even though they belonged to the same cell type and were at the same developmental stage. In addition, PGCs at the same developmental stage may really have high variation. For example, Otten et al. found that zebrafish PGCs had highly variable mtDNA copy numbers [37]. As a result, PCC was very low between any two samples even though they were at the same developmental stage (Fig 4A).

Nanog, Pou5f3 and SoxB1 (containing Sox2, Sox3, Sox19a, and Sox19b) are three important transcription factors which can control the expression of thousands of genes in the ealy embryo, and these factors can be used as markers for pluripotent embryonic stem (ES) cell identity [3843]. In this study, the expression of Sox2 was too low to be detected, and expression of Sox19b/Sox3/Pou5f3/Nanog decreased with the progression of development (S4 Fig). Especially at 24 hpf, the expression of Pou5f3/Nanog was very low, just slightly above zero. The expression of Sox19a reached a maximum at 11 hpf, but the FPKM was just about 20 at this stage. Based on these results, we concluded that at the stage of 6/11 hpf, PGCs had relatively high pluripotency, and all PGCs were in the same developmental states. At the stage of 24 hpf, PGCs had relatively low pluripotency and might have lost synchrony, and some PGCs started to differentiate into downstream germ cells. As a result, dots were more scattered at 24 than at 6/11 hpf in the PCA graph (Fig 4B).

Gene-network analysis reveals two modules that are important for PGC development

Genes having the same expression pattern may contribute to specific biological processes [44]. Here, WGCNA was utilized to investigate gene expression patterns systematically and to find modules of interest [45, 46]. In this study, 103 modules were detected (S2 Table; Fig 5A; a grey module meant that these genes could not be grouped into any specific module). Based on the Pearson correlation between module eigengenes and sample traits (here the trait was hpf), two modules (turquoise and blue) with the biggest absolute PCCs (S5A Fig, turquoise: PCC = -0.91, p = 6e-4; blue: PCC = 0.98, p = 2e-6) were chosen. These two modules were also the two largest modules (turquoise: 829 genes; blue: 524 genes) of all modules (S2 Table).

thumbnail
Fig 5. WGCNA for detecting modules related to PGC development.

(A) Hierarchical cluster tree for modules identified by WGCNA. The heatmap underneath means gene expression with red for higher and white for lower. The blue module is marked by blue frame and the turquoise module is marked by turquoise frame. (B) Heatmap for gene expresion calculated by log2(FPKM+1) and scaled by the row in blue module. (C) Heatmap for gene expresion calculated by log2(FPKM+1) and scaled by the row in turquoise module. (D) Functional enrichment analysis for genes in the blue module(blue bar) and the turquoise module(turquoise bar).

https://doi.org/10.1371/journal.pone.0220364.g005

For the turquoise module, gene expression was higher in 6/11 hpf than in 24 hpf (Fig 5C). Module eigengene (ME) values for samples in 6/11 hpf were greater than 0.1, except for one sample in 11 hpf (11_3) whose ME value was nearly zero, while ME values for the three repeats in 24 hpf were less than -0.2 (S5C Fig). Some genes that are already known to play an important role in PGC formation or migration belonged to this module, such as cxcr4b, dnd1, tdrd7a, ca15b and ddx4 (S2 Table). Functional enrichment analysis for genes in the turquoise module showed that chromosome organization and cell division process had been enriched, and that germ cell migration and transcription had also been enriched (Fig 5D; S3 Table). These results indicated that genes in the turquoise module were related to PGC proliferation and migration during embryo development.

For the blue module, gene expression pattern was the opposite of that in the turquoise module. Genes in 24 hpf had higher FPKM values than in 6/11 hpf (Fig 5B), and ME values for samples in 24 hpf were greater than 0.4 while those for samples in 6/11 hpf were less than -0.1 (S5B Fig). Functional enrichment analysis for the blue module showed that translation and peptide biosynthetic process were enriched (Fig 5D; S3 Table).

Based on the gene expression pattern and functional enrichment analysis in the turquoise and blue modules, we found that the main task for PGCs were cell division and migration from 6 to 11 hpf, while translation become the main task from 24 hpf and later.

Differentially expressed genes during PGC development

Dynamic changes of differentially expressed genes and the biological processes driven by these genes are useful for studying development. For each adjacent pair of developmental stages, more genes were downregulated. In the pair of 6 and 11 hpf, 301 genes were upregulated and 555 genes were downregulated (Fig 6A; S4 Table). Upregulated genes were enriched in ribonucleoprotein complex biogenesis, cellular component biogenesis and replication fork processing. Downregulated genes were enriched in response to inorganic substance, protein modification process and response to metal ion (Fig 6B; S4 Table). In addition, genes related to PGC formation or migration, such as cxcr4b, dnd1 and tdrd7a, were not differentially expressed (S6A Fig), and their expressions remained high during this period (S7 Fig).

thumbnail
Fig 6.

Expression of differentially expressed genes and their functional enrichment analysis between 6/11 hpf (A,B) and 11/24 hpf (C,D). Gene expression in the heatmap was calculated by log2(FPKM+1) and scaled by the row.

https://doi.org/10.1371/journal.pone.0220364.g006

In the pair of 11 and 24 hpf, 433 genes were upregulated and 872 genes were downregulated (Fig 6C; S5 Table). Upregulated genes were enriched in translation, peptide biosynthetic process and oxidation-reduction process (Fig 6D; S5 Table), and this corresponded to the blue module in WGCNA (Fig 5D). Downregulated genes were enriched in cell division, transcription and germ cell migration (Fig 6D; S5 Table), and this corresponded to the turquoise module in WGCNA (Fig 5D). Scatter plots and bar graphs showed that genes related to PGC formation or migration, such as cxcr4b, dnd1 and tdrd7a, decreased significantly from 11 to 24 hpf (S6B Fig and S7 Fig).

These results again indicated that PGCs conducted cell division to increase their cell number and migration to reach the gonads during early development. After reaching the gonad, the activities of cell division and migration weakened gradually and translation became more active. Knaut et al. showed that the number of PGCs per embryo increases from 15 at 50 percent epiboly to 30 at 24 hpf [30]. Tzung et al. showed that PGC number fluctuates little during the first week of development [47]. Our data are consistent with these previous results, increasing confidence in its validity. Transcription activity weakened from 11 to 24 hpf (Fig 6D), and this was in accordance with the results mentioned above in which the number of genes expressed decreased significantly during this period (Fig 2B). Translation was active from 11 to 24 hpf, and we propose that PGCs were preparing for differentiation during this time because their transcriptomic profiles were more scattered at 24 hpf than at 6/11 hpf (Fig 4B), and gonocytes appeare at 10 dpf (days post fertilization) [48, 49].

Conclusions

In this study, zebrafish PGCs were separated and sequenced to explore their dynamic transcriptomes during early development. 5099–7376 genes were expressed from 6 to 24 hpf, and the number of genes expressed decreased in the course of development. Gene expression patterns were different between 6/11 hpf and 24 hpf when PGCs were in the process of migration and had finished migration, respectively. Individual differences in PGCs were greater at 24 hpf than at 6/11 hpf, which meant that synchrony was lost at 24 hpf. Functional enrichment analysis showed that PGCs conducted cell division to increase the cell number during migration, and that translation became more active to prepare for differentiation after reaching the final destination of the gonad. Genes related to PGC formation or migration had high expression at 6 and 11 hpf, and their expressions decreased significantly from 11 to 24 hpf. These results help us to understand zebrafish PGC development, and we will focus future studies on zebrafish germ cell development in a wider time range.

Methods

Zebrafish strains and breeding

Transgenic AB strain zebrafish carrying the kop-egfp-nanos3-3ʹUTR inserted fragment were bought from China Zebrafish Resource Center (CZRC), Wuhan, and bred in the zebrafish platform of Shanghai Institute of Biochemistry and Cell Biology (SIBSB), CAS, under standard breeding conditions [50]. Embryo stages were determined by the description of Kimmel et al [51]. All animal experiments were approved by the Institutional Animal Care and Use Committee of the Shanghai Institute of Biochemistry and Cell Biology, CAS, and all the methods were carried out in accordance with the approved guidelines.

Isolating single PGC

For zebrafish at 6 and 11 hpf, just one embryo was used respectively. The embryo was dechorionated using forceps, and then single cell suspension was prepared by pipetting the embryo with 20 μL tips for 20 times in a 0.1% BSA solution (dissolved in phosphate-buffered saline, PBS).

For zebrafish at 24hpf, one embryo was collected. The embryo was first anaesthetized by tricaine (3-amino benzoic acid ethyl ester also called ethyl 3-aminobenzoate, Sigma, Cat: A-5040). Then the embryo was dechorionated using forceps, and the embryo head, tail and yolk were cut off with the tip of a 1 mL syringe. The remaining trunk was transferred to a new dish and minced with a syringe tip. Then the minced tissues were pipetted up and down for more than 20 times with 20 μL tips to prepare a single cell suspension.

Single cell suspensions were transferred to slides, and PGCs emitting green fluorescence were picked out using a capillary of 50 μm diameter under an Olympus IX51 microscope. Each isolated PGC was placed in a 200 μL tube with 4.6 μL lysis butter and stored at -80°C. Three PGCs from one embryo were isolated at each stage.

Whole-mount immunofluorescence

Zebrafish embryos were treated with pronase (Roche, Cat: 10165921001) 1 mg/mL for 10 min at 28°C to lyse the embryo chorion. The dechorionated embryos were fixed with 4% paraformaldehyde/PBS for 1 h at room temperature. Then embryos were fixed with methanol overnight at -20°C and hydrated with gradient methanol (75% 5 min; 50% 5 min; 25% 5 min; PBS 5 min). After that, embryos were washed with PBST (0.1% Triton X-100 in PBS) for 5 min and subsequently blocked with 3% BSA for 30 min. Then the embryos were incubated with anti-Vasa antibody (GeneTex, Cat: GTX128306, 1:1000 dilution) overnight at 4°C. On the next day, embryos were washed with PBST 3 times for 30 min each. Thereafter, embryos were incubated with the secondary antibody (1:500, goat anti-rabbit, Alexa546, Thermo Fisher Scientific, Cat: a11010) for 1 h at room temperature. Finally, the embryos were washed with PBST 3 times for 30 min each and observed by a Zeiss AXIO Zoom V16.

Immunocytofluorescence

Single cell suspensions of transgenic zebrafish embryo at 24 hpf were prepared as mentioned above. Small circles were drawn on a slide with a Dako pen. One drop of PBS was dropped on the circles and 5 μL of a single cell suspension was transferred into the PBS. After settling for 10 min, cells were fixed with 4% PFA for 30 min at room temperature. Then cells were incubated with 0.5% Triton X-100 in PBS for 15 min, then the cells were blocked with 3% BSA for 30 min and incubated with anti-Vasa antibody (GeneTex, Cat: GTX128306, 1:1000 dilution) overnight at 4°C. Then the cells were washed with PBST 3 times for 5 min each and incubated with the secondary antibody (1:500, goat anti rabbit, Alexa546). Finally, cells were observed by an Olympus IX51 microscope.

Single-cell qPCR

The single-cell qPCR followed the protocol published before [52]. Briefly, PGC was collected into lysis buffer for the first cDNA synthesis, and a modified Smart2-seq protocol was applied to amplify the cDNA. Then cDNAs which passed quality control were used for qPCR. qPCR was done with NovoStart 480 SYBR qPCR SuperMix Plus (novoprotein) on a LightCycler 480 Instrument (Roche). For the genes at specific developmental stage, five PGCs from one embryo were used and two technical repeats were examined for each PGC. Primers for the 20 selected genes were listed in S6 Table. actb1 is used as an endogenous control.

RNA-seq library construction and sequencing

The collected PGCs were subjected to Geo-seq [52] for transcriptomic profiling. Briefly, cells were amplified with a modified Smart-seq2 protocol and a library was constructed with Nextera XT kits (Illumina). Sequencing was performed in a 125-bp paired-end format on the Illumina HiSeq 2500. Sequences and processed RNA-seq data files were deposited in the NCBI Gene Expression Omnibus (GEO) database under accession number GEO: GSE122208.

Read mapping and quantification of gene expression

Raw reads were pre-processed by Trimmomatic version 0.36 [53] and mapped to danRer10 using HISAT2 version 2.0.5 [54]. Gene expression quantification was performed by StringTie version 1.3.3 [54] with the annotation file (Danio_rerio.GRCz10.89.gtf) downloaded from Ensembl. Genes that had FPKM ≥ 1 in at least three samples were taken into account, and genes not meeting this standard were discarded. The Python script prepDE.py (http://ccb.jhu.edu/software/stringtie/index.shtml?t=manual#deseq) was used to calculate the gene read counts for analyzing differentially expressed genes by edgeR.

Statistical analysis

WGCNA (weighted gene co-expression network analysis) was performed using the WGCNA package [46] in R software [55] with the choice of one-step automatic network construction and module detection. GO (gene ontology) for gene sets of interest was calculated by DAVID 6.8 online [56]. Differentially expressed genes were determined by edgeR [57]. Genes with fold change (FC) ≥ 2 and p ≤ 0.05 were treated as differentially expressed genes.

Supporting information

S1 Fig. Images for the zebrafish embryos in this study.

White arrows pointing to cells emitting green fluorescence means PGCs. Images of the up panel are taken under bright light, and images of the down panel are taken under fluorescent light.

https://doi.org/10.1371/journal.pone.0220364.s001

(TIF)

S2 Fig. Results of single-cell qPCR.

actb1 is used as an endogenous control, and error bar stands for SE (Standard Error) in 5 PGCs.

https://doi.org/10.1371/journal.pone.0220364.s002

(TIF)

S3 Fig. Dendrogram analysis for PGCs at different developmental stages.

https://doi.org/10.1371/journal.pone.0220364.s003

(TIF)

S4 Fig. Gene expression of SoxB1, Pou5f3 and Nanog.

The y-axis means FPKM values. Sox19a, Sox19b and Sox3 belong to SoxB1

https://doi.org/10.1371/journal.pone.0220364.s004

(TIF)

S5 Fig. Module eigengene (ME) values for modules identified by WGCNA.

(A) Pearson correlation between module engengenes and sample traits (here the trait is hpf). The blue module is marked by blue arrow and the turquoise module is marked by turquoise arrow. (B) ME values for blue module. (C) ME values for turquoise module.

https://doi.org/10.1371/journal.pone.0220364.s005

(TIF)

S6 Fig.

Scatter plots for differentially expressed genes in 6/11 hpf (A) and 11/24 hpf (B). Red dots mean genes upregulated and green dots mean genes downregulated. Genes with fold change (FC) ≥ 2 and p ≤ 0.05 are treated as differentially expressed genes.

https://doi.org/10.1371/journal.pone.0220364.s006

(TIF)

S7 Fig. Expression of genes related to PGC formation or migration.

The y-axis means FPKM values.

https://doi.org/10.1371/journal.pone.0220364.s007

(TIF)

S1 Table. Gene expression of zebrafish PGCs.

These genes have FPKM ≥ 1 in three samples at least, expression values are FPKM values.

https://doi.org/10.1371/journal.pone.0220364.s008

(XLSX)

S2 Table. Modules detected and gene list in turquoise and blue modules.

https://doi.org/10.1371/journal.pone.0220364.s009

(XLSX)

S3 Table. Functional enrichment analysis by DAVID for gene sets in turquoise and blue modules.

https://doi.org/10.1371/journal.pone.0220364.s010

(XLSX)

S4 Table. Differentially expressed genes in the pair of 6hpf and 11hpf and functional enrichment analysis by DAVID.

https://doi.org/10.1371/journal.pone.0220364.s011

(XLSX)

S5 Table. Differentially expressed genes in the pair of 11 hpf and 24 hpf and functional enrichment analysis by DAVID.

https://doi.org/10.1371/journal.pone.0220364.s012

(XLSX)

S6 Table. Primers for the 20 selected genes.

actb1 is used as an endogenous control.

https://doi.org/10.1371/journal.pone.0220364.s013

(XLSX)

Acknowledgments

The authors thank Prof. Ligang Wu at SIBCB (Shanghai Institute of Biochemistry and Cell Biology), CAS for suggestions and discussions about this research, Prof. Naihe Jing and Prof. Guangdun Peng at SIBCB for library construction, all members of Yiping Li group at SIBCB for zebrafish breeding and advice on experimental design.

References

  1. 1. Howe K, Clark MD, Torroja CF, Torrance J, Berthelot C, Muffato M, et al. The zebrafish reference genome sequence and its relationship to the human genome. Nature. 2013;496(7446):498–503. pmid:23594743
  2. 2. Lieschke GJ, Currie PD. Animal models of human disease: zebrafish swim into view. Nat Rev Genet. 2007;8(5):353–67. pmid:17440532
  3. 3. B.Veldman M, Lin S. Zebrafish as a developmental model organism for pediatric research. Pediatr Res. 2008;64(5):470–6. pmid:18679162
  4. 4. Feitsma H, Cuppen E. Zebrafish as a cancer model. Mol Cancer Res. 2008;6(5):685–94. pmid:18505914
  5. 5. Zhao S, Xia J, Wu X, Zhang L, Wang P, Wang H, et al. Deficiency in class III PI3-kinase confers postnatal lethality with IBD-like features in zebrafish. Nat Commun. 2018;9(1):2639. pmid:29980668
  6. 6. Yang H, Zhou Y, Gu J, Xie S, Xu Y, Zhu G, et al. Deep mRNA sequencing analysis to capture the transcriptome landscape of zebrafish embryos and larvae. PLoS One. 2013;8(5):e64058. pmid:23700457
  7. 7. Vesterlund L, Jiao H, Unneberg P, Hovatta O, Kere J. The zebrafish transcriptome during early development. BMC Dev Biol. 2011;11(30):1–18. pmid:21609443
  8. 8. Strome S, Updike D. Specifying and protecting germ cell fate. Nat Rev Mol Cell Biol. 2015;16(7):406–16. pmid:26122616
  9. 9. Hartung O, Forbes MM, Marlow FL. Zebrafish vasa is required for germ-cell differentiation and maintenance. Mol Reprod Dev. 2014;81(10):946–61. pmid:25257909
  10. 10. Zhang J, Liu W, Jin Y, Jia P, Jia K, Yi M. MiR-202-5p is a novel germ plasm-specific microRNA in zebrafish. Sci Rep. 2017;7(1):1–7.
  11. 11. Yoon C, Kawakami K, Hopkins N. Zebrafish vasa homologue RNA is localized to the cleavage planes of 2- and 4-cell-stage embryos and is expressed in the primordial germ cells. Development. 1997;124(16):3157–66. pmid:9272956
  12. 12. Olsen LC, Aasland R, Fjose A. A vasa-like gene in zebrafish identifies putative primordial germ cells. Mech Dev. 1997;66(1):95–105.
  13. 13. Koprunner M, Thisse C, Thisse B, Raz E. A zebrafish nanos-related gene is essential for the development of primordial germ cells. Gene Dev. 2001;15(21):2877–85. pmid:11691838
  14. 14. Muller K, Thisse C, Thisse B, Raz E. Expression of a linker histone-like gene in the primordial germ cells in zebrafish. Mech Dev. 2002;117(1):253–7.
  15. 15. Wibrand K, Olsen LC. Linker histone H1M transcripts mark the developing germ line in zebrafish. Mech Dev. 2002;117(1–2):249–52. pmid:12204265
  16. 16. Weidinger G, Stebler J, Slanchev K, Dumstrei K, Wise C, Lovell-Badge R, et al. dead end, a Novel Vertebrate Germ Plasm Component, Is Required for Zebrafish Primordial Germ Cell Migration and Survival. Curr Biol. 2003;13(16):1429–34. pmid:12932328
  17. 17. Strasser MJ, Mackenzie NC, Dumstrei K, Nakkrasae LI, Stebler J, Raz E. Control over the morphology and segregation of Zebrafish germ cell granules during embryonic development. BMC Dev Biol. 2008;8:1–16.
  18. 18. Blaser H, Eisenbeiss S, Neumann M, Reichman-Fried M, Thisse B, Thisse C, et al. Transition from non-motile behaviour to directed migration during early PGC development in zebrafish. J Cell Sci. 2005;118(17):4027–38. pmid:16129886
  19. 19. Ko MSH. Embryogenomics developmental biology meets genomics. Trends Biotechnol. 2001;19(12):511–8. pmid:11711195
  20. 20. Wang Z, Gerstein M, Snyder M. RNA-Seq a revolutionary tool for transcriptomics. Nat Rev Genet. 2009;10(1):57–63. pmid:19015660
  21. 21. Marioni JC, Mason CE, Mane SM, Stephens M, Gilad Y. RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008;18(9):1509–17. pmid:18550803
  22. 22. Tang F, Barbacioru C, Wang Y, Nordman E, Lee C, Xu N, et al. mRNA-Seq whole-transcriptome analysis of a single cell. Nat Methods. 2009;6(5):377–82. pmid:19349980
  23. 23. Guo F, Yan L, Guo H, Li L, Hu B, Zhao Y, et al. The Transcriptome and DNA Methylome Landscapes of Human Primordial Germ Cells. Cell. 2015;161(6):1437–52. pmid:26046443
  24. 24. Aanes H, Winata CL, Lin CH, Chen JP, Srinivasan KG, Lee SG, et al. Zebrafish mRNA sequencing deciphers novelties in transcriptome dynamics during maternal to zygotic transition. Genome Res. 2011;21(8):1328–38. pmid:21555364
  25. 25. White RJ, Collins JE, Sealy IM, Wali N, Dooley CM, Digby Z, et al. A high-resolution mRNA expression time course of embryonic development in zebrafish. eLife 2017;6:1–32.
  26. 26. Saito T, Fujimoto T, Maegawa S, Inoue K, Tanaka M, Arai K, et al. Visualization of primordial germ cells in vivo using GFP-nos1 3'UTR mRNA. Int J Dev Biol. 2006;50(8):691–9. pmid:17051479
  27. 27. Weidinger G, Wolke U, Köprunner M, Klinger M, Raz E. Identification of tissues and patterning events required for distinct steps in early migration of zebrafish primordial germ cells. Development. 1999;126(23):5295–307. pmid:10556055
  28. 28. Raz E. Primordial germ-cell development: the zebrafish perspective. Nat Rev Genet. 2003;4(9):690–700. pmid:12951570
  29. 29. Knaut H, Werz C, Geisler R, Consortium TTS, Nusslein-Volhard C. A zebrafish homologue of the chemokine receptor Cxcr4 is a germ-cell guidance receptor. Nature. 2003;421(6920):279–82 pmid:12508118
  30. 30. Knaut H, Pelegri F, Bohmann K, Schwarz H, Nüsslein-Volhard C. Zebrafish vasa RNA but Not Its Protein Is a Component of the Germ Plasm and Segregates Asymmetrically before Germline Specification. J Cell Biol. 2000;149(4):875–88. pmid:10811828
  31. 31. Braat AK, Water Svd, Goos H, Bogerd J, Zivkovic D. Vasa protein expression and localization in the zebrafish. Mech Dev. 2000;95:271–4. pmid:10906476
  32. 32. Weidinger G, Wolke U, Koprunner M, Thisse C, Thisse B, Raz E. Regulation of zebrafish primordial germ cell migration by attraction towards an intermediate target. Development. 2002;129(1):25–36. pmid:11782398
  33. 33. Boldajipour B, Mahabaleshwar H, Kardash E, Reichman-Fried M, Blaser H, Minina S, et al. Control of chemokine-guided cell migration by ligand sequestration. Cell. 2008;132(3):463–73. pmid:18267076
  34. 34. Walser CB, Lipshitz HD. Transcript clearance during the maternal-to-zygotic transition. Curr Opin Genet Dev. 2011;21(4):431–43. pmid:21497081
  35. 35. Gross-Thebing T, Yigit S, Pfeiffer J, Reichman-Fried M, Bandemer J, Ruckert C, et al. The Vertebrate Protein Dead End Maintains Primordial Germ Cell Fate by Inhibiting Somatic Differentiation. Dev Cell. 2017;43(6):704–15. pmid:29257950
  36. 36. Yamaji M, Jishage M, Meyer C, Suryawanshi H, Der E, Yamaji M, et al. DND1 maintains germline stem cells via recruitment of the CCR4-NOT complex to target mRNAs. Nature. 2017;543(7646):568–72. pmid:28297718
  37. 37. Otten AB, Theunissen TE, Derhaag JG, Lambrichs EH, Boesten IB, Winandy M, et al. Differences in Strength and Timing of the mtDNA Bottleneck between Zebrafish Germline and Non-germline Cells. Cell Rep. 2016;16(3):622–30. pmid:27373161
  38. 38. Mitsui K, Tokuzawa Y, Itoh H, Segawa K, Murakami M, Takahashi K, et al. The Homeoprotein Nanog Is Required for Maintenance of Pluripotency in Mouse Epiblast and ES Cells. Cell. 2003;113(5):631–42. pmid:12787504
  39. 39. Chambers I, Tomlinson SR. The transcriptional foundation of pluripotency. Development. 2009;136(14):2311–22. pmid:19542351
  40. 40. Onichtchouk D. Pou5f1/oct4 in pluripotency control: insights from zebrafish. Genesis. 2012;50(2):75–85. pmid:21913309
  41. 41. Lee MT, Bonneau AR, Takacs CM, Bazzini AA, DiVito KR, Fleming ES, et al. Nanog, Pou5f1 and SoxB1 activate zygotic gene expression during the maternal-to-zygotic transition. Nature. 2013;503(7476):360–4. pmid:24056933
  42. 42. Leichsenring M, Maes J, Mossner R, Driever W, Onichtchouk D. Pou5f1 Transcription Factor Controls Zygotic Gene Activation In Vertebrates. Science. 2013;341(6149):1005–9. pmid:23950494
  43. 43. Wang H, Liu Y, Ye D, Li J, Liu J, Deng F. Knockdown of zebrafish Nanog increases primordial germ cells during early embryonic development. Dev Growth Differ. 2016;58(4):355–66. pmid:27125179
  44. 44. Kudoh T, Tsang M, Hukriede NA, Chen X, Dedekian M, Clarke CJ, et al. A Gene Expression Screen in Zebrafish Embryogenesis. Genome Res. 2002;11(12):1979–87. pmid:11731487
  45. 45. Zhang B, Horvath S. A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol. 2005;4(1):1–44. pmid:16646834
  46. 46. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:1–13.
  47. 47. Tzung KW, Goto R, Saju JM, Sreenivasan R, Saito T, Arai K, et al. Early depletion of primordial germ cells in zebrafish promotes testis formation. Stem Cell Rep. 2015;4(1):61–73. pmid:25434820
  48. 48. Tong SK, Hsu HJ, Chung BC. Zebrafish monosex population reveals female dominance in sex determination and earliest events of gonad differentiation. Dev Biol. 2010;344(2):849–56. pmid:20553901
  49. 49. Hsu CW, Pan YJ, Wang YW, Tong SK, Chung BC. Changes in the morphology and gene expression of developing zebrafish gonads. Gen Comp Endocrinol. 2018;265:154–9. pmid:29409886
  50. 50. Westerfield M. The zebrafish book. A guide for the laboratory use of zebrafish (Danio rerio). 4th ed. Univ of Oregon Press, Eugene. 2000.
  51. 51. Kimmel CB, Ballard WW, Kimmel SR, Ullmann B, Schilling TF. Stages of Embryonic Development of the Zebrafish. Dev Dynam. 1995;203:253–310. pmid:8589427
  52. 52. Chen J, Suo S, Tam PP, Han JJ, Peng G, Jing N. Spatial transcriptomic analysis of cryosectioned tissue samples with Geo-seq. Nat Protoc. 2017;12(3):566–80. pmid:28207000
  53. 53. Bolger AM, Lohse M, Usadel B. Trimmomatic A flexible trimmer for Illumina Sequence Data. Bioinformatics. 2014;30(15):2114–20. pmid:24695404
  54. 54. Pertea M, Kim D, Pertea GM, Leek JT, Salzberg SL. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat Protoc. 2016;11(9):1650–67. pmid:27560171
  55. 55. R CT. R: A language and environment for statistical computing. 2017.
  56. 56. Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44–57. pmid:19131956
  57. 57. Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40. pmid:19910308