Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Multiomics profiling of zebrafish embryonic cell line PAC2 across growth phases to assess its relevance for toxicological studies

  • Mihai-Ovidiu Degeratu,

    Roles Conceptualization, Formal analysis, Investigation, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliations Eawag – Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland, Department of Environmental Systems Science, ETH Zürich, Zürich, Switzerland

  • Jessica Bertoli,

    Roles Formal analysis, Investigation, Validation, Writing – review & editing

    Affiliations Eawag – Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland, Department of Environmental Systems Science, ETH Zürich, Zürich, Switzerland

  • Nikolai Huwa,

    Roles Investigation, Resources, Writing – review & editing

    Affiliation Eawag – Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland

  • David Lopez Rodriguez,

    Roles Formal analysis, Visualization, Writing – review & editing

    Affiliations Eawag – Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland, Department of Biomedical Sciences, University of Lausanne, Lausanne, Switzerland, Institute of Earth Surface Dynamics, University of Lausanne, Canton de Vaud, Switzerland

  • Marion Revel,

    Roles Writing – review & editing

    Affiliation Eawag – Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland

  • René Schönenberger,

    Roles Investigation, Methodology, Writing – review & editing

    Affiliation Eawag – Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland

  • Colette vom Berg,

    Roles Conceptualization, Funding acquisition, Supervision, Writing – review & editing

    Affiliation Eawag – Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland

  • Kristin Schirmer,

    Roles Conceptualization, Funding acquisition, Project administration, Resources, Supervision, Writing – review & editing

    Affiliations Eawag – Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland, Department of Environmental Systems Science, ETH Zürich, Zürich, Switzerland

  • Ksenia J. Groh

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    ksenia.groh@eawag.ch

    Affiliation Eawag – Swiss Federal Institute of Aquatic Science and Technology, Dübendorf, Switzerland

Abstract

Permanent fish cell lines offer promising alternatives to traditional animal models for environmental risk assessment of chemicals. However, to facilitate broader uptake into toxicity testing practice, a better understanding of functional capacities and expression of toxicologically relevant molecular targets is needed. Here, we present an extensive molecular profiling of the zebrafish embryonic cell line PAC2, combining global proteomics across cell population growth phases (over 7300 protein groups) with matched transcriptomics at exponential and stationary phases (over 14500 transcripts). Proteome coverage was sufficiently deep to reveal functional insights consistent with those derived from transcriptomics data, despite differences in the total number of measured genes. Major gene expression shifts detected upon transition from exponential to stationary cell population growth phase indicated reduction in DNA replication, translation, metabolism, and cell cycle regulation, along with increased stress and immune system responses, and extracellular matrix remodeling. Functional annotation revealed expression of core cellular processes along with a number of toxicologically relevant pathways, including xenobiotic metabolism, stress signaling, and nuclear receptors responsive to important chemical classes, such as steroids (e.g., estrogens, glucocorticoids) and chemicals known to disrupt lipid metabolism, such as through interaction with peroxisome proliferation activating receptors. These findings reinforce the potential of PAC2 cells to offer a versatile in vitro model for studying fish cell biology and omics-enhanced exploration of chemical toxicity mechanisms, aided by the well-developed molecular annotation in zebrafish. Moreover, the analysis approaches developed in this work offer a blueprint for molecular baseline characterization of other fish cell lines. This work thus strengthens the mechanistic foundation supporting the use of fish cell lines as alternative models in aquatic toxicity testing and risk assessment.

Introduction

Chemical toxicity testing traditionally relies on in vivo tests using animal models. Fish are widely used as test animals for environmental risk assessment [1] due to their ecological relevance and sensitivity to environmental contaminants known to accumulate in freshwater systems [2]. However, whole-animal fish tests are both resource-intensive and ethically contentious. As millions of fish are used annually to test chemicals and water samples [35], these tests are increasingly scrutinized in light of the 3Rs (replacement, reduction and refinement) principles [6], similar to other animal-based test methods.

In the last two decades, the toxicology community has witnessed an ongoing shift toward increased reliance on alternative (non-animal) test approaches (also known as new approach methodologies or NAMs) that aim to reduce or eliminate animal use [7]. This new paradigm relies on the mechanistic understanding of toxicity, postulating that chemical-induced molecular and cellular changes precede higher-level effects and hence could be used to predict whole-organism toxicity outcomes [8,9]. Among ecotoxicological NAMs, permanent fish cell lines offer a variety of practical approaches to explore molecular and cellular toxicity mechanisms [3].

A wide variety of fish cell lines, derived from different tissues and species, are available [10,11] and have been increasingly used for aquatic toxicity testing [12]. Among these, cell lines derived from rainbow trout have been used most often [1317], including as a model system for the ISO Standard [18] and the OECD Test Guideline (TG) 249 [19], which both use the gill-derived RTgill-W1 cell line for prediction of acute fish toxicity. Standardization of in vitro tests helps ensure their reproducibility, which is an essential requirement for regulatory uptake. The fish invitrome framework for animal-free toxicity assessment further builds on the OECD TG 249 by introducing a suite of additional modules which address other endpoints of regulatory relevance, e.g., for bioaccumulation or chemical effects on growth [5]. These modules can include additional cell lines derived from other organs or species, along with endpoint-specific test methods and computational models. In this context, cell lines from zebrafish (Danio rerio) could offer a unique advantage for molecular investigations thanks to extensive genomic and bioinformatic resources already available for this species [2022]. This in turn could support the development of more regulatory-relevant, mechanistically anchored omics-enhanced assays, e.g., for prediction of chronic toxicity outcomes [5,23]. One example of a permanent zebrafish cell line that has already been used in several toxicology-relevant studies is the PAC2, derived in 1994 from 24 h post-fertilization embryos [24,25]. These studies included investigations of heat shock responses [26], genotoxicity [27], oxidative stress [28], and mercapturic acid pathway involved in phase II biotransformation of electrophiles [21].

One limitation currently hindering a broader applicability of the fish invitrome approach stems from insufficient knowledge about the molecular and functional repertoire of fish cell lines. Understanding toxicity-relevant properties and functional capacities of different fish cells lines, i.e., which genes and pathways are expressed and regulated under various conditions, could help to further improve their utility for toxicity tests relying on omics-enhanced assessment of chemical safety [29]. The growth of cell populations in in vitro culture undergoes two main phases: the exponential phase, characterized by active proliferation; and the stationary phase, marked by a plateau in cell proliferation, often coinciding with the formation of a confluent cell monolayer [30]. Both phases have been purposefully used for toxicity assessment. Acute toxicity effects are typically assessed with cells in confluent monolayers, corresponding to stationary phase [31], while effects on cell population growth, predictive of chemical impacts on fish growth in vivo, are tested in exponentially proliferating cells [32]. The transition from exponential to stationary phase is accompanied by massive gene expression shifts [33], which could also influence the expression of pathways relevant to stress responses and toxicity. Such growth phase-specific gene expression changes have not yet been examined in detail in fish cell models.

Among the various molecular profiling methods, transcriptomics has been used most widely due to the maturity of technology and deep coverage. However, mRNA levels often do not reliably reflect protein abundance, especially under dynamic conditions. Several biological mechanisms could cause mRNA-protein discrepancies, including those occurring post-transcriptionally, such as mRNA degradation and translational repression; as well as post-translationally, such as protein ubiquitylation and degradation [3436]. Therefore, multiomics profiling, combining proteomics and transcriptomics, could enable a more comprehensive understanding of gene expression changes across growth phases [3739]. No prior study has yet conducted such an integrated multiomics analysis across growth phases in zebrafish PAC2 cells.

In this study, we conducted an in-depth molecular characterization of the permanent zebrafish cell line PAC2 [24,25], sampled in different phases of cell population growth and extensively profiled on the protein level with label-free mass spectrometry-based global proteomics. We further complemented these data by transcriptomics analysis with RNAseq, performed at two matching time points in order to gain a more comprehensive view of gene expression profiles. This integrated approach allowed us to (i) reveal mRNA and protein expression changes across different phases of PAC2 cell population growth; (ii) assess the degree of correlation between transcriptomic and proteomic profiles measured in matched samples and compare the insights delivered by the two techniques; (iii) explore the functional capacity of the PAC2 cells, particularly with regard to toxicologically relevant genes and pathways expressed by these cells. This information is expected to increase the value of the PAC2 cell line and facilitate its broader use in toxicological testing, as well as support its integration into the fish invitrome framework [5], especially for omics-enhanced assessments of chemical safety.

Results

Growth of PAC2 cells in culture and sampling for omics analyses

PAC2 cell population growth was characterized in 96-well plates over 3 weeks (Figs 1 and S1). In the exponential growth phase, cell population doubling time (estimated between days 3–13) was approximately 3.3 days. Although the cell numbers measured by Hoechst nuclear staining appeared to increase continuously throughout the whole observation period, metabolic activity started plateauing around day 11 post-seeding (Fig 1). Membrane integrity measurements also indicated that exponential growth could be slowing down around day 11–13 (S1 Fig). Moreover, metabolic activity-based curves resembled the shape of a curve constructed based on the direct cell counts data generated previously in 12-well plates with otherwise similar culturing conditions [40]. Therefore, also considering that the Hoechst staining of the nucleus does not distinguish between live and dead cells, we relied on metabolic activity measurements for selecting the timepoints to collect samples for omics analyses. Selected timepoints were day 3, 8 and 14 post-seeding, assuming that they correspond, respectively, to the early exponential, mid-exponential, and stationary phases of cell population growth (Fig 1). These samples comprised proteomics dataset 1 (PD1). On days 8 and 14 of the same experiment, we also collected matched samples for transcriptomics analysis, from which we derived a transcriptomics dataset (TD). This allowed to increase the number of profiled genes and enabled a direct comparison between molecular insights provided by proteomics and transcriptomics data.

thumbnail
Fig 1. Growth curve of the PAC2 cell line based on metabolic activity measurements and nuclear staining counts.

PAC2 cells were seeded and cultured in 96-well plates. Metabolic activity based on alamarBlue™ fluorescence (arbitrary units, blue circles line, left y-axis) and cell counts based on Hoechst 33342 nuclear staining (black triangles line, right y-axis) were recorded at multiple timepoints (days) throughout the culturing duration. Data are shown as mean ± SD of technical replicates (n = 3-12) for each biological replicate (n = 3). Trend lines were added using LOESS smoothing. Red boxes and black labels above them indicate the days on which the samples for proteomics dataset 1 (PD1) and transcriptomics dataset (TD) were collected. Representative phase contrast microscopy images included in the top left and bottom right corners show the appearance of PAC2 cells grown in T75 flasks at approximately 60% and 95% confluency, respectively, corresponding to exponential and stationary phases. Images were captured with 100x magnification, scale bar = 100µm.

https://doi.org/10.1371/journal.pone.0346109.g001

In addition to PD1, we included in this study a second set of PAC2 samples, collected in the frame of a targeted phosphoproteomics study published previously [40], and used them to derive proteomics dataset 2 (PD2). These samples provided an even more resolved representation of different cell population growth phases: specifically, days 4, 7, 11, 18 and 28, assumed to correspond, respectively, to early, mid- and late exponential, and early and late stationary phases. This allowed enhancing the depth and reliability of proteomics-derived insights into molecular changes occurring across cell population growth phases. Further, while PD1 and PD2 samples differed in several parameters of experimental design (summarized in Table 1), they were analyzed by the same global proteomics data acquisition method in the frame of the current study. This additionally allowed us to address important technical questions related to the comparability of proteomics sample preparation procedures and robustness of mass spectrometry-based global proteomics analyses.

thumbnail
Table 1. Overview of experimental design parameters for omics datasets analyzed in this study.

https://doi.org/10.1371/journal.pone.0346109.t001

Proteomics profiling of PAC2 cells across cell population growth phases

Global proteomics analysis showed a high consistency of precursor, peptide, protein and protein group identification numbers across technical and biological replicates within both datasets, with %CV values of 4.6%, 5.5%, 2.8% and 2.0%, respectively (S1 Table). Across both the PD1 (S2 Table) and PD2 (S3 Table) combined, up to 11796 proteins could be identified, corresponding to 7432 protein groups (protein groups comprise either uniquely identified proteins or sets of indistinguishable proteins inferred from the identified peptides). Among these, 112 were contaminants and consequently removed from downstream analysis. Over 95% (7165) of the identified 7320 Danio rerio protein groups were shared between the two datasets (Fig 2, top Venn diagram). Similarly, high overlaps of the identified protein groups were observed between the different sampling days within each dataset (Fig 2, bottom Venn diagrams). The abundance of measured proteins spanned over 6 orders of magnitude, with log10 median protein group quantity ranging from <1 to >7.

thumbnail
Fig 2. Numbers and overlap of protein groups identified within and between proteomics datasets 1 (PD1) and 2 (PD2).

In total, 7320 protein groups were identified in the PAC2 cell line in the two datasets combined, the majority of which were shared between the two datasets (see top Venn diagram) and between the different sampling days within the same dataset (see bottom Venn diagrams for PD1 (left) and PD2 (right)). 7195 PD1 protein groups detected in day 8 and day 14 samples were used for multiomics comparison with matched transcriptomics samples.

https://doi.org/10.1371/journal.pone.0346109.g002

In the next step, identification data was prepared for quantitative analysis. For this, protein groups with <70% detection frequency were removed to ensure statistical power, and protein groups with detection frequency ≥70% were imputed if they did not appear in all replicates (see S2 Fig for histograms of measured and imputed counts in each replicate). Following this procedure, the number of protein groups included in PD1 and PD2 was reduced to 6751 (S2 Table) and 6927 (S3 Table), respectively, and the combined list of PD1 and PD2 together was reduced to 6803 protein groups. Within these lists, less than 10% of protein groups were single hits (i.e., identified by only one peptide). For example, in the shared list of 6803 protein groups, 445 protein groups were single-peptide identifications. The reduced lists were used to perform quantitative analyses such as the principal component analysis (PCA) and differential protein expression analysis presented below.

PCA showed a clear separation between all conditions analyzed (Fig 3A). Samples belonging to the two datasets (PD1 and PD2) were separated in principal component 1 (PC1), while PC2 primarily showed separation between growth phases (sampling days) within each dataset. A similar trend, i.e., a progression over time from early exponential (days 3–4) to (late) stationary phase (days 14–28), was observed in both PD1 and PD2 (Fig 3A). Days 3 and 8 from PD1 clustered closely together, similarly to days 4 and 7 from PD2. These samples, corresponding to early/mid-exponential growth phase, were strongly separated from samples corresponding to the stationary growth phase, i.e., day 14 of PD1 and days 18 and 28 of PD2, the latter two closely clustered together too. Interestingly, day 11 of PD2 was located between the exponential- and stationary-phase related samples of this dataset, indicating that it may reflect a transition period between the two distinct growth phases.

thumbnail
Fig 3. Comparison of protein expression trends in proteomics datasets 1 and 2 (PD1 and PD2).

Panel (A) shows principal component analysis (PCA) performed on 6803 PAC2 protein groups shared between PD1 (orange) and PD2 (grey). Convex hulls show the outer boundaries formed by biological replicates within each group. Panel (B) shows correlation and density plot for the same list of 6803 protein groups. For each dataset (PD1 and PD2), abundance values of each protein group were averaged across biological replicates (n = 4 for PD1 and n = 3 for PD2) from all conditions (i.e., sampling days). The diagonal line indicates equality (x = y); points above the line have higher values in PD2, and those below have higher values in PD1. Color gradient indicates the density of the data points (blue – most dense; red and yellow – intermediate; green – least dense).

https://doi.org/10.1371/journal.pone.0346109.g003

The separation of PD1 and PD2 samples could be due to differences in absolute abundance values measured for each protein group in the two datasets, as Pearson correlation coefficient between the two datasets was lower than 1 (r = 0.92, Fig 3B). These differences appeared to be unevenly distributed within the highest- and lowest-abundance analytes in each dataset. The former and the latter showed, respectively, slightly higher and slightly lower abundance values in PD1 compared to PD2 (see S3 Fig for a detailed analysis of top-200 high- and low-abundant protein groups). This uneven distribution of absolute abundance differences could affect the degree of variance within the two datasets, which was likely reflected in the PCA. The differences in the absolute abundance values could be caused by the differences in the peptide amount loaded on the column in chromatographic runs performed for PD1 and PD2 samples (see Table 1), which is reflected by the progressive decrease in Pearson coefficient from 0.964 to 0.827 for comparisons of abundance values between day 3 of PD1 (1 µg loaded) through samples with 2 µg loading (PD1 day 8 and 14) to samples with 4 µg loading (days 4, 7, 11, 18 and 28 of PD2), shown in S4 Fig. For respective comparisons of 2 and 4 µg samples, Pearson coefficients also decreased from 0.939 to 0.841 for day 8 samples, but for day 14 samples, they remained similar or even slightly increased (from 0.86 to 0.886). Of note, correlation between samples was expected to be less than perfect, because they represent different growth phases, accompanied by differential expression of a significant proportion of protein groups (see below). Hence, the observed correlation coefficients can still be considered high overall. Moreover, despite these differences in absolute values, the relative differences between the samples corresponding to exponential and stationary cell population growth phases within each dataset appeared to be retained, as shown by similar over-time PCA trends discussed above (Figs 3A and S5). Thus, PCA suggested that PD1 and PD2 data are comparable but cannot be directly combined for quantitative analysis aimed at identifying differentially expressed protein groups (DEPs). We therefore proceeded to perform differential expression analysis for PD1 and PD2 separately, followed by comparing the obtained results.

We identified 4450 DEPs in PD1 (corresponding to 62% of quantified proteome, S2 Table) and 3531 DEPs in PD2 (49% of quantified proteome, S3 Table), presented in Fig 4 as hierarchically clustered heatmaps. A total of 2520 DEPs were shared between the two datasets PD1 and PD2, and they turned out to represent the core of the two main co-expression groups progressing from early to late cell population growth phases: downregulated (1247 DEPs) and upregulated (1273 DEPs), see S6 Fig. To perform functional annotation and enrichment analysis, shared DEPs were converted to official gene symbols, resulting in 1229 genes from 1247 downregulated DEPs and 1270 genes from 1273 upregulated DEPs. Selected findings from this analysis are shown in the top panels (proteomics) of Fig 5A and 5B, corresponding to the downregulation and upregulation clusters, respectively; the complete outcome of DAVID Bioinformatics gene ontology (GO) analysis can be viewed in S4 Table. Enriched downregulated pathways were related to three main areas: nucleic acid metabolic process and DNA replication; ribosome biogenesis and protein folding; cell cycle process and its regulation. Enriched upregulated pathways were related to extracellular interactions; nutrient metabolism; and cytoprotective responses.

thumbnail
Fig 4. Hierarchical clustering and overlap of the differentially expressed protein groups (DEPs) identified in the proteomics dataset 1 (PD1) and 2 (PD2).

Heatmap columns correspond to the biological replicates within each dataset (n = 4 for PD1 and n = 3 for PD2) and each cell population growth phase (represented by sampling days 3, 8, 14 in PD1 and 4, 7, 11, 18, 28 in PD2), indicated at the bottom of each heatmap. DEPs were identified by ANOVA followed by permutation-based calculation (250 randomizations) of false discovery rate (FDR), significance threshold FDR < 0.05. Color gradient represents the DEPs z-score normalized expression values. The Venn diagram illustrates the overlap of hierarchically clustered DEPs across all conditions (days) within each dataset.

https://doi.org/10.1371/journal.pone.0346109.g004

thumbnail
Fig 5. Functional annotation and enrichment analysis of differentially regulated gene clusters identified with proteomics, transcriptomics, and multiomics data.

Panels (A) and (B) show results for downregulated and upregulated co-expression clusters derived from proteomics analysis (top two graphs, analysis performed on 1247 downregulated and 1273 upregulated differentially expressed protein groups (DEPs) shared between proteomics datasets 1 and 2); transcriptomics analysis (middle two graphs, analysis performed on 2306 downregulated and 2143 upregulated differentially expressed genes (DEGs)); and multiomics analysis (bottom two graphs, analysis performed on 690 downregulated and 446 upregulated genes found in both DEPs and DEGs lists from the matched multiomics dataset). Annotation was performed with KEGG pathways and gene ontology (GO) terms for biological process (BP), cellular component (CC), and molecular function (MF). Resulting pathways and terms were analyzed for enrichment against background proteome and/or transcriptome, and clustered based on gene similarity in DAVID Bioinformatics. Shown clusters include top 10 and several additionally selected (indicated by *) statistically significant terms/pathways of interest. The complete outcome of these analyses can be viewed in the S4, S7 and S8 Tables, corresponding to proteomics, transcriptomics, and multiomics data, respectively. Cluster score represents the geometric mean (-log p-value) of all GO terms and KEGG pathways included within a cluster. Cluster names and gene numbers were taken from the top significant pathway or term (sorted on y-axis based on false discovery rate (FDR), shown on x-axis).

https://doi.org/10.1371/journal.pone.0346109.g005

Transcriptomics profiling of PAC2 cells in exponential and stationary phases

Transcriptomics analysis was performed for matched samples corresponding to day 8 and day 14 within PD1 (see S5 Table), representing exponential and stationary cell population growth phases, respectively. RNAseq analysis generated counts for 30628 gene IDs in total. Following data filtering for very low-expressed genes, this number was reduced to 17570 genes and finally to 14574 transcripts that remained after filtering with a threshold of transcripts per million (TPM) >1 (S6 Table). Over 90% (13409) of identified transcripts were shared between the two sampled conditions (Fig 6A). These conditions, however, showed a clear separation in the PCA (Fig 6B), indicating the occurrence of major expression changes, similar to those observed in the proteomics data (see Figs 3A and S5).

thumbnail
Fig 6. Characterization of the transcriptomics dataset (TD).

(A) Numbers and overlap of 14574 gene transcripts reliably measured in the PAC2 cells sampled at two different cell culture growth phases (i.e., day 8 and day 14). (B) Principal component analysis (PCA) of the 14574 gene transcripts showed a clear separation between the two different growth phases. Convex hulls (purple areas) show the outer boundaries formed by biological replicates (n = 4) within each group. Note that replicate B3 in day 14 condition was lost during sample preparation.

https://doi.org/10.1371/journal.pone.0346109.g006

Differential expression analysis identified 2306 downregulated and 2143 upregulated differentially expressed genes (DEGs) in the TD (see S6 Table). For the TD cluster of downregulated genes, enriched terms showed a high overlap with all main areas revealed in the respective proteomics cluster, but there were also several new terms appearing among the top-enriched, including organelle organization, mitochondrion and generation of precursor metabolites and energy (compare top and middle panel in Fig 5A and see S4 and S7 Tables). For upregulated genes, enriched terms were again similar to proteomics data and related, e.g., to extracellular interactions and cytoprotective responses such as response to chemical stimulus and cytochrome P450 drug metabolism (compare top and middle panel in Fig 5B and see S4 and S7 Tables). In contrast, terms related to axon projection development appeared only in upregulated DEPs (Fig 5B, top panel), while upregulated DEGs showed enrichment of skeletal system development terms and negative regulation of immune system process (Fig 5B, middle panel).

Proteomics and transcriptomics provide comparable information about biological changes in PAC2 cells

To explore the overlap between proteomics- and transcriptomics-derived information about molecular and cellular changes occurring upon transition from exponential to stationary cell population growth phase in PAC2 cells, we carried out a differential gene expression analysis of day 8 vs day 14 condition in PD1 and TD and compared the identified DEPs and DEGs. For PD1, this analysis identified 1668 downregulated and 1571 upregulated DEPs (Fig 7), which had 98% and 95% overlap with the respective DEP clusters described in Fig 4. The terms enriched in these two clusters were also similar to those highlighted in the complete proteomics data (shown in Fig 5, top panel).

thumbnail
Fig 7. Hierarchical clustering and overlap of differentially expressed protein groups (DEPs) and differentially expressed genes (DEGs) in the matched multiomics dataset.

Heatmap columns correspond to the biological replicates (n = 4) within each dataset, and each of the two analyzed cell population growth phases (i.e., days 8 and 14), indicated at the bottom of each heatmap; note that replicate B3 in day 14 condition was lost during sample preparation. DEPs were identified by unpaired Student’s t-test followed by permutation-based calculation (250 randomizations) of false discovery rate (FDR), significance threshold FDR < 0.05. DEGs were identified by quasi-likelihood F-test with Benjamini-Hochberg correction, significance threshold for adjusted p-value (FDR)<0.05. Color gradient represents the DEPs’ and DEGs’ z-score normalized expression values. The Venn diagram illustrates the overlap of hierarchically clustered DEPs and DEGs across all conditions (days) within each dataset. Overlaps between DEPs and DEGs were identified by comparing respective lists converted to official gene symbols.

https://doi.org/10.1371/journal.pone.0346109.g007

To compare these DEPs with the DEGs identified in the TD (described in the preceding section), both lists were converted from gene identifiers originally used in each dataset (i.e., UniProt accessions in PDs and Ensembl gene IDs in the TD) into official gene symbols. This resulted in slightly different output numbers than were input (compare numbers shown in the middle area of Fig 7), which could be due to some IDs being less comprehensively annotated, leading to many-to-one or ambiguous mapping. Moreover, a few genes appeared as shared between down- and upregulation clusters within proteomics (4 genes) and transcriptomics (2 genes) lists. This could be due to the ambiguities introduced during the conversion from respective identifiers to gene symbols, but it could also imply the existence of (unresolved) isoforms of the same gene/protein that behave differently.

690 genes were significantly downregulated (FDR < 0.05) on both the protein and transcript levels, representing approximately 40% of downregulated DEPs and about one third (approximately 34%) of downregulated DEGs (Fig 7). Overlap between proteomics and transcriptomics data was even lower within the upregulated cluster, with 446 shared genes representing approximately 28% and 24% of all upregulated DEPs and DEGs, respectively (Fig 7). However, despite this low overlap and the relatively low numbers of shared genes, the terms enriched among the shared DEPs and DEGs in the downregulation cluster (see Fig 5A, bottom panel, and S8 Table for complete analysis output) were found to cover the same biological aspects that were identified as downregulated based on the individual analyses of proteomics (Fig 5A, top panel) and transcriptomics (Fig 5A, middle panel) data, i.e., related to DNA replication, translation, metabolism and cell cycle regulation. Similarly, enriched terms in the shared upregulated DEPs and DEGs included terms related to extracellular interactions and cytoprotective responses (Fig 5B, bottom panel), also identified in the individual proteomics (Fig 5B, top panel) and transcriptomics (Fig 5B, middle panel) analysis. Interestingly, the terms enriched in the shared proteomics/transcriptomics list also included both the axon guidance and immune system process terms – the two areas found to be uniquely enriched in the individual proteomics and transcriptomics lists, respectively (compare bottom, top and middle panels in Fig 5B), when analysed separately. The complete output of functional annotation and enrichment analysis for shared DEPs and DEGs can be viewed in S8 Table.

Top-changing genes (identified as showing the log₂ difference > |1.5| and p-value <0.01, see Fig 8) included 60 DEPs and 136 DEGs downregulated and 51 DEPs and 343 DEGs upregulated (see S9 Table for complete lists). Among these, 5 downregulated and 16 upregulated genes were shared between DEP and DEG lists (Fig 8 and Table 2).

thumbnail
Table 2. Shared genes found on both the protein and transcript lists of top-regulated exponential-to-stationary genes.

https://doi.org/10.1371/journal.pone.0346109.t002

thumbnail
Fig 8. Overlap between the top regulated differentially expressed protein groups (DEPs) and transcripts (DEGs) in PAC2 cells from day 8 to day 14.

The lists of 3239 DEPs from proteomics dataset 1 (PD1, data from days 8 and 14) and 4449 DEGs from the transcriptomics dataset (TD) were tested using unpaired Student`s t-test and plotted in volcano plots shown in left and right panel, respectively. Identifiers with increased expression (log₂ difference >1.5 and p < 0.01) are shown in red, corresponding to 51 DEPs (A) and 343 DEGs (B). Identifiers with decreased expression (log₂ difference < –1.5 and p < 0.01) are shown in blue, corresponding to 60 DEPs (A) and 136 DEGs (B). Grey dots represent the remaining DEPs and DEGs not meeting these thresholds. Overlaps between DEPs and DEGs were identified by comparing respective lists converted to official gene symbols.

https://doi.org/10.1371/journal.pone.0346109.g008

Multiomics integration model of gene expression changes between exponential and stationary phases

An unsupervised multiomics model, built to evaluate the shared variance between the proteomics and transcriptomics datasets, showed a clear separation between day 8 and 14 (Fig 9A). This trend was similar to that observed in proteomics (Fig 3A) and transcriptomics (Fig 6B) data individually. In the model, each biological replicate exhibited relatively small variance differences between the proteomics and the transcriptomics dataset (Fig 9A). Contribution to the total explained variance per component was nearly equal for transcriptomics and proteomics (Fig 9B).

thumbnail
Fig 9. Multiomics Partial Least Square (PLS) model showing separation between analyzed conditions corresponding to exponential and stationary cell population growth phases.

Panel (A) shows multiomics model built for data from days 8 and 14 from proteomics dataset 1 (PD1) and the corresponding matched samples from the transcriptomics dataset (TD). Arrows represent the variance difference between PD1 and TD for each biological replicate (n = 4 for day 8 and n = 3 for day 14), with shorter arrows indicating stronger agreement in variance. Panel (B) shows contributions of proteomics and transcriptomics data to the total explained variance per principal component (PC).

https://doi.org/10.1371/journal.pone.0346109.g009

Functional annotation analysis of all proteins and transcripts detected in PAC2 cells

Following the above-described analysis of DEPs and DEGs that accompany the transition from exponential to stationary growth phase in PAC2 cells, we wanted to explore which types of biological processes and functions, especially the toxicologically relevant ones, are being expressed in PAC2 cells overall. For this, we carried out a functional annotation analysis of all proteins and transcripts detected in PAC2 cells. To enable comparisons, original identifiers were converted to official gene symbol identifiers. As already observed previously, the output of this conversion did not yield the exact number of identifiers that were input. That is, 7320 protein groups (PD1 and PD2 combined) resulted in 7465 genes, while 14574 transcripts resulted in 12172 genes. 6644 genes were shared between the complete proteomics (combined PD1 and PD2) and transcriptomics (TD) identifications, corresponding to 89% and 55% of PDs and TD. Thus, around 11% (821) of protein groups detected by global proteomics did not have a corresponding transcript match in the RNAseq data. Functional annotation analysis of these genes in DAVID GO identified 6 clusters with significantly enriched terms related to extracellular region and immune response; cell adhesion; ligand-gated channel activity and neurotransmitter receptor activity; DNA-binding transcription factor activity; supramolecular fiber; and calcium signaling (S10 Table). Conversely, over half of the identified transcripts were also detected on the protein level, while around 45% were identified on the transcript level only, which is to be expected in multiomics studies since transcriptomics typically shows higher detection sensitivity than proteomics.

Functional annotation analysis of all proteins and transcripts detected in PAC2 cells (S11 Table) showed that, despite the almost 2-times difference between the overall gene counts present in the proteomics and transcriptomics datasets, similar functional categories were identified by both data types (compare Fig 10A and 10B). Additional information about each functional category is provided in Table 3, and a complete overview of Cytoscape categories can be seen in S11 Table.

thumbnail
Table 3. Overview of the main functional categories containing enriched terms and pathways represented in the proteome and transcriptome of PAC2 cells. For each category, the number of terms, the top statistically significant term (marked by *) and the most mapped term with its corresponding number of genes are shown.

https://doi.org/10.1371/journal.pone.0346109.t003

thumbnail
Fig 10. Overview of the main biological processes represented by the proteins and transcripts measured in zebrafish PAC2 cells.

Panel (A) shows representative gene ontology (GO) biological process terms identified within the 7465 genes corresponding to the 7320 protein groups of the PAC2 proteome. Panel (B) shows the same for the 12172 genes corresponding to the 14574 transcripts of the PAC2 transcriptome. Nodes represent GO biological process terms, and edges connecting them are based on semantic similarity scores calculated in Revigo. Only terms with similarity above the set threshold (0.7) are connected. Unconnected nodes were retained because they were considered informative, despite low similarity to other terms. Colors used to denote different functional categories explained in the shared legend underneath panels (A) and (B) were manually assigned based on proximity and nomenclature (shared common words).

https://doi.org/10.1371/journal.pone.0346109.g010

PAC2 cell type and relevance of PAC2 cells for toxicological studies, explored by multiomics analysis

Manual checks of selected gene families of interest were carried out to gain information on the PAC2 cell type as well as to explore the relevance of PAC2 cells for toxicological studies. First, we looked at the expression of selected markers of fibroblastic or epithelial cell types, specifically the fibroblastic markers: vim, vimentin; cdh2, cadherin 2, type 1, N-cadherin (neuronal); col1a1b, collagen, type I; fn1a, fibronectin; and epithelial markers: cdh1, cadherin 1, type 1, E-cadherin (epithelial); prr36b, mucin-5 AC; si:ch73-61d6.3, occludin; sdc2, syndecan; tjp2b, zona occludens 2. All markers except vimentin were found to be expressed in PAC2 cells at both the protein and transcript level (S12 Table and S7 Fig).

Next, we looked at the expression of selected toxicologically relevant gene families and pathways, also summarized in S12 Table. PAC2 cells were found to express molecular components related to cytoprotective mechanisms (e.g., detoxification and response to heat shock response, oxidative stress, endoplasmic reticulum stress), which are often involved in responses to toxic chemicals [41,42]. With regard to xenobiotic metabolism, PAC2 cells express several members of gene families related to active transport (ATP-binding casette (ABC) transporters, solute carriers (SLC)) and biotransformation, including phase I: cytochrome P450 (CYP); phase II: glutathione S-transferases (GST); uridine 5-diphospho-glucuronosyltransferases (UGT), sulfotransferases (SULT); and N-acetyltransferases (NAT). Further, PAC2 cells also express several toxicologically relevant signaling pathways, such as Mitogen-Activated Protein Kinase / c-Jun N-terminal Kinase (MAPK/JNK), AKT Serine/Threonine Kinase (AKT) and mechanistic Target of Rapamycin kinase (mTOR), as well as transcription factors and effector genes involved in response to hypoxia and metals, heat shock response, unfolded protein response and autophagy regulation. Lastly, PAC2 cells express several nuclear receptors, including those that govern the response to important chemical groups, such as estrogens, glucocorticoids, aryl hydrocarbons, and peroxisome proliferator substances. Expression was most often observed on transcript level only, but numerous genes were detected on both the transcript and protein level, and several more were detected on protein level only (S12 Table).

Discussion

We carried out a comprehensive profiling of molecular changes occurring in PAC2 cells during progression through the cell population growth phases. Extensive data obtained on the protein level using two distinct sets of samples (PD1, PD2) allowed to evaluate the robustness of global proteomics analysis. Proteomics analysis was complemented with transcriptomics data generated at two time points in exponential and stationary growth phases (TD), which were directly paired with corresponding samples from PD1. The inclusion of transcriptomics permitted comparison of the overall molecular insights provided by the two omics data streams with regard to gene expression changes occurring between growth phases and supported the in-depth evaluation of the overall repertoire of genes and pathways expressed in the PAC2 cells.

Data independent acquisition (DIA)-based global proteomics enables fast analysis and deep coverage

The total number of PAC2 protein groups measured in our study was 7320, with over 95% overlap observed across all technical and biological replicates. This comparatively deep coverage was achieved by separating the complete peptide sample (i.e., without prior fractionation) with approximately 2h nanoLC gradient on in-house made 30 cm-long column, coupled to MS/MS in data independent acquisition (DIA) mode [43], followed by directDIA (library-free) analysis [44] to identify and quantify peptides and protein groups using SpectronautTM software [45,46]. The speed and depth of proteome analysis achieved here constitute a significant improvement compared to previously published studies on proteomes of cell lines from zebrafish and other fish species, which typically reported 2–5 times lower coverage for unfractionated samples [4749], while a more comparable coverage has been achieved by including extensive fractionation steps, e.g., in a study with early zebrafish embryos [50]. The improvement reported here is largely based on the implementation of DIA mode, which offers superior sensitivity and more stable label-free quantification results compared to the data dependent acquisition (DDA) mode that was more commonly used in earlier studies [43,51,52]. The fast and sensitive global proteomics analysis workflow presented in this study paves the way for a broader application of global proteomics in ecotoxicological studies, especially those that require analyzing large sample numbers to obtain time- and concentration-resolved molecular profiles. This is often necessary to gain an improved understanding of toxicity mechanisms for unknown or insufficiently characterized substances. While further improvements in speed and sensitivity can be achieved with transition to the next generation of high-resolution mass spectrometers, such as Thermo Orbitrap Astral [53], the presented solution is already capable of obtaining highly informative proteomics profiles (see below) using more regular high resolution mass spectrometry instrumentation.

DIA proteomics enables robust profiling of proteome changes across PAC2 population growth phases

Analysis of protein expression changes across PAC2 cell population growth phases relied on two different experiments: PD1, performed with four biological replicates in the current study and partially matched with transcriptomics analysis; and PD2, performed with three biological replicates in the frame of a previously published targeted phosphoproteomics study [40]. While both experiments had the same overall goal of collecting PAC2 cell samples at different phases of cell population growth, they had several differences in cell culturing conditions, sample collection days, proteomics sample preparation techniques and the total peptide amount loaded on-column for LC-MS/MS analysis (Table 1). Regarding sample preparation, while the PD1 samples were subjected to the regular S-TrapTM assisted tryptic digest followed by desalting [54], PD2 samples were additionally subjected to Fe3+-NTA phosphopeptide enrichment procedure and represented the non-bound fractions collected during its performance [40,55]. Despite these differences, we observed approximately 98% overlap of protein groups (7165) identified within each dataset, while peptides and precursors showed approximately 95% overlap. Comparable overlaps were also observed across the different cell population growth phases (Fig 2). These observations underscore the high consistency and robustness of global proteomics analysis applied in our study. This also confirms that the two different sample preparation techniques delivered largely comparable peptide populations enabling the identification of the same proteins. Thus, passing the non-phosphorylated peptides through the Fe3+-NTA enrichment procedure and recollecting them as the non-bound fraction does not seem to lead to substantial losses that would have precluded identification of corresponding proteins.

Despite the high overlap in total identifications, PCA revealed significant separation between the two datasets, particularly in principal component 1 (PC1, Fig 3A). This separation could be due to dataset-specific differences in the absolute expression values, observed especially for the proteins located at the extremes of the dynamic abundance range, i.e., either very high- or very-low abundant proteins (see Figs 3B and S3). These deviations could be caused either by variations in peptide recovery introduced during the sample preparation procedure or from different storage regimes or because of the differences in the absolute peptide amounts injected on-column during LC-MS/MS runs (Table 1). Although storage conditions can theoretically influence peptide integrity, the specific combinations of storage temperature and duration used in this study are well within the established ranges known to preserve peptide stability. Regarding the peptide loading concentrations, the coefficient of correlation between different samples showed a tendency to decrease from 0.964 (for 1 µg vs 2 µg loading) to 0.899 (for 1 µg vs. 4 µg loading) in otherwise comparable sample pairs (S4 Fig). However, the correlation between samples cannot reach 100% because each sample is from a different growth phase where some differential expression is naturally present. Therefore, coefficients close to or above 0.9 can still be seen as high, and hence the variations caused by different loading amounts can be considered rather minor, which is also underscored by comparable peptide and protein identifications obtained across all samples (CV 2–5.5%, S1 Table).

Lastly, it is important to emphasize that, despite these differences in the absolute abundance values, the two datasets PD1 and PD2 revealed highly similar trends in the temporal protein expression and changes occurring across cell population growth cycle, demonstrated by both the PCA (Fig 3A) and the comparative analysis of DEPs identified within each dataset, with high overlap between the two datasets observed in the down- and upregulation clusters (Figs 4 and 5). This again demonstrates that DIA-based global proteomics analysis performed in this study is sufficiently robust against minor differences in cell culture parameters such as plate format, differing sample collection timepoints, or even differences in sample preparation parameters. Moreover, our analysis shows that the non-bound fraction from the Fe-NTA phosphopeptide enrichment procedure can be used to obtain reliable global proteomics data. This approach can be applied for studies interested in simultaneous characterization of proteome and phosphoproteome, as it would allow for these two omics streams to be obtained from fully paired samples (essentially, one sample), thus increasing the power for matched multiomics analysis.

Major gene expression shifts occur upon PAC2 transition from exponential to stationary growth phase

The transition from exponential to stationary cell population growth phase is known to represent a coordinated cellular reprogramming event characterized by major modulation of core metabolic and regulatory networks, reported in both prokaryotes [56] and eukaryotes [57,58]. In PAC2 as well, differential expression analysis revealed massive shifts in both the protein (Fig 4) and transcript (Fig 7) expression, indicating comprehensive remodeling of molecular architecture of PAC2 cells across cell population growth phases. The exact timing of exponential-to-stationary transition can be explored based on the characteristics that the cells exhibit over time in culture. The PAC2 growth curves defined either by direct cell count [40] or by Hoechst nuclear staining-based count or by metabolic activity (this study, Fig 1), suggested that the switch between these phases is likely to occur between day 11 and day 14 post-seeding. By this time, cells would be approaching confluence, which is likely associated with a decline in nutrient availability from the culture medium and an increase in cell-to-cell contact interactions. This in turn could trigger a starvation response as well as specific differentiation reactions, which could also accompany the transition from active proliferation to stationary maintenance mode.

These observations are corroborated by the observed changes in the molecular profiles of PAC2 cells across cell population growth phases, revealed in the heatmaps for identified DEPs (Figs 4 and 7) and DEGs (Fig 7). While coordinated gene expression differences between days 8 (exponential) and 14 (stationary) are clearly visible on both the transcript and protein levels (Fig 7), a more resolved view can be obtained from the proteomics data shown in Fig 4 for PD1 and PD2. Here, protein expression patterns at days 3 and 8 (PD1) or 4 and 7 (PD2), corresponding to exponential growth phase, showed clear differences to the patterns observed at day 14 (PD1) and days 18 and 28 (PD2), corresponding to stationary phase. In contrast, day 11, the timepoint present only in PD2, showed an intermediary pattern that significantly differed from both the former and the latter group (but was closer to the former). This suggests that day 11 likely corresponds to a transitionary state between the exponential and stationary phases. Thus, gene expression profiles measured on this day are likely reflective of a switch that the proteome undergoes when progressing from exponential to stationary phase. These observations could have implications for in vitro chemical toxicity assessment in prolonged-exposure scenarios, such as tests aiming to assess chemical effects on proliferation and cell population growth. The drastic gene expression changes that accompany the progression from exponential to stationary growth phase could be a confounding factor hindering the interpretability of chemical-specific effects observed in such test setups, especially considering that the exact timing of the transition state within a growth curve could be influenced by multiple experimental parameters. These could include cell line type, seeding density, frequency of culture medium exchange, and the type of cell culture vessel, as well as chemical-specific factors. To minimize the influence of this confounding factor, it could be suggested that molecular responses used for toxicological assessments are recorded either at multiple timepoints, or at timepoints sufficiently distant from the (presumed) transition point likely to occur within multi-day test designs.

The overlap between DEPs and DEGs was relatively modest (~30–40%). This was not unexpected and in line with previous reports from similar studies [59,60]. These studies collectively demonstrate that reliability of omics-informed toxicological predictions is typically improved by inclusion of multiple omics data streams, as they tend to complement each other. Furthermore, employing study designs that enable sample collection at multiple timepoints can provide additional valuable information on both the specificity and the temporal dynamics of physiological or toxicological responses under study.

Functional annotation analysis of DEPs and DEGs identified between the exponential and stationary growth phases delivered largely similar conclusions with regard to the accompanying changes in the internal molecular machinery of PAC2 cells. That is, both omics types revealed a similar set of affected functions (Fig 5, top and middle panel), which were also enriched in the core group of shared DEPs and DEGs (Fig 5, bottom panel). Genes that were downregulated upon transition from exponential to stationary phase covered cellular processes related to DNA replication and nucleic acid metabolic process; mitotic cell cycle process and its regulation; and biogenesis and assembly of ribosome and protein folding chaperones. This suggests a reduction in cell division/proliferation as well as stalled production of cellular structures and organelles, accompanied by reduced translation. The same mechanisms are highlighted by the top-5 genes identified as downregulated with p < 0.01 on both the protein and transcript level (Fig 8 and Table 2). For example, arhgef39 encodes a RhoGEF protein that regulates cytoskeletal dynamics, cell migration, and proliferation. While previous studies mostly looked at the role of this gene in cancer progression, its downregulation in fish cells may indicate reduced cytoskeletal remodeling, consistent with contact inhibition and decreased proliferation in stationary phase [61,62]. Another downregulated gene, phlda2, is a modulator of growth and apoptosis, known in mammals as an imprinted gene involved in growth regulation [63,64]. The osbpl3b gene belongs to the family of lipid transfer proteins highly conserved between zebrafish and humans [65]. Given the likely role in intracellular lipid trafficking, its downregulation may reflect a decreasing need for membrane turnover, in line with reduced cell growth and proliferation. Similarly, downregulation of the iron uptake receptor tfr1b suggests diminishing demand for iron, possibly due to lowered biosynthetic activity typical of quiescent cells [66]. In line with this, downregulation of chchd10 known to be involved in mitochondrial cristae maintenance and regulation of lipolysis, suggests a reduction in energy production, which could reflect a reduced energy demand associated with a metabolic shift toward maintenance rather than growth [6769].

Processes enriched in the upregulation clusters indicated that cells in the stationary phase prioritized functions related to cellular response to exogenous stimulus and cytoprotective mechanisms, such as detoxification (metabolism of xenobiotics by cytochrome P450) and increased lysosomal activity (autophagy), as well as organelle stabilization and adaptation to the extracellular environment, including extracellular matrix remodeling and changes in cellular adhesion. In addition, stationary-phase cells showed a massive upregulation of immune system-related responses, which will be discussed in more detail in the next section. The transition to the stationary phase can thus be seen as a cellular response to the changing environment, characterized by increased stress due to higher cell density, ensuing nutrient limitation and increased concentration of waste products of metabolism. Thus, robust responses observed at this stage may indicate the overall cellular capacity to mount efficient stress response.

Same as for the top-downregulated genes, most of the top-16 upregulated genes are related to the processes highlighted by the overall enrichment analysis in this cluster, except for the igfbp6b whose upregulation alone points to growth suppression – a process otherwise more prominent in the downregulation cluster. Upregulation of stress protection mechanisms is reflected by increased expression of hmox1a that plays a role in protection against oxidative damage, which indicates that stationary-phase cells could be experiencing oxidative stress resulting from buildup of metabolic waste [41,70]. Increased expression of rcn3 involved in maintenance of endoplasmic reticulum function and protein folding could confer protection against endoplasmic reticulum stress [71], while upregulation of sacs2, which is a chaperone involved in mitochondrial function and cytoskeletal organization, suggests the need to confer protection to mitochondria in order to maintain a basal level of energy production [72]. Upregulation of dap1b known as negative regulatory of autophagy, may seem counterintuitive [73,74]. However, since the activity of this protein is chiefly regulated through phosphorylation, changes in its abundance may have a less significant effect or even indicate a compensatory response [75,76]. Similarly, upregulation of sypl1 known to be involved in vesicle trafficking could be indicative of increased protein secretion and changes in secretory phenotype in growth-arrested cells, but could also be a compensatory response, since this gene is also known to promote cell survival in tumors by inhibiting apoptosis [77]. Lastly, upregulation of numerous collagens (col2a1b and col6a2 among the top-16 and several others present among the upregulated DEPs) along with lumican (lum), known to be involved in organization of collagen fibers, and sparc (osteonectin), known to modulate cell-matrix interactions, point to structural reinforcement and massive remodeling of extracellular matrix [7881]. Most of the cellular machinery processes found to change between the exponential and stationary growth phases in PAC2 are similar to those reported by other studies looking at similar growth transitions in cultured cells [57,58].

Tissue-specific genes significantly changing upon transition from exponential to stationary phase

Apart from the more general cellular machinery responses described in the previous section, PAC2 cells in the stationary phase also showed several responses which could be viewed as more tissue-specific. This included enrichment of exocrine system development terms in the downregulation cluster and of terms related to neuronal processes and immune-system related responses in the upregulation cluster.

Specifically, upregulated DEPs showed enrichment for biological processes “axon guidance” and “regulation of neuron projection development”. These processes were not significantly enriched in upregulated DEGs but appeared again in the multiomics list of shared upregulated DEPs and DEGs (Fig 5). Notably, both these processes were also enriched in the full proteome, while another related term, “regulation of axonogenesis”, was highlighted in the full transcriptome annotation (Fig 10). These observations raised the question of whether this cell line starts exhibiting neuronal characteristics. Such specialization is theoretically possible, since PAC2 are derived from 24h-old zebrafish embryos and thus could retain some specific differentiation capacity [25,82]. However, many fish cell lines are generally known to have stem cell potential, so this observation may not be a specific feature of embryonic cells only. For example, strong expression of a stem cell marker lgr5 has been observed in permanent cell lines derived from rainbow trout gut [83]. Nonetheless, although our present evidence does not allow for concluding that PAC2 cells could transform into neuronal cells over time, future research could investigate their differentiation potential in this regard.

Enrichment of the terms “negative regulation of immune system process” and “immune system process” was observed in the upregulated DEGs and in the shared upregulated DEPs/DEGs list, respectively. While no significant enrichment was found among upregulated DEPs, several immune system-related genes were found among the top-16 genes upregulated on both the transcript and protein levels, as well as in the DEG list overall. For example, dhx58 is a pattern recognition receptor involved in antiviral innate immunity [84], while stat2 is a central activator of antiviral-like gene expression cascades, but is also known to play a role in regulating inflammation and promoting cell survival [8588]. Ifi44g, ifit16 and isg15 represent downstream targets of interferon signaling, with the two former ones being interferon-induced proteins which have been linked to viral defense and inflammatory pathways and the latter one (isg15) being ubiquitin-like modifier which conjugates to proteins in response to interferon signaling [89]. The expression of various immune system components in fish cell lines and their ability to mount active responses to immune challenges has been demonstrated previously, for example, in rainbow trout gut cells (RTgutGC) treated with lipopolysaccharide [14] or in zebrafish embryonic line Z428 challenged by Singapore Grouper Iridovirus [47]. The upregulation of these genes in the stationary phase of unchallenged PAC2 cells could point to the induction of interferon-like signaling as part of an antiviral-like response pathway, which could represent a generalized defense mechanism that can be triggered by low-level stress even in sterile cell culture condition.

Functional characteristics of PAC2 cells revealed by multiomics profiling

To evaluate PAC2 functional capacities, we next explored the obtained proteomics and transcriptomics profiles in their entirety. Our PAC2 datasets contain almost twice as many transcripts measured as protein groups (Fig 7). This was expected since RNAseq technology is known to provide deeper coverage than mass spectrometry-based global proteomics. However, as our analyses revealed, the level of proteomics coverage attained in the present study may already be sufficient for many research applications. Indeed, as shown in Fig 10 and Table 3, the major functional categories represented by identified genes were shared between both datasets, and the number of associated terms was also comparable, except in the categories “Regulatory processes” and “Response to external stimuli”, where transcriptomics data achieved a 2-times higher term representation (Table 3). However, higher representation of transcripts of genes with regulatory function is in fact to be expected, because many regulatory components are typically expressed at low levels and often transiently, which makes their detection on protein level even less likely. Thus, both the global proteomics and transcriptomics data obtained in our study were able to reveal highly similar biological processes despite the difference in total number of genes detected by each omics technology.

Interestingly, proteomics analysis also identified approximately 800 proteins which did not have a corresponding transcript detected in the matched samples, which showed enrichment of terms related, e.g., to extracellular space and activity of membrane receptors and ligand-gated channels. Many of these proteins appear to be membrane proteins, detected thanks to the SDS-based cell lysis procedure employed in our sample preparation protocol, which is known to efficiently solubilize many membrane proteins [54]. The absence of corresponding transcripts’ detection could be due to the existence of alternative splicing isoforms and difficulties with detecting polyadenylated regions of some mRNAs [90] or to generally low transcript expression in conjunction with higher protein stability, as for example some membrane proteins with slower turnover rates could be retained even when corresponding mRNA expression has already ceased [91].

Another interesting observation was that a large group of terms highlighted in both transcriptomics and proteomics datasets was related to developmental processes and growth, such as “embryo development”, “chordate embryonic development” as well as development of “circulatory system”, “hemopoiesis”, “myeloid cell differentiation”, “connective tissue” and “cranial skeletal system”. In addition, “locomotory behavior” and “digestive tract development” were highlighted in the proteomics dataset only. This evidence reveals that some of the molecular characteristics of the PAC2 cell line could be retained from the early zebrafish embryonic tissue it was originally derived from [11,24].

Expression of cell morphology-related molecular markers in PAC2 cells

Molecular profiling can also help shed more light on the question of morphological classification of the PAC2 cell line. Originally, it was classified as “fibroblast-like” [24]. However, the visual appearance of the PAC2 cells currently cultured in our laboratory may question this classification, as the cells exhibit rather an epithelial-like morphology (Fig 1) and express a number of proteins representative of both morphologies, as we highlighted in a short conference report published last year based on preliminary analysis of PD1 [92]. The analysis of PD1/PD2 proteomics along with the transcriptomics data obtained in the current study has now confirmed the expression of both types of markers in PAC2 on both the protein and transcript level (S12 Table). Namely, fibroblast-like markers expressed in PAC2 cells include fibronectin, N-cadherin and collagen type I, while expressed epithelial-like markers include E-cadherin, occludin, zona occludens 2, mucin-5 AC, and syndecan (S7 Fig). Importantly, vimentin, a well-known marker of fibroblast morphology, was not detected on the protein level, while on the transcript level it was detected only in the unfiltered counts dataset and hence its expression is negligible. Comprehensive multiomics evidence obtained in our study thus reinforces the possible need to reconsider the morphological classification of the PAC2 cell line that we voiced previously [92].

Expression of toxicologically relevant gene groups in PAC2 cells

Zebrafish PAC2 cells were found to express a number of toxicologically relevant gene families and pathways (S12 Table), underscoring the relevance of this cell model for use in toxicological studies. The biotransformation capacity of cell models is frequently a matter of concern, as the extent of in vitro cultured cells’ ability to biotransform xenobiotics is often unclear. In PAC2 cells, for example, expression of cyp1a was neither detected in our study nor could be detected or induced (as checked on mRNA level) by others, in line with their observation that PAC2 cells are not able to biotransform polycyclic aromatic hydrocarbons such as BaP [27]. However, we found that PAC2 cells do express aryl hydrocarbon receptor 2 (ahr2) and aryl hydrocarbon nuclear translocator 2 (arnt2), detected on both the mRNA and protein level, along with a low mRNA level of PXR, as well as cyp1b1, a number of cyp2 isoforms and several other cyp genes (S12 Table), also reported by [27]. This suggests that the ability to biotransform other hydrocarbons or compounds such as estradiol could still be retained in the PAC2 cells. However, this would need to be confirmed experimentally in future studies. Cyp2 family genes expressed in PAC2 cells (cyp2aa1, cyp2aa11, cyp2aa6, cyp2ae1 and cyp2v2) could exhibit analogous activity to mammalian CYP2 enzymes, which are known to process diverse xenobiotics including drugs, pesticides and industrial chemicals. Further, cyp3a65 is a xenobiotic-inducible zebrafish homolog of CYP3A [93,94]. In mammals, CYP3A is a major drug-processing enzyme known to biotransform pharmaceuticals and dietary xenobiotics [42,95]. In addition, expression of cyp4t8 could confer the ability to process lipophilic xenobiotics, as in mammals some CYP4 isoforms, especially CYP4B1, are known to biotransform some industrial chemicals, in addition to their primary internal function in fatty acid oxidation [96,97]. With regard to phase II biotransformation, PAC2 cells were already demonstrated to express a wide repertoire of GST enzymes [21], many of which are relevant for handling xenobiotics [42,98]. Multiomics evidence obtained in the current study also demonstrates the expression of UGTs, SULTs, and NATs (S12 Table). Members of the ugt1 and ugt5 families are broadly active toward phenolic and steroid substrates and in zebrafish are known to glucuronidate bisphenol A, alkylphenol, diclofenac, and steroids [99]. For members of ugt2 family, substrate specificity is less defined, and they could be involved in internal metabolic functions instead. Sult1 and sult3 family members can catalyze sulfation of phenols and amine [100]. Lastly, NAT enzymes can be involved in detoxification of hydrazines and aromatic amines, such as beta-naphthylamine [101]. PAC2 cells were also found to express a wide variety of ABC transporters, of which many support cellular defense against chemical exposure (e.g., abcc1, abcc3, abcc4, abcc5, abcb6, abcb7, abcg2c) [102106]. Further, PAC2 cells express an enormous variety of SLCs, some of which are known to respond to changes in cellular homeostasis, such as slc30a5 [107], and slc36a4 [42], as well as genes involved in xenobiotic transport, including family members of slc16, slc22, slc47 [108111].

Next to biotransformation capacity, the ability of cultured cells to mount specific molecular signaling pathways in response to chemical exposure is also an important determinant of its suitability for certain toxicological investigations. In this regard, it is interesting to note that PAC2 cells might likely detect and respond to estrogenic compounds through estrogen receptors esr2a and esr2b. These cells also express both the glucocorticoid receptor (nr3c1, detected on both the mRNA and protein level) and mineralocorticoid receptor (nr3c2, detected on mRNA level only), which could make them a useful model for studies on effects of synthetic corticosteroids. Further, PAC2 cells might also have capacity to respond to lipid-regulating and peroxisome proliferator compounds, such as fibrates and industrial chemicals like phthalates, due to them expressing several peroxisome-proliferating receptors (pparab, pparda, ppargc1a, ppargc1b), as well as liver X receptor alpha (LXRalpha, nr1h3) and farnesoid X receptor (FXR, nr1h4), known to be involved in cholesterol transport, lipid and glucose metabolism, and inflammation. The presence of PPARs, LXRalpha and FXR suggests that PAC2 cells could offer a useful model for studying metabolic disruption, especially chemical effects on lipid metabolism [112116]. We also observed, albeit at low levels, the mRNA expression of several genes from the Rev-Erb family (nr1d1, nr1d2a, nr1d4b), which function as circadian modulators of metabolism [117]. Additionally, we detected the expression of several clock-controlled genes, e.g., from smad and tgf families (S12 Table), thus corroborating an earlier study in zebrafish larvae [118]. Hence, it might be interesting to continue exploring whether PAC2 cells exhibit differential sensitivity to chemicals dependent on the daytime, as this could establish them as a model for studying interaction of metabolic rhythms with responses to xenobiotics. Lastly, PAC2 were observed to express a number of other nuclear receptors, some of which are known to act as modulators of retinoic acid signaling (e.g., nr2 family genes [119]), or co-repressors of other NRs and could thus exert indirect influence on biotransformation of or response to xenobiotics in PAC2 cells.

Overall, our analysis shows that PAC2 cells express a broad molecular repertoire that includes major components of xenobiotic biotransformation across all biotransformation phases, along with a versatile suite of toxicologically relevant pathways and numerous transcriptional regulators capable of sensing a wide range of chemical stimuli and mounting transcriptional and functional responses to them. PAC2 cells could thus offer a versatile and biologically informative in vitro system for chemical screening and studying molecular mechanisms of toxicity. It could thus support the expansion of the fish invitrome framework toward covering a wider range of chemical toxicity mechanisms and effects. However, it should be pointed out that for a significant proportion of the genes discussed, expression was detected only on the mRNA level (S12 Table). Hence, further confirmation of the protein-level expression, along with functional validation, would be needed to confirm the presence and activity of the corresponding enzymes, receptors, and response pathways, which should be addressed in the future studies.

Conclusions

This study presents a comprehensive molecular profiling of the zebrafish PAC2 cell line, combining an extensive global proteomics analysis across cell population growth phases complemented with a matched transcriptomics dataset at two timepoints corresponding to exponential and stationary phases. Proteomics analysis of two independently generated sample series delivered comparable results despite significant differences in sample preparation protocols, which confirmed the high robustness of the employed DIA-based global proteomics workflow. Furthermore, despite the differences in the total number of gene identifications, both omics approaches highlighted similar biological categories and processes, indicating that the proteome coverage obtained was sufficient to reveal meaningful functional insights. This study thus paves the way for a broader application of state-of-the-art global proteomics techniques for molecular characterization of fish cell models.

Multiomics analysis revealed that PAC2 cells express a broad array of toxicologically relevant genes and pathways, including, e.g., xenobiotic biotransformation enzymes, stress response pathways, and a suite of chemical-responsive nuclear receptors. These findings reinforce the suitability of PAC2 cells as a versatile and well-characterized in vitro model for in-depth studies of fish cell biology, which can utilize different gene networks expressed at proliferative and stationary cell population growth phases, as well as omics-enhanced exploration of chemical toxicity mechanisms, further leveraging the availability of well-developed molecular annotation for this species. Beyond toxicology, molecular characterization data generated in this study can be further explored to evaluate PAC2 model usefulness for other disciplines, including fish virology, immunology, and aquaculture.

Moreover, the analysis pipelines and approaches developed in this work will provide a blueprint for molecular baseline characterization of other fish cell lines, particularly for a suite of cell lines derived from different organs of rainbow trout, foreseen to constitute a vital component of the fish invitrome for animal-free toxicity prediction. This work thus strengthens the mechanistic knowledge base supporting the use of fish cell lines in aquatic toxicity testing and risk assessment.

Materials and methods

PAC2 cell culture and growth curve characterization

The permanent zebrafish embryonic cell line PAC2 (Cellosaurus accession number: CVCL_5853) was routinely cultured at 26°C ± 1°C in T75 flasks (Thermo Fisher Scientific) with 10 mL of Lebovitz`s cell culture medium (L-15; PAN-Biotech), supplemented with 5% fetal calf serum (FCS; Eurobio Scientific). Sub-culturing was performed after reaching full confluence (95–100%) and involved washing with Versene (Gibco™), detachment with trypsin (0.25% in Dulbecco’s phosphate buffered saline; PAN-Biotech), cell counting by Casy cell counter (OLS OMNI Life Science, Bremen, Germany) and seeding at 1–2 million into a new T75 flask in fresh culture medium. Cells were routinely observed using a microscope (Nikon Eclipse TS2).

Growth curves in 96-well plates – experimental design overview

To characterize cell population growth phases, PAC2 cells were seeded in the 96-well plates (Greiner Bio-One CELLSTAR 96 Well, transparent, F-bottom, with lid) at a density of 3500 cells in 200 µL/well of the L-15 medium supplemented with 5% FBS; empty wells were loaded with 200 µL culture media. The plates were sealed with transparent sterile adhesive sheets (Thermo Scientific), the lid was sealed with Parafilm® and the cells were cultured at 26°C ± 1°C. Seven days after seeding, the old culture medium was removed by inverting the plate and gently tapping it onto a clean tissue paper. Fresh medium was then added at 200 µL/well. Cell viability and cell counting data were acquired at multiple timepoints (days) post-seeding from different wells of the same 96-well plate. In most experiments, 12 wells (rows B to G in columns 8 and 10) were seeded with cells: the top six were assigned for cell viability and the bottom six for cell counting. For the cell viability assays, six adjacent wells without cells served as blanks. The cell viability assays were performed using the fluorescent dyes alamarBlue™ (Invitrogen, indicator of metabolic activity) and 5-carboxyfluorescein diacetate, acetoxymethyl ester (CFDA-AM, Invitrogen, indicator of membrane integrity), according to previously established protocols [19,40,120]. After dye addition and 30 min incubation in the dark at 26°C ± 1°C, fluorescence intensity was measured on a Tecan Infinite M200 plate reader with the following parameters: bottom well reading; 530nm excitation and 595nm emission for alamarBlue™; 493nm excitation and 541nm emission for CFDA-AM. Cell count data was acquired on the Biotek Cytation 5 (Agilent) using the nuclear stain Hoechst 33342 (Invitrogen), according to the previously described procedures [121], using standardized settings for the DAPI channel (excitation 377 nm, emission 477 nm) and Gen5 software (version 3.10.06). All measurement data were exported to Excel for further analysis. Fluorescent intensity readings from cell-containing wells were corrected by subtracting the average fluorescent intensity readings of the cell-free wells (blanks). Growth curves were plotted in RStudio (version 4.4.2) with ggplot2’s geom_smooth (span = 1), using the LOESS (locally estimated scatterplot smoothing) function.

Sample collection for omics analyses

Based on the growth curve information, collection timepoints were set at days 3, 8, and 14 post-seeding, corresponding to the early exponential, mid-exponential, and stationary cell culture growth phases. To ensure sufficient amounts of protein and RNA material, cell culturing for omics analyses was performed in 6-well plates seeded at a density of 90000 cells in 4mL/well culture medium. Relative to the well surface area, this cell density was slightly lower than the one used for cell seeding in the growth curve experiments in the 96-well plates described above. This adjustment was made to account for differences in cell distribution patterns observed between the two plate formats, which, in the case of the 6-well plate format, caused the cells to accumulate more in the center of the well. The final seeding density was determined empirically through a series of scaling experiments comparing different cell seeding densities in 6-well plates. Plates were sealed and incubated in the dark at 26°C ± 1°C. Culture medium was exchanged after 7 days post-seeding. On days 8 and 14, corresponding to exponential and stationary growth phases, two sets of samples, one for global proteomics and one for transcriptomics, were collected from the same plate (top row was used to collect cells for the transcriptomics samples and bottom row for the proteomics samples), thus ensuring direct pairing of multiomics samples. In addition, day 3 samples were collected for proteomics only. These experiments were performed in four biological replicates with different cell passage numbers (P85-88), and experiments started on different days. Thus-collected samples underwent proteomics and transcriptomics analyses as described below, generating proteomics dataset 1 (PD1) and transcriptomics dataset (TD). In addition, a second set of samples collected at days 4, 7, 11, 18 and 28 post-seeding was included for global proteomics analysis in our study, generating proteomics dataset 2 (PD2). These samples, representing days 4, 7, 11, 18, and 28 post-seeding, corresponding to early, mid- and late exponential, and early/mid and late stationary phases, respectively, originated from another PAC2 cell culture growth experiment reported in [40], performed in three biological replicates. In contrast to PD1 samples which were trypsin-digested and desalted (see below) prior to mass spectrometry analysis, PD2 samples comprised a non-bound fraction collected during Fe-NTA phosphopeptide enrichment procedure, described in [40]. Further differences between the two proteomics sample sets are described in Table 1 above.

Global proteomics analysis: Sample collection and preparation for MS analysis

For proteomics sample collection, cells were washed three times with ice cold 1x PBS (without Ca2+ and Mg2+) before adding lysis buffer containing 5% sodium dodecyl sulfate (SDS) and 5% triethylammonium bicarbonate buffer (TEAB) in nanopure water. Cellular lysis was facilitated using scrapers with swiveling 13 mm blades. Lysed cells were pooled from 3–6 wells (depending on the time point / amount of biomass per well) in a 1.5 mL Protein LoBind® Eppendorf tube, which was then heated for 10 min at 90°C with a ThermoMixer (Eppendorf), cooled to room temperature and stored at −70°C until protein extraction. Both PD1 and PD2 samples underwent tryptic digestion following the same procedures based on S-trap™ (Protifi) Micro high recovery protocol [54]. Briefly, frozen samples were brought to room temperature and sonicated for 5 min in a bath sonicator to facilitate recovery of wall-adsorbed proteins. After this, the lysate was passed 6–10 times through a syringe needle (Sterican, 21G 0.80 x 40 mm) to shear the DNA. Protein concentration in the lysates was determined using the Pierce™ BCA Protein Assay Kits (Thermo Scientific™) with absorbance measured on Tecan (Infinite M200) plate reader at 562 nm wavelength. Tryptic digest for PD1 samples started with 50 µg (day 3) or 100 μg protein (day 8 and 14) and for PD2 with 200 µg for all samples. Disulphide bond reduction and alkylation were successively performed for 30 min in the dark each, using, respectively, 5 mM tris(2-carboxyethyl)phosphine–hydrochloride (TCEP) added at volume ratio 1:50 and 25 mM of the alkylator 2-iodoacetamide (IAA) at volume ratio 1:20. Next, the lysate mixture was acidified with formic acid (FA; 100%, final pH < 1) and thoroughly mixed by repeated pipetting. Binding/wash buffer (10% TEAB of pH adjusted to pH 7.55 with FA and 90% methanol) was then added to each sample at volume ratio 6:1 and mixed by pipetting before transferring the sample mixture to the S-Trap micro column placed in a 2 mL receiver tube. The S-Traps were centrifuged at 4000 g for 30 s to trap proteins, the flow-through was reloaded onto the trap, and the procedure was repeated twice. After this, 200 µl of binding/wash buffer was added, centrifuged at 4,000 g for 30 s, and the flow-through was discarded. This procedure was repeated two more times. Finally, the S-Trap column was centrifuged (1 min, 4000g) to fully remove the binding/wash buffer and then transferred to a new 1.5 mL Eppendorf tube. To each S-Trap, 120 µl of the digestion buffer containing elution buffer 1 (50mM TEAB in nanopure water), trypsin (1:100 w/w; Roche), and 0.5% 0.5M CaCl2, was added, spun through, re-loaded and incubated for 12–14 h at 37°C. The next day, elution buffer 1 was added and centrifuged for 1 min at 4000 g. This step was repeated with elution buffer 2 (0.2% FA in nanopure water) and elution buffer 3 (50% acetonitrile in nanopure water). All eluates of the same sample were collected in one tube, centrifuged, and dried in the vacuum concentrator (Eppendorf). After this step, the handling of PD1 and PD2 samples started to differ. PD1 samples were resuspended in buffer A (0.1% FA in nanopure water), desalted on the tC18 cartridges (Sep-Pak, Vac 1cc (50 mg), #WAT054960), eluted with buffer B (0.1% FA and 80% acetonitrile in nanopure water), centrifuged, dried, resuspended in buffer A, filtered with 0.45 µm centrifugal filter (Amicon Ultrafree MC), transferred into glass AS-vials with a 0.1 ml insert and stored at 4°C until MS measurement. PD2 samples underwent the Fe3+-NTA phosphopeptide enrichment procedure described in [40] and comprise the non-bound fractions containing non-phosphorylated peptides which were collected during the washing steps in this procedure, pooled, dried and then treated similarly to PD1 samples after final drying step.

Global proteomics analysis: Data acquisition and processing

Peptides (1–4 µg, see Table 1) were separated on a nanoLC system (Dionex Ultimate 300 UHPLC, Thermo Scientific) using an in-house made C18 reverse-phase analytical column (30 cm x 75 µm ID) with self-pulled tip, packed with ReproSil-Pur, 120 Å pore size, C18-AQ, 3 µm bead size (Dr. Maisch GmbH, Germany), and coupled with a Nanospray Flex Series ion source NGTM to the Orbitrap FusionTM LumosTM TribridTM (Thermo Scientific) mass spectrometer operated in data-independent acquisition mode (DIA) over 138 min gradient at a flow rate of 300 nL/min, including 92 min linear gradient from 2% to 35% eluent B (0.2% FA, 2% nanopure water, 97.8% acetonitrile). Source parameters were 2400 V for positive ion spray and 275°C ion transfer tube temperature. MS was set for expected LC peak width of 60 s with a default charge state of 2. Full scan set at 120000 resolution, scan range of 350–1150 m/z, normalized AGC target 200% and maximum injection time 100 ms; DIA was performed using 41 variable-width isolation windows. MSn experiment was set with HCD at normalized collision energy of 28%, 30000 resolution, scan range of 200–1800 m/z, AGC target 400%, maximum injection time 54 ms. Most samples were run with two technical replicates, except PD1-day 3 samples (three technical replicates) and the third biological replicate of PD2 (one technical replicate). Raw LC-MS/MS data were converted with HTRMS converter (version 18.2) and processed in Spectronaut® 18.0 Pulsar (version 18.2.230802.50606, Biognosys AG, Schlieren, Switzerland) [45]. Proteins were inferred based on peptide fragment ions identified using directDIA, a method embedded in Spectronaut that uses a FASTA sequence database instead of a spectral library. Search FASTAs included reference proteome of zebrafish (Danio rerio, UP000000437) downloaded from UniProt on 19.9.2023 (46695 entries) and NIHMS1873978_supplement_contaminant (381 entries) [122]. Trypsin was selected as a digestion enzyme with 1 miscleavage allowed, carbamidomethylation was set as fixed modification, and methionine oxidation and N-terminal acetylation as variable modifications. FDR was set to 0.01 and quantification was performed on MS2 level. Identified proteins were combined into protein groups where peptide information did not allow their unambiguous distinction from each other, and final quantification was performed on the level of protein group using only protein group-specific peptides. Normalization was performed locally using sparse profiles and imputation was disabled. Following the initial processing in Spectronaut, data was exported and further analyzed using Excel, RStudio (version 4.4.2) and Perseus (version 2.0.11.0) [123]. The mass spectrometry proteomics data has been deposited to the ProteomeXchange Consortium via the PRIDE [124] partner repository with the dataset identifier PXD066565.

Global proteomics: Downstream data analysis

For each biological replicate (n = 4 for PD1 and n = 3 for PD2), technical replicates (if any) were averaged and results (i.e., one value per protein group, per biological replicate) were used for protein identification counts, where each protein group required at least one measured value in one of the biological replicates for the respective condition to be counted as having identified that protein group. No batch correction was needed since all replicates within each independent dataset (PD1 and PD2) were processed and measured together. For downstream quantitative analysis, proteins which had missing values in 30% or more of the dataset’s replicates were removed. For remaining proteins, missing values were imputed based on the Perseus default settings, where available values are first log2-transformed and then the missing entries are randomly assigned a value drawn from a set of low-intensity values mimicking the parametric distribution of the real values, per sample (see S2 Fig for the distribution of real and imputed data in each biological replicate). Data centering was then performed by subtracting the median of the sample from every value within the corresponding sample. Next, a principal component analysis (PCA) was performed using the prcomp function, with the first two components visualized using ggplot2. Samples were grouped by day, with convex hulls (chull) to illustrate group boundaries and biological replicates labeled using ggrepel. Variance explained by each component was indicated on the axes. Differential expression analysis was conducted in Perseus separately for each dataset. PD1 and PD2 were analyzed by a one-way analysis of variance (ANOVA), while unpaired Student’s t-test was used on a subset of PD1 (days 8 and 14) matched with TD. In both tests, significance was assessed by permutation-based FDR approach, with a threshold of FDR < 0.05 and 250 randomizations. No grouping was preserved during permutation. Following statistical testing, differentially expressed protein groups (DEPs) were z-score normalized by subtracting the mean and dividing by the standard deviation of protein group intensities across conditions. Hierarchical clustering was then carried out using Euclidean distance and complete linkage. Heatmaps of the clustered DEPs were generated in RStudio (version 4.4.2) using the pheatmap package (version 1.0.12) for visualization. To identify the top regulated DEPs, a volcano plot was generated using visualization tools from Perseus.

Transcriptomics analysis: Sample collection and library preparation

To collect the samples for transcriptomics analysis (biological replicates n = 4), cells were washed with ice cold 1x PBS before adding RNeasy Plus lysis buffer (QIAGEN, Ref. 1053393). Cellular lysis was facilitated using scrapers with swiveling 13 mm blades. Lysed cells were harvested from multiple wells in 1.5 mL DNase/RNase-free Eppendorf Collection Tubes (QIAGEN) placed on ice, vortexed at maximal speed for 1 min and immediately stored at –70°C until further processing. RNA extraction was performed according to manufacturer’s instructions using the RNeasy Plus Mini Kit (QIAGEN, Ref. 172034266), including a gDNA removal step. RNA quality was analyzed with the RNA ScreenTape Assay for TapeStation 4200 System (Agilent), following the assay instructions and using the electronic ladder. For all samples, the measured RNA Integrity Number (RIN) was equal to 10. After this quality control, the samples were transported in dry ice to the Genomic Diversity Centre (GDC), ETH Zürich, which provides access to managed instruments and expertise in genetic techniques and genomic data. Here, samples were treated with DNase using the DNase I Amplification Grade Kit (Invitrogen, Ref. 18068015) to eliminate any residual gDNA, using 1 µg of RNA for each sample and following the kit instructions. After DNAse treatment, three random samples were tested again with the TapeStation to ensure the quality of RNA was not changed. All tested samples showed RINs equaling 10.

To prepare the mRNA library, DNAse-treated RNA samples were first quantified using the Qubit RNA Broad Range (BR) Assay Kit (Invitrogen, Ref. Q10211) following kit instruction. Measurements were performed in a 384-well black plate (Faust, Ref. 4000246) using a plate reader Spark M10 (Tecan). mRNA library was prepared with starting amount of 0.5 µg for each sample, using the TruSeq Stranded mRNA Library Preparation Kit from Illumina (96 Samples, Ref. 2002055), which includes poly-A mRNA enrichment through oligo(dT) magnetic beads. For increased demultiplexing accuracy and reduced sample misassignment during sequencing, IDT for Illumina – TruSeq RNA Unique Dual (UD) Indexes v2 (96 Indexes, 96 Samples, Ref. 20040871) were used. The whole procedure was performed following manufacturer’s instructions but using half-volumes of all reagents. This modification of the protocol was previously tested in an internal trial to ensure similar performance in the library preparation procedure. Following the preparation, the concentration of each mRNA library was measured using a Qubit dsDNA BR Assay Kit (Invitrogen, Ref. Q32853) following the kit instructions. One sample (day 14, biological replicate 3) appeared to be lost during library preparation and was therefore not included in downstream analyses. Although the reduction in sample size in one group could slightly decrease statistical power, we do not expect the loss of this replicate to bear a significant impact on the results as the remaining replicates showed highly consistent expression profiles. The remaining mRNA libraries were normalized and pooled following the specifications required by the sequencing platform.

After pooling, residual adapter dimers were removed by an additional mRNA library purification step using AMPure XP beads (self-prepared by the GDC) using a bead-to-sample volume ratio of 0.95:1 and following the mRNA library purification steps described in the TruSeq Stranded mRNA Library Preparation protocol. The samples were then air-dried for 15 min, removed from the magnetic stand, and gently resuspended in 70 µL of nuclease-free ultrapure water. After 2 min incubation, samples were centrifuged at 280 rcf for 1 min and placed back into the magnetic rack. Once the liquid became clear, the supernatant containing the cleaned mRNA library was collected into a new tube. The final quality check of the pooled, cleaned library was performed using the D1000 ScreenTape Assay for TapeStation Systems (Agilent).

Transcriptomics analysis: Sequencing and data processing

The pooled mRNA library was sequenced by the Functional Genomics Center of University of Zurich and ETH Zurich (FGCZ). Before the main sequencing, mRNA quality check was performed with iSeq (Illumina). This additional step of short, low-output sequencing provided information about the optimal loading concentration, library quality, and index balance. The index balance for under-represented samples detected by the iSeq analysis was adjusted by respective spiking into the final mRNA library, followed by evaluation of sample concentration and quality of sample using the ScreenTape assay. The final sequencing was performed on an Illumina NovaSeq X system using a 300-cycle 10B flow cell and yielded an average of approximately 20 million paired-end reads (2 × 150 bp) per sample. Raw data (sample reads provided by the sequencing facility (FGCZ)) were pre-processed by the GDC, including the pseudo-alignment which was performed using Kallisto (version v0.50.1) [125] and the reference transcriptome from Ensembl (Danio rerio, GRCz11/GCA_000002035.4). Transcripts remapping was on average above 83%, with most samples yielding between 12–33 million aligned reads. The fragment length ranged between 168–209 bp. The RNAseq transcriptomics data has been deposited to the Gene Expression Omnibus [126] with the dataset identifier GSE303229.

Transcriptomics analysis: Downstream data analysis

Transcript-level abundance estimates from Kallisto (in HDF5 format) were imported into RStudio (version 4.4.2) and summarized to the gene level counts based on the reference transcriptome. Raw counts per million (CPM) values were extracted and normalized using transcript length and effective library size, thus accounting for the sequencing depth and gene length bias. No batch correction was needed since all replicates were processed and measured together. The normalized dataset was log-transformed. Differentially expressed genes (DEGs) analysis was performed using the edgeR package (version 4.6.2) [127]. A design matrix accounting for the two experimental conditions of interest (exponential vs. stationary growth phases) was generated. Genes with low expression were filtered using the filterByExpr function. The filtering step removed approximately 13000 genes, going from 30628 to 17570 genes. The glmQLFit function, which applies a negative binomial model and robust estimation to reduce the impact of outliers, was used to model the variation in gene expression across samples. Differential expressions were tested applying a quasi-likelihood F-test (glmQLFTest function) and corrected by FDR < 0.05 obtained using the Benjamini-Hochberg (BH) adjustment. Following statistical testing, DEGs were z-score normalized and hierarchically clustered using Euclidean distance and complete linkage. Heatmaps of clustered DEGs were generated in RStudio using the pheatmap package for visualization. To visualize the data, transcripts were filtered according to the transcript per million (TPM) value, where each transcript`s geometric mean was calculated across samples and within each cell population growth phase. Normalized data with TPM > 1 was selected for PCA, performed using the prcomp function with default parameters. The first two components were visualized using ggplot2 (v3.5.2). Samples were grouped by day, with convex hulls (chull) to illustrate group boundaries. Variance explained by each component was indicated on the axes. To identify the top regulated DEGs, a volcano plot was generated using visualization tools from Perseus.

Multiomics data integration and functional annotation analysis

Identifiers (UniProt accessions for PDs and Ensembl IDs for TD) were converted to official gene symbols using DAVID Bioinformatics (version: 2021 update) [128,129], which were then used for overlap analysis (Venn diagrams) and functional annotation analysis. Transcripts used in analysis were filtered according to the TPM value, where each transcript`s geometric mean was calculated across samples and within growth phase, and transcripts with TPM < 1 were excluded from the overlap analysis. Venn diagrams were analyzed using the InteractiVenn webpage [130] and re-drawn using BioRender.com. Functional annotation and enrichment analysis for Gene Ontology (GO) biological process (BP; GOTERM_BP_FAT), molecular function (MF; GOTERM_BP_FAT), cellular component (CC; GOTERM_MF_FAT) and KEGG pathway (KEGG) terms were performed using DAVID Bioinformatics and respective gene symbol lists. DAVID Bioinformatics platform was used because it provides a functional annotation clustering tool which allows grouping the related GO terms and KEGG pathways based on shared annotated genes, thereby offering a broader overview of biologically meaningful functions. When analyzing the complete list of identified protein groups and transcripts, no background was applied. When analyzing DEPs and DEGs, the full PAC2 proteome and transcriptome, respectively, were used as background references. Statistically significant GO BP terms (FDR < 0.05) identified in the previous steps were clustered based on semantic similarity (default settings, specifically medium/0.7 resulting list and SimRel) using the Danio rerio species background in Revigo (version 1.8.1) webpage [131]. The process included removal of obsolete terms. To represent the functions of the entire proteome and transcriptome, the Interactive Graph results from Revigo were exported in an XGMML file format and transferred to Cytoscape (version 3.10.3; [132]). In Cytoscape, terms (represented by circles) were color-coded according to pre-established cellular functional categories based on the proximity and nomenclature of the circles (i.e., common words used throughout circles found in close distance from each other). To represent the functions of up- and down-regulated DEPs and DEGs, resulting GO terms and KEGG pathways were grouped according to gene similarity using DAVID Bioinformatics in-build clustering algorithm. Terms and pathways of interest were selected and represented using horizontal lollipop charts inspired by the design of the visualization tools used in interactive web-based platform, ShinyGO 0.81 [133].

Generation of multiomics data model

A multiomics model was built with RStudio (version 4.5.1) for the matched PD1 and TD samples (day 8 and day 14) using normalized proteomics and transcriptomics data, created as described in the corresponding sections above. A horizontal (shared sample model) multiblock data integration partial least-square (PLS) model was implemented using the block.pls function from the mixomics R library (v.3.21) [134]. PLS maximizes the covariance between datasets (blocks) and generates latent variables that capture their shared variations. This approach was selected over alternatives such as MOFA [135] or DIABLO [136] because it allows exploratory integration focused on shared variance between omics layers without the need to define classes, thus making it well suited for our paired analysis design. Canonical method and near-zero variance filtering were used as arguments. PLS plot capturing variance per biological replicate and per dataset was generated using the plotArrow function.

AI statement

Several RStudio scripts used for figure preparation were refined with the assistance of ChatGPT (OpenAI).

Supporting information

S1 Fig. Growth curve of the PAC2 cell line based on membrane integrity measurements.

PAC2 cells were seeded in 96-well plates and cultured for 3 weeks. Cell culture medium was exchanged once on day 7 post-seeding. Membrane integrity based on 5-caroxyfluorescein diacetate, acetoxymethyl ester (CFDA-AM) fluorescence was recorded in three biological replicates at multiple timepoints (days) throughout the culturing duration. Data are shown as mean ± SD of technical replicates (n = 3–12) for each biological replicate. Trend line was added using LOESS smoothing (geom_smooth, span = 1) to visualize overall changes over time; no further statistical modeling was applied.

https://doi.org/10.1371/journal.pone.0346109.s001

(TIF)

S2 Fig. Histograms of the biological replicates from proteomics datasets.

Histograms from proteomics dataset 1 (PD1) and 2 (PD2) are shown in panels A and B, respectively. Protein group abundance values were log2-transformed and filtered to retain only those with values in at least 70% of biological replicates. Remaining missing values were imputed (shown in the red bins, located left of 0 on x-axis) and centred to 0 (by subtracting the median per sample). Biological replicates are labelled as B1 to B4 for PD1 and B1 to B3 for PD2.

https://doi.org/10.1371/journal.pone.0346109.s002

(TIF)

S3 Fig. Absolute abundance differences of protein groups shared between proteomics datasets 1 and 2 (PD1 and PD2).

(A) Correlation and density plot of 6803 protein groups (subset with >70% detection) shared between PD1 and PD2. For each dataset, abundance values of each protein group were averaged across biological replicates from all conditions (Days). The diagonal line indicates equality (x = y); points above the line have higher values in PD2, and those below have higher values in PD1. Protein groups are color-coded according to abundance: green represents the 200 most abundant and blue the 200 least abundant, based on the average values across all biological replicates and conditions from both datasets. (B–C) Profile plots of the top 200 most abundant (B) and least abundant (C) protein groups shown in (A), with box plots illustrating the distribution of abundance values across conditions (Days) for each dataset (orange for PD1; black for PD2).

https://doi.org/10.1371/journal.pone.0346109.s003

(TIF)

S4 Fig. Impact of peptide loading concentration in LC-MS/MS runs on correlation between average protein group abundancies in proteomics datasets 1 and 2 (PD1 and PD2).

Correlation plots show protein group abundances (averaged across biological replicates) between conditions: day 3 (1 µg on-column loading per sample), day 8 and day 14 (2 µg on-column loading per sample) from PD1; and days 4, 7, 11, 18, and 28 (4 µg on-column loading per sample) from PD2. Pearson correlation coefficients are shown in each plot.

https://doi.org/10.1371/journal.pone.0346109.s004

(TIF)

S5 Fig. Principal component analysis (PCA) of individual proteomics datasets.

PCA plots are shown for proteomics dataset 1 (PD1; orange; panel A) and proteomics dataset 2 (PD2; grey; panel B). PCA was performed on filtered, log₂-transformed, and zero-centred data, which comprised 6751 protein groups for PD1 and 6927 protein groups for PD2. Convex hulls (orange and grey areas for PD1 and PD2, respectively) show the outer boundaries formed by replicates within each group.

https://doi.org/10.1371/journal.pone.0346109.s005

(TIF)

S6 Fig. Co-expression plots of the z-score normalized differentially expressed protein groups (DEPs) identified as significantly down- or upregulated in proteomics datasets 1 (PD1) and 2 (PD2).

In each plot, all lines represent the total number of differentially expressed protein groups (DEPs) in the corresponding case, while coloured lines represent the DEPs shared between PD1 and PD2. Panels (A) and (B) show downregulation clusters in PD1 (2317 DEPs) and PD2 (1622 DEPs), respectively, with 1247 shared DEPs (green lines). Panels (C) and (D) show upregulation clusters in PD1 (2133 DEPs) and PD2 (1909 DEPs), respectively, with 1273 shared DEPs (pink lines). Abbreviations: B, biological replicate.

https://doi.org/10.1371/journal.pone.0346109.s006

(TIF)

S7 Fig. Expression levels of selected markers of fibroblastic and epithelial cell types.

Markers were selected to represent epithelial-like morphologies (panels A, C, and E) and fibroblast-like morphologies (panels B, D, and F). Expression data are shown for selected markers using proteomics dataset 1 (PD1; panels A and B), proteomics dataset 2 (PD2; panels C and D), and the transcriptomics dataset (TD; panels E and F). For PD1 and PD2, protein group abundance values were used; for TD, transcript levels were represented as TPM values. All data were log₂-transformed and visualized in RStudio (version 4.4.2). Epithelial markers full name, including gene symbol and UniProt accession: cadherin 1, type 1, E-cadherin (epithelial) (cdh1, Q90Z37); mucin-5 AC (prr36b, A0A8N7UTE4); occludin (si:ch73-61d6.3, A0A8M1PUX9); syndecan (sdc2, Q66I66); zona occludens 2 (tjp2b, Q6DHS7). Fibroblastic markers full name, including gene symbol and UniProt accession: vimentin (vim, A0A8M2BL67); collagen, type I (col1a1b, A0A8M2B227); fibronectin (fn1a, B0S602); cadherin 2, type 1, N-cadherin (neuronal) (cdh2, Q90275).

https://doi.org/10.1371/journal.pone.0346109.s007

(TIF)

S1 Table. Technical overview of Proteomics Datasets (PDs).

https://doi.org/10.1371/journal.pone.0346109.s008

(XLSX)

S2 Table. Supporting data for Proteomics Dataset 1 (PD1).

https://doi.org/10.1371/journal.pone.0346109.s009

(XLSX)

S3 Table. Supporting data for Proteomics Dataset 2 (PD2).

https://doi.org/10.1371/journal.pone.0346109.s010

(XLSX)

S4 Table. Supporting data for shared Proteomics Datasets 1 and 2 (PD1PD2) lists of differentially expressed protein groups (DEPs).

https://doi.org/10.1371/journal.pone.0346109.s011

(XLSX)

S5 Table. Supporting data for Proteomics Dataset 1 (PD1), subset with day 8 and day 14.

https://doi.org/10.1371/journal.pone.0346109.s012

(XLSX)

S6 Table. Supporting data for Transcriptomics Dataset (TD).

https://doi.org/10.1371/journal.pone.0346109.s013

(XLSX)

S7 Table. Supporting Data for Transcriptomics Dataset (TD) lists of differentially expressed genes (DEGs).

https://doi.org/10.1371/journal.pone.0346109.s014

(XLSX)

S8 Table. Supporting data for shared Proteomics Dataset 1 (PD1) subset day8-day14 and Transcriptomics Dataset (TD) lists of differentially expressed molecules.

https://doi.org/10.1371/journal.pone.0346109.s015

(XLSX)

S9 Table. Supporting Data for Volcano plots for Proteomics Dataset 1 (PD1) subset of Day 8 and day 14 and Transcriptomics Databaset (TD).

https://doi.org/10.1371/journal.pone.0346109.s016

(XLSX)

S10 Table. Supporting data for genes identified on the protein level without correspondence to transcript level.

https://doi.org/10.1371/journal.pone.0346109.s017

(XLSX)

S11 Table. Supporting data for functional annotation analysis of all PAC2 genes expressed on the proteome and transriptome levels.

https://doi.org/10.1371/journal.pone.0346109.s018

(XLSX)

S12 Table. Overview of expression in PAC2 cells of genes or proteins belonging to selected families and/or pathways related to toxicologically relevant processes and functions.

https://doi.org/10.1371/journal.pone.0346109.s019

(DOCX)

Acknowledgments

Transcriptomics data presented in this paper were produced and analyzed in collaboration with the Genetic Diversity Centre (GDC), ETH Zurich.

References

  1. 1. Schuijt LM, Peng F-J, van den Berg SJP, Dingemans MML, Van den Brink PJ. (Eco)toxicological tests for assessing impacts of chemical stress to aquatic ecosystems: Facts, challenges, and future. Sci Total Environ. 2021;795:148776. pmid:34328937
  2. 2. Organisation for Economic Co-operation and Development. Fish Toxicity Testing Framework. 177. Paris: Organisation for Economic Co-operation and Development; 2014.
  3. 3. Lillicrap A, Belanger S, Burden N, Pasquier DD, Embry MR, Halder M, et al. Alternative approaches to vertebrate ecotoxicity tests in the 21st century: A review of developments over the last 2 decades and current status. Environ Toxicol Chem. 2016;35(11):2637–46. pmid:27779828
  4. 4. Burden N, Benstead R, Benyon K, Clook M, Green C, Handley J, et al. Key opportunities to replace, reduce, and refine regulatory fish acute toxicity tests. Environmental Toxicology and Chemistry. 2020;39(10):2076–89.
  5. 5. Revel M, Groh K, Bertoli J, Degeratu M-O, Fischer M, Fischer S, et al. Development of the fish invitrome for animal-free environmental risk assessment of chemicals. Environ Toxicol Chem. 2025;44(9):2648–58. pmid:39880375
  6. 6. Russell WMS, Burch RL. The principles of humane experimental technique: Methuen; 1959.
  7. 7. Knight DJ, Deluyker H, Chaudhry Q, Vidal J-M, de Boer A. A call for action on the development and implementation of new methodologies for safety assessment of chemical-based products in the EU - A short communication. Regul Toxicol Pharmacol. 2021;119:104837. pmid:33249099
  8. 8. Krewski D, Acosta D Jr, Andersen M, Anderson H, Bailar JC 3rd, Boekelheide K, et al. Toxicity testing in the 21st century: a vision and a strategy. J Toxicol Environ Health B Crit Rev. 2010;13(2–4):51–138. pmid:20574894
  9. 9. Ankley GT, Bennett RS, Erickson RJ, Hoff DJ, Hornung MW, Johnson RD, et al. Adverse outcome pathways: a conceptual framework to support ecotoxicology research and risk assessment. Environ Toxicol Chem. 2010;29(3):730–41. pmid:20821501
  10. 10. Bols NC, Pham PH, Dayeh VR, Lee LEJ. Invitromatics, invitrome, and invitroomics: introduction of three new terms for in vitro biology and illustration of their use with the cell lines from rainbow trout. In Vitro Cell Dev Biol Anim. 2017;53(5):383–405. pmid:28374170
  11. 11. Arana ÁJ, González-Llera L, Barreiro-Iglesias A, Sánchez L. Emerging Frontiers in Zebrafish Embryonic and Adult-Derived Cell Lines. Int J Mol Sci. 2025;26(9):4351. pmid:40362588
  12. 12. Goswami M, Yashwanth BS, Trudeau V, Lakra WS. Role and relevance of fish cell lines in advanced in vitro research. Mol Biol Rep. 2022;49(3):2393–411. pmid:35013860
  13. 13. Fischer M, Belanger SE, Berckmans P, Bernhard MJ, Bláha L, Coman Schmid DE, et al. Repeatability and Reproducibility of the RTgill-W1 Cell Line Assay for Predicting Fish Acute Toxicity. Toxicol Sci. 2019;169(2):353–64. pmid:30825313
  14. 14. Schug H, Yue Y, Krese R, Fischer S, Kortner TM, Schirmer K. Time- and concentration-dependent expression of immune and barrier genes in the RTgutGC fish intestinal model following immune stimulation. Fish Shellfish Immunol. 2019;88:308–17. pmid:30844464
  15. 15. Minghetti M, Drieschner C, Bramaz N, Schug H, Schirmer K. A fish intestinal epithelial barrier model established from the rainbow trout (Oncorhynchus mykiss) cell line, RTgutGC. Cell Biol Toxicol. 2017;33(6):539–55. pmid:28251411
  16. 16. Emberley-Korkmaz S, Mittal K, Temlock N, Head J, Basu N. Cytotoxicity of 19 pesticides in rainbow trout gill, liver, and intestinal cell lines. Environ Toxicol Chem. 2025;44(9):2443–54. pmid:38116990
  17. 17. Scott J, Grewe R, Minghetti M. Fish Embryo Acute Toxicity Testing and the RTgill-W1 Cell Line as In Vitro Models for Whole-Effluent Toxicity (WET) Testing: An In Vitro/In Vivo Comparison of Chemicals Relevant for WET Testing. Environ Toxicol Chem. 2022;41(11):2721–31. pmid:35942926
  18. 18. International Organization for Standardization. ISO 21115:2019 Water quality - Determination of acute toxicity of water samples and chemicals to a fish gill cell line (RTgill-W1). Geneva, Switzerland; 2019. Report No.: ISO 21115;2019.
  19. 19. Organisation for Economic Co-operation and Development. Test No. 249: Fish Cell Line Acute Toxicity-The RTgill-W1 Cell Line Assay. OECD Publishing; 2021.
  20. 20. Lin S, Gaiano N, Culp P, Burns JC, Friedmann T, Yee JK, et al. Integration and germ-line transmission of a pseudotyped retroviral vector in zebrafish. Science. 1994;265(5172):666–9. pmid:8036514
  21. 21. Tierbach A, Groh KJ, Schoenenberger R, Schirmer K, Suter MJ-F. Characterization of the Mercapturic Acid Pathway, an Important Phase II Biotransformation Route, in a Zebrafish Embryo Cell Line. Chem Res Toxicol. 2020;33(11):2863–71. pmid:32990429
  22. 22. Lai KP, Gong Z, Tse WKF. Zebrafish as the toxicant screening model: Transgenic and omics approaches. Aquat Toxicol. 2021;234:105813. pmid:33812311
  23. 23. Schmeisser S, Miccoli A, Von Bergen M, Berggren E, Braeuning A, Busch W. New approach methodologies in human regulatory toxicology–not if, but how and when!. Environment International. 2023;178:108082.
  24. 24. Culp PA. Random DNA integrations as an approach to insertional mutagenesis in the zebrafish (Brachydanio rerio). Massachusetts Institute of Technology; 1994.
  25. 25. Senghaas N, Köster RW. Culturing and transfecting zebrafish PAC2 fibroblast cells. Cold Spring Harb Protoc. 2009;2009(6):pdb.prot5235. pmid:20147193
  26. 26. Keller JM, Escara-Wilke JF, Keller ET. Heat stress-induced heat shock protein 70 expression is dependent on ERK activation in zebrafish (Danio rerio) cells. Comp Biochem Physiol A Mol Integr Physiol. 2008;150(3):307–14. pmid:18467140
  27. 27. Šrut M, Bourdineaud J-P, Štambuk A, Klobučar GIV. Genomic and gene expression responses to genotoxic stress in PAC2 zebrafish embryonic cell line. J Appl Toxicol. 2015;35(11):1381–9. pmid:25612249
  28. 28. Lungu-Mitea S, Oskarsson A, Lundqvist J. Development of an oxidative stress in vitro assay in zebrafish (Danio rerio) cell lines. Sci Rep. 2018;8(1):12380. pmid:30120374
  29. 29. Li Y, Zhang Z, Jiang S, Xu F, Tulum L, Li K, et al. Using transcriptomics, proteomics and phosphoproteomics as new approach methodology (NAM) to define biological responses for chemical safety assessment. Chemosphere. 2023;313:137359. pmid:36427571
  30. 30. Bhuyan BK, Fraser TJ, Day KJ. Cell proliferation kinetics and drug sensitivity of exponential and stationary populations of cultured L1210 cells. Cancer Res. 1977;37(4):1057–63. pmid:139202
  31. 31. Tanneberger K, Knöbel M, Busser FJM, Sinnige TL, Hermens JLM, Schirmer K. Predicting fish acute toxicity using a fish gill cell line-based toxicity assay. Environ Sci Technol. 2013;47(2):1110–9. pmid:23227966
  32. 32. Stadnicka-Michalak J, Schirmer K, Ashauer R. Toxicology across scales: Cell population growth in vitro predicts reduced fish growth. Sci Adv. 2015;1(7):e1500302. pmid:26601229
  33. 33. Sabatier P, Lechner M, Guzmán UH, Beusch CM, Zeng X, Wang L, et al. Global analysis of protein turnover dynamics in single cells. Cell. 2025;188(9):2433-2450.e21. pmid:40168994
  34. 34. Buccitelli C, Selbach M. mRNAs, proteins and the emerging principles of gene expression control. Nat Rev Genet. 2020;21(10):630–44. pmid:32709985
  35. 35. Maier T, Güell M, Serrano L. Correlation of mRNA and protein in complex biological samples. FEBS Lett. 2009;583(24):3966–73. pmid:19850042
  36. 36. Salovska B, Zhu H, Gandhi T, Frank M, Li W, Rosenberger G, et al. Isoform-resolved correlation analysis between mRNA abundance regulation and protein level degradation. Mol Syst Biol. 2020;16(3):e9170. pmid:32175694
  37. 37. Groh KJ, Suter MJ-F. Stressor-induced proteome alterations in zebrafish: a meta-analysis of response patterns. Aquat Toxicol. 2015;159:1–12. pmid:25498419
  38. 38. Canzler S, Schor J, Busch W, Schubert K, Rolle-Kampczyk UE, Seitz H, et al. Prospects and challenges of multi-omics data integration in toxicology. Arch Toxicol. 2020;94(2):371–88. pmid:32034435
  39. 39. Shi C, Cheng L, Yu Y, Chen S, Dai Y, Yang J, et al. Multi-omics integration analysis: Tools and applications in environmental toxicology. Environ Pollut. 2024;360:124675. pmid:39103035
  40. 40. Huwa N, Schönenberger R, Groh KJ. Development and application of a targeted phosphoproteomics method for analysing the mTOR pathway dynamics in zebrafish PAC2 cell line. J Proteomics. 2025;319:105469. pmid:40466985
  41. 41. Simmons SO, Fan C-Y, Ramabhadran R. Cellular stress response pathway system as a sentinel ensemble in toxicological screening. Toxicol Sci. 2009;111(2):202–25. pmid:19567883
  42. 42. Eide M, Zhang X, Karlsen OA, Goldstone JV, Stegeman J, Jonassen I, et al. The chemical defensome of five model teleost fish. Sci Rep. 2021;11(1):10546. pmid:34006915
  43. 43. Ludwig C, Gillet L, Rosenberger G, Amon S, Collins BC, Aebersold R. Data-independent acquisition-based SWATH-MS for quantitative proteomics: a tutorial. Mol Syst Biol. 2018;14(8):e8126. pmid:30104418
  44. 44. Pino LK, Just SC, MacCoss MJ, Searle BC. Acquiring and Analyzing Data Independent Acquisition Proteomics Experiments without Spectrum Libraries. Mol Cell Proteomics. 2020;19(7):1088–103. pmid:32312845
  45. 45. Bruderer R, Bernhardt OM, Gandhi T, Reiter L. High-precision iRT prediction in the targeted analysis of data-independent acquisition and its impact on identification and quantitation. Proteomics. 2016;16(15–16):2246–56. pmid:27213465
  46. 46. Bruderer R, Bernhardt OM, Gandhi T, Xuan Y, Sondermann J, Schmidt M, et al. Optimization of Experimental Parameters in Data-Independent Mass Spectrometry Significantly Increases Depth and Reproducibility of Results. Mol Cell Proteomics. 2017;16(12):2296–309. pmid:29070702
  47. 47. Wang F, Zhu Y, Hew CL. Quantitative study of proteomic alterations in a Zebrafish (danio rerio) cell line infected with the Singapore Grouper Iridovirus (SGIV). Virus Res. 2015;199:62–7. pmid:25662022
  48. 48. Menanteau-Ledouble S, Nöbauer K, Razzazi-Fazeli E, El-Matbouli M. Effects of Yersinia ruckeri invasion on the proteome of the Chinook salmon cell line CHSE-214. Sci Rep. 2020;10(1):11840. pmid:32678312
  49. 49. Yan J, Long Y, Zhou T, Ren J, Li Q, Song G, et al. Dynamic Phosphoproteome Profiling of Zebrafish Embryonic Fibroblasts during Cold Acclimation. Proteomics. 2020;20(2):e1900257. pmid:31826332
  50. 50. Alli Shaik A, Wee S, Li RHX, Li Z, Carney TJ, Mathavan S, et al. Functional mapping of the zebrafish early embryo proteome and transcriptome. J Proteome Res. 2014;13(12):5536–50. pmid:25230361
  51. 51. Krasny L, Huang PH. Data-independent acquisition mass spectrometry (DIA-MS) for proteomic applications in oncology. Mol Omics. 2021;17(1):29–42. pmid:33034323
  52. 52. Sajic T, Liu Y, Aebersold R. Using data-independent, high-resolution mass spectrometry in protein biomarker research: perspectives and clinical applications. Proteomics Clin Appl. 2015;9(3–4):307–21. pmid:25504613
  53. 53. Stewart HI, Grinfeld D, Giannakopulos A, Petzoldt J, Shanley T, Garland M, et al. Parallelized Acquisition of Orbitrap and Astral Analyzers Enables High-Throughput Quantitative Analysis. Anal Chem. 2023;95(42):15656–64. pmid:37815927
  54. 54. HaileMariam M, Eguez RV, Singh H, Bekele S, Ameni G, Pieper R, et al. S-Trap, an Ultrafast Sample-Preparation Approach for Shotgun Proteomics. J Proteome Res. 2018;17(9):2917–24. pmid:30114372
  55. 55. Tsai C-F, Hsu C-C, Hung J-N, Wang Y-T, Choong W-K, Zeng M-Y, et al. Sequential phosphoproteomic enrichment through complementary metal-directed immobilized metal ion affinity chromatography. Anal Chem. 2014;86(1):685–93. pmid:24313913
  56. 56. Lim HG, Gao Y, Rychel K, Lamoureux C, Lou XA, Palsson BO. Revealing systematic changes in the transcriptome during the transition from exponential growth to stationary phase. mSystems. 2025;10(1):e0131524. pmid:39714213
  57. 57. Carlage T, Kshirsagar R, Zang L, Janakiraman V, Hincapie M, Lyubarskaya Y, et al. Analysis of dynamic changes in the proteome of a Bcl-XL overexpressing Chinese hamster ovary cell culture during exponential and stationary phases. Biotechnol Prog. 2012;28(3):814–23. pmid:22556165
  58. 58. Ridder Md, van den Brandeler W, Altiner M, Daran-Lapujade P, Pabst M. Proteome Dynamics During Transition From Exponential to Stationary Phase Under Aerobic and Anaerobic Conditions in Yeast. Mol Cell Proteomics. 2023;22(6):100552. pmid:37076048
  59. 59. Williams EG, Wu Y, Jha P, Dubuis S, Blattmann P, Argmann CA, et al. Systems proteomics of liver mitochondria function. Science. 2016;352(6291):aad0189. pmid:27284200
  60. 60. Li Y, Xu K, Zhou A, Xu Z, Wu J, Peng X, et al. Integrative Transcriptomics and Proteomics Analysis Reveals THRSP’s Role in Lipid Metabolism. Genes (Basel). 2024;15(12):1562. pmid:39766829
  61. 61. Wang H, Li M, Tao X, Qian Y, Chen L, Tao G. ARHGEF39 promotes gastric cancer cell proliferation and migration via Akt signaling pathway. Mol Cell Biochem. 2018;440(1–2):33–42. pmid:28871449
  62. 62. Anijs M, Devanna P, Vernes SC. ARHGEF39, a Gene Implicated in Developmental Language Disorder, Activates RHOA and Is Involved in Cell De-Adhesion and Neural Progenitor Cell Proliferation. Front Mol Neurosci. 2022;15:941494. pmid:35959104
  63. 63. Moore GE, Ishida M, Demetriou C, Al-Olabi L, Leon LJ, Thomas AC, et al. The role and interaction of imprinted genes in human fetal growth. Philos Trans R Soc Lond B Biol Sci. 2015;370(1663):20140074. pmid:25602077
  64. 64. Caniçais C, Vasconcelos S, Ramalho C, Marques CJ, Dória S. Deregulation of imprinted genes expression and epigenetic regulators in placental tissue from intrauterine growth restriction. J Assist Reprod Genet. 2021;38(4):791–801. pmid:33389447
  65. 65. Zhou Y, Wohlfahrt G, Paavola J, Olkkonen VM. A vertebrate model for the study of lipid binding/transfer protein function: conservation of OSBP-related proteins between zebrafish and human. Biochem Biophys Res Commun. 2014;446(3):675–80. pmid:24326072
  66. 66. Obeagu EI, Maibouge Tanko MS. Iron metabolism in breast cancer: mechanisms and therapeutic implications: a narrative review. Ann Med Surg (Lond). 2025;87(6):3403–9. pmid:40486542
  67. 67. Ding M, Ma Y-J, Du R-Q, Zhou W-Y, Dou X, Yang Q-Q, et al. CHCHD10 Modulates Thermogenesis of Adipocytes by Regulating Lipolysis. Diabetes. 2022;71(9):1862–79. pmid:35709007
  68. 68. Wu X, Zhang Z, Li J, Zong J, Yuan L, Shu L, et al. Chchd10: A Novel Metabolic Sensor Modulating Adipose Tissue Homeostasis. Adv Sci (Weinh). 2025;12(15):e2408763. pmid:39985288
  69. 69. Harjuhaahto S, Jokela M, Rajendran J, Rokka M, Hu B, Kvist J, et al. Dose-dependent CHCHD10 dysregulation dictates motor neuron disease severity and alters creatine metabolism. Acta Neuropathol Commun. 2025;13(1):111. pmid:40400037
  70. 70. Holowiecki A, O’Shields B, Jenny MJ. Spatiotemporal expression and transcriptional regulation of heme oxygenase and biliverdin reductase genes in zebrafish (Danio rerio) suggest novel roles during early developmental periods of heightened oxidative stress. Comp Biochem Physiol C Toxicol Pharmacol. 2017;191:138–51. pmid:27760386
  71. 71. Jin J, Shi X, Li Y, Zhang Q, Guo Y, Li C, et al. Reticulocalbin 3 Deficiency in Alveolar Epithelium Exacerbated Bleomycin-induced Pulmonary Fibrosis. Am J Respir Cell Mol Biol. 2018;59(3):320–33. pmid:29676583
  72. 72. Perna L, Salsbury G, Dushti M, Smith CJ, Morales V, Bianchi K, et al. Altered Cellular Metabolism Is a Consequence of Loss of the Ataxia-Linked Protein Sacsin. Int J Mol Sci. 2024;25(24):13242. pmid:39769008
  73. 73. Kroemer G, Mariño G, Levine B. Autophagy and the integrated stress response. Mol Cell. 2010;40(2):280–93. pmid:20965422
  74. 74. Yahiro K, Tsutsuki H, Ogura K, Nagasawa S, Moss J, Noda M. DAP1, a negative regulator of autophagy, controls SubAB-mediated apoptosis and autophagy. Infect Immun. 2014;82(11):4899–908. pmid:25183729
  75. 75. Koren I, Reem E, Kimchi A. DAP1, a novel substrate of mTOR, negatively regulates autophagy. Curr Biol. 2010;20(12):1093–8. pmid:20537536
  76. 76. Wazir U, Sanders AJ, Wazir A, Baig RM, Jiang WG, Ster IC, et al. Effect of the knockdown of death-associated protein 1 expression on cell adhesion, growth and migration in breast cancer cells. Oncol Rep. 2015;33(3):1450–8. pmid:25530065
  77. 77. Song Y, Sun X, Duan F, He C, Wu J, Huang X, et al. SYPL1 Inhibits Apoptosis in Pancreatic Ductal Adenocarcinoma via Suppression of ROS-Induced ERK Activation. Front Oncol. 2020;10:1482. pmid:33042794
  78. 78. Menon A, Pettinari L, Martinelli C, Colombo G, Portinaro N, Dalle-Donne I, et al. New insights in extracellular matrix remodeling and collagen turnover related pathways in cultured human tenocytes after ciprofloxacin administration. Muscles Ligaments Tendons J. 2013;3(3):122–31. pmid:24367771
  79. 79. Mohammadzadeh N, Lunde IG, Andenæs K, Strand ME, Aronsen JM, Skrbic B, et al. The extracellular matrix proteoglycan lumican improves survival and counteracts cardiac dilatation and failure in mice subjected to pressure overload. Sci Rep. 2019;9(1):9206. pmid:31235849
  80. 80. Mohammadzadeh N, Melleby AO, Palmero S, Sjaastad I, Chakravarti S, Engebretsen KVT, et al. Moderate Loss of the Extracellular Matrix Proteoglycan Lumican Attenuates Cardiac Fibrosis in Mice Subjected to Pressure Overload. Cardiology. 2020;145(3):187–98. pmid:31968347
  81. 81. Schüler SC, Kirkpatrick JM, Schmidt M, Santinha D, Koch P, Di Sanzo S, et al. Extensive remodeling of the extracellular matrix during aging contributes to age-dependent impairments of muscle stem cell functionality. Cell Rep. 2021;35(10):109223. pmid:34107247
  82. 82. Stanganello E, Hagemann AIH, Mattes B, Sinner C, Meyen D, Weber S, et al. Filopodia-based Wnt transport during vertebrate tissue patterning. Nat Commun. 2015;6:5846. pmid:25556612
  83. 83. Pasquariello R, Verdile N, Pavlovic R, Panseri S, Schirmer K, Brevini TAL, et al. New Stable Cell Lines Derived from the Proximal and Distal Intestine of Rainbow Trout (Oncorhynchus mykiss) Retain Several Properties Observed In Vivo. Cells. 2021;10(6):1555. pmid:34205481
  84. 84. Li S, Cao L, Zhang Z, Kuang M, Chen L, Zhao Y, et al. Cytosolic and nuclear recognition of virus and viral evasion. Mol Biomed. 2021;2(1):30. pmid:35006471
  85. 85. Howat TJ, Barreca C, O’Hare P, Gog JR, Grenfell BT. Modelling dynamics of the type I interferon response to in vitro viral infection. J R Soc Interface. 2006;3(10):699–709. pmid:16971338
  86. 86. Gopal R, Lee B, McHugh KJ, Rich HE, Ramanan K, Mandalapu S, et al. STAT2 Signaling Regulates Macrophage Phenotype During Influenza and Bacterial Super-Infection. Front Immunol. 2018;9:2151. pmid:30337919
  87. 87. Heath H, Britton G, Kudo H, Renney G, Ward M, Hutchins R, et al. Stat2 loss disrupts damage signalling and is protective in acute pancreatitis. J Pathol. 2020;252(1):41–52. pmid:32506441
  88. 88. Duodu P, Sosa G, Canar J, Chhugani O, Gamero AM. Exposing the Two Contrasting Faces of STAT2 in Inflammation. J Interferon Cytokine Res. 2022;42(9):467–81. pmid:35877097
  89. 89. Espada CE, da Rocha EL, Ricciardi-Jorge T, Dos Santos AA, Soares ZG, Malaquias G, et al. ISG15/USP18/STAT2 is a molecular hub regulating IFN I-mediated control of Dengue and Zika virus replication. Front Immunol. 2024;15:1331731. pmid:38384473
  90. 90. Tian Q, Zou Q, Jia L. Benchmarking of methods that identify alternative polyadenylation events in single-/multiple-polyadenylation site genes. NAR Genom Bioinform. 2025;7(2):lqaf056. pmid:40371010
  91. 91. Ross AB, Langer JD, Jovanovic M. Proteome Turnover in the Spotlight: Approaches, Applications, and Perspectives. Mol Cell Proteomics. 2021;20:100016. pmid:33556866
  92. 92. Degeratu MO, Schönenberger R, Huwa N, Groh K. Exploring zebrafish embryonic cell line PAC2 by proteomics profiling: highlights of analytical sciences in Switzerland. Chimia. 2024;78(7–8):558.
  93. 93. Tseng H-P, Hseu T-H, Buhler DR, Wang W-D, Hu C-H. Constitutive and xenobiotics-induced expression of a novel CYP3A gene from zebrafish larva. Toxicol Appl Pharmacol. 2005;205(3):247–58. pmid:15922010
  94. 94. Chang C-T, Chung H-Y, Su H-T, Tseng H-P, Tzou W-S, Hu C-H. Regulation of zebrafish CYP3A65 transcription by AHR2. Toxicol Appl Pharmacol. 2013;270(2):174–84. pmid:23624173
  95. 95. Klyushova LS, Perepechaeva ML, Grishanova AY. The Role of CYP3A in Health and Disease. Biomedicines. 2022;10(11):2686. pmid:36359206
  96. 96. Baer BR, Rettie AE. CYP4B1: an enigmatic P450 at the interface between xenobiotic and endobiotic metabolism. Drug Metab Rev. 2006;38(3):451–76. pmid:16877261
  97. 97. Röder A, Hüsken S, Hutter MC, Rettie AE, Hanenberg H, Wiek C. Spotlight on CYP4B1. International Journal of Molecular Sciences. 2023;24(3):2038.
  98. 98. Alnasser SM. The role of glutathione S-transferases in human disease pathogenesis and their current inhibitors. Genes Dis. 2024;12(4):101482. pmid:40290119
  99. 99. Wang Y, Huang H, Wu Q. Characterization of the zebrafish Ugt repertoire reveals a new class of drug-metabolizing UDP glucuronosyltransferases. Mol Pharmacol. 2014;86(1):62–75. pmid:24728488
  100. 100. Mohammed YI, Kurogi K, Al Shaban A, Xu Z, Liu MY, Williams FE. Identification and characterization of zebrafish sult1 st9, sult3 st4, and sult3 st5. Aquatic Toxicology. 2012;112:11–8.
  101. 101. Boukouvala S, Kontomina E, Olbasalis I, Patriarcheas D, Tzimotoudis D, Arvaniti K, et al. Insights into the genomic and functional divergence of NAT gene family to serve microbial secondary metabolism. Sci Rep. 2024;14(1):14905. pmid:38942826
  102. 102. Han B, Zhang J-T. Multidrug resistance in cancer chemotherapy and xenobiotic protection mediated by the half ATP-binding cassette transporter ABCG2. Curr Med Chem Anticancer Agents. 2004;4(1):31–42. pmid:14754410
  103. 103. Shi P, Tang L, Yin F, Guo H, Liu J. Expression, Function, and Regulation of ABCG2 on the Intestinal Epithelial Barrier Permeability. Curr Drug Metab. 2025;26(1):30–8. pmid:40231523
  104. 104. Xiao L, Lu Z, Fang H, Zhou Y, Che W, Zhang W, et al. Explorations of novel MDR-related hub genes and the potential roles TRIM9 played in drug-resistant hepatocellular carcinoma. Int J Biol Macromol. 2025;290:138949. pmid:39706432
  105. 105. Fischer S, Loncar J, Zaja R, Schnell S, Schirmer K, Smital T. Constitutive mRNA expression and protein activity levels of nine ABC efflux transporters in seven permanent cell lines derived from different tissues of rainbow trout (Oncorhynchus mykiss). Aquatic Toxicology. 2011;101(2):438–46.
  106. 106. Ferreira M, Costa J, Reis-Henriques MA. ABC transporters in fish species: a review. Frontiers in Physiology. 2014;5:266.
  107. 107. Kaeser-Pebernard S, Vionnet C, Mari M, Sankar DS, Hu Z, Roubaty C, et al. mTORC1 controls Golgi architecture and vesicle secretion by phosphorylation of SCYL1. Nat Commun. 2022;13(1):4685. pmid:35948564
  108. 108. Pelis RM, Wright SH. SLC22, SLC44, and SLC47 transporters--organic anion and cation transporters: molecular and cellular properties. Curr Top Membr. 2014;73:233–61. pmid:24745985
  109. 109. Felmlee MA, Jones RS, Rodriguez-Cruz V, Follman KE, Morris ME. Monocarboxylate Transporters (SLC16): Function, Regulation, and Role in Health and Disease. Pharmacol Rev. 2020;72(2):466–85. pmid:32144120
  110. 110. Yee SW, Macdonald CB, Mitrovic D, Zhou X, Koleske ML, Yang J, et al. The full spectrum of SLC22 OCT1 mutations illuminates the bridge between drug transporter biophysics and pharmacogenomics. Mol Cell. 2024;84(10):1932-1947.e10. pmid:38703769
  111. 111. Lončar J, Popović M, Krznar P, Zaja R, Smital T. The first characterization of multidrug and toxin extrusion (MATE/SLC47) proteins in zebrafish (Danio rerio). Sci Rep. 2016;6:28937. pmid:27357367
  112. 112. Suvorov A, Salemme V, McGaunn J, Poluyanoff A, Teffera M, Amir S. Unbiased approach for the identification of molecular mechanisms sensitive to chemical exposures. Chemosphere. 2021;262:128362. pmid:33182146
  113. 113. Sun Y, Sun K, Ling H, Xia Q. Farnesoid X receptor‑driven metabolic plasticity: Bridging physiological adaptation and malignant transformation in lipid handling (Review). Int J Mol Med. 2025;56(1):110. pmid:40376981
  114. 114. Den Broeder MJ, Kopylova VA, Kamminga LM, Legler J. Zebrafish as a Model to Study the Role of Peroxisome Proliferating-Activated Receptors in Adipogenesis and Obesity. PPAR Res. 2015;2015:358029. pmid:26697060
  115. 115. Schaaf MJM. Nuclear receptor research in zebrafish. J Mol Endocrinol. 2017;59(1):R65–76. pmid:28438785
  116. 116. van den Boom R, Vergauwen L, Knapen D. Effects of Metabolic Disruption on Lipid Metabolism and Yolk Retention in Zebrafish Embryos. Environ Toxicol Chem. 2024;43(8):1880–93. pmid:38860666
  117. 117. Adlanmerini M, Lazar MA. The REV-ERB Nuclear Receptors: Timekeepers for the Core Clock Period and Metabolism. Endocrinology. 2023;164(6):bqad069. pmid:37149727
  118. 118. Sloin HE, Ruggiero G, Rubinstein A, Smadja Storz S, Foulkes NS, Gothilf Y. Interactions between the circadian clock and TGF-β signaling pathway in zebrafish. PLoS One. 2018;13(6):e0199777. pmid:29940038
  119. 119. Lewandowski M, Busch R, Marschner JA, Merk D. Comparative Evaluation and Profiling of Chemical Tools for the Nuclear Hormone Receptor Family 2. ACS Pharmacol Transl Sci. 2025;8(3):854–70. pmid:40046426
  120. 120. Schirmer K, Chan AG, Greenberg BM, Dixon DG, Bols NC. Methodology for demonstrating and measuring the photocytotoxicity of fluoranthene to fish cells in culture. Toxicol In Vitro. 1997;11(1–2):107–19. pmid:20654301
  121. 121. Jozef B, Zhang ZR, Kaltenbach H-M, Schirmer K. A systematic approach towards long-term, serum-free cultivation of fish cells with the RTgill-W1 cell line as example. NAM Journal. 2025;1:100008.
  122. 122. Frankenfield AM, Ni J, Ahmed M, Hao L. Protein Contaminants Matter: Building Universal Protein Contaminant Libraries for DDA and DIA Proteomics. J Proteome Res. 2022;21(9):2104–13. pmid:35793413
  123. 123. Tyanova S, Temu T, Sinitcyn P, Carlson A, Hein MY, Geiger T, et al. The Perseus computational platform for comprehensive analysis of (prote)omics data. Nat Methods. 2016;13(9):731–40. pmid:27348712
  124. 124. Perez-Riverol Y, Bandla C, Kundu DJ, Kamatchinathan S, Bai J, Hewapathirana S. The PRIDE database at 20 years: 2025 update. Nucleic Acids Research. 2025;53(D1):D543–53.
  125. 125. Bray NL, Pimentel H, Melsted P, Pachter L. Near-optimal probabilistic RNA-seq quantification. Nat Biotechnol. 2016;34(5):525–7. pmid:27043002
  126. 126. Edgar R, Domrachev M, Lash AE. Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002;30(1):207–10. pmid:11752295
  127. 127. Chen Y, Chen L, Lun ATL, Baldoni PL, Smyth GK. edgeR v4: powerful differential analysis of sequencing data with expanded functionality and improved support for small counts and larger datasets. Nucleic Acids Res. 2025;53(2):gkaf018. pmid:39844453
  128. 128. Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44–57. pmid:19131956
  129. 129. Sherman BT, Hao M, Qiu J, Jiao X, Baseler MW, Lane HC, et al. DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update). Nucleic acids research. 2022;50(W1):W216–W21.
  130. 130. Heberle H, Meirelles GV, da Silva FR, Telles GP, Minghim R. InteractiVenn: a web-based tool for the analysis of sets through Venn diagrams. BMC Bioinformatics. 2015;16(1):169. pmid:25994840
  131. 131. Supek F, Bošnjak M, Škunca N, Šmuc T. REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS One. 2011;6(7):e21800. pmid:21789182
  132. 132. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504. pmid:14597658
  133. 133. Ge SX, Jung D, Yao R. ShinyGO: a graphical gene-set enrichment tool for animals and plants. Bioinformatics. 2020;36(8):2628–9. pmid:31882993
  134. 134. Rohart F, Gautier B, Singh A, Lê Cao K-A. mixOmics: An R package for ’omics feature selection and multiple data integration. PLoS Comput Biol. 2017;13(11):e1005752. pmid:29099853
  135. 135. Argelaguet R, Velten B, Arnol D, Dietrich S, Zenz T, Marioni JC, et al. Multi-Omics Factor Analysis-a framework for unsupervised integration of multi-omics data sets. Mol Syst Biol. 2018;14(6):e8124. pmid:29925568
  136. 136. Singh A, Shannon CP, Gautier B, Rohart F, Vacher M, Tebbutt SJ, et al. DIABLO: an integrative approach for identifying key molecular drivers from multi-omics assays. Bioinformatics. 2019;35(17):3055–62. pmid:30657866