Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Best reference genes for unbiased normalized transcript expression in normal and dystrophic human cell models of myogenesis

  • Raffaella Quarta ,

    Contributed equally to this work with: Raffaella Quarta, Brigida Boccanegra

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Validation, Visualization, Writing – original draft

    Affiliation Department of Pharmacy - Drug Sciences, University of Bari “Aldo Moro”, Bari, Italy

  • Brigida Boccanegra ,

    Contributed equally to this work with: Raffaella Quarta, Brigida Boccanegra

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Validation, Visualization, Writing – original draft

    Affiliation Department of Pharmacy - Drug Sciences, University of Bari “Aldo Moro”, Bari, Italy

  • Enrica Cristiano,

    Roles Data curation, Formal analysis

    Affiliation Department of Pharmacy - Drug Sciences, University of Bari “Aldo Moro”, Bari, Italy

  • Alberto Ladisa,

    Roles Data curation, Formal analysis

    Affiliation Department of Pharmacy - Drug Sciences, University of Bari “Aldo Moro”, Bari, Italy

  • Elena Conte,

    Roles Data curation, Formal analysis

    Affiliation Department of Pharmacy - Drug Sciences, University of Bari “Aldo Moro”, Bari, Italy

  • Jessica Ohana,

    Roles Resources

    Affiliation Institut de Myologie, Centre de Recherche en Myologie, Sorbonne Université, Paris, France

  • Vincent Mouly,

    Roles Resources

    Affiliation Institut de Myologie, Centre de Recherche en Myologie, Sorbonne Université, Paris, France

  • Annamaria De Luca,

    Roles Conceptualization, Funding acquisition, Project administration, Resources, Supervision, Writing – original draft

    Affiliation Department of Pharmacy - Drug Sciences, University of Bari “Aldo Moro”, Bari, Italy

  • John Hildyard ,

    Roles Conceptualization, Data curation, Formal analysis, Software, Supervision, Validation, Visualization, Writing – original draft

    jhildyard@rvc.ac.uk (JH); ornella.cappellari@uniba.it (OC);

    ‡ These authors share senior co-authorship

    Affiliation Department of Clinical Science and Services, Royal Veterinary College, London, United Kingdom

  • Ornella Cappellari

    Roles Conceptualization, Formal analysis, Funding acquisition, Project administration, Supervision, Writing – original draft

    jhildyard@rvc.ac.uk (JH); ornella.cappellari@uniba.it (OC);

    ‡ These authors share senior co-authorship

    Affiliation Department of Pharmacy - Drug Sciences, University of Bari “Aldo Moro”, Bari, Italy

Abstract

Patient-derived cell models of dystrophic myogenesis and differentiation are valuable preclinical tools for early and mutation-based assessment of candidate therapeutic approaches. Quantitative measurement of gene expression within such models plays a key role in these studies, but normalisation of RT-qPCR data requires a panel of validated stably expressed reference genes. This study aims to identify stable reference genes for RT-qPCR assays in three human derived muscle immortalized cell lines: one healthy WT (from a 16-year-old donor), and two dystrophic lines, DMD1 (from a 11-year-old patient carrying a stop codon mutation on exon 59) and DMD2, from a 14-year-old patient carrying an exon 48–50 deletion. We screened a pool of 14 candidate genes (ACTB, HPRT1, RPL13A, RPS18, GAPDH, ALAS1, UBC, YWHAZ, IPO8, PSMC4, HSP90AB1, NONO, CSNK2A2, AP3D1), investigating stability of expression from proliferation through to 11 days of myogenic differentiation. Data were analysed using four complementary approaches (Bestkeeper, geNorm, Normfinder and DeltaCt) to determine the most appropriate references both within and between cell lines. Our study shows that RPS18, UBC, YWHAZ scored highly across all comparisons, and we therefore suggest that these three genes represent an appropriate reference panel for these human myogenic cell lines, regardless of genotype or differentiation stage.

1. Introduction

In vitro cellular models of skeletal muscle are widely used in preclinical studies of neuromuscular diseases (such as Duchenne Muscular Dystrophy, DMD) to better elucidate pathological mechanisms and as an early platform for drug testing. DMD is a X-linked inherited recessive, muscle-wasting disorder characterized by muscle degeneration, progressive disability and premature death [1]. The disease is caused by heterogeneous mutations in the gene that encodes for the dystrophin protein (DMD) which has a pivotal function in myofiber stability due to its central role in the dystrophin associated glycoprotein complex (DGC), a network of proteins that ensure a link between cytoskeleton and extracellular matrix. In the absence of dystrophin, the DGC cannot assemble, leaving myofibres vulnerable to contraction-induced injury. Repeated cycles of degeneration and regeneration lead to a complex pathogenetic cascade, characterized by chronic inflammation, oxidative stress and substitution of muscle tissue with fibrotic scarring [2]. DMD presently has no cure, but there are continued efforts within the scientific community to find therapies that correct the primary defect and restore dystrophin (such as exon-skipping), or that pharmacologically ameliorate aspects of the complex cascade initiated by dystrophin absence (such as corticosteroids to combat persistent inflammation). Animal models are widely used to obtain robust preclinical data, however the drive to refine, reduce and replace animal experiments (the 3Rs) has increasing favoured cell-based models (2D and 3D) for preclinical studies. Patient-derived cell culture models, in particular, offer a predictive and reliable tool to better support preclinical research, and isolated myoblasts can be immortalized to serve as an ongoing renewable resource. Evaluation of therapeutic efficacy (especially dystrophin restoration) typically requires myogenic differentiation of these proliferating myoblasts to form post-mitotic multinucleated myotubes, a process involving major changes in morphology and cellular activity, controlled by an elaborate network of regulatory mechanisms [3,4].

RT-qPCR is an essential quantitative method for measuring pathological or therapeutic changes in gene expression in these cell culture models, however analysis of data first requires proper normalisation to internal “reference genes” or “housekeeping genes”: genes with expression patterns that are stable regardless of experimental conditions, and which are ideally present at levels comparable to any genes of interest [5,6]. Identifying appropriate reference genes within highly dynamic scenarios such as myogenic differentiation and concomitant pathological background is, however, challenging. Evidence is mounting that other popular reference genes, such as GAPDH and ACTB, are unsuitable candidates in several cellular models [7,8]. These studies have detected GAPDH involvement in many biological activities including membrane fusion [9], vesicular transport [10] and apoptosis pathways [1113]. These multifunctional properties of GAPDH have been demonstrated to result in its changeable pattern of expression depending on different conditions [14]. Moreover, beta-actin (ACTB) has traditionally been regarded as an endogenous housekeeping gene and has been widely used as a reference gene/protein in quantifying expression levels. However, ACTB is closely associated with a variety of cancers, and accumulating evidence indicates that ACTB is de-regulated in liver, melanoma, renal, colorectal, gastric. In addition, studies on muscle cell cultures have demonstrated that ACTB expression progressively declines as differentiation proceeds throughout myogenesis. Such fluctuations may hinder its reliability as a housekeeping gene when performing RT-PCR studies on skeletal muscle derived cell cultures during differentiation. While some genes perform well in multiple scenarios, it is increasingly becoming clear that no truly universal reference genes exist, and that appropriate references should be determined empirically for every comparative situation. Furthermore, gene expression is inherently dynamic, and mRNA can be synthesised or degraded over rapid timescales: even with ostensibly stable genes, reliance on a single reference is unwise, and panels of two or three references are advisable to buffer against rare stochastic changes in expression (ideally these genes should be selected from different functional categories, i.e., not co-ordinately regulated) [15]. In this frame, we previously conducted studies on muscle cell cultures and on muscle samples derived from commonly used dystrophic animal models for preclinical DMD research (e.g., BL10-mdx and D2-mdx mouse models, DE50-MD dogs) in order to identify the most reliable housekeeping genes for RT-PCR studies [6].

In light of previous findings, this study aimed to establish suitable reference genes for three human-derived immortalized muscle cell lines: one healthy WT (from a 16-year-old donor), and two dystrophic lines, DMD1 (from a 11-year-old patient carrying a stop codon mutation on exon 59) and DMD2, from a 14-year-old patient carrying an exon 48–50 deletion. We screened a panel of 14 candidate reference genes, measuring expression in each cell line at different stages of differentiation and in the proliferative state. This set of genes (ACTB, HPRT1, RPL13A, RPS18, GAPDH, ALAS1, UBC, YWHAZ, IPO8, PSMC4, HSP90AB1, NONO, CSNK2A2, AP3D1) was selected from GeneCards database following the criteria of level expression (high) in human muscle tissue.

Expression data were analysed using four different algorithms: geNorm [16], Bestkeeper [17], DeltaCt [18] and Normfinder [19]. Each method is capable of determining optimal references a priori, but each uses different, but complementary criteria: reliably strong reference candidates should accordingly score highly in all four approaches.

2. Materials and methods

2.1. Cell culture: maintenance and sample collection

We used three paediatric patient- and muscle-derived satellite cell lines granted from the biobank MyoLine, previously immortalized and validated by the Mouly lab [20,21]: WT (AB1190), DMD1 (stop exon 59; AB1023) and DMD2 (deletion 48–50; AB1098). Cells were periodically checked for mycoplasma contaminations using MycoStrip®myocoplasma detection kit (InvivoGen cat# rep-mys-50). Briefly, cells were quickly thawed from separate vials and resuspended in fresh growth medium (passage number 8–9, Myoline biobank, France). Once cells reached approximately 70% of confluency, they were subcultured into separate flasks. Cells were cultured in Greiner CELLSTAR® T-75 flask (GN658195) and maintained in a proliferative state (myoblast) at 37°C by using skeletal muscle cell growth medium (S1 Table in S1 File). For ‘Myoblast stage’ samples, approx. 2.5 × 105 cells of each line were collected and used for RNA extraction.

To start myogenic differentiation, cells (2.5 × 105) were seeded in petri dishes with low serum differentiation medium (S1 Table in S1 File). Cells were incubated at 37°C throughout the differentiation process. Samples were harvested for RNA isolation at two time points, corresponding to different stages of the myogenic cascade: 6 and 11 days of differentiation. Differentiation medium was partially replaced (50:50 v/v) with fresh differentiation medium every 48 hours until each endpoint. Our entire dataset consists of three biological replicates from three separate culturing sessions of each sample (N = 3 per cell line).

2.2. Immunofluorescence staining

To characterise the state of differentiation of our cells lines we performed immunofluorescence assay to detect myosin heavy chain, as a myogenic marker. 2.5 × 105 cells/well were cultured in a 6-well plate (CytoOne® - Starlab) for 6 and 11 days in differentiation medium. At each timepoint, cells were fixed in 4% paraformaldehyde in PBS for 15 min at room temperature and then washed 3 times with PBS and blocked in Blocking Buffer (2% BSA + 1% Triton X-100) for 45 min. Anti-myosin primary antibody (MF-20-S; 1:5; Developmental Studies Hybridoma Bank) was diluted in Blocking Buffer (w/Triton) and incubated at RT for 1h. After three washes in PBS, cells were incubated with Alexa FluorTM 594-conjugated secondary antibodies (1:500; Thermo Fisher Scientific) for 1 h at RT. Nuclei were stained with Pure Blu Hoechst nuclear staining dye (1:100; Bio-Rad). Mounting Medium was added before imaging. Images were acquired using a Leica THUNDER Imager (11525687) with a 20x objective.

2.3. RNA extraction, cDNA synthesis and real time PCR

Total RNA was isolated using Total RNA Purification Plus Micro Kit (Norgen Biotek Corp. NR48500) following manufacturer’s instructions. RNA purification kit includes also a genomic DNA removal step and to confirm its purity, RNA was evaluated via nanodrop spectroscopy to confirm that 260/280 ratios were >2.0, and 260/230 ratios >1.8. Samples that did not met these parameters were considered not acceptable and therefore excluded. cDNA synthesis (800 ng of RNA) was performed using the iScript™ gDNA Clear cDNA Synthesis Kit (Bio-Rad) according to the manufacturer’s protocols. Following DNAse heat-inactivation, samples were reverse transcribed using oligodT and random priming. Quantitative Real Time PCR was performed via the CFX Duet Real-Time PCR system (Bio-Rad, Hercules, CA, USA) by using Prime PCR assays SYBR® green. Each reaction was performed in technical triplicate in Hard-shell® 96 well PCR plates using 8 ng of diluted cDNA (assuming 1:1 synthesis) per well for a total volume of 10 µl. Volumes and concentrations are listed in S2 Table in S1 File. The real-time PCR cycling protocol is described in S3 Table in S1 File. [22,23] All reference gene primers were purchased from Bio-Rad (Unique Assay IDs are shown in Table 1). Quantification cycle (Cq) values were determined via regression analysis of the amplification traces, and per-sample replicate Cq values were used to calculate sample Cq (arithmetic mean). Subsequent analysis used either Cq values (Bestkeeper, deltaCt) or linearised relative quantity (RQ) values (geNorm, Normfinder, expression analysis). Where multiple reference genes were used for normalisation, RQ values for each gene were combined (geometric mean) on a per-sample basis into normalisation factor (NF) values.

thumbnail
Table 1. List of candidate genes with their Unique assay IDs from Bio-Rad.

https://doi.org/10.1371/journal.pone.0344973.t001

2.4. Reference gene analyses

Calculation of RQ values was performed as standard: efficiency(lowest Cq – sample Cq), where efficiency is the increase in total amplicon per cycle (thus here, 2.0, corresponding to 100% efficiency). All Cq values for a given gene are accordingly converted to a linear scale (0–1). Candidate reference gene data was assessed using four different approaches (geNorm, Bestkeeper, deltaCt and Normfinder). These identify suitable references independently, but each uses different methods to determine ranking. As we have shown previously [2426], use of multiple complementary approaches strengthens analysis: genes consistently scored highly are likely to be strong candidates, while genes that score highly under some approaches but not others can provide insights of biological relevance. Furthermore, provided that a suitably large sample set is employed, data can be assessed both as a whole, and as subsets: here we determine reference genes appropriate for both healthy and dystrophic cell cultures (all samples), but also, we investigate whether different combinations of references emerge if data are assessed when segregated by genotype (healthy vs dystrophic cell line) or differentiation stage.

GeNorm [16] uses an iterative pairwise approach: pairwise variation values are determined for all combinations of candidate genes, and then ranked accordingly. The gene with the highest total pairwise variation (M) is then discarded, and the process repeated until only a single pair of genes remain: the “best pair”. Finally, all genes are ranked from ‘worst’ to ‘best pair’ by their M values, where low M values indicate high stability. This method assumes that unrelated genes exhibiting highly correlated expression values are likely to reflect cDNA content, however it should be noted that pairs of co-ordinately regulated genes (such as ribosomal protein genes) will also tend to score highly under geNorm. The geNorm algorithm uses data linearized to relative quantities (RQ).

Bestkeeper [17] uses raw Cq data to generate a hypothetical expression profile, the ‘bestkeeper’: representing the mean behaviour of all genes in the candidate panel. Individual candidates are then assessed for pairwise correlation with this and ranked accordingly, by Pearson correlation “r”, where high values indicate high correlation. The Bestkeeper approach assumes that the consensus expression profile of multiple genes increasingly reflects overall cDNA levels, and individual genes that closely match this profile are thus suitable references.

The DeltaCt [18] approach takes per-sample Cq differences between genes (dCt = Cq gene1 – Cq gene2) and then calculates the standard deviation (SD) of these values for the entire sample set: a pair of genes that vary in a consistent manner will thus produce low SD values, regardless of differences in relative abundance. Each gene is compared with all other genes in the panel, and then ranked by their mean SD value, with low overall values indicating high stability. The assumption is that genes that exhibit no strong differential expression (i.e., good candidate references) will exhibit the least overall divergence from other genes in the candidate pool.

Finally, Normfinder [19] empirically assesses expression variance across the sample set for each gene individually, and thus does not involve any pairwise comparative elements. Normfinder can assess a dataset as a whole, but can also be conducted with data assigned to specific groups (e.g., “healthy” or “dystrophic”). This grouped analysis allows intergroup variation to be identified and treated accordingly: genes exhibiting high overall stability but also a modest but consistent group-specific bias will consequently be scored less highly than genes with greater, but group-agnostic, variability. This latter approach also identifies a “best pair”: unlike the best pair identified by geNorm, this pair of genes can include two genes that vary in a consistently opposed fashion, such that their mean behaviour represents a more stable reference than any other gene alone.

To generate an aggregate ranking, we calculated the geometric mean of the four algorithm scores on a per-gene basis. As bestkeeper ranks from low to high values, while all other methods rank from high to low values, all bestkeeper data was inverted (1-BK score) prior to use.

3. Results

3.1. Myotube characterization

First, we characterized the state of maturation of all three cell lines at day 6 and day 11 of differentiation via immunolabelling for myosin heavy chain (as a marker of terminal differentiation). Staining with the pan-myosin antibody MF20 revealed an intense signal in healthy myotubes, indicative of well-developed sarcomeric structures and efficient myogenic differentiation. In contrast, both dystrophic cell lines exhibited markedly lower MF20 labelling than healthy cells, with DMD1 showing even lower staining than DMD2 (at day 11), suggesting impaired expression or organization of contractile proteins. Morphological differences could also be detected in brightfield images (Fig 1).

thumbnail
Fig 1. Human derived myoblasts and myotubes.

Cell lines at different stages of differentiation: myoblast, day 6 and day 11 of differentiation. The bright-field images were taken with Eclipse Ti2 Nikon microscope at 10X magnification. Immunofluorescence staining showing heavy chain of myosin. Images show myosin at day 6 and day 11 of differentiation in all three cell lines. Scale bar: 100μm.

https://doi.org/10.1371/journal.pone.0344973.g001

3.2. Raw Cq values

As shown in Fig 2, typical Cq values of the genes in our candidate panel spanned a range from ~15–27, with high abundance transcripts (GAPDH, ACTB, RPS18) exhibiting the lowest Cq values as expected. No marked differences associated with genotype/age were observed in raw Cq values in any candidate gene.

thumbnail
Fig 2. Box and whisker plots for raw Cq values across the candidate gene panel.

Individual (per sample) Cq values are indicated: solid blue circles: WT cells; solid red circles: DMD1; hollow red circles: DMD2.

https://doi.org/10.1371/journal.pone.0344973.g002

3.3. GeNorm

The geNorm algorithm ranks genes by pairwise variation scores (M), identifying the pair of genes that show the most closely correlated patten of expression (the best pair). As shown in Fig 3A, assessment of the entire dataset revealed UBC and YWHAZ as the best pair, with GAPDH and ACTB also ranking highly, while HPRT1 and HSP90 were least stable. No genes exhibited M values below 0.5 (the commonly used threshold for suitability), suggesting that substantial variation existed across the sample set, either between cell lines, differentiation stages, or both. Supporting this, M values were lower when analysis was restricted by cell line (Fig 3B). Here, WT and DMD1 rankings were similar to the overall dataset, while DMD2 was markedly different, with AP3D1 and IPO8 forming the best pair (this latter gene ranked near last in WT and DMD1 cells). Following assessment of differentiation-stage-specific subsets (Fig 3C) several genes fell below the 0.5 threshold, indicating myogenesis represented the strongest contributor to variation in gene expression. Gene rankings in myoblasts and in well-differentiated cells (day 11) were broadly comparable to those in the overall dataset (with YWHAZ and GAPDH scoring highly, while HPRT1 and CSNK2 scored poorly), but rankings from cells at day 6 of differentiation (Fig 3C) showed less agreement, with AP3D1 and IPO8 being the best pair (similar to DMD2 alone). Our data thus suggested a pronounced divergence in gene stability profile at this specific timepoint.

thumbnail
Fig 3. GeNorm analysis.

Stability values (M) of individual genes ranked from least stable (left) to most stable (right) calculated via geNorm. (3A) Rankings are shown for the entire dataset. (3B) Rankings of individual cell lines: WT alone (upper left panel), all DMD (upper right panel), DMD1 and DMD2 only (bottom panel). (3C) Rankings of differentiation stage specific subsets (myoblast stage, day 6 and day 11).

https://doi.org/10.1371/journal.pone.0344973.g003

GeNorm also determines whether additional reference genes (beyond the best pair) reduce variation (the common target being 0.2 or below). In agreement with the high M values noted above, the best pair alone was insufficient in most cases (myoblast- and day 11-specific subsets being exceptions). Use of 3 or 4 reference genes however markedly reduced variation.

3.4. Bestkeeper

The Bestkeeper algorithm compares individual genes with the consensus average of all candidate genes (the ‘bestkeeper’), ranking by Pearson correlation (thus high values indicate stronger reference candidates). As shown in Fig 4A, for our entire sample set YWHAZ, UBC and ACTB again scored highly, along with RPS18. At the other end of the scale, ALAS1, AP3D1, HPRT1 and IPO8 showed particularly poor correlations. Assessment of disease or cell-line specific datasets revealed prominent differences, however (Fig 4B), in dystrophic samples (either collectively or individually) RPS18, ACTB and UBC all ranked highly, while ALAS1, IPO8 and HPRT1 scored exceptionally poorly, but in WT samples ACTB had a markedly lower correlation coefficient, while HPRT1 was unexpectedly ranked highest. Assessment by differentiation stage (Fig 4C) suggested increasing stability with myogenic progression: several genes (such as ALAS1), exhibited negative correlation coefficients in myoblasts and at day 6, but at day 11 fully half the candidate panel pool correlated highly (r values >0.75).

thumbnail
Fig 4. Bestkeeper analysis.

Individual genes from the candidate panel ranked by Pearson correlation coefficient (r) against the “bestkeeper” (the mean expression behaviour of all genes in the candidate panel). High correlations indicate stronger reference candidates. (4A) Data is shown for the entire dataset. (4B) Genotype specific rankings. (4C) Differentiation stage specific rankings.

https://doi.org/10.1371/journal.pone.0344973.g004

3.5. DeltaCt

DeltaCt assesses the extent to which each individual reference candidate varies from all other genes (on a per-sample basis), and thus genes with low mean dCt standard deviations are considered more appropriate references. As shown in Fig 5A, the highest scoring candidates for our complete sample set were RPS18, UBC, YWHAZ and GAPDH, and these genes also tended to score highly in disease- or cell-line specific datasets (Fig 5B). Rankings in WT cells identified PSMC4 and YWHAZ as the best references, however these genes were of only intermediate ranking in both DMD cell lines, where UBC was instead ranked most stable. As with Bestkeeper (see above), expression stability increased with myogenic maturity (Fig 5C): dCt values ranged from 1.3–2.0 at day 6, yet almost all were <1.0 at day 11, again indicating differentiation was the greatest contributor to expression variation. Overall, UBC, RPS18 and YWHAZ continued to rank highly.

thumbnail
Fig 5. DeltaCt analysis.

Individual genes ranked by summed dCt standard deviations: high values (left) indicate poor candidate references, while low values (right) indicate stronger candidates. (5A) Data is shown for the entire dataset. (5B) Genotype-specific rankings. (5C) Differentiation stage specific rankings.

https://doi.org/10.1371/journal.pone.0344973.g005

3.6. Normfinder

Normfinder determines empirical stability across the dataset on a gene-by-gene basis (rather than via pairwise comparisons) but can also assess data using user-assigned groups (genotype, age, etc.) to identify genes exhibiting subtle group-specific expression patterns (‘grouped analysis’). Here, low stability values correspond to highly stable genes. We first used ungrouped analysis: for our complete dataset (Fig 6A) RPS18, UBC and GAPDH were the strongest candidates, while HSP90, HPRT1 and IPO8 were weakest. Cell-line and disease-specific subsets suggested slightly different rankings (Fig 6B) but UBC and RPS18 still consistently ranked highly, and the former gene was also a strong candidate during differentiation (Fig 6C), where in agreement with other assessment methods, overall expression stability increased (lower scores) with myogenic maturation. We then assessed our complete dataset using grouped analysis (Fig 6D), taking the entire dataset but assigning data to groups either by “disease” (two groups; healthy, dystrophic), “cell-line” (three groups; WT, DMD1 and DMD2) or by “differentiation” (three groups; myoblast, day 6, day 11). Stability values were markedly higher (more variable) when grouped by differentiation stage, supporting myogenesis as the major source of dataset expression variability. Nevertheless, RPS18 and particularly UBC performed well in all cases (the latter being consistently the highest ranking, and forming one half of the ‘best pair’ in all comparisons). Similarly, HPRT1, HSP90 and IPO8 scored poorly in all comparisons.

thumbnail
Fig 6. Normfinder analysis, ungrouped: individual gene candidates ranked by Normfinder stability value from least stable to most (left to right).

(6A) Data is shown for the entire dataset. (6B) Genotype specific rankings. (6C) Differentiation stage subsets rankings. (6D) Rankings of complete dataset using grouped analysis, grouped by “disease”, “cell line” and “differentiation stage”.

https://doi.org/10.1371/journal.pone.0344973.g006

3.7. Combined analysis

To aggregate the outputs of these four algorithms into an overall ranking, we generated the geometric mean of the scores (see methods) for each analysis (Fig 7A). RPS18, UBC YWHAZ, ACTB and GAPDH all scored highly within the full dataset, while HPRT1, HSP90 and IPO8 scored poorly. Final rankings across subset analyses were less consistent, with ACTB in particular being highly variable during myogenesis (ranked highly in dystrophic samples, myoblasts and day 11 differentiated myotubes, but second-last in day 6 myotubes) and across genotypes (high ranking in DMD cells -particularly DMD1- but not in WT). These data suggest that despite ostensibly strong performance in our full dataset, ACTB is not a suitable reference for myogenic scenarios.

thumbnail
Fig 7. Combined rankings.

Aggregate geNorm, deltaCt, Bestkeeper and Normfinder scores of all candidate genes in the entire dataset (7A), ranked from least to most stable (left to right). (7B) Combined scores for the entire dataset showing genotype specific rankings. (7C) Differentiation stage specific rankings.

https://doi.org/10.1371/journal.pone.0344973.g007

Overall, UBC and (to a lesser extent) RPS18, YWHAZ and GAPDH typically emerged as strong candidates under all comparative scenarios, suggesting that these four might be a suitable reference panel.

3.8. Validation

GeNorm analysis suggested that 3–4 reference candidates were necessary for effective normalisation. Given their high rankings (both overall and within specific conditional subsets), we generated normalisation factors (NF) from UBC, RPS18 and YWHAZ, with or without further addition of GAPDH. Comparison of the resultant 4-gene and 3-gene NFs revealed minimal differences (i.e., inclusion of GAPDH was unnecessary) (S1 Fig in S1 File).

To validate this 3-gene NF, we then used a within-dataset approach as described previously: essentially, using high scoring genes to normalise consistently poor-scoring genes. As HPRT1, IPO8 and HSP90 were near-uniformly ranked last in our analysis, these were accordingly selected for normalisation to our NF. Normalisation of HPRT1 revealed distinct expression patterns across cell lines (Fig 8). In WT cells, expression was high in myoblasts but decreased during differentiation. DMD2 cells showed a similar but less pronounced pattern. In contrast, DMD1 cells exhibited lower expression in myoblasts that increased progressively throughout differentiation. IPO8 expression decreased between the myoblast stage and day 11 of differentiation across all cell lines, with more variable levels observed at day 6. HSP90 expression increased throughout differentiation in all cell lines. Notably, basal expression at the myoblast stage was approximately 10-fold lower in DMD2 compared to WT cells.

thumbnail
Fig 8. Normalization of HPRT1, IPO8, HSP90, DMD and MYOG expression using 3-gene NF.

Expression of HPRT1, IPO8 and HSP90 (poor ranking candidates) across cell lines throughout differentiation using a normalisation factor derived from RPS18, UBC and YWHAZ. Two-way ANOVA with post-hoc multiple comparisons (Tukey’s). For HSP90, significant effect of differentiation (P < 0.0001) and significant effect of cell-line (P = 0.0024), for IPO8, significant effect of differentiation (P < 0.0001). For HPRT1, no significant overall effects of either differentiation or cell-line, but a significant interaction between the two (P = 0.0025). For MYOG, significant effect of differentiation (P < 0.0001) and significant effect of cell-line (P < 0.0001).

https://doi.org/10.1371/journal.pone.0344973.g008

We also normalized MYOG and DMD expression using our NF. MYOG expression increased markedly as differentiation proceeded across all cell lines. DMD expression showed steady increases in the WT cell line, while both DMD lines exhibited more variable patterns.

Collectively, these data indicate that use of UBC, RPS18 and YWHAZ allows normalisation of gene expression in our healthy and dystrophic cell lines, during proliferation and throughout differentiation.

4. Discussion

Well-ordered and coordinated myogenic differentiation is essential, both for muscle development during embryogenesis and for regeneration/repair of adult skeletal muscle [27]. Muscle is a complex and heterogeneous tissue with a plethora of genetic pathways regulating the myogenic program during differentiation [28]. Study of gene expression during differentiation requires normalization, but selecting suitable reference genes within such transcriptionally dynamic scenarios is challenging. In this study, we used four complementary reference gene assessment methods (GeNorm, Bestkeeper, DeltaCt and Normfinder) to assess a large candidate gene panel in proliferating and differentiating, patient-derived, cell-culture models of healthy and dystrophic muscle. By using the consensus of all four methods, across both the entire dataset, and cell-line-, disease- or differentiation-state-specific subsets, we identified strong reference candidates suited to comparative scenarios within these models. We show that RPS18, UBC and YWHAZ consistently ranked highly within the full dataset, while HPRT1, HSP90 and IPO8 similarly scored poorly. These genes have pedigree across the broader literature: RPS18 was also found to be stable in over 20 developmental stages in skeletal muscle of Landrace pigs [29]. UBC has been identified as a reliable reference gene across various tissue types [30,31]. However, this gene has also been reported to be overexpressed in muscle tissues under certain conditions, such as glucocorticoid treatment [32]. UBC was highly stable under the conditions studied here, but exposure to glucocorticosteroids was not assessed: given these drugs represent the gold standard therapy for DMD, further investigation is warranted. YWHAZ was also found by others to be a suitable reference gene in different murine muscle tissue types as well as in a C2C12 myoblast cell line model [33].

A notable finding of our study is that expression was most variable when samples were assessed by differentiation status, rather than disease or cell-line. Variability at day 11 was comparatively low regardless of algorithm, indicating that mature myotubes are largely transcriptionally stable, however rankings (and overall stability values) were markedly less consistent at day 6 (with Bestkeeper even reporting negative Pearson correlations for several genes at this stage). Our data thus suggests that progression through myogenesis is the primary driver of variability, contributing substantially more to expression variation than patient of origin or dystrophic phenotype. This is perhaps understandable: myogenic differentiation is a highly complex process, orchestrated by distinct, hierarchical gene regulatory networks acting at specific temporal and spatial stages of development [4,34], including myogenic regulatory factors (MRFs), upstream regulators such as the Pax family and Six homeodomain transcription factors, as well as microRNAs [3436]. While this complex scenario has well-defined start- and end-points (myoblasts and myofibres, respectively), it is not surprising that greater variability is observed between samples at intermediate myogenic stages, both because extensive transcriptional changes occur throughout the myogenic program, and because individual cell lines might not mature at equivalent rates, despite equivalent differentiation times. This was demonstrated clearly by ACTB, which was near-uniformly poorly stable at day 6, while typically ranking highly in proliferating myoblasts or differentiated myotubes. Previous studies have shown that ACTB and GAPDH exhibit the highest variability across prenatal and postnatal stages of porcine skeletal muscle development [29], and our results here extend this to human-derived samples, suggesting that these genes are not suitable references for RT-qPCR experiments assessing expression in differentiating skeletal muscle.

Expression of HPRT1 (another commonly used reference gene) was also revealed to be variable following normalization to our 3-gene NF, both between cell lines and throughout myogenesis. This was especially clear for DMD1, where the expression of this gene differed markedly from the other lines (indicating patient-related variation). This finding contrasts with other studies in which this gene was considered a suitable reference for RT-PCR experiments in murine models [33] and rat tissues [37], and again supports the notion that no gene is universally appropriate across all genotypes and tissues.

In conclusion, we show that RPS18, UBC and YWHAZ are appropriate references for normalizing gene expression across the immortalised healthy and dystrophic humans cell lines studied here (both proliferating and during differentiation). We note, however, that this does not imply they are universally applicable to human myogenic cultures, and we stress that studies using other cell lines, dystrophic genotypes or differentiation contexts would require additional validation. Conducting comprehensive housekeeping gene studies, as here and by others [6,26,38] prior to RT-qPCR analysis, is essential to identify the most suitable references for the specific scenarios under consideration (which can vary with sample type and pharmacological treatment).

References

  1. 1. Mercuri E, Bönnemann CG, Muntoni F. Muscular dystrophies. Lancet Lond Engl. 2019;394(10213):2025–38.
  2. 2. Aartsma-Rus A, Van Deutekom JCT, Fokkema IF, Van Ommen G-JB, Den Dunnen JT. Entries in the Leiden Duchenne muscular dystrophy mutation database: an overview of mutation types and paradoxical cases that confirm the reading-frame rule. Muscle Nerve. 2006;34(2):135–44. pmid:16770791
  3. 3. Kang J-S, Krauss RS. Muscle stem cells in developmental and regenerative myogenesis. Curr Opin Clin Nutr Metab Care. 2010;13(3):243–8. pmid:20098319
  4. 4. Hernández-Hernández JM, García-González EG, Brun CE, Rudnicki MA. The myogenic regulatory factors, determinants of muscle development, cell identity and regeneration. Semin Cell Dev Biol. 2017;72:10–8. pmid:29127045
  5. 5. Rho H-W, Lee B-C, Choi E-S, Choi I-J, Lee Y-S, Goh S-H. Identification of valid reference genes for gene expression studies of human stomach cancer by reverse transcription-qPCR. BMC Cancer. 2010;10:240. pmid:20507635
  6. 6. Boccanegra B, Lenti R, Mantuano P, Conte E, Tulimiero L, Piercy RJ, et al. Determination of qPCR reference genes suitable for normalizing gene expression in a novel model of Duchenne muscular dystrophy, the D2-mdx mouse. PLoS One. 2024;19(11):e0310714. pmid:39535998
  7. 7. Glare EM, Divjak M, Bailey MJ, Walters EH. β-Actin and GAPDH housekeeping gene expression in asthmatic airways is variable and not suitable for normalising mRNA levels. 2023
  8. 8. Zhao L-M, Zheng Z-X, Zhao X, Shi J, Bi J-J, Pei W, et al. Optimization of reference genes for normalization of the quantitative polymerase chain reaction in tissue samples of gastric cancer. Asian Pac J Cancer Prev. 2014;15(14):5815–8. pmid:25081706
  9. 9. Glaser PE, Gross RW. Rapid plasmenylethanolamine-selective fusion of membrane bilayers catalyzed by an isoform of glyceraldehyde-3-phosphate dehydrogenase: discrimination between glycolytic and fusogenic roles of individual isoforms. Biochemistry. 1995;34(38):12193–203. pmid:7547960
  10. 10. Tisdale EJ. Glyceraldehyde-3-phosphate dehydrogenase is required for vesicular transport in the early secretory pathway. J Biol Chem. 2001;276(4):2480–6. pmid:11035021
  11. 11. Sawa A, Khan AA, Hester LD, Snyder SH. Glyceraldehyde-3-phosphate dehydrogenase: nuclear translocation participates in neuronal and nonneuronal cell death. Proc Natl Acad Sci U S A. 1997;94(21):11669–74. pmid:9326668
  12. 12. Sunaga K, Takahashi H, Chuang DM, Ishitani R. Glyceraldehyde-3-phosphate dehydrogenase is over-expressed during apoptotic death of neuronal cultures and is recognized by a monoclonal antibody against amyloid plaques from Alzheimer’s brain. Neurosci Lett. 1995;200(2):133–6. pmid:8614562
  13. 13. Hara MR, Agrawal N, Kim SF, Cascio MB, Fujimuro M, Ozeki Y, et al. S-nitrosylated GAPDH initiates apoptotic cell death by nuclear translocation following Siah1 binding. Nat Cell Biol. 2005;7(7):665–74. pmid:15951807
  14. 14. Moe TK, Ziliang J, Barathi A, Beuerman RW. Differential expression of glyceraldehyde-3-phosphate dehydrogenase (GAPDH), beta actin and hypoxanthine phosphoribosyltransferase (HPRT) in postnatal rabbit sclera. Curr Eye Res. 2001;23(1):44–50. pmid:11821985
  15. 15. Bustin SA, Benes V, Garson JA, Hellemans J, Huggett J, Kubista M, et al. The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments. Clin Chem. 2009;55(4):611–22. pmid:19246619
  16. 16. Vandesompele J, Preter KD, Roy NV, Paepe AD. Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biol. 2002;3(7):RESEARCH0034.
  17. 17. Pfaffl MW, Tichopad A, Prgomet C, Neuvians TP. Determination of stable housekeeping genes, differentially regulated target genes and sample integrity: BestKeeper--Excel-based tool using pair-wise correlations. Biotechnol Lett. 2004;26(6):509–15. pmid:15127793
  18. 18. Silver N, Best S, Jiang J, Thein SL. Selection of housekeeping genes for gene expression studies in human reticulocytes using real-time PCR. BMC Mol Biol. 2006;7:33. pmid:17026756
  19. 19. Andersen CL, Jensen JL, Ørntoft TF. Normalization of real-time quantitative reverse transcription-PCR data: a model-based variance estimation approach to identify genes suited for normalization, applied to bladder and colon cancer data sets. Cancer Res. 2004;64(15):5245–50. pmid:15289330
  20. 20. Thorley M, Duguez S, Mazza EMC, Valsoni S, Bigot A, Mamchaoui K, et al. Skeletal muscle characteristics are preserved in hTERT/cdk4 human myogenic cell lines. Skelet Muscle. 2016;6(1):43. pmid:27931240
  21. 21. Mamchaoui K, Trollet C, Bigot A, Negroni E, Chaouch S, Wolff A, et al. Immortalized pathological human myoblasts: towards a universal tool for the study of neuromuscular disorders. Skelet Muscle. 2011;1:34. pmid:22040608
  22. 22. Boccanegra B, Mantuano P, Conte E, Cerchiara AG, Tulimiero L, Quarta R, et al. LKB1 signaling is altered in skeletal muscle of a Duchenne muscular dystrophy mouse model. Dis Model Mech. 2023;16(7):dmm049930. pmid:37427454
  23. 23. Boccanegra B, Cappellari O, Mantuano P, Trisciuzzi D, Mele A, Tulimiero L, et al. Growth hormone secretagogues modulate inflammation and fibrosis in mdx mouse model of Duchenne muscular dystrophy. Front Immunol. 2023;14:1119888. pmid:37122711
  24. 24. Hildyard JCW, Wells DJ. Identification and validation of quantitative PCR reference genes suitable for normalizing expression in normal and dystrophic cell culture models of myogenesis. PLoS Curr. 2014;6:ecurrents.md.faafdde4bea8df4aa7d06cd5553119a6. pmid:24634799
  25. 25. Hildyard JCW, Finch AM, Wells DJ. Identification of qPCR reference genes suitable for normalizing gene expression in the mdx mouse model of Duchenne muscular dystrophy. PLoS One. 2019;14(1):e0211384. pmid:30699165
  26. 26. Hildyard JCW, Wells DJ, Piercy RJ. Identification of qPCR reference genes suitable for normalising gene expression in the developing mouse embryo. Wellcome Open Res. 2022;6:197. pmid:35509373
  27. 27. Cossu G, Biressi S. Satellite cells, myoblasts and other occasional myogenic progenitors: possible origin, phenotypic features and role in muscle regeneration. Semin Cell Dev Biol. 2005;16(4–5):623–31. pmid:16118057
  28. 28. Bentzinger CF, Wang YX, Rudnicki MA. Building muscle: molecular regulation of myogenesis. Cold Spring Harb Perspect Biol. 2012;4(2):a008342. pmid:22300977
  29. 29. Niu G, Yang Y, Zhang Y, Hua C, Wang Z, Tang Z, et al. Identifying suitable reference genes for gene expression analysis in developing skeletal muscle in pigs. PeerJ. 2016;4:e2428. pmid:27994956
  30. 30. Chua SL, See Too WC, Khoo BY, Few LL. UBC and YWHAZ as suitable reference genes for accurate normalisation of gene expression using MCF7, HCT116 and HepG2 cell lines. Cytotechnology. 2011;63(6):645–54. pmid:21850463
  31. 31. Mosley Y-YC, HogenEsch H. Selection of a suitable reference gene for quantitative gene expression in mouse lymph nodes after vaccination. BMC Res Notes. 2017;10(1):689. pmid:29208024
  32. 32. Marinovic AC, Zheng B, Mitch WE, Price SR. Ubiquitin (UbC) expression in muscle cells is increased by glucocorticoids through a mechanism involving Sp1 and MEK1. J Biol Chem. 2002;277(19):16673–81. pmid:11872750
  33. 33. Ma J, Chen J, Gan M, Chen L, Zhao Y, Niu L, et al. Comparison of reference gene expression stability in mouse skeletal muscle via five algorithms. PeerJ. 2022;10:e14221. pmid:36275473
  34. 34. Buckingham M, Rigby PWJ. Gene regulatory networks and transcriptional mechanisms that control myogenesis. Dev Cell. 2014;28(3):225–38. pmid:24525185
  35. 35. Gagan J, Dey BK, Dutta A. MicroRNAs regulate and provide robustness to the myogenic transcriptional network. Curr Opin Pharmacol. 2012;12(3):383–8.
  36. 36. Liu N, Williams AH, Maxeiner JM, Bezprozvannaya S, Shelton JM, Richardson JA, et al. microRNA-206 promotes skeletal muscle regeneration and delays progression of Duchenne muscular dystrophy in mice. J Clin Invest. 2012;122(6):2054–65. pmid:22546853
  37. 37. Wang X, Zhao H, Ni J, Pan J, Hua H, Wang Y. Identification of suitable reference genes for gene expression studies in rat skeletal muscle following sciatic nerve crush injury. Mol Med Rep. 2019;19(5):4377–87. pmid:30942461
  38. 38. Hildyard JCW, Taylor-Brown F, Massey C, Wells DJ, Piercy RJ. Determination of qPCR Reference Genes Suitable for Normalizing Gene Expression in a Canine Model of Duchenne Muscular Dystrophy. J Neuromuscul Dis. 2018;5(2):177–91. pmid:29614692