Skip to main content
Advertisement

< Back to Article

Fig 1.

Ability to identify differentially dispersed genes.

The performances of Levene’s test, MDSeq, DiPhiSeq, GAMLSS and DiffDist for differential dispersion detection in gene expression data, as measured by the area under the ROC curve (AUC), were assessed using 10 replicates of simulated datasets composed of highly and lowly differentially expressed genes between two sample populations of equal size.

More »

Fig 1 Expand

Fig 2.

Ability to detect differential dispersion for lowly differentially expressed genes.

False discovery rate (FDR) and true positive rate (TPR) of Levene’s test, MDSeq, DiPhiSeq, GAMLSS and DiffDist for differential dispersion detection in simulated datasets composed of lowly differentially expressed genes between two sample populations of equal size. The performances were assessed using 10 replicates of simulated datasets per parameter setting.

More »

Fig 2 Expand

Fig 3.

Differentially dispersed genes correctly identified by the evaluated methods among lowly differentially expressed genes.

(A) Intersections of sets of differentially dispersed (DD) genes correctly identified by Levene’s test, MDSeq, DiPhiSeq, GAMLSS and DiffDist. (B): Correctness of dispersion log2-fold change sign of DD genes correctly identified by the different methods. (C) Real mean and dispersion log2-fold changes and estimated dispersion log2-fold changes of DD genes correctly identified by GAMLSS and DiffDist. (D) Correctness of dispersion log2-fold change signs according to Levene’s test, MDSeq and DiPhiSeq for DD genes correctly identified by GAMLSS and DiffDist with incorrect dispersion log2-fold change sign. Simulated datasets are composed of lowly differentially expressed genes with a mean fold change of expression between 1 and 1.5 between two populations of 50 samples. Values indicated at the middle of the bars are percentages of the corresponding categories of genes over the entire sets of analyzed genes. All results relate to 10 replicates of simulated datasets, e.g. the counts and percentages are averaged over all the replicates.

More »

Fig 3 Expand

Table 1.

Numbers of adjacent normal and tumor samples for the analyzed TCGA datasets.

More »

Table 1 Expand

Fig 4.

Differentially dispersed genes among non-differentially expressed genes for each TCGA dataset.

(A) Number of differentially expressed (DE) genes separated between those upregulated in tumors (DE+) and those downregulated in tumors (DE-) detected by MDSeq per TCGA dataset. (B) Number of differentially dispersed (DD) genes among non-DE genes separated between those overdispersed in tumors (DD+) and those underdispersed in tumors (DD-), as detected by Levene’s test, MDSeq, DiPhiSeq, GAMLSS and DiffDist, per TCGA dataset.

More »

Fig 4 Expand

Fig 5.

Overdispersed genes in tumors identified by the evaluated methods among non-differentially expressed genes.

Intersections of sets of overdispersed genes in tumors identified by Levene’s test, MDSeq, DiPhiSeq, GAMLSS and DiffDist among non-differentially expressed genes for (A) the kidney renal clear cell carcinoma dataset (TCGA-KIRC) and (B) the kidney renal papillary cell carcinoma dataset (TCGA-KIRP). Non-differentially expressed genes were identified by MDSeq.

More »

Fig 5 Expand

Fig 6.

Enriched GO terms among overdispersed genes in tumors identified by the evaluated methods.

Top 40 representative enriched Gene Ontology (GO) terms among overdispersed genes in tumors (DD+) among non-differentially expressed (non-DE) genes, ordered first by the number of datasets for which they are enriched (decreasing order) and second by the mean p-values of enrichment across all datasets (increasing order). Non-DE genes were identified using MDSeq, and DD+ genes were identified among non-differentially expressed genes by at least one of the evaluated methods, i.e. Levene’s test, MDSeq, DiPhiSeq, GAMLSS and DiffDist.

More »

Fig 6 Expand