Skip to main content
Advertisement

< Back to Article

Exploring the single-cell RNA-seq analysis landscape with the scRNA-tools database

Fig 2

Phases of a typical unsupervised scRNA-seq analysis process.

In Phase 1 (data acquisition) raw sequencing reads are converted into a gene by cell expression matrix. For many protocols this requires the alignment of genes to a reference genome and the assignment and de-duplication of Unique Molecular Identifiers (UMIs). The data is then cleaned (Phase 2) to remove low-quality cells and uninformative genes, resulting in a high-quality dataset for further analysis. The data can also be normalised and missing values imputed during this phase. Phase 3 assigns cells, either in a discrete manner to known (classification) or unknown (clustering) groups or to a position on a continuous trajectory. Interesting genes (eg. differentially expressed, markers, specific patterns of expression) are then identified to explain these groups or trajectories (Phase 4).

Fig 2

doi: https://doi.org/10.1371/journal.pcbi.1006245.g002