Single Cell Clustering

What is single-cell clustering?

Single-cell clustering is a computational technique that groups individual cells into clusters based on their gene expression profiles, enabling the identification of cell types and states from scRNA-seq data.

Key Steps

Data Preprocessing: Remove noise, correct artefacts, and normalise expression values.
Dimensionality Reduction: Use PCA, t-SNE, or UMAP to reduce dimensionality.
Clustering: Apply K-means, hierarchical, density-based (DBSCAN), or graph-based methods (Leiden) for grouping cells.
Visualisation: Use UMAP plots to explore relationships between clusters.

Applications

Cell type identification — crucial for understanding tissue composition.
Disease subtyping — reveal disease-specific cell subpopulations for conditions such as cancer.
Developmental biology — track cell differentiation and developmental processes.
Immunology — identify immune cell populations, discover rare types.
Drug discovery — identify target cells and uncover resistance mechanisms.
Rare cell identification — detect and characterise rare populations.

scRNA-seq vs. Bulk RNA-seq

scRNA-seq vs Bulk RNA-seq

Image source: 10x Genomics

Property	scRNA-seq	Bulk RNA-seq
Resolution	Individual cells.	Average across all cells.
Heterogeneity	Detects cellular diversity.	Masks heterogeneity.
Applications	Complex tissues, rare cells.	Homogeneous samples.
Data complexity	High-dimensional, requires advanced methods.	Lower-dimensional, simpler to analyse.
Cost	Higher (single-cell isolation).	Lower (pooled RNA).
Sensitivity	Detects rare transcripts.	May miss rare transcripts.

In summary, scRNA-seq provides high-resolution insights into cellular heterogeneity, making it the foundation for methods like GARAGE that aim to model and augment this diversity.