TY - UNPB
T1 - Wiring together large single-cell RNA-seq sample collections
AU - Barkas, Nikolas
AU - Petukhov, Viktor
AU - Nikolaeva, Daria
AU - Lozinsky, Yaroslav
AU - Demharter, Samuel
AU - Khodosevich, Konstantin
AU - Kharchenko, Peter V
PY - 2018
Y1 - 2018
N2 - Single-cell RNA-seq methods are being increasingly applied in complex study designs, which involve measurements of many samples, commonly spanning multiple individuals, conditions, or tissue compartments. Joint analysis of such extensive, and often heterogeneous, sample collections requires a way of identifying and tracking recurrent cell subpopulations across the entire collection. Here we describe a flexible approach, called Conos (Clustering On Network Of Samples), that relies on multiple plausible inter-sample mappings to construct a global graph connecting all measured cells. The graph can then be used to propagate information between samples and to identify cell communities that show consistent grouping across broad subsets of the collected samples. Conos results enable investigators to balance between resolution and breadth of the detected subpopulations. In this way, it is possible to focus on the fine-grained clusters appearing within more similar subsets of samples, or analyze coarser clusters spanning broader sets of samples in the collection. Such multi-resolution joint clustering provides an important basis for downstream analysis and interpretation of sizable multi-sample single-cell studies and atlas-scale collections.
AB - Single-cell RNA-seq methods are being increasingly applied in complex study designs, which involve measurements of many samples, commonly spanning multiple individuals, conditions, or tissue compartments. Joint analysis of such extensive, and often heterogeneous, sample collections requires a way of identifying and tracking recurrent cell subpopulations across the entire collection. Here we describe a flexible approach, called Conos (Clustering On Network Of Samples), that relies on multiple plausible inter-sample mappings to construct a global graph connecting all measured cells. The graph can then be used to propagate information between samples and to identify cell communities that show consistent grouping across broad subsets of the collected samples. Conos results enable investigators to balance between resolution and breadth of the detected subpopulations. In this way, it is possible to focus on the fine-grained clusters appearing within more similar subsets of samples, or analyze coarser clusters spanning broader sets of samples in the collection. Such multi-resolution joint clustering provides an important basis for downstream analysis and interpretation of sizable multi-sample single-cell studies and atlas-scale collections.
U2 - 10.1101/460246
DO - 10.1101/460246
M3 - Working paper
BT - Wiring together large single-cell RNA-seq sample collections
PB - Cold Spring Harbor Laboratory
CY - Biorxiv
ER -