Unknown

Dataset Information

0

Souporcell: robust clustering of single-cell RNA-seq data by genotype without reference genotypes.


ABSTRACT: Methods to deconvolve single-cell RNA-sequencing (scRNA-seq) data are necessary for samples containing a mixture of genotypes, whether they are natural or experimentally combined. Multiplexing across donors is a popular experimental design that can avoid batch effects, reduce costs and improve doublet detection. By using variants detected in scRNA-seq reads, it is possible to assign cells to their donor of origin and identify cross-genotype doublets that may have highly similar transcriptional profiles, precluding detection by transcriptional profile. More subtle cross-genotype variant contamination can be used to estimate the amount of ambient RNA. Ambient RNA is caused by cell lysis before droplet partitioning and is an important confounder of scRNA-seq analysis. Here we develop souporcell, a method to cluster cells using the genetic variants detected within the scRNA-seq reads. We show that it achieves high accuracy on genotype clustering, doublet detection and ambient RNA estimation, as demonstrated across a range of challenging scenarios.

SUBMITTER: Heaton H 

PROVIDER: S-EPMC7617080 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Souporcell: robust clustering of single-cell RNA-seq data by genotype without reference genotypes.

Heaton Haynes H   Talman Arthur M AM   Knights Andrew A   Imaz Maria M   Gaffney Daniel J DJ   Durbin Richard R   Hemberg Martin M   Lawniczak Mara K N MKN  

Nature methods 20200504 6


Methods to deconvolve single-cell RNA-sequencing (scRNA-seq) data are necessary for samples containing a mixture of genotypes, whether they are natural or experimentally combined. Multiplexing across donors is a popular experimental design that can avoid batch effects, reduce costs and improve doublet detection. By using variants detected in scRNA-seq reads, it is possible to assign cells to their donor of origin and identify cross-genotype doublets that may have highly similar transcriptional p  ...[more]

Similar Datasets

| S-EPMC6909514 | biostudies-literature
| S-EPMC12116994 | biostudies-literature
| S-EPMC5410170 | biostudies-literature
| S-EPMC6937257 | biostudies-literature
| S-EPMC6477982 | biostudies-literature
| S-EPMC5473255 | biostudies-literature
| S-EPMC8742847 | biostudies-literature
| S-EPMC10547911 | biostudies-literature