Unknown

Dataset Information

0

Artificial-cell-type aware cell-type classification in CITE-seq.


ABSTRACT:

Motivation

Cellular Indexing of Transcriptomes and Epitopes by sequencing (CITE-seq), couples the measurement of surface marker proteins with simultaneous sequencing of mRNA at single cell level, which brings accurate cell surface phenotyping to single-cell transcriptomics. Unfortunately, multiplets in CITE-seq datasets create artificial cell types (ACT) and complicate the automation of cell surface phenotyping.

Results

We propose CITE-sort, an artificial-cell-type aware surface marker clustering method for CITE-seq. CITE-sort is aware of and is robust to multiplet-induced ACT. We benchmarked CITE-sort with real and simulated CITE-seq datasets and compared CITE-sort against canonical clustering methods. We show that CITE-sort produces the best clustering performance across the board. CITE-sort not only accurately identifies real biological cell types (BCT) but also consistently and reliably separates multiplet-induced artificial-cell-type droplet clusters from real BCT droplet clusters. In addition, CITE-sort organizes its clustering process with a binary tree, which facilitates easy interpretation and verification of its clustering result and simplifies cell-type annotation with domain knowledge in CITE-seq.

Availability and implementation

http://github.com/QiuyuLian/CITE-sort.

Supplementary information

Supplementary data is available at Bioinformatics online.

SUBMITTER: Lian Q 

PROVIDER: S-EPMC7355304 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Artificial-cell-type aware cell-type classification in CITE-seq.

Lian Qiuyu Q   Xin Hongyi H   Ma Jianzhu J   Konnikova Liza L   Chen Wei W   Gu Jin J   Chen Kong K  

Bioinformatics (Oxford, England) 20200701 Suppl_1


<h4>Motivation</h4>Cellular Indexing of Transcriptomes and Epitopes by sequencing (CITE-seq), couples the measurement of surface marker proteins with simultaneous sequencing of mRNA at single cell level, which brings accurate cell surface phenotyping to single-cell transcriptomics. Unfortunately, multiplets in CITE-seq datasets create artificial cell types (ACT) and complicate the automation of cell surface phenotyping.<h4>Results</h4>We propose CITE-sort, an artificial-cell-type aware surface m  ...[more]

Similar Datasets

| S-EPMC8697413 | biostudies-literature
| S-EPMC10350048 | biostudies-literature
| S-EPMC11840950 | biostudies-literature
| S-EPMC9219143 | biostudies-literature
| S-EPMC11561699 | biostudies-literature
| S-EPMC11261982 | biostudies-literature
| S-EPMC6929456 | biostudies-literature
| S-EPMC8335627 | biostudies-literature
| S-EPMC10048047 | biostudies-literature
| S-EPMC8163004 | biostudies-literature