Unknown

Dataset Information

0

ScMAGIC: accurately annotating single cells using two rounds of reference-based classification.


ABSTRACT: Here, we introduce scMAGIC (Single Cell annotation using MArker Genes Identification and two rounds of reference-based Classification [RBC]), a novel method that uses well-annotated single-cell RNA sequencing (scRNA-seq) data as the reference to assist in the classification of query scRNA-seq data. A key innovation in scMAGIC is the introduction of a second-round RBC in which those query cells whose cell identities are confidently validated in the first round are used as a new reference to again classify query cells, therefore eliminating the batch effects between the reference and the query data. scMAGIC significantly outperforms 13 competing RBC methods with their optimal parameter settings across 86 benchmark tests, especially when the cell types in the query dataset are not completely covered by the reference dataset and when there exist significant batch effects between the reference and the query datasets. Moreover, when no reference dataset is available, scMAGIC can annotate query cells with reasonably high accuracy by using an atlas dataset as the reference.

SUBMITTER: Zhang Y 

PROVIDER: S-EPMC9071478 | biostudies-literature | 2022 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

scMAGIC: accurately annotating single cells using two rounds of reference-based classification.

Zhang Yu Y   Zhang Feng F   Wang Zekun Z   Wu Siyi S   Tian Weidong W  

Nucleic acids research 20220501 8


Here, we introduce scMAGIC (Single Cell annotation using MArker Genes Identification and two rounds of reference-based Classification [RBC]), a novel method that uses well-annotated single-cell RNA sequencing (scRNA-seq) data as the reference to assist in the classification of query scRNA-seq data. A key innovation in scMAGIC is the introduction of a second-round RBC in which those query cells whose cell identities are confidently validated in the first round are used as a new reference to again  ...[more]

Similar Datasets

| S-EPMC7306901 | biostudies-literature
| S-EPMC2577219 | biostudies-literature
2022-09-25 | GSE189778 | GEO
| S-EPMC6249247 | biostudies-literature
| S-EPMC5235923 | biostudies-literature
| S-EPMC9801963 | biostudies-literature
2018-01-14 | GSE90496 | GEO
| S-EPMC9187757 | biostudies-literature
| S-EPMC3753567 | biostudies-literature
| S-EPMC9474308 | biostudies-literature