Unknown

Dataset Information

0

Learning single-cell chromatin accessibility profiles using meta-analytic marker genes.


ABSTRACT:

Motivation

Single-cell assay for transposase accessible chromatin using sequencing (scATAC-seq) is a valuable resource to learn cis-regulatory elements such as cell-type specific enhancers and transcription factor binding sites. However, cell-type identification of scATAC-seq data is known to be challenging due to the heterogeneity derived from different protocols and the high dropout rate.

Results

In this study, we perform a systematic comparison of seven scATAC-seq datasets of mouse brain to benchmark the efficacy of neuronal cell-type annotation from gene sets. We find that redundant marker genes give a dramatic improvement for a sparse scATAC-seq annotation across the data collected from different studies. Interestingly, simple aggregation of such marker genes achieves performance comparable or higher than that of machine-learning classifiers, suggesting its potential for downstream applications. Based on our results, we reannotated all scATAC-seq data for detailed cell types using robust marker genes. Their meta scATAC-seq profiles are publicly available at https://gillisweb.cshl.edu/Meta_scATAC. Furthermore, we trained a deep neural network to predict chromatin accessibility from only DNA sequence and identified key motifs enriched for each neuronal subtype. Those predicted profiles are visualized together in our database as a valuable resource to explore cell-type specific epigenetic regulation in a sequence-dependent and -independent manner.

SUBMITTER: Kawaguchi RK 

PROVIDER: S-EPMC9851328 | biostudies-literature | 2023 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Learning single-cell chromatin accessibility profiles using meta-analytic marker genes.

Kawaguchi Risa Karakida RK   Tang Ziqi Z   Fischer Stephan S   Rajesh Chandana C   Tripathy Rohit R   Koo Peter K PK   Gillis Jesse J  

Briefings in bioinformatics 20230101 1


<h4>Motivation</h4>Single-cell assay for transposase accessible chromatin using sequencing (scATAC-seq) is a valuable resource to learn cis-regulatory elements such as cell-type specific enhancers and transcription factor binding sites. However, cell-type identification of scATAC-seq data is known to be challenging due to the heterogeneity derived from different protocols and the high dropout rate.<h4>Results</h4>In this study, we perform a systematic comparison of seven scATAC-seq datasets of m  ...[more]

Similar Datasets

| S-EPMC9492688 | biostudies-literature
| S-EPMC7676699 | biostudies-literature
| S-EPMC9619378 | biostudies-literature
| S-EPMC10014730 | biostudies-literature
| S-EPMC10549956 | biostudies-literature
| S-EPMC10069466 | biostudies-literature
| S-EPMC10541611 | biostudies-literature
| S-EPMC7904839 | biostudies-literature
| S-EPMC8168584 | biostudies-literature
| S-EPMC8552484 | biostudies-literature