Unknown

Dataset Information

0

Quantification, Dynamic Visualization, and Validation of Bias in ATAC-Seq Data with ataqv.


ABSTRACT: The assay for transposase-accessible chromatin using sequencing (ATAC-seq) has become the preferred method for mapping chromatin accessibility due to its time and input material efficiency. However, it can be difficult to evaluate data quality and identify sources of technical bias across samples. Here, we present ataqv, a computational toolkit for efficiently measuring, visualizing, and comparing quality control (QC) results across samples and experiments. We use ataqv to analyze 2,009 public ATAC-seq datasets; their QC metrics display a 10-fold range. Tn5 dosage experiments and statistical modeling show that technical variation in the ratio of Tn5 transposase to nuclei and sequencing flowcell density induces systematic bias in ATAC-seq data by changing the enrichment of reads across functional genomic annotations including promoters, enhancers, and transcription-factor-bound regions, with the notable exception of CTCF. ataqv can be integrated into existing computational pipelines and is freely available at https://github.com/ParkerLab/ataqv/.

SUBMITTER: Orchard P 

PROVIDER: S-EPMC8245295 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Quantification, Dynamic Visualization, and Validation of Bias in ATAC-Seq Data with ataqv.

Orchard Peter P   Kyono Yasuhiro Y   Hensley John J   Kitzman Jacob O JO   Parker Stephen C J SCJ  

Cell systems 20200301 3


The assay for transposase-accessible chromatin using sequencing (ATAC-seq) has become the preferred method for mapping chromatin accessibility due to its time and input material efficiency. However, it can be difficult to evaluate data quality and identify sources of technical bias across samples. Here, we present ataqv, a computational toolkit for efficiently measuring, visualizing, and comparing quality control (QC) results across samples and experiments. We use ataqv to analyze 2,009 public A  ...[more]

Similar Datasets

2020-03-23 | GSE130450 | GEO
| PRJNA540283 | ENA
| S-EPMC10236359 | biostudies-literature
| S-EPMC6056626 | biostudies-literature
| S-EPMC11039982 | biostudies-literature
| S-EPMC7666144 | biostudies-literature
| S-EPMC8756194 | biostudies-literature
| S-EPMC6099720 | biostudies-literature
| S-EPMC6385462 | biostudies-literature
| S-EPMC8215916 | biostudies-literature