Genomics

Dataset Information

0

Data Exploration, Quality Control and Statistical Analysis of ChIP-exo/nexus Experiments


ABSTRACT: ChIP-exo/nexus experiments present modifications on the commonly used ChIP-seq protocol for high resolution mapping of transcription factor binding sites. Although many aspects of the ChIP-exo data analysis are similar to those of ChIP-seq, these high throughput experiments pose a number of unique quality control and analysis challenges. We develop a statistical quality control pipeline and accompanying R package, ChIPexoQual, to enable exploration and analysis of ChIP-exo and related experiments. ChIPexoQual evaluates a number of key issues including strand imbalance, library complexity, and signal enrichment of data. Assessment of these features are facilitated through diagnostic plots and summary statistics calculated over regions of the genome with varying levels of coverage. We evaluated our QC pipeline with both large collections of public ChIP-exo/nexus data and multiple, new ChIP-exo datasets from E. coli. ChIPexoQual analysis of these datasets resulted in guidelines for using these QC metrics across a wide range of sequencing depths and provided further insights for modelling ChIP-exo data. Finally, although ChIP-exo experiments have been compared to ChIP-seq experiments with single-end (SE) sequencing, we provide, for the first time, comparisons with paired-end (PE) ChIP-seq experiments. We illustrate that, at fixed sequencing depths, ChIP-exo provides higher sensitivity, specificity, and spatial resolution than PE ChIP-seq and both significantly outperform their SE ChIP-seq counterpart.

ORGANISM(S): Escherichia coli

PROVIDER: GSE84830 | GEO | 2017/03/01

SECONDARY ACCESSION(S): PRJNA338159

REPOSITORIES: GEO

Similar Datasets

| PRJNA338159 | ENA
| EGAD00001004099 | EGA
2014-12-02 | GSE55306 | GEO
2019-07-30 | GSE135056 | GEO
2018-09-25 | GSE87657 | GEO
2015-05-20 | PXD001449 | Pride
2016-01-07 | E-GEOD-76618 | biostudies-arrayexpress
2017-06-09 | GSE97471 | GEO
2014-09-09 | E-GEOD-61204 | biostudies-arrayexpress
| PRJNA239257 | ENA