Genomics

Dataset Information

0

ChIP-AP - An Integrated ChIP-Seq Analysis Pipeline


ABSTRACT: ChIP-Seq is a technique used to analyse protein-DNA interactions. The protein-DNA complex is pulled down using a protein antibody, after which sequencing and analysis of the bound DNA fragments is performed. A key bioinformatics analysis step is “peak” calling - identifying regions of enrichment. Benchmarking studies have consistently shown that no optimal peak caller exists. Peak callers have distinct selectivity and specificity characteristics which are often not additive and seldom completely overlap in many scenarios. In the absence of a universal peak caller, we rationalized one ought to utilize multiple peak-callers to 1) gauge peak confidence as determined through detection by multiple algorithms, and 2) more thoroughly survey the protein-bound landscape by capturing peaks not detected by individual peak callers owing to algorithmic limitations and biases. We therefore developed an integrated ChIP-Seq Analysis Pipeline (ChIP AP) which performs all analysis steps from raw fastq files to final result, and utilizes four commonly used peak callers to more thoroughly and comprehensively analyse datasets. Results are integrated and presented in a single file enabling users to apply selectivity and sensitivity thresholds to select the consensus peak set, the union peak set, or any sub-set in-between to more confidently and comprehensively explore the protein bound landscape. (https://github.com/JSuryatenggara/ChIP-AP).

ORGANISM(S): Homo sapiens

PROVIDER: GSE172355 | GEO | 2022/01/03

REPOSITORIES: GEO

Similar Datasets

2015-01-07 | E-GEOD-57563 | biostudies-arrayexpress
2015-01-07 | GSE57563 | GEO
2013-07-17 | E-GEOD-48930 | biostudies-arrayexpress
2014-04-25 | E-GEOD-54332 | biostudies-arrayexpress
2023-03-29 | GSE212920 | GEO
2016-02-29 | E-GEOD-73372 | biostudies-arrayexpress
2022-03-31 | GSE199611 | GEO
2014-04-25 | GSE54333 | GEO
2014-04-25 | GSE54332 | GEO
2018-02-15 | GSE80791 | GEO