Genomics

Dataset Information

0

An effective approach for identification of in vivo protein–DNA binding sites from paired-end ChIP-Seq data


ABSTRACT: ChIP-Seq, which combines chromatin immunoprecipitation (ChIP) with high-throughput massively parallel sequencing, is increasingly being used for identification of protein–DNA interactions in-vivo in the genome. In general, current algorithms for ChIP-seq reads employ artificial estimation of the average length of DNA fragments for peak finding, leading to uncertain prediction of DNA-protein binding sites. Here, we present SIPeS (Site Identification from Paired-end Sequencing), a novel algorithm for precise identification of binding sites from short reads generated from paired-end Solexa ChIP-Seq technology. SIPeS uses a dynamic baseline directly via ‘piling up’ the corresponding fragments defined by the paired reads to efficiently find peaks corresponding to binding sites. The performance of SIPeS is demonstrated by analyzing the ChIP-Seq data of the Arabidopsis basic helix-loop-helix transcription factor ABORTED MICROSPORES (AMS). The robustness of SIPeS was demonstrated in higher sensitivity and spatial resolution in peak finding compared to three existing peak detection algorithms. Keywords: transcription factors (protein-DNA interactions)

ORGANISM(S): Arabidopsis thaliana

PROVIDER: GSE16940 | GEO | 2010/06/04

SECONDARY ACCESSION(S): PRJNA117657

REPOSITORIES: GEO

Similar Datasets

2010-06-04 | E-GEOD-16940 | biostudies-arrayexpress
2020-10-20 | GSE155666 | GEO
2022-01-03 | GSE172355 | GEO
2010-11-01 | E-GEOD-18481 | biostudies-arrayexpress
2015-09-08 | E-GEOD-66296 | biostudies-arrayexpress
2012-07-19 | E-GEOD-39495 | biostudies-arrayexpress
2010-11-01 | GSE18481 | GEO
2019-10-15 | E-MTAB-8375 | biostudies-arrayexpress
2022-11-22 | GSE211771 | GEO
2022-06-01 | E-MTAB-11510 | biostudies-arrayexpress