Unknown

Dataset Information

0

DROMPA: easy-to-handle peak calling and visualization software for the computational analysis and validation of ChIP-seq data.


ABSTRACT: Chromatin immunoprecipitation with high-throughput sequencing (ChIP-seq) can identify genomic regions that bind proteins involved in various chromosomal functions. Although the development of next-generation sequencers offers the technology needed to identify these protein-binding sites, the analysis can be computationally challenging because sequencing data sometimes consist of >100 million reads/sample. Herein, we describe a cost-effective and time-efficient protocol that is generally applicable to ChIP-seq analysis; this protocol uses a novel peak-calling program termed DROMPA to identify peaks and an additional program, parse2wig, to preprocess read-map files. This two-step procedure drastically reduces computational time and memory requirements compared with other programs. DROMPA enables the identification of protein localization sites in repetitive sequences and efficiently identifies both broad and sharp protein localization peaks. Specifically, DROMPA outputs a protein-binding profile map in pdf or png format, which can be easily manipulated by users who have a limited background in bioinformatics.

SUBMITTER: Nakato R 

PROVIDER: S-EPMC3738949 | biostudies-literature | 2013 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

DROMPA: easy-to-handle peak calling and visualization software for the computational analysis and validation of ChIP-seq data.

Nakato Ryuichiro R   Itoh Tahehiko T   Shirahige Katsuhiko K  

Genes to cells : devoted to molecular & cellular mechanisms 20130515 7


Chromatin immunoprecipitation with high-throughput sequencing (ChIP-seq) can identify genomic regions that bind proteins involved in various chromosomal functions. Although the development of next-generation sequencers offers the technology needed to identify these protein-binding sites, the analysis can be computationally challenging because sequencing data sometimes consist of >100 million reads/sample. Herein, we describe a cost-effective and time-efficient protocol that is generally applicab  ...[more]

Similar Datasets

| S-EPMC4061025 | biostudies-literature
| S-EPMC5429005 | biostudies-literature
| S-EPMC7885521 | biostudies-literature
| S-EPMC3672025 | biostudies-literature
| S-EPMC5175345 | biostudies-literature
| S-EPMC6396939 | biostudies-literature
| S-EPMC7808876 | biostudies-literature
| S-EPMC4053734 | biostudies-literature
| S-EPMC4364623 | biostudies-literature