Genomics

Dataset Information

0

Multiplexing of ChIP-seq samples for a model experimental condition has minimal impact on peak detection


ABSTRACT: ChIP-seq experiments are standard experimental procedure for interrogating epigenetic states and protein-DNA interactions. Sequencing experiments are often designed according to the trade-off between the need to obtain maximum sequencing coverage limited funds. Multiplexing samples is a common approach to minimize cost and maximize information yield. We therefore performed an extensive ChiP-seq multiplexing study to gain a better understanding of the effect of multiplexing on the resulting peak detection and genomic annotation and to provide solid guidelines for multiplexing ChIP-seq studies. For a well characterized antibody, our results indicate that multiplexing to ~20M reads (roughly 8 samples per sequencing lane) is sufficient to capture most of the biological signal. Multiplexing samples in sequencing experiments is a common approach to maximize information yield while minimizing cost. In most cases the number of samples that are multiplexed is determined by financial consideration or experimental convenience with limited understanding on the effects on the experimental results. Here we set to examine the impact of multiplexing ChIP-seq experiments on the ability to identify a specific epigenetic modification. We performed an analysis of peak detection to determine the effects of multiplexing. These include false discovery rates, size, position and statistical significance of peak detection and changes in gene annotation. We found that, for histone marker H3K4me3, one can multiplex up to 8 samples (7 IP + 1 input) at ~21 million reads each and still detect over 90% of all peaks found when using a full lane for sample. Furthermore, there are no variations introduced by indexing or lane batch effects and importantly there is no significant reduction in the number of genes with neighboring H3K4me3 peaks. We conclude that, for a well characterized antibody and therefore, model IP condition, multiplexing 8 samples per lane is sufficient to capture most of the biological signal.

ORGANISM(S): Homo sapiens

PROVIDER: GSE64504 | GEO | 2015/06/01

SECONDARY ACCESSION(S): PRJNA271137

REPOSITORIES: GEO

Similar Datasets

| PRJNA271137 | ENA
2014-12-29 | GSE56861 | GEO
2018-12-17 | E-MTAB-7524 | biostudies-arrayexpress
2024-03-24 | GSE246624 | GEO
2014-05-20 | GSE38629 | GEO
2021-01-30 | GSE165780 | GEO
2017-03-01 | GSE85219 | GEO
2020-07-29 | PXD018952 | Pride
2020-07-29 | PXD019764 | Pride
2015-01-07 | GSE57563 | GEO