Genomics

Dataset Information

0

A Random-Forest Based Algorithm for Prediction of Enhancers From Histone Modifications


ABSTRACT: Transcriptional enhancers play critical roles in regulation of gene expression, but their identification has remained a challenge. Recently, it was shown that enhancers in the mammalian genome are associated with characteristic histone modification patterns, which have been increasingly exploited for enhancer identification. However, only a limited number of histone modifications have previously been investigated for this purpose, leaving the questions answered whether there exist an optimal set of histone modifications that could improve the enhancer prediction. Here, we address this issue by exploring a rich dataset produced by the human Epigenome Roadmap Project. Specifically, we examined genome-wide profiles of 24 histone modifications in human embryonic stem cells and fibroblasts, and developed a Random-Forest based algorithm to integrate histone modification profiles for identification of enhancers.As a training set, we used histone modification profiles at genome-wide binding sites of p300 in the two cell types identified using ChIP-seq. We show that this algorithm not only leads to more accurate and precise prediction of enhancers than previous methods, but also helps identify an optimal set of three chromatin marks for enhancer prediction.

ORGANISM(S): Homo sapiens

PROVIDER: GSE37858 | GEO | 2012/05/10

SECONDARY ACCESSION(S): PRJNA165163

REPOSITORIES: GEO

Similar Datasets

2012-05-09 | E-GEOD-37858 | biostudies-arrayexpress
2019-09-30 | GSE120376 | GEO
| PRJNA165163 | ENA
2016-02-25 | E-GEOD-72886 | biostudies-arrayexpress
2012-11-19 | E-GEOD-32380 | biostudies-arrayexpress
2010-10-31 | E-GEOD-23907 | biostudies-arrayexpress
2016-02-25 | GSE72886 | GEO
2012-11-19 | GSE32380 | GEO
2023-06-17 | PXD043070 | Pride
2015-04-27 | E-GEOD-61349 | biostudies-arrayexpress