We need your help! If you've ever found our data helpful, please take our impact survey (15 min). Your replies will help keep the data flowing to the scientific community. Please Click here for Survey


Dataset Information


Towards a map of cis-regulatory sequences in the human genome.

ABSTRACT: Accumulating evidence indicates that transcription factor (TF) binding sites, or cis-regulatory elements (CREs), and their clusters termed cis-regulatory modules (CRMs) play a more important role than do gene-coding sequences in specifying complex traits in humans, including the susceptibility to common complex diseases. To fully characterize their roles in deriving the complex traits/diseases, it is necessary to annotate all CREs and CRMs encoded in the human genome. However, the current annotations of CREs and CRMs in the human genome are still very limited and mostly coarse-grained, as they often lack the detailed information of CREs in CRMs. Here, we integrated 620 TF ChIP-seq datasets produced by the ENCODE project for 168 TFs in 79 different cell/tissue types and predicted an unprecedentedly completely map of CREs in CRMs in the human genome at single nucleotide resolution. The map includes 305 912 CRMs containing a total of 1 178 913 CREs belonging to 736 unique TF binding motifs. The predicted CREs and CRMs tend to be subject to either purifying selection or positive selection, thus are likely to be functional. Based on the results, we also examined the status of available ChIP-seq datasets for predicting the entire regulatory genome of humans.


PROVIDER: S-EPMC6009671 | BioStudies | 2018-01-01


REPOSITORIES: biostudies

Similar Datasets

1000-01-01 | S-EPMC4265420 | BioStudies
2012-01-01 | S-EPMC3525548 | BioStudies
2015-01-01 | S-EPMC4413241 | BioStudies
2011-01-01 | S-EPMC3065704 | BioStudies
2019-01-01 | S-EPMC6393237 | BioStudies
2018-01-01 | S-EPMC5752016 | BioStudies
2020-01-01 | S-EPMC7224214 | BioStudies
2017-01-01 | S-EPMC5741056 | BioStudies
2015-01-01 | S-EPMC4345404 | BioStudies
2012-01-01 | S-EPMC3511131 | BioStudies