Unknown

Dataset Information

0

CORECLUST: identification of the conserved CRM grammar together with prediction of gene regulation.


ABSTRACT: Identification of transcriptional regulatory regions and tracing their internal organization are important for understanding the eukaryotic cell machinery. Cis-regulatory modules (CRMs) of higher eukaryotes are believed to possess a regulatory 'grammar', or preferred arrangement of binding sites, that is crucial for proper regulation and thus tends to be evolutionarily conserved. Here, we present a method CORECLUST (COnservative REgulatory CLUster STructure) that predicts CRMs based on a set of positional weight matrices. Given regulatory regions of orthologous and/or co-regulated genes, CORECLUST constructs a CRM model by revealing the conserved rules that describe the relative location of binding sites. The constructed model may be consequently used for the genome-wide prediction of similar CRMs, and thus detection of co-regulated genes, and for the investigation of the regulatory grammar of the system. Compared with related methods, CORECLUST shows better performance at identification of CRMs conferring muscle-specific gene expression in vertebrates and early-developmental CRMs in Drosophila.

SUBMITTER: Nikulova AA 

PROVIDER: S-EPMC3384346 | biostudies-literature | 2012 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

CORECLUST: identification of the conserved CRM grammar together with prediction of gene regulation.

Nikulova Anna A AA   Favorov Alexander V AV   Sutormin Roman A RA   Makeev Vsevolod J VJ   Mironov Andrey A AA  

Nucleic acids research 20120315 12


Identification of transcriptional regulatory regions and tracing their internal organization are important for understanding the eukaryotic cell machinery. Cis-regulatory modules (CRMs) of higher eukaryotes are believed to possess a regulatory 'grammar', or preferred arrangement of binding sites, that is crucial for proper regulation and thus tends to be evolutionarily conserved. Here, we present a method CORECLUST (COnservative REgulatory CLUster STructure) that predicts CRMs based on a set of  ...[more]

Similar Datasets

| S-EPMC7334449 | biostudies-literature
| S-EPMC6203363 | biostudies-literature
| S-EPMC3752241 | biostudies-literature
| S-EPMC7969006 | biostudies-literature
| S-EPMC2572636 | biostudies-literature
| S-EPMC5728398 | biostudies-literature
| S-EPMC5291205 | biostudies-literature
| S-EPMC6602064 | biostudies-literature
| S-EPMC6602064 | biostudies-literature
| S-EPMC2920418 | biostudies-literature