Unknown

Dataset Information

0

Genome-wide analysis of core promoter elements from conserved human and mouse orthologous pairs.


ABSTRACT: The canonical core promoter elements consist of the TATA box, initiator (Inr), downstream core promoter element (DPE), TFIIB recognition element (BRE) and the newly-discovered motif 10 element (MTE). The motifs for these core promoter elements are highly degenerate, which tends to lead to a high false discovery rate when attempting to detect them in promoter sequences.In this study, we have performed the first analysis of these core promoter elements in orthologous mouse and human promoters with experimentally-supported transcription start sites. We have identified these various elements using a combination of positional weight matrices (PWMs) and the degree of conservation of orthologous mouse and human sequences--a procedure that significantly reduces the false positive rate of motif discovery. Our analysis of 9,010 orthologous mouse-human promoter pairs revealed two combinations of three-way synergistic effects, TATA-Inr-MTE and BRE-Inr-MTE. The former has previously been putatively identified in human, but the latter represents a novel synergistic relationship.Our results demonstrate that DNA sequence conservation can greatly improve the identification of functional core promoter elements in the human genome. The data also underscores the importance of synergistic occurrence of two or more core promoter elements. Furthermore, the sequence data and results presented here can help build better computational models for predicting the transcription start sites in the promoter regions, which remains one of the most challenging problems.

SUBMITTER: Jin VX 

PROVIDER: S-EPMC1475891 | biostudies-other | 2006 Mar

REPOSITORIES: biostudies-other

altmetric image

Publications

Genome-wide analysis of core promoter elements from conserved human and mouse orthologous pairs.

Jin Victor X VX   Singer Gregory A C GA   Agosto-Pérez Francisco J FJ   Liyanarachchi Sandya S   Davuluri Ramana V RV  

BMC bioinformatics 20060307


<h4>Background</h4>The canonical core promoter elements consist of the TATA box, initiator (Inr), downstream core promoter element (DPE), TFIIB recognition element (BRE) and the newly-discovered motif 10 element (MTE). The motifs for these core promoter elements are highly degenerate, which tends to lead to a high false discovery rate when attempting to detect them in promoter sequences.<h4>Results</h4>In this study, we have performed the first analysis of these core promoter elements in ortholo  ...[more]

Similar Datasets

| S-EPMC3812177 | biostudies-literature
| S-EPMC310816 | biostudies-literature
| S-EPMC4615752 | biostudies-literature
| S-EPMC3118216 | biostudies-literature
| S-EPMC3544860 | biostudies-literature
| S-EPMC193685 | biostudies-literature