Unknown

Dataset Information

0

CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes.


ABSTRACT: BACKGROUND: The identification of transcription factor binding sites is difficult since they are only a small number of nucleotides in size, resulting in large numbers of false positives and false negatives in current approaches. Computational methods to reduce false positives are to look for over-representation of transcription factor binding sites in a set of similarly regulated promoters or to look for conservation in orthologous promoter alignments. RESULTS: We have developed a novel tool, "CORE_TF" (Conserved and Over-REpresented Transcription Factor binding sites) that identifies common transcription factor binding sites in promoters of co-regulated genes. To improve upon existing binding site predictions, the tool searches for position weight matrices from the TRANSFAC R database that are over-represented in an experimental set compared to a random set of promoters and identifies cross-species conservation of the predicted transcription factor binding sites. The algorithm has been evaluated with expression and chromatin-immunoprecipitation on microarray data. We also implement and demonstrate the importance of matching the random set of promoters to the experimental promoters by GC content, which is a unique feature of our tool. CONCLUSION: The program CORE_TF is accessible in a user friendly web interface at http://www.LGTC.nl/CORE_TF. It provides a table of over-represented transcription factor binding sites in the users input genes' promoters and a graphical view of evolutionary conserved transcription factor binding sites. In our test data sets it successfully predicts target transcription factors and their binding sites.

SUBMITTER: Hestand MS 

PROVIDER: S-EPMC2613159 | biostudies-literature | 2008

REPOSITORIES: biostudies-literature

altmetric image

Publications

CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes.

Hestand Matthew S MS   van Galen Michiel M   Villerius Michel P MP   van Ommen Gert-Jan B GJ   den Dunnen Johan T JT   't Hoen Peter A C PA  

BMC bioinformatics 20081126


<h4>Background</h4>The identification of transcription factor binding sites is difficult since they are only a small number of nucleotides in size, resulting in large numbers of false positives and false negatives in current approaches. Computational methods to reduce false positives are to look for over-representation of transcription factor binding sites in a set of similarly regulated promoters or to look for conservation in orthologous promoter alignments.<h4>Results</h4>We have developed a  ...[more]

Similar Datasets

| S-EPMC5869121 | biostudies-literature
2013-06-13 | GSE40748 | GEO
| S-EPMC4041754 | biostudies-literature
2013-06-13 | E-GEOD-40748 | biostudies-arrayexpress
| S-EPMC5734273 | biostudies-literature
| S-EPMC1456992 | biostudies-literature