Dataset Information


Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices.

ABSTRACT: Accurate prediction of transcription factor binding sites (TFBSs) is a prerequisite for identifying cis-regulatory modules that underlie transcriptional regulatory circuits encoded in the genome. Here, we present a computational framework for detecting TFBSs, when multiple position weight matrices (PWMs) for a transcription factor are available. Grouping multiple PWMs of a transcription factor (TF) based on their sequence similarity improves the specificity of TFBS prediction, which was evaluated using multiple genome-wide ChIP-Seq data sets from 26 TFs. The Z-scores of the area under a receiver operating characteristic curve (AUC) values of 368 TFs were calculated and used to statistically identify co-occurring regulatory motifs in the TF bound ChIP loci. Motifs that are co-occurring along with the empirical bindings of E2F, JUN or MYC have been evaluated, in the basal or stimulated condition. Results prove our method can be useful to systematically identify the co-occurring motifs of the TF for the given conditions.


PROVIDER: S-EPMC3300004 | BioStudies | 2012-01-01T00:00:00Z

REPOSITORIES: biostudies

Similar Datasets

1000-01-01 | S-EPMC3764009 | BioStudies
2018-01-01 | S-EPMC5902045 | BioStudies
2013-01-01 | S-EPMC3540023 | BioStudies
2017-01-01 | S-EPMC5529029 | BioStudies
2012-01-01 | S-EPMC3525548 | BioStudies
1000-01-01 | S-EPMC6037060 | BioStudies
2018-01-01 | S-EPMC6057437 | BioStudies
2014-01-01 | S-EPMC4057186 | BioStudies
1000-01-01 | S-EPMC3676070 | BioStudies
2020-01-01 | S-EPMC6954419 | BioStudies