Unknown

Dataset Information

0

TFBSshape: an expanded motif database for DNA shape features of transcription factor binding sites.


ABSTRACT: TFBSshape (https://tfbsshape.usc.edu) is a motif database for analyzing structural profiles of transcription factor binding sites (TFBSs). The main rationale for this database is to be able to derive mechanistic insights in protein-DNA readout modes from sequencing data without available structures. We extended the quantity and dimensionality of TFBSshape, from mostly in vitro to in vivo binding and from unmethylated to methylated DNA. This new release of TFBSshape improves its functionality and launches a responsive and user-friendly web interface for easy access to the data. The current expansion includes new entries from the most recent collections of transcription factors (TFs) from the JASPAR and UniPROBE databases, methylated TFBSs derived from in vitro high-throughput EpiSELEX-seq binding assays and in vivo methylated TFBSs from the MeDReaders database. TFBSshape content has increased to 2428 structural profiles for 1900 TFs from 39 different species. The structural profiles for each TFBS entry now include 13 shape features and minor groove electrostatic potential for standard DNA and four shape features for methylated DNA. We improved the flexibility and accuracy for the shape-based alignment of TFBSs and designed new tools to compare methylated and unmethylated structural profiles of TFs and methods to derive DNA shape-preserving nucleotide mutations in TFBSs.

SUBMITTER: Chiu TP 

PROVIDER: S-EPMC7145579 | biostudies-literature | 2020 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

TFBSshape: an expanded motif database for DNA shape features of transcription factor binding sites.

Chiu Tsu-Pei TP   Xin Beibei B   Markarian Nicholas N   Wang Yingfei Y   Rohs Remo R  

Nucleic acids research 20200101 D1


TFBSshape (https://tfbsshape.usc.edu) is a motif database for analyzing structural profiles of transcription factor binding sites (TFBSs). The main rationale for this database is to be able to derive mechanistic insights in protein-DNA readout modes from sequencing data without available structures. We extended the quantity and dimensionality of TFBSshape, from mostly in vitro to in vivo binding and from unmethylated to methylated DNA. This new release of TFBSshape improves its functionality and  ...[more]

Similar Datasets

| S-EPMC5042832 | biostudies-literature
| S-EPMC3526315 | biostudies-other
| S-EPMC4618392 | biostudies-literature
| S-EPMC5728407 | biostudies-literature
| S-EPMC7607496 | biostudies-literature
| S-EPMC4686966 | biostudies-literature
| S-EPMC5291265 | biostudies-literature
| S-EPMC3105477 | biostudies-literature
| S-EPMC3965012 | biostudies-literature