Unknown

Dataset Information

0

Computational identification of new structured cis-regulatory elements in the 3'-untranslated region of human protein coding genes.


ABSTRACT: Messenger ribonucleic acids (RNAs) contain a large number of cis-regulatory RNA elements that function in many types of post-transcriptional regulation. These cis-regulatory elements are often characterized by conserved structures and/or sequences. Although some classes are well known, given the wide range of RNA-interacting proteins in eukaryotes, it is likely that many new classes of cis-regulatory elements are yet to be discovered. An approach to this is to use computational methods that have the advantage of analysing genomic data, particularly comparative data on a large scale. In this study, a set of structural discovery algorithms was applied followed by support vector machine (SVM) classification. We trained a new classification model (CisRNA-SVM) on a set of known structured cis-regulatory elements from 3'-untranslated regions (UTRs) and successfully distinguished these and groups of cis-regulatory elements not been strained on from control genomic and shuffled sequences. The new method outperformed previous methods in classification of cis-regulatory RNA elements. This model was then used to predict new elements from cross-species conserved regions of human 3'-UTRs. Clustering of these elements identified new classes of potential cis-regulatory elements. The model, training and testing sets and novel human predictions are available at: http://mRNA.otago.ac.nz/CisRNA-SVM.

SUBMITTER: Chen XS 

PROVIDER: S-EPMC3467077 | biostudies-literature | 2012 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Computational identification of new structured cis-regulatory elements in the 3'-untranslated region of human protein coding genes.

Chen Xiaowei Sylvia XS   Brown Chris M CM  

Nucleic acids research 20120720 18


Messenger ribonucleic acids (RNAs) contain a large number of cis-regulatory RNA elements that function in many types of post-transcriptional regulation. These cis-regulatory elements are often characterized by conserved structures and/or sequences. Although some classes are well known, given the wide range of RNA-interacting proteins in eukaryotes, it is likely that many new classes of cis-regulatory elements are yet to be discovered. An approach to this is to use computational methods that have  ...[more]

Similar Datasets

| S-EPMC4615190 | biostudies-literature
| S-EPMC2612703 | biostudies-literature
| S-EPMC5885252 | biostudies-literature
| S-EPMC5115852 | biostudies-literature
| S-EPMC8443183 | biostudies-literature
| S-EPMC6562536 | biostudies-literature