Unknown

Dataset Information

0

IRBIS: a systematic search for conserved complementarity.


ABSTRACT: IRBIS is a computational pipeline for detecting conserved complementary regions in unaligned orthologous sequences. Unlike other methods, it follows the "first-fold-then-align" principle in which all possible combinations of complementary k-mers are searched for simultaneous conservation. The novel trimming procedure reduces the size of the search space and improves the performance to the point where large-scale analyses of intra- and intermolecular RNA-RNA interactions become possible. In this article, I provide a rigorous description of the method, benchmarking on simulated and real data, and a set of stringent predictions of intramolecular RNA structure in placental mammals, drosophilids, and nematodes. I discuss two particular cases of long-range RNA structures that are likely to have a causal effect on single- and multiple-exon skipping, one in the mammalian gene Dystonin and the other in the insect gene Ca-α1D. In Dystonin, one of the two complementary boxes contains a binding site of Rbfox protein similar to one recently described in Enah gene. I also report that snoRNAs and long noncoding RNAs (lncRNAs) have a high capacity of base-pairing to introns of protein-coding genes, suggesting possible involvement of these transcripts in splicing regulation. I also find that conserved sequences that occur equally likely on both strands of DNA (e.g., transcription factor binding sites) contribute strongly to the false-discovery rate and, therefore, would confound every such analysis. IRBIS is an open-source software that is available at http://genome.crg.es/~dmitri/irbis/.

SUBMITTER: Pervouchine DD 

PROVIDER: S-EPMC4174434 | biostudies-literature | 2014 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

IRBIS: a systematic search for conserved complementarity.

Pervouchine Dmitri D DD  

RNA (New York, N.Y.) 20140820 10


IRBIS is a computational pipeline for detecting conserved complementary regions in unaligned orthologous sequences. Unlike other methods, it follows the "first-fold-then-align" principle in which all possible combinations of complementary k-mers are searched for simultaneous conservation. The novel trimming procedure reduces the size of the search space and improves the performance to the point where large-scale analyses of intra- and intermolecular RNA-RNA interactions become possible. In this  ...[more]

Similar Datasets

| S-EPMC3292681 | biostudies-literature
| S-EPMC1199595 | biostudies-literature
| S-EPMC1395340 | biostudies-literature
| S-EPMC4848107 | biostudies-literature
| S-EPMC5079492 | biostudies-other
| S-EPMC9678781 | biostudies-literature
| S-EPMC6125683 | biostudies-literature
| S-EPMC122330 | biostudies-literature
| S-EPMC8162549 | biostudies-literature
| S-EPMC3763539 | biostudies-literature