Unknown

Dataset Information

0

Stochastic sampling of the RNA structural alignment space.


ABSTRACT: A novel method is presented for predicting the common secondary structures and alignment of two homologous RNA sequences by sampling the 'structural alignment' space, i.e. the joint space of their alignments and common secondary structures. The structural alignment space is sampled according to a pseudo-Boltzmann distribution based on a pseudo-free energy change that combines base pairing probabilities from a thermodynamic model and alignment probabilities from a hidden Markov model. By virtue of the implicit comparative analysis between the two sequences, the method offers an improvement over single sequence sampling of the Boltzmann ensemble. A cluster analysis shows that the samples obtained from joint sampling of the structural alignment space cluster more closely than samples generated by the single sequence method. On average, the representative (centroid) structure and alignment of the most populated cluster in the sample of structures and alignments generated by joint sampling are more accurate than single sequence sampling and alignment based on sequence alone, respectively. The 'best' centroid structure that is closest to the known structure among all the centroids is, on average, more accurate than structure predictions of other methods. Additionally, cluster analysis identifies, on average, a few clusters, whose centroids can be presented as alternative candidates. The source code for the proposed method can be downloaded at http://rna.urmc.rochester.edu.

SUBMITTER: Harmanci AO 

PROVIDER: S-EPMC2709569 | BioStudies | 2009-01-01T00:00:00Z

REPOSITORIES: biostudies

Similar Datasets

1000-01-01 | S-EPMC3159474 | BioStudies
2002-01-01 | S-EPMC2373660 | BioStudies
2009-01-01 | S-EPMC2788221 | BioStudies
2006-01-01 | S-EPMC1687212 | BioStudies
2005-01-01 | S-EPMC1370799 | BioStudies
2010-01-01 | S-EPMC2962639 | BioStudies
1000-01-01 | S-EPMC1087833 | BioStudies
1000-01-01 | S-EPMC2867768 | BioStudies
2012-01-01 | S-EPMC3697813 | BioStudies
1000-01-01 | S-EPMC6311937 | BioStudies