Unknown

Dataset Information

0

Mapping the RNA structural landscape of viral genomes.


ABSTRACT: Functional RNA structures are prevalent in viral genomes, and have been shown to play roles in almost every aspect of their biology. However, the majority of viral RNA remains structurally uncharacterized. This is likely to remain true as the cost of sequencing decreases much faster than the cost of structural characterizations. Because of this, there is a need for rapid, inexpensive methods to highlight regions of viral RNA which are ideal candidates for structure-function analyses. The ScanFold method was developed as a single sequence alternative to traditional RNA structural motif discovery pipelines, which rely heavily on well curated sequence alignments to identify conserved RNA structures. ScanFold focuses on identifying (based on their more stable than expected folding energies) the most likely functional structures encoded within a single large RNA sequence, while allowing predicted motifs to be tested for evidence of structural conservation later. Decoupling these processes can be a benefit to researchers studying viruses lacking the ideal phylogenetic depth to yield evidence of structural conservation. Here, we demonstrate how the most significant ScanFold predicted structures correspond to higher base pairing probabilities, SHAPE reactivities, and predict known functional structures within the ZIKV and HIV-1 genomes with accuracy. Best practices and examples are also shown to aid users in utilizing ScanFold for their own systems of interest. ScanFold is available as a Webserver (https://mosslabtools.bb.iastate.edu/scanfold) or can be downloaded (https://github.com/moss-lab/ScanFold) and run locally.

SUBMITTER: Andrews RJ 

PROVIDER: S-EPMC7205576 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC8127238 | biostudies-literature
| S-EPMC7668258 | biostudies-literature
| S-EPMC7560756 | biostudies-literature
| S-EPMC3708773 | biostudies-literature
2008-05-01 | E-GEOD-10008 | biostudies-arrayexpress
2008-05-01 | E-GEOD-10037 | biostudies-arrayexpress
2008-05-01 | GSE10037 | GEO
2008-05-01 | GSE10008 | GEO
| S-EPMC4573499 | biostudies-literature
| S-EPMC5123813 | biostudies-literature