Unknown

Dataset Information

0

Automated band annotation for RNA structure probing experiments with numerous capillary electrophoresis profiles.


ABSTRACT:

Motivation

Capillary electrophoresis (CE) is a powerful approach for structural analysis of nucleic acids, with recent high-throughput variants enabling three-dimensional RNA modeling and the discovery of new rules for RNA structure design. Among the steps composing CE analysis, the process of finding each band in an electrophoretic trace and mapping it to a position in the nucleic acid sequence has required significant manual inspection and remains the most time-consuming and error-prone step. The few available tools seeking to automate this band annotation have achieved limited accuracy and have not taken advantage of information across dozens of profiles routinely acquired in high-throughput measurements.

Results

We present a dynamic-programming-based approach to automate band annotation for high-throughput capillary electrophoresis. The approach is uniquely able to define and optimize a robust target function that takes into account multiple CE profiles (sequencing ladders, different chemical probes, different mutants) collected for the RNA. Over a large benchmark of multi-profile datasets for biological RNAs and designed RNAs from the EteRNA project, the method outperforms prior tools (QuSHAPE and FAST) significantly in terms of accuracy compared with gold-standard manual annotations. The amount of computation required is reasonable at a few seconds per dataset. We also introduce an 'E-score' metric to automatically assess the reliability of the band annotation and show it to be practically useful in flagging uncertainties in band annotation for further inspection.

Availability and implementation

The implementation of the proposed algorithm is included in the HiTRACE software, freely available as an online server and for download at http://hitrace.stanford.edu.

Contact

sryoon@snu.ac.kr or rhiju@stanford.edu

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Lee S 

PROVIDER: S-EPMC4560050 | biostudies-literature | 2015 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Automated band annotation for RNA structure probing experiments with numerous capillary electrophoresis profiles.

Lee Seungmyung S   Kim Hanjoo H   Tian Siqi S   Lee Taehoon T   Yoon Sungroh S   Das Rhiju R  

Bioinformatics (Oxford, England) 20150505 17


<h4>Motivation</h4>Capillary electrophoresis (CE) is a powerful approach for structural analysis of nucleic acids, with recent high-throughput variants enabling three-dimensional RNA modeling and the discovery of new rules for RNA structure design. Among the steps composing CE analysis, the process of finding each band in an electrophoretic trace and mapping it to a position in the nucleic acid sequence has required significant manual inspection and remains the most time-consuming and error-pron  ...[more]

Similar Datasets

| S-EPMC2504414 | biostudies-literature
| S-EPMC3696430 | biostudies-literature
| S-EPMC5238798 | biostudies-literature
| S-EPMC6933539 | biostudies-literature
| S-EPMC3864256 | biostudies-literature
| S-EPMC4810379 | biostudies-literature
| S-EPMC6690962 | biostudies-literature