Unknown

Dataset Information

0

RiboNT: A Noise-Tolerant Predictor of Open Reading Frames from Ribosome-Protected Footprints.


ABSTRACT: Ribo-seq, also known as ribosome profiling, refers to the sequencing of ribosome-protected mRNA fragments (RPFs). This technique has greatly advanced our understanding of translation and facilitated the identification of novel open reading frames (ORFs) within untranslated regions or non-coding sequences as well as the identification of non-canonical start codons. However, the widespread application of Ribo-seq has been hindered because obtaining periodic RPFs requires a highly optimized protocol, which may be difficult to achieve, particularly in non-model organisms. Furthermore, the periodic RPFs are too short (28 nt) for accurate mapping to polyploid genomes, but longer RPFs are usually produced with a compromise in periodicity. Here we present RiboNT, a noise-tolerant ORF predictor that can utilize RPFs with poor periodicity. It evaluates RPF periodicity and automatically weighs the support from RPFs and codon usage before combining their contributions to identify translated ORFs. The results demonstrate the utility of RiboNT for identifying both long and small ORFs using RPFs with either good or poor periodicity. We implemented the pipeline on a dataset of RPFs with poor periodicity derived from membrane-bound polysomes of Arabidopsis thaliana seedlings and identified several small ORFs (sORFs) evolutionarily conserved in diverse plant species. RiboNT should greatly broaden the application of Ribo-seq by minimizing the requirement of RPF quality and allowing the use of longer RPFs, which is critical for organisms with complex genomes because these RPFs can be more accurately mapped to the position from which they were derived.

SUBMITTER: Song B 

PROVIDER: S-EPMC8307163 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6168376 | biostudies-literature
2022-05-20 | GSE131514 | GEO
| S-EPMC4940163 | biostudies-literature
2015-12-25 | GSE75290 | GEO
2020-03-14 | GSE131650 | GEO
2015-11-12 | E-GEOD-73136 | biostudies-arrayexpress
2021-04-28 | GSE154491 | GEO
2015-10-14 | GSE67305 | GEO
2019-07-03 | GSE125218 | GEO