Genomics

Dataset Information

0

Translation and Natural Selection of long non-canonical RNA micropeptides


ABSTRACT: Long noncoding RNAs (lncRNAs) are transcripts longer than 200 nucleotides but lacking canonical coding sequences. Apparently unable to produce peptides, lncRNA function seems to only involve RNA sequence and structure. Here, we exhaustively detect in-vivo translation of small open reading frames (small ORFs) within lncRNAs using Ribosomal profiling during Drosophila melanogaster embryogenesis. We show that around 30% of lncRNAs contain small ORFs engaged by ribosomes, leading to regulated translation of 100 to 300 micropeptides. We identify lncRNA features that favour translation, such as cistronicity, Kozak sequences, and conservation. For this latter, we develop a bioinformatics pipeline to detect small ORF homologues, and we reveal evidence of natural selection favouring the conservation of micropeptide sequence and function across evolution. Our results expand the repertoire of lncRNA functions, and suggest that lncRNAs give rise to novel coding genes throughout evolution. Since most lncRNAs contain small ORFs with as yet unknown translation potential, we propose to rename them “long non-canonical RNAs”.

ORGANISM(S): Drosophila melanogaster

PROVIDER: GSE204739 | GEO | 2022/11/02

REPOSITORIES: GEO

Similar Datasets

2020-01-09 | PXD014553 | Pride
2014-04-04 | GSE53693 | GEO
2024-01-26 | PXD046452 | Pride
2021-07-16 | GSE166214 | GEO
2021-10-05 | PXD025267 | Pride
2021-01-20 | GSE165066 | GEO
2014-01-19 | E-GEOD-43520 | biostudies-arrayexpress
| PRJNA632889 | ENA
2022-11-14 | PXD034587 | Pride
2018-11-30 | GSE80554 | GEO