Proteomics

Dataset Information

0

Deeply mining a universe of peptides Encoded by Long Noncoding RNAs


ABSTRACT: Long non-coding RNAs (lncRNAs) are generally defined as RNA transcripts longer than 200 nucleotides that are not translated into proteins. Recently, many small open reading frames (smORFs) embedded in lncRNA scripts have been verified to be able to encode functional polypeptides (namely lncRNA-SEPs here). Although collaborative analysis by advanced genomics, bioinformatics and proteomics largely drives SEPs discovery, the poor predictability, diminutive size and low abundance still challenge systematic identification of SEPs from different biological samples. Here, we took advantage of the NONCODE database that deposited with the most complete collection and annotation of lncRNA transcripts from different species to build a database that to maximally collect all putative small ORFs from human and mouse lncRNA transcripts. Two effective and complementary polypeptides enrichment strategies (30 kDa MWCO filter and C8 SPE column) were also integrated to further improve the discovery of novel lncRNA-SEPs. These efforts led to the discovery of 362 lncRNA-SEPs from 8 human cell lines and 238 lncRNA-SEPs from 3 mouse cell lines and 8 mouse tissues. 18 out of these lncRNA-SEPs were verified experimentally by multiple technologies including in vitro expression, immunoblotting and parallel reaction monitoring-based mass spectrometry (PRM-MS) in 293T cells. Further bioinformatic analysis reveals that the physical and chemical properties of these novel lncRNA-SEPs, such as amino acid composition and codon usage, are varied from canonical proteins. Intriguingly, nearly 70% of the identified lncRNA-SEPs were found to be initiated with non-AUG start codons. Collectively, the efficient workflows presented in this study enables us identify 600 novel lncRNA-SPEs from multiple cell lines and tissues, which should represent the largest number of MS-detected lncRNA-encoding SEPs ever reported to date. These novel lncRNA-SEPs not only could provide new clues for the annotation of the noncoding elements in the genome, but also could serve as a valuable resource for the functional characterization of individual lncRNA-SEPs.

INSTRUMENT(S): Q Exactive

ORGANISM(S): Homo Sapiens (human) Mus Musculus (mouse)

TISSUE(S): Spleen, Heart, Testis, Brain, Liver, Lung, Cell Culture, Kidney

SUBMITTER: Qing Zhang  

LAB HEAD: Fuquan Yang

PROVIDER: PXD016981 | Pride | 2023-05-06

REPOSITORIES: Pride

altmetric image

Publications

LncRNA-encoded microproteins: A new form of cargo in cell culture-derived and circulating extracellular vesicles.

Cai Tanxi T   Zhang Qing Q   Wu Bowen B   Wang Jifeng J   Li Na N   Zhang Tingting T   Wang Zhipeng Z   Luo Jianjun J   Guo Xiaojing X   Ding Xiang X   Xie Zhensheng Z   Niu Lili L   Ning Weihai W   Fan Zhen Z   Chen Xiaowei X   Guo Xiangqian X   Chen Runsheng R   Zhang Hongwei H   Yang Fuquan F  

Journal of extracellular vesicles 20210712 9


Advancements in omics-based technologies over the past few years have led to the discovery of numerous biologically relevant peptides encoded by small open reading frames (smORFs) embedded in long noncoding RNA (lncRNA) transcripts (referred to as microproteins here) in a variety of species. However, the mechanisms and modes of action that underlie the roles of microproteins have yet to be fully characterized. Herein, we provide the first experimental evidence of abundant microproteins in extrac  ...[more]

Similar Datasets

2021-06-15 | PXD019486 | Pride
2012-11-14 | E-GEOD-34740 | biostudies-arrayexpress
2017-10-17 | PXD005643 | Pride
2023-06-14 | PXD040463 | Pride
2022-11-14 | PXD034587 | Pride
2022-11-23 | E-MTAB-8730 | biostudies-arrayexpress
2014-02-21 | E-GEOD-55191 | biostudies-arrayexpress
2022-01-17 | E-MTAB-11367 | biostudies-arrayexpress
2019-06-26 | MSV000084014 | MassIVE
2015-04-10 | E-GEOD-67106 | biostudies-arrayexpress