Proteomics

Dataset Information

0

A Hidden Human Proteome Encoded by the “Non-Coding” Genes


ABSTRACT: It has been a long debate whether the 98% “non-coding” fraction of human genome can encode functional proteins besides a “random noise” of translation. We used our established translatome sequencing (RNC-seq) to analyze human cells and found that up to 3330 long non-coding RNAs (lncRNAs) were bound to ribosomes and thus might be translated into proteins (with more than 50 amino acids). These new protein-coding genes distributed universally in all human chromosomes. We then used various experimental methods including mass spectrometry, immunoblotting, subcellular localization and phenotype assessments to verify the existence of such a hidden human proteome encoded by purported lncRNAs that can express functional proteins. These new proteins deviate from the canonical proteins in various physical and chemical properties, and emerged mostly in primates during evolution. In sum, we experimentally evidenced a hidden human functional proteome encoded by purported lncRNAs, suggesting that the human genome has to be systematically re-annotated.

INSTRUMENT(S): TripleTOF 5600

ORGANISM(S): Homo Sapiens (human)

TISSUE(S): Cell Culture

SUBMITTER: Yang Wang  

LAB HEAD: Qing-Yu He

PROVIDER: PXD005291 | Pride | 2022-02-25

REPOSITORIES: Pride

altmetric image

Publications


It has been a long debate whether the 98% 'non-coding' fraction of human genome can encode functional proteins besides short peptides. With full-length translating mRNA sequencing and ribosome profiling, we found that up to 3330 long non-coding RNAs (lncRNAs) were bound to ribosomes with active translation elongation. With shotgun proteomics, 308 lncRNA-encoded new proteins were detected. A total of 207 unique peptides of these new proteins were verified by multiple reaction monitoring (MRM) and  ...[more]

Similar Datasets

2019-08-29 | PXD044056 | iProX
2022-02-06 | GSE196032 | GEO
2023-03-18 | GSE196927 | GEO
| PRJNA803053 | ENA
2014-01-07 | E-GEOD-40364 | biostudies-arrayexpress
2014-01-07 | GSE40364 | GEO
2023-04-26 | PXD030066 | Pride
2015-05-15 | E-GEOD-54964 | biostudies-arrayexpress
2015-05-15 | E-GEOD-54966 | biostudies-arrayexpress
2016-11-04 | GSE84722 | GEO