Proteomics

Dataset Information

0

Identification and analysis of small proteins and short open reading frame encoded peptides in Hep3B cell


ABSTRACT: The small proteins and short open reading frames encoded peptides (SEPs) are of fundamental importance because of their essential roles in biological processes. However, the annotation or identification of them is challenging, in part owing to the limitation of the traditional genome annotation pipeline and their inherent characteristics of low abundance and low molecular weight. To discover and characterize SEPs in Hep3B cell line, we developed an optimized peptidomic assay by combining different peptide extraction and separation methods. The organic solvent precipitation method in peptidomic showed promotion in the enrichment of low molecular proteins or peptides, and the data clearly showed a beneficial effect from the reduction of sample complexity, resulting in high-quality MS/MS spectra. Furthermore, different strategies exhibited good complementarity in improving the total amount of small proteins and their sequence coverage. In total, 1192 proteins within less than 100 amino acids were identified, including 271 newly discovered SEPs that been annotated in the OpenProt database and 147 SEPs of them encoded from ncRNA or lincRNA. Results in this work provide robust evidence to date that the human proteome is more complicated than previously appreciated, and this will be a benefit to discoveries of proteins without function annotation. SIGNIFICANCE: In this work, methods were optimized to identify SEPs in Hep3B. The organic solvent precipitation presents promotion in enrichment of low molecular proteins or peptides, and the data clearly showed a beneficial effect from the reduction of sample complexity, resulting in high quality MS/MS spectra. Different strategies exhibited good complementarity in improving total amount of small proteins and their sequence coverage. In total, 1192 proteins within less than 100 amino acids were identified, including 271 newly discovered SEPs that been annotated in the OpenProt database and 147 SEPs of them encoded from ncRNA or lincRNA. Furthermore, 22 SEPs generated from the uORF may has potential effect in translation control, and 149 newly identified SEPs have known functional domains or cross-species conservation. Results in this work present robust evidence for the coding potential of the ignored region of human genomes and may provide additional insights into tumor biology.

ORGANISM(S): Homo Sapiens

SUBMITTER: Cuihong Wan  

PROVIDER: PXD025813 | iProX | Wed May 05 00:00:00 BST 2021

REPOSITORIES: iProX

altmetric image

Publications

Identification and analysis of small proteins and short open reading frame encoded peptides in Hep3B cell.

Wang Bing B   Hao Junhui J   Pan Ni N   Wang Zhiwei Z   Chen Yinxuan Y   Wan Cuihong C  

Journal of proteomics 20200903


The small proteins and short open reading frames encoded peptides (SEPs) are of fundamental importance because of their essential roles in biological processes. However, the annotation or identification of them is challenging, in part owing to the limitation of the traditional genome annotation pipeline and their inherent characteristics of low abundance and low molecular weight. To discover and characterize SEPs in Hep3B cell line, we developed an optimized peptidomic assay by combining differe  ...[more]

Similar Datasets

2021-06-15 | PXD019486 | Pride
2021-01-11 | GSE164239 | GEO
2012-11-14 | E-GEOD-34740 | biostudies-arrayexpress
2023-05-06 | PXD016981 | Pride
2020-03-18 | PXD016718 | Pride
2020-10-12 | PXD017416 | Pride
2019-06-26 | MSV000084014 | MassIVE
2021-10-07 | PXD025297 | Pride
2023-03-13 | PXD034931 | Pride
2021-04-08 | PXD025249 | iProX