Proteomics

Dataset Information

0

Peptide analysis in human cell lines via high-resolution MRM (PRM) for the annotation of non-canonical open reading frames


ABSTRACT: A major scientific drive is to characterize the protein-coding genome, which is a primary basis for studying human health. But the fundamental question remains: what has been missed in prior analyses? Over the past decade, the translation of non-canonical open reading frames (ncORFs) has been observed across human cell types and disease states, with major implications for proteomics, genomics, and clinical science. However, the impact of ncORFs has been limited by the absence of a generalized understanding of their contribution to the human proteome. Here, we report the collaborative efforts of stakeholders in proteomics, immunopeptidomics, ORF discovery, and gene annotation to produce a consensus landscape of protein-level evidence for ncORFs. We show that at least 25% of a set of 7,264 ncORFs give rise to translated gene products detected in a large-scale analysis encompassing 3.8 billion mass spectra from 95,520 experiments. With these data, we developed an annotation framework for ncORFs and codified public tools for researchers through GENCODE and PeptideAtlas. To probe the biological implications of ncORFs, we created a custom evolutionary analysis approach, termed ORF Relative Branch Length (ORBL), and found that a substantial fraction of ncORFs exhibit evolutionary constraint as open reading frames, which associates with the chance of observing ncORF-derived peptides. To classify such cases, we invoked the model of “peptideins” as a new protein annotation concept referring to ncORF-derived polypeptides that fall short of the definition of a conventional protein. We then employed knockout screens of putative peptideins and characterized one ncORF from the OLMALINC transcript as responsible for a pan-essential cellular phenotype. Overall, our work will provide a platform to advance ncORF-derived proteins in biomedical discovery and, beyond humans, diverse animals and plants where ncORFs are similarly observed.We demonstrated that ncORF peptides are detectable in human cell lysates using targeted mass spectrometry and heavy labeled synthetic peptides. Data were acquired with a ZenoTOF 8600 as well as an Orbitrap Astral mass spectrometer.

INSTRUMENT(S):

ORGANISM(S): Homo Sapiens (human)

TISSUE(S): Cell Culture

SUBMITTER: Ulrike Kusebauch  

LAB HEAD: Robert Moritz.

PROVIDER: PXD066599 | Pride | 2026-02-17

REPOSITORIES: Pride

altmetric image

Publications

Expanding the human proteome with microproteins and peptideins.

Deutsch Eric W EW   Kok Leron W LW   Mudge Jonathan M JM   Valls Cristian F CF   Jungreis Irwin I   Ruiz-Orera Jorge J   Sun Zhi Z   Kusebauch Ulrike U   Fierro-Monti Ivo I   Abelin Jennifer G JG   Alba M Mar MM   Aspden Julie L JL   Bandyopadhyay Sreejan S   Banerjee Kaushik K   Baranov Pavel V PV   Bazzini Ariel A AA   Bourassa Francis F   Bruford Elspeth A EA   Calviello Lorenzo L   Carr Steven A SA   Carvunis Anne-Ruxandra AR   Chothani Sonia S   Clauwaert Jim J   Dean Kellie K   Faridi Pouya P   Frankish Adam A   Goodale Amy A   Green Thomas T   Hubner Norbert N   Ingolia Nicholas T NT   Kellis Manolis M   Magrane Michele M   Martin Maria Jesus MJ   Martinez Thomas F TF   Menschaert Gerben G   Ohler Uwe U   Orchard Sandra S   Potter Alisa A   Rackham Owen J L OJL   Rees Matthew G MG   Root David E DE   Roth Jennifer A JA   Roucou Xavier X   Sialana Fernando J FJ   Slavoff Sarah A SA   Świrski Michał I MI   Tierney Jack A S JAS   Trifiro Félix-Antoine FA   Valen Eivind E   Vasylieva Valeriia V   Wacholder Aaron A   Wang Shengbo S   Wang Li L   Weissman Jonathan S JS   Wu Wei W   Xie Zhi Z   Choudhary Jyoti S JS   Bassani-Sternberg Michal M   Vizcaíno Juan Antonio JA   Ternette Nicola N   Brunet Marie A MA   Moritz Robert L RL   Prensner John R JR   van Heesch Sebastiaan S  

Nature 20260506 8119


A major scientific drive is to characterize the protein-coding genome, which is a primary basis for studying human health. But the fundamental question remains of what has been missed in previous analyses. Over the past decade, the translation of non-canonical open reading frames (ncORFs) has been observed across human cell types and disease states<sup>1-3</sup>, with major implications for biomedical science. However, a key gap in knowledge has been which ncORFs produce small microproteins or a  ...[more]

Similar Datasets

2010-06-20 | E-GEOD-15835 | biostudies-arrayexpress
2009-10-26 | E-GEOD-15831 | biostudies-arrayexpress
2010-06-20 | E-GEOD-15834 | biostudies-arrayexpress
2010-06-20 | E-GEOD-15833 | biostudies-arrayexpress
2014-07-02 | E-GEOD-53080 | biostudies-arrayexpress
2009-10-26 | E-GEOD-15832 | biostudies-arrayexpress
2014-07-02 | E-GEOD-53081 | biostudies-arrayexpress
2015-11-25 | E-GEOD-21498 | biostudies-arrayexpress
2013-05-22 | E-GEOD-47125 | biostudies-arrayexpress
2025-09-04 | PXD024809 | Pride