Proteomics

Dataset Information

0

InstaNovo-P: A de novo peptide sequencing model for phosphoproteomics


ABSTRACT: Phosphorylation, a crucial post-translational modification (PTM), plays a central role in cellular signaling and disease mechanisms. Mass spectrometry-based phosphoproteomics is widely used for system-wide characterization of phosphorylation events. However, traditional methods struggle with accurate phosphorylated site localization, complex search spaces, and detecting sequences outside the reference database. Advances in de novo peptide sequencing offer opportunities to address these limitations, but have yet to become integrated and adapted for phosphoproteomics datasets. Here, we present InstaNovo-P, a phosphorylation specific version of our transformer-based InstaNovo model, fine-tuned on extensive phosphoproteomics datasets. InstaNovo-P significantly surpasses existing methods in phosphorylated peptide detection and phosphorylated site localization accuracy across multiple datasets, including complex experimental scenarios. Our model robustly identifies peptides with single and multiple phosphorylated sites, effectively localizing phosphorylation events on serine, threonine, and tyrosine residues. We experimentally validate our model predictions by studying FGFR2 signaling, further demonstrating that InstaNovo-P uncovers phosphorylated sites previously missed by traditional database searches. These predictions align with critical biological processes, confirming the model’s capacity to yield valuable biological insights. InstaNovo-P adds value to phosphoproteomics experiments by effectively identifying biologically relevant phosphorylation events without prior information, providing a powerful analytical tool for the dissection of signaling pathways.

INSTRUMENT(S):

ORGANISM(S): Homo Sapiens (human)

TISSUE(S): Breast Epithelium, Cell Culture

DISEASE(S): Breast Cancer

SUBMITTER: Vahap Canbay  

LAB HEAD: Konstantinos Kalogeropoulos

PROVIDER: PXD074105 | Pride | 2026-05-20

REPOSITORIES: pride

Dataset's files

Source:
Action DRS
20260126_MH_ELBE_VC_uPAC50_68min_0000_DIA_DeNovo_WT_1.raw Raw
20260126_MH_ELBE_VC_uPAC50_68min_0000_DIA_DeNovo_WT_2.raw Raw
20260205_134604_20260126_WT_INP_V2.sne Other
checksum.txt Txt
Items per page:
1 - 4 of 4

Similar Datasets

2010-05-20 | E-GEOD-21917 | biostudies-arrayexpress
2026-03-17 | PXD062859 | Pride
2010-05-20 | GSE21917 | GEO
2026-05-22 | PXD063292 | panorama
2025-12-09 | MTBLS7390 | MetaboLights
2022-09-05 | PXD030352 | Pride
2021-12-25 | PXD025376 | Pride
2014-09-25 | E-GEOD-61713 | biostudies-arrayexpress
2025-05-06 | PXD052217 | Pride
2015-10-01 | GSE57017 | GEO