Proteomics

Dataset Information

0

Carafe enables high quality in silico spectral library generation for data-independent acquisition proteomics


ABSTRACT: Data-independent acquisition (DIA)-based mass spectrometry is becoming an increasingly popular mass spectrometry acquisition strategy for carrying out quantitative proteomics experiments. Most of the popular DIA search engines make use of in silico generated spectral libraries. However, the generation of high-quality spectral libraries for DIA data analysis remains a challenge, particularly because most such libraries are generated directly from data-dependent acquisition (DDA) data or are from in silico prediction using models trained on DDA data. In this study, we developed Carafe, a tool that generates high-quality experiment-specific in silico spectral libraries by training deep learning models directly on DIA data. We demonstrate the performance of Carafe on a wide range of DIA datasets, where we observe improved fragment ion intensity prediction and peptide detection relative to existing pretrained DDA models.

ORGANISM(S): Homo Sapiens Saccharomyces Cerevisiae

SUBMITTER: Chris Hsu  

PROVIDER: PXD056793 | panorama | Sun Nov 09 00:00:00 GMT 2025

REPOSITORIES: PanoramaPublic

altmetric image

Publications

Carafe enables high quality in silico spectral library generation for data-independent acquisition proteomics.

Wen Bo B   Hsu Chris C   Shteynberg David D   Zeng Wen-Feng WF   Riffle Michael M   Chang Alexis A   Mudge Miranda C MC   Nunn Brook L BL   MacLean Brendan X BX   Berg Matthew D MD   Villén Judit J   MacCoss Michael J MJ   Noble William S WS  

Nature communications 20251106 1


Data-independent acquisition (DIA)-based mass spectrometry is becoming an increasingly popular mass spectrometry acquisition strategy for carrying out quantitative proteomics experiments. Most of the popular DIA search engines make use of in silico generated spectral libraries. However, the generation of high-quality spectral libraries for DIA data analysis remains a challenge, particularly because most such libraries are generated directly from data-dependent acquisition (DDA) data or are from  ...[more]

Similar Datasets

2021-06-08 | PXD021937 | Pride
2024-05-06 | PXD044981 | Pride
2017-11-20 | PXD006934 | Pride
2021-04-13 | PXD022950 | Pride
2018-03-26 | MTBLS417 | MetaboLights
2019-11-08 | PXD012987 | Pride
2019-11-08 | PXD012988 | Pride
2019-11-08 | PXD012986 | Pride
2019-11-08 | PXD014956 | Pride
2019-12-02 | PXD014108 |