Proteomics

Dataset Information

0

Evaluation of the False Discovery Rate in Library-free Search by DIA-NN Using in vitro Human Proteome


ABSTRACT: In recent years, deep learning-based in silico spectral libraries have gained increasing attention. Several data-independent acquisition (DIA) software tools have integrated this feature, known as library-free search, making DIA analysis more accessible. However, controlling the false discovery rate (FDR) is challenging due to the vast amount of peptide information in in silico libraries. In this study, we introduce a stringent method to evaluate FDR control in DIA software. Recombinant proteins were synthesized from full-length human cDNA libraries, measured using liquid chromatography-mass spectrometry (LC-MS/MS), and analyzed with DIA software. The results were compared to known protein sequences to calculate FDR. We compared the identification performance of DIA-NN versions 1.8.1 and 1.9.2. Our results show that version 1.9.2 identified more peptides than version 1.8.1, though no significant difference was observed at the protein level. DIA-NN 1.9.2 uses a more conservative identification approach, significantly improving FDR control. Across 12 samples analyzed, the average FDR at the peptide level was 0.58% for version 1.8.1 and 0.43% for version 1.9.2, and at the protein level, 2.74% and 1.77%, respectively. Our dataset provides valuable insights for comparing FDR control across DIA software and aiding bioinformaticians in enhancing their tools.

ORGANISM(S): Cellular Organisms

SUBMITTER: Sumio Ohtsuki, Kumamoto University 

PROVIDER: PXD056519 | JPOST Repository | Wed Jul 23 00:00:00 BST 2025

REPOSITORIES: jPOST

Dataset's files

Source:
Action DRS
Mix1.wiff Wiff
Mix1.wiff.scan Wiff
Mix10.wiff Wiff
Mix10.wiff.scan Wiff
Mix11.wiff Wiff
Items per page:
1 - 5 of 20
altmetric image

Publications

Evaluation of the False Discovery Rate in Library-Free Search by DIA-NN Using <i>In Vitro</i> Human Proteome.

Gu Kongxin K   Kenko Masanaga M   Ogawa Koji K   Goshima Naoki N   Masuda Takeshi T   Ito Shingo S   Ohtsuki Sumio S  

Journal of proteome research 20250718 8


Recently, deep-learning-based <i>in silico</i> spectral libraries have gained increasing attention. Several data-independent acquisition (DIA) software tools have integrated this feature, known as a library-free search, thereby making DIA analysis more accessible. However, controlling the false discovery rate (FDR) is challenging owing to the vast amount of peptide information in <i>in silico</i> libraries. In this study, we introduced a stringent method to evaluate FDR control using DIA softwar  ...[more]

Similar Datasets

2024-05-06 | PXD044981 | Pride
2024-05-31 | PXD047793 | Pride
2019-11-06 | PXD014690 | Pride
2022-01-17 | PXD028901 | Pride
2025-04-10 | MSV000097602 | MassIVE
| MSV000095765 | MassIVE
2021-07-09 | PXD022589 | Pride
2021-07-09 | PXD022582 | Pride
2023-08-10 | PXD039759 | Pride
2024-11-11 | PXD057731 |