Proteomics

Dataset Information

0

An adaptive pipeline to maximize isobaric tagging data in large-scale MS-based proteomics


ABSTRACT: Isobaric tagging is a method of choice in Mass Spectrometry (MS)-based proteomics for comparing multiple conditions at a time. Despite its multiplexing capabilities, when multiple experiments are merged for comparison in large sample-size studies, some drawbacks appears, due to the presence of missing values, which result from the stochastic nature of the Data-Dependent Acquisition (DDA) mode. Another indirect cause of data incompleteness might derive from the proteomic-typical data processing workflow that first identifies proteins in individual experiments and then only quantifies those identified proteins, leaving a large number of unmatched spectra with quantitative information unexploited. Inspired by untargeted metabolomic and label-free proteomic workflows, we developed a quantification-driven bioinformatic pipeline (Quantify then Identify – QtI) that optimizes the processing of isobaric Tandem Mass Tag (TMT) data from large-scale studies. This pipeline includes innovative modules, such as the Peptide Match Rescue (PMR) and the Optimized Post-Translational Modification (OPTM) and outperforms a classical benchmark workflow in terms of quantification and identification rates, significantly reducing missing data while preserving unmatched features for quantitative comparison. The number of unexploited tandem mass spectra was reduced by 77% and 62% for two human cerebrospinal fluid (CSF) and plasma datasets, respectively.

INSTRUMENT(S): LTQ Orbitrap Elite

ORGANISM(S): Homo Sapiens (human)

TISSUE(S): Blood Plasma, Cerebrospinal Fluid

SUBMITTER: Charlotte Macron  

LAB HEAD: Loïc Dayon

PROVIDER: PXD008029 | Pride | 2018-05-01

REPOSITORIES: Pride

altmetric image

Publications


Isobaric tagging is the method of choice in mass-spectrometry-based proteomics for comparing several conditions at a time. Despite its multiplexing capabilities, some drawbacks appear when multiple experiments are merged for comparison in large sample-size studies due to the presence of missing values, which result from the stochastic nature of the data-dependent acquisition mode. Another indirect cause of data incompleteness might derive from the proteomic-typical data-processing workflow that  ...[more]

Similar Datasets

2016-05-27 | PXD002967 | Pride
2023-10-24 | PXD042546 | Pride
2015-07-27 | PXD002224 | Pride
2019-07-18 | PXD013210 | Pride
2018-05-01 | PXD005206 | Pride
| PRJNA153537 | ENA
2023-10-12 | PXD036822 | Pride
2021-05-07 | PXD022996 | Pride
2021-05-07 | PXD023012 | Pride
2018-09-24 | PXD009716 | Pride