Proteomics

Dataset Information

0

Evaluating False Transfer Rates from the Match-Between-Runs algorithm with a two-proteome model


ABSTRACT: Stochasticity between independent LC-MS/MS runs is a challenging problem in the field of proteomics resulting in significant missing values (i.e., abundance measurements) among observed peptides. To address this issue, several approaches have been developed including computational methods such as MaxQuant’s Match-Between-Runs (MBR) algorithm. Often dozens of runs are all considered at once by MBR, transferring identifications from any one run to any of the others. To evaluate the error associated with these transfer events, we created a two-sample/two-proteome approach. In this way, samples containing no yeast lysates were assessed for false identification transfers from samples containing yeast. Overall, transfers from any of 20 runs with yeast were assessed to any of 20 runs without yeast were measured. While MBR increased the total number of spectral identifications by ~40%, we also found that 44% of all identified yeast proteins had identifications transferred to at least one sample without yeast. However, of these only 2.7% remained in the final dataset after applying the MaxQuant LFQ algorithm. We conclude that false transfers by MBR are plentiful, but likely have only a small impact on the final dataset due to downstream removal.

INSTRUMENT(S): Orbitrap Fusion Lumos

ORGANISM(S): Homo Sapiens (human) Saccharomyces Cerevisiae (baker's Yeast)

SUBMITTER: Joao Paulo  

LAB HEAD: Steven P. Gygi

PROVIDER: PXD014415 | Pride | 2019-09-25

REPOSITORIES: Pride

altmetric image

Publications

Evaluating False Transfer Rates from the Match-between-Runs Algorithm with a Two-Proteome Model.

Lim Matthew Y MY   Paulo João A JA   Gygi Steven P SP  

Journal of proteome research 20191002 11


Stochasticity between independent LC-MS/MS runs is a challenging problem in the field of proteomics, resulting in significant missing values (i.e., abundance measurements) among observed peptides. To address this issue, several approaches have been developed including computational methods such as MaxQuant's match-between-runs (MBR) algorithm. Often dozens of runs are all considered at once by MBR, transferring identifications from any one run to any of the others. To evaluate the error associat  ...[more]

Similar Datasets

2005-06-21 | E-GEOD-2830 | biostudies-arrayexpress
2014-07-29 | E-GEOD-55392 | biostudies-arrayexpress
2018-03-13 | PXD007145 | Pride
2020-01-27 | PXD000617 | Pride
2021-08-06 | PXD023889 | Pride
2024-02-13 | PXD039576 | Pride
2021-07-22 | PXD000409 | Pride
2013-10-02 | GSE45947 | GEO
2013-10-02 | GSE45946 | GEO
2013-10-02 | GSE45944 | GEO