Proteomics

Dataset Information

0

Analytical guidelines for co-fractionation mass spectrometry obtained through global profiling of gold standard Saccharomyces cerevisiae protein complexes


ABSTRACT: Co-fractionation mass spectrometry (CF-MS) is a technique with potential to characterise endogenous and unmanipulated protein complexes on an unprecedented scale. However this potential has been offset by a lack of guidelines for best-practice CF-MS data collection and analysis. To obtain such guidelines, this study exploits very high proteome coverage libraries of gold standard Saccharomyces cerevisiae complexes to thoroughly evaluate novel and published yeast CF-MS datasets. A new method for identifying gold standard complexes in CF-MS data, Reference Complex Profiling, and the Extending ‘Guilt-by-Association’ by Degree (EGAD) R package are used for these evaluations, which are reinforced with concurrent analyses of published human data. By evaluating data collection designs, which involve fractionation of cell lysates, it is found that near-maximum recall of complexes can be achieved with fewer samples than published studies. Distributing sample collection across orthogonal fractionation methods, rather than a single high resolution dataset, leads to particularly efficient recall. By evaluating 17 different similarity scoring metrics, which are central to CF-MS data analysis, it is found that two metrics rarely used in past CF-MS studies – Spearman and Kendall correlations – and the recently introduced Co-apex metric frequently maximise recall, while a popular metric – Euclidean distance – delivers poor recall. The common practice of integrating external genomic data into CF-MS data analysis is also evaluated, revealing that this practice may improve the precision and recall of known complexes but is generally unsuitable for predicting novel complexes in model organisms. If studying non-model organisms using orthologous genomic data, it is found that particular subsets of fractionation profiles (e.g. the lowest abundance quartile) should be excluded to minimise false discovery. Together these guidelines identify avenues for precise, sensitive and efficient CF-MS studies of known complexes, and effective predictions of novel complexes for orthogonal experimental validation.

INSTRUMENT(S): Q Exactive

ORGANISM(S): Saccharomyces Cerevisiae (baker's Yeast)

SUBMITTER: Gene Hart-Smith  

LAB HEAD: Gene Hart-Smith

PROVIDER: PXD019513 | Pride | 2020-08-24

REPOSITORIES: Pride

altmetric image

Publications

Analytical Guidelines for co-fractionation Mass Spectrometry Obtained through Global Profiling of Gold Standard <i>Saccharomyces cerevisiae</i> Protein Complexes.

Pang Chi Nam Ignatius CNI   Ballouz Sara S   Weissberger Daniel D   Thibaut Loïc M LM   Hamey Joshua J JJ   Gillis Jesse J   Wilkins Marc R MR   Hart-Smith Gene G  

Molecular & cellular proteomics : MCP 20200818 11


Co-fractionation MS (CF-MS) is a technique with potential to characterize endogenous and unmanipulated protein complexes on an unprecedented scale. However this potential has been offset by a lack of guidelines for best-practice CF-MS data collection and analysis. To obtain such guidelines, this study thoroughly evaluates novel and published <i>Saccharomyces cerevisiae</i> CF-MS data sets using very high proteome coverage libraries of yeast gold standard complexes. A new method for identifying g  ...[more]

Similar Datasets

2024-03-21 | PXD044084 | Pride
2024-03-21 | PXD044083 | Pride
2021-11-26 | MODEL2111260002 | BioModels
2019-07-16 | PXD011182 | Pride
2022-08-03 | PXD027704 | Pride
2021-04-20 | PXD022048 | Pride
2022-01-14 | E-PROT-9 | ExpressionAtlas
2024-01-26 | PXD042664 | Pride
2024-03-11 | PXD035055 | Pride
2015-04-30 | E-GEOD-67819 | biostudies-arrayexpress