Proteomics

Dataset Information

0

Democratizing Data-Independent Acquisition Proteomics Analysis on Public Cloud Infrastructures Via The Galaxy Framework


ABSTRACT: Data-independent acquisition (DIA) has become an important approach in global, mass spectrometric proteomic studies because it provides in-depth insights into the molecular variety of biological systems. However, DIA data analysis remains challenging due to the high complexity and large data and sample size, which require specialized software and large computing infrastructures. Most available open-source DIA software necessitate basic programming skills and cover only a fraction of the analysis steps, often yielding a complex of multiple software tools, severely limiting usability and reproducibility. To overcome this hurdle, we have integrated a suite of DIA tools in the Galaxy framework for reproducible and version-controlled data processing. The DIA suite includes OpenSwath, PyProphet, diapysef and swath2stats. We have compiled functional Galaxy pipelines for DIA processing, which provide a web-based graphical user interface to these pre-installed and pre-configured tools for their usage on freely accessible, powerful computational resources of the Galaxy framework. This approach also enables seamless sharing workflows with full configuration in addition to sharing raw data and results. We demonstrate usability of the all-in-one DIA pipeline in Galaxy by the analysis of a spike-in case study dataset. Additionally, extensive training material is provided, to further increase access for the proteomics community. Here we provide the five representative E.coli:HEK ratio data-dependent acquisition measurements, that were used for the generation of a spectral library (for more details see https://zenodo.org/record/4293493#.YPcm6-gzat8). Furthermore, we provide the twenty DIA measurements of the different E.coli:HEK ratio samples (for more details see https://zenodo.org/record/4307762#.YPcnKOgzat8).

INSTRUMENT(S): Q Exactive Plus

ORGANISM(S): Escherichia Coli K-12 (ncbitaxon:83333) Homo Sapiens (ncbitaxon:9606)

SUBMITTER: Oliver Schilling  

PROVIDER: MSV000087859 | MassIVE | Tue Jul 20 18:24:00 BST 2021

REPOSITORIES: MassIVE

altmetric image

Publications

Democratizing data-independent acquisition proteomics analysis on public cloud infrastructures via the Galaxy framework.

Fahrner Matthias M   Föll Melanie Christine MC   Grüning Björn Andreas BA   Bernt Matthias M   Röst Hannes H   Schilling Oliver O  

GigaScience 20220201


<h4>Background</h4>Data-independent acquisition (DIA) has become an important approach in global, mass spectrometric proteomic studies because it provides in-depth insights into the molecular variety of biological systems. However, DIA data analysis remains challenging owing to the high complexity and large data and sample size, which require specialized software and vast computing infrastructures. Most available open-source DIA software necessitates basic programming skills and covers only a fr  ...[more]

Similar Datasets

2017-04-06 | PXD001240 | Pride
2023-07-03 | PXD040205 | Pride
2019-11-06 | PXD014690 | Pride
2024-05-06 | PXD044981 | Pride
2016-09-27 | PXD002952 | Pride
2021-06-09 | MSV000087597 | MassIVE
2023-07-23 | PXD044012 | iProX
2021-04-27 | ST001776 | MetabolomicsWorkbench
2017-04-27 | MSV000081024 | MassIVE
2002-08-21 | GSE80 | GEO