Dataset Information

Analysis of O-glycoproteomics data with MS-Decipher, MetaMorpheus, MSFragger-Glyco and Mascot

ABSTRACT: We presented a user-friendly proteomic database search platform, MS-Decipher, for the identification of peptides from MS data. Two scoring schemes, rank score and hyperscore, could be used for peptide spectra matching. FDR controlling strategies could be used after searching, and it was found that MS-Decipher performs well compared to traditional database searching software. In addition, a special search mode, O-search, is presented to search O-glycopeptides for the O-glycoproteomics analysis. Data and result files in several formats could be used in searching and validation. Dataset of tryptic peptides from serum were used to evaluate the performance of our software in analysis of O-glycoproteomics . The searches using the same parameters were also performed by other commonly used software, like MetaMorpheus, MSFragger-Glyco and Mascot, to compare the performance.

ORGANISM(S): Homo Sapiens (human)

SUBMITTER: Mingliang Ye

PROVIDER: PXD027004 | JPOST Repository | Thu Jan 13 00:00:00 GMT 2022

REPOSITORIES: jPOST

ACCESS DATA

Dataset's files

Source:

Items per page:

1 - 5 of 20

Similar Datasets

Project description:The standard platform for proteomics experiments today is mass spectrometry, particularly for samples derived from complex matrices. Recent increases in mass spectrometry sequencing speed, sensitivity and resolution now permit comprehensive coverage of even the most precious and limited samples, particularly when coupled with improvements in protein extraction techniques and chromatographic separation. However, the results obtained from laborious sample extraction and expensive instrumentation are often hindered by a sub optimal data processing pipelines. One critical data processing piece is peptide sequencing which is most commonly done through database search engines. In almost all MS/MS search engines users must limit their search space due to time constraints and q-value considerations. In nearly all experiments, the search is limited to a canonical database that typically does not reflect the individual genetic variations of the organism being studied. Searching for posttranslational modifications can exponentially increase the search space thus careful consideration must be used during the selection process. In addition, engines will nearly always assume the presence of only fully tryptic peptides. Despite these stringent parameters, proteomic data searches may take hours or even days to complete and opening even one of these criteria to more realistic biological settings will lead to detrimental increases in search time on expensive and custom data processing towers. Even on high performance servers, these search engines are computationally expensive, and most users decide to dial back their search parameters. We present Bolt, a new search engine that can search more than nine hundred thousand protein sequences (canonical, isoform, mutations, and contaminants) with 31 post translation modifications and N-terminal and C-terminal partial tryptic search in a matter of minutes on a standard configuration laptop. Along with increases in speed, Bolt provides an additional benefit of improvement in high confidence identifications, as demonstrated by manual validation of unique peptides identified by Bolt that were missed with parallel searching using standard engines. When in disagreement, 67% of peptides identified by Bolt may be manually validated by strong fragmentation patterns, compared to 14% of peptides uniquely identified by SEQUEST. Bolt represents, to the best of our knowledge, the first fully scalable, cloud based quantitative proteomic solution that can be operated within a user-friendly GUI interface.

			Action	DRS
	LLY_HF_OGP_A_1.mgf	Mgf
	LLY_HF_OGP_A_10.mgf	Mgf
	LLY_HF_OGP_A_15.mgf	Mgf
	LLY_HF_OGP_A_16.mgf	Mgf
	LLY_HF_OGP_A_17.mgf	Mgf

Dataset Information

Analysis of O-glycoproteomics data with MS-Decipher, MetaMorpheus, MSFragger-Glyco and Mascot

Dataset's files

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets