Unknown

Dataset Information

0

MutCombinator: identification of mutated peptides allowing combinatorial mutations using nucleotide-based graph search.


ABSTRACT:

Motivation

Proteogenomics has proven its utility by integrating genomics and proteomics. Typical approaches use data from next-generation sequencing to infer proteins expressed. A sample-specific protein sequence database is often adopted to identify novel peptides from matched mass spectrometry-based proteomics; nevertheless, there is no software that can practically identify all possible forms of mutated peptides suggested by various genomic information sources.

Results

We propose MutCombinator, which enables us to practically identify mutated peptides from tandem mass spectra allowing combinatorial mutations during the database search. It uses an upgraded version of a variant graph, keeping track of frame information. The variant graph is indexed by nine nucleotides for fast access. Using MutCombinator, we could identify more mutated peptides than previous methods, because combinations of point mutations are considered and also because it can be practically applied together with a large mutation database such as COSMIC. Furthermore, MutCombinator supports in-frame search for coding regions and three-frame search for non-coding regions.

Availability and implementation

https://prix.hanyang.ac.kr/download/mutcombinator.jsp.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Choi S 

PROVIDER: S-EPMC7355298 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

MutCombinator: identification of mutated peptides allowing combinatorial mutations using nucleotide-based graph search.

Choi Seunghyuk S   Paek Eunok E  

Bioinformatics (Oxford, England) 20200701 Suppl_1


<h4>Motivation</h4>Proteogenomics has proven its utility by integrating genomics and proteomics. Typical approaches use data from next-generation sequencing to infer proteins expressed. A sample-specific protein sequence database is often adopted to identify novel peptides from matched mass spectrometry-based proteomics; nevertheless, there is no software that can practically identify all possible forms of mutated peptides suggested by various genomic information sources.<h4>Results</h4>We propo  ...[more]

Similar Datasets

| S-EPMC8187432 | biostudies-literature
| S-EPMC7426425 | biostudies-literature
| S-EPMC117354 | biostudies-literature
| S-EPMC5945037 | biostudies-literature
| S-EPMC4874008 | biostudies-literature
| S-EPMC1761437 | biostudies-literature
| S-EPMC8872422 | biostudies-literature
| S-EPMC2807629 | biostudies-literature
| S-EPMC2483259 | biostudies-literature
| S-EPMC4141472 | biostudies-literature