Unknown

Dataset Information

0

TagRecon: high-throughput mutation identification through sequence tagging.


ABSTRACT: Shotgun proteomics produces collections of tandem mass spectra that contain all the data needed to identify mutated peptides from clinical samples. Identifying these sequence variations, however, has not been feasible with conventional database search strategies, which require exact matches between observed and expected sequences. Searching for mutations as mass shifts on specified residues through database search can incur significant performance penalties and generate substantial false positive rates. Here we describe TagRecon, an algorithm that leverages inferred sequence tags to identify unanticipated mutations in clinical proteomic data sets. TagRecon identifies unmodified peptides as sensitively as the related MyriMatch database search engine. In both LTQ and Orbitrap data sets, TagRecon outperformed state of the art software in recognizing sequence mismatches from data sets with known variants. We developed guidelines for filtering putative mutations from clinical samples, and we applied them in an analysis of cancer cell lines and an examination of colon tissue. Mutations were found in up to 6% of identified peptides, and only a small fraction corresponded to dbSNP entries. The RKO cell line, which is DNA mismatch repair deficient, yielded more mutant peptides than the mismatch repair proficient SW480 line. Analysis of colon cancer tumor and adjacent tissue revealed hydroxyproline modifications associated with extracellular matrix degradation. These results demonstrate the value of using sequence tagging algorithms to fully interrogate clinical proteomic data sets.

SUBMITTER: Dasari S 

PROVIDER: S-EPMC2859315 | biostudies-literature | 2010 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

TagRecon: high-throughput mutation identification through sequence tagging.

Dasari Surendra S   Chambers Matthew C MC   Slebos Robbert J RJ   Zimmerman Lisa J LJ   Ham Amy-Joan L AJ   Tabb David L DL  

Journal of proteome research 20100401 4


Shotgun proteomics produces collections of tandem mass spectra that contain all the data needed to identify mutated peptides from clinical samples. Identifying these sequence variations, however, has not been feasible with conventional database search strategies, which require exact matches between observed and expected sequences. Searching for mutations as mass shifts on specified residues through database search can incur significant performance penalties and generate substantial false positiv  ...[more]

Similar Datasets

| S-EPMC3728768 | biostudies-literature
| S-EPMC3295828 | biostudies-literature
| S-EPMC3088570 | biostudies-literature
| S-EPMC5536957 | biostudies-literature
| S-EPMC7657533 | biostudies-literature
2017-02-10 | GSE76434 | GEO
| S-EPMC4922190 | biostudies-literature
| S-EPMC6018237 | biostudies-literature
2016-09-01 | GSE78064 | GEO