Unknown

Dataset Information

0

A predictive model for vertebrate bone identification from collagen using proteomic mass spectrometry.


ABSTRACT: Proteogenomics is an increasingly common method for species identification as it allows for rapid and inexpensive interrogation of an unknown organism's proteome-even when the proteome is partially degraded. The proteomic method typically uses tandem mass spectrometry to survey all peptides detectable in a sample that frequently contains hundreds or thousands of proteins. Species identification is based on detection of a small numbers of species-specific peptides. Genetic analysis of proteins by mass spectrometry, however, is a developing field, and the bone proteome, typically consisting of only two proteins, pushes the limits of this technology. Nearly 20% of highly confident spectra from modern human bone samples identify non-human species when searched against a vertebrate database-as would be necessary with a fragment of unknown bone. These non-human peptides are often the result of current limitations in mass spectrometry or algorithm interpretation errors. Consequently, it is difficult to know if a "species-specific" peptide used to identify a sample is actually present in that sample. Here we evaluate the causes of peptide sequence errors and propose an unbiased, probabilistic approach to determine the likelihood that a species is correctly identified from bone without relying on species-specific peptides.

SUBMITTER: Yang H 

PROVIDER: S-EPMC8149876 | biostudies-literature | 2021 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

A predictive model for vertebrate bone identification from collagen using proteomic mass spectrometry.

Yang Heyi H   Butler Erin R ER   Monier Samantha A SA   Teubl Jennifer J   Fenyö David D   Ueberheide Beatrix B   Siegel Donald D  

Scientific reports 20210525 1


Proteogenomics is an increasingly common method for species identification as it allows for rapid and inexpensive interrogation of an unknown organism's proteome-even when the proteome is partially degraded. The proteomic method typically uses tandem mass spectrometry to survey all peptides detectable in a sample that frequently contains hundreds or thousands of proteins. Species identification is based on detection of a small numbers of species-specific peptides. Genetic analysis of proteins by  ...[more]

Similar Datasets

| S-EPMC5570998 | biostudies-literature
| S-EPMC6971861 | biostudies-literature
| S-EPMC2657802 | biostudies-other
| S-EPMC3556083 | biostudies-literature
| S-EPMC4253302 | biostudies-literature
| S-EPMC4253302 | biostudies-literature
| PRJEB45336 | ENA
| S-EPMC5783497 | biostudies-literature
| S-EPMC5141281 | biostudies-literature
| S-EPMC9223489 | biostudies-literature