Proteomics

Dataset Information

0

Exome-based proteogenomics of HEK-293 human cell line: coding genomic variants identified at the level of shotgun proteome


ABSTRACT: Genomic and proteomic data were integrated into the proteogenomic workflow to identify coding genomic variants of Human Embryonic Kidney 293 (HEK-293) standard cell line at the proteome level. Shotgun proteome data published by Geiger et al (2012) and obtained in this work for HEK-293 were searched against the customized genomic databased generated using exome data published by Lin et al (2014). 54 unique variants out of ~1,200 coding variants annotated in the exome were found at the proteome level. 27 of them were validated by two search engines, X!Tandem and Andromeda. 16 (60%) of those validated variants were confidently identified in both own and published proteome datasets. Some of the variants found belonged to widely known genomic polymorphisms originated from the germline, while others are more likely to result from somatic mutations. Notably, the peptide subsets identified by only one, or the other search engine were enriched by the sequences with miscleavages. This can be due to the large presence of false-positive hits in these subsets that is especially true for the subset of variant peptides. High-resolution mass-spectra of HEK-293 cell line were deposited to ProteomeXchange repository, project accession PXD002613.

INSTRUMENT(S): Q Exactive

ORGANISM(S): Homo Sapiens (ncbitaxon:9606)

SUBMITTER: Sergei Moshkovskii  

PROVIDER: MSV000080712 | MassIVE | Tue Mar 28 23:16:00 BST 2017

SECONDARY ACCESSION(S): PXD002613

REPOSITORIES: MassIVE

altmetric image

Publications


Genomic and proteomic data were integrated into the proteogenomic workflow to identify coding genomic variants of Human Embryonic Kidney 293 (HEK-293) cell line at the proteome level. Shotgun proteome data published by Geiger et al. (2012), Chick et al. (2015), and obtained in this work for HEK-293 were searched against the customized genomic database generated using exome data published by Lin et al. (2014). Overall, 112 unique variants were identified at the proteome level out of ∼1200 coding  ...[more]

Similar Datasets

2016-07-22 | PXD002613 | Pride
2012-03-31 | GSE28400 | GEO
2012-03-30 | E-GEOD-28400 | biostudies-arrayexpress
2017-12-17 | GSE92599 | GEO
2024-05-13 | GSE226054 | GEO
2018-09-13 | GSE114437 | GEO
2024-02-15 | PXD046950 | Pride
2010-01-01 | E-MTAB-122 | biostudies-arrayexpress
2016-01-12 | GSE72631 | GEO
2023-05-09 | GSE206061 | GEO