Proteomics

Dataset Information

0

Generation of ENSEMBL-based proteogenomics databases boost the identification of novel peptides - Mouse dataset


ABSTRACT: A novel bioinformatics tool pypgatk and the pgdb workflow is presented in study to create proteogenomics databases based on ENSEMBL resources. The tools allow the generation of protein sequences from novel protein-coding transcripts by performing a three-frame translation of pseudogenes, lncRNAs, and other non-canonical transcripts, such as those produced by alternative splicing events. It also includes exonic out-of-frame translation from otherwise canonical protein-coding mRNAs. Moreover, the tool enables the generation of variant protein sequences from multiple sources of genomic variants including COSMIC, cBioportal, gnomAD, and mutations detected from sequencing of patient samples. pypgatk and pgdb provide multiple functionalities for database handling, notably optimized target/decoy generati on by the algorithm DecoyPyrat.

INSTRUMENT(S): LTQ, Q Exactive HF

ORGANISM(S): Mus Musculus (mouse)

DISEASE(S): Melanoma

SUBMITTER: Yasset Perez-Riverol  

LAB HEAD: Yasset Perez-Riverol

PROVIDER: PXD029362 | Pride | 2021-10-26

REPOSITORIES: Pride

Similar Datasets

2021-10-26 | PXD029360 | Pride
2022-10-31 | E-MTAB-12209 | biostudies-arrayexpress
2020-06-20 | E-MTAB-9206 | biostudies-arrayexpress
2022-10-31 | E-MTAB-12207 | biostudies-arrayexpress
2020-03-12 | E-MTAB-8245 | biostudies-arrayexpress
2023-12-07 | E-MTAB-13416 | biostudies-arrayexpress
2021-04-14 | ST001752 | MetabolomicsWorkbench
2021-12-30 | E-MTAB-10095 | biostudies-arrayexpress
2022-08-02 | PXD030285 | panorama
2021-11-01 | E-MTAB-8626 | biostudies-arrayexpress