Dataset Information


Integrated Transcriptomic-Proteomic Analysis Using a Proteogenomic Workflow Refines Rat Genome Annotation.

ABSTRACT: Proteogenomic re-annotation and mRNA splicing information can lead to the discovery of various protein forms for eukaryotic model organisms like rat. However, detection of novel proteoforms using mass spectrometry proteomics data remains a formidable challenge. We developed EuGenoSuite, an open source multiple algorithmic proteomic search tool and utilized it in our in-house integrated transcriptomic-proteomic pipeline to facilitate automated proteogenomic analysis. Using four proteogenomic pipelines (integrated transcriptomic-proteomic, Peppy, Enosi, and ProteoAnnotator) on publicly available RNA-sequence and MS proteomics data, we discovered 363 novel peptides in rat brain microglia representing novel proteoforms for 249 gene loci in the rat genome. These novel peptides aided in the discovery of novel exons, translation of annotated untranslated regions, pseudogenes, and splice variants for various loci; many of which have known disease associations, including neurological disorders like schizophrenia, amyotrophic lateral sclerosis, etc. Novel isoforms were also discovered for genes implicated in cardiovascular diseases and breast cancer for which rats are considered model organisms. Our integrative multi-omics data analysis not only enables the discovery of new proteoforms but also generates an improved reference for human disease studies in the rat model.


PROVIDER: S-EPMC4762527 | BioStudies | 2016-01-01T00:00:00Z

REPOSITORIES: biostudies

Similar Datasets

1000-01-01 | S-EPMC5379071 | BioStudies
1000-01-01 | S-EPMC3820949 | BioStudies
2014-01-01 | S-EPMC4249766 | BioStudies
2014-01-01 | S-EPMC4290786 | BioStudies
2013-01-01 | S-EPMC4117251 | BioStudies
1000-01-01 | S-EPMC4261978 | BioStudies
2019-01-01 | S-EPMC6768830 | BioStudies
2014-01-01 | S-EPMC4392723 | BioStudies
2019-01-01 | S-EPMC7331093 | BioStudies
2011-01-01 | S-EPMC3219674 | BioStudies