Dataset Information


EMpire: Deep isoform level integration of the transcriptome, translatome, and proteome

ABSTRACT: In this study, we make used of mRNA-seq and its ability to reliably quantify isoforms, integrating this data with ribosome profiling and LC-MS/MS, to assign ribosome footprints and peptides at the isoform level. We leverage the principle that most cell types, and even tissues, predominantly express a single principal isoform to set isoform-level mRNA-seq quantifications as priors to guide and improve allocation of footprints or peptides to isoforms. Through tightly integrated mRNAseq, ribosome footprinting and/or LC-MS/MS proteomics we demonstrate that a principal isoform can be identified in over 80% of gene products in homogenous HEK293 cell culture and over 70% of proteins detected in complex human brain tissue. Defining isoforms in experiments with matched RNA-seq and translatomic/proteomic data increases the functional relevance of such datasets and will further broaden our understanding of multi-level control of gene expression. In this PRIDE submission you will find the raw files for the HEK293 cell proteomics. Files for the human brain proteomics can be found at PXD005445. We have also uploaded a zip file that contains the input files for our HEK293 cell analysis, and the isoform level output files – there is a separate folder within the zip files for these. The data used to create the manuscript figures is in the Rdata file. Code for assigning peptides and footprints to isoforms can be found on Github here:

INSTRUMENT(S): LTQ Orbitrap Elite

ORGANISM(S): Homo sapiens  

TISSUE(S): Cell Culture

DISEASE(S): Not Available


LAB HEAD: TuKiet Tan Lam

PROVIDER: PXD008693 | Pride | 2018-08-21


altmetric image


Isoform-Level Interpretation of High-Throughput Proteomics Data Enabled by Deep Integration with RNA-seq.

Carlyle Becky C BC   Kitchen Robert R RR   Zhang Jing J   Wilson Rashaun S RS   Lam Tukiet T TT   Rozowsky Joel S JS   Williams Kenneth R KR   Sestan Nenad N   Gerstein Mark B MB   Nairn Angus C AC  

Journal of proteome research 20180906 10

Cellular control of gene expression is a complex process that is subject to multiple levels of regulation, but ultimately it is the protein produced that determines the biosynthetic state of the cell. One way that a cell can regulate the protein output from each gene is by expressing alternate isoforms with distinct amino acid sequences. These isoforms may exhibit differences in localization and binding interactions that can have profound functional implications. High-throughput liquid chromatog  ...[more]

Similar Datasets

2014-04-03 | E-GEOD-56436 | ArrayExpress
2015-03-17 | PXD001131 | Pride
2015-02-25 | E-MTAB-2466 | ArrayExpress
2013-11-01 | E-GEOD-46562 | ArrayExpress
2013-09-01 | E-GEOD-45326 | ArrayExpress
2019-03-28 | PXD012413 | Pride
2018-07-16 | PXD009056 | Pride
2017-09-27 | E-MTAB-6019 | ArrayExpress
2020-03-07 | E-MTAB-8825 | ArrayExpress
2020-01-21 | E-MTAB-7371 | ArrayExpress