Transcriptomics

Dataset Information

0

DNA microarray analysis of R1-9 ES cells


ABSTRACT: We have studied protein expression in mouse embryonic stem (mES) cells- R1-9 and identified the largest complement of proteins reported thus far for mES cells, using on-line LC-MS/MS analysis. With support from a transcriptome profile of the same mES cell line, we arrived at an integrated dataset of 9,370 protein expressions that may be useful to understand the stem cell dynamics. 6,278 protein identifications (IDs) were based on two or more peptide signatures and 3,092 with single peptide signatures. These single peptide-based protein IDs were also supported by transcriptome data. The proteomic analysis demonstrated the expression of 1,369 genes (among 9,370) that were not seen in RNA transcripts. Pathway analysis of the protein IDs in our dataset, using KEGG and BioCarta and their gene ontology classification, revealed high representation of the proteins belonging to many regulatory pathways and showed enrichment for protein IDs involved in transcription regulation, signal transduction, cell-cycle, and differentiation. Interestingly, the data support the expression of 1,299 putative open reading frames. In silico domain comparisons of some of these sequences matched with regulatory proteins such as transcription factors or signal transduction molecules. Over 60% of the transcript complement is represented in our protein dataset implying a corresponding richness of the dynamic network of proteins in a pluripotent mES cells.have thus arrived at a dataset of 10428 expressed proteins with high confidence which was also verified by microarray analysis of the expressed mRNAs R1-9 ES cells. Pathway analysis of these expressed proteins was carried out using KEGG, IPA and their gene ontology classification revealed the presence of large number of transcription regulators, signal transducers, cell cycle and differentiation molecules along with other general classes of proteins belonging to a number of regulatory pathways. The database also includes IDs of proteins corresponding to many still unidentified / unannotated ORFs from the mouse genome and with putative regulatory functions. Our study thus represents the largest database of expressed proteins in mouse ES cells that would be a valuable molecular resource for experimental designs in targeted proteomics investigations using mES cell lines.

ORGANISM(S): Mus musculus

PROVIDER: GSE15028 | GEO | 2019/04/22

REPOSITORIES: GEO

Similar Datasets

2022-08-20 | E-MTAB-11655 | biostudies-arrayexpress
2022-10-05 | PXD031929 | Pride
2020-11-10 | E-MTAB-9404 | biostudies-arrayexpress
2015-12-15 | E-GEOD-73227 | biostudies-arrayexpress
2010-09-21 | E-MTAB-382 | biostudies-arrayexpress
2008-06-01 | E-MEXP-1257 | biostudies-arrayexpress
2018-02-27 | PXD008432 | Pride
2019-06-11 | PXD012247 | Pride
2015-12-08 | GSE71883 | GEO
2015-12-08 | GSE71882 | GEO