Dataset Information


The conserved organization of the human and mouse transcriptomes

ABSTRACT: We characterized by RNA-seq the transcriptional profiles of a large and heterogeneous collection of mouse tissues, augmenting the mouse transcriptome with thousands of novel transcript candidates. Comparison with transcriptome profiles obtained in human cell lines reveals substantial conservation of transcriptional programs, and uncovers a distinct class of genes with levels of expression across cell types and species, that have been constrained early in vertebrate evolution. This core set of genes capture a substantial and constant fraction of the transcriptional output of mammalian cells, and participates in basic functional and structural housekeeping processes common to all cell types. Perturbation of these constrained genes is associated with significant phenotypes including embryonic lethality and cancer. Evolutionary constraint in gene expression levels is not reflected in the conservation of the genomic sequences, but it is associated with strong and conserved epigenetic marking, as well as to a characteristic post-transcriptional regulatory program in which sub-cellular localization and alternative splicing play comparatively large roles. Comparison of human and mouse transcriptome profiles has uncovered a distinct class of genes (6600- one third of all expressed genes in both human and mouse) whose variation in expression levels have been constrained irrespective of cell types and species that they are express in. Such constraint appears to have been developed early in vertebrate evolution since it seen in multiple other species. This constraint is not associated with the conservation of the genomic sequences found in each species. Finally, this core set of genes helps in interpreting how non-human organisms like the mouse can better be used as models for human disease and why perturbation of these constrained genes is associated with significant phenotypes including embryonic lethality and cancer.

ORGANISM(S): Musculus  

TISSUE(S): Heart, Brain, Adipose, Skeletal Muscle, Liver, Lung, Lymph Node, Adrenal, Kidney, Ovary, Breast, Colon, White Blood Cell, Testes, Prostate, Thyroid

SUBMITTER: Chris Zaleski   Meagan Fastuca  Mark Gerstein  Thomas R Gingeras  Alessandra Breschi  Huaien Wang  Baikang Pei  Julien Lagarde  Sarah Djebali  Micheal A Beer  Suganthi Balasubramanian  Alex Dobin  Cedric Notredame  Jorg Drenkow  Jean Monlong  Dmitri D Pervouchine  Andrea Tanzer  Roderic Guigo  Pablo P Barja  Lei-Hoon See  Carrie A Davis  Giovanni Bussotti  Arif Harmanci  ENCODE DCC 

PROVIDER: E-GEOD-49417 | ArrayExpress | 2014-11-20



Similar Datasets

2010-08-22 | GSE22549 | GEO
2010-08-22 | E-GEOD-22549 | ArrayExpress
2011-07-29 | E-MTAB-424 | ArrayExpress
2011-07-29 | E-MTAB-958 | ArrayExpress
| GSE87528 | GEO
| GSE72208 | GEO
2011-10-05 | GSE32647 | GEO
2013-01-22 | E-GEOD-43512 | ArrayExpress
2011-10-04 | E-GEOD-32647 | ArrayExpress
2010-12-15 | E-GEOD-23963 | ArrayExpress