Dataset Information


Differential use of signal peptides and membrane domains is a common occurrence in the protein output of transcriptional units.

ABSTRACT: Membrane organization describes the orientation of a protein with respect to the membrane and can be determined by the presence, or absence, and organization within the protein sequence of two features: endoplasmic reticulum signal peptides and alpha-helical transmembrane domains. These features allow protein sequences to be classified into one of five membrane organization categories: soluble intracellular proteins, soluble secreted proteins, type I membrane proteins, type II membrane proteins, and multi-spanning membrane proteins. Generation of protein isoforms with variable membrane organizations can change a protein's subcellular localization or association with the membrane. Application of MemO, a membrane organization annotation pipeline, to the FANTOM3 Isoform Protein Sequence mouse protein set revealed that within the 8,032 transcriptional units (TUs) with multiple protein isoforms, 573 had variation in their use of signal peptides, 1,527 had variation in their use of transmembrane domains, and 615 generated protein isoforms from distinct membrane organization classes. The mechanisms underlying these transcript variations were analyzed. While TUs were identified encoding all pairwise combinations of membrane organization categories, the most common was conversion of membrane proteins to soluble proteins. Observed within our high-confidence set were 156 TUs predicted to generate both extracellular soluble and membrane proteins, and 217 TUs generating both intracellular soluble and membrane proteins. The differential use of endoplasmic reticulum signal peptides and transmembrane domains is a common occurrence within the variable protein output of TUs. The generation of protein isoforms that are targeted to multiple subcellular locations represents a major functional consequence of transcript variation within the mouse transcriptome.

PROVIDER: S-EPMC1449889 | BioStudies |

REPOSITORIES: biostudies

Similar Datasets

| S-EPMC18552 | BioStudies
2004-01-01 | S-EPMC522031 | BioStudies
| S-EPMC5915466 | BioStudies
| S-EPMC3077683 | BioStudies
| S-EPMC2975600 | BioStudies
| S-EPMC5559180 | BioStudies
| S-EPMC2904353 | BioStudies
| S-EPMC2764890 | BioStudies
| S-EPMC2717299 | BioStudies
| S-EPMC6277224 | BioStudies