Unknown

Dataset Information

0

Mining Unknown Porcine Protein Isoforms by Tissue-based Map of Proteome Enhances Pig Genome Annotation.


ABSTRACT: A lack of the complete pig proteome has left a gap in our knowledge of the pig genome and has restricted the feasibility of using pigs as a biomedical model. In this study, we developed a tissue-based proteome map using 34 major normal pig tissues. A total of 5841 unknown protein isoforms were identified and systematically characterized, including 2225 novel protein isoforms, 669 protein isoforms from 460 genes symbolized beginning with LOC, and 2947 protein isoforms without clear NCBI annotation in the current pig reference genome. These newly identified protein isoforms were functionally annotated through profiling the pig transcriptome with high-throughput RNA sequencing of the same pig tissues, further improving the genome annotation of the corresponding protein-coding genes. Combining the well-annotated genes that have parallel expression pattern and subcellular witness, we predicted the tissue-related subcellularlocations and potential functions for these unknown proteins. Finally, we mined 3081 orthologous genes for 52.7% of unknown protein isoforms across multiple species, referring to 68 KEGG pathways as well as 23 disease signaling pathways. These findings provide valuable insights and a rich resource for enhancing studies of pig genomics and biology, as well as biomedical model application to human medicine.

SUBMITTER: Zhao P 

PROVIDER: S-EPMC9170766 | biostudies-literature | 2021 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Mining Unknown Porcine Protein Isoforms by Tissue-based Map of Proteome Enhances Pig Genome Annotation.

Zhao Pengju P   Zheng Xianrui X   Yu Ying Y   Hou Zhuocheng Z   Diao Chenguang C   Wang Haifei H   Kang Huimin H   Ning Chao C   Li Junhui J   Feng Wen W   Wang Wen W   Liu George E GE   Li Bugao B   Smith Jacqueline J   Chamba Yangzom Y   Liu Jian-Feng JF  

Genomics, proteomics & bioinformatics 20210223 5


A lack of the complete pig proteome has left a gap in our knowledge of the pig genome and has restricted the feasibility of using pigs as a biomedical model. In this study, we developed a tissue-based proteome map using 34 major normal pig tissues. A total of 5841 unknown protein isoforms were identified and systematically characterized, including 2225 novel protein isoforms, 669 protein isoforms from 460 genes symbolized beginning with LOC, and 2947 protein isoforms without clear NCBI annotatio  ...[more]

Similar Datasets

| S-EPMC6661141 | biostudies-literature
| S-EPMC4510380 | biostudies-literature
| S-EPMC6030718 | biostudies-literature
| S-EPMC10159913 | biostudies-literature
| S-EPMC4510071 | biostudies-literature
| S-EPMC8494738 | biostudies-literature
| S-EPMC4431823 | biostudies-literature
2019-02-14 | PXD020169 |
| S-EPMC2374703 | biostudies-literature
| S-EPMC6427241 | biostudies-literature