Unknown

Dataset Information

0

Classification of human Herpesviridae proteins using Domain-architecture Aware Inference of Orthologs (DAIO).


ABSTRACT: We developed a computational approach called Domain-architecture Aware Inference of Orthologs (DAIO) for the analysis of protein orthology by combining phylogenetic and protein domain-architecture information. Using DAIO, we performed a systematic study of the proteomes of all human Herpesviridae species to define Strict Ortholog Groups (SOGs). In addition to assessing the taxonomic distribution for each protein based on sequence similarity, we performed a protein domain-architecture analysis for every protein family and computationally inferred gene duplication events. While many herpesvirus proteins have evolved without any detectable gene duplications or domain rearrangements, numerous herpesvirus protein families do exhibit complex evolutionary histories. Some proteins acquired additional domains (e.g., DNA polymerase), whereas others show a combination of domain acquisition and gene duplication (e.g., betaherpesvirus US22 family), with possible functional implications. This novel classification system of SOGs for human Herpesviridae proteins is available through the Virus Pathogen Resource (ViPR, www.viprbrc.org).

SUBMITTER: Zmasek CM 

PROVIDER: S-EPMC6502252 | biostudies-literature | 2019 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Classification of human Herpesviridae proteins using Domain-architecture Aware Inference of Orthologs (DAIO).

Zmasek Christian M CM   Knipe David M DM   Pellett Philip E PE   Scheuermann Richard H RH  

Virology 20190106


We developed a computational approach called Domain-architecture Aware Inference of Orthologs (DAIO) for the analysis of protein orthology by combining phylogenetic and protein domain-architecture information. Using DAIO, we performed a systematic study of the proteomes of all human Herpesviridae species to define Strict Ortholog Groups (SOGs). In addition to assessing the taxonomic distribution for each protein based on sequence similarity, we performed a protein domain-architecture analysis fo  ...[more]

Similar Datasets

| S-EPMC3215765 | biostudies-literature
| S-EPMC116988 | biostudies-literature
| S-EPMC3832408 | biostudies-literature
| S-EPMC4383905 | biostudies-literature
| S-EPMC5066062 | biostudies-literature
| S-EPMC6003080 | biostudies-other
| S-EPMC3516229 | biostudies-literature
| S-EPMC4893229 | biostudies-literature
| S-EPMC2612752 | biostudies-literature