Dataset Information

FlowerPower: clustering proteins into domain architecture classes for phylogenomic inference of protein function.

ABSTRACT: Function prediction by transfer of annotation from the top database hit in a homology search has been shown to be prone to systematic error. Phylogenomic analysis reduces these errors by inferring protein function within the evolutionary context of the entire family. However, accuracy of function prediction for multi-domain proteins depends on all members having the same overall domain structure. By contrast, most common homolog detection methods are optimized for retrieving local homologs, and do not address this requirement.We present FlowerPower, a novel clustering algorithm designed for the identification of global homologs as a precursor to structural phylogenomic analysis. Similar to methods such as PSIBLAST, FlowerPower employs an iterative approach to clustering sequences. However, rather than using a single HMM or profile to expand the cluster, FlowerPower identifies subfamilies using the SCI-PHY algorithm and then selects and aligns new homologs using subfamily hidden Markov models. FlowerPower is shown to outperform BLAST, PSI-BLAST and the UCSC SAM-Target 2K methods at discrimination between proteins in the same domain architecture class and those having different overall domain structures.Structural phylogenomic analysis enables biologists to avoid the systematic errors associated with annotation transfer; clustering sequences based on sharing the same domain architecture is a critical first step in this process. FlowerPower is shown to consistently identify homologous sequences having the same domain architecture as the query.FlowerPower is available as a webserver at http://phylogenomics.berkeley.edu/flowerpower/.

SUBMITTER: Krishnamurthy N

PROVIDER: S-EPMC1796606 | biostudies-literature | 2007 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

FlowerPower: clustering proteins into domain architecture classes for phylogenomic inference of protein function.

Krishnamurthy Nandini N Brown Duncan D Sjölander Kimmen K

BMC evolutionary biology 20070208

<h4>Background</h4>Function prediction by transfer of annotation from the top database hit in a homology search has been shown to be prone to systematic error. Phylogenomic analysis reduces these errors by inferring protein function within the evolutionary context of the entire family. However, accuracy of function prediction for multi-domain proteins depends on all members having the same overall domain structure. By contrast, most common homolog detection methods are optimized for retrieving l ...[more]

PMID: 17288570

Dataset Information

FlowerPower: clustering proteins into domain architecture classes for phylogenomic inference of protein function.

Publications

FlowerPower: clustering proteins into domain architecture classes for phylogenomic inference of protein function.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Redefining the architecture of ferlin proteins: Insights into multi-domain protein structure and function.
| S-EPMC9333456 | biostudies-literature

Classification of human Herpesviridae proteins using Domain-architecture Aware Inference of Orthologs (DAIO).
| S-EPMC6502252 | biostudies-literature

PANDA: Protein function prediction using domain architecture and affinity propagation.
| S-EPMC5823857 | biostudies-literature

A simple, fast, and accurate method of phylogenomic inference.
| S-EPMC2760878 | biostudies-other

Phylogenomic and Evolutionary Analyses Reveal Diversifications of SET-Domain Proteins in Fungi.
| S-EPMC9692433 | biostudies-literature

Chloroplast Phylogenomic Inference of Green Algae Relationships.
| S-EPMC4742797 | biostudies-literature

Knowledge-guided inference of domain-domain interactions from incomplete protein-protein interaction networks.
| S-EPMC2752622 | biostudies-literature

Comparative structural modeling and inference of conserved protein classes in Drosophila seminal fluid.
| S-EPMC518759 | biostudies-literature

Repeated evolution of identical domain architecture in metazoan netrin domain-containing proteins.
| S-EPMC3516229 | biostudies-literature

The origin of modern metabolic networks inferred from phylogenomic analysis of protein architecture.
| S-EPMC1890499 | biostudies-literature