Unknown

Dataset Information

0

Inverse Potts model improves accuracy of phylogenetic profiling.


ABSTRACT: Phylogenetic profiling is a powerful computational method for revealing the functions of function-unknown genes. Although conventional similarity metrics in phylogenetic profiling achieved high prediction accuracy, they have two estimation biases: an evolutionary bias and a spurious correlation bias. While previous studies reduced the evolutionary bias by considering a phylogenetic tree, few studies have analyzed the spurious correlation bias. To reduce the spurious correlation bias, we developed metrics based on the inverse Potts model (IPM) for phylogenetic profiling. We also developed a metric based on both the IPM and a phylogenetic tree. In an empirical dataset analysis, we demonstrated that these IPM-based metrics improved the prediction performance of phylogenetic profiling. In addition, we found that the integration of several metrics, including the IPM-based metrics, had superior performance to a single metric. The source code is freely available at https://github.com/fukunagatsu/Ipm. Supplementary data are available at Bioinformatics online.

SUBMITTER: Fukunaga T 

PROVIDER: S-EPMC8963296 | biostudies-literature | 2022 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Inverse Potts model improves accuracy of phylogenetic profiling.

Fukunaga Tsukasa T   Iwasaki Wataru W  

Bioinformatics (Oxford, England) 20220301 7


<h4>Motivation</h4>Phylogenetic profiling is a powerful computational method for revealing the functions of function-unknown genes. Although conventional similarity metrics in phylogenetic profiling achieved high prediction accuracy, they have two estimation biases: an evolutionary bias and a spurious correlation bias. While previous studies reduced the evolutionary bias by considering a phylogenetic tree, few studies have analyzed the spurious correlation bias.<h4>Results</h4>To reduce the spur  ...[more]

Similar Datasets

| S-EPMC3299401 | biostudies-literature
| S-EPMC11404401 | biostudies-literature
| S-EPMC2821295 | biostudies-literature
| S-EPMC9452544 | biostudies-literature
| S-EPMC4104320 | biostudies-literature
| S-EPMC4598229 | biostudies-literature
| S-EPMC10909357 | biostudies-literature
2023-01-30 | GSE215240 | GEO
| S-EPMC6797059 | biostudies-literature
| S-EPMC7188920 | biostudies-literature