Unknown

Dataset Information

0

Information content-based gene ontology semantic similarity approaches: toward a unified framework theory.


ABSTRACT: Several approaches have been proposed for computing term information content (IC) and semantic similarity scores within the gene ontology (GO) directed acyclic graph (DAG). These approaches contributed to improving protein analyses at the functional level. Considering the recent proliferation of these approaches, a unified theory in a well-defined mathematical framework is necessary in order to provide a theoretical basis for validating these approaches. We review the existing IC-based ontological similarity approaches developed in the context of biomedical and bioinformatics fields to propose a general framework and unified description of all these measures. We have conducted an experimental evaluation to assess the impact of IC approaches, different normalization models, and correction factors on the performance of a functional similarity metric. Results reveal that considering only parents or only children of terms when assessing information content or semantic similarity scores negatively impacts the approach under consideration. This study produces a unified framework for current and future GO semantic similarity measures and provides theoretical basics for comparing different approaches. The experimental evaluation of different approaches based on different term information content models paves the way towards a solution to the issue of scoring a term's specificity in the GO DAG.

SUBMITTER: Mazandu GK 

PROVIDER: S-EPMC3775452 | BioStudies | 2013-01-01

SECONDARY ACCESSION(S): GO:0003678

REPOSITORIES: biostudies

Similar Datasets

2014-01-01 | S-EPMC3904913 | BioStudies
2013-01-01 | S-EPMC3849277 | BioStudies
2014-01-01 | S-EPMC4253309 | BioStudies
2018-01-01 | S-EPMC6180005 | BioStudies
1000-01-01 | S-EPMC3533586 | BioStudies
2014-01-01 | S-EPMC4256219 | BioStudies
2012-01-01 | S-EPMC3422825 | BioStudies
2008-01-01 | S-EPMC2518162 | BioStudies
2016-01-01 | S-EPMC5260111 | BioStudies
2016-01-01 | S-EPMC4966780 | BioStudies