Unknown

Dataset Information

0

Interpretation errors related to the GO annotation file format.


ABSTRACT: The Gene Ontology (GO) is the most widely used ontology for creating biomedical annotations. GO annotations are statements associating a biological entity with a GO term. These statements comprise a large dataset of biological knowledge that is used widely in biomedical research. GO Annotations are available as "gene association files" from the GO website in a tab-delimited file format (GO Annotation File Format) composed of rows of 15 tab-delimited fields. This simple format lacks the knowledge representation (KR) capabilities to represent unambiguously semantic relationships between each field. This paper demonstrates that this KR shortcoming leads users to interpret the files in ways that can be erroneous. We propose a complementary format to represent GO annotation files as knowledge bases using the W3C recommended Web Ontology Language (OWL).

SUBMITTER: Moreira DA 

PROVIDER: S-EPMC2655813 | BioStudies | 2007-01-01

REPOSITORIES: biostudies

Similar Datasets

2009-01-01 | S-EPMC2689195 | BioStudies
2010-01-01 | S-EPMC2945790 | BioStudies
2013-01-01 | S-EPMC3598649 | BioStudies
2012-01-01 | S-EPMC3245010 | BioStudies
2008-01-01 | S-EPMC2238903 | BioStudies
2019-01-01 | S-EPMC6407916 | BioStudies
2008-01-01 | S-EPMC2259407 | BioStudies
1000-01-01 | S-EPMC2951084 | BioStudies
2018-01-01 | S-EPMC6030999 | BioStudies
2004-01-01 | S-EPMC308756 | BioStudies