Unknown

Dataset Information

0

Detecting Gene Ontology misannotations using taxon-specific rate ratio comparisons.


ABSTRACT:

Motivation

Many protein function databases are built on automated or semi-automated curations and can contain various annotation errors. The correction of such misannotations is critical to improving the accuracy and reliability of the databases.

Results

We proposed a new approach to detect potentially incorrect Gene Ontology (GO) annotations by comparing the ratio of annotation rates (RAR) for the same GO term across different taxonomic groups, where those with a relatively low RAR usually correspond to incorrect annotations. As an illustration, we applied the approach to 20 commonly studied species in two recent UniProt-GOA releases and identified 250 potential misannotations in the 2018-11-6 release, where only 25% of them were corrected in the 2019-6-3 release. Importantly, 56% of the misannotations are 'Inferred from Biological aspect of Ancestor (IBA)' which is in contradiction with previous observations that attributed misannotations mainly to 'Inferred from Sequence or structural Similarity (ISS)', probably reflecting an error source shift due to the new developments of function annotation databases. The results demonstrated a simple but efficient misannotation detection approach that is useful for large-scale comparative protein function studies.

Availability and implementation

https://zhanglab.ccmb.med.umich.edu/RAR.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Wei X 

PROVIDER: S-EPMC7751014 | biostudies-literature | 2020 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detecting Gene Ontology misannotations using taxon-specific rate ratio comparisons.

Wei Xiaoqiong X   Zhang Chengxin C   Freddolino Peter L PL   Zhang Yang Y  

Bioinformatics (Oxford, England) 20200801 16


<h4>Motivation</h4>Many protein function databases are built on automated or semi-automated curations and can contain various annotation errors. The correction of such misannotations is critical to improving the accuracy and reliability of the databases.<h4>Results</h4>We proposed a new approach to detect potentially incorrect Gene Ontology (GO) annotations by comparing the ratio of annotation rates (RAR) for the same GO term across different taxonomic groups, where those with a relatively low R  ...[more]

Similar Datasets

| S-EPMC7171811 | biostudies-literature
| S-EPMC4823264 | biostudies-literature
2010-08-05 | GSE21325 | GEO
| S-EPMC3098080 | biostudies-literature
2010-08-05 | E-GEOD-21325 | biostudies-arrayexpress
| S-EPMC4062467 | biostudies-literature
| S-EPMC3572115 | biostudies-literature
| S-EPMC3098089 | biostudies-literature
| S-EPMC7355306 | biostudies-literature
| S-EPMC9106748 | biostudies-literature