Unknown

Dataset Information

0

Annotation of biologically relevant ligands in UniProtKB using ChEBI.


ABSTRACT:

Motivation

To provide high quality, computationally tractable annotation of binding sites for biologically relevant (cognate) ligands in UniProtKB using the chemical ontology ChEBI (Chemical Entities of Biological Interest), to better support efforts to study and predict functionally relevant interactions between protein sequences and structures and small molecule ligands.

Results

We structured the data model for cognate ligand binding site annotations in UniProtKB and performed a complete reannotation of all cognate ligand binding sites using stable unique identifiers from ChEBI, which we now use as the reference vocabulary for all such annotations. We developed improved search and query facilities for cognate ligands in the UniProt website, REST API and SPARQL endpoint that leverage the chemical structure data, nomenclature and classification that ChEBI provides.

Availability and implementation

Binding site annotations for cognate ligands described using ChEBI are available for UniProtKB protein sequence records in several formats (text, XML and RDF) and are freely available to query and download through the UniProt website (www.uniprot.org), REST API (www.uniprot.org/help/api), SPARQL endpoint (sparql.uniprot.org/) and FTP site (https://ftp.uniprot.org/pub/databases/uniprot/).

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Coudert E 

PROVIDER: S-EPMC9825770 | biostudies-literature | 2023 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Annotation of biologically relevant ligands in UniProtKB using ChEBI.

Coudert Elisabeth E   Gehant Sebastien S   de Castro Edouard E   Pozzato Monica M   Baratin Delphine D   Neto Teresa T   Sigrist Christian J A CJA   Redaschi Nicole N   Bridge Alan A  

Bioinformatics (Oxford, England) 20230101 1


<h4>Motivation</h4>To provide high quality, computationally tractable annotation of binding sites for biologically relevant (cognate) ligands in UniProtKB using the chemical ontology ChEBI (Chemical Entities of Biological Interest), to better support efforts to study and predict functionally relevant interactions between protein sequences and structures and small molecule ligands.<h4>Results</h4>We structured the data model for cognate ligand binding site annotations in UniProtKB and performed a  ...[more]

Similar Datasets

| S-EPMC3531142 | biostudies-literature
| S-EPMC7162351 | biostudies-literature
| S-EPMC3740789 | biostudies-literature
2013-03-04 | GSE42409 | GEO
| S-EPMC5095671 | biostudies-literature
| S-EPMC7160037 | biostudies-literature
| S-EPMC2689360 | biostudies-literature
| S-EPMC3797126 | biostudies-literature
2013-03-04 | E-GEOD-42409 | biostudies-arrayexpress
| S-EPMC2790310 | biostudies-literature