Unknown

Dataset Information

0

GRaSP-web: a machine learning strategy to predict binding sites based on residue neighborhood graphs.


ABSTRACT: Proteins are essential macromolecules for the maintenance of living systems. Many of them perform their function by interacting with other molecules in regions called binding sites. The identification and characterization of these regions are of fundamental importance to determine protein function, being a fundamental step in processes such as drug design and discovery. However, identifying such binding regions is not trivial due to the drawbacks of experimental methods, which are costly and time-consuming. Here we propose GRaSP-web, a web server that uses GRaSP (Graph-based Residue neighborhood Strategy to Predict binding sites), a residue-centric method based on graphs that uses machine learning to predict putative ligand binding site residues. The method outperformed 6 state-of-the-art residue-centric methods (MCC of 0.61). Also, GRaSP-web is scalable as it takes 10-20 seconds to predict binding sites for a protein complex (the state-of-the-art residue-centric method takes 2-5h on the average). It proved to be consistent in predicting binding sites for bound/unbound structures (MCC 0.61 for both) and for a large dataset of multi-chain proteins (4500 entries, MCC 0.61). GRaSPWeb is freely available at https://grasp.ufv.br.

SUBMITTER: Santana CA 

PROVIDER: S-EPMC9252730 | biostudies-literature | 2022 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

GRaSP-web: a machine learning strategy to predict binding sites based on residue neighborhood graphs.

Santana Charles A CA   Izidoro Sandro C SC   de Melo-Minardi Raquel C RC   Tyzack Jonathan D JD   Ribeiro António J M AJM   Pires Douglas E V DEV   Thornton Janet M JM   de A Silveira Sabrina S  

Nucleic acids research 20220701 W1


Proteins are essential macromolecules for the maintenance of living systems. Many of them perform their function by interacting with other molecules in regions called binding sites. The identification and characterization of these regions are of fundamental importance to determine protein function, being a fundamental step in processes such as drug design and discovery. However, identifying such binding regions is not trivial due to the drawbacks of experimental methods, which are costly and tim  ...[more]

Similar Datasets

| S-EPMC10962094 | biostudies-literature
| S-EPMC2887807 | biostudies-literature
2021-12-08 | GSE171994 | GEO
| S-EPMC6796762 | biostudies-literature
| S-EPMC10842082 | biostudies-literature
| S-EPMC9445105 | biostudies-literature
| S-EPMC6172270 | biostudies-literature
| S-EPMC7474663 | biostudies-literature
| S-EPMC11893853 | biostudies-literature
| S-EPMC8596748 | biostudies-literature