Unknown

Dataset Information

0

Structure-based neural network protein-carbohydrate interaction predictions at the residue level.


ABSTRACT: Carbohydrates dynamically and transiently interact with proteins for cell-cell recognition, cellular differentiation, immune response, and many other cellular processes. Despite the molecular importance of these interactions, there are currently few reliable computational tools to predict potential carbohydrate-binding sites on any given protein. Here, we present two deep learning (DL) models named CArbohydrate-Protein interaction Site IdentiFier (CAPSIF) that predicts non-covalent carbohydrate-binding sites on proteins: (1) a 3D-UNet voxel-based neural network model (CAPSIF:V) and (2) an equivariant graph neural network model (CAPSIF:G). While both models outperform previous surrogate methods used for carbohydrate-binding site prediction, CAPSIF:V performs better than CAPSIF:G, achieving test Dice scores of 0.597 and 0.543 and test set Matthews correlation coefficients (MCCs) of 0.599 and 0.538, respectively. We further tested CAPSIF:V on AlphaFold2-predicted protein structures. CAPSIF:V performed equivalently on both experimentally determined structures and AlphaFold2-predicted structures. Finally, we demonstrate how CAPSIF models can be used in conjunction with local glycan-docking protocols, such as GlycanDock, to predict bound protein-carbohydrate structures.

SUBMITTER: Canner SW 

PROVIDER: S-EPMC10318439 | biostudies-literature | 2023

REPOSITORIES: biostudies-literature

altmetric image

Publications

Structure-based neural network protein-carbohydrate interaction predictions at the residue level.

Canner Samuel W SW   Shanker Sudhanshu S   Gray Jeffrey J JJ  

Frontiers in bioinformatics 20230620


Carbohydrates dynamically and transiently interact with proteins for cell-cell recognition, cellular differentiation, immune response, and many other cellular processes. Despite the molecular importance of these interactions, there are currently few reliable computational tools to predict potential carbohydrate-binding sites on any given protein. Here, we present two deep learning (DL) models named CArbohydrate-Protein interaction Site IdentiFier (CAPSIF) that predicts non-covalent carbohydrate-  ...[more]

Similar Datasets

| S-EPMC10054975 | biostudies-literature
| S-EPMC9118482 | biostudies-literature
| S-EPMC8698800 | biostudies-literature
| S-EPMC8058773 | biostudies-literature
| S-EPMC9464414 | biostudies-literature
| S-EPMC5998897 | biostudies-literature
| S-EPMC10849033 | biostudies-literature
| S-EPMC6372335 | biostudies-literature
| S-EPMC3785744 | biostudies-literature
| S-EPMC1993824 | biostudies-literature