Unknown

Dataset Information

0

PLAS-5k: Dataset of Protein-Ligand Affinities from Molecular Dynamics for Machine Learning Applications.


ABSTRACT: Computational methods and recently modern machine learning methods have played a key role in structure-based drug design. Though several benchmarking datasets are available for machine learning applications in virtual screening, accurate prediction of binding affinity for a protein-ligand complex remains a major challenge. New datasets that allow for the development of models for predicting binding affinities better than the state-of-the-art scoring functions are important. For the first time, we have developed a dataset, PLAS-5k comprised of 5000 protein-ligand complexes chosen from PDB database. The dataset consists of binding affinities along with energy components like electrostatic, van der Waals, polar and non-polar solvation energy calculated from molecular dynamics simulations using MMPBSA (Molecular Mechanics Poisson-Boltzmann Surface Area) method. The calculated binding affinities outperformed docking scores and showed a good correlation with the available experimental values. The availability of energy components may enable optimization of desired components during machine learning-based drug design. Further, OnionNet model has been retrained on PLAS-5k dataset and is provided as a baseline for the prediction of binding affinities.

SUBMITTER: Korlepara DB 

PROVIDER: S-EPMC9451116 | biostudies-literature | 2022 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications


Computational methods and recently modern machine learning methods have played a key role in structure-based drug design. Though several benchmarking datasets are available for machine learning applications in virtual screening, accurate prediction of binding affinity for a protein-ligand complex remains a major challenge. New datasets that allow for the development of models for predicting binding affinities better than the state-of-the-art scoring functions are important. For the first time, w  ...[more]

Similar Datasets

| S-EPMC10858175 | biostudies-literature
| S-EPMC7459320 | biostudies-literature
| S-EPMC10333426 | biostudies-literature
| S-EPMC9925849 | biostudies-literature
| S-EPMC8874395 | biostudies-literature
| S-EPMC9679474 | biostudies-literature
| S-EPMC3524828 | biostudies-literature
| S-EPMC10964057 | biostudies-literature
| S-EPMC8933537 | biostudies-literature
| S-EPMC8668825 | biostudies-literature