Unknown

Dataset Information

0

Electron density-based GPT for optimization and suggestion of host-guest binders.


ABSTRACT: Here we present a machine learning model trained on electron density for the production of host-guest binders. These are read out as simplified molecular-input line-entry system (SMILES) format with >98% accuracy, enabling a complete characterization of the molecules in two dimensions. Our model generates three-dimensional representations of the electron density and electrostatic potentials of host-guest systems using a variational autoencoder, and then utilizes these representations to optimize the generation of guests via gradient descent. Finally the guests are converted to SMILES using a transformer. The successful practical application of our model to established molecular host systems, cucurbit[n]uril and metal-organic cages, resulted in the discovery of 9 previously validated guests for CB[6] and 7 unreported guests (with association constant Ka ranging from 13.5 M-1 to 5,470 M-1) and the discovery of 4 unreported guests for [Pd214]4+ (with Ka ranging from 44 M-1 to 529 M-1).

SUBMITTER: Parrilla-Gutierrez JM 

PROVIDER: S-EPMC10965440 | biostudies-literature | 2024 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Electron density-based GPT for optimization and suggestion of host-guest binders.

Parrilla-Gutiérrez Juan M JM   Granda Jarosław M JM   Ayme Jean-François JF   Bajczyk Michał D MD   Wilbraham Liam L   Cronin Leroy L  

Nature computational science 20240308 3


Here we present a machine learning model trained on electron density for the production of host-guest binders. These are read out as simplified molecular-input line-entry system (SMILES) format with >98% accuracy, enabling a complete characterization of the molecules in two dimensions. Our model generates three-dimensional representations of the electron density and electrostatic potentials of host-guest systems using a variational autoencoder, and then utilizes these representations to optimize  ...[more]

Similar Datasets

| S-EPMC9093170 | biostudies-literature
| S-EPMC6039243 | biostudies-literature
| S-EPMC9552185 | biostudies-literature
| S-EPMC5811089 | biostudies-literature
| S-EPMC12664294 | biostudies-literature
| S-EPMC3072439 | biostudies-literature
| S-EPMC8396050 | biostudies-literature
| S-EPMC6792245 | biostudies-literature
| S-EPMC9053703 | biostudies-literature