<HashMap><database>biostudies-literature</database><scores/><additional><submitter>Parrilla-Gutierrez JM</submitter><funding>RCUK | Engineering and Physical Sciences Research Council (EPSRC)</funding><funding>European Research Council</funding><funding>RCUK | Engineering and Physical Sciences Research Council</funding><pagination>200-209</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC10965440</full_dataset_link><repository>biostudies-literature</repository><omics_type>Unknown</omics_type><volume>4(3)</volume><pubmed_abstract>Here we present a machine learning model trained on electron density for the production of host-guest binders. These are read out as simplified molecular-input line-entry system (SMILES) format with >98% accuracy, enabling a complete characterization of the molecules in two dimensions. Our model generates three-dimensional representations of the electron density and electrostatic potentials of host-guest systems using a variational autoencoder, and then utilizes these representations to optimize the generation of guests via gradient descent. Finally the guests are converted to SMILES using a transformer. The successful practical application of our model to established molecular host systems, cucurbit[n]uril and metal-organic cages, resulted in the discovery of 9 previously validated guests for CB[6] and 7 unreported guests (with association constant K&lt;sub>a&lt;/sub> ranging from 13.5 M&lt;sup>-1&lt;/sup> to 5,470 M&lt;sup>-1&lt;/sup>) and the discovery of 4 unreported guests for [Pd&lt;sub>2&lt;/sub>1&lt;sub>4&lt;/sub>]&lt;sup>4+&lt;/sup> (with K&lt;sub>a&lt;/sub> ranging from 44 M&lt;sup>-1&lt;/sup> to 529 M&lt;sup>-1&lt;/sup>).</pubmed_abstract><journal>Nature computational science</journal><pubmed_title>Electron density-based GPT for optimization and suggestion of host-guest binders.</pubmed_title><pmcid>PMC10965440</pmcid><funding_grant_id>EP/L023652/1, EP/R020914/1, EP/S030603/1, EP/R01308X/1, EP/S017046/1, and EP/S019472/1</funding_grant_id><funding_grant_id>670467</funding_grant_id><pubmed_authors>Ayme JF</pubmed_authors><pubmed_authors>Cronin L</pubmed_authors><pubmed_authors>Wilbraham L</pubmed_authors><pubmed_authors>Parrilla-Gutierrez JM</pubmed_authors><pubmed_authors>Granda JM</pubmed_authors><pubmed_authors>Bajczyk MD</pubmed_authors></additional><is_claimable>false</is_claimable><name>Electron density-based GPT for optimization and suggestion of host-guest binders.</name><description>Here we present a machine learning model trained on electron density for the production of host-guest binders. These are read out as simplified molecular-input line-entry system (SMILES) format with >98% accuracy, enabling a complete characterization of the molecules in two dimensions. Our model generates three-dimensional representations of the electron density and electrostatic potentials of host-guest systems using a variational autoencoder, and then utilizes these representations to optimize the generation of guests via gradient descent. Finally the guests are converted to SMILES using a transformer. The successful practical application of our model to established molecular host systems, cucurbit[n]uril and metal-organic cages, resulted in the discovery of 9 previously validated guests for CB[6] and 7 unreported guests (with association constant K&lt;sub>a&lt;/sub> ranging from 13.5 M&lt;sup>-1&lt;/sup> to 5,470 M&lt;sup>-1&lt;/sup>) and the discovery of 4 unreported guests for [Pd&lt;sub>2&lt;/sub>1&lt;sub>4&lt;/sub>]&lt;sup>4+&lt;/sup> (with K&lt;sub>a&lt;/sub> ranging from 44 M&lt;sup>-1&lt;/sup> to 529 M&lt;sup>-1&lt;/sup>).</description><dates><release>2024-01-01T00:00:00Z</release><publication>2024 Mar</publication><modification>2025-04-27T03:11:25.513Z</modification><creation>2025-04-06T18:45:09.016Z</creation></dates><accession>S-EPMC10965440</accession><cross_references><pubmed>38459272</pubmed><doi>10.1038/s43588-024-00602-x</doi></cross_references></HashMap>