Unknown

Dataset Information

0

Variational autoencoder-based chemical latent space for large molecular structures with 3D complexity.


ABSTRACT: The structural diversity of chemical libraries, which are systematic collections of compounds that have potential to bind to biomolecules, can be represented by chemical latent space. A chemical latent space is a projection of a compound structure into a mathematical space based on several molecular features, and it can express structural diversity within a compound library in order to explore a broader chemical space and generate novel compound structures for drug candidates. In this study, we developed a deep-learning method, called NP-VAE (Natural Product-oriented Variational Autoencoder), based on variational autoencoder for managing hard-to-analyze datasets from DrugBank and large molecular structures such as natural compounds with chirality, an essential factor in the 3D complexity of compounds. NP-VAE was successful in constructing the chemical latent space from large-sized compounds that were unable to be handled in existing methods, achieving higher reconstruction accuracy, and demonstrating stable performance as a generative model across various indices. Furthermore, by exploring the acquired latent space, we succeeded in comprehensively analyzing a compound library containing natural compounds and generating novel compound structures with optimized functions.

SUBMITTER: Ochiai T 

PROVIDER: S-EPMC10654724 | biostudies-literature | 2023 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Variational autoencoder-based chemical latent space for large molecular structures with 3D complexity.

Ochiai Toshiki T   Inukai Tensei T   Akiyama Manato M   Furui Kairi K   Ohue Masahito M   Matsumori Nobuaki N   Inuki Shinsuke S   Uesugi Motonari M   Sunazuka Toshiaki T   Kikuchi Kazuya K   Kakeya Hideaki H   Sakakibara Yasubumi Y  

Communications chemistry 20231116 1


The structural diversity of chemical libraries, which are systematic collections of compounds that have potential to bind to biomolecules, can be represented by chemical latent space. A chemical latent space is a projection of a compound structure into a mathematical space based on several molecular features, and it can express structural diversity within a compound library in order to explore a broader chemical space and generate novel compound structures for drug candidates. In this study, we  ...[more]

Similar Datasets

| S-EPMC11008089 | biostudies-literature
| S-EPMC6316879 | biostudies-literature
| S-EPMC8906577 | biostudies-literature
| S-EPMC8633506 | biostudies-literature
| S-EPMC10782437 | biostudies-literature
| S-EPMC8842480 | biostudies-literature
| S-EPMC9388855 | biostudies-literature
| S-EPMC11862945 | biostudies-literature
| S-EPMC9814385 | biostudies-literature
| S-EPMC6081979 | biostudies-literature