Unknown

Dataset Information

0

Leveraging molecular structure and bioactivity with chemical language models for de novo drug design.


ABSTRACT: Generative chemical language models (CLMs) can be used for de novo molecular structure generation by learning from a textual representation of molecules. Here, we show that hybrid CLMs can additionally leverage the bioactivity information available for the training compounds. To computationally design ligands of phosphoinositide 3-kinase gamma (PI3Kγ), a collection of virtual molecules was created with a generative CLM. This virtual compound library was refined using a CLM-based classifier for bioactivity prediction. This second hybrid CLM was pretrained with patented molecular structures and fine-tuned with known PI3Kγ ligands. Several of the computer-generated molecular designs were commercially available, enabling fast prescreening and preliminary experimental validation. A new PI3Kγ ligand with sub-micromolar activity was identified, highlighting the method's scaffold-hopping potential. Chemical synthesis and biochemical testing of two of the top-ranked de novo designed molecules and their derivatives corroborated the model's ability to generate PI3Kγ ligands with medium to low nanomolar activity for hit-to-lead expansion. The most potent compounds led to pronounced inhibition of PI3K-dependent Akt phosphorylation in a medulloblastoma cell model, demonstrating efficacy of PI3Kγ ligands in PI3K/Akt pathway repression in human tumor cells. The results positively advocate hybrid CLMs for virtual compound screening and activity-focused molecular design.

SUBMITTER: Moret M 

PROVIDER: S-EPMC9825622 | biostudies-literature | 2023 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Leveraging molecular structure and bioactivity with chemical language models for de novo drug design.

Moret Michael M   Pachon Angona Irene I   Cotos Leandro L   Yan Shen S   Atz Kenneth K   Brunner Cyrill C   Baumgartner Martin M   Grisoni Francesca F   Schneider Gisbert G  

Nature communications 20230107 1


Generative chemical language models (CLMs) can be used for de novo molecular structure generation by learning from a textual representation of molecules. Here, we show that hybrid CLMs can additionally leverage the bioactivity information available for the training compounds. To computationally design ligands of phosphoinositide 3-kinase gamma (PI3Kγ), a collection of virtual molecules was created with a generative CLM. This virtual compound library was refined using a CLM-based classifier for b  ...[more]

Similar Datasets

| S-EPMC11558674 | biostudies-literature
| S-EPMC8549794 | biostudies-literature
| S-EPMC3607962 | biostudies-literature
| S-EPMC6059760 | biostudies-literature
| S-EPMC10631243 | biostudies-literature
| S-EPMC8760751 | biostudies-literature
| S-EPMC11444397 | biostudies-literature
| S-EPMC2683404 | biostudies-literature
| S-EPMC9223405 | biostudies-literature
| S-EPMC8949797 | biostudies-literature