Dataset Information

MoleMCL: a multi-level contrastive learning framework for molecular pre-training.

ABSTRACT:

Motivation

Molecular representation learning plays an indispensable role in crucial tasks such as property prediction and drug design. Despite the notable achievements of molecular pre-training models, current methods often fail to capture both the structural and feature semantics of molecular graphs. Moreover, while graph contrastive learning has unveiled new prospects, existing augmentation techniques often struggle to retain their core semantics. To overcome these limitations, we propose a gradient-compensated encoder parameter perturbation approach, ensuring efficient and stable feature augmentation. By merging enhancement strategies grounded in attribute masking and parameter perturbation, we introduce MoleMCL, a new MOLEcular pre-training model based on multi-level contrastive learning.

Results

Experimental results demonstrate that MoleMCL adeptly dissects the structure and feature semantics of molecular graphs, surpassing current state-of-the-art models in molecular prediction tasks, paving a novel avenue for molecular modeling.

Availability and implementation

The code and data underlying this work are available in GitHub at https://github.com/BioSequenceAnalysis/MoleMCL.

SUBMITTER: Zhang X

PROVIDER: S-EPMC11001485 | biostudies-literature | 2024 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

MoleMCL: a multi-level contrastive learning framework for molecular pre-training.

Zhang Xinyi X Xu Yanni Y Jiang Changzhi C Shen Lian L Liu Xiangrong X

Bioinformatics (Oxford, England) 20240301 4

<h4>Motivation</h4>Molecular representation learning plays an indispensable role in crucial tasks such as property prediction and drug design. Despite the notable achievements of molecular pre-training models, current methods often fail to capture both the structural and feature semantics of molecular graphs. Moreover, while graph contrastive learning has unveiled new prospects, existing augmentation techniques often struggle to retain their core semantics. To overcome these limitations, we prop ...[more]

PMID: 38530779

Dataset Information

MoleMCL: a multi-level contrastive learning framework for molecular pre-training.

Motivation

Results

Availability and implementation

Publications

MoleMCL: a multi-level contrastive learning framework for molecular pre-training.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A knowledge-guided pre-training framework for improving molecular representation learning.
| S-EPMC10663446 | biostudies-literature

BiMGCL: rumor detection via bi-directional multi-level graph contrastive learning
| S-EPMC10703058 | biostudies-literature

Enhanced knowledge graph recommendation algorithm based on multi-level contrastive learning.
| S-EPMC11452630 | biostudies-literature

Multi-level multi-view network based on structural contrastive learning for scRNA-seq data clustering.
| S-EPMC11532661 | biostudies-literature

MolMVC: Enhancing molecular representations for drug-related tasks through multi-view contrastive learning.
| S-EPMC11373324 | biostudies-literature

A multi-view contrastive learning for heterogeneous network embedding.
| S-EPMC10130187 | biostudies-literature

Multi-modal contrastive learning of subcellular organization using DICE.
| S-EPMC11520230 | biostudies-literature

A deep-learning framework for multi-level peptide-protein interaction prediction.
| S-EPMC8443569 | biostudies-literature

A multi-intent based multi-policy relay contrastive learning for sequential recommendation.
| S-EPMC9455276 | biostudies-literature

Multi-view multi-level contrastive graph convolutional network for cancer subtyping on multi-omics data.
| S-EPMC11789786 | biostudies-literature