Dataset Information

CLCLSA: Cross-omics linked embedding with contrastive learning and self attention for integration with incomplete multi-omics data.

ABSTRACT: Integration of heterogeneous and high-dimensional multi-omics data is becoming increasingly important in understanding etiology of complex genetic diseases. Each omics technique only provides a limited view of the underlying biological process and integrating heterogeneous omics layers simultaneously would lead to a more comprehensive and detailed understanding of diseases and phenotypes. However, one obstacle faced when performing multi-omics data integration is the existence of unpaired multi-omics data due to instrument sensitivity and cost. Studies may fail if certain aspects of the subjects are missing or incomplete. In this paper, we propose a deep learning method for multi-omics integration with incomplete data by Cross-omics Linked unified embedding with Contrastive Learning and Self Attention (CLCLSA). Utilizing complete multi-omics data as supervision, the model employs cross-omics autoencoders to learn the feature representation across different types of biological data. The multi-omics contrastive learning is employed, which maximizes the mutual information between different types of omics. In addition, the feature-level self-attention and omics-level self-attention are employed to dynamically identify the most informative features for multi-omics data integration. Finally, a Softmax classifier is employed to perform multi-omics data classification. Extensive experiments were conducted on four public multi-omics datasets. The experimental results indicate that our proposed CLCLSA produces promising results in multi-omics data classification using both complete and incomplete multi-omics data.

SUBMITTER: Zhao C

PROVIDER: S-EPMC10959569 | biostudies-literature | 2024 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

CLCLSA: Cross-omics linked embedding with contrastive learning and self attention for integration with incomplete multi-omics data.

Zhao Chen C Liu Anqi A Zhang Xiao X Cao Xuewei X Ding Zhengming Z Sha Qiuying Q Shen Hui H Deng Hong-Wen HW Zhou Weihua W

Computers in biology and medicine 20240128

Integration of heterogeneous and high-dimensional multi-omics data is becoming increasingly important in understanding etiology of complex genetic diseases. Each omics technique only provides a limited view of the underlying biological process and integrating heterogeneous omics layers simultaneously would lead to a more comprehensive and detailed understanding of diseases and phenotypes. However, one obstacle faced when performing multi-omics data integration is the existence of unpaired multi- ...[more]

PMID: 38295477

Dataset Information

CLCLSA: Cross-omics linked embedding with contrastive learning and self attention for integration with incomplete multi-omics data.

Publications

CLCLSA: Cross-omics linked embedding with contrastive learning and self attention for integration with incomplete multi-omics data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A multi-view contrastive learning for heterogeneous network embedding.
| S-EPMC10130187 | biostudies-literature

Spatial domain detection using contrastive self-supervised learning for spatial multi-omics technologies.
| S-EPMC10862910 | biostudies-literature

Con-AAE: contrastive cycle adversarial autoencoders for single-cell multi-omics alignment and integration.
| S-EPMC10101696 | biostudies-literature

Single-cell multi-omics integration for unpaired data by a siamese network with graph-based contrastive loss.
| S-EPMC9812356 | biostudies-literature

BRANEnet: embedding multilayer networks for omics data integration.
| S-EPMC9575224 | biostudies-literature

Supervised multiple kernel learning approaches for multi-omics data integration.
| S-EPMC11585117 | biostudies-literature

A roadmap for multi-omics data integration using deep learning.
| S-EPMC8769688 | biostudies-literature

MOMA: a multi-task attention learning algorithm for multi-omics data interpretation and classification.
| S-EPMC10060719 | biostudies-literature

CustOmics: A versatile deep-learning based strategy for multi-omics integration.
| S-EPMC10019780 | biostudies-literature

MOGAT: A Multi-Omics Integration Framework Using Graph Attention Networks for Cancer Subtype Prediction
| S-EPMC10932030 | biostudies-literature