Unknown

Dataset Information

0

TEMPO: A transformer-based mutation prediction framework for SARS-CoV-2 evolution.


ABSTRACT: The widespread of SARS-CoV-2 presents a significant threat to human society, as well as public health and economic development. Extensive efforts have been undertaken to battle against the pandemic, whereas effective approaches such as vaccination would be weakened by the continuous mutations, leading to considerable attention being attracted to the mutation prediction. However, most previous studies lack attention to phylogenetics. In this paper, we propose a novel and effective model TEMPO for predicting the mutation of SARS-CoV-2 evolution. Specifically, we design a phylogenetic tree-based sampling method to generate sequence evolution data. Then, a transformer-based model is presented for the site mutation prediction after learning the high-level representation of these sequence data. We conduct experiments to verify the effectiveness of TEMPO, leveraging a large-scale SARS-CoV- 2 dataset. Experimental results show that TEMPO is effective for mutation prediction of SARS- CoV-2 evolution and outperforms several state-of-the-art baseline methods. We further perform mutation prediction experiments of other infectious viruses, to explore the feasibility and robustness of TEMPO, and experimental results verify its superiority. The codes and datasets are freely available at https://github.com/ZJUDataIntelligence/TEMPO.

SUBMITTER: Zhou B 

PROVIDER: S-EPMC9747230 | biostudies-literature | 2023 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

TEMPO: A transformer-based mutation prediction framework for SARS-CoV-2 evolution.

Zhou Binbin B   Zhou Hang H   Zhang Xue X   Xu Xiaobin X   Chai Yi Y   Zheng Zengwei Z   Kot Alex Chichung AC   Zhou Zhan Z  

Computers in biology and medicine 20221214


The widespread of SARS-CoV-2 presents a significant threat to human society, as well as public health and economic development. Extensive efforts have been undertaken to battle against the pandemic, whereas effective approaches such as vaccination would be weakened by the continuous mutations, leading to considerable attention being attracted to the mutation prediction. However, most previous studies lack attention to phylogenetics. In this paper, we propose a novel and effective model TEMPO for  ...[more]

Similar Datasets

| S-EPMC9272809 | biostudies-literature
| S-EPMC10528219 | biostudies-literature
2024-09-13 | GSE262953 | GEO
| S-EPMC10538282 | biostudies-literature
| S-EPMC10490908 | biostudies-literature
| S-EPMC10041520 | biostudies-literature
| S-EPMC10726525 | biostudies-literature
| S-EPMC11622836 | biostudies-literature
| S-EPMC7857040 | biostudies-literature
| S-EPMC8842338 | biostudies-literature