Unknown

Dataset Information

0

In-depth investigation of the point mutation pattern of HIV-1.


ABSTRACT: Mutations may produce highly transmissible and damaging HIV variants, which increase the genetic diversity, and pose a challenge to develop vaccines. Therefore, it is of great significance to understand how mutations drive the virulence of HIV. Based on the 11897 reliable genomes of HIV-1 retrieved from HIV sequence Database, we analyze the 12 types of point mutation (A>C, A>G, A>T, C>A, C>G, C>T, G>A, G>C, G>T, T>A, T>C, T>G) from multiple statistical perspectives for the first time. The global/geographical location/subtype/k-mer analysis results report that A>G, G>A, C>T and T>C account for nearly 64% among all SNPs, which suggest that APOBEC-editing and ADAR-editing may play an important role in HIV-1 infectivity. Time analysis shows that most genomes with abnormal mutation numbers comes from African countries. Finally, we use natural vector method to check the k-mer distribution changing patterns in the genome, and find that there is an important substitution pattern between nucleotides A and G, and 2-mer CG may have a significant impact on viral infectivity. This paper provides an insight into the single mutation of HIV-1 by using the latest data in the HIV sequence Database.

SUBMITTER: Sun N 

PROVIDER: S-EPMC9705751 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

altmetric image

Publications

In-depth investigation of the point mutation pattern of HIV-1.

Sun Nan N   Yau Stephen S-T SS  

Frontiers in cellular and infection microbiology 20221115


Mutations may produce highly transmissible and damaging HIV variants, which increase the genetic diversity, and pose a challenge to develop vaccines. Therefore, it is of great significance to understand how mutations drive the virulence of HIV. Based on the 11897 reliable genomes of HIV-1 retrieved from HIV sequence Database, we analyze the 12 types of point mutation (A>C, A>G, A>T, C>A, C>G, C>T, G>A, G>C, G>T, T>A, T>C, T>G) from multiple statistical perspectives for the first time. The global  ...[more]

Similar Datasets

| S-EPMC5118795 | biostudies-literature
| S-EPMC8933506 | biostudies-literature
| S-EPMC8284934 | biostudies-literature
| S-EPMC6947549 | biostudies-literature
| S-EPMC2766332 | biostudies-literature
2014-06-13 | E-GEOD-47904 | biostudies-arrayexpress
| S-EPMC6078319 | biostudies-literature
2014-06-13 | GSE47904 | GEO
| S-EPMC4227133 | biostudies-literature
| S-EPMC310660 | biostudies-literature