Unknown

Dataset Information

0

Artificial Intelligence and Machine learning based prediction of resistant and susceptible mutations in Mycobacterium tuberculosis.


ABSTRACT: Tuberculosis (TB), an infectious disease caused by Mycobacterium tuberculosis (M.tb), causes highest number of deaths globally for any bacterial disease necessitating novel diagnosis and treatment strategies. High-throughput sequencing methods generate a large amount of data which could be exploited in determining multi-drug resistant (MDR-TB) associated mutations. The present work is a computational framework that uses artificial intelligence (AI) based machine learning (ML) approaches for predicting resistance in the genes rpoB, inhA, katG, pncA, gyrA and gyrB for the drugs rifampicin, isoniazid, pyrazinamide and fluoroquinolones. The single nucleotide variations were represented by several sequence and structural features that indicate the influence of mutations on the target protein coded by each gene. We used ML algorithms - naïve bayes, k nearest neighbor, support vector machine, and artificial neural network, to build the prediction models. The classification models had an average accuracy of 85% across all examined genes and were evaluated on an external unseen dataset to demonstrate their application. Further, molecular docking and molecular dynamics simulations were performed for wild type and predicted resistance causing mutant protein and anti-TB drug complexes to study their impact on the conformation of proteins to confirm the observed phenotype.

SUBMITTER: Jamal S 

PROVIDER: S-EPMC7099008 | BioStudies | 2020-01-01

REPOSITORIES: biostudies

Similar Datasets

2019-01-01 | S-EPMC6894303 | BioStudies
2019-01-01 | S-EPMC6624858 | BioStudies
1000-01-01 | S-EPMC3088277 | BioStudies
2018-01-01 | S-EPMC5895860 | BioStudies
1000-01-01 | S-EPMC3404943 | BioStudies
2017-01-01 | S-EPMC5417650 | BioStudies
2020-01-01 | S-EPMC7265455 | BioStudies
2020-01-01 | S-EPMC6947908 | BioStudies
2014-01-01 | S-EPMC4068477 | BioStudies
2012-01-01 | S-EPMC3502959 | BioStudies