Unknown

Dataset Information

0

HaploCart: Human mtDNA haplogroup classification using a pangenomic reference graph.


ABSTRACT: Current mitochondrial DNA (mtDNA) haplogroup classification tools map reads to a single reference genome and perform inference based on the detected mutations to this reference. This approach biases haplogroup assignments towards the reference and prohibits accurate calculations of the uncertainty in assignment. We present HaploCart, a probabilistic mtDNA haplogroup classifier which uses a pangenomic reference graph framework together with principles of Bayesian inference. We demonstrate that our approach significantly outperforms available tools by being more robust to lower coverage or incomplete consensus sequences and producing phylogenetically-aware confidence scores that are unbiased towards any haplogroup. HaploCart is available both as a command-line tool and through a user-friendly web interface. The C++ program accepts as input consensus FASTA, FASTQ, or GAM files, and outputs a text file with the haplogroup assignments of the samples along with the level of confidence in the assignments. Our work considerably reduces the amount of data required to obtain a confident mitochondrial haplogroup assignment.

SUBMITTER: Rubin JD 

PROVIDER: S-EPMC10281577 | biostudies-literature | 2023 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

HaploCart: Human mtDNA haplogroup classification using a pangenomic reference graph.

Rubin Joshua Daniel JD   Vogel Nicola Alexandra NA   Gopalakrishnan Shyam S   Sackett Peter Wad PW   Renaud Gabriel G  

PLoS computational biology 20230607 6


Current mitochondrial DNA (mtDNA) haplogroup classification tools map reads to a single reference genome and perform inference based on the detected mutations to this reference. This approach biases haplogroup assignments towards the reference and prohibits accurate calculations of the uncertainty in assignment. We present HaploCart, a probabilistic mtDNA haplogroup classifier which uses a pangenomic reference graph framework together with principles of Bayesian inference. We demonstrate that ou  ...[more]

Similar Datasets

| S-EPMC4852222 | biostudies-literature
| S-EPMC4234434 | biostudies-literature
| S-EPMC8956381 | biostudies-literature
| S-EPMC2841866 | biostudies-literature
| S-EPMC2529308 | biostudies-literature
| S-EPMC4904161 | biostudies-literature
| S-EPMC8268184 | biostudies-literature
| S-EPMC9949352 | biostudies-literature
| S-EPMC7799442 | biostudies-literature
| S-EPMC8275337 | biostudies-literature