Unknown

Dataset Information

0

Maast: genotyping thousands of microbial strains efficiently.


ABSTRACT: Existing single nucleotide polymorphism (SNP) genotyping algorithms do not scale for species with thousands of sequenced strains, nor do they account for conspecific redundancy. Here we present a bioinformatics tool, Maast, which empowers population genetic meta-analysis of microbes at an unrivaled scale. Maast implements a novel algorithm to heuristically identify a minimal set of diverse conspecific genomes, then constructs a reliable SNP panel for each species, and enables rapid and accurate genotyping using a hybrid of whole-genome alignment and k-mer exact matching. We demonstrate Maast's utility by genotyping thousands of Helicobacter pylori strains and tracking SARS-CoV-2 diversification.

SUBMITTER: Shi ZJ 

PROVIDER: S-EPMC10416524 | biostudies-literature | 2023 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Maast: genotyping thousands of microbial strains efficiently.

Shi Zhou Jason ZJ   Nayfach Stephen S   Pollard Katherine S KS  

Genome biology 20230810 1


Existing single nucleotide polymorphism (SNP) genotyping algorithms do not scale for species with thousands of sequenced strains, nor do they account for conspecific redundancy. Here we present a bioinformatics tool, Maast, which empowers population genetic meta-analysis of microbes at an unrivaled scale. Maast implements a novel algorithm to heuristically identify a minimal set of diverse conspecific genomes, then constructs a reliable SNP panel for each species, and enables rapid and accurate  ...[more]

Similar Datasets

| S-EPMC3483277 | biostudies-literature
| S-EPMC8576353 | biostudies-literature
2011-12-31 | GSE31543 | GEO
| S-EPMC11923105 | biostudies-literature
| S-EPMC9511215 | biostudies-literature
| S-EPMC3575377 | biostudies-literature
| S-EPMC5079060 | biostudies-literature
| S-EPMC3716655 | biostudies-other
| S-EPMC8891347 | biostudies-literature
| S-EPMC2438477 | biostudies-literature