Unknown

Dataset Information

0

BlastFrost: fast querying of 100,000s of bacterial genomes in Bifrost graphs.


ABSTRACT: BlastFrost is a highly efficient method for querying 100,000s of genome assemblies, building on Bifrost, a dynamic data structure for compacted and colored de Bruijn graphs. BlastFrost queries a Bifrost data structure for sequences of interest and extracts local subgraphs, enabling the identification of the presence or absence of individual genes or single nucleotide sequence variants. We show two examples using Salmonella genomes: finding within minutes the presence of genes in the SPI-2 pathogenicity island in a collection of 926 genomes and identifying single nucleotide polymorphisms associated with fluoroquinolone resistance in three genes among 190,209 genomes. BlastFrost is available at https://github.com/nluhmann/BlastFrost/tree/master/data .

SUBMITTER: Luhmann N 

PROVIDER: S-EPMC7798312 | biostudies-literature | 2021 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

BlastFrost: fast querying of 100,000s of bacterial genomes in Bifrost graphs.

Luhmann Nina N   Holley Guillaume G   Achtman Mark M  

Genome biology 20210111 1


BlastFrost is a highly efficient method for querying 100,000s of genome assemblies, building on Bifrost, a dynamic data structure for compacted and colored de Bruijn graphs. BlastFrost queries a Bifrost data structure for sequences of interest and extracts local subgraphs, enabling the identification of the presence or absence of individual genes or single nucleotide sequence variants. We show two examples using Salmonella genomes: finding within minutes the presence of genes in the SPI-2 pathog  ...[more]

Similar Datasets

| S-EPMC10538363 | biostudies-literature
| S-EPMC6452717 | biostudies-literature
| S-EPMC10691542 | biostudies-literature
| S-EPMC10927331 | biostudies-literature
| S-EPMC4081498 | biostudies-literature
| S-EPMC1160258 | biostudies-literature
| PRJNA700858 | ENA
| S-EPMC10507293 | biostudies-literature
| S-EPMC7320612 | biostudies-literature
| S-EPMC5289856 | biostudies-literature