Unknown

Dataset Information

0

Pandora: nucleotide-resolution bacterial pan-genomics with reference graphs.


ABSTRACT: We present pandora, a novel pan-genome graph structure and algorithms for identifying variants across the full bacterial pan-genome. As much bacterial adaptability hinges on the accessory genome, methods which analyze SNPs in just the core genome have unsatisfactory limitations. Pandora approximates a sequenced genome as a recombinant of references, detects novel variation and pan-genotypes multiple samples. Using a reference graph of 578 Escherichia coli genomes, we compare 20 diverse isolates. Pandora recovers more rare SNPs than single-reference-based tools, is significantly better than picking the closest RefSeq reference, and provides a stable framework for analyzing diverse samples without reference bias.

SUBMITTER: Colquhoun RM 

PROVIDER: S-EPMC8442373 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC5120557 | biostudies-literature
| S-EPMC4822748 | biostudies-literature
| S-EPMC3122890 | biostudies-literature
| S-EPMC8311964 | biostudies-literature
| S-EPMC5870562 | biostudies-literature
| S-EPMC5289856 | biostudies-other
| S-EPMC7568353 | biostudies-literature
| PRJNA715669 | ENA
| S-EPMC4299219 | biostudies-literature