Unknown

Dataset Information

0

Efficient inference of large prokaryotic pangenomes with PanTA.


ABSTRACT: Pangenome inference is an indispensable step in bacterial genomics, yet its scalability poses a challenge due to the rapid growth of genomic collections. This paper presents PanTA, a software package designed for constructing pangenomes of large bacterial datasets, showing unprecedented efficiency levels multiple times higher than existing tools. PanTA introduces a novel mechanism to construct the pangenome progressively without rebuilding the accumulated collection from scratch. The progressive mode is shown to consume orders of magnitude less computational resources than existing solutions in managing growing datasets. The software is open source and is publicly available at https://github.com/amromics/panta and at 10.6084/m9.figshare.23724705 .

SUBMITTER: Le DQ 

PROVIDER: S-EPMC11304767 | biostudies-literature | 2024 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications


Pangenome inference is an indispensable step in bacterial genomics, yet its scalability poses a challenge due to the rapid growth of genomic collections. This paper presents PanTA, a software package designed for constructing pangenomes of large bacterial datasets, showing unprecedented efficiency levels multiple times higher than existing tools. PanTA introduces a novel mechanism to construct the pangenome progressively without rebuilding the accumulated collection from scratch. The progressive  ...[more]

Similar Datasets

| S-EPMC7376924 | biostudies-literature
| S-EPMC4547308 | biostudies-literature
| S-EPMC11722392 | biostudies-literature
| S-EPMC10540461 | biostudies-literature
| S-EPMC6158922 | biostudies-literature
| S-EPMC10160065 | biostudies-literature
| S-EPMC6107288 | biostudies-literature
| S-EPMC11126918 | biostudies-literature
| S-EPMC7038658 | biostudies-literature
| S-EPMC4315300 | biostudies-literature