Dataset Information


SNIT: SNP identification for strain typing.

ABSTRACT: With ever-increasing numbers of microbial genomes being sequenced, efficient tools are needed to perform strain-level identification of any newly sequenced genome. Here, we present the SNP identification for strain typing (SNIT) pipeline, a fast and accurate software system that compares a newly sequenced bacterial genome with other genomes of the same species to identify single nucleotide polymorphisms (SNPs) and small insertions/deletions (indels). Based on this information, the pipeline analyzes the polymorphic loci present in all input genomes to identify the genome that has the fewest differences with the newly sequenced genome. Similarly, for each of the other genomes, SNIT identifies the input genome with the fewest differences. Results from five bacterial species show that the SNIT pipeline identifies the correct closest neighbor with 75% to 100% accuracy. The SNIT pipeline is available for download at http://www.bhsai.org/snit.html.

SUBMITTER: Vijaya Satya R 

PROVIDER: S-EPMC3182885 | BioStudies | 2011-01-01T00:00:00Z

REPOSITORIES: biostudies

Similar Datasets

1000-01-01 | S-EPMC2596143 | BioStudies
2015-01-01 | S-EPMC4331810 | BioStudies
1000-01-01 | S-EPMC4248810 | BioStudies
2010-01-01 | S-EPMC2905370 | BioStudies
2011-01-01 | S-EPMC3219749 | BioStudies
2019-01-01 | S-EPMC6774505 | BioStudies
2014-01-01 | S-EPMC4155085 | BioStudies
2013-01-01 | S-EPMC3630956 | BioStudies
2010-01-01 | S-EPMC2957689 | BioStudies
2017-01-01 | S-EPMC5733104 | BioStudies