Unknown

Dataset Information

0

CRISPRdisco: An Automated Pipeline for the Discovery and Analysis of CRISPR-Cas Systems.


ABSTRACT: CRISPR-Cas adaptive immune systems of bacteria and archaea have catapulted into the scientific spotlight as genome editing tools. To aid researchers in the field, we have developed an automated pipeline, named CRISPRdisco (CRISPR discovery), to identify CRISPR repeats and cas genes in genome assemblies, determine type and subtype, and describe system completeness. All six major types and 23 currently recognized subtypes and novel putative V-U types are detected. Here, we use the pipeline to identify and classify putative CRISPR-Cas systems in 2,777 complete genomes from the NCBI RefSeq database. This allows comparison to previous publications and investigation of the occurrence and size of CRISPR-Cas systems. Software available at http://github.com/crisprlab/CRISPRdisco provides reproducible, standardized, accessible, transparent, and high-throughput analysis methods available to all researchers in and beyond the CRISPR-Cas research community. This tool opens new avenues to enable classification within a complex nomenclature and provides analytical methods in a field that has evolved rapidly.

SUBMITTER: Crawley AB 

PROVIDER: S-EPMC6636876 | biostudies-literature | 2018 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

CRISPRdisco: An Automated Pipeline for the Discovery and Analysis of CRISPR-Cas Systems.

Crawley Alexandra B AB   Henriksen James R JR   Barrangou Rodolphe R  

The CRISPR journal 20180409


CRISPR-Cas adaptive immune systems of bacteria and archaea have catapulted into the scientific spotlight as genome editing tools. To aid researchers in the field, we have developed an automated pipeline, named CRISPRdisco (CRISPR discovery), to identify CRISPR repeats and <i>cas</i> genes in genome assemblies, determine type and subtype, and describe system completeness. All six major types and 23 currently recognized subtypes and novel putative V-U types are detected. Here, we use the pipeline  ...[more]

Similar Datasets

| S-EPMC4660269 | biostudies-literature
| S-EPMC6709367 | biostudies-literature
| S-EPMC4531879 | biostudies-literature
| S-EPMC10734277 | biostudies-literature
| S-EPMC4507336 | biostudies-literature
| S-SCDT-10_1038-S44319-025-00399-4 | biostudies-other
| S-EPMC11560957 | biostudies-literature
| S-EPMC6546389 | biostudies-literature
2020-11-01 | GSE152684 | GEO
| S-EPMC10379622 | biostudies-literature