Dataset Information


Identification of GATC- and CCGG-recognizing Type II REases and their putative specificity-determining positions using Scan2S--a novel motif scan algorithm with optional secondary structure constraints.

ABSTRACT: Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering.


PROVIDER: S-EPMC2465807 | BioStudies | 2008-01-01

REPOSITORIES: biostudies

Similar Datasets

1000-01-01 | S-EPMC2492507 | BioStudies
1000-01-01 | S-EPMC4551934 | BioStudies
2008-01-01 | S-EPMC2441816 | BioStudies
2008-01-01 | S-EPMC2459302 | BioStudies
2012-01-01 | S-EPMC3384240 | BioStudies
2007-01-01 | S-EPMC1965518 | BioStudies
2005-01-01 | S-EPMC548357 | BioStudies
2007-01-01 | S-EPMC1874628 | BioStudies
2013-01-01 | S-EPMC3850674 | BioStudies
2015-01-01 | S-EPMC4417163 | BioStudies