Unknown

Dataset Information

0

UProC: tools for ultra-fast protein domain classification.


ABSTRACT: MOTIVATION: With rapidly increasing volumes of biological sequence data the functional analysis of new sequences in terms of similarities to known protein families challenges classical bioinformatics. RESULTS: The ultrafast protein classification (UProC) toolbox implements a novel algorithm ('Mosaic Matching') for large-scale sequence analysis. UProC is by three orders of magnitude faster than profile-based methods and in a metagenome simulation study achieved up to 80% higher sensitivity on unassembled 100?bp reads. AVAILABILITY AND IMPLEMENTATION: UProC is available as an open-source software at https://github.com/gobics/uproc. Precompiled databases (Pfam) are linked on the UProC homepage: http://uproc.gobics.de/. CONTACT: peter@gobics.de. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

SUBMITTER: Meinicke P 

PROVIDER: S-EPMC4410661 | biostudies-literature | 2015 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

UProC: tools for ultra-fast protein domain classification.

Meinicke Peter P  

Bioinformatics (Oxford, England) 20141223 9


<h4>Motivation</h4>With rapidly increasing volumes of biological sequence data the functional analysis of new sequences in terms of similarities to known protein families challenges classical bioinformatics.<h4>Results</h4>The ultrafast protein classification (UProC) toolbox implements a novel algorithm ('Mosaic Matching') for large-scale sequence analysis. UProC is by three orders of magnitude faster than profile-based methods and in a metagenome simulation study achieved up to 80% higher sensi  ...[more]

Similar Datasets

2023-08-08 | GSE237874 | GEO
| EGAS00001007475 | EGA
2019-08-01 | GSE118265 | GEO
2019-10-01 | GSE121115 | GEO
2016-12-06 | GSE60865 | GEO
| S-EPMC1479088 | biostudies-literature
| S-EPMC4920111 | biostudies-literature
| S-EPMC7378889 | biostudies-literature