Unknown

Dataset Information

0

ProGeRF: proteome and genome repeat finder utilizing a fast parallel hash function.


ABSTRACT: Repetitive element sequences are adjacent, repeating patterns, also called motifs, and can be of different lengths; repetitions can involve their exact or approximate copies. They have been widely used as molecular markers in population biology. Given the sizes of sequenced genomes, various bioinformatics tools have been developed for the extraction of repetitive elements from DNA sequences. However, currently available tools do not provide options for identifying repetitive elements in the genome or proteome, displaying a user-friendly web interface, and performing-exhaustive searches. ProGeRF is a web site for extracting repetitive regions from genome and proteome sequences. It was designed to be efficient, fast, and accurate and primarily user-friendly web tool allowing many ways to view and analyse the results. ProGeRF (Proteome and Genome Repeat Finder) is freely available as a stand-alone program, from which the users can download the source code, and as a web tool. It was developed using the hash table approach to extract perfect and imperfect repetitive regions in a (multi)FASTA file, while allowing a linear time complexity.

SUBMITTER: Lopes Rda S 

PROVIDER: S-EPMC4355816 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

ProGeRF: proteome and genome repeat finder utilizing a fast parallel hash function.

Lopes Robson da Silva Rda S   Moraes Walas Jhony Lopes WJ   Rodrigues Thiago de Souza Tde S   Bartholomeu Daniella Castanheira DC  

BioMed research international 20150225


Repetitive element sequences are adjacent, repeating patterns, also called motifs, and can be of different lengths; repetitions can involve their exact or approximate copies. They have been widely used as molecular markers in population biology. Given the sizes of sequenced genomes, various bioinformatics tools have been developed for the extraction of repetitive elements from DNA sequences. However, currently available tools do not provide options for identifying repetitive elements in the geno  ...[more]

Similar Datasets

| S-EPMC4464037 | biostudies-literature
| S-EPMC4525727 | biostudies-literature
| S-EPMC4798648 | biostudies-literature
| S-EPMC9027990 | biostudies-literature
| S-EPMC5845352 | biostudies-literature
| S-EPMC6935933 | biostudies-literature
| S-EPMC6853708 | biostudies-literature
| S-EPMC3333191 | biostudies-literature