Unknown

Dataset Information

0

Marker discovery in the large.


ABSTRACT:

Motivation

Markers for diagnostic polymerase chain reactions are routinely constructed by taking regions common to the genomes of a target organism and subtracting the regions found in the targets' closest relatives, their neighbors. This approach is implemented in the published package Fur, which originally required memory proportional to the number of nucleotides in the neighborhood. This does not scale well.

Results

Here, we describe a new version of Fur that only requires memory proportional to the longest neighbor. In spite of its greater memory efficiency, the new Fur remains fast and is accurate. We demonstrate this by applying it to simulated sequences and comparing it to an efficient alternative. Then we use the new Fur to extract markers from 120 reference bacteria. To make this feasible, we also introduce software for automatically finding target and neighbor genomes and for assessing markers. We pick the best primers from the 10 most sequenced reference bacteria and show their excellent in silico sensitivity and specificity.

Availability and implementation

Fur is available from github.com/evolbioinf/fur, in the Docker image hub.docker.com/r/beatrizvm/mapro, and in the Code Ocean capsule 10.24433/CO.7955947.v1.

SUBMITTER: Vieira Mourato B 

PROVIDER: S-EPMC11310107 | biostudies-literature | 2024

REPOSITORIES: biostudies-literature

altmetric image

Publications

Marker discovery in the large.

Vieira Mourato Beatriz B   Tsers Ivan I   Denker Svenja S   Klötzl Fabian F   Haubold Bernhard B  

Bioinformatics advances 20240727 1


<h4>Motivation</h4>Markers for diagnostic polymerase chain reactions are routinely constructed by taking regions common to the genomes of a target organism and subtracting the regions found in the targets' closest relatives, their neighbors. This approach is implemented in the published package Fur, which originally required memory proportional to the number of nucleotides in the neighborhood. This does not scale well.<h4>Results</h4>Here, we describe a new version of Fur that only requires memo  ...[more]

Similar Datasets

| S-EPMC3633874 | biostudies-other
| EGAC00001000661 | EGA
| S-EPMC3113791 | biostudies-literature
| S-EPMC6737103 | biostudies-literature
| S-EPMC4370664 | biostudies-literature
| S-EPMC5768334 | biostudies-literature
| S-EPMC3566585 | biostudies-literature
2010-08-01 | GSE17673 | GEO
2013-04-01 | GSE42847 | GEO
2010-08-01 | E-GEOD-17673 | biostudies-arrayexpress