Unknown

Dataset Information

0

Contamination detection in genomic data: more is not enough.


ABSTRACT: The decreasing cost of sequencing and concomitant augmentation of publicly available genomes have created an acute need for automated software to assess genomic contamination. During the last 6 years, 18 programs have been published, each with its own strengths and weaknesses. Deciding which tools to use becomes more and more difficult without an understanding of the underlying algorithms. We review these programs, benchmarking six of them, and present their main operating principles. This article is intended to guide researchers in the selection of appropriate tools for specific applications. Finally, we present future challenges in the developing field of contamination detection.

SUBMITTER: Cornet L 

PROVIDER: S-EPMC8862208 | biostudies-literature | 2022 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Contamination detection in genomic data: more is not enough.

Cornet Luc L   Baurain Denis D  

Genome biology 20220221 1


The decreasing cost of sequencing and concomitant augmentation of publicly available genomes have created an acute need for automated software to assess genomic contamination. During the last 6 years, 18 programs have been published, each with its own strengths and weaknesses. Deciding which tools to use becomes more and more difficult without an understanding of the underlying algorithms. We review these programs, benchmarking six of them, and present their main operating principles. This artic  ...[more]

Similar Datasets

| S-EPMC11639764 | biostudies-literature
| S-EPMC11652266 | biostudies-literature
| S-EPMC11540323 | biostudies-literature
| S-EPMC10576407 | biostudies-literature
| S-EPMC3057953 | biostudies-literature
| S-EPMC9481068 | biostudies-literature
| S-EPMC9710612 | biostudies-literature
| S-EPMC5370491 | biostudies-literature
| S-EPMC9630725 | biostudies-literature
| S-EPMC8048386 | biostudies-literature