Unknown

Dataset Information

0

Using false discovery rates to benchmark SNP-callers in next-generation sequencing projects.


ABSTRACT: Sequence alignments form the basis for many comparative and population genomic studies. Alignment tools provide a range of accuracies dependent on the divergence between the sequences and the alignment methods. Despite widespread use, there is no standard method for assessing the accuracy of a dataset and alignment strategy after resequencing. We present a framework and tool for determining the overall accuracies of an input read dataset, alignment and SNP-calling method providing an isolate in that dataset has a corresponding, or closely related reference sequence available. In addition to this tool for comparing False Discovery Rates (FDR), we include a method for determining homozygous and heterozygous positions from an alignment using binomial probabilities for an expected error rate. We benchmark this method against other SNP callers using our FDR method with three fungal genomes, finding that it was able achieve a high level of accuracy. These tools are available at http://cfdr.sourceforge.net/.

SUBMITTER: Farrer RA 

PROVIDER: S-EPMC3604800 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using false discovery rates to benchmark SNP-callers in next-generation sequencing projects.

Farrer Rhys A RA   Henk Daniel A DA   MacLean Dan D   Studholme David J DJ   Fisher Matthew C MC  

Scientific reports 20130101


Sequence alignments form the basis for many comparative and population genomic studies. Alignment tools provide a range of accuracies dependent on the divergence between the sequences and the alignment methods. Despite widespread use, there is no standard method for assessing the accuracy of a dataset and alignment strategy after resequencing. We present a framework and tool for determining the overall accuracies of an input read dataset, alignment and SNP-calling method providing an isolate in  ...[more]

Similar Datasets

| S-EPMC3785481 | biostudies-literature
| S-EPMC7293574 | biostudies-literature
| S-EPMC4697941 | biostudies-literature
| S-EPMC3413699 | biostudies-literature
| S-EPMC3557168 | biostudies-literature
| S-EPMC3907553 | biostudies-literature
| S-EPMC3933208 | biostudies-other
2017-04-03 | PXD003804 | Pride
| S-EPMC6932819 | biostudies-literature
| S-EPMC4188281 | biostudies-literature