Dataset Information


Long Single-Molecule Reads Can Resolve the Complexity of the Influenza Virus Composed of Rare, Closely Related Mutant Variants.

ABSTRACT: As a result of a high rate of mutations and recombination events, an RNA-virus exists as a heterogeneous "swarm" of mutant variants. The long read length offered by single-molecule sequencing technologies allows each mutant variant to be sequenced in a single pass. However, high error rate limits the ability to reconstruct heterogeneous viral population composed of rare, related mutant variants. In this article, we present two single-nucleotide variants (2SNV), a method able to tolerate the high error rate of the single-molecule protocol and reconstruct mutant variants. 2SNV uses linkage between single-nucleotide variations to efficiently distinguish them from read errors. To benchmark the sensitivity of 2SNV, we performed a single-molecule sequencing experiment on a sample containing a titrated level of known viral mutant variants. Our method is able to accurately reconstruct clone with frequency of 0.2% and distinguish clones that differed in only two nucleotides distantly located on the genome. 2SNV outperforms existing methods for full-length viral mutant reconstruction.

SUBMITTER: Artyomenko A 

PROVIDER: S-EPMC5467126 | BioStudies | 2017-01-01

REPOSITORIES: biostudies

Similar Datasets

2018-01-01 | S-EPMC5864009 | BioStudies
2013-01-01 | S-EPMC3655449 | BioStudies
2015-01-01 | S-EPMC4787755 | BioStudies
2017-01-01 | S-EPMC5685235 | BioStudies
2018-01-01 | S-EPMC6264523 | BioStudies
2015-01-01 | S-EPMC4333345 | BioStudies
2019-01-01 | S-EPMC6765106 | BioStudies
2016-01-01 | S-EPMC4752602 | BioStudies
2014-01-01 | S-EPMC4253835 | BioStudies
2014-01-01 | S-EPMC3967922 | BioStudies