Unknown

Dataset Information

0

High-frequency, low-coverage "false positives" mutations may be true in GS Junior sequencing studies.


ABSTRACT: The GS Junior sequencer provides simplified procedures for library preparation and data processing. Errors in pyrosequencing generate some biases during library construction and emulsion PCR amplification. False-positive mutations are identified by related characteristics described in the manufacturer's manual, and some detected mutations may have 'borderline' characteristics when they are detected in few reads or at low frequency. Among these mutations, however, some may be true positives. This study aimed to improve the accuracy of identifying true positives among mutations with borderline false-positive characteristics detected with GS Junior sequencing. Mutations with the borderline features were tested for validity with Sanger sequencing. We examined 10 mutations detected in coverages <20-fold at frequencies >30% (group A) and 16 mutations detected in coverages >20-fold at frequencies < 30% (group B). In group A, two mutations were not confirmed, and two mutations with 100% frequency were confirmed as heterozygous alleles. No mutation in group B was confirmed. The two groups had significantly different false-positive prevalences (p = 0.001). These results suggest that mutations detected at frequencies less than 30% can be confidently identified as false-positives but that mutations detected at frequencies over 30%, despite coverages less than 20-fold, should be verified with Sanger sequencing.

SUBMITTER: Yang Z 

PROVIDER: S-EPMC5653793 | BioStudies | 2017-01-01

SECONDARY ACCESSION(S): rs34605667

REPOSITORIES: biostudies

Similar Datasets

2017-01-01 | S-EPMC5327365 | BioStudies
2013-01-01 | S-EPMC3877031 | BioStudies
2014-01-01 | S-EPMC4098986 | BioStudies
2016-01-01 | S-EPMC5008380 | BioStudies
2014-01-01 | S-EPMC3960061 | BioStudies
2011-03-04 | GSE27659 | GEO
2010-01-01 | S-EPMC2884316 | BioStudies
2010-01-01 | S-EPMC2871986 | BioStudies
2011-03-04 | E-GEOD-27659 | ArrayExpress
1000-01-01 | S-EPMC4803105 | BioStudies