Ontology highlight
ABSTRACT: Background
Next generation sequencing (NGS) platforms are currently being utilized for targeted sequencing of candidate genes or genomic intervals to perform sequence-based association studies. To evaluate these platforms for this application, we analyzed human sequence generated by the Roche 454, Illumina GA, and the ABI SOLiD technologies for the same 260 kb in four individuals.Results
Local sequence characteristics contribute to systematic variability in sequence coverage (>100-fold difference in per-base coverage), resulting in patterns for each NGS technology that are highly correlated between samples. A comparison of the base calls to 88 kb of overlapping ABI 3730xL Sanger sequence generated for the same samples showed that the NGS platforms all have high sensitivity, identifying >95% of variant sites. At high coverage, depth base calling errors are systematic, resulting from local sequence contexts; as the coverage is lowered additional 'random sampling' errors in base calling occur.Conclusions
Our study provides important insights into systematic biases and data variability that need to be considered when utilizing NGS platforms for population targeted sequencing studies.
SUBMITTER: Harismendy O
PROVIDER: S-EPMC2691003 | biostudies-literature | 2009
REPOSITORIES: biostudies-literature
Harismendy Olivier O Ng Pauline C PC Strausberg Robert L RL Wang Xiaoyun X Stockwell Timothy B TB Beeson Karen Y KY Schork Nicholas J NJ Murray Sarah S SS Topol Eric J EJ Levy Samuel S Frazer Kelly A KA
Genome biology 20090327 3
<h4>Background</h4>Next generation sequencing (NGS) platforms are currently being utilized for targeted sequencing of candidate genes or genomic intervals to perform sequence-based association studies. To evaluate these platforms for this application, we analyzed human sequence generated by the Roche 454, Illumina GA, and the ABI SOLiD technologies for the same 260 kb in four individuals.<h4>Results</h4>Local sequence characteristics contribute to systematic variability in sequence coverage (>10 ...[more]