Dataset Information


Accurate whole-genome sequencing and haplotyping from 10 to 20 human cells.

ABSTRACT: Recent advances in whole-genome sequencing have brought the vision of personal genomics and genomic medicine closer to reality. However, current methods lack clinical accuracy and the ability to describe the context (haplotypes) in which genome variants co-occur in a cost-effective manner. Here we describe a low-cost DNA sequencing and haplotyping process, long fragment read (LFR) technology, which is similar to sequencing long single DNA molecules without cloning or separation of metaphase chromosomes. In this study, ten LFR libraries were made using only ?100?picograms of human DNA per sample. Up to 97% of the heterozygous single nucleotide variants were assembled into long haplotype contigs. Removal of false positive single nucleotide variants not phased by multiple LFR haplotypes resulted in a final genome error rate of 1 in 10?megabases. Cost-effective and accurate genome sequencing and haplotyping from 10-20 human cells, as demonstrated here, will enable comprehensive genetic studies and diverse clinical applications.

PROVIDER: S-EPMC3397394 | BioStudies |

REPOSITORIES: biostudies

Similar Datasets

| S-EPMC4937318 | BioStudies
| S-EPMC3619281 | BioStudies
| S-EPMC4053695 | BioStudies
| S-EPMC1802573 | BioStudies
| S-EPMC4786454 | BioStudies
| S-EPMC5703283 | BioStudies
| S-EPMC3638138 | BioStudies
| S-EPMC3299995 | BioStudies
| S-EPMC9226495 | BioStudies
| S-EPMC4073643 | BioStudies