Dataset Information


Whole-genome haplotyping by dilution, amplification, and sequencing.

ABSTRACT: Standard whole-genome genotyping technologies are unable to determine haplotypes. Here we describe a method for rapid and cost-effective long-range haplotyping. Genomic DNA is diluted and distributed into multiple aliquots such that each aliquot receives a fraction of a haploid copy. The DNA template in each aliquot is amplified by multiple displacement amplification, converted into barcoded sequencing libraries using Nextera technology, and sequenced in multiplexed pools. To assess the performance of our method, we combined two male genomic DNA samples at equal ratios, resulting in a sample with diploid X chromosomes with known haplotypes. Pools of the multiplexed sequencing libraries were subjected to targeted pull-down of a 1-Mb contiguous region of the X-chromosome Duchenne muscular dystrophy gene. We were able to phase the Duchenne muscular dystrophy region into two contiguous haplotype blocks with a mean length of 494 kb. The haplotypes showed 99% agreement with the consensus base calls made by sequencing the individual DNAs. We subsequently used the strategy to haplotype two human genomes. Standard genomic sequencing to identify all heterozygous SNPs in the sample was combined with dilution-amplification-based sequencing data to resolve the phase of identified heterozygous SNPs. Using this procedure, we were able to phase >95% of the heterozygous SNPs from the diploid sequence data. The N50 for a Yoruba male DNA was 702 kb whereas the N50 for a European female DNA was 358 kb. Therefore, the strategy described here is suitable for haplotyping of a set of targeted regions as well as of the entire genome.

PROVIDER: S-EPMC3619281 | BioStudies |

REPOSITORIES: biostudies

Similar Datasets

| S-EPMC3299995 | BioStudies
| S-EPMC4053695 | BioStudies
| S-EPMC3638138 | BioStudies
| S-EPMC4162929 | BioStudies
| S-EPMC4858692 | BioStudies
| S-EPMC5703283 | BioStudies
| S-EPMC3397394 | BioStudies
| S-EPMC2490767 | BioStudies
| S-EPMC5989205 | BioStudies
| S-EPMC4937318 | BioStudies