Unknown

Dataset Information

0

Reference-based phasing using the Haplotype Reference Consortium panel.


ABSTRACT: Haplotype phasing is a fundamental problem in medical and population genetics. Phasing is generally performed via statistical phasing in a genotyped cohort, an approach that can yield high accuracy in very large cohorts but attains lower accuracy in smaller cohorts. Here we instead explore the paradigm of reference-based phasing. We introduce a new phasing algorithm, Eagle2, that attains high accuracy across a broad range of cohort sizes by efficiently leveraging information from large external reference panels (such as the Haplotype Reference Consortium; HRC) using a new data structure based on the positional Burrows-Wheeler transform. We demonstrate that Eagle2 attains a ?20× speedup and ?10% increase in accuracy compared to reference-based phasing using SHAPEIT2. On European-ancestry samples, Eagle2 with the HRC panel achieves >2× the accuracy of 1000 Genomes-based phasing. Eagle2 is open source and freely available for HRC-based phasing via the Sanger Imputation Service and the Michigan Imputation Server.

SUBMITTER: Loh PR 

PROVIDER: S-EPMC5096458 | biostudies-literature | 2016 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications


Haplotype phasing is a fundamental problem in medical and population genetics. Phasing is generally performed via statistical phasing in a genotyped cohort, an approach that can yield high accuracy in very large cohorts but attains lower accuracy in smaller cohorts. Here we instead explore the paradigm of reference-based phasing. We introduce a new phasing algorithm, Eagle2, that attains high accuracy across a broad range of cohort sizes by efficiently leveraging information from large external  ...[more]

Similar Datasets

| S-EPMC4920110 | biostudies-literature
| S-EPMC4579394 | biostudies-literature
| S-EPMC9314402 | biostudies-literature
| EGAS00001001710 | EGA
| S-EPMC3777110 | biostudies-literature
| S-EPMC6504677 | biostudies-literature
| S-EPMC4756330 | biostudies-literature
| S-EPMC6199332 | biostudies-literature
| S-EPMC6822470 | biostudies-literature
| S-EPMC4338501 | biostudies-literature