Dataset Information


Phasebook: haplotype-aware de novo assembly of diploid genomes from long reads

ABSTRACT: Haplotype-aware diploid genome assembly is crucial in genomics, precision medicine, and many other disciplines. Long-read sequencing technologies have greatly improved genome assembly. However, current long-read assemblers are either reference based, so introduce biases, or fail to capture the haplotype diversity of diploid genomes. We present phasebook, a de novo approach for reconstructing the haplotypes of diploid genomes from long reads. phasebook outperforms other approaches in terms of haplotype coverage by large margins, in addition to achieving competitive performance in terms of assembly errors and assembly contiguity.

Supplementary Information

The online version contains supplementary material available at (10.1186/s13059-021-02512-x).

PROVIDER: S-EPMC8549298 | BioStudies |

REPOSITORIES: biostudies

Similar Datasets

| S-EPMC8590762 | BioStudies
| S-EPMC6267036 | BioStudies
| S-EPMC5870766 | BioStudies
| S-EPMC8092372 | BioStudies
| S-EPMC7523683 | BioStudies
| S-EPMC8613828 | BioStudies
| S-EPMC7203741 | BioStudies
| S-EPMC5503144 | BioStudies
| S-EPMC7673114 | BioStudies
| S-EPMC8040228 | BioStudies