Unknown

Dataset Information

0

Pangenomics enables genotyping of known structural variants in 5202 diverse genomes.


ABSTRACT: We introduce Giraffe, a pangenome short-read mapper that can efficiently map to a collection of haplotypes threaded through a sequence graph. Giraffe maps sequencing reads to thousands of human genomes at a speed comparable to that of standard methods mapping to a single reference genome. The increased mapping accuracy enables downstream improvements in genome-wide genotyping pipelines for both small variants and larger structural variants. We used Giraffe to genotype 167,000 structural variants, discovered in long-read studies, in 5202 diverse human genomes that were sequenced using short reads. We conclude that pangenomics facilitates a more comprehensive characterization of variation and, as a result, has the potential to improve many genomic analyses.

SUBMITTER: Siren J 

PROVIDER: S-EPMC9365333 | biostudies-literature | 2021 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications


We introduce Giraffe, a pangenome short-read mapper that can efficiently map to a collection of haplotypes threaded through a sequence graph. Giraffe maps sequencing reads to thousands of human genomes at a speed comparable to that of standard methods mapping to a single reference genome. The increased mapping accuracy enables downstream improvements in genome-wide genotyping pipelines for both small variants and larger structural variants. We used Giraffe to genotype 167,000 structural variants  ...[more]

Similar Datasets

| S-EPMC9780226 | biostudies-literature
| S-EPMC11897243 | biostudies-literature
| S-EPMC6664100 | biostudies-literature
| S-EPMC10664547 | biostudies-literature
| S-EPMC6499320 | biostudies-literature
| S-EPMC8496513 | biostudies-literature
| S-EPMC6881350 | biostudies-literature
| S-EPMC6724671 | biostudies-literature
| S-EPMC9637116 | biostudies-literature
2024-12-01 | GSE282636 | GEO