Unknown

Dataset Information

0

A comparative study of structural variant calling in WGS from Alzheimer's disease families.


ABSTRACT: Detecting structural variants (SVs) in whole-genome sequencing poses significant challenges. We present a protocol for variant calling, merging, genotyping, sensitivity analysis, and laboratory validation for generating a high-quality SV call set in whole-genome sequencing from the Alzheimer's Disease Sequencing Project comprising 578 individuals from 111 families. Employing two complementary pipelines, Scalpel and Parliament, for SV/indel calling, we assessed sensitivity through sample replicates (N = 9) with in silico variant spike-ins. We developed a novel metric, D-score, to evaluate caller specificity for deletions. The accuracy of deletions was evaluated by Sanger sequencing. We generated a high-quality call set of 152,301 deletions of diverse sizes. Sanger sequencing validated 114 of 146 detected deletions (78.1%). Scalpel excelled in accuracy for deletions ≤100 bp, whereas Parliament was optimal for deletions >900 bp. Overall, 83.0% and 72.5% of calls by Scalpel and Parliament were validated, respectively, including all 11 deletions called by both Parliament and Scalpel between 101 and 900 bp. Our flexible protocol successfully generated a high-quality deletion call set and a truth set of Sanger sequencing-validated deletions with precise breakpoints spanning 1-17,000 bp.

SUBMITTER: Malamon JS 

PROVIDER: S-EPMC10902710 | biostudies-literature | 2024 May

REPOSITORIES: biostudies-literature

altmetric image

Publications


Detecting structural variants (SVs) in whole-genome sequencing poses significant challenges. We present a protocol for variant calling, merging, genotyping, sensitivity analysis, and laboratory validation for generating a high-quality SV call set in whole-genome sequencing from the Alzheimer's Disease Sequencing Project comprising 578 individuals from 111 families. Employing two complementary pipelines, Scalpel and Parliament, for SV/indel calling, we assessed sensitivity through sample replicat  ...[more]

Similar Datasets

| S-EPMC7336445 | biostudies-literature
| S-EPMC11291474 | biostudies-literature
| S-EPMC8712436 | biostudies-literature
| S-EPMC9843587 | biostudies-literature
| S-EPMC7751401 | biostudies-literature
| S-EPMC9294411 | biostudies-literature
| S-EPMC6513159 | biostudies-literature
| S-EPMC6164060 | biostudies-literature
| S-EPMC8458033 | biostudies-literature
| S-EPMC6868818 | biostudies-literature