Unknown

Dataset Information

0

ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data.


ABSTRACT:

Background

A wide range of tools are available for the detection of copy number variants (CNVs) from whole-genome sequencing (WGS) data. However, none of them focus on clinically-relevant CNVs, such as those that are associated with known genetic syndromes. Such variants are often large in size, typically 1-5 Mb, but currently available CNV callers have been developed and benchmarked for the discovery of smaller variants. Thus, the ability of these programs to detect tens of real syndromic CNVs remains largely unknown.

Results

Here we present ConanVarvar, a tool which implements a complete workflow for the targeted analysis of large germline CNVs from WGS data. ConanVarvar comes with an intuitive R Shiny graphical user interface and annotates identified variants with information about 56 associated syndromic conditions. We benchmarked ConanVarvar and four other programs on a dataset containing real and simulated syndromic CNVs larger than 1 Mb. In comparison to other tools, ConanVarvar reports 10-30 times less false-positive variants without compromising sensitivity and is quicker to run, especially on large batches of samples.

Conclusions

ConanVarvar is a useful instrument for primary analysis in disease sequencing studies, where large CNVs could be the cause of disease.

SUBMITTER: Gudkov M 

PROVIDER: S-EPMC9930243 | biostudies-literature | 2023 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data.

Gudkov Mikhail M   Thibaut Loïc L   Khushi Matloob M   Blue Gillian M GM   Winlaw David S DS   Dunwoodie Sally L SL   Giannoulatou Eleni E  

BMC bioinformatics 20230215 1


<h4>Background</h4>A wide range of tools are available for the detection of copy number variants (CNVs) from whole-genome sequencing (WGS) data. However, none of them focus on clinically-relevant CNVs, such as those that are associated with known genetic syndromes. Such variants are often large in size, typically 1-5 Mb, but currently available CNV callers have been developed and benchmarked for the discovery of smaller variants. Thus, the ability of these programs to detect tens of real syndrom  ...[more]

Similar Datasets

| S-EPMC3481445 | biostudies-literature
| S-EPMC7520931 | biostudies-literature
| S-EPMC7059689 | biostudies-literature
| S-EPMC10403694 | biostudies-literature
| S-EPMC10710158 | biostudies-literature
| S-EPMC4381046 | biostudies-literature
| S-EPMC7701142 | biostudies-literature
| S-EPMC8612020 | biostudies-literature
| S-EPMC9995335 | biostudies-literature
| S-EPMC9468235 | biostudies-literature