Dataset Information


CNVineta: a data mining tool for large case-control copy number variation datasets.

ABSTRACT: MOTIVATION: Copy number variation (CNV), a major contributor to human genetic variation, comprises >/= 1 kb genomic deletions and insertions. Yet, the identification of CNVs from microarray data is still hampered by high false negative and positive prediction rates due to the noisy nature of the raw data. Here, we present CNVineta, an R package for rapid data mining and visualization of CNVs in large case-control datasets genotyped with single nucleotide polymorphism oligonucleotide arrays. CNVineta is compatible with various established CNV prediction algorithms, can be used for genome-wide association analysis of rare and common CNVs and enables rapid and serial display of log(2) of raw data ratios as well as B-allele frequencies for visual quality inspection. In summary, CNVineta aides in the interpretation of large-scale CNV datasets and prioritization of target regions for follow-up experiments. AVAILABILITY AND IMPLEMENTATION: CNVineta is available as an R package and can be downloaded from http://www.ikmb.uni-kiel.de/CNVineta/; the package contains a tutorial outlining a typical workflow. The CNVineta compatible HapMap dataset can also be downloaded from the link above.

PROVIDER: S-EPMC2922892 | BioStudies | 2010-01-01T00:00:00Z

REPOSITORIES: biostudies

Similar Datasets

1000-01-01 | S-EPMC2913665 | BioStudies
2015-01-01 | S-EPMC4676147 | BioStudies
2019-01-01 | S-EPMC6411622 | BioStudies
2019-01-01 | S-EPMC6468244 | BioStudies
| S-EPMC4896365 | BioStudies
2012-01-01 | S-EPMC3476336 | BioStudies
1000-01-01 | S-EPMC2915992 | BioStudies
2015-01-01 | S-EPMC4510559 | BioStudies
2017-01-01 | S-EPMC5515741 | BioStudies
2016-01-01 | S-EPMC4976506 | BioStudies