Unknown

Dataset Information

0

Optimizing copy number variation analysis using genome-wide short sequence oligonucleotide arrays.


ABSTRACT: The detection of copy number variants (CNV) by array-based platforms provides valuable insight into understanding human diversity. However, suboptimal study design and data processing negatively affect CNV assessment. We quantitatively evaluate their impact when short-sequence oligonucleotide arrays are applied (Affymetrix Genome-Wide Human SNP Array 6.0) by evaluating 42 HapMap samples for CNV detection. Several processing and segmentation strategies are implemented, and results are compared to CNV assessment obtained using an oligonucleotide array CGH platform designed to query CNVs at high resolution (Agilent). We quantitatively demonstrate that different reference models (e.g. single versus pooled sample reference) used to detect CNVs are a major source of inter-platform discrepancy (up to 30%) and that CNVs residing within segmental duplication regions (higher reference copy number) are significantly harder to detect (P < 0.0001). After adjusting Affymetrix data to mimic the Agilent experimental design (reference sample effect), we applied several common segmentation approaches and evaluated differential sensitivity and specificity for CNV detection, ranging 39-77% and 86-100% for non-segmental duplication regions, respectively, and 18-55% and 39-77% for segmental duplications. Our results are relevant to any array-based CNV study and provide guidelines to optimize performance based on study-specific objectives.

SUBMITTER: Oldridge DA 

PROVIDER: S-EPMC2879534 | biostudies-literature | 2010 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Optimizing copy number variation analysis using genome-wide short sequence oligonucleotide arrays.

Oldridge Derek A DA   Banerjee Samprit S   Setlur Sunita R SR   Sboner Andrea A   Demichelis Francesca F  

Nucleic acids research 20100215 10


The detection of copy number variants (CNV) by array-based platforms provides valuable insight into understanding human diversity. However, suboptimal study design and data processing negatively affect CNV assessment. We quantitatively evaluate their impact when short-sequence oligonucleotide arrays are applied (Affymetrix Genome-Wide Human SNP Array 6.0) by evaluating 42 HapMap samples for CNV detection. Several processing and segmentation strategies are implemented, and results are compared to  ...[more]

Similar Datasets

| S-EPMC1665641 | biostudies-literature
| S-EPMC3525261 | biostudies-literature
| S-EPMC4490277 | biostudies-other
| S-EPMC3153341 | biostudies-literature
| S-EPMC5345089 | biostudies-other
| S-EPMC4875690 | biostudies-literature
| S-EPMC2761258 | biostudies-literature
| S-EPMC2925224 | biostudies-literature
| S-EPMC2981564 | biostudies-literature
| S-EPMC3464621 | biostudies-literature