Dataset Information


Joint estimation of DNA copy number from multiple platforms.

ABSTRACT: DNA copy number variants (CNVs) are gains and losses of segments of chromosomes, and comprise an important class of genetic variation. Recently, various microarray hybridization-based techniques have been developed for high-throughput measurement of DNA copy number. In many studies, multiple technical platforms or different versions of the same platform were used to interrogate the same samples; and it became necessary to pool information across these multiple sources to derive a consensus molecular profile for each sample. An integrated analysis is expected to maximize resolution and accuracy, yet currently there is no well-formulated statistical method to address the between-platform differences in probe coverage, assay methods, sensitivity and analytical complexity.The conventional approach is to apply one of the CNV detection ('segmentation') algorithms to search for DNA segments of altered signal intensity. The results from multiple platforms are combined after segmentation. Here we propose a new method, Multi-Platform Circular Binary Segmentation (MPCBS), which pools statistical evidence across platforms during segmentation, and does not require pre-standardization of different data sources. It involves a weighted sum of t-statistics, which arises naturally from the generalized log-likelihood ratio of a multi-platform model. We show by comparing the integrated analysis of Affymetrix and Illumina SNP array data with Agilent and fosmid clone end-sequencing results on eight HapMap samples that MPCBS achieves improved spatial resolution, detection power and provides a natural consensus across platforms. We also apply the new method to analyze multi-platform data for tumor samples.The R package for MPCBS is registered on R-Forge (http://r-forge.r-project.org/) under project name MPCBS.Supplementary data are available at Bioinformatics online.


PROVIDER: S-EPMC2852203 | BioStudies | 2010-01-01

REPOSITORIES: biostudies

Similar Datasets

2013-01-01 | S-EPMC4053982 | BioStudies
2012-01-01 | S-EPMC3571941 | BioStudies
1000-01-01 | S-EPMC2685605 | BioStudies
1000-01-01 | S-EPMC3072561 | BioStudies
2016-01-01 | S-EPMC5870135 | BioStudies
2016-01-01 | S-EPMC5101202 | BioStudies
2009-01-01 | S-EPMC2866077 | BioStudies
2010-01-01 | S-EPMC2879534 | BioStudies
2019-01-01 | S-EPMC6736450 | BioStudies
2012-01-01 | S-EPMC3582591 | BioStudies