Unknown

Dataset Information

0

Inference of chromosome-specific copy numbers using population haplotypes.


ABSTRACT:

Background

Using microarray and sequencing platforms, a large number of copy number variations (CNVs) have been identified in humans. In practice, because our human genome is a diploid, these platforms are limited to or more accurate for detecting total copy numbers rather than chromosome-specific copy numbers at each of the two homologous chromosomes. Nevertheless, the analysis of linkage disequilibrium (LD) between CNVs and SNPs indicates that distinct copy numbers often sit on their own background haplotypes.

Results

We propose new computational models for inferring chromosome-specific copy numbers by distinguishing background haplotypes of each copy number. The formulated problems are shown to be NP-hard and approximation/heuristic algorithms are developed. Simulation indicates that our method is accurate and outperforms the existing approach. By testing the program in 60 parent-offspring trios, the inferred chromosome-specific copy numbers are highly consistent with the law of Mendelian inheritance. The distributions of copy numbers at chromosomal level are provided for 270 individuals in three HapMap panels.

Conclusions

The estimation of chromosome-specific copy numbers using microarray or sequencing platforms was often confounded by a number of factors. This study showed that the integration of background haplotypes is able to improve the accuracies of copy number estimation at chromosome level, especially for the CNVs having strong LD with SNPs in proximity.

SUBMITTER: Huang YT 

PROVIDER: S-EPMC3128032 | biostudies-literature | 2011 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Inference of chromosome-specific copy numbers using population haplotypes.

Huang Yao-Ting YT   Wu Min-Han MH  

BMC bioinformatics 20110524


<h4>Background</h4>Using microarray and sequencing platforms, a large number of copy number variations (CNVs) have been identified in humans. In practice, because our human genome is a diploid, these platforms are limited to or more accurate for detecting total copy numbers rather than chromosome-specific copy numbers at each of the two homologous chromosomes. Nevertheless, the analysis of linkage disequilibrium (LD) between CNVs and SNPs indicates that distinct copy numbers often sit on their o  ...[more]

Similar Datasets

| S-EPMC7743722 | biostudies-literature
| S-EPMC3249356 | biostudies-literature
| S-EPMC5625343 | biostudies-literature
| S-EPMC8466715 | biostudies-literature
| S-EPMC7205152 | biostudies-literature
| S-EPMC4341064 | biostudies-literature
| S-EPMC4331703 | biostudies-literature