Unknown

Dataset Information

0

Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation.


ABSTRACT: Affymetrix SNP arrays have been widely used for single-nucleotide polymorphism (SNP) genotype calling and DNA copy number variation inference. Although numerous methods have achieved high accuracy in these fields, most studies have paid little attention to the modeling of hybridization of probes to off-target allele sequences, which can affect the accuracy greatly. In this study, we address this issue and demonstrate that hybridization with mismatch nucleotides (HWMMN) occurs in all SNP probe-sets and has a critical effect on the estimation of allelic concentrations (ACs). We study sequence binding through binding free energy and then binding affinity, and develop a probe intensity composite representation (PICR) model. The PICR model allows the estimation of ACs at a given SNP through statistical regression. Furthermore, we demonstrate with cell-line data of known true copy numbers that the PICR model can achieve reasonable accuracy in copy number estimation at a single SNP locus, by using the ratio of the estimated AC of each sample to that of the reference sample, and can reveal subtle genotype structure of SNPs at abnormal loci. We also demonstrate with HapMap data that the PICR model yields accurate SNP genotype calls consistently across samples, laboratories and even across array platforms.

SUBMITTER: Wan L 

PROVIDER: S-EPMC2761258 | biostudies-literature | 2009 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation.

Wan Lin L   Sun Kelian K   Ding Qi Q   Cui Yuehua Y   Li Ming M   Wen Yalu Y   Elston Robert C RC   Qian Minping M   Fu Wenjiang J WJ  

Nucleic acids research 20090707 17


Affymetrix SNP arrays have been widely used for single-nucleotide polymorphism (SNP) genotype calling and DNA copy number variation inference. Although numerous methods have achieved high accuracy in these fields, most studies have paid little attention to the modeling of hybridization of probes to off-target allele sequences, which can affect the accuracy greatly. In this study, we address this issue and demonstrate that hybridization with mismatch nucleotides (HWMMN) occurs in all SNP probe-se  ...[more]

Similar Datasets

| S-EPMC3525261 | biostudies-literature
| S-EPMC3006124 | biostudies-literature
| S-EPMC1665641 | biostudies-literature
| S-EPMC3029233 | biostudies-literature
| S-EPMC2732310 | biostudies-other
| S-EPMC2879534 | biostudies-literature
| S-EPMC1175992 | biostudies-literature
| S-EPMC3834792 | biostudies-literature
| S-EPMC3023756 | biostudies-literature