Unknown

Dataset Information

0

Linkage disequilibrium based genotype calling from low-coverage shotgun sequencing reads.


ABSTRACT: BACKGROUND: Recent technology advances have enabled sequencing of individual genomes, promising to revolutionize biomedical research. However, deep sequencing remains more expensive than microarrays for performing whole-genome SNP genotyping. RESULTS: In this paper we introduce a new multi-locus statistical model and computationally efficient genotype calling algorithms that integrate shotgun sequencing data with linkage disequilibrium (LD) information extracted from reference population panels such as Hapmap or the 1000 genomes project. Experiments on publicly available 454, Illumina, and ABI SOLiD sequencing datasets suggest that integration of LD information results in genotype calling accuracy comparable to that of microarray platforms from sequencing data of low-coverage. A software package implementing our algorithm, released under the GNU General Public License, is available at http://dna.engr.uconn.edu/software/GeneSeq/. CONCLUSIONS: Integration of LD information leads to significant improvements in genotype calling accuracy compared to prior LD-oblivious methods, rendering low-coverage sequencing as a viable alternative to microarrays for conducting large-scale genome-wide association studies.

SUBMITTER: Duitama J 

PROVIDER: S-EPMC3044311 | biostudies-literature | 2011

REPOSITORIES: biostudies-literature

altmetric image

Publications

Linkage disequilibrium based genotype calling from low-coverage shotgun sequencing reads.

Duitama Jorge J   Kennedy Justin J   Dinakar Sanjiv S   Hernández Yözen Y   Wu Yufeng Y   Măndoiu Ion I II  

BMC bioinformatics 20110215


<h4>Background</h4>Recent technology advances have enabled sequencing of individual genomes, promising to revolutionize biomedical research. However, deep sequencing remains more expensive than microarrays for performing whole-genome SNP genotyping.<h4>Results</h4>In this paper we introduce a new multi-locus statistical model and computationally efficient genotype calling algorithms that integrate shotgun sequencing data with linkage disequilibrium (LD) information extracted from reference popul  ...[more]

Similar Datasets

| S-EPMC5972415 | biostudies-other
| S-EPMC4244156 | biostudies-literature
| S-EPMC3493122 | biostudies-literature
| S-EPMC3848615 | biostudies-literature
| S-EPMC8329942 | biostudies-literature
| S-EPMC5427492 | biostudies-literature
| S-EPMC3777110 | biostudies-literature
| S-EPMC1142312 | biostudies-literature
| S-EPMC2577856 | biostudies-literature