Metabolomics,Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

Diversity of Human Copy Number Variation and Multicopy Genes


ABSTRACT: Copy number variants (CNVs) affect both disease and normal phenotypic variation but those lying within heavily duplicated, highly identical sequence have been difficult to assay. By analyzing short-read mapping depth for 159 human genomes, we demonstrate accurate estimation of absolute copy number for duplications as small as 1.9 kbp, ranging from 0-48 copies. We identified 4.1 million ‘singly unique nucleotide’ (SUN) positions informative in distinguishing specific copies, and use them to genotype the copy and content of specific paralogs within highly duplicated gene families. These data identify human-specific expansions in genes associated with brain development, reveal extensive population genetic diversity, and detect signatures consistent with gene conversion in the human species. Our approach makes ~1000 genes accessible to genetic studies of disease association. This dataset complements the results from short read sequencing by performing validation on five individuals. We analyzed the 17q21.1 locus in 5 HapMap individuals by array CGH on a custom Agilent 4-plex 310k array performing 1 experiment for each sample. This array was targeted at high density (1 probe/105bp) to 7 genomic loci, including 17q21. The reference individual used was NA19240.

ORGANISM(S): Homo sapiens

SUBMITTER: Peter Sudmant 

PROVIDER: E-GEOD-24334 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

Similar Datasets

2010-02-26 | E-MEXP-2534 | biostudies-arrayexpress
2010-09-01 | E-MEXP-2788 | biostudies-arrayexpress
2011-10-27 | E-MEXP-2964 | biostudies-arrayexpress
2012-02-27 | E-MEXP-3569 | biostudies-arrayexpress
2011-06-11 | E-MEXP-3002 | biostudies-arrayexpress
2011-06-11 | E-MEXP-2996 | biostudies-arrayexpress
2011-09-07 | E-MEXP-2909 | biostudies-arrayexpress
2011-09-23 | E-MEXP-2922 | biostudies-arrayexpress
2011-09-02 | E-MEXP-2967 | biostudies-arrayexpress
2010-10-18 | E-MEXP-2928 | biostudies-arrayexpress