Genomics

Dataset Information

0

Array-CGH and Next-generation sequencing of duplication CNVs reveals that most are tandem and some disrupt genes at breakpoints


ABSTRACT: Interpreting the genomic and phenotypic consequences of copy number variation (CNV) is essential to understand the etiology of genetic disorders. Whereas deletion CNVs obviously lead to haploinsufficiency, duplications may cause disease through triplosensitivity, gene disruption, or gene fusion at breakpoints. The mutational spectrum of duplications has been studied at certain loci and in some cases these copy number gains are complex chromosome rearrangements involving triplications and/or inversions. However, the organization of clinically relevant duplications throughout the genome has not been investigated on a large scale. Here, we fine mapped 184 germline duplications (14.7 kb-25.3 Mb; median 532 kb) ascertained from individuals referred for diagnostic cytogenetics testing. We performed next-generation sequencing (NGS) and whole-genome sequencing (WGS) to sequence 130 breakpoints from 112 subjects with 119 CNVs and found that most (83%) were tandem duplications in direct orientation. The remainder were triplications embedded within duplications (8.4%), adjacent duplications (4.2%), insertional translocations (2.5%), or other complex rearrangements (1.7%). In addition, we predicted six in-frame fusion genes at sequenced duplication breakpoints. Four gene fusions were formed by tandem duplications, one by two interconnected duplications, and one by duplication inserted at another locus. These novel fusion genes could be related to clinical phenotypes and warrant further study. Though most duplications are positioned head-to-tail adjacent to the original locus, those that are inverted, triplicated, or inserted can disrupt or fuse genes in a manner that may not be predicted by conventional copy number analysis. Thus, interpreting the genetic consequences of duplication CNVs requires breakpoint-level analysis.

ORGANISM(S): Homo sapiens

PROVIDER: GSE62657 | GEO | 2014/10/25

SECONDARY ACCESSION(S): PRJNA264832

REPOSITORIES: GEO

Similar Datasets

2006-08-10 | GSE5373 | GEO
| E-GEOD-45395 | biostudies-arrayexpress
2013-04-03 | GSE45395 | GEO
| E-GEOD-26954 | biostudies-arrayexpress
| E-GEOD-27080 | biostudies-arrayexpress
2011-04-06 | GSE26954 | GEO
2011-02-28 | GSE27080 | GEO
| E-GEOD-57246 | biostudies-arrayexpress
2014-06-30 | GSE57246 | GEO
| E-GEOD-67871 | biostudies-arrayexpress