Project description:Studying the functional consequences of structural variants (SVs) in mammalian genomes is challenging because: 1) SVs arise much less commonly than single nucleotide variants or small indels; and 2) methods to generate, map and characterize SVs in model systems are underdeveloped. To address these challenges, we developed Genome-Shuffle-seq, a method that enables the multiplex generation and mapping of thousands of SVs (deletions, inversions, translocations, extrachromosomal circles) throughout mammalian genomes. We also demonstrate the co-capture of SV identity with single-cell transcriptomes, facilitating the measurement of SVs’ impact on gene expression. We anticipate Genome-Shuffle-seq will be broadly useful for the systematic exploration of the functional consequences of SVs on gene expression, chromatin landscape, and 3D nuclear architecture, while also initiating a path towards a minimal mammalian genome.
Project description:Studying the functional consequences of structural variants (SVs) in mammalian genomes is challenging because: 1) SVs arise much less commonly than single nucleotide variants or small indels; and 2) methods to generate, map and characterize SVs in model systems are underdeveloped. To address these challenges, we developed Genome-Shuffle-seq, a method that enables the multiplex generation and mapping of thousands of SVs (deletions, inversions, translocations, extrachromosomal circles) throughout mammalian genomes. We also demonstrate the co-capture of SV identity with single-cell transcriptomes, facilitating the measurement of SVs’ impact on gene expression. We anticipate Genome-Shuffle-seq will be broadly useful for the systematic exploration of the functional consequences of SVs on gene expression, chromatin landscape, and 3D nuclear architecture, while also initiating a path towards a minimal mammalian genome.
Project description:Studying the functional consequences of structural variants (SVs) in mammalian genomes is challenging because: 1) SVs arise much less commonly than single nucleotide variants or small indels; and 2) methods to generate, map and characterize SVs in model systems are underdeveloped. To address these challenges, we developed Genome-Shuffle-seq, a method that enables the multiplex generation and mapping of thousands of SVs (deletions, inversions, translocations, extrachromosomal circles) throughout mammalian genomes. We also demonstrate the co-capture of SV identity with single-cell transcriptomes, facilitating the measurement of SVs’ impact on gene expression. We anticipate Genome-Shuffle-seq will be broadly useful for the systematic exploration of the functional consequences of SVs on gene expression, chromatin landscape, and 3D nuclear architecture, while also initiating a path towards a minimal mammalian genome.
Project description:Genetic variation amongst individual humans occurs on many different scales, ranging from gross alterations in the human karyotype to single-nucleotide changes. In this manuscript we explore variation on an intermediate scale-particularly insertions, deletions, and inversions affecting from a few thousand to a few million base pairs. We employed a clone-based method to interrogate this intermediate structural variation in eight individuals of diverse geographic ancestry. Our analysis provides a comprehensive overview of the normal pattern of structural variation present in these genomes, refining the location of 1695 structural variants. We find that 50% were seen in more than one individual and that nearly half lay outside regions of the genome previously described as structurally variant. We discover 525 new insertion sequences that are not present in the human reference genome and show that many of these are variable in copy number among individuals. Sequencing of a subset of structural variants reveals considerable locus complexity and provides insights into the different mutational processes that have shaped the human genome. These data provide the first high-resolution sequence-map of human structural variation-an important standard for genotyping platforms and a prelude to future individual genome sequencing projects. Keywords: comparitive genomic hybridization, copy number variation, structural variation, fosmid end sequencing CGH analysis targeted against sites identified by fosmid end sequencing. 8 HapMap samples (sources of libraries ABC7-ABC14) are hybed against NA15510 (source of fosmid library G248).