Project description:The process of producing the GRC zebrafish assembly included the high-quality sequencing and finishing of clones representing alternate haplotypes to corresponding regions in the current primary assembly. This project reports the variation between those alternate haplotype clones and the primary assembly.
Project description:Constructing high-quality haplotype-resolved genome assemblies has substantially improved the ability to detect and characterize genetic variants. A targeted approach providing readily access to the rich information from haplotype-resolved genome assemblies will be appealing to groups of basic researchers and medical scientists focused on specific genomic regions. Here, using the 4.5 megabase, notoriously difficult-to-assemble major histocompatibility complex (MHC) region as an example, we demonstrated an approach to construct haplotype-resolved assembly of the targeted genomic region with the CRISPR-based enrichment. Compared to the results from haplotype-resolved genome assembly, our targeted approach achieved comparable completeness and accuracy with reduced computing complexity, sequencing cost, as well as the amount of starting materials. Moreover, using the targeted assembled personal MHC haplotypes as the reference both improves the quantification accuracy for sequencing data and enables allele-specific functional genomics analyses of the MHC region. Given its highly efficient use of resources, our approach can greatly facilitate population genetic studies of targeted regions, and may pave a new way to elucidate the molecular mechanisms in disease etiology.
Project description:Constructing high-quality haplotype-resolved genome assemblies has substantially improved the ability to detect and characterize genetic variants. A targeted approach providing readily access to the rich information from haplotype-resolved genome assemblies will be appealing to groups of basic researchers and medical scientists focused on specific genomic regions. Here, using the 4.5 megabase, notoriously difficult-to-assemble major histocompatibility complex (MHC) region as an example, we demonstrated an approach to construct haplotype-resolved assembly of the targeted genomic region with the CRISPR-based enrichment. Compared to the results from haplotype-resolved genome assembly, our targeted approach achieved comparable completeness and accuracy with reduced computing complexity, sequencing cost, as well as the amount of starting materials. Moreover, using the targeted assembled personal MHC haplotypes as the reference both improves the quantification accuracy for sequencing data and enables allele-specific functional genomics analyses of the MHC region. Given its highly efficient use of resources, our approach can greatly facilitate population genetic studies of targeted regions, and may pave a new way to elucidate the molecular mechanisms in disease etiology.
Project description:Dikaryotic rust fungi maintain two distinct haploid nuclei for most of their life cycle, making their large, repeat-rich genomes difficult to assemble and phase. Here we present haplotype-phased, near chromosome-scale genome assemblies for the poplar rust pathogens Melampsora larici-populina 98AG31 and Melampsora allii-populina 12AY07, generated using PacBio HiFi sequencing and Hi-C-guided scaffolding. For each species, we resolve 18 chromosomes per haplotype, providing the first chromosome-level representations of poplar rust fungal species. M. larici-populina diploid assembly spans ~203 Mb, while M. allii-populina reaches ~416 Mb, with high completeness and strong collinearity between haplotypes.