Unknown

Dataset Information

0

HiCHap: a package to correct and analyze the diploid Hi-C data.


ABSTRACT:

Background

In diploid cells, it is important to construct maternal and paternal Hi-C contact maps respectively since the two homologous chromosomes can differ in chromatin three-dimensional (3D) organization. Though previous softwares could construct diploid (maternal and paternal) Hi-C contact maps by using phased genetic variants, they all neglected the systematic biases in diploid Hi-C contact maps caused by variable genetic variant density in the genome. In addition, few of softwares provided quantitative analyses on allele-specific chromatin 3D organization, including compartment, topological domain and chromatin loop.

Results

In this work, we revealed the feature of allele-assignment bias caused by the variable genetic variant density, and then proposed a novel strategy to correct the systematic biases in diploid Hi-C contact maps. Based on the bias correction, we developed an integrated tool, called HiCHap, to perform read mapping, contact map construction, whole-genome identification of compartments, topological domains and chromatin loops, and allele-specific testing for diploid Hi-C data. Our results show that the correction on allele-assignment bias in HiCHap does significantly improve the quality of diploid Hi-C contact maps, which subsequently facilitates the whole-genome identification of diploid chromatin 3D organization, including compartments, topological domains and chromatin loops. Finally, HiCHap also supports the data analysis for haploid Hi-C maps without distinguishing two homologous chromosomes.

Conclusions

We provided an integrated package HiCHap to perform the data processing, bias correction and structural analysis for diploid Hi-C data. The source code and tutorial of software HiCHap are freely available at https://pypi.org/project/HiCHap/ .

SUBMITTER: Luo H 

PROVIDER: S-EPMC7590616 | BioStudies | 2020-01-01

REPOSITORIES: biostudies

Similar Datasets

2020-01-01 | S-EPMC7708071 | BioStudies
2020-01-01 | S-EPMC7648276 | BioStudies
2020-01-01 | S-EPMC7036197 | BioStudies
2018-01-01 | S-EPMC6043163 | BioStudies
2019-01-01 | S-EPMC6612869 | BioStudies
2018-01-01 | S-EPMC6084597 | BioStudies
2017-01-01 | S-EPMC5591081 | BioStudies
2016-01-01 | S-EPMC4937202 | BioStudies
2020-01-01 | S-EPMC7528378 | BioStudies
2012-01-01 | S-EPMC3509491 | BioStudies