Unknown

Dataset Information

0

GILoop: Robust chromatin loop calling across multiple sequencing depths on Hi-C data.


ABSTRACT: Graph and image are two common representations of Hi-C cis-contact maps. Existing computational tools have only adopted Hi-C data modeled as unitary data structures but neglected the potential advantages of synergizing the information of different views. Here we propose GILoop, a dual-branch neural network that learns from both representations to identify genome-wide CTCF-mediated loops. With GILoop, we explore the combined strength of integrating the two view representations of Hi-C data and corroborate the complementary relationship between the views. In particular, the model outperforms the state-of-the-art loop calling framework and is also more robust against low-quality Hi-C libraries. We also uncover distinct preferences for matrix density by graph-based and image-based models, revealing interesting insights into Hi-C data elucidation. Finally, along with multiple transfer-learning case studies, we demonstrate that GILoop can accurately model the organizational and functional patterns of CTCF-mediated looping across different cell lines.

SUBMITTER: Wang F 

PROVIDER: S-EPMC9700007 | biostudies-literature | 2022 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

GILoop: Robust chromatin loop calling across multiple sequencing depths on Hi-C data.

Wang Fuzhou F   Gao Tingxiao T   Lin Jiecong J   Zheng Zetian Z   Huang Lei L   Toseef Muhammad M   Li Xiangtao X   Wong Ka-Chun KC  

iScience 20221110 12


Graph and image are two common representations of Hi-C <i>cis</i>-contact maps. Existing computational tools have only adopted Hi-C data modeled as unitary data structures but neglected the potential advantages of synergizing the information of different views. Here we propose GILoop, a dual-branch neural network that learns from both representations to identify genome-wide CTCF-mediated loops. With GILoop, we explore the combined strength of integrating the two view representations of Hi-C data  ...[more]

Similar Datasets

| S-EPMC10958853 | biostudies-literature
| S-EPMC11444182 | biostudies-literature
| S-EPMC2099500 | biostudies-literature
| S-EPMC9270730 | biostudies-literature
| S-EPMC3799458 | biostudies-literature
| S-EPMC11106031 | biostudies-literature
| S-EPMC10954456 | biostudies-literature
| S-EPMC5427492 | biostudies-literature
| S-EPMC2575221 | biostudies-literature
| S-EPMC6343296 | biostudies-literature