Unknown

Dataset Information

0

InsuLock: A Weakly Supervised Learning Approach for Accurate Insulator Prediction, and Variant Impact Quantification.


ABSTRACT: Mapping chromatin insulator loops is crucial to investigating genome evolution, elucidating critical biological functions, and ultimately quantifying variant impact in diseases. However, chromatin conformation profiling assays are usually expensive, time-consuming, and may report fuzzy insulator annotations with low resolution. Therefore, we propose a weakly supervised deep learning method, InsuLock, to address these challenges. Specifically, InsuLock first utilizes a Siamese neural network to predict the existence of insulators within a given region (up to 2000 bp). Then, it uses an object detection module for precise insulator boundary localization via gradient-weighted class activation mapping (~40 bp resolution). Finally, it quantifies variant impacts by comparing the insulator score differences between the wild-type and mutant alleles. We applied InsuLock on various bulk and single-cell datasets for performance testing and benchmarking. We showed that it outperformed existing methods with an AUROC of ~0.96 and condensed insulator annotations to ~2.5% of their original size while still demonstrating higher conservation scores and better motif enrichments. Finally, we utilized InsuLock to make cell-type-specific variant impacts from brain scATAC-seq data and identified a schizophrenia GWAS variant disrupting an insulator loop proximal to a known risk gene, indicating a possible new mechanism of action for the disease.

SUBMITTER: Srinivasan SS 

PROVIDER: S-EPMC9026820 | biostudies-literature | 2022 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

InsuLock: A Weakly Supervised Learning Approach for Accurate Insulator Prediction, and Variant Impact Quantification.

Srinivasan Shushrruth Sai SS   Gong Yanwen Y   Xu Siwei S   Hwang Ahyeon A   Xu Min M   Girgenti Matthew J MJ   Zhang Jing J  

Genes 20220330 4


Mapping chromatin insulator loops is crucial to investigating genome evolution, elucidating critical biological functions, and ultimately quantifying variant impact in diseases. However, chromatin conformation profiling assays are usually expensive, time-consuming, and may report fuzzy insulator annotations with low resolution. Therefore, we propose a weakly supervised deep learning method, InsuLock, to address these challenges. Specifically, InsuLock first utilizes a Siamese neural network to p  ...[more]

Similar Datasets

| S-EPMC9307817 | biostudies-literature
| S-EPMC9312957 | biostudies-literature
| S-EPMC10773112 | biostudies-literature
| S-EPMC10508757 | biostudies-literature
| S-EPMC8934548 | biostudies-literature
| S-EPMC11258139 | biostudies-literature
| S-EPMC11331614 | biostudies-literature
| S-EPMC10216803 | biostudies-literature
| S-EPMC8345494 | biostudies-literature
| S-EPMC9017143 | biostudies-literature