Genomics

Dataset Information

0

High dimensional association detection in large-scale genomic data


ABSTRACT: Joint analyses of genomic datasets obtained in multiple different conditions are essential for understanding the biological mechanism that drives tissue-specificity and cell differentiation, but they still remain computationally challenging. To address this we introduce CLIMB (Composite LIkelihood eMpirical Bayes), a statistical methodology that learns patterns of condition-specificity present in genomic data. CLIMB provides a generic framework facilitating a host of analyses, such as clustering genomic features sharing similar condition-specific patterns and identifying which of these features are involved in cell fate commitment. Our approach improves upon existing methods by boosting statistical power to identify meaningful signals while retaining interpretability and computational tractability. We illustrate CLIMB's value on two sets of hematopoietic data: one studying CTCF ChIP-seq measured in 17 different cell populations, and another examining RNA-seq measured across constituent cell populations in three committed lineages. These analyses demonstrate that CLIMB captures biologically relevant clusters in the data and improves upon commonly-used pairwise comparisons and unsupervised clusterings typical of genomic analyses.

ORGANISM(S): Mus musculus Homo sapiens

PROVIDER: GSE156074 | GEO | 2020/11/18

REPOSITORIES: GEO

Similar Datasets

2015-12-10 | E-MTAB-4119 | biostudies-arrayexpress
2017-06-30 | GSE88824 | GEO
2023-09-05 | PXD044264 | Pride
2016-06-15 | E-GEOD-76763 | biostudies-arrayexpress
2016-09-23 | GSE87209 | GEO
2016-06-15 | GSE76763 | GEO
2022-10-05 | GSE214633 | GEO
2019-08-11 | E-MTAB-8192 | biostudies-arrayexpress
2018-04-18 | GSE103964 | GEO
| PRJNA312932 | ENA