Unknown

Dataset Information

0

KCML: a machine-learning framework for inference of multi-scale gene functions from genetic perturbation screens.


ABSTRACT: Characterising context-dependent gene functions is crucial for understanding the genetic bases of health and disease. To date, inference of gene functions from large-scale genetic perturbation screens is based on ad hoc analysis pipelines involving unsupervised clustering and functional enrichment. We present Knowledge- and Context-driven Machine Learning (KCML), a framework that systematically predicts multiple context-specific functions for a given gene based on the similarity of its perturbation phenotype to those with known function. As a proof of concept, we test KCML on three datasets describing phenotypes at the molecular, cellular and population levels and show that it outperforms traditional analysis pipelines. In particular, KCML identified an abnormal multicellular organisation phenotype associated with the depletion of olfactory receptors, and TGFβ and WNT signalling genes in colorectal cancer cells. We validate these predictions in colorectal cancer patients and show that olfactory receptors expression is predictive of worse patient outcomes. These results highlight KCML as a systematic framework for discovering novel scale-crossing and context-dependent gene functions. KCML is highly generalisable and applicable to various large-scale genetic perturbation screens.

SUBMITTER: Sailem HZ 

PROVIDER: S-EPMC7059140 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

KCML: a machine-learning framework for inference of multi-scale gene functions from genetic perturbation screens.

Sailem Heba Z HZ   Rittscher Jens J   Pelkmans Lucas L  

Molecular systems biology 20200301 3


Characterising context-dependent gene functions is crucial for understanding the genetic bases of health and disease. To date, inference of gene functions from large-scale genetic perturbation screens is based on ad hoc analysis pipelines involving unsupervised clustering and functional enrichment. We present Knowledge- and Context-driven Machine Learning (KCML), a framework that systematically predicts multiple context-specific functions for a given gene based on the similarity of its perturbat  ...[more]

Similar Datasets

| S-EPMC8277066 | biostudies-literature
2023-08-07 | GSE231345 | GEO
| S-EPMC10277617 | biostudies-literature
2023-08-07 | GSE231344 | GEO
2023-08-07 | GSE231343 | GEO
| S-EPMC9654597 | biostudies-literature
| S-EPMC11807228 | biostudies-literature
| S-EPMC9751287 | biostudies-literature
| S-EPMC10245690 | biostudies-literature
| S-EPMC10812086 | biostudies-literature