Genomics

Dataset Information

0

HydRA: Deep-learning models for predicting RNA-binding capacity from protein interaction association context and protein sequence


ABSTRACT: RNA-binding proteins (RBPs) control RNA metabolism to orchestrate gene expression, and dysfunctional RBPs underlie many human diseases. Proteome-wide discovery efforts predict thousands of novel RBPs, many of which lack canonical RNA-binding domains. Here, we present a hybrid ensemble RBP classifier (HydRA) that leverages information from both intermolecular protein interactions and internal protein sequence patterns to predict RNA-binding capacity with unparalleled specificity and sensitivity using support vector machine, convolutional neural networks and transformer-based protein language models. HydRA enables Occlusion Mapping to robustly detect known RNA-binding domains and to predict hundreds of uncharacterized RNA-binding domains. Enhanced CLIP validation for a diverse collection of RBP candidates reveals genome-wide targets and confirms RNA-binding activity for HydRA-predicted domains. The HydRA computational framework accelerates construction of a comprehensive RBP catalogue and expands the set of known RNA-binding protein domains.

ORGANISM(S): Homo sapiens

PROVIDER: GSE221870 | GEO | 2023/07/10

REPOSITORIES: GEO

Similar Datasets

2016-10-21 | MSV000080265 | MassIVE
2016-10-06 | GSE86035 | GEO
2021-09-01 | E-MTAB-9612 | biostudies-arrayexpress
2013-11-06 | E-GEOD-49309 | biostudies-arrayexpress
2022-07-09 | GSE207597 | GEO
2018-08-29 | PXD008914 | Pride
2021-08-25 | PXD024601 | Pride
2016-03-31 | PXD003664 | Pride
2017-01-16 | PXD003451 | Pride
2013-11-06 | GSE49309 | GEO