Unknown

Dataset Information

0

D-SCRIPT translates genome to phenome with sequence-based, structure-aware, genome-scale predictions of protein-protein interactions.


ABSTRACT: We combine advances in neural language modeling and structurally motivated design to develop D-SCRIPT, an interpretable and generalizable deep-learning model, which predicts interaction between two proteins using only their sequence and maintains high accuracy with limited training data and across species. We show that a D-SCRIPT model trained on 38,345 human PPIs enables significantly improved functional characterization of fly proteins compared with the state-of-the-art approach. Evaluating the same D-SCRIPT model on protein complexes with known 3D structure, we find that the inter-protein contact map output by D-SCRIPT has significant overlap with the ground truth. We apply D-SCRIPT to screen for PPIs in cow (Bos taurus) at a genome-wide scale and focusing on rumen physiology, identify functional gene modules related to metabolism and immune response. The predicted interactions can then be leveraged for function prediction at scale, addressing the genome-to-phenome challenge, especially in species where little data are available.

SUBMITTER: Sledzieski S 

PROVIDER: S-EPMC8586911 | biostudies-literature | 2021 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

D-SCRIPT translates genome to phenome with sequence-based, structure-aware, genome-scale predictions of protein-protein interactions.

Sledzieski Samuel S   Singh Rohit R   Cowen Lenore L   Berger Bonnie B  

Cell systems 20211009 10


We combine advances in neural language modeling and structurally motivated design to develop D-SCRIPT, an interpretable and generalizable deep-learning model, which predicts interaction between two proteins using only their sequence and maintains high accuracy with limited training data and across species. We show that a D-SCRIPT model trained on 38,345 human PPIs enables significantly improved functional characterization of fly proteins compared with the state-of-the-art approach. Evaluating th  ...[more]

Similar Datasets

| S-EPMC11223820 | biostudies-literature
| S-EPMC1988853 | biostudies-literature
| S-EPMC5860606 | biostudies-literature
| S-EPMC10288729 | biostudies-literature
| S-EPMC10837428 | biostudies-literature
| S-EPMC3482288 | biostudies-literature
| S-EPMC11272779 | biostudies-literature
2023-01-08 | GSE211000 | GEO
| S-EPMC10373252 | biostudies-literature
2016-09-24 | GSE87233 | GEO