Dataset Information

Accurate and efficient target prediction using a potency-sensitive influence-relevance voter.

ABSTRACT: BACKGROUND:A number of algorithms have been proposed to predict the biological targets of diverse molecules. Some are structure-based, but the most common are ligand-based and use chemical fingerprints and the notion of chemical similarity. These methods tend to be computationally faster than others, making them particularly attractive tools as the amount of available data grows. RESULTS:Using a ChEMBL-derived database covering 490,760 molecule-protein interactions and 3236 protein targets, we conduct a large-scale assessment of the performance of several target-prediction algorithms at predicting drug-target activity. We assess algorithm performance using three validation procedures: standard tenfold cross-validation, tenfold cross-validation in a simulated screen that includes random inactive molecules, and validation on an external test set composed of molecules not present in our database. CONCLUSIONS:We present two improvements over current practice. First, using a modified version of the influence-relevance voter (IRV), we show that using molecule potency data can improve target prediction. Second, we demonstrate that random inactive molecules added during training can boost the accuracy of several algorithms in realistic target-prediction experiments. Our potency-sensitive version of the IRV (PS-IRV) obtains the best results on large test sets in most of the experiments. Models and software are publicly accessible through the chemoinformatics portal at http://chemdb.ics.uci.edu/.

SUBMITTER: Lusci A

PROVIDER: S-EPMC4696267 | biostudies-other | 2015

REPOSITORIES: biostudies-other

ACCESS DATA

Publications

Accurate and efficient target prediction using a potency-sensitive influence-relevance voter.

Lusci Alessandro A Browning Michael M Fooshee David D Swamidass Joshua J Baldi Pierre P

Journal of cheminformatics 20151229

<h4>Background</h4>A number of algorithms have been proposed to predict the biological targets of diverse molecules. Some are structure-based, but the most common are ligand-based and use chemical fingerprints and the notion of chemical similarity. These methods tend to be computationally faster than others, making them particularly attractive tools as the amount of available data grows.<h4>Results</h4>Using a ChEMBL-derived database covering 490,760 molecule-protein interactions and 3236 protei ...[more]

PMID: 26719774

Dataset Information

Accurate and efficient target prediction using a potency-sensitive influence-relevance voter.

Publications

Accurate and efficient target prediction using a potency-sensitive influence-relevance voter.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Prediction-based highly sensitive CRISPR off-target validation using target-specific DNA enrichment.
| S-EPMC7368065 | biostudies-literature

Accurate indel prediction using paired-end short reads.
| S-EPMC3614465 | biostudies-other

A Bayesian framework for efficient and accurate variant prediction.
| S-EPMC6136750 | biostudies-literature

Accurate microRNA Target Prediction Using Detailed Binding Site Accessibility and Machine Learning on Proteomics Data.
| S-EPMC3265086 | biostudies-literature

Accurate and transferable drug-target interaction prediction with DrugLAMP.
| S-EPMC11629708 | biostudies-literature

CONSULT: accurate contamination removal using locality-sensitive hashing.
| S-EPMC8340999 | biostudies-literature

Prediction of skin sensitization potency using machine learning approaches.
| S-EPMC5435511 | biostudies-literature

Efficient use of accessibility in microRNA target prediction.
| S-EPMC3017612 | biostudies-literature

Accurate Prediction of Kinase-Substrate Networks Using Knowledge Graphs
2022-02-20 | PXD018905 | Pride

Accurate Prediction of Children's Target Height from Their Mid-Parental Height.
| S-EPMC11352326 | biostudies-literature