Project description:Drug combination is now a hot research topic in the pharmaceutical industry, but experiment-based methodologies are extremely costly in time and money. Many computational methods have been proposed to address these problems by starting from existing drug combinations. However, in most cases, only molecular structure information is included, which covers too limited a set of drug characteristics to efficiently screen drug combinations. Here, we integrated similarity-based multifeature drug data to improve the prediction accuracy by using the neighbor recommender method combined with ensemble learning algorithms. By conducting feature assessment analysis, we selected the most useful drug features and achieved 0.964 AUC in the ensemble models. The comparison results showed that the ensemble models outperform traditional machine learning algorithms such as support vector machine (SVM), naïve Bayes (NB), and logistic regression (GLM). Furthermore, we predicted 7 candidate drug combinations for a specific drug, paclitaxel, and successfully verified that the two of the predicted combinations have promising effects.
Project description:Combination therapy has shown an obvious efficacy on complex diseases and can greatly reduce the development of drug resistance. However, even with high-throughput screens, experimental methods are insufficient to explore novel drug combinations. In order to reduce the search space of drug combinations, there is an urgent need to develop more efficient computational methods to predict novel drug combinations. In recent decades, more and more machine learning (ML) algorithms have been applied to improve the predictive performance. The object of this study is to introduce and discuss the recent applications of ML methods and the widely used databases in drug combination prediction. In this study, we first describe the concept and controversy of synergism between drug combinations. Then, we investigate various publicly available data resources and tools for prediction tasks. Next, ML methods including classic ML and deep learning methods applied in drug combination prediction are introduced. Finally, we summarize the challenges to ML methods in prediction tasks and provide a discussion on future work.
Project description:Gene expression profiles were generated from 199 primary breast cancer patients. Samples 1-176 were used in another study, GEO Series GSE22820, and form the training data set in this study. Sample numbers 200-222 form a validation set. This data is used to model a machine learning classifier for Estrogen Receptor Status. RNA was isolated from 199 primary breast cancer patients. A machine learning classifier was built to predict ER status using only three gene features.
Project description:Clinical trials of novel therapeutics for Alzheimer's Disease (AD) have consumed a large amount of time and resources with largely negative results. Repurposing drugs already approved by the Food and Drug Administration (FDA) for another indication is a more rapid and less expensive option. In this study, we profile 80 FDA-approved and clinically tested drugs in neural cell cultures, with the goal of producing a ranked list of possible repurposing candidates.
Project description:Transcription factors (TFs) have been shown to play a key role in the occurrence and development of tumors, including triple-negative breast cancer (TNBC), with a worse prognosis. Machine learning is widely used for establishing prediction models and screening key tumor drivers. Current studies lack TF integration in TNBC, so targeted research on TF prognostic models and targeted drugs is beneficial to improve clinical translational application. The purpose of this study was to use the Least Absolute Shrinkage and Selection Operator to build a prognostic TFs model after cohort normalization based on housekeeping gene expression levels. Potential targeted drugs were then screened on the basis of molecular docking, and a multi-drug combination strategy was used for both in vivo and in vitro experimental studies. The machine learning model of TFs built by E2F8, FOXM1, and MYBL2 has broad applicability, with an AUC value of up to 0.877 at one year. As a high-risk clinical factor, its abnormal disorder may lead to upregulation of the activity of pathways related to cell proliferation. This model can also be used to predict the adverse effects of immunotherapy in patients with TNBC. Molecular docking was used to screen three drugs that target TFs: Trichostatin A (TSA), Doxorubicin (DOX), and Calcitriol. In vitro and in vivo experiments showed that TSA + DOX was able to effectively reduce DOX dosage, and TSA + DOX + Calcitriol may be able to effectively reduce the toxic side effects of DOX on the heart. In conclusion, the machine learning model based on three TFs provides new biomarkers for clinical and prognostic diagnosis of TNBC, and the combination targeted drug strategy offers a novel research perspective for TNBC treatment.
Project description:BackgroundThe mechanism of action for most cancer drugs is not clear. Large-scale pharmacogenomic cancer cell line datasets offer a rich resource to obtain this knowledge. Here, we present an analysis strategy for revealing biological pathways that contribute to drug response using publicly available pharmacogenomic cancer cell line datasets.MethodsWe present a custom machine-learning based approach for identifying biological pathways involved in cancer drug response. We test the utility of our approach with a pan-cancer analysis of ML210, an inhibitor of GPX4, and a melanoma-focused analysis of inhibitors of BRAFV600. We apply our approach to reveal determinants of drug resistance to microtubule inhibitors.ResultsOur method implicated lipid metabolism and Rac1/cytoskeleton signaling in the context of ML210 and BRAF inhibitor response, respectively. These findings are consistent with current knowledge of how these drugs work. For microtubule inhibitors, our approach implicated Notch and Akt signaling as pathways that associated with response.ConclusionsOur results demonstrate the utility of combining informed feature selection and machine learning algorithms in understanding cancer drug response.
Project description:Lung cancers with a mutated epidermal growth factor receptor (EGFR) are a major contributor to cancer fatalities globally. Targeted tyrosine kinase inhibitors (TKIs) have been developed against EGFR and show encouraging results for survival rate and quality of life. However, drug resistance may affect treatment plans and treatment efficacy may be lost after about a year. Predicting the response to EGFR-TKIs for EGFR-mutated lung cancer patients is a key research area. In this study, we propose a personalized drug response prediction model (PDRP), based on molecular dynamics simulations and machine learning, to predict the response of first generation FDA-approved small molecule EGFR-TKIs, Gefitinib/Erlotinib, in lung cancer patients. The patient's mutation status is taken into consideration in molecular dynamics (MD) simulation. Each patient's unique mutation status was modeled considering MD simulation to extract molecular-level geometric features. Moreover, additional clinical features were incorporated into machine learning model for drug response prediction. The complete feature set includes demographic and clinical information (DCI), geometrical properties of the drug-target binding site, and the binding free energy of the drug-target complex from the MD simulation. PDRP incorporates an XGBoost classifier, which achieves state-of-the-art performance with 97.5% accuracy, 93% recall, 96.5% precision, and 94% F1-score, for a 4-class drug response prediction task. We found that modeling the geometry of the binding pocket combined with binding free energy is a good predictor for drug response. However, we observed that clinical information had a little impact on the performance of the model. The proposed model could be tested on other types of cancers. We believe PDRP will support the planning of effective treatment regimes based on clinical-genomic information. The source code and related files are available on GitHub at: https://github.com/rizwanqureshi123/PDRP/ .
Project description:Chemotherapy resistance is a major challenge to the effective treatment of cancer. Thus, a systematic pipeline for the efficient identification of effective combination treatments could bring huge biomedical benefit. In order to facilitate rational design of combination therapies, we developed a comprehensive computational model that incorporates the available biological knowledge and relevant experimental data on the life-and-death response of individual cancer cells to cisplatin or cisplatin combined with the TNF-related apoptosis-inducing ligand (TRAIL). The model's predictions, that a combination treatment of cisplatin and TRAIL would enhance cancer cell death and exhibit a "two-wave killing" temporal pattern, was validated by measuring the dynamics of p53 accumulation, cell fate, and cell death in single cells. The validated model was then subjected to a systematic analysis with an ensemble of diverse machine learning methods. Though each method is characterized by a different algorithm, they collectively identified several molecular players that can sensitize tumor cells to cisplatin-induced apoptosis (sensitizers). The identified sensitizers are consistent with previous experimental observations. Overall, we have illustrated that machine learning analysis of an experimentally validated mechanistic model can convert our available knowledge into the identity of biologically meaningful sensitizers. This knowledge can then be leveraged to design treatment strategies that could improve the efficacy of chemotherapy.
Project description:Understanding drug-drug interactions is an essential step to reduce the risk of adverse drug events before clinical drug co-prescription. Existing methods, commonly integrating heterogeneous data to increase model performance, often suffer from a high model complexity, As such, how to elucidate the molecular mechanisms underlying drug-drug interactions while preserving rational biological interpretability is a challenging task in computational modeling for drug discovery. In this study, we attempt to investigate drug-drug interactions via the associations between genes that two drugs target. For this purpose, we propose a simple f drug target profile representation to depict drugs and drug pairs, from which an l2-regularized logistic regression model is built to predict drug-drug interactions. Furthermore, we define several statistical metrics in the context of human protein-protein interaction networks and signaling pathways to measure the interaction intensity, interaction efficacy and action range between two drugs. Large-scale empirical studies including both cross validation and independent test show that the proposed drug target profiles-based machine learning framework outperforms existing data integration-based methods. The proposed statistical metrics show that two drugs easily interact in the cases that they target common genes; or their target genes connect via short paths in protein-protein interaction networks; or their target genes are located at signaling pathways that have cross-talks. The unravelled mechanisms could provide biological insights into potential adverse drug reactions of co-prescribed drugs.
Project description:Gene expression profiles were generated from 199 primary breast cancer patients. Samples 1-176 were used in another study, GEO Series GSE22820, and form the training data set in this study. Sample numbers 200-222 form a validation set. This data is used to model a machine learning classifier for Estrogen Receptor Status.