Unknown

Dataset Information

0

Comprehensive assessment of computational algorithms in predicting cancer driver mutations.


ABSTRACT:

Background

The initiation and subsequent evolution of cancer are largely driven by a relatively small number of somatic mutations with critical functional impacts, so-called driver mutations. Identifying driver mutations in a patient's tumor cells is a central task in the era of precision cancer medicine. Over the decade, many computational algorithms have been developed to predict the effects of missense single-nucleotide variants, and they are frequently employed to prioritize mutation candidates. These algorithms employ diverse molecular features to build predictive models, and while some algorithms are cancer-specific, others are not. However, the relative performance of these algorithms has not been rigorously assessed.

Results

We construct five complementary benchmark datasets: mutation clustering patterns in the protein 3D structures, literature annotation based on OncoKB, TP53 mutations based on their effects on target-gene transactivation, effects of cancer mutations on tumor formation in xenograft experiments, and functional annotation based on in vitro cell viability assays we developed including a new dataset of ~ 200 mutations. We evaluate the performance of 33 algorithms and found that CHASM, CTAT-cancer, DEOGEN2, and PrimateAI show consistently better performance than the other algorithms. Moreover, cancer-specific algorithms show much better performance than those designed for a general purpose.

Conclusions

Our study is a comprehensive assessment of the performance of different algorithms in predicting cancer driver mutations and provides deep insights into the best practice of computationally prioritizing cancer mutation candidates for end-users and for the future development of new algorithms.

SUBMITTER: Chen H 

PROVIDER: S-EPMC7033911 | biostudies-literature | 2020 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Comprehensive assessment of computational algorithms in predicting cancer driver mutations.

Chen Hu H   Li Jun J   Wang Yumeng Y   Ng Patrick Kwok-Shing PK   Tsang Yiu Huen YH   Shaw Kenna R KR   Mills Gordon B GB   Liang Han H  

Genome biology 20200220 1


<h4>Background</h4>The initiation and subsequent evolution of cancer are largely driven by a relatively small number of somatic mutations with critical functional impacts, so-called driver mutations. Identifying driver mutations in a patient's tumor cells is a central task in the era of precision cancer medicine. Over the decade, many computational algorithms have been developed to predict the effects of missense single-nucleotide variants, and they are frequently employed to prioritize mutation  ...[more]

Similar Datasets

| S-EPMC8921613 | biostudies-literature
| S-EPMC6029450 | biostudies-literature
| S-EPMC2763410 | biostudies-literature
| S-EPMC6589134 | biostudies-literature
| S-EPMC11382650 | biostudies-literature
| S-EPMC11537269 | biostudies-literature
| S-EPMC2922245 | biostudies-literature
| S-EPMC10805075 | biostudies-literature
| S-EPMC5564847 | biostudies-literature
| S-EPMC3665581 | biostudies-literature