Unknown

Dataset Information

0

DeepCellEss: cell line-specific essential protein prediction with attention-based interpretable deep learning.


ABSTRACT:

Motivation

Protein essentiality is usually accepted to be a conditional trait and strongly affected by cellular environments. However, existing computational methods often do not take such characteristics into account, preferring to incorporate all available data and train a general model for all cell lines. In addition, the lack of model interpretability limits further exploration and analysis of essential protein predictions.

Results

In this study, we proposed DeepCellEss, a sequence-based interpretable deep learning framework for cell line-specific essential protein predictions. DeepCellEss utilizes a convolutional neural network and bidirectional long short-term memory to learn short- and long-range latent information from protein sequences. Further, a multi-head self-attention mechanism is used to provide residue-level model interpretability. For model construction, we collected extremely large-scale benchmark datasets across 323 cell lines. Extensive computational experiments demonstrate that DeepCellEss yields effective prediction performance for different cell lines and outperforms existing sequence-based methods as well as network-based centrality measures. Finally, we conducted some case studies to illustrate the necessity of considering specific cell lines and the superiority of DeepCellEss. We believe that DeepCellEss can serve as a useful tool for predicting essential proteins across different cell lines.

Availability and implementation

The DeepCellEss web server is available at http://csuligroup.com:8000/DeepCellEss. The source code and data underlying this study can be obtained from https://github.com/CSUBioGroup/DeepCellEss.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Li Y 

PROVIDER: S-EPMC9825760 | biostudies-literature | 2023 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

DeepCellEss: cell line-specific essential protein prediction with attention-based interpretable deep learning.

Li Yiming Y   Zeng Min M   Zhang Fuhao F   Wu Fang-Xiang FX   Li Min M  

Bioinformatics (Oxford, England) 20230101 1


<h4>Motivation</h4>Protein essentiality is usually accepted to be a conditional trait and strongly affected by cellular environments. However, existing computational methods often do not take such characteristics into account, preferring to incorporate all available data and train a general model for all cell lines. In addition, the lack of model interpretability limits further exploration and analysis of essential protein predictions.<h4>Results</h4>In this study, we proposed DeepCellEss, a seq  ...[more]

Similar Datasets

| S-EPMC8848015 | biostudies-literature
2022-02-15 | PXD019987 | Pride
| S-EPMC11528319 | biostudies-literature
| S-EPMC10547708 | biostudies-literature
| S-EPMC11262459 | biostudies-literature
| S-EPMC11815866 | biostudies-literature
| S-EPMC8421337 | biostudies-literature
| S-EPMC8055695 | biostudies-literature
| S-EPMC9712484 | biostudies-literature
| S-EPMC6300887 | biostudies-other