Unknown

Dataset Information

0

IBPred: A sequence-based predictor for identifying ion binding protein in phage.


ABSTRACT: Ion binding proteins (IBPs) can selectively and non-covalently interact with ions. IBPs in phages also play an important role in biological processes. Therefore, accurate identification of IBPs is necessary for understanding their biological functions and molecular mechanisms that involve binding to ions. Since molecular biology experimental methods are still labor-intensive and cost-ineffective in identifying IBPs, it is helpful to develop computational methods to identify IBPs quickly and efficiently. In this work, a random forest (RF)-based model was constructed to quickly identify IBPs. Based on the protein sequence information and residues' physicochemical properties, the dipeptide composition combined with the physicochemical correlation between two residues were proposed for the extraction of features. A feature selection technique called analysis of variance (ANOVA) was used to exclude redundant information. By comparing with other classified methods, we demonstrated that our method could identify IBPs accurately. Based on the model, a Python package named IBPred was built with the source code which can be accessed at https://github.com/ShishiYuan/IBPred.

SUBMITTER: Yuan SS 

PROVIDER: S-EPMC9474292 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

altmetric image

Publications

IBPred: A sequence-based predictor for identifying ion binding protein in phage.

Yuan Shi-Shi SS   Gao Dong D   Xie Xue-Qin XQ   Ma Cai-Yi CY   Su Wei W   Zhang Zhao-Yue ZY   Zheng Yan Y   Ding Hui H  

Computational and structural biotechnology journal 20220828


Ion binding proteins (IBPs) can selectively and non-covalently interact with ions. IBPs in phages also play an important role in biological processes. Therefore, accurate identification of IBPs is necessary for understanding their biological functions and molecular mechanisms that involve binding to ions. Since molecular biology experimental methods are still labor-intensive and cost-ineffective in identifying IBPs, it is helpful to develop computational methods to identify IBPs quickly and effi  ...[more]

Similar Datasets

| S-EPMC4058692 | biostudies-literature
| S-EPMC4820270 | biostudies-literature
| S-EPMC9272798 | biostudies-literature
| S-EPMC3483203 | biostudies-literature
| S-EPMC2788375 | biostudies-literature
| S-EPMC7523644 | biostudies-literature
| S-EPMC2703882 | biostudies-literature
| S-EPMC6274413 | biostudies-literature
| S-EPMC8376549 | biostudies-literature
| S-EPMC9275694 | biostudies-literature