Unknown

Dataset Information

0

C10Pred: A First Machine Learning Based Tool to Predict C10 Family Cysteine Peptidases Using Sequence-Derived Features.


ABSTRACT: Streptococcus pyogenes, or group A Streptococcus (GAS), a gram-positive bacterium, is implicated in a wide range of clinical manifestations and life-threatening diseases. One of the key virulence factors of GAS is streptopain, a C10 family cysteine peptidase. Since its discovery, various homologs of streptopain have been reported from other bacterial species. With the increased affordability of sequencing, a significant increase in the number of potential C10 family-like sequences in the public databases is anticipated, posing a challenge in classifying such sequences. Sequence-similarity-based tools are the methods of choice to identify such streptopain-like sequences. However, these methods depend on some level of sequence similarity between the existing C10 family and the target sequences. Therefore, in this work, we propose a novel predictor, C10Pred, for the prediction of C10 peptidases using sequence-derived optimal features. C10Pred is a support vector machine (SVM) based model which is efficient in predicting C10 enzymes with an overall accuracy of 92.7% and Matthews' correlation coefficient (MCC) value of 0.855 when tested on an independent dataset. We anticipate that C10Pred will serve as a handy tool to classify novel streptopain-like proteins belonging to the C10 family and offer essential information.

SUBMITTER: Malik A 

PROVIDER: S-EPMC9455582 | biostudies-literature | 2022 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

C10Pred: A First Machine Learning Based Tool to Predict C10 Family Cysteine Peptidases Using Sequence-Derived Features.

Malik Adeel A   Mahajan Nitin N   Dar Tanveer Ali TA   Kim Chang-Bae CB  

International journal of molecular sciences 20220823 17


<i>Streptococcus pyogenes</i>, or group A <i>Streptococcus</i> (GAS), a gram-positive bacterium, is implicated in a wide range of clinical manifestations and life-threatening diseases. One of the key virulence factors of GAS is streptopain, a C10 family cysteine peptidase. Since its discovery, various homologs of streptopain have been reported from other bacterial species. With the increased affordability of sequencing, a significant increase in the number of potential C10 family-like sequences  ...[more]

Similar Datasets

| S-EPMC10380794 | biostudies-literature
2023-01-09 | GSE221703 | GEO
| S-EPMC4357240 | biostudies-literature
| S-EPMC7643032 | biostudies-literature
| S-EPMC4336737 | biostudies-literature
| S-EPMC3064834 | biostudies-literature
2016-03-24 | E-GEOD-71126 | biostudies-arrayexpress
2016-03-24 | GSE71126 | GEO
| S-EPMC2873492 | biostudies-literature
| S-EPMC3407033 | biostudies-literature