Unknown

Dataset Information

0

A GHKNN model based on the physicochemical property extraction method to identify SNARE proteins.


ABSTRACT: There is a great deal of importance to SNARE proteins, and their absence from function can lead to a variety of diseases. The SNARE protein is known as a membrane fusion protein, and it is crucial for mediating vesicle fusion. The identification of SNARE proteins must therefore be conducted with an accurate method. Through extensive experiments, we have developed a model based on graph-regularized k-local hyperplane distance nearest neighbor model (GHKNN) binary classification. In this, the model uses the physicochemical property extraction method to extract protein sequence features and the SMOTE method to upsample protein sequence features. The combination achieves the most accurate performance for identifying all protein sequences. Finally, we compare the model based on GHKNN binary classification with other classifiers and measure them using four different metrics: SN, SP, ACC, and MCC. In experiments, the model performs significantly better than other classifiers.

SUBMITTER: Gu X 

PROVIDER: S-EPMC9727185 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

altmetric image

Publications

A GHKNN model based on the physicochemical property extraction method to identify SNARE proteins.

Gu Xingyue X   Ding Yijie Y   Xiao Pengfeng P   He Tao T  

Frontiers in genetics 20221123


There is a great deal of importance to SNARE proteins, and their absence from function can lead to a variety of diseases. The SNARE protein is known as a membrane fusion protein, and it is crucial for mediating vesicle fusion. The identification of SNARE proteins must therefore be conducted with an accurate method. Through extensive experiments, we have developed a model based on graph-regularized k-local hyperplane distance nearest neighbor model (GHKNN) binary classification. In this, the mode  ...[more]

Similar Datasets

| S-EPMC5080432 | biostudies-literature
| S-EPMC8699583 | biostudies-literature
| S-EPMC6006778 | biostudies-literature
| S-EPMC11393682 | biostudies-literature
2011-01-21 | E-GEOD-17222 | biostudies-arrayexpress
| S-EPMC5561268 | biostudies-literature
| S-EPMC3182212 | biostudies-literature
2011-01-21 | GSE17222 | GEO
| S-EPMC11781884 | biostudies-literature
| S-EPMC6641880 | biostudies-literature