Unknown

Dataset Information

0

Improved Species-Specific Lysine Acetylation Site Prediction Based on a Large Variety of Features Set.


ABSTRACT: Lysine acetylation is a major post-translational modification. It plays a vital role in numerous essential biological processes, such as gene expression and metabolism, and is related to some human diseases. To fully understand the regulatory mechanism of acetylation, identification of acetylation sites is first and most important. However, experimental identification of protein acetylation sites is often time consuming and expensive. Therefore, the alternative computational methods are necessary. Here, we developed a novel tool, KA-predictor, to predict species-specific lysine acetylation sites based on support vector machine (SVM) classifier. We incorporated different types of features and employed an efficient feature selection on each type to form the final optimal feature set for model learning. And our predictor was highly competitive for the majority of species when compared with other methods. Feature contribution analysis indicated that HSE features, which were firstly introduced for lysine acetylation prediction, significantly improved the predictive performance. Particularly, we constructed a high-accurate structure dataset of H.sapiens from PDB to analyze the structural properties around lysine acetylation sites. Our datasets and a user-friendly local tool of KA-predictor can be freely available at http://sourceforge.net/p/ka-predictor.

SUBMITTER: Wuyun Q 

PROVIDER: S-EPMC4868276 | biostudies-literature | 2016

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improved Species-Specific Lysine Acetylation Site Prediction Based on a Large Variety of Features Set.

Wuyun Qiqige Q   Zheng Wei W   Zhang Yanping Y   Ruan Jishou J   Hu Gang G  

PloS one 20160516 5


Lysine acetylation is a major post-translational modification. It plays a vital role in numerous essential biological processes, such as gene expression and metabolism, and is related to some human diseases. To fully understand the regulatory mechanism of acetylation, identification of acetylation sites is first and most important. However, experimental identification of protein acetylation sites is often time consuming and expensive. Therefore, the alternative computational methods are necessar  ...[more]

Similar Datasets

| S-EPMC3930742 | biostudies-literature
| S-EPMC3500252 | biostudies-other
| S-EPMC8769686 | biostudies-literature
| S-EPMC4301072 | biostudies-literature
| S-EPMC1852326 | biostudies-literature
| S-EPMC4118097 | biostudies-literature
| S-EPMC3229533 | biostudies-literature
| S-EPMC3251012 | biostudies-literature
| S-EPMC4223480 | biostudies-literature
| S-EPMC2975415 | biostudies-literature