Dataset Information


A method for WD40 repeat detection and secondary structure prediction.

ABSTRACT: WD40-repeat proteins (WD40s), as one of the largest protein families in eukaryotes, play vital roles in assembling protein-protein/DNA/RNA complexes. WD40s fold into similar ?-propeller structures despite diversified sequences. A program WDSP (WD40 repeat protein Structure Predictor) has been developed to accurately identify WD40 repeats and predict their secondary structures. The method is designed specifically for WD40 proteins by incorporating both local residue information and non-local family-specific structural features. It overcomes the problem of highly diversified protein sequences and variable loops. In addition, WDSP achieves a better prediction in identifying multiple WD40-domain proteins by taking the global combination of repeats into consideration. In secondary structure prediction, the average Q3 accuracy of WDSP in jack-knife test reaches 93.7%. A disease related protein LRRK2 was used as a representive example to demonstrate the structure prediction.


PROVIDER: S-EPMC3679165 | BioStudies | 2013-01-01

REPOSITORIES: biostudies

Similar Datasets

2017-01-01 | S-EPMC5587647 | BioStudies
2018-01-01 | S-EPMC6113231 | BioStudies
2014-01-01 | S-EPMC3900672 | BioStudies
2014-01-01 | S-EPMC4054300 | BioStudies
2010-01-01 | S-EPMC3003394 | BioStudies
2019-01-01 | S-EPMC6358694 | BioStudies
2009-01-01 | S-EPMC2794542 | BioStudies
2010-01-01 | S-EPMC2871033 | BioStudies
1000-01-01 | S-EPMC1462119 | BioStudies
2007-01-01 | S-EPMC2526319 | BioStudies