Dataset Information


AlignHUSH: alignment of HMMs using structure and hydrophobicity information.

ABSTRACT: BACKGROUND: Sensitive remote homology detection and accurate alignments especially in the midnight zone of sequence similarity are needed for better function annotation and structural modeling of proteins. An algorithm, AlignHUSH for HMM-HMM alignment has been developed which is capable of recognizing distantly related domain families The method uses structural information, in the form of predicted secondary structure probabilities, and hydrophobicity of amino acids to align HMMs of two sets of aligned sequences. The effect of using adjoining column(s) information has also been investigated and is found to increase the sensitivity of HMM-HMM alignments and remote homology detection. RESULTS: We have assessed the performance of AlignHUSH using known evolutionary relationships available in SCOP. AlignHUSH performs better than the best HMM-HMM alignment methods and is observed to be even more sensitive at higher error rates. Accuracy of the alignments obtained using AlignHUSH has been assessed using the structure-based alignments available in BaliBASE. The alignment length and the alignment quality are found to be appropriate for homology modeling and function annotation. The alignment accuracy is found to be comparable to existing methods for profile-profile alignments. CONCLUSIONS: A new method to align HMMs has been developed and is shown to have better sensitivity at error rates of 10% and above when compared to other available programs. The proposed method could effectively aid obtaining clues to functions of proteins of yet unknown function. A web-server incorporating the AlignHUSH method is available at http://crick.mbu.iisc.ernet.in/~alignhush/

SUBMITTER: Krishnadev O 

PROVIDER: S-EPMC3228556 | BioStudies | 2011-01-01

REPOSITORIES: biostudies

Similar Datasets

2007-01-01 | S-EPMC1852395 | BioStudies
2006-01-01 | S-EPMC1523218 | BioStudies
2010-01-01 | S-EPMC2879284 | BioStudies
2005-01-01 | S-EPMC1160169 | BioStudies
2016-01-01 | S-EPMC5119741 | BioStudies
2016-01-01 | S-EPMC4777721 | BioStudies
2011-01-01 | S-EPMC3085309 | BioStudies
2007-01-01 | S-EPMC1950344 | BioStudies
2014-01-01 | S-EPMC4082353 | BioStudies
2016-01-01 | S-EPMC4709342 | BioStudies