Unknown

Dataset Information

0

Alignment-free similarity analysis for protein sequences based on fuzzy integral.


ABSTRACT: Sequence comparison is an essential part of modern molecular biology research. In this study, we estimated the parameters of Markov chain by considering the frequencies of occurrence of the all possible amino acid pairs from each alignment-free protein sequence. These estimated Markov chain parameters were used to calculate similarity between two protein sequences based on a fuzzy integral algorithm. For validation, our result was compared with both alignment-based (ClustalW) and alignment-free methods on six benchmark datasets. The results indicate that our developed algorithm has a better clustering performance for protein sequence comparison.

SUBMITTER: Saw AK 

PROVIDER: S-EPMC6391537 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6403383 | biostudies-literature
| S-EPMC2722654 | biostudies-literature
| S-EPMC1131888 | biostudies-literature
| S-EPMC6355110 | biostudies-literature
| S-EPMC7446192 | biostudies-literature
| S-EPMC6436989 | biostudies-literature