Unknown

Dataset Information

0

Performance evaluation for MOTIFSIM.


ABSTRACT: BackgroundPrevious studies show various results obtained from different motif finders for an identical dataset. This is largely due to the fact that these tools use different strategies and possess unique features for discovering the motifs. Hence, using multiple tools and methods has been suggested because the motifs commonly reported by them are more likely to be biologically significant.ResultsThe common significant motifs from multiple tools can be obtained by using MOTIFSIM tool. In this work, we evaluated the performance of MOTIFSIM in three aspects. First, we compared the pair-wise comparison technique of MOTIFSIM with the un-gapped Smith-Waterman algorithm and four common distance metrics: average Kullback-Leibler, average log-likelihood ratio, Chi-Square distance, and Pearson Correlation Coefficient. Second, we compared the performance of MOTIFSIM with RSAT Matrix-clustering tool for motif clustering. Lastly, we evaluated the performances of nineteen motif finders and the reliability of MOTIFSIM for identifying the common significant motifs from multiple tools.ConclusionsThe pair-wise comparison results reveal that MOTIFSIM attains better performance than the un-gapped Smith-Waterman algorithm and four distance metrics. The clustering results also demonstrate that MOTIFSIM achieves similar or even better performance than RSAT Matrix-clustering. Furthermore, the findings indicate if the motif detection does not require a special tool for detecting a specific type of motif then using multiple motif finders and combining with MOTIFSIM for obtaining the common significant motifs, it improved the results for DNA motif detection.Electronic supplementary materialThe online version of this article (10.1186/s12575-018-0088-3) contains supplementary material, which is available to authorized users.

SUBMITTER: Tran NTL 

PROVIDER: S-EPMC6299673 | biostudies-other | 2018

REPOSITORIES: biostudies-other

Similar Datasets

2010-06-23 | E-GEOD-19248 | biostudies-arrayexpress
2019-06-27 | GSE131398 | GEO
2010-03-18 | GSE19248 | GEO
2019-06-27 | GSE131397 | GEO
2019-06-27 | GSE131396 | GEO
2006-04-08 | GSE4632 | GEO
2024-06-16 | PXD044349 | Pride
| PRJNA543480 | ENA
2023-07-31 | PXD044222 |
2021-01-26 | GSE144127 | GEO