Unknown

Dataset Information

0

A novel harmony search-K means hybrid algorithm for clustering gene expression data.


ABSTRACT: Recent progress in bioinformatics research has led to the accumulation of huge quantities of biological data at various data sources. The DNA microarray technology makes it possible to simultaneously analyze large number of genes across different samples. Clustering of microarray data can reveal the hidden gene expression patterns from large quantities of expression data that in turn offers tremendous possibilities in functional genomics, comparative genomics, disease diagnosis and drug development. The k- ¬means clustering algorithm is widely used for many practical applications. But the original k-¬means algorithm has several drawbacks. It is computationally expensive and generates locally optimal solutions based on the random choice of the initial centroids. Several methods have been proposed in the literature for improving the performance of the k-¬means algorithm. A meta-heuristic optimization algorithm named harmony search helps find out near-global optimal solutions by searching the entire solution space. Low clustering accuracy of the existing algorithms limits their use in many crucial applications of life sciences. In this paper we propose a novel Harmony Search-K means Hybrid (HSKH) algorithm for clustering the gene expression data. Experimental results show that the proposed algorithm produces clusters with better accuracy in comparison with the existing algorithms.

SUBMITTER: Nazeer KA 

PROVIDER: S-EPMC3563403 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel harmony search-K means hybrid algorithm for clustering gene expression data.

Nazeer Ka Abdul KA   Sebastian Mp M   Kumar Sd Madhu SM  

Bioinformation 20130118 2


Recent progress in bioinformatics research has led to the accumulation of huge quantities of biological data at various data sources. The DNA microarray technology makes it possible to simultaneously analyze large number of genes across different samples. Clustering of microarray data can reveal the hidden gene expression patterns from large quantities of expression data that in turn offers tremendous possibilities in functional genomics, comparative genomics, disease diagnosis and drug developm  ...[more]

Similar Datasets

| S-EPMC3984869 | biostudies-other
| S-EPMC5154524 | biostudies-literature
| S-EPMC2758278 | biostudies-literature
| S-EPMC7180207 | biostudies-literature
| S-EPMC5599559 | biostudies-literature
| S-EPMC3443659 | biostudies-literature
| S-EPMC5395818 | biostudies-literature
| S-EPMC4807955 | biostudies-literature
| S-EPMC6409843 | biostudies-other