Unknown

Dataset Information

0

Diffusion model based spectral clustering for protein-protein interaction networks.


ABSTRACT:

Background

A goal of systems biology is to analyze large-scale molecular networks including gene expressions and protein-protein interactions, revealing the relationships between network structures and their biological functions. Dividing a protein-protein interaction (PPI) network into naturally grouped parts is an essential way to investigate the relationship between topology of networks and their functions. However, clear modular decomposition is often hard due to the heterogeneous or scale-free properties of PPI networks.

Methodology/principal findings

To address this problem, we propose a diffusion model-based spectral clustering algorithm, which analytically solves the cluster structure of PPI networks as a problem of random walks in the diffusion process in them. To cope with the heterogeneity of the networks, the power factor is introduced to adjust the diffusion matrix by weighting the transition (adjacency) matrix according to a node degree matrix. This algorithm is named adjustable diffusion matrix-based spectral clustering (ADMSC). To demonstrate the feasibility of ADMSC, we apply it to decomposition of a yeast PPI network, identifying biologically significant clusters with approximately equal size. Compared with other established algorithms, ADMSC facilitates clear and fast decomposition of PPI networks.

Conclusions/significance

ADMSC is proposed by introducing the power factor that adjusts the diffusion matrix to the heterogeneity of the PPI networks. ADMSC effectively partitions PPI networks into biologically significant clusters with almost equal sizes, while being very fast, robust and appealing simple.

SUBMITTER: Inoue K 

PROVIDER: S-EPMC2935381 | biostudies-literature | 2010 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Diffusion model based spectral clustering for protein-protein interaction networks.

Inoue Kentaro K   Li Weijiang W   Kurata Hiroyuki H  

PloS one 20100907 9


<h4>Background</h4>A goal of systems biology is to analyze large-scale molecular networks including gene expressions and protein-protein interactions, revealing the relationships between network structures and their biological functions. Dividing a protein-protein interaction (PPI) network into naturally grouped parts is an essential way to investigate the relationship between topology of networks and their functions. However, clear modular decomposition is often hard due to the heterogeneous or  ...[more]

Similar Datasets

| S-EPMC5798376 | biostudies-literature
| S-EPMC1637120 | biostudies-literature
| S-EPMC1716184 | biostudies-literature
| S-EPMC1409676 | biostudies-literature
| S-EPMC4074043 | biostudies-literature
| S-EPMC3958373 | biostudies-literature
| S-EPMC6454479 | biostudies-literature
| S-EPMC6150027 | biostudies-literature