Unknown

Dataset Information

0

Identifying robust communities and multi-community nodes by combining top-down and bottom-up approaches to clustering.


ABSTRACT: Biological functions are carried out by groups of interacting molecules, cells or tissues, known as communities. Membership in these communities may overlap when biological components are involved in multiple functions. However, traditional clustering methods detect non-overlapping communities. These detected communities may also be unstable and difficult to replicate, because traditional methods are sensitive to noise and parameter settings. These aspects of traditional clustering methods limit our ability to detect biological communities, and therefore our ability to understand biological functions. To address these limitations and detect robust overlapping biological communities, we propose an unorthodox clustering method called SpeakEasy which identifies communities using top-down and bottom-up approaches simultaneously. Specifically, nodes join communities based on their local connections, as well as global information about the network structure. This method can quantify the stability of each community, automatically identify the number of communities, and quickly cluster networks with hundreds of thousands of nodes. SpeakEasy shows top performance on synthetic clustering benchmarks and accurately identifies meaningful biological communities in a range of datasets, including: gene microarrays, protein interactions, sorted cell populations, electrophysiology and fMRI brain imaging.

SUBMITTER: Gaiteri C 

PROVIDER: S-EPMC4637843 | biostudies-other | 2015 Nov

REPOSITORIES: biostudies-other

altmetric image

Publications

Identifying robust communities and multi-community nodes by combining top-down and bottom-up approaches to clustering.

Gaiteri Chris C   Chen Mingming M   Szymanski Boleslaw B   Kuzmin Konstantin K   Xie Jierui J   Lee Changkyu C   Blanche Timothy T   Chaibub Neto Elias E   Huang Su-Chun SC   Grabowski Thomas T   Madhyastha Tara T   Komashko Vitalina V  

Scientific reports 20151109


Biological functions are carried out by groups of interacting molecules, cells or tissues, known as communities. Membership in these communities may overlap when biological components are involved in multiple functions. However, traditional clustering methods detect non-overlapping communities. These detected communities may also be unstable and difficult to replicate, because traditional methods are sensitive to noise and parameter settings. These aspects of traditional clustering methods limit  ...[more]

Similar Datasets

| S-EPMC8106998 | biostudies-literature
| S-EPMC7145108 | biostudies-literature
| S-EPMC5508256 | biostudies-other
| S-EPMC4770414 | biostudies-literature
| S-EPMC3653887 | biostudies-other