Unknown

Dataset Information

0

Active link selection for efficient semi-supervised community detection.


ABSTRACT: Several semi-supervised community detection algorithms have been proposed recently to improve the performance of traditional topology-based methods. However, most of them focus on how to integrate supervised information with topology information; few of them pay attention to which information is critical for performance improvement. This leads to large amounts of demand for supervised information, which is expensive or difficult to obtain in most fields. For this problem we propose an active link selection framework, that is we actively select the most uncertain and informative links for human labeling for the efficient utilization of the supervised information. We also disconnect the most likely inter-community edges to further improve the efficiency. Our main idea is that, by connecting uncertain nodes to their community hubs and disconnecting the inter-community edges, one can sharpen the block structure of adjacency matrix more efficiently than randomly labeling links as the existing methods did. Experiments on both synthetic and real networks demonstrate that our new approach significantly outperforms the existing methods in terms of the efficiency of using supervised information. It needs ~13% of the supervised information to achieve a performance similar to that of the original semi-supervised approaches.

SUBMITTER: Yang L 

PROVIDER: S-EPMC4649850 | biostudies-other | 2015

REPOSITORIES: biostudies-other

Similar Datasets

| S-EPMC5441628 | biostudies-literature
2019-11-13 | GSE140262 | GEO
| S-EPMC7514320 | biostudies-literature
| S-EPMC3359096 | biostudies-literature
| S-EPMC3205936 | biostudies-literature
| S-EPMC4382902 | biostudies-literature
| S-EPMC8406783 | biostudies-literature
| S-EPMC3956069 | biostudies-literature