Unknown

Dataset Information

0

Robust integrative biclustering for multi-view data.


ABSTRACT: In many biomedical research, multiple views of data (e.g. genomics, proteomics) are available, and a particular interest might be the detection of sample subgroups characterized by specific groups of variables. Biclustering methods are well-suited for this problem as they assume that specific groups of variables might be relevant only to specific groups of samples. Many biclustering methods exist for detecting row-column clusters in a view but few methods exist for data from multiple views. The few existing algorithms are heavily dependent on regularization parameters for getting row-column clusters, and they impose unnecessary burden on users thus limiting their use in practice. We extend an existing biclustering method based on sparse singular value decomposition for single-view data to data from multiple views. Our method, integrative sparse singular value decomposition (iSSVD), incorporates stability selection to control Type I error rates, estimates the probability of samples and variables to belong to a bicluster, finds stable biclusters, and results in interpretable row-column associations. Simulations and real data analyses show that integrative sparse singular value decomposition outperforms several other single- and multi-view biclustering methods and is able to detect meaningful biclusters. iSSVD is a user-friendly, computationally efficient algorithm that will be useful in many disease subtyping applications.

SUBMITTER: Zhang W 

PROVIDER: S-EPMC10153449 | biostudies-literature | 2022 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Robust integrative biclustering for multi-view data.

Zhang Weijie W   Wendt Christine C   Bowler Russel R   Hersh Craig P CP   Safo Sandra E SE  

Statistical methods in medical research 20220913 11


In many biomedical research, multiple views of data (e.g. genomics, proteomics) are available, and a particular interest might be the detection of sample subgroups characterized by specific groups of variables. Biclustering methods are well-suited for this problem as they assume that specific groups of variables might be relevant only to specific groups of samples. Many biclustering methods exist for detecting row-column clusters in a view but few methods exist for data from multiple views. The  ...[more]

Similar Datasets

| S-EPMC10701104 | biostudies-literature
| S-EPMC2965388 | biostudies-literature
| S-EPMC6751173 | biostudies-literature
| S-EPMC11784738 | biostudies-literature
| S-EPMC3019228 | biostudies-literature
| S-EPMC9214338 | biostudies-literature
| S-EPMC6849205 | biostudies-literature
| S-EPMC11470236 | biostudies-literature
| S-EPMC7423957 | biostudies-literature
| S-EPMC6805321 | biostudies-literature