Unknown

Dataset Information

0

Network-based cancer genomic data integration for pattern discovery.


ABSTRACT:

Background

Since genes involved in the same biological modules usually present correlated expression profiles, lots of computational methods have been proposed to identify gene functional modules based on the expression profiles data. Recently, Sparse Singular Value Decomposition (SSVD) method has been proposed to bicluster gene expression data to identify gene modules. However, this model can only handle the gene expression data where no gene interaction information is integrated. Ignoring the prior gene interaction information may produce the identified gene modules hard to be biologically interpreted.

Results

In this paper, we develop a Sparse Network-regularized SVD (SNSVD) method that integrates a prior gene interaction network from a protein protein interaction network and gene expression data to identify underlying gene functional modules. The results on a set of simulated data show that SNSVD is more effective than the traditional SVD-based methods. The further experiment results on real cancer genomic data show that most co-expressed modules are not only significantly enriched on GO/KEGG pathways, but also correspond to dense sub-networks in the prior gene interaction network. Besides, we also use our method to identify ten differentially co-expressed miRNA-gene modules by integrating matched miRNA and mRNA expression data of breast cancer from The Cancer Genome Atlas (TCGA). Several important breast cancer related miRNA-gene modules are discovered.

Conclusions

All the results demonstrate that SNSVD can overcome the drawbacks of SSVD and capture more biologically relevant functional modules by incorporating a prior gene interaction network. These identified functional modules may provide a new perspective to understand the diagnostics, occurrence and progression of cancer.

SUBMITTER: Zhu F 

PROVIDER: S-EPMC8662848 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC7241240 | biostudies-literature
| S-EPMC3600490 | biostudies-literature
| S-EPMC6616462 | biostudies-literature
| S-EPMC4258423 | biostudies-literature
| S-EPMC6041755 | biostudies-literature
| S-EPMC3760887 | biostudies-literature
| S-EPMC4684294 | biostudies-literature
| S-EPMC3866686 | biostudies-literature
| S-EPMC5985961 | biostudies-literature