Unknown

Dataset Information

0

PPIGCF: A Protein-Protein Interaction-Based Gene Correlation Filter for Optimal Gene Selection.


ABSTRACT: Biological data at the omics level are highly complex, requiring powerful computational approaches to identifying significant intrinsic characteristics to further search for informative markers involved in the studied phenotype. In this paper, we propose a novel dimension reduction technique, protein-protein interaction-based gene correlation filtration (PPIGCF), which builds on gene ontology (GO) and protein-protein interaction (PPI) structures to analyze microarray gene expression data. PPIGCF first extracts the gene symbols with their expression from the experimental dataset, and then, classifies them based on GO biological process (BP) and cellular component (CC) annotations. Every classification group inherits all the information on its CCs, corresponding to the BPs, to establish a PPI network. Then, the gene correlation filter (regarding gene rank and the proposed correlation coefficient) is computed on every network and eradicates a few weakly correlated genes connected with their corresponding networks. PPIGCF finds the information content (IC) of the other genes related to the PPI network and takes only the genes with the highest IC values. The satisfactory results of PPIGCF are used to prioritize significant genes. We performed a comparison with current methods to demonstrate our technique's efficiency. From the experiment, it can be concluded that PPIGCF needs fewer genes to reach reasonable accuracy (~99%) for cancer classification. This paper reduces the computational complexity and enhances the time complexity of biomarker discovery from datasets.

SUBMITTER: Pati SK 

PROVIDER: S-EPMC10218330 | biostudies-literature | 2023 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

PPIGCF: A Protein-Protein Interaction-Based Gene Correlation Filter for Optimal Gene Selection.

Pati Soumen Kumar SK   Gupta Manan Kumar MK   Banerjee Ayan A   Mallik Saurav S   Zhao Zhongming Z  

Genes 20230510 5


Biological data at the omics level are highly complex, requiring powerful computational approaches to identifying significant intrinsic characteristics to further search for informative markers involved in the studied phenotype. In this paper, we propose a novel dimension reduction technique, protein-protein interaction-based gene correlation filtration (PPIGCF), which builds on gene ontology (GO) and protein-protein interaction (PPI) structures to analyze microarray gene expression data. PPIGCF  ...[more]

Similar Datasets

| S-EPMC10153705 | biostudies-literature
| S-EPMC5803242 | biostudies-literature
| S-EPMC10363294 | biostudies-literature
| S-EPMC5002968 | biostudies-literature
| S-EPMC2374374 | biostudies-literature
| S-EPMC5054498 | biostudies-literature
| S-EPMC7017824 | biostudies-literature
| S-EPMC6795673 | biostudies-literature
| S-EPMC5322573 | biostudies-literature
| S-EPMC9340570 | biostudies-literature