Unknown

Dataset Information

0

Network-based penalized regression with application to genomic data.


ABSTRACT: Penalized regression approaches are attractive in dealing with high-dimensional data such as arising in high-throughput genomic studies. New methods have been introduced to utilize the network structure of predictors, for example, gene networks, to improve parameter estimation and variable selection. All the existing network-based penalized methods are based on an assumption that parameters, for example, regression coefficients, of neighboring nodes in a network are close in magnitude, which however may not hold. Here we propose a novel penalized regression method based on a weaker prior assumption that the parameters of neighboring nodes in a network are likely to be zero (or non-zero) at the same time, regardless of their specific magnitudes. We propose a novel non-convex penalty function to incorporate this prior, and an algorithm based on difference convex programming. We use simulated data and two breast cancer gene expression datasets to demonstrate the advantages of the proposed methods over some existing methods. Our proposed methods can be applied to more general problems for group variable selection.

SUBMITTER: Kim S 

PROVIDER: S-EPMC4007772 | biostudies-literature | 2013 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Network-based penalized regression with application to genomic data.

Kim Sunkyung S   Pan Wei W   Shen Xiaotong X  

Biometrics 20130703 3


Penalized regression approaches are attractive in dealing with high-dimensional data such as arising in high-throughput genomic studies. New methods have been introduced to utilize the network structure of predictors, for example, gene networks, to improve parameter estimation and variable selection. All the existing network-based penalized methods are based on an assumption that parameters, for example, regression coefficients, of neighboring nodes in a network are close in magnitude, which how  ...[more]

Similar Datasets

| S-EPMC3232376 | biostudies-literature
| S-EPMC3338337 | biostudies-literature
2015-08-04 | GSE71669 | GEO
2015-08-04 | E-GEOD-71669 | biostudies-arrayexpress
2015-08-04 | GSE71576 | GEO
2015-08-04 | GSE71666 | GEO
| S-EPMC4143805 | biostudies-literature
| S-EPMC4672920 | biostudies-literature
2015-08-04 | E-GEOD-71576 | biostudies-arrayexpress
2015-08-04 | E-GEOD-71666 | biostudies-arrayexpress