Unknown

Dataset Information

0

It's all relative: Regression analysis with compositional predictors.


ABSTRACT: Compositional data reside in a simplex and measure fractions or proportions of parts to a whole. Most existing regression methods for such data rely on log-ratio transformations that are inadequate or inappropriate in modeling high-dimensional data with excessive zeros and hierarchical structures. Moreover, such models usually lack a straightforward interpretation due to the interrelation between parts of a composition. We develop a novel relative-shift regression framework that directly uses proportions as predictors. The new framework provides a paradigm shift for regression analysis with compositional predictors and offers a superior interpretation of how shifting concentration between parts affects the response. New equi-sparsity and tree-guided regularization methods and an efficient smoothing proximal gradient algorithm are developed to facilitate feature aggregation and dimension reduction in regression. A unified finite-sample prediction error bound is derived for the proposed regularized estimators. We demonstrate the efficacy of the proposed methods in extensive simulation studies and a real gut microbiome study. Guided by the taxonomy of the microbiome data, the framework identifies important taxa at different taxonomic levels associated with the neurodevelopment of preterm infants.

SUBMITTER: Li G 

PROVIDER: S-EPMC9767704 | biostudies-literature | 2023 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

It's all relative: Regression analysis with compositional predictors.

Li Gen G   Li Yan Y   Chen Kun K  

Biometrics 20220711 2


Compositional data reside in a simplex and measure fractions or proportions of parts to a whole. Most existing regression methods for such data rely on log-ratio transformations that are inadequate or inappropriate in modeling high-dimensional data with excessive zeros and hierarchical structures. Moreover, such models usually lack a straightforward interpretation due to the interrelation between parts of a composition. We develop a novel relative-shift regression framework that directly uses pr  ...[more]

Similar Datasets

| S-EPMC8218926 | biostudies-literature
| S-EPMC9387759 | biostudies-literature
| S-EPMC7831267 | biostudies-literature
| S-EPMC7410344 | biostudies-literature
| S-EPMC11017126 | biostudies-literature
| S-EPMC7868056 | biostudies-literature
| S-EPMC10201722 | biostudies-literature
| S-EPMC10361693 | biostudies-literature
| S-EPMC8216648 | biostudies-literature
| S-EPMC3842620 | biostudies-literature