Unknown

Dataset Information

0

RFtest: A Robust and Flexible Community-Level Test for Microbiome Data Powerfully Detects Phylogenetically Clustered Signals.


ABSTRACT: Random forest is considered as one of the most successful machine learning algorithms, which has been widely used to construct microbiome-based predictive models. However, its use as a statistical testing method has not been explored. In this study, we propose "Random Forest Test" (RFtest), a global (community-level) test based on random forest for high-dimensional and phylogenetically structured microbiome data. RFtest is a permutation test using the generalization error of random forest as the test statistic. Our simulations demonstrate that RFtest has controlled type I error rates, that its power is superior to competing methods for phylogenetically clustered signals, and that it is robust to outliers and adaptive to interaction effects and non-linear associations. Finally, we apply RFtest to two real microbiome datasets to ascertain whether microbial communities are associated or not with the outcome variables.

SUBMITTER: Zhang L 

PROVIDER: S-EPMC8819960 | biostudies-literature | 2021

REPOSITORIES: biostudies-literature

altmetric image

Publications

RFtest: A Robust and Flexible Community-Level Test for Microbiome Data Powerfully Detects Phylogenetically Clustered Signals.

Zhang Lujun L   Wang Yanshan Y   Chen Jingwen J   Chen Jun J  

Frontiers in genetics 20220124


Random forest is considered as one of the most successful machine learning algorithms, which has been widely used to construct microbiome-based predictive models. However, its use as a statistical testing method has not been explored. In this study, we propose "Random Forest Test" (RFtest), a global (community-level) test based on random forest for high-dimensional and phylogenetically structured microbiome data. RFtest is a permutation test using the generalization error of random forest as the  ...[more]

Similar Datasets

| S-EPMC4992081 | biostudies-literature
| S-EPMC3154466 | biostudies-other
| S-EPMC11887327 | biostudies-literature
| S-EPMC107003 | biostudies-literature
| S-EPMC2842741 | biostudies-literature
| S-EPMC5592124 | biostudies-literature
| S-EPMC5142122 | biostudies-literature
| S-EPMC4214555 | biostudies-literature
| S-EPMC3390866 | biostudies-other
| S-EPMC9996644 | biostudies-literature