Unknown

Dataset Information

0

kernInt: A Kernel Framework for Integrating Supervised and Unsupervised Analyses in Spatio-Temporal Metagenomic Datasets.


ABSTRACT: The advent of next-generation sequencing technologies allowed relative quantification of microbiome communities and their spatial and temporal variation. In recent years, supervised learning (i.e., prediction of a phenotype of interest) from taxonomic abundances has become increasingly common in the microbiome field. However, a gap exists between supervised and classical unsupervised analyses, based on computing ecological dissimilarities for visualization or clustering. Despite this, both approaches face common challenges, like the compositional nature of next-generation sequencing data or the integration of the spatial and temporal dimensions. Here we propose a kernel framework to place on a common ground the unsupervised and supervised microbiome analyses, including the retrieval of microbial signatures (taxa importances). We define two compositional kernels (Aitchison-RBF and compositional linear) and discuss how to transform non-compositional beta-dissimilarity measures into kernels. Spatial data is integrated with multiple kernel learning, while longitudinal data is evaluated by specific kernels. We illustrate our framework through a single point soil dataset, a human dataset with a spatial component, and a previously unpublished longitudinal dataset concerning pig production. The proposed framework and the case studies are freely available in the kernInt package at https://github.com/elies-ramon/kernInt.

SUBMITTER: Ramon E 

PROVIDER: S-EPMC7876079 | biostudies-literature | 2021

REPOSITORIES: biostudies-literature

altmetric image

Publications

<i>kernInt</i>: A Kernel Framework for Integrating Supervised and Unsupervised Analyses in Spatio-Temporal Metagenomic Datasets.

Ramon Elies E   Belanche-Muñoz Lluís L   Molist Francesc F   Quintanilla Raquel R   Perez-Enciso Miguel M   Ramayo-Caldas Yuliaxis Y  

Frontiers in microbiology 20210128


The advent of next-generation sequencing technologies allowed relative quantification of microbiome communities and their spatial and temporal variation. In recent years, supervised learning (i.e., prediction of a phenotype of interest) from taxonomic abundances has become increasingly common in the microbiome field. However, a gap exists between supervised and classical unsupervised analyses, based on computing ecological dissimilarities for visualization or clustering. Despite this, both appro  ...[more]

Similar Datasets

| S-EPMC10939423 | biostudies-literature
| S-EPMC7397036 | biostudies-literature
| S-EPMC3057986 | biostudies-literature
| S-EPMC5098425 | biostudies-literature
| S-EPMC4641738 | biostudies-literature
| S-EPMC7990963 | biostudies-literature
| S-EPMC7338024 | biostudies-literature
| S-EPMC10785655 | biostudies-literature
| S-EPMC6363393 | biostudies-literature
| S-EPMC9746591 | biostudies-literature