Unknown

Dataset Information

0

Scalable in-memory processing of omics workflows.


ABSTRACT: We present a proof of concept implementation of the in-memory computing paradigm that we use to facilitate the analysis of metagenomic sequencing reads. In doing so we compare the performance of POSIX™file systems and key-value storage for omics data, and we show the potential for integrating high-performance computing (HPC) and cloud native technologies. We show that in-memory key-value storage offers possibilities for improved handling of omics data through more flexible and faster data processing. We envision fully containerized workflows and their deployment in portable micro-pipelines with multiple instances working concurrently with the same distributed in-memory storage. To highlight the potential usage of this technology for event driven and real-time data processing, we use a biological case study focused on the growing threat of antimicrobial resistance (AMR). We develop a workflow encompassing bioinformatics and explainable machine learning (ML) to predict life expectancy of a population based on the microbiome of its sewage while providing a description of AMR contribution to the prediction. We propose that in future, performing such analyses in 'real-time' would allow us to assess the potential risk to the population based on changes in the AMR profile of the community.

SUBMITTER: Elisseev V 

PROVIDER: S-EPMC9052061 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

altmetric image

Publications

Scalable in-memory processing of omics workflows.

Elisseev Vadim V   Gardiner Laura-Jayne LJ   Krishna Ritesh R  

Computational and structural biotechnology journal 20220420


We present a proof of concept implementation of the in-memory computing paradigm that we use to facilitate the analysis of metagenomic sequencing reads. In doing so we compare the performance of POSIX™file systems and key-value storage for omics data, and we show the potential for integrating high-performance computing (HPC) and cloud native technologies. We show that in-memory key-value storage offers possibilities for improved handling of omics data through more flexible and faster data proces  ...[more]

Similar Datasets

| S-EPMC5488373 | biostudies-literature
| S-EPMC8514239 | biostudies-literature
| S-EPMC8299072 | biostudies-literature
| S-EPMC3431220 | biostudies-literature
| S-EPMC7479590 | biostudies-literature
| S-EPMC8837709 | biostudies-literature
| S-EPMC11832004 | biostudies-literature
| S-EPMC8828470 | biostudies-literature
| S-EPMC4638048 | biostudies-literature
| S-EPMC8247461 | biostudies-literature