Unknown

Dataset Information

0

MEDUSA: A Pipeline for Sensitive Taxonomic Classification and Flexible Functional Annotation of Metagenomic Shotgun Sequences.


ABSTRACT: Metagenomic studies unravel details about the taxonomic composition and the functions performed by microbial communities. As a complete metagenomic analysis requires different tools for different purposes, the selection and setup of these tools remain challenging. Furthermore, the chosen toolset will affect the accuracy, the formatting, and the functional identifiers reported in the results, impacting the results interpretation and the biological answer obtained. Thus, we surveyed state-of-the-art tools available in the literature, created simulated datasets, and performed benchmarks to design a sensitive and flexible metagenomic analysis pipeline. Here we present MEDUSA, an efficient pipeline to conduct comprehensive metagenomic analyses. It performs preprocessing, assembly, alignment, taxonomic classification, and functional annotation on shotgun data, supporting user-built dictionaries to transfer annotations to any functional identifier. MEDUSA includes several tools, as fastp, Bowtie2, DIAMOND, Kaiju, MEGAHIT, and a novel tool implemented in Python to transfer annotations to BLAST/DIAMOND alignment results. These tools are installed via Conda, and the workflow is managed by Snakemake, easing the setup and execution. Compared with MEGAN 6 Community Edition, MEDUSA correctly identifies more species, especially the less abundant, and is more suited for functional analysis using Gene Ontology identifiers.

SUBMITTER: Morais DAA 

PROVIDER: S-EPMC8940201 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

altmetric image

Publications

MEDUSA: A Pipeline for Sensitive Taxonomic Classification and Flexible Functional Annotation of Metagenomic Shotgun Sequences.

Morais Diego A A DAA   Cavalcante João V F JVF   Monteiro Shênia S SS   Pasquali Matheus A B MAB   Dalmolin Rodrigo J S RJS  

Frontiers in genetics 20220307


Metagenomic studies unravel details about the taxonomic composition and the functions performed by microbial communities. As a complete metagenomic analysis requires different tools for different purposes, the selection and setup of these tools remain challenging. Furthermore, the chosen toolset will affect the accuracy, the formatting, and the functional identifiers reported in the results, impacting the results interpretation and the biological answer obtained. Thus, we surveyed state-of-the-a  ...[more]

Similar Datasets

| S-EPMC3152360 | biostudies-literature
| S-EPMC5131823 | biostudies-literature
| S-EPMC11865343 | biostudies-literature
| S-EPMC7579964 | biostudies-literature
| S-EPMC9749362 | biostudies-literature
| S-EPMC2957682 | biostudies-literature
| S-EPMC8921650 | biostudies-literature
| S-EPMC9026403 | biostudies-literature
| S-EPMC3953531 | biostudies-literature
2021-07-26 | E-MTAB-9191 | biostudies-arrayexpress