Dataset Information


Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes.

ABSTRACT: Metagenomics, the application of shotgun sequencing, facilitates the reconstruction of the genomes of individual species from natural environments. A major challenge in the genome recovery domain is to agglomerate or 'bin' sequences assembled from metagenomic reads into individual groups. Metagenomic binning without consideration of reference sequences enables the comprehensive discovery of new microbial organisms and aids in the microbial genome reconstruction process. Here we present MyCC, an automated binning tool that combines genomic signatures, marker genes and optional contig coverages within one or multiple samples, in order to visualize the metagenomes and to identify the reconstructed genomic fragments. We demonstrate the superior performance of MyCC compared to other binning tools including CONCOCT, GroopM, MaxBin and MetaBAT on both synthetic and real human gut communities with a small sample size (one to 11 samples), as well as on a large metagenome dataset (over 250 samples). Moreover, we demonstrate the visualization of metagenomes in MyCC to aid in the reconstruction of genomes from distinct bins. MyCC is freely available at http://sourceforge.net/projects/sb2nhri/files/MyCC/.

PROVIDER: S-EPMC4828714 | BioStudies |

REPOSITORIES: biostudies

Similar Datasets

| S-EPMC6873667 | BioStudies
| S-EPMC5751405 | BioStudies
| S-EPMC6821242 | BioStudies
| S-EPMC4129434 | BioStudies
| S-EPMC8296540 | BioStudies
| S-EPMC3514610 | BioStudies
| S-EPMC5414344 | BioStudies
| S-EPMC6889603 | BioStudies
| S-EPMC5706307 | BioStudies
| S-EPMC6662567 | BioStudies