Unknown

Dataset Information

0

MetaSim: a sequencing simulator for genomics and metagenomics.


ABSTRACT: BACKGROUND: The new research field of metagenomics is providing exciting insights into various, previously unclassified ecological systems. Next-generation sequencing technologies are producing a rapid increase of environmental data in public databases. There is great need for specialized software solutions and statistical methods for dealing with complex metagenome data sets. METHODOLOGY/PRINCIPAL FINDINGS: To facilitate the development and improvement of metagenomic tools and the planning of metagenomic projects, we introduce a sequencing simulator called MetaSim. Our software can be used to generate collections of synthetic reads that reflect the diverse taxonomical composition of typical metagenome data sets. Based on a database of given genomes, the program allows the user to design a metagenome by specifying the number of genomes present at different levels of the NCBI taxonomy, and then to collect reads from the metagenome using a simulation of a number of different sequencing technologies. A population sampler optionally produces evolved sequences based on source genomes and a given evolutionary tree. CONCLUSIONS/SIGNIFICANCE: MetaSim allows the user to simulate individual read datasets that can be used as standardized test scenarios for planning sequencing projects or for benchmarking metagenomic software.

SUBMITTER: Richter DC 

PROVIDER: S-EPMC2556396 | biostudies-literature | 2008

REPOSITORIES: biostudies-literature

altmetric image

Publications

MetaSim: a sequencing simulator for genomics and metagenomics.

Richter Daniel C DC   Ott Felix F   Auch Alexander F AF   Schmid Ramona R   Huson Daniel H DH  

PloS one 20081008 10


<h4>Background</h4>The new research field of metagenomics is providing exciting insights into various, previously unclassified ecological systems. Next-generation sequencing technologies are producing a rapid increase of environmental data in public databases. There is great need for specialized software solutions and statistical methods for dealing with complex metagenome data sets.<h4>Methodology/principal findings</h4>To facilitate the development and improvement of metagenomic tools and the  ...[more]

Similar Datasets

| S-EPMC3790878 | biostudies-literature
| S-EPMC4168713 | biostudies-literature
| S-EPMC6588853 | biostudies-other
| S-EPMC6129308 | biostudies-literature
| S-EPMC3278762 | biostudies-literature
2019-01-31 | PXD010137 | Pride
| S-EPMC2911387 | biostudies-literature
| S-EPMC5869506 | biostudies-literature
| S-EPMC6414365 | biostudies-literature
| S-EPMC6873395 | biostudies-literature