Unknown

Dataset Information

0

EMIRGE: reconstruction of full-length ribosomal genes from microbial community short read sequencing data.


ABSTRACT: Recovery of ribosomal small subunit genes by assembly of short read community DNA sequence data generally fails, making taxonomic characterization difficult. Here, we solve this problem with a novel iterative method, based on the expectation maximization algorithm, that reconstructs full-length small subunit gene sequences and provides estimates of relative taxon abundances. We apply the method to natural and simulated microbial communities, and correctly recover community structure from known and previously unreported rRNA gene sequences. An implementation of the method is freely available at https://github.com/csmiller/EMIRGE.

SUBMITTER: Miller CS 

PROVIDER: S-EPMC3219967 | biostudies-literature | 2011

REPOSITORIES: biostudies-literature

altmetric image

Publications

EMIRGE: reconstruction of full-length ribosomal genes from microbial community short read sequencing data.

Miller Christopher S CS   Baker Brett J BJ   Thomas Brian C BC   Singer Steven W SW   Banfield Jillian F JF  

Genome biology 20110519 5


Recovery of ribosomal small subunit genes by assembly of short read community DNA sequence data generally fails, making taxonomic characterization difficult. Here, we solve this problem with a novel iterative method, based on the expectation maximization algorithm, that reconstructs full-length small subunit gene sequences and provides estimates of relative taxon abundances. We apply the method to natural and simulated microbial communities, and correctly recover community structure from known a  ...[more]

Similar Datasets

2021-10-11 | GSE158985 | GEO
| PRJEB17706 | ENA
| S-EPMC4915659 | biostudies-literature
| S-EPMC3566076 | biostudies-literature
| S-EPMC5657049 | biostudies-literature
| S-EPMC5132105 | biostudies-literature