Unknown

Dataset Information

0

Clustering 16S rRNA for OTU prediction: a method of unsupervised Bayesian clustering.


ABSTRACT:

Motivation

With the advancements of next-generation sequencing technology, it is now possible to study samples directly obtained from the environment. Particularly, 16S rRNA gene sequences have been frequently used to profile the diversity of organisms in a sample. However, such studies are still taxed to determine both the number of operational taxonomic units (OTUs) and their relative abundance in a sample.

Results

To address these challenges, we propose an unsupervised Bayesian clustering method termed Clustering 16S rRNA for OTU Prediction (CROP). CROP can find clusters based on the natural organization of data without setting a hard cut-off threshold (3%/5%) as required by hierarchical clustering methods. By applying our method to several datasets, we demonstrate that CROP is robust against sequencing errors and that it produces more accurate results than conventional hierarchical clustering methods.

Availability and implementation

Source code freely available at the following URL: http://code.google.com/p/crop-tingchenlab/, implemented in C++ and supported on Linux and MS Windows.

SUBMITTER: Hao X 

PROVIDER: S-EPMC3042185 | biostudies-literature | 2011 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Clustering 16S rRNA for OTU prediction: a method of unsupervised Bayesian clustering.

Hao Xiaolin X   Jiang Rui R   Chen Ting T  

Bioinformatics (Oxford, England) 20110113 5


<h4>Motivation</h4>With the advancements of next-generation sequencing technology, it is now possible to study samples directly obtained from the environment. Particularly, 16S rRNA gene sequences have been frequently used to profile the diversity of organisms in a sample. However, such studies are still taxed to determine both the number of operational taxonomic units (OTUs) and their relative abundance in a sample.<h4>Results</h4>To address these challenges, we propose an unsupervised Bayesian  ...[more]

Similar Datasets

| S-EPMC4593230 | biostudies-literature
| S-EPMC3599145 | biostudies-literature
| S-EPMC4274427 | biostudies-literature
| S-EPMC6422886 | biostudies-literature
| S-EPMC3641076 | biostudies-literature
| S-EPMC6476724 | biostudies-literature
| S-EPMC8925046 | biostudies-literature
| S-EPMC6134716 | biostudies-literature
| S-EPMC3742672 | biostudies-literature
| S-EPMC10164572 | biostudies-literature