Dataset Information

From genus to phylum: large-subunit and internal transcribed spacer rRNA operon regions show similar classification accuracies influenced by database composition.

ABSTRACT: We compared the classification accuracy of two sections of the fungal internal transcribed spacer (ITS) region, individually and combined, and the 5' section (about 600 bp) of the large-subunit rRNA (LSU), using a naive Bayesian classifier and BLASTN. A hand-curated ITS-LSU training set of 1,091 sequences and a larger training set of 8,967 ITS region sequences were used. Of the factors evaluated, database composition and quality had the largest effect on classification accuracy, followed by fragment size and use of a bootstrap cutoff to improve classification confidence. The naive Bayesian classifier and BLASTN gave similar results at higher taxonomic levels, but the classifier was faster and more accurate at the genus level when a bootstrap cutoff was used. All of the ITS and LSU sections performed well (>97.7% accuracy) at higher taxonomic ranks from kingdom to family, and differences between them were small at the genus level (within 0.66 to 1.23%). When full-length sequence sections were used, the LSU outperformed the ITS1 and ITS2 fragments at the genus level, but the ITS1 and ITS2 showed higher accuracy when smaller fragment sizes of the same length and a 50% bootstrap cutoff were used. In a comparison using the larger ITS training set, ITS1 and ITS2 had very similar accuracy classification for fragments between 100 and 200 bp. Collectively, the results show that any of the ITS or LSU sections we tested provided comparable classification accuracy to the genus level and underscore the need for larger and more diverse classification training sets.

SUBMITTER: Porras-Alfaro A

PROVIDER: S-EPMC3911224 | biostudies-literature | 2014 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

From genus to phylum: large-subunit and internal transcribed spacer rRNA operon regions show similar classification accuracies influenced by database composition.

Porras-Alfaro Andrea A Liu Kuan-Liang KL Kuske Cheryl R CR Xie Gary G

Applied and environmental microbiology 20131115 3

We compared the classification accuracy of two sections of the fungal internal transcribed spacer (ITS) region, individually and combined, and the 5' section (about 600 bp) of the large-subunit rRNA (LSU), using a naive Bayesian classifier and BLASTN. A hand-curated ITS-LSU training set of 1,091 sequences and a larger training set of 8,967 ITS region sequences were used. Of the factors evaluated, database composition and quality had the largest effect on classification accuracy, followed by frag ...[more]

PMID: 24242255

Dataset Information

From genus to phylum: large-subunit and internal transcribed spacer rRNA operon regions show similar classification accuracies influenced by database composition.

Publications

From genus to phylum: large-subunit and internal transcribed spacer rRNA operon regions show similar classification accuracies influenced by database composition.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Internal transcribed spacer regions of rRNA genes of Pneumocystis carinii from monkeys.
| S-EPMC96091 | biostudies-literature

Genotyping of Pneumocystis jirovecii by Use of a New Simplified Nomenclature System Based on the Internal Transcribed Spacer Regions and 5.8S rRNA Gene of the rRNA Operon.
| S-EPMC6535588 | biostudies-literature

Analysis of the 16S-23S rRNA gene internal transcribed spacer region in Klebsiella species.
| S-EPMC2576583 | biostudies-literature

High-resolution differentiation of Cyanobacteria by using rRNA-internal transcribed spacer denaturing gradient gel electrophoresis.
| S-EPMC262283 | biostudies-literature

Differentiation Among <i>Rodentibacter</i> Species Based on 16S-23S rRNA Internal Transcribed Spacer Analysis.
| S-EPMC7754199 | biostudies-literature

Internal transcribed spacer rRNA gene sequencing analysis of fungal diversity in Kansas City indoor environments.
| S-EPMC3966654 | biostudies-literature

Improved resolution of bacteria by high throughput sequence analysis of the rRNA internal transcribed spacer.
| S-EPMC4160368 | biostudies-literature

Diverse and unique picocyanobacteria in Chesapeake Bay, revealed by 16S-23S rRNA internal transcribed spacer sequences.
| S-EPMC1393199 | biostudies-literature

Metagenomic data of fungal internal transcribed Spacer and 18S rRNA gene sequences from Lonar lake sediment, India.
| S-EPMC4510552 | biostudies-literature

Bioprospection of Basidiomycetes and molecular phylogenetic analysis using internal transcribed spacer (ITS) and 5.8S rRNA gene sequence.
| S-EPMC6048145 | biostudies-literature