Unknown

Dataset Information

0

Binpairs: utilization of Illumina paired-end information for improving efficiency of taxonomic binning of metagenomic sequences.


ABSTRACT:

Motivation

Paired-end sequencing protocols, offered by next generation sequencing (NGS) platforms like Illumia, generate a pair of reads for every DNA fragment in a sample. Although this protocol has been utilized for several metagenomics studies, most taxonomic binning approaches classify each of the reads (forming a pair), independently. The present work explores some simple but effective strategies of utilizing pairing-information of Illumina short reads for improving the accuracy of taxonomic binning of metagenomic datasets. The strategies proposed can be used in conjunction with all genres of existing binning methods.

Results

Validation results suggest that employment of these "Binpairs" strategies can provide significant improvements in the binning outcome. The quality of the taxonomic assignments thus obtained are often comparable to those that can only be achieved with relatively longer reads obtained using other NGS platforms (such as Roche).

Availability

An implementation of the proposed strategies of utilizing pairing information is freely available for academic users at https://metagenomics.atc.tcs.com/binning/binpairs.

SUBMITTER: Dutta A 

PROVIDER: S-EPMC4281075 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Binpairs: utilization of Illumina paired-end information for improving efficiency of taxonomic binning of metagenomic sequences.

Dutta Anirban A   Tandon Disha D   Mohammed M H MH   Bose Tungadri T   Mande Sharmila S SS  

PloS one 20141231 12


<h4>Motivation</h4>Paired-end sequencing protocols, offered by next generation sequencing (NGS) platforms like Illumia, generate a pair of reads for every DNA fragment in a sample. Although this protocol has been utilized for several metagenomics studies, most taxonomic binning approaches classify each of the reads (forming a pair), independently. The present work explores some simple but effective strategies of utilizing pairing-information of Illumina short reads for improving the accuracy of  ...[more]

Similar Datasets

| S-EPMC3471323 | biostudies-literature
| S-EPMC3483553 | biostudies-literature
| S-EPMC8097841 | biostudies-literature
| S-EPMC7071698 | biostudies-literature
| S-EPMC6393434 | biostudies-literature
| S-EPMC4023940 | biostudies-literature
| S-EPMC6330020 | biostudies-literature
| S-EPMC4545970 | biostudies-literature