Dataset Information


An optimistic protein assembly from sequence reads salvaged an uncharacterized segment of mouse picobirnavirus.

ABSTRACT: Advances in Next Generation Sequencing technologies have enabled the generation of millions of sequences from microorganisms. However, distinguishing the sequence of a novel species from sequencing errors remains a technical challenge when the novel species is highly divergent from the closest known species. To solve such a problem, we developed a new method called Optimistic Protein Assembly from Reads (OPAR). This method is based on the assumption that protein sequences could be more conserved than the nucleotide sequences encoding them. By taking advantage of metagenomics, bioinformatics and conventional Sanger sequencing, our method successfully identified all coding regions of the mouse picobirnavirus for the first time. The salvaged sequences indicated that segment 1 of this virus was more divergent from its homologues in other Picobirnaviridae species than segment 2. For this reason, only segment 2 of mouse picobirnavirus has been detected in previous studies. OPAR web tool is available at http://bioinformatics.czc.hokudai.ac.jp/opar/.

SUBMITTER: Gonzalez G 

PROVIDER: S-EPMC5223137 | BioStudies | 2017-01-01


REPOSITORIES: biostudies

Similar Datasets

1000-01-01 | S-EPMC6316005 | BioStudies
2012-01-01 | S-EPMC3372223 | BioStudies
2011-01-01 | S-EPMC3077240 | BioStudies
2020-01-01 | S-EPMC7102571 | BioStudies
2016-01-01 | S-EPMC7127629 | BioStudies
2014-01-01 | S-EPMC4202271 | BioStudies
2018-01-01 | S-EPMC6142670 | BioStudies
| PRJNA788895 | ENA
2010-01-01 | S-EPMC2863890 | BioStudies
2009-01-01 | S-EPMC2693148 | BioStudies