Unknown

Dataset Information

0

Illumina reads correction: evaluation and improvements.


ABSTRACT: The paper focuses on the correction of Illumina WGS sequencing reads. We provide an extensive evaluation of the existing correctors. To this end, we measure an impact of the correction on variant calling (VC) as well as de novo assembly. It shows, that in selected cases read correction improves the VC results quality. We also examine the algorithms behaviour in a processing of Illumina NovaSeq reads, with different reads quality characteristics than in older sequencers. We show that most of the algorithms are ready to cope with such reads. Finally, we introduce a new version of RECKONER, our read corrector, by optimizing it and equipping with a new correction strategy. Currently, RECKONER allows to correct high-coverage human reads in less than 2.5 h, is able to cope with two types of reads errors: indels and substitutions, and utilizes a new, based on a two lengths of oligomers, correction verification technique.

SUBMITTER: Dlugosz M 

PROVIDER: S-EPMC11222498 | biostudies-literature | 2024 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Illumina reads correction: evaluation and improvements.

Długosz Maciej M   Deorowicz Sebastian S  

Scientific reports 20240126 1


The paper focuses on the correction of Illumina WGS sequencing reads. We provide an extensive evaluation of the existing correctors. To this end, we measure an impact of the correction on variant calling (VC) as well as de novo assembly. It shows, that in selected cases read correction improves the VC results quality. We also examine the algorithms behaviour in a processing of Illumina NovaSeq reads, with different reads quality characteristics than in older sequencers. We show that most of the  ...[more]

Similar Datasets

| S-EPMC4615873 | biostudies-literature
| S-EPMC3822393 | biostudies-literature
| S-EPMC4471408 | biostudies-literature
| S-EPMC5563063 | biostudies-other
| S-EPMC4191382 | biostudies-literature
| S-EPMC6362602 | biostudies-literature
| S-EPMC7071698 | biostudies-literature
| S-EPMC5097354 | biostudies-literature
| S-EPMC2610436 | biostudies-literature
| S-EPMC7671326 | biostudies-literature