Transcriptomics

Dataset Information

0

Sequencing artifacts produced by mispriming during reverse transcription in multiple RNA-seq technologies


ABSTRACT: The quality of RNA sequencing data relies on specific priming by the primer used for reverse transcription (RT-primer). Non-specific annealing of the RT-primer to the RNA template can generate reads with incorrect cDNA ends and can cause misinterpretation of data (RT mispriming). This kind of artifact in RNA-seq based technologies is underappreciated and currently no adequate tools exist to computationally remove them from published datasets. We show that mispriming can occur with as little as 2 bases of complementarity at the 3' end of the primer followed by intermittent regions of complementarity. We propose an experimental solution to avoid RT-mispriming by performing RNA-seq using thermostable group II intron derived reverse transcriptase (TGIRT-seq).

ORGANISM(S): Homo sapiens

PROVIDER: GSE85163 | GEO | 2018/06/26

REPOSITORIES: GEO

Similar Datasets

2016-02-19 | GSE78059 | GEO
2016-02-19 | E-GEOD-78059 | biostudies-arrayexpress
2019-11-11 | GSE138200 | GEO
2019-07-09 | GSE130951 | GEO
2020-04-22 | GSE149061 | GEO
2016-09-16 | GSE84537 | GEO
2023-03-30 | GSE228595 | GEO
2020-05-09 | GSE149087 | GEO
| PRJNA386569 | ENA
2020-05-17 | GSE147103 | GEO