Genomics

Dataset Information

0

Bias in ligation-based small RNA sequencing library construction is determined by adaptor and RNA structure


ABSTRACT: High-throughput sequencing (HTS) has become a powerful tool for the detection of and sequence characterization of microRNAs (miRNA) and other small RNAs (sRNA). Unfortunately, the use of HTS data to determine the relative quantity of different miRNAs in a sample has been shown to be inconsistent with quantitative PCR and Northern Blot results. Several recent studies have concluded that the major contributor to this inconsistency is bias introduced during the construction of sRNA libraries for HTS and that the bias is primarily derived from the adaptor ligation steps; specifically where single stranded adaptors are sequentially ligated to the 3' and 5'-end of sRNAs using T4 RNA ligases. In this study we investigated the effects of ligation bias by using a pool of randomized ligation substrates, defined mixtures of miRNA sequences and several combinations of adaptors in HTS library construction. We show that like the 3' adaptor ligation step, the 5' adaptor ligation is also biased, not because of primary sequence, but instead due to secondary structures of the two ligation substrates. We find that multiple secondary structural factors influence final representation in HTS results. Our results provide insight about the nature of ligation bias and allowed us to design adaptors that reduce ligation bias and produce HTS results that more accurately reflect the actual concentrations of miRNAs in the defined starting material.

ORGANISM(S): synthetic construct

PROVIDER: GSE67053 | GEO | 2015/05/15

SECONDARY ACCESSION(S): PRJNA278810

REPOSITORIES: GEO

Similar Datasets

2015-05-15 | E-GEOD-67053 | biostudies-arrayexpress
| PRJNA278810 | ENA
2014-06-24 | E-MTAB-2566 | biostudies-arrayexpress
2019-10-01 | GSE102510 | GEO
2014-02-20 | E-MTAB-2226 | biostudies-arrayexpress
2018-12-12 | GSE123627 | GEO
2021-06-12 | GSE177036 | GEO
2016-07-18 | PXD004105 | Pride
2022-10-22 | E-MTAB-12275 | biostudies-arrayexpress
2009-09-30 | GSE18031 | GEO