Metabolomics,Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

Spliced synthetic genes as internal controls in RNA sequencing experiments.


ABSTRACT: RNA sequencing (RNAseq) can be used to assemble spliced isoforms, quantify expressed genes and provide a global profile of the transcriptome. However, the size and diversity of the transcriptome, the wide dynamic range in gene expression and inherent technical biases confound RNAseq analysis. We have developed a set of spike-in RNA standards, termed ‘sequins’ (sequencing spike-ins), that represent full-length spliced mRNA isoforms. Sequins have an entirely artificial sequence with no homology to natural reference genomes, but align to gene loci encoded on an artificial in silico chromosome. The combination of multiple sequins across a range of concentrations emulates alternative splicing and differential gene expression, and provides scaling factors for normalization between samples. We demonstrate the use of sequins in RNAseq experiments to measure sample-specific biases and determine the limits of reliable transcript assembly and quantification in accompanying human RNA samples. In addition, we have designed a complementary set of sequins that represent fusion genes arising from rearrangements of the in silico chromosome to aid in cancer diagnosis. RNA sequins provide a qualitative and quantitative reference with which to navigate the complexity of the human transcriptome. Detailed transcriptomic analysis of a human cell-type with synthetic RNA spike-ins ('sequins'). Sequins were initially combined at equimolar concentrations (a "flat" mix) and sequenced neat (i.e. without any natural RNA added). We then prepared two staggered mixtures (Mix A & B) and sequenced them neat. Mix A was then spiked into total RNA extracted from the K562 cell-type. Finally, we prepared a staggered mixture of fusion sequins and sequenced it neat.

ORGANISM(S): Homo sapiens

SUBMITTER: Simon Hardwick 

PROVIDER: E-GEOD-77072 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

Similar Datasets

2016-07-21 | GSE77072 | GEO
| PRJEB45064 | ENA
2021-04-15 | GSE172118 | GEO
| PRJNA422917 | ENA
2022-11-21 | E-NASC-43 | biostudies-arrayexpress
2007-01-08 | GSE5739 | GEO
2019-01-01 | E-MTAB-7820 | biostudies-arrayexpress
2019-04-03 | GSE129240 | GEO
2017-07-19 | GSE98553 | GEO
2008-06-14 | E-GEOD-5739 | biostudies-arrayexpress