Metabolomics,Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

Measure transcript integrity using RNA-seq data


ABSTRACT: Achieved biospecimens annotated with patient clinical characteristics are unique resources for translational research. However, RNA extracted from the achieved tissues is often degraded. RNA degradation can have a significant impact on the measure of transcript abundance that can lead to an increase rate of erroneous differentially expressed genes. Here, we are presenting the transcript integrity number (TIN) algorithm to measure the RNA degradation at transcript level. When applied to RNA-seq datasets generated from human brain Glioblastome cell line, human peripheral blood mononuclear cells, and metastatic castration resistant prostate cancer (mCRPC) clinical tissues, TIN provided a more reliable and more sensitive measure of RNA degradation than RIN, as demonstrated by much higher concordance with the RNA fragment size estimated from read pairs. More importantly, when comparing 10 mCRPC samples with lower RNA quality to another 10 samples with higher RNA quality, we demonstrated that calibrating gene quantification with TIN scores could mitigate RNA degradation effects and greatly improve gene expression analysis. The detected differentially expressed genes before TIN correction were predominantly ribosomal genes. However, when we adjusted gene quantifications with the corresponding TIN scores, we found differentially expressed genes were highly enriched in prostate cancer specific pathways. When further evaluating the performance of TIN correction using synthetic spike-in transcripts with predetermined abundance in RNA-seq data generated from Sequencing Control Consortium (SEQC), we found TIN adjustment had a better control of false positives and false negatives (sensitivity = 0.89, specificity = 0.91), as compared to gene expression analysis results without TIN correction (sensitivity =0.98, specificity = 0.50). RNA sequencing of 20 bone-metastatic castration resistant prostate cancer (mCRPC) using Illumina HiSeq 2500. Out of 20 mCRPC samples, 10 samples have relative low RNA integrity and another 10 samples have relative higher RNA integrity as measured by Agilent RIN score.

ORGANISM(S): Homo sapiens

SUBMITTER: LIGUO WANG 

PROVIDER: E-GEOD-70285 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

altmetric image

Publications


<h4>Background</h4>Stored biological samples with pathology information and medical records are invaluable resources for translational medical research. However, RNAs extracted from the archived clinical tissues are often substantially degraded. RNA degradation distorts the RNA-seq read coverage in a gene-specific manner, and has profound influences on whole-genome gene expression profiling.<h4>Result</h4>We developed the transcript integrity number (TIN) to measure RNA degradation. When applied  ...[more]

Similar Datasets

2016-02-19 | GSE70285 | GEO
2015-06-30 | E-GEOD-70380 | biostudies-arrayexpress
2016-03-03 | E-GEOD-71120 | biostudies-arrayexpress
2016-03-03 | E-GEOD-71119 | biostudies-arrayexpress
2015-11-19 | E-GEOD-73144 | biostudies-arrayexpress
2019-11-12 | PXD010845 | Pride
2015-11-04 | E-GEOD-66928 | biostudies-arrayexpress
2016-02-09 | E-GEOD-67667 | biostudies-arrayexpress
| EGAS00001006542 | EGA
2015-02-09 | E-GEOD-64927 | biostudies-arrayexpress