Unknown

Dataset Information

0

Detecting, Categorizing, and Correcting Coverage Anomalies of RNA-Seq Quantification.


ABSTRACT: Because of incomplete reference transcriptomes, incomplete sequencing bias models, or other modeling defects, algorithms to infer isoform expression from RNA sequencing (RNA-seq) sometimes do not accurately model expression. We present a computational method to detect instances where a quantification algorithm could not completely explain the input reads. Our approach identifies regions where the read coverage significantly deviates from expectation. We call these regions "expression anomalies." We further present a method to attribute their cause to either the incompleteness of the reference transcriptome or algorithmic mistakes. We detect anomalies for 30 GEUVADIS and 16 Human Body Map samples. By correcting anomalies when possible, we reduce the number of falsely predicted instances of differential expression. Anomalies that cannot be corrected are suspected to indicate the existence of isoforms unannotated by the reference. We detected 88 common anomalies of this type and find that they tend to have a lower-than-expected coverage toward their 3' ends.

SUBMITTER: Ma C 

PROVIDER: S-EPMC6938679 | biostudies-literature | 2019 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detecting, Categorizing, and Correcting Coverage Anomalies of RNA-Seq Quantification.

Ma Cong C   Kingsford Carl C  

Cell systems 20191127 6


Because of incomplete reference transcriptomes, incomplete sequencing bias models, or other modeling defects, algorithms to infer isoform expression from RNA sequencing (RNA-seq) sometimes do not accurately model expression. We present a computational method to detect instances where a quantification algorithm could not completely explain the input reads. Our approach identifies regions where the read coverage significantly deviates from expectation. We call these regions "expression anomalies."  ...[more]

Similar Datasets

| S-EPMC8092017 | biostudies-literature
| S-EPMC5003039 | biostudies-literature
| S-EPMC3758689 | biostudies-literature
| S-EPMC3129672 | biostudies-literature
| S-EPMC4160374 | biostudies-literature
| S-EPMC5860083 | biostudies-literature
| S-EPMC4071332 | biostudies-literature
| S-EPMC4842274 | biostudies-literature
| S-EPMC5952827 | biostudies-other