Genomics

Dataset Information

0

Small RNA-seq of undiseased human brain


ABSTRACT: The surprising observation that virtually the entire human genome is transcribed means we know very little about the function of many emerging classes of RNAs, except their astounding diversity. Traditional RNA function prediction methods rely on sequence or alignment information, which are limited in their ability to classify classes of non-coding RNAs (ncRNAs). To address this, we developed CoRAL, a machine learning-based approach for classification of RNA molecules. CoRAL uses biologically interpretable features including fragment length, cleavage specificity, and antisense transcription to distinguish between different ncRNA classes. We evaluated CoRAL using genome-wide small RNA sequencing (smRNA-seq) datasets from two human tissue types (brain and skin [GSE31037]), and were able to classify six different types of RNA transcripts with 79~80% accuracy in cross-validation experiments, and with 71~73% accuracy when CoRAL uses one tissue type for training and the other as validation. Analysis by CoRAL revealed that long intergenic ncRNAs, small cytoplasmic RNAs, and small nuclear RNAs show more tissue specificity, while microRNAs, small nucleolar, and transposon-derived RNAs are highly discernible and consistent across the two tissue types. The ability to consistently annotate loci across tissue types demonstrates the potential of CoRAL to characterize ncRNAs using smRNA-seq data in less characterized organisms.

ORGANISM(S): Homo sapiens

PROVIDER: GSE43335 | GEO | 2013/07/07

SECONDARY ACCESSION(S): PRJNA185476

REPOSITORIES: GEO

Similar Datasets

2013-07-07 | E-GEOD-43335 | biostudies-arrayexpress
2015-10-31 | E-GEOD-66224 | biostudies-arrayexpress
2015-04-27 | E-GEOD-60400 | biostudies-arrayexpress
2014-01-29 | GSE45397 | GEO
2020-08-18 | GSE156205 | GEO
2015-04-27 | GSE60400 | GEO
2014-01-29 | GSE45398 | GEO
2012-03-31 | GSE36971 | GEO
2014-01-29 | E-GEOD-45398 | biostudies-arrayexpress
2012-03-31 | E-GEOD-36971 | biostudies-arrayexpress