Unknown

Dataset Information

0

Seq-ing improved gene expression estimates from microarrays using machine learning.


ABSTRACT: BACKGROUND:Quantifying gene expression by RNA-Seq has several advantages over microarrays, including greater dynamic range and gene expression estimates on an absolute, rather than a relative scale. Nevertheless, microarrays remain in widespread use, demonstrated by the ever-growing numbers of samples deposited in public repositories. RESULTS:We propose a novel approach to microarray analysis that attains many of the advantages of RNA-Seq. This method, called Machine Learning of Transcript Expression (MaLTE), leverages samples for which both microarray and RNA-Seq data are available, using a Random Forest to learn the relationship between the fluorescence intensity of sets of microarray probes and RNA-Seq transcript expression estimates. We trained MaLTE on data from the Genotype-Tissue Expression (GTEx) project, consisting of Affymetrix gene arrays and RNA-Seq from over 700 samples across a broad range of human tissues. CONCLUSION:This approach can be used to accurately estimate absolute expression levels from microarray data, at both gene and transcript level, which has not previously been possible. This methodology will facilitate re-analysis of archived microarray data and broaden the utility of the vast quantities of data still being generated.

SUBMITTER: Korir PK 

PROVIDER: S-EPMC4559919 | biostudies-literature | 2015 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Seq-ing improved gene expression estimates from microarrays using machine learning.

Korir Paul K PK   Geeleher Paul P   Seoighe Cathal C  

BMC bioinformatics 20150904


<h4>Background</h4>Quantifying gene expression by RNA-Seq has several advantages over microarrays, including greater dynamic range and gene expression estimates on an absolute, rather than a relative scale. Nevertheless, microarrays remain in widespread use, demonstrated by the ever-growing numbers of samples deposited in public repositories.<h4>Results</h4>We propose a novel approach to microarray analysis that attains many of the advantages of RNA-Seq. This method, called Machine Learning of T  ...[more]

Similar Datasets

| S-EPMC8113180 | biostudies-literature
| S-EPMC8631472 | biostudies-literature
| S-EPMC3229535 | biostudies-literature
| S-EPMC3063777 | biostudies-literature
2013-01-01 | E-GEOD-29210 | biostudies-arrayexpress
| S-EPMC3404069 | biostudies-literature
| S-EPMC4766705 | biostudies-literature
| S-EPMC7644310 | biostudies-literature