Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

A merged microarray meta-dataset for studying transcriptome-wide changes during neoplastic progression of colorectal cancer


ABSTRACT: Transcriptional profiling of pre-malignant and malignant colorectal cancer lesions provides a means for temporally monitoring key molecular events underlying neoplastic progression. Unfortunately, the most widely used central dataset for colorectal cancer samples from The Cancer Genome Atlas (TCGA) does not contain adenoma samples, putting a greater reliance of in silico analyses and pre-clinical modelling on a handful of independent microarray experiments. Due to the differences in sample acquisition, preparation, downstream analysis and other parameters, results are often incongruent, hindering consensus building. Here, we developed a microarray meta-dataset consisting of 231 normal, 132 adenoma, and 342 colon cancer tissue samples (705 samples total) sourced from 12 independent microarray studies all using the Affymetrix HG U133 Plus 2.0 (GPL570) chip platform including GSE4183, GSE8671,GSE9348, GSE15960, GSE20916, GSE21510, GSE22598, GSE23194, GSE23878, GSE32323, GSE33113, and GSE37364. Individual datasets were pre-processed and normalized by frozen robust multiarray averaging (fRMA) before merging by matching probe sets. Batch effects were subsequently identified by Principal Component Analysis (PCA) and removed using ComBat. In addition, low variant probes were filtered from the meta-dataset before downstream analysis. Finally, biological signatures corresponding to cancer and adenoma samples were both quantitatively and functionally validated. Quantitative validation was performed by correlation analysis of LogFC values with the TCGA-COAD or other external GEO microarray datasets, respectively. Functional validation was carried out through predictive analyses using Ingenuity Pathway Analysis (IPA) and Gene Set Enrichment Analysis (GSEA). Overall, our meta-dataset provides a powerful tool for studying transcriptome-wide changes which occur during early dysplasia and malignant transformation of adenomas as well as colorectal cancer in general.

ORGANISM(S): Homo sapiens

SUBMITTER: Michael Rohr 

PROVIDER: E-MTAB-10089 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

Similar Datasets

2019-04-11 | E-MTAB-6690 | biostudies-arrayexpress
2019-04-11 | E-MTAB-6696 | biostudies-arrayexpress
2019-04-11 | E-MTAB-6697 | biostudies-arrayexpress
2019-04-11 | E-MTAB-6692 | biostudies-arrayexpress
2019-04-11 | E-MTAB-6694 | biostudies-arrayexpress
2019-04-11 | E-MTAB-6693 | biostudies-arrayexpress
2019-04-11 | E-MTAB-6695 | biostudies-arrayexpress
2019-04-11 | E-MTAB-6691 | biostudies-arrayexpress
2019-04-11 | E-MTAB-6699 | biostudies-arrayexpress
2019-04-11 | E-MTAB-6698 | biostudies-arrayexpress