Dataset Information


Association of Protein Translation and Extracellular Matrix Gene Sets with Breast Cancer Metastasis: Findings Uncovered on Analysis of Multiple Publicly Available Datasets Using Individual Patient Data Approach.

ABSTRACT: INTRODUCTION:Microarray analysis has revolutionized the role of genomic prognostication in breast cancer. However, most studies are single series studies, and suffer from methodological problems. We sought to use a meta-analytic approach in combining multiple publicly available datasets, while correcting for batch effects, to reach a more robust oncogenomic analysis. AIM:The aim of the present study was to find gene sets associated with distant metastasis free survival (DMFS) in systemically untreated, node-negative breast cancer patients, from publicly available genomic microarray datasets. METHODS:Four microarray series (having 742 patients) were selected after a systematic search and combined. Cox regression for each gene was done for the combined dataset (univariate, as well as multivariate - adjusted for expression of Cell cycle related genes) and for the 4 major molecular subtypes. The centre and microarray batch effects were adjusted by including them as random effects variables. The Cox regression coefficients for each analysis were then ranked and subjected to a Gene Set Enrichment Analysis (GSEA). RESULTS:Gene sets representing protein translation were independently negatively associated with metastasis in the Luminal A and Luminal B subtypes, but positively associated with metastasis in Basal tumors. Proteinaceous extracellular matrix (ECM) gene set expression was positively associated with metastasis, after adjustment for expression of cell cycle related genes on the combined dataset. Finally, the positive association of the proliferation-related genes with metastases was confirmed. CONCLUSION:To the best of our knowledge, the results depicting mixed prognostic significance of protein translation in breast cancer subtypes are being reported for the first time. We attribute this to our study combining multiple series and performing a more robust meta-analytic Cox regression modeling on the combined dataset, thus discovering 'hidden' associations. This methodology seems to yield new and interesting results and may be used as a tool to guide new research.

SUBMITTER: Chowdhury N 

PROVIDER: S-EPMC4469303 | BioStudies | 2015-01-01

REPOSITORIES: biostudies

Similar Datasets

2015-01-01 | S-EPMC4637883 | BioStudies
2008-01-01 | S-EPMC2563019 | BioStudies
2014-01-01 | S-EPMC4535974 | BioStudies
2014-01-01 | S-EPMC4101981 | BioStudies
2014-01-01 | S-EPMC4071199 | BioStudies
2012-01-01 | S-EPMC3446348 | BioStudies
2014-01-01 | S-EPMC4716673 | BioStudies
2020-01-01 | S-EPMC6930435 | BioStudies
2020-01-01 | S-EPMC7582264 | BioStudies
1000-01-01 | S-EPMC4653029 | BioStudies