Dataset Information

Breast cancer subtype predictors revisited: from consensus to concordance?

ABSTRACT:

Background

At the molecular level breast cancer comprises a heterogeneous set of subtypes associated with clear differences in gene expression and clinical outcomes. Single sample predictors (SSPs) are built via a two-stage approach consisting of clustering and subtype predictor construction based on the cluster labels of individual cases. SSPs have been criticized because their subtype assignments for the same samples were only moderately concordant (Cohen's κ<0.6).

Methods

We propose a semi-supervised approach where for five datasets, consensus sets were constructed consisting of those samples that were concordantly subtyped by a number of different predictors. Next, nine subtype predictors - three SSPs, three subtype classification models (SCMs) and three novel rule-based predictors based on the St. Gallen surrogate intrinsic subtype definitions (STGs) - were constructed on the five consensus sets and their associated consensus subtype labels. The predictors were validated on a compendium of over 4,000 uniformly preprocessed Affymetrix microarrays. Concordance between subtype predictors was assessed using Cohen's kappa statistic.

Results

In this standardized setup, subtype predictors of the same type (either SCM, SSP, or STG) but with a different gene list and/or consensus training set were associated with almost perfect levels of agreement (median κ>0.8). Interestingly, for a given predictor type a change in consensus set led to higher concordance than a change to another gene list. The more challenging scenario where the predictor type, gene list and training set were all different resulted in predictors with only substantial levels of concordance (median κ=0.74) on independent validation data.

Conclusions

Our results demonstrate that for a given subtype predictor type stringent standardization of the preprocessing stage, combined with carefully devised consensus training sets, leads to predictors that show almost perfect levels of concordance. However, predictors of a different type are only substantially concordant, despite reaching almost perfect levels of concordance on training data.

SUBMITTER: Sontrop HMJ

PROVIDER: S-EPMC4893290 | biostudies-literature | 2016 Jun

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Breast cancer subtype predictors revisited: from consensus to concordance?

Sontrop Herman M J HMJ Reinders Marcel J T MJT Moerland Perry D PD

BMC medical genomics 20160603 1

<h4>Background</h4>At the molecular level breast cancer comprises a heterogeneous set of subtypes associated with clear differences in gene expression and clinical outcomes. Single sample predictors (SSPs) are built via a two-stage approach consisting of clustering and subtype predictor construction based on the cluster labels of individual cases. SSPs have been criticized because their subtype assignments for the same samples were only moderately concordant (Cohen's κ<0.6).<h4>Methods</h4>We pr ...[more]

PMID: 27259591

Dataset Information

Breast cancer subtype predictors revisited: from consensus to concordance?

Background

Methods

Results

Conclusions

Publications

Breast cancer subtype predictors revisited: from consensus to concordance?

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Association of molecular subtype concordance and survival outcome in synchronous and metachronous bilateral breast cancer.
| S-EPMC8027898 | biostudies-literature

Concordance among gene expression-based predictors for ER-positive breast cancer treated with adjuvant tamoxifen.
| S-EPMC3477878 | biostudies-literature

Consensus molecular subtype transition during progression of colorectal cancer
2023-07-13 | GSE237249 | GEO

Early Gnathostome Phylogeny Revisited: Multiple Method Consensus.
| S-EPMC5029804 | biostudies-literature

A consensus hypoxia signature in breast cancer
2018-12-26 | GSE111653 | GEO

Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers.
| S-EPMC7428878 | biostudies-literature

Consensus molecular subtype transition during progression of colorectal cancer [NanoClassifier Gene Set]
2023-07-13 | GSE237247 | GEO

Evaluation of cross-platform and interlaboratory concordance via consensus modelling of genomic measurements
2018-09-01 | GSE113372 | GEO

Implications of Intratumor Heterogeneity on Consensus Molecular Subtype (CMS) in Colorectal Cancer.
| S-EPMC8507736 | biostudies-literature

Concordance of genomic alterations between primary and recurrent breast cancer.
| S-EPMC4348062 | biostudies-literature