Unknown

Dataset Information

0

Copy number variations distinguish lung adenocarcinomas from squamous cell carcinomas


ABSTRACT: For a number of clinical and biological reasons, the accurate classification of non-small cell lung carcinoma (NSCLC) into adenocarcinoma (ADC) and squamous cell carcinoma (SCC) is essential. DNA-based tests, which are not currently used, are more robust when applied to formalin-fixed paraffin-embedded tissues. To develop a molecular-based classification of NSCLC based on genome wide copy number variations (CNVs), the corresponding TCGA, SPORE and CANARY patient datasets were used as training and independent validation data. The signature genes were selected by advanced supervised classification algorithms and restricted to known important oncogenes/tumor suppressors, resulting in a final 27-gene signature that was able to classify ADC from SCC with 0.85-0.87 accuracies of SPORE validation sets and 0.96-0.98 accuracy of CANARY validation sets. Even by using the top 7 genes in this signature, the accuracies of the validation sets were still as high as 0.80 and 0.97, respectively. These signature genes also classified adenocarcinoma and squamous cell carcinomas from the non-malignant lung samples with accuracies of 91-97%.

ORGANISM(S): Homo sapiens

PROVIDER: GSE74948 | GEO | 2015/11/13

SECONDARY ACCESSION(S): PRJNA301991

REPOSITORIES: GEO

Similar Datasets

2016-06-01 | E-GEOD-75037 | biostudies-arrayexpress
2016-06-01 | GSE75037 | GEO
2014-01-28 | PXD000438 | Pride
2010-07-09 | E-GEOD-13937 | biostudies-arrayexpress
2016-07-02 | MSV000079881 | MassIVE
2018-10-11 | GSE121090 | GEO
2015-08-20 | GSE72194 | GEO
2015-08-20 | GSE72192 | GEO
2010-07-09 | GSE13937 | GEO
2017-02-02 | GSE94365 | GEO