Genomics

Dataset Information

0

DNA methylation signature reliably distinguishes pulmonary enteric adenocarcinoma from metastatic colorectal cancer


ABSTRACT: Pulmonary enteric adenocarcinoma (PEAD) is a rare non-small cell lung cancer subtype. It is poorly characterized and cannot be distinguished from metastatic colorectal or upper gastrointestinal adenocarcinomas (ADCs) by means of routine pathological methods. As DNA methylation patterns are known to be highly tissue specific, we aimed to develop a methylation-based algorithm to differentiate these entities. To this end, genome wide methylation profiles of 600 primary pulmonary, colorectal and upper gastrointestinal ADCs obtained from The Cancer Genome Atlas (TCGA) and the Gene Expression Omnibus (GEO) database were used as a reference cohort to train a machine learning algorithm. The resulting classifier correctly classified all samples from a validation cohort consisting of 680 primary pulmonary, colorectal and upper gastrointestinal ADCs from TCGA and the GEO database, demonstrating the ability of the algorithm to reliably distinguish these three entities. We then analyzed DNA methylation data of 15 PEADs as well as four pulmonary metastases and four primary colorectal ADCs with the algorithm. All 15 PEADs were reliably classified as primary pulmonary tumors and all four metastases as well as all four primary colorectal ADC samples were identified as primary colorectal ADCs. In a t-distributed stochastic neighbor embedding analysis, the PEAD samples did not form a separate methylation subclass but rather diffusely intermixed with other pulmonary ADCs. Additional characterization of the PEAD series using fluorescence in-situ hybridization, next generation sequencing and copy number analysis revealed KRAS mutations in nine of 15 samples (60%) and a high number of structural chromosomal changes. Except for an unusually high rate of chromosome 20 gain (66.7%) the molecular data was mostly reminiscent of standard pulmonary ADCs. In conclusion, we provide sound evidence of the pulmonary origin of PEAD and in addition provide a publicly available machine learning based algorithm to reliably distinguish PEAD from metastatic colorectal cancer and upper gastrointestinal adenocarcinomas.

ORGANISM(S): Homo sapiens

PROVIDER: GSE116699 | GEO | 2019/01/04

REPOSITORIES: GEO

Similar Datasets

| PRJNA479888 | ENA
2011-11-01 | E-GEOD-31210 | biostudies-arrayexpress
2011-11-01 | GSE31210 | GEO
| PRJEB23497 | ENA
2018-08-02 | MSV000082703 | GNPS
2018-08-02 | MSV000082698 | GNPS
2019-05-15 | GSE114989 | GEO
| EGAD00001005827 | EGA
2012-09-04 | E-GEOD-36459 | biostudies-arrayexpress
2012-09-04 | E-GEOD-36458 | biostudies-arrayexpress