Genomics

Dataset Information

0

Improved lung cancer classification by employing diverse molecular features of microRNAs


ABSTRACT: Lung adenocarcinoma (LUAD) is one of the most common pathological and histological subtypes of primary lung cancer, with high morbidity and mortality. MicroRNAs (miRNAs) are endogenous small non-coding RNAs that regulate the expression of genes at post-transcriptional level. It was reported that A-to-I miRNA editing was decreased in tumors, suggesting the potential value of miRNA editing in cancer classification. However, existing miRNA-based cancer classification models mainly used the frequencies of miRNAs. In order to validate the contribution of miRNA editing information in cancer classification, we extracted three types of miRNA features, including the abundances of original miRNAs, the abundances of edited miRNAs, and the editing levels of miRNA editing sites. Our results show that four classification algorithms selected, i.e., kNN, C4.5, RF and SVM, generally had better performances on all features than on the abundances of miRNAs alone. Since the number of features were large, we used three feature selection (FS) methods to further improve the classification models. One of the FS methods, the DFL algorithm, selected only three features, i.e., the frequencies of hsa-miR-135b-5p, hsa-miR-210-3p and hsa-miR-182 48u (an edited miRNA), from 316 training samples. And all of the four classification algorithms achieved 100% accuracy on these three features for 79 independent testing samples. These results indicate that the additional information of miRNA editing are useful in improving the classification of LUAD samples. And the three miRNAs selected by DFL potentially represent an effective molecular signature for LUAD diagnosis.

ORGANISM(S): Homo sapiens

PROVIDER: GSE244311 | GEO | 2024/02/22

REPOSITORIES: GEO

Similar Datasets

2014-10-02 | E-GEOD-61066 | biostudies-arrayexpress
2014-10-02 | E-GEOD-61067 | biostudies-arrayexpress
2017-10-17 | GSE105027 | GEO
| EGAD00001006844 | EGA
2023-10-05 | GSE242861 | GEO
2014-10-02 | GSE61067 | GEO
2014-10-02 | GSE61066 | GEO
2010-04-26 | E-GEOD-20384 | biostudies-arrayexpress
2013-06-14 | GSE47939 | GEO
2012-06-28 | GSE30656 | GEO