Dataset Information


Alternative Polyadenylation Modification Patterns Reveal Essential Posttranscription Regulatory Mechanisms of Tumorigenesis in Multiple Tumor Types.

ABSTRACT: Among various risk factors for the initiation and progression of cancer, alternative polyadenylation (APA) is a remarkable endogenous contributor that directly triggers the malignant phenotype of cancer cells. APA affects biological processes at a transcriptional level in various ways. As such, APA can be involved in tumorigenesis through gene expression, protein subcellular localization, or transcription splicing pattern. The APA sites and status of different cancer types may have diverse modification patterns and regulatory mechanisms on transcripts. Potential APA sites were screened by applying several machine learning algorithms on a TCGA-APA dataset. First, a powerful feature selection method, minimum redundancy maximum relevancy, was applied on the dataset, resulting in a feature list. Then, the feature list was fed into the incremental feature selection, which incorporated the support vector machine as the classification algorithm, to extract key APA features and build a classifier. The classifier can classify cancer patients into cancer types with perfect performance. The key APA-modified genes had a potential prognosis ability because of their significant power in the survival analysis of TCGA pan-cancer data.


PROVIDER: S-EPMC7315320 | BioStudies | 2020-01-01

REPOSITORIES: biostudies

Similar Datasets

1000-01-01 | S-EPMC6045855 | BioStudies
2019-01-01 | S-EPMC6533131 | BioStudies
2019-01-01 | S-EPMC6522811 | BioStudies
2018-01-01 | S-EPMC5975616 | BioStudies
2017-01-01 | S-EPMC5476451 | BioStudies
2014-01-01 | S-EPMC4467577 | BioStudies
2020-01-01 | S-EPMC6943033 | BioStudies
2019-01-01 | S-EPMC6871504 | BioStudies
2016-01-01 | S-EPMC4878608 | BioStudies
2018-01-01 | S-EPMC5802200 | BioStudies