Unknown

Dataset Information

0

Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions.


ABSTRACT: Methylation is one of the most common and considerable modifications in biological systems mediated by multiple enzymes. Recent studies have shown that methylation has been widely identified in different RNA molecules. RNA methylation modifications have various kinds, such as 5-methylcytosine (m5C). However, for individual methylation sites, their functions still remain to be elucidated. Testing of all methylation sites relies heavily on high-throughput sequencing technology, which is expensive and labor consuming. Thus, computational prediction approaches could serve as a substitute. In this study, multiple machine learning models were used to predict possible RNA m5C sites on the basis of mRNA sequences in human and mouse. Each site was represented by several features derived from k-mers of an RNA subsequence containing such site as center. The powerful max-relevance and min-redundancy (mRMR) feature selection method was employed to analyse these features. The outcome feature list was fed into incremental feature selection method, incorporating four classification algorithms, to build efficient models. Furthermore, the sites related to features used in the models were also investigated.

SUBMITTER: Chen L 

PROVIDER: S-EPMC8776474 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions.

Chen Lei L   Li ZhanDong Z   Zhang ShiQi S   Zhang Yu-Hang YH   Huang Tao T   Cai Yu-Dong YD  

BioMed research international 20220113


Methylation is one of the most common and considerable modifications in biological systems mediated by multiple enzymes. Recent studies have shown that methylation has been widely identified in different RNA molecules. RNA methylation modifications have various kinds, such as 5-methylcytosine (m<sup>5</sup>C). However, for individual methylation sites, their functions still remain to be elucidated. Testing of all methylation sites relies heavily on high-throughput sequencing technology, which is  ...[more]

Similar Datasets

| S-EPMC6251864 | biostudies-literature
| S-EPMC4290605 | biostudies-literature
| S-EPMC6651575 | biostudies-literature
| S-EPMC7488740 | biostudies-literature
| S-EPMC7285933 | biostudies-literature
| S-EPMC8599298 | biostudies-literature
| S-EPMC3322362 | biostudies-literature
| S-EPMC9022248 | biostudies-literature
| S-EPMC6099560 | biostudies-literature
| S-EPMC8034527 | biostudies-literature