Unknown

Dataset Information

0

Staem5: A novel computational approachfor accurate prediction of m5C site.


ABSTRACT: 5-Methylcytosine (m5C) is an important post-transcriptional modification that has been extensively found in multiple types of RNAs. Many studies have shown that m5C plays vital roles in many biological functions, such as RNA structure stability and metabolism. Computational approaches act as an efficient way to identify m5C sites from high-throughput RNA sequence data and help interpret the functional mechanism of this important modification. This study proposed a novel species-specific computational approach, Staem5, to accurately predict RNA m5C sites in Mus musculus and Arabidopsis thaliana. Staem5 was developed by employing feature fusion tactics to leverage informatic sequence profiles, and a stacking ensemble learning framework combined five popular machine learning algorithms. Extensive benchmarking tests demonstrated that Staem5 outperformed state-of-the-art approaches in both cross-validation and independent tests. We provide the source code of Staem5, which is publicly available at https://github.com/Cxd-626/Staem5.git.

SUBMITTER: Chai D 

PROVIDER: S-EPMC8571400 | biostudies-literature | 2021 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Staem5: A novel computational approachfor accurate prediction of m5C site.

Chai Di D   Jia Cangzhi C   Zheng Jia J   Zou Quan Q   Li Fuyi F  

Molecular therapy. Nucleic acids 20211020


5-Methylcytosine (m5C) is an important post-transcriptional modification that has been extensively found in multiple types of RNAs. Many studies have shown that m5C plays vital roles in many biological functions, such as RNA structure stability and metabolism. Computational approaches act as an efficient way to identify m5C sites from high-throughput RNA sequence data and help interpret the functional mechanism of this important modification. This study proposed a novel species-specific computat  ...[more]

Similar Datasets

| S-EPMC5660750 | biostudies-literature
| S-EPMC5479036 | biostudies-literature
| S-EPMC11340604 | biostudies-literature
| S-EPMC7049599 | biostudies-literature
| S-EPMC10604046 | biostudies-literature
| S-EPMC3675099 | biostudies-literature
| S-EPMC2241647 | biostudies-literature
| S-EPMC8636726 | biostudies-literature
| S-EPMC9898109 | biostudies-literature
2021-12-15 | GSE128318 | GEO