Transcriptomics

Dataset Information

0

Convolutional neural network modelling: advancing identification of true mRNA cleavage sites


ABSTRACT: Degradome sequencing is commonly used to generate high-throughput information on mRNA cleavages by small RNAs. Here we developed an extension module based on a deep learning convolutional neural network (CNN) in a machine learning environment to discriminate false from true cleavage sites applied on datasets from potato (Solanum tuberosum, St) and the oomycete pathogen Phytophthora infestans (Pi). The core of the CNN module is a stochastic gradient descent optimizer with cyclical learning rate (CLR) which together with Bayesian optimization scored a validation accuracy of 100%. To verify the recognition of cleavage sites we applied the module on Arabidopsis thaliana microRNA cleavages. Our module managed to recognize all cleavages, confirming the reliability of the module. When applying this new model to evaluate our data, 7.3% of all cleavage windows represented true cleavages distributed as 214 sites in P. infestans and 444 sites in potato. The sRNA landscape of the potato-P. infestans interaction is complex with uneven sRNA production and cleavage regions. In total, 222 endogenous Pi-sRNAs, 565 endogenous St-sRNAs, 91 trans-acting Pi-sRNAs and 14 trans-acting St-sRNAs were discovered from our datasets. Groups of self-regulatory sRNAs, and sRNAs generated from effector sequences like RXLR and Crinklers suggest dual effector functions. In the potato genome, resistance genes were most targeted mainly as self-regulation but also as results of a trans-action event. Our new analytic model is freely accessible for anyone working on complex biological systems.

ORGANISM(S): Phytophthora infestans Solanum tuberosum

PROVIDER: GSE163382 | GEO | 2021/05/05

REPOSITORIES: GEO

Similar Datasets

2021-06-30 | GSE119230 | GEO
2014-12-12 | E-GEOD-63292 | biostudies-arrayexpress
2014-12-12 | E-GEOD-62674 | biostudies-arrayexpress
2014-12-12 | GSE63292 | GEO
2014-12-12 | GSE62674 | GEO
2020-12-01 | GSE159015 | GEO
2009-06-13 | GSE11781 | GEO
2014-06-26 | E-MTAB-1712 | biostudies-arrayexpress
2009-06-13 | E-GEOD-11781 | biostudies-arrayexpress
2012-09-01 | E-GEOD-33240 | biostudies-arrayexpress