Dataset Information


Prediction of Recombination Spots Using Novel Hybrid Feature Extraction Method via Deep Learning Approach.

ABSTRACT: Meiotic recombination is the driving force of evolutionary development and an important source of genetic variation. The meiotic recombination does not take place randomly in a chromosome but occurs in some regions of the chromosome. A region in chromosomes with higher rate of meiotic recombination events are considered as hotspots and a region where frequencies of the recombination events are lower are called coldspots. Prediction of meiotic recombination spots provides useful information about the basic functionality of inheritance and genome diversity. This study proposes an intelligent computational predictor called iRSpots-DNN for the identification of recombination spots. The proposed predictor is based on a novel feature extraction method and an optimized deep neural network (DNN). The DNN was employed as a classification engine whereas, the novel features extraction method was developed to extract meaningful features for the identification of hotspots and coldspots across the yeast genome. Unlike previous algorithms, the proposed feature extraction avoids bias among different selected features and preserved the sequence discriminant properties along with the sequence-structure information simultaneously. This study also considered other effective classifiers named support vector machine (SVM), K-nearest neighbor (KNN), and random forest (RF) to predict recombination spots. Experimental results on a benchmark dataset with 10-fold cross-validation showed that iRSpots-DNN achieved the highest accuracy, i.e., 95.81%. Additionally, the performance of the proposed iRSpots-DNN is significantly better than the existing predictors on a benchmark dataset. The relevant benchmark dataset and source code are freely available at: https://github.com/Fatima-Khan12/iRspot_DNN/tree/master/iRspot_DNN.


PROVIDER: S-EPMC7527634 | BioStudies | 2020-01-01

REPOSITORIES: biostudies

Similar Datasets

2013-01-01 | S-EPMC3616736 | BioStudies
1000-01-01 | S-EPMC1463011 | BioStudies
2016-01-01 | S-EPMC4945094 | BioStudies
2017-01-01 | S-EPMC5499175 | BioStudies
2007-01-01 | S-EPMC1805699 | BioStudies
2019-01-01 | S-EPMC6754380 | BioStudies
2014-01-01 | S-EPMC3958819 | BioStudies
2007-02-01 | GSE6245 | GEO
2018-01-01 | S-EPMC5880243 | BioStudies
2016-01-01 | S-EPMC4747544 | BioStudies