Unknown

Dataset Information

0

Fast splice site detection using information content and feature reduction.


ABSTRACT:

Background

Accurate identification of splice sites in DNA sequences plays a key role in the prediction of gene structure in eukaryotes. Already many computational methods have been proposed for the detection of splice sites and some of them showed high prediction accuracy. However, most of these methods are limited in terms of their long computation time when applied to whole genome sequence data.

Results

In this paper we propose a hybrid algorithm which combines several effective and informative input features with the state of the art support vector machine (SVM). To obtain the input features we employ information content method based on Shannon's information theory, Shapiro's score scheme, and Markovian probabilities. We also use a feature elimination scheme to reduce the less informative features from the input data.

Conclusion

In this study we propose a new feature based splice site detection method that shows improved acceptor and donor splice site detection in DNA sequences when the performance is compared with various state of the art and well known methods.

SUBMITTER: Baten AK 

PROVIDER: S-EPMC2638148 | biostudies-literature | 2008 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Fast splice site detection using information content and feature reduction.

Baten A K M A AK   Halgamuge S K SK   Chang B C H BC  

BMC bioinformatics 20081212


<h4>Background</h4>Accurate identification of splice sites in DNA sequences plays a key role in the prediction of gene structure in eukaryotes. Already many computational methods have been proposed for the detection of splice sites and some of them showed high prediction accuracy. However, most of these methods are limited in terms of their long computation time when applied to whole genome sequence data.<h4>Results</h4>In this paper we propose a hybrid algorithm which combines several effective  ...[more]

Similar Datasets

| S-EPMC4452346 | biostudies-literature
| S-EPMC134237 | biostudies-literature
| S-EPMC3624888 | biostudies-literature
| S-EPMC5956828 | biostudies-literature
| S-EPMC9299264 | biostudies-literature
| S-EPMC3942325 | biostudies-literature
| S-EPMC5479904 | biostudies-literature
| S-EPMC1764471 | biostudies-literature
| S-EPMC4718645 | biostudies-literature
| S-EPMC8693380 | biostudies-literature