Ontology highlight
ABSTRACT:
SUBMITTER: Sarkar BK
PROVIDER: S-EPMC8249421 | biostudies-literature | 2021 Jul
REPOSITORIES: biostudies-literature
Sarkar Bimal Kumar BK Sharma Ashish Ranjan AR Bhattacharya Manojit M Sharma Garima G Lee Sang-Soo SS Chakraborty Chiranjib C
Scientific reports 20210701 1
We describe a novel algorithm for information recovery from DNA sequences by using a digital filter. This work proposes a three-part algorithm to decide the k-mer or q-gram word density. Employing a finite impulse response digital filter, one can calculate the sequence's k-mer or q-gram word density. Further principal component analysis is used on word density distribution to analyze the dissimilarity between sequences. A dissimilarity matrix is thus formed and shows the appearance of cluster fo ...[more]