A Novel Method for Comparative Analysis of DNA Sequences by Ramanujan-Fourier Transform

03/06/2014
by   Changchuan Yin, et al.
0

Alignment-free sequence analysis approaches provide important alternatives over multiple sequence alignment (MSA) in biological sequence analysis because alignment-free approaches have low computation complexity and are not dependent on high level of sequence identity, however, most of the existing alignment-free methods do not employ true full information content of sequences and thus can not accurately reveal similarities and differences among DNA sequences. We present a novel alignment-free computational method for sequence analysis based on Ramanujan-Fourier transform (RFT), in which complete information of DNA sequences is retained. We represent DNA sequences as four binary indicator sequences and apply RFT on the indicator sequences to convert them into frequency domain. The Euclidean distance of the complete RFT coefficients of DNA sequences are used as similarity measure. To address the different lengths in Euclidean space of RFT coefficients, we pad zeros to short DNA binary sequences so that the binary sequences equal the longest length in the comparison sequence data. Thus, the DNA sequences are compared in the same dimensional frequency space without information loss. We demonstrate the usefulness of the proposed method by presenting experimental results on hierarchical clustering of genes and genomes. The proposed method opens a new channel to biological sequence analysis, classification, and structural module identification.

READ FULL TEXT
research
11/24/2022

Estimation of Similarity between DNA Sequences and Its Graphical Representation

Bioinformatics, which is now a well known field of study, originated in ...
research
07/14/2013

Map of Life: Measuring and Visualizing Species' Relatedness with "Molecular Distance Maps"

We propose a novel combination of methods that (i) portrays quantitative...
research
09/21/2020

A high-performance MEMRISTOR-based Smith-Waterman DNA sequence alignment Using FPNI structure

This paper aims to present a new re-configuration sequencing method for ...
research
09/04/2023

Blind Biological Sequence Denoising with Self-Supervised Set Learning

Biological sequence analysis relies on the ability to denoise the imprec...
research
07/10/2023

A Linear Time Quantum Algorithm for Pairwise Sequence Alignment

Sequence Alignment is the process of aligning biological sequences in or...
research
10/11/2019

Statistical Linear Models in Virus Genomic Alignment-free Classification: Application to Hepatitis C Viruses

Viral sequence classification is an important task in pathogen detection...

Please sign up or login with your details

Forgot password? Click here to reset