On Conflict Free DNA Codes

02/12/2019
by   Krishna Gopal Benerjee, et al.
0

DNA storage has emerged as an important area of research. The reliability of DNA storage system depends on designing the DNA strings (called DNA codes) that are sufficiently dissimilar. In this work, we introduce DNA codes that satisfy a special constraint. Each codeword of the DNA code has a specific property that any two consecutive sub-strings of the DNA codeword will not be the same (a generalization of homo-polymers constraint). This is in addition to the usual constraints such as Hamming, reverse, reverse-complement and GC-content. We believe that the new constraint will help further in reducing the errors during reading and writing data into the synthetic DNA strings. We also present a construction (based on a variant of stochastic local search algorithm) to calculate the size of the DNA codes with all the above constraints, which improves the lower bounds from the existing literature, for some specific cases. Moreover, a recursive isometric map between binary vectors and DNA strings is proposed. Using the map and the well known binary codes we obtain few classes of DNA codes with all the constraints including the property that the constructed DNA codewords are free from the hairpin-like secondary structures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/04/2021

Reversible G^k-Codes with Applications to DNA Codes

In this paper, we give a matrix construction method for designing DNA co...
research
02/15/2023

Indel Error Correction Codes for DNA Digital Data Storage and Retrieval

A procedure for storage and retrieval of Digital information in DNA stri...
research
02/13/2021

DNA codes over two noncommutative rings of order four

DNA codes based on error-correcting codes have been successful in DNA-ba...
research
10/18/2021

DNA Codes over the Ring ℤ_4 + wℤ_4

In this present work, we generalize the study of construction of DNA cod...
research
11/30/2020

Batch Optimization for DNA Synthesis

Large pools of synthetic DNA molecules have been recently used to reliab...
research
11/26/2018

An optimized Parallel Failure-less Aho-Corasick algorithm for DNA sequence matching

The Aho-Corasick algorithm is multiple patterns searching algorithm runn...
research
12/14/2018

Properties and constructions of constrained codes for DNA-based data storage

We describe properties and constructions of constraint-based codes for D...

Please sign up or login with your details

Forgot password? Click here to reset