Small Longest Tandem Scattered Subsequences

06/24/2020
by   Luís M. S. Russo, et al.
0

We consider the problem of identifying tandem scattered subsequences within a string. Our algorithm identifies a longest subsequence which occurs twice without overlap in a string. This algorithm is based on the Hunt-Szymanski algorithm, therefore its performance improves if the string is not self similar. This occurs naturally on strings over large alphabets. Our algorithm relies on new results for data structures that support dynamic longest increasing sub-sequences. In the process we also obtain improved algorithms for the decremental string comparison problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/02/2023

An Algorithm for the Constrained Longest Common Subsequence and Substring Problem

Let Σ be an alphabet. For two strings X, Y, and a constrained string P o...
research
04/09/2020

Pattern Discovery in Colored Strings

We consider the problem of identifying patterns of interest in colored s...
research
03/21/2019

Scalable Similarity Joins of Tokenized Strings

This work tackles the problem of fuzzy joining of strings that naturally...
research
03/17/2020

An Efficient Implementation of Manacher's Algorithm

Manacher's algorithm has been shown to be optimal to the longest palindr...
research
01/12/2018

Cosmic String Detection with Tree-Based Machine Learning

We explore the use of random forest and gradient boosting, two powerful ...
research
03/29/2019

Data structures to represent sets of k-long DNA sequences

The analysis of biological sequencing data has been one of the biggest a...
research
02/05/2021

Effect of forename string on author name disambiguation

In author name disambiguation, author forenames are used to decide which...

Please sign up or login with your details

Forgot password? Click here to reset