Space efficient merging of de Bruijn graphs and Wheeler graphs

09/05/2020
by   Lavinia Egidi, et al.
0

The merging of succinct data structures is a well established technique for the space efficient construction of large succinct indexes. In the first part of the paper we propose a new algorithm for merging succinct representations of de Bruijn graphs. Our algorithm has the same asymptotic cost of the state of the art algorithm for the same problem but it uses less than half of its working space. A novel important feature of our algorithm, not found in any of the existing tools, is that it can compute the Variable Order succinct representation of the union graph within the same asymptotic time/space bounds. In the second part of the paper we consider the more general problem of merging succinct representations of Wheeler graphs, a recently introduced graph family which includes as special cases de Bruijn graphs and many other known succinct indexes based on the BWT or one of its variants. We show that Wheeler graphs merging is in general a much more difficult problem, and we provide a space efficient algorithm for the slightly simplified problem of determining whether the union graph has an ordering that satisfies the Wheeler conditions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2019

Space-efficient merging of succinct de Bruijn graphs

We propose a new algorithm for merging succinct representations of de Br...
research
03/04/2019

Lightweight merging of compressed indices based on BWT variants

In this paper we propose a flexible and lightweight technique for mergin...
research
12/01/2015

Dynamic Parallel and Distributed Graph Cuts

Graph-cuts are widely used in computer vision. In order to speed up the ...
research
07/03/2023

Dynamical Graph Echo State Networks with Snapshot Merging for Dissemination Process Classification

The Dissemination Process Classification (DPC) is a popular application ...
research
02/01/2017

A Hybrid Evolutionary Algorithm Based on Solution Merging for the Longest Arc-Preserving Common Subsequence Problem

The longest arc-preserving common subsequence problem is an NP-hard comb...
research
12/07/2018

CSS Minification via Constraint Solving (Technical Report)

Minification is a widely-accepted technique which aims at reducing the s...
research
06/10/2023

On Improving the Cohesiveness of Graphs by Merging Nodes: Formulation, Analysis, and Algorithms

Graphs are a powerful mathematical model, and they are used to represent...

Please sign up or login with your details

Forgot password? Click here to reset