Re-Pair In-Place

08/14/2019
by   Dominik Köppl, et al.
0

Re-Pair is a grammar compression scheme with favorably good compression rates. The computation of Re-Pair comes with the cost of maintaining large frequency tables, which makes it hard to compute Re-Pair on large scale data sets. As a solution for this problem we present, given a text of length n whose characters are drawn from an integer alphabet, an O(n^2) time algorithm computing Re-Pair in n τ bits of space including the text space, where τ is the maximum of n and the number of terminals and non-terminals. The algorithm works in the restore model, supporting the recovery of the original input in O(n^2) time with O( n) additional bits of working space. We give variants of our solution working in parallel or in the external memory model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/12/2022

Efficient Construction of the BWT for Repetitive Text Using String Compression

We present a new semi-external algorithm that builds the Burrows-Wheeler...
research
11/05/2018

RePair in Compressed Space and Time

Given a string T of length N, the goal of grammar compression is to cons...
research
11/08/2019

Space Efficient Construction of Lyndon Arrays in Linear Time

We present the first linear time algorithm to construct the 2n-bit versi...
research
01/11/2021

Strictly In-Place Algorithms for Permuting and Inverting Permutations

We revisit the problem of permuting an array of length n according to a ...
research
07/07/2019

Bidirectional Text Compression in External Memory

Bidirectional compression algorithms work by substituting repeated subst...
research
02/22/2021

Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding

Latent variable models have been successfully applied in lossless compre...
research
01/30/2019

A study for Image compression using Re-Pair algorithm

The compression is an important topic in computer science which allows w...

Please sign up or login with your details

Forgot password? Click here to reset