Encoding of probability distributions for Asymmetric Numeral Systems

06/11/2021
by   Jarek Duda, et al.
0

Many data compressors regularly encode probability distributions for entropy coding - requiring minimal description length type of optimizations. Canonical prefix/Huffman coding usually just writes lengths of bit sequences, this way approximating probabilities with powers-of-2. Operating on more accurate probabilities usually allows for better compression ratios, and is possible e.g. using arithmetic coding and Asymmetric Numeral Systems family. Especially the multiplication-free tabled variant of the latter (tANS) builds automaton often replacing Huffman coding due to better compression at similar computational cost - e.g. in popular Facebook Zstandard and Apple LZFSE compressors. There is discussed encoding of probability distributions for such applications, especially using Pyramid Vector Quantizer(PVQ)-based approach with deformation, also tuned symbol spread for tANS.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2021

Multiple-Output Channel Simulation and Lossy Compression of Probability Distributions

We consider a variant of the channel simulation problem with a single in...
research
09/11/2023

Low-Complexity Vector Source Coding for Discrete Long Sequences with Unknown Distributions

In this paper, we propose a source coding scheme that represents data fr...
research
09/06/2022

Compression Optimality of Asymmetric Numeral Systems

Compression also known as entropy coding has a rich and long history. Ho...
research
01/24/2020

A tutorial on the range variant of asymmetric numeral systems

This paper is intended to be an accessible introduction to the range var...
research
08/24/2021

Infinite Choice and Probability Distributions. An Open Problem: The Real Hotel

We sketch a process algebra with data and probability distributions. Thi...
research
09/15/2019

Run-Length Encoding in a Finite Universe

Text compression schemes and compact data structures usually combine sop...
research
06/02/2022

Lossless Compression of Point Cloud Sequences Using Sequence Optimized CNN Models

We propose a new paradigm for encoding the geometry of point cloud seque...

Please sign up or login with your details

Forgot password? Click here to reset