Categorical SDEs with Simplex Diffusion

10/26/2022
by   Pierre H. Richemond, et al.
0

Diffusion models typically operate in the standard framework of generative modelling by producing continuously-valued datapoints. To this end, they rely on a progressive Gaussian smoothing of the original data distribution, which admits an SDE interpretation involving increments of a standard Brownian motion. However, some applications such as text generation or reinforcement learning might naturally be better served by diffusing categorical-valued data, i.e., lifting the diffusion to a space of probability distributions. To this end, this short theoretical note proposes Simplex Diffusion, a means to directly diffuse datapoints located on an n-dimensional probability simplex. We show how this relates to the Dirichlet distribution on the simplex and how the analogous SDE is realized thanks to a multi-dimensional Cox-Ingersoll-Ross process (abbreviated as CIR), previously used in economics and mathematical finance. Finally, we make remarks as to the numerical implementation of trajectories of the CIR process, and discuss some limitations of our approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/05/2023

Diffusion on the Probability Simplex

Diffusion models learn to reverse the progressive noising of a data dist...
research
08/11/2023

Mirror Diffusion Models

Diffusion models have successfully been applied to generative tasks in v...
research
02/20/2020

The continuous categorical: a novel simplex-valued exponential family

Simplex-valued data appear throughout statistics and machine learning, f...
research
09/02/2022

First Hitting Diffusion Models

We propose a family of First Hitting Diffusion Models (FHDM), deep gener...
research
03/31/2022

Equivariant Diffusion for Molecule Generation in 3D

This work introduces a diffusion model for molecule generation in 3D tha...
research
11/15/2022

Donsker Theorems for Occupation Measures of Multi-Dimensional Periodic Diffusions

We study the empirical process arising from a multi-dimensional diffusio...
research
04/28/2022

On the Normalizing Constant of the Continuous Categorical Distribution

Probability distributions supported on the simplex enjoy a wide range of...

Please sign up or login with your details

Forgot password? Click here to reset