How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?

07/26/2023
by   Huazheng Wang, et al.
0

Transformer-based pretrained language models (PLMs) have achieved great success in modern NLP. An important advantage of PLMs is good out-of-distribution (OOD) robustness. Recently, diffusion models have attracted a lot of work to apply diffusion to PLMs. It remains under-explored how diffusion influences PLMs on OOD data. The core of diffusion models is a forward diffusion process which gradually applies Gaussian noise to inputs, and a reverse denoising process which removes noise. The noised input reconstruction is a fundamental ability of diffusion models. We directly analyze OOD robustness by measuring the reconstruction loss, including testing the abilities to reconstruct OOD data, and to detect OOD samples. Experiments are conducted by analyzing different training parameters and data statistical features on eight datasets. It shows that finetuning PLMs with diffusion degrades the reconstruction ability on OOD data. The comparison also shows that diffusion models can effectively detect OOD samples, achieving state-of-the-art performance in most of the datasets with an absolute accuracy improvement up to 18

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2021

Non Gaussian Denoising Diffusion Models

Generative diffusion processes are an emerging and effective tool for im...
research
06/06/2023

Protecting the Intellectual Property of Diffusion Models by the Watermark Diffusion Process

Diffusion models have emerged as state-of-the-art deep generative archit...
research
05/18/2023

Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces

Typical generative diffusion models rely on a Gaussian diffusion process...
research
09/29/2022

Analyzing Diffusion as Serial Reproduction

Diffusion models are a class of generative models that learn to synthesi...
research
11/14/2022

Denoising Diffusion Models for Out-of-Distribution Detection

Out-of-distribution detection is crucial to the safe deployment of machi...
research
11/01/2022

DensePure: Understanding Diffusion Models towards Adversarial Robustness

Diffusion models have been recently employed to improve certified robust...
research
06/02/2023

PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models

This paper presents PolyDiffuse, a novel structured reconstruction algor...

Please sign up or login with your details

Forgot password? Click here to reset