Chat as Expected: Learning to Manipulate Black-box Neural Dialogue Models

05/27/2020
by   Haochen Liu, et al.
0

Recently, neural network based dialogue systems have become ubiquitous in our increasingly digitalized society. However, due to their inherent opaqueness, some recently raised concerns about using neural models are starting to be taken seriously. In fact, intentional or unintentional behaviors could lead to a dialogue system to generate inappropriate responses. Thus, in this paper, we investigate whether we can learn to craft input sentences that result in a black-box neural dialogue model being manipulated into having its outputs contain target words or match target sentences. We propose a reinforcement learning based model that can generate such desired inputs automatically. Extensive experiments on a popular well-trained state-of-the-art neural dialogue model show that our method can successfully seek out desired inputs that lead to the target outputs in a considerable portion of cases. Consequently, our work reveals the potential of neural dialogue models to be manipulated, which inspires and opens the door towards developing strategies to defend them.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2019

Say What I Want: Towards the Dark Side of Neural Dialogue Models

Neural dialogue models have been widely adopted in various chatbot appli...
research
09/11/2018

Detecting egregious responses in neural sequence-to-sequence models

In this work, we attempt to answer a critical question: whether there ex...
research
03/02/2020

Learning from Easy to Complex: Adaptive Multi-curricula Learning for Neural Dialogue Generation

Current state-of-the-art neural dialogue systems are mainly data-driven ...
research
04/03/2022

DST: Dynamic Substitute Training for Data-free Black-box Attack

With the wide applications of deep neural network models in various comp...
research
08/13/2020

Dialogue State Induction Using Neural Latent Variable Models

Dialogue state modules are a useful component in a task-oriented dialogu...
research
11/15/2018

Generating Responses Expressing Emotion in an Open-domain Dialogue System

Neural network-based Open-ended conversational agents automatically gene...
research
06/02/2021

DialoGraph: Incorporating Interpretable Strategy-Graph Networks into Negotiation Dialogues

To successfully negotiate a deal, it is not enough to communicate fluent...

Please sign up or login with your details

Forgot password? Click here to reset