MADNet: Maximizing Addressee Deduction Expectation for Multi-Party Conversation Generation

05/22/2023
by   Jia-Chen Gu, et al.
0

Modeling multi-party conversations (MPCs) with graph neural networks has been proven effective at capturing complicated and graphical information flows. However, existing methods rely heavily on the necessary addressee labels and can only be applied to an ideal setting where each utterance must be tagged with an addressee label. To study the scarcity of addressee labels which is a common issue in MPCs, we propose MADNet that maximizes addressee deduction expectation in heterogeneous graph neural networks for MPC generation. Given an MPC with a few addressee labels missing, existing methods fail to build a consecutively connected conversation graph, but only a few separate conversation fragments instead. To ensure message passing between these conversation fragments, four additional types of latent edges are designed to complete a fully-connected graph. Besides, to optimize the edge-type-dependent message passing for those utterances without addressee labels, an Expectation-Maximization-based method that iteratively generates silver addressee labels (E step), and optimizes the quality of generated responses (M step), is designed. Experimental results on two Ubuntu IRC channel benchmarks show that MADNet outperforms various baseline models on the task of MPC generation, especially under the more common and challenging setting where part of addressee labels are missing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/16/2022

HeterMPC: A Heterogeneous Graph Neural Network for Response Generation in Multi-Party Conversations

Recently, various response generation models for two-party conversations...
research
06/03/2021

MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Understanding

Recently, various neural models for multi-party conversation (MPC) have ...
research
05/16/2023

GIFT: Graph-Induced Fine-Tuning for Multi-Party Conversation Understanding

Addressing the issues of who saying what to whom in multi-party conversa...
research
05/21/2023

EM Pre-training for Multi-party Dialogue Response Generation

Dialogue response generation requires an agent to generate a response ac...
research
04/17/2019

Neural Message Passing for Multi-Label Classification

Multi-label classification (MLC) is the task of assigning a set of targe...
research
09/07/2021

Unsupervised Conversation Disentanglement through Co-Training

Conversation disentanglement aims to separate intermingled messages into...

Please sign up or login with your details

Forgot password? Click here to reset