DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations

07/13/2023
by   Bo-Ru Lu, et al.
0

Applications that could benefit from automatic understanding of human-human conversations often come with challenges associated with private information in real-world data such as call center or clinical conversations. Working with protected data also increases costs of annotation, which limits technology development. To address these challenges, we propose DIALGEN, a human-in-the-loop semi-automated dialogue generation framework. DIALGEN uses a language model (ChatGPT) that can follow schema and style specifications to produce fluent conversational text, generating a complex conversation through iteratively generating subdialogues and using human feedback to correct inconsistencies or redirect the flow. In experiments on structured summarization of agent-client information gathering calls, framed as dialogue state tracking, we show that DIALGEN data enables significant improvement in model performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/20/2019

Dialogue Design and Management for Multi-Session Casual Conversation with Older Adults

We address the problem of designing a conversational avatar capable of a...
research
05/24/2022

Unsupervised Learning of Hierarchical Conversation Structure

Human conversations can evolve in many different ways, creating challeng...
research
10/21/2019

On Automating Conversations

From 2016 to 2018, we developed and deployed Chorus, a system that blend...
research
03/08/2019

Fast Prototyping a Dialogue Comprehension System for Nurse-Patient Conversations on Symptom Monitoring

Data for human-human spoken dialogues for research and development are c...
research
05/03/2023

Clinical Note Generation from Doctor-Patient Conversations using Large Language Models: Insights from MEDIQA-Chat

This paper describes our submission to the MEDIQA-Chat 2023 shared task ...
research
04/03/2023

The StatCan Dialogue Dataset: Retrieving Data Tables through Conversations with Genuine Intents

We introduce the StatCan Dialogue Dataset consisting of 19,379 conversat...
research
05/13/2020

Large Scale Multi-Actor Generative Dialog Modeling

Non-goal oriented dialog agents (i.e. chatbots) aim to produce varying a...

Please sign up or login with your details

Forgot password? Click here to reset