Interactively Learning to Summarise Timelines by Reinforcement Learning

11/14/2022
by   Yuxuan Ye, et al.
0

Timeline summarisation (TLS) aims to create a time-ordered summary list concisely describing a series of events with corresponding dates. This differs from general summarisation tasks because it requires the method to capture temporal information besides the main idea of the input documents. This paper proposes a TLS system which can interactively learn from the user's feedback via reinforcement learning and generate timelines satisfying the user's interests. We define a compound reward function that can update automatically according to the received feedback through interaction with the user. The system utilises the reward function to fine-tune an abstractive summarisation model via reinforcement learning to guarantee topical coherence, factual consistency and linguistic fluency of the generated summaries. The proposed system avoids the need of preference feedback from individual users. The experiments show that our system outperforms the baseline on the benchmark TLS dataset and can generate accurate and timeline precises that better satisfy real users.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/21/2020

A Composable Specification Language for Reinforcement Learning Tasks

Reinforcement learning is a promising approach for learning control poli...
research
08/30/2023

Iterative Reward Shaping using Human Feedback for Correcting Reward Misspecification

A well-defined reward function is crucial for successful training of an ...
research
12/19/2022

Optimizing Prompts for Text-to-Image Generation

Well-designed prompts can guide text-to-image models to generate amazing...
research
06/07/2019

Preference-based Interactive Multi-Document Summarisation

Interactive NLP is a promising paradigm to close the gap between automat...
research
08/09/2021

Knowledge accumulating: The general pattern of learning

Artificial Intelligence has been developed for decades with the achievem...
research
11/26/2021

Learning Long-Term Reward Redistribution via Randomized Return Decomposition

Many practical applications of reinforcement learning require agents to ...
research
12/18/2018

Reinforcement Learning for Online Information Seeking

Information seeking techniques, satisfying users' information needs by s...

Please sign up or login with your details

Forgot password? Click here to reset