QTSumm: A New Benchmark for Query-Focused Table Summarization

05/23/2023
by   Yilun Zhao, et al.
16

People primarily consult tables to conduct data analysis or answer specific questions. Text generation systems that can provide accurate table summaries tailored to users' information needs can facilitate more efficient access to relevant data insights. However, existing table-to-text generation studies primarily focus on converting tabular data into coherent statements, rather than addressing information-seeking purposes. In this paper, we define a new query-focused table summarization task, where text generation models have to perform human-like reasoning and analysis over the given table to generate a tailored summary, and we introduce a new benchmark named QTSumm for this task. QTSumm consists of 5,625 human-annotated query-summary pairs over 2,437 tables on diverse topics. Moreover, we investigate state-of-the-art models (i.e., text generation, table-to-text generation, and large language models) on the QTSumm dataset. Experimental results and manual analysis reveal that our benchmark presents significant challenges in table-to-text generation for future research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/16/2021

Learning to Reason for Text Generation from Scientific Tables

In this paper, we introduce SciGen, a new challenge dataset for the task...
research
05/24/2023

Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers

Large language models (LLMs) have shown remarkable ability on controllab...
research
01/12/2020

Revisiting Challenges in Data-to-Text Generation with Fact Grounding

Data-to-text generation models face challenges in ensuring data fidelity...
research
05/23/2020

Summarizing and Exploring Tabular Data in Conversational Search

Tabular data provide answers to a significant portion of search queries....
research
05/24/2022

Medical Scientific Table-to-Text Generation with Human-in-the-Loop under the Data Sparsity Constraint

Structured (tabular) data in the preclinical and clinical domains contai...
research
09/04/2023

NumHG: A Dataset for Number-Focused Headline Generation

Headline generation, a key task in abstractive summarization, strives to...
research
05/23/2021

Controlling Text Edition by Changing Answers of Specific Questions

In this paper, we introduce the new task of controllable text edition, i...

Please sign up or login with your details

Forgot password? Click here to reset