Exploring Sentence Vector Spaces through Automatic Summarization

10/16/2018
by   Adly Templeton, et al.
0

Given vector representations for individual words, it is necessary to compute vector representations of sentences for many applications in a compositional manner, often using artificial neural networks. Relatively little work has explored the internal structure and properties of such sentence vectors. In this paper, we explore the properties of sentence vectors in the context of automatic summarization. In particular, we show that cosine similarity between sentence vectors and document vectors is strongly correlated with sentence importance and that vector semantics can identify and correct gaps between the sentences chosen so far and the document. In addition, we identify specific dimensions which are linked to effective summaries. To our knowledge, this is the first time specific dimensions of sentence embeddings have been connected to sentence properties. We also compare the features of different methods of sentence embeddings. Many of these insights have applications in uses of sentence embeddings far beyond summarization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/09/2020

Sentence Analogies: Exploring Linguistic Relationships and Regularities in Sentence Embeddings

While important properties of word vector representations have been stud...
research
02/23/2018

High-Dimensional Vector Semantics

In this paper we explore the "vector semantics" problem from the perspec...
research
10/11/2012

Artex is AnotheR TEXt summarizer

This paper describes Artex, another algorithm for Automatic Text Summari...
research
08/25/2019

A Method for Estimating the Proximity of Vector Representation Groups in Multidimensional Space. On the Example of the Paraphrase Task

The following paper presents a method of comparing two sets of vectors. ...
research
07/22/2020

Exploratory Search with Sentence Embeddings

Exploratory search aims to guide users through a corpus rather than pinp...
research
09/16/2020

Unsupervised Summarization by Jointly Extracting Sentences and Keywords

We present RepRank, an unsupervised graph-based ranking model for extrac...
research
09/17/2023

Do Large GPT Models Discover Moral Dimensions in Language Representations? A Topological Study Of Sentence Embeddings

As Large Language Models are deployed within Artificial Intelligence sys...

Please sign up or login with your details

Forgot password? Click here to reset