Using Apache Lucene to Search Vector of Locally Aggregated Descriptors

by   Giuseppe Amato, et al.

Surrogate Text Representation (STR) is a profitable solution to efficient similarity search on metric space using conventional text search engines, such as Apache Lucene. This technique is based on comparing the permutations of some reference objects in place of the original metric distance. However, the Achilles heel of STR approach is the need to reorder the result set of the search according to the metric distance. This forces to use a support database to store the original objects, which requires efficient random I/O on a fast secondary memory (such as flash-based storages). In this paper, we propose to extend the Surrogate Text Representation to specifically address a class of visual metric objects known as Vector of Locally Aggregated Descriptors (VLAD). This approach is based on representing the individual sub-vectors forming the VLAD vector with the STR, providing a finer representation of the vector and enabling us to get rid of the reordering phase. The experiments on a publicly available dataset show that the extended STR outperforms the baseline STR achieving satisfactory performance near to the one obtained with the original VLAD vectors.


page 1

page 2

page 3

page 4


Vectors of Locally Aggregated Centers for Compact Video Representation

We propose a novel vector aggregation technique for compact video repres...

Vector of Locally-Aggregated Word Embeddings (VLAWE): A Novel Document-level Representation

In this paper, we propose a novel representation for text documents base...

Interferences in match kernels

We consider the design of an image representation that embeds and aggreg...

Image search using multilingual texts: a cross-modal learning approach between image and text

Multilingual (or cross-lingual) embeddings represent several languages i...

Memory vectors for similarity search in high-dimensional spaces

We study an indexing architecture to store and search in a database of h...

When VLAD met Hilbert

Vectors of Locally Aggregated Descriptors (VLAD) have emerged as powerfu...

Please sign up or login with your details

Forgot password? Click here to reset