A Pooling Approach to Modelling Spatial Relations for Image Retrieval and Annotation

11/19/2014
by   Mateusz Malinowski, et al.
0

Over the last two decades we have witnessed strong progress on modeling visual object classes, scenes and attributes that have significantly contributed to automated image understanding. On the other hand, surprisingly little progress has been made on incorporating a spatial representation and reasoning in the inference process. In this work, we propose a pooling interpretation of spatial relations and show how it improves image retrieval and annotations tasks involving spatial language. Due to the complexity of the spatial language, we argue for a learning-based approach that acquires a representation of spatial relations by learning parameters of the pooling operator. We show improvements on previous work on two datasets and two different tasks as well as provide additional insights on a new dataset with an explicit focus on spatial relations.

READ FULL TEXT

page 1

page 7

page 8

research
07/16/2021

All the attention you need: Global-local, spatial-channel attention for image retrieval

We address representation learning for large-scale instance-level image ...
research
09/19/2020

City-Scale Visual Place Recognition with Deep Local Features Based on Multi-Scale Ordered VLAD Pooling

Visual place recognition is the task of recognizing a place depicted in ...
research
04/21/2020

Image Retrieval using Multi-scale CNN Features Pooling

In this paper, we address the problem of image retrieval by learning ima...
research
03/15/2023

A Triplet-loss Dilated Residual Network for High-Resolution Representation Learning in Image Retrieval

Content-based image retrieval is the process of retrieving a subset of i...
research
07/19/2020

Understanding Spatial Relations through Multiple Modalities

Recognizing spatial relations and reasoning about them is essential in m...
research
11/10/2019

Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries

This paper explores the task of interactive image retrieval using natura...
research
01/15/2013

Learnable Pooling Regions for Image Classification

Biologically inspired, from the early HMAX model to Spatial Pyramid Matc...

Please sign up or login with your details

Forgot password? Click here to reset