A Probabilistic Hard Attention Model For Sequentially Observed Scenes

11/15/2021
by   Samrudhdhi B Rangrej, et al.
5

A visual hard attention model actively selects and observes a sequence of subregions in an image to make a prediction. The majority of hard attention models determine the attention-worthy regions by first analyzing a complete image. However, it may be the case that the entire image is not available initially but instead sensed gradually through a series of partial observations. In this paper, we design an efficient hard attention model for classifying such sequentially observed scenes. The presented model never observes an image completely. To select informative regions under partial observability, the model uses Bayesian Optimal Experiment Design. First, it synthesizes the features of the unobserved regions based on the already observed regions. Then, it uses the predicted features to estimate the expected information gain (EIG) attained, should various regions be attended. Finally, the model attends to the actual content on the location where the EIG mentioned above is maximum. The model uses a) a recurrent feature aggregator to maintain a recurrent state, b) a linear classifier to predict the class label, c) a Partial variational autoencoder to predict the features of unobserved regions. We use normalizing flows in Partial VAE to handle multi-modality in the feature-synthesis problem. We train our model using a differentiable objective and test it on five datasets. Our model gains 2-10 baseline models when both have seen only a couple of glimpses.

READ FULL TEXT

page 9

page 19

page 21

page 22

page 23

page 24

page 25

page 26

research
11/14/2017

Saliency-based Sequential Image Attention with Multiset Prediction

Humans process visual scenes selectively and sequentially using attentio...
research
04/01/2022

Consistency driven Sequential Transformers Attention Model for Partially Observable Scenes

Most hard attention models initially observe a complete scene to locate ...
research
04/01/2021

Visual Attention in Imaginative Agents

We present a recurrent agent who perceives surroundings through a series...
research
11/03/2021

Multi-Glimpse Network: A Robust and Efficient Classification Architecture based on Recurrent Downsampled Attention

Most feedforward convolutional neural networks spend roughly the same ef...
research
08/20/2019

Saccader: Improving Accuracy of Hard Attention Models for Vision

Although deep convolutional neural networks achieve state-of-the-art per...
research
06/13/2021

A test for partial correlation between repeatedly observed nonstationary nonlinear timeseries

We describe a family of statistical tests to measure partial correlation...
research
12/06/2022

Probabilistic Shape Completion by Estimating Canonical Factors with Hierarchical VAE

We propose a novel method for 3D shape completion from a partial observa...

Please sign up or login with your details

Forgot password? Click here to reset