Simulating Human Gaze with Neural Visual Attention

by   Leo Schwinn, et al.

Existing models of human visual attention are generally unable to incorporate direct task guidance and therefore cannot model an intent or goal when exploring a scene. To integrate guidance of any downstream visual task into attention modeling, we propose the Neural Visual Attention (NeVA) algorithm. To this end, we impose to neural networks the biological constraint of foveated vision and train an attention mechanism to generate visual explorations that maximize the performance with respect to the downstream task. We observe that biologically constrained neural networks generate human-like scanpaths without being trained for this objective. Extensive experiments on three common benchmark datasets show that our method outperforms state-of-the-art unsupervised human attention models in generating human-like scanpaths.


Behind the Machine's Gaze: Biologically Constrained Neural Networks Exhibit Human-like Visual Attention

By and large, existing computational models of visual attention tacitly ...

CNN-based search model underestimates attention guidance by simple visual features

Recently, Zhang et al. (2018) proposed an interesting model of attention...

A probabilistic tour of visual attention and gaze shift computational models

In this paper a number of problems are considered which are related to t...

Pre-training Attention Mechanisms

Recurrent neural networks with differentiable attention mechanisms have ...

Contrastive Language-Image Pretrained Models are Zero-Shot Human Scanpath Predictors

Understanding the mechanisms underlying human attention is a fundamental...

Peripheral Vision Transformer

Human vision possesses a special type of visual processing systems calle...

Please sign up or login with your details

Forgot password? Click here to reset