Where's YOUR focus: Personalized Attention

by   Sikun Lin, et al.

Human visual attention is subjective and biased according to the personal preference of the viewer, however, current works of saliency detection are general and objective, without counting the factor of the observer. This will make the attention prediction for a particular person not accurate enough. In this work, we present the novel idea of personalized attention prediction and develop Personalized Attention Network (PANet), a convolutional network that predicts saliency in images with personal preference. The model consists of two streams which share common feature extraction layers, and one stream is responsible for saliency prediction, while the other is adapted from the detection model and used to fit user preference. We automatically collect user preference from their albums and leaves them freedom to define what and how many categories their preferences are divided into. To train PANet, we dynamically generate ground truth saliency maps upon existing detection labels and saliency labels, and the generation parameters are based upon our collected datasets consists of 1k images. We evaluate the model with saliency prediction metrics and test the trained model on different preference vectors. The results have shown that our system is much better than general models in personalized saliency prediction and is efficient to use for different preferences.


page 4

page 5

page 7


Personalized Saliency and its Prediction

Almost all existing visual saliency models focus on predicting a univers...

Personalization of Saliency Estimation

Most existing saliency models use low-level features or task description...

End-to-end Convolutional Network for Saliency Prediction

The prediction of saliency areas in images has been traditionally addres...

A General Framework for Saliency Detection Methods

Saliency detection is one of the most challenging problems in the fields...

Context-empowered Visual Attention Prediction in Pedestrian Scenarios

Effective and flexible allocation of visual attention is key for pedestr...

Few-Shot Personalized Saliency Prediction Using Tensor Regression for Preserving Structural Global Information

This paper presents a few-shot personalized saliency prediction using te...

PR-Net: Preference Reasoning for Personalized Video Highlight Detection

Personalized video highlight detection aims to shorten a long video to i...

Please sign up or login with your details

Forgot password? Click here to reset