Saliency Driven Object recognition in egocentric videos with deep CNN

The problem of object recognition in natural scenes has been recently successfully addressed with Deep Convolutional Neuronal Networks giving a significant break-through in recognition scores. The computational efficiency of Deep CNNs as a function of their depth, allows for their use in real-time applications. One of the key issues here is to reduce the number of windows selected from images to be submitted to a Deep CNN. This is usually solved by preliminary segmentation and selection of specific windows, having outstanding "objectiveness" or other value of indicators of possible location of objects. In this paper we propose a Deep CNN approach and the general framework for recognition of objects in a real-time scenario and in an egocentric perspective. Here the window of interest is built on the basis of visual attention map computed over gaze fixations measured by a glass-worn eye-tracker. The application of this set-up is an interactive user-friendly environment for upper-limb amputees. Vision has to help the subject to control his worn neuro-prosthesis in case of a small amount of remaining muscles when the EMG control becomes unefficient. The recognition results on a specifically recorded corpus of 151 videos with simple geometrical objects show the mAP of 64,6% and the computational time at the generalization lower than a time of a visual fixation on the object-of-interest.

READ FULL TEXT

page 7

page 10

page 11

page 12

page 14

research
12/02/2007

Learning View Generalization Functions

Learning object models from views in 3D visual object recognition is usu...
research
01/10/2017

What are the visual features underlying human versus machine vision?

Although Deep Convolutional Networks (DCNs) are approaching the accuracy...
research
08/23/2013

Suspicious Object Recognition Method in Video Stream Based on Visual Attention

We propose a state of the art method for intelligent object recognition ...
research
10/28/2014

A hierarchical framework for object recognition

Object recognition in the presence of background clutter and distractors...
research
06/21/2022

Guiding Visual Attention in Deep Convolutional Neural Networks Based on Human Eye Movements

Deep Convolutional Neural Networks (DCNNs) were originally inspired by p...
research
02/04/2016

NeRD: a Neural Response Divergence Approach to Visual Salience Detection

In this paper, a novel approach to visual salience detection via Neural ...

Please sign up or login with your details

Forgot password? Click here to reset