Sequential Person Recognition in Photo Albums with a Recurrent Network

11/30/2016
by   Yao Li, et al.
0

Recognizing the identities of people in everyday photos is still a very challenging problem for machine vision, due to non-frontal faces, changes in clothing, location, lighting and similar. Recent studies have shown that rich relational information between people in the same photo can help in recognizing their identities. In this work, we propose to model the relational information between people as a sequence prediction task. At the core of our work is a novel recurrent network architecture, in which relational information between instances' labels and appearance are modeled jointly. In addition to relational cues, scene context is incorporated in our sequence prediction model with no additional cost. In this sense, our approach is a unified framework for modeling both contextual cues and visual appearance of person instances. Our model is trained end-to-end with a sequence of annotated instances in a photo as inputs, and a sequence of corresponding labels as targets. We demonstrate that this simple but elegant formulation achieves state-of-the-art performance on the newly released People In Photo Albums (PIPA) dataset.

READ FULL TEXT

page 2

page 4

page 5

page 9

research
01/23/2015

Beyond Frontal Faces: Improving Person Recognition Using Multiple Cues

We explore the task of recognizing peoples' identities in photo albums i...
research
09/24/2018

Zoom-RNN: A Novel Method for Person Recognition Using Recurrent Neural Networks

The overwhelming popularity of social media has resulted in bulk amounts...
research
06/08/2018

Unifying Identification and Context Learning for Person Recognition

Despite the great success of face recognition techniques, recognizing pe...
research
09/11/2015

Person Recognition in Personal Photo Collections

Recognising persons in everyday photos presents major challenges (occlud...
research
02/02/2019

Hierarchical Photo-Scene Encoder for Album Storytelling

In this paper, we propose a novel model with a hierarchical photo-scene ...
research
05/20/2021

Egocentric Activity Recognition and Localization on a 3D Map

Given a video captured from a first person perspective and recorded in a...
research
06/02/2015

What Makes Kevin Spacey Look Like Kevin Spacey

We reconstruct a controllable model of a person from a large photo colle...

Please sign up or login with your details

Forgot password? Click here to reset