Domain Generalization through Audio-Visual Relative Norm Alignment in First Person Action Recognition

10/19/2021
by   Mirco Planamente, et al.
0

First person action recognition is becoming an increasingly researched area thanks to the rising popularity of wearable cameras. This is bringing to light cross-domain issues that are yet to be addressed in this context. Indeed, the information extracted from learned representations suffers from an intrinsic "environmental bias". This strongly affects the ability to generalize to unseen scenarios, limiting the application of current methods to real settings where labeled data are not available during training. In this work, we introduce the first domain generalization approach for egocentric activity recognition, by proposing a new audio-visual loss, called Relative Norm Alignment loss. It re-balances the contributions from the two modalities during training, over different domains, by aligning their feature norm representations. Our approach leads to strong results in domain generalization on both EPIC-Kitchens-55 and EPIC-Kitchens-100, as demonstrated by extensive experiments, and can be extended to work also on domain adaptation settings with competitive results.

READ FULL TEXT
research
06/03/2021

Cross-Domain First Person Audio-Visual Action Recognition through Relative Norm Alignment

First person action recognition is an increasingly researched topic beca...
research
07/01/2021

PoliTO-IIT Submission to the EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition

In this report, we describe the technical details of our submission to t...
research
09/09/2022

PoliTO-IIT-CINI Submission to the EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition

In this report, we describe the technical details of our submission to t...
research
06/14/2022

Semantic-Discriminative Mixup for Generalizable Sensor-based Cross-domain Activity Recognition

It is expensive and time-consuming to collect sufficient labeled data to...
research
07/21/2022

Domain Generalization for Activity Recognition via Adaptive Feature Fusion

Human activity recognition requires the efforts to build a generalizable...
research
03/27/2022

Audio-Adaptive Activity Recognition Across Video Domains

This paper strives for activity recognition under domain shift, for exam...
research
04/20/2022

CALI: Coarse-to-Fine ALIgnments Based Unsupervised Domain Adaptation of Traversability Prediction for Deployable Autonomous Navigation

Traversability prediction is a fundamental perception capability for aut...

Please sign up or login with your details

Forgot password? Click here to reset