Domain Adaptive Hand Keypoint and Pixel Localization in the Wild

03/16/2022
by   Takehiko Ohkawa, et al.
0

We aim to improve the performance of regressing hand keypoints and segmenting pixel-level hand masks under new imaging conditions (e.g., outdoors) when we only have labeled images taken under very different conditions (e.g., indoors). In the real world, it is important that the model trained for both tasks works under various imaging conditions. However, their variation covered by existing labeled hand datasets is limited. Thus, it is necessary to adapt the model trained on the labeled images (source) to unlabeled images (target) with unseen imaging conditions. While self-training domain adaptation methods (i.e., learning from the unlabeled target images in a self-supervised manner) have been developed for both tasks, their training may degrade performance when the predictions on the target images are noisy. To avoid this, it is crucial to assign a low importance (confidence) weight to the noisy predictions during self-training. In this paper, we propose to utilize the divergence of two predictions to estimate the confidence of the target image for both tasks. These predictions are given from two separate networks, and their divergence helps identify the noisy predictions. To integrate our proposed confidence estimation into self-training, we propose a teacher-student framework where the two networks (teachers) provide supervision to a network (student) for self-training, and the teachers are learned from the student by knowledge distillation. Our experiments show its superiority over state-of-the-art methods in adaptation settings with different lighting, grasping objects, backgrounds, and camera viewpoints. Our method improves by 4 score on HO3D compared to the latest adversarial adaptation method. We also validate our method on Ego4D, egocentric videos with rapid changes in imaging conditions outdoors.

READ FULL TEXT

page 2

page 7

page 8

page 13

page 17

page 18

page 19

research
07/21/2021

S4T: Source-free domain adaptation for semantic segmentation via self-supervised selective self-training

Most modern approaches for domain adaptive semantic segmentation rely on...
research
11/29/2021

Semi-supervised Domain Adaptation via Sample-to-Sample Self-Distillation

Semi-supervised domain adaptation (SSDA) is to adapt a learner to a new ...
research
03/25/2022

Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music

Lack of large-scale note-level labeled data is the major obstacle to sin...
research
12/08/2022

Self-training via Metric Learning for Source-Free Domain Adaptation of Semantic Segmentation

Unsupervised source-free domain adaptation methods aim to train a model ...
research
02/28/2018

Joint Pixel and Feature-level Domain Adaptation in the Wild

Recent developments in deep domain adaptation have allowed knowledge tra...
research
02/09/2023

MAPS: A Noise-Robust Progressive Learning Approach for Source-Free Domain Adaptive Keypoint Detection

Existing cross-domain keypoint detection methods always require accessin...
research
12/15/2020

Teach me to segment with mixed supervision: Confident students become masters

Deep segmentation neural networks require large training datasets with p...

Please sign up or login with your details

Forgot password? Click here to reset