Three Recipes for Better 3D Pseudo-GTs of 3D Human Mesh Estimation in the Wild

04/10/2023
by   Gyeongsik Moon, et al.
0

Recovering 3D human mesh in the wild is greatly challenging as in-the-wild (ITW) datasets provide only 2D pose ground truths (GTs). Recently, 3D pseudo-GTs have been widely used to train 3D human mesh estimation networks as the 3D pseudo-GTs enable 3D mesh supervision when training the networks on ITW datasets. However, despite the great potential of the 3D pseudo-GTs, there has been no extensive analysis that investigates which factors are important to make more beneficial 3D pseudo-GTs. In this paper, we provide three recipes to obtain highly beneficial 3D pseudo-GTs of ITW datasets. The main challenge is that only 2D-based weak supervision is allowed when obtaining the 3D pseudo-GTs. Each of our three recipes addresses the challenge in each aspect: depth ambiguity, sub-optimality of weak supervision, and implausible articulation. Experimental results show that simply re-training state-of-the-art networks with our new 3D pseudo-GTs elevates their performance to the next level without bells and whistles. The 3D pseudo-GT is publicly available in https://github.com/mks0601/NeuralAnnot_RELEASE.

READ FULL TEXT

page 1

page 2

page 3

page 10

page 11

page 12

research
11/23/2020

NeuralAnnot: Neural Annotator for in-the-wild Expressive 3D Human Pose and Mesh Training Sets

Recovering expressive 3D human pose and mesh from in-the-wild images is ...
research
07/20/2022

3D Clothed Human Reconstruction in the Wild

Although much progress has been made in 3D clothed human reconstruction,...
research
07/25/2022

W2N:Switching From Weak Supervision to Noisy Supervision for Object Detection

Weakly-supervised object detection (WSOD) aims to train an object detect...
research
04/04/2020

Weakly-Supervised Mesh-Convolutional Hand Reconstruction in the Wild

We introduce a simple and effective network architecture for monocular 3...
research
07/03/2020

LOOC: Localize Overlapping Objects with Count Supervision

Acquiring count annotations generally requires less human effort than po...
research
10/27/2020

Synthetic Training for Monocular Human Mesh Recovery

Recovering 3D human mesh from monocular images is a popular topic in com...
research
07/08/2021

Affect Expression Behaviour Analysis in the Wild using Consensual Collaborative Training

Facial expression recognition (FER) in the wild is crucial for building ...

Please sign up or login with your details

Forgot password? Click here to reset