Chained Representation Cycling: Learning to Estimate 3D Human Pose and Shape by Cycling Between Representations

01/06/2020
by   Nadine Rueegg, et al.
18

The goal of many computer vision systems is to transform image pixels into 3D representations. Recent popular models use neural networks to regress directly from pixels to 3D object parameters. Such an approach works well when supervision is available, but in problems like human pose and shape estimation, it is difficult to obtain natural images with 3D ground truth. To go one step further, we propose a new architecture that facilitates unsupervised, or lightly supervised, learning. The idea is to break the problem into a series of transformations between increasingly abstract representations. Each step involves a cycle designed to be learnable without annotated training data, and the chain of cycles delivers the final solution. Specifically, we use 2D body part segments as an intermediate representation that contains enough information to be lifted to 3D, and at the same time is simple enough to be learned in an unsupervised way. We demonstrate the method by learning 3D human pose and shape from un-paired and un-annotated images. We also explore varying amounts of paired data and show that cycling greatly alleviates the need for paired data. While we present results for modeling humans, our formulation is general and can be applied to other vision problems.

READ FULL TEXT

page 1

page 5

page 6

page 7

research
09/30/2019

DenseRaC: Joint 3D Pose and Shape Estimation by Dense Render-and-Compare

We present DenseRaC, a novel end-to-end framework for jointly estimating...
research
05/23/2021

Heuristic Weakly Supervised 3D Human Pose Estimation in Novel Contexts without Any 3D Pose Ground Truth

Monocular 3D human pose estimation from a single RGB image has received ...
research
03/28/2016

Shuffle and Learn: Unsupervised Learning using Temporal Order Verification

In this paper, we present an approach for learning a visual representati...
research
05/10/2018

Ordinal Depth Supervision for 3D Human Pose Estimation

Our ability to train end-to-end systems for 3D human pose estimation fro...
research
03/23/2020

Weakly Supervised 3D Human Pose and Shape Reconstruction with Normalizing Flows

Monocular 3D human pose and shape estimation is challenging due to the m...
research
11/26/2020

Multi-view Human Pose and Shape Estimation Using Learnable Volumetric Aggregation

Human pose and shape estimation from RGB images is a highly sought after...
research
04/22/2019

Superquadrics Revisited: Learning 3D Shape Parsing beyond Cuboids

Abstracting complex 3D shapes with parsimonious part-based representatio...

Please sign up or login with your details

Forgot password? Click here to reset