Self-supervision on Unlabelled OR Data for Multi-person 2D/3D Human Pose Estimation

by   Vinkle Srivastav, et al.
Université de Strasbourg

2D/3D human pose estimation is needed to develop novel intelligent tools for the operating room that can analyze and support the clinical activities. The lack of annotated data and the complexity of state-of-the-art pose estimation approaches limit, however, the deployment of such techniques inside the OR. In this work, we propose to use knowledge distillation in a teacher/student framework to harness the knowledge present in a large-scale non-annotated dataset and in an accurate but complex multi-stage teacher network to train a lightweight network for joint 2D/3D pose estimation. The teacher network also exploits the unlabeled data to generate both hard and soft labels useful in improving the student predictions. The easily deployable network trained using this effective self-supervision strategy performs on par with the teacher network on MVOR+, an extension of the public MVOR dataset where all persons have been fully annotated, thus providing a viable solution for real-time 2D/3D human pose estimation in the OR.


page 3

page 6

page 8

page 12

page 13

page 14


Lightweight 3D Human Pose Estimation Network Training Using Teacher-Student Learning

We present MoVNect, a lightweight deep neural network to capture 3D huma...

PoseNet3D: Unsupervised 3D Human Shape and Pose Estimation

Recovering 3D human pose from 2D joints is a highly unconstrained proble...

Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin-picking

In this paper, we propose an iterative self-training framework for sim-t...

Unsupervised Cross-Modal Alignment for Multi-Person 3D Pose Estimation

We present a deployment friendly, fast bottom-up framework for multi-per...

Invariant Teacher and Equivariant Student for Unsupervised 3D Human Pose Estimation

We propose a novel method based on teacher-student learning framework fo...

Improving Multi-Person Pose Estimation using Label Correction

Significant attention is being paid to multi-person pose estimation meth...

Learning to Train with Synthetic Humans

Neural networks need big annotated datasets for training. However, manua...

Please sign up or login with your details

Forgot password? Click here to reset