A Baseline Framework for Part-level Action Parsing and Action Recognition

10/07/2021
by   Xiaodong Chen, et al.
0

This technical report introduces our 2nd place solution to Kinetics-TPS Track on Part-level Action Parsing in ICCV DeeperAction Workshop 2021. Our entry is mainly based on YOLOF for instance and part detection, HRNet for human pose estimation, and CSN for video-level action recognition and frame-level part state parsing. We describe technical details for the Kinetics-TPS dataset, together with some experimental results. In the competition, we achieved 61.37 mAP on the test set of Kinetics-TPS.

READ FULL TEXT
research
03/13/2023

An Improved Baseline Framework for Pose Estimation Challenge at ECCV 2022 Visual Perception for Navigation in Human Environments Workshop

This technical report describes our first-place solution to the pose est...
research
11/05/2021

Technical Report: Disentangled Action Parsing Networks for Accurate Part-level Action Parsing

Part-level Action Parsing aims at part state parsing for boosting action...
research
06/14/2021

Quality-Aware Network for Face Parsing

This is a very short technical report, which introduces the solution of ...
research
07/16/2023

Integrating Human Parsing and Pose Network for Human Action Recognition

Human skeletons and RGB sequences are both widely-adopted input modaliti...
research
06/13/2021

A Stronger Baseline for Ego-Centric Action Detection

This technical report analyzes an egocentric video action detection meth...
research
11/30/2018

Parsing R-CNN for Instance-Level Human Analysis

Instance-level human analysis is common in real-life scenarios and has m...
research
08/18/2021

The Multi-Modal Video Reasoning and Analyzing Competition

In this paper, we introduce the Multi-Modal Video Reasoning and Analyzin...

Please sign up or login with your details

Forgot password? Click here to reset