Multi-View Features and Hybrid Reward Strategies for Vatex Video Captioning Challenge 2019

10/17/2019
by   Xinxin Zhu, et al.
0

This document describes our solution for the VATEX Captioning Challenge 2019, which requires generating descriptions for the videos in both English and Chinese languages. We identified three crucial factors that improve the performance, namely: multi-view features, hybrid reward, and diverse ensemble. Our method achieves the 2nd and the 3rd places on the Chinese and English video captioning tracks, respectively.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset