ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System

04/27/2023
by   Junke Wang, et al.
0

Existing deep video models are limited by specific tasks, fixed input-output spaces, and poor generalization capabilities, making it difficult to deploy them in real-world scenarios. In this paper, we present our vision for multimodal and versatile video understanding and propose a prototype system, . Our system is built upon a tracklet-centric paradigm, which treats tracklets as the basic video unit and employs various Video Foundation Models (ViFMs) to annotate their properties e.g., appearance, motion, . All the detected tracklets are stored in a database and interact with the user through a database manager. We have conducted extensive case studies on different types of in-the-wild videos, which demonstrates the effectiveness of our method in answering various video-related problems. Our project is available at https://www.wangjunke.info/ChatVideo/

READ FULL TEXT

page 3

page 5

page 6

page 7

page 8

research
08/28/2023

MagicAvatar: Multimodal Avatar Generation and Animation

This report presents MagicAvatar, a framework for multimodal video gener...
research
09/14/2022

WildQA: In-the-Wild Video Question Answering

Existing video understanding datasets mostly focus on human interactions...
research
07/13/2023

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

This paper introduces InternVid, a large-scale video-centric multimodal ...
research
05/24/2023

ECHo: Event Causality Inference via Human-centric Reasoning

We introduce ECHo, a diagnostic dataset of event causality inference gro...
research
05/10/2023

VideoChat: Chat-Centric Video Understanding

In this study, we initiate an exploration into video understanding by in...
research
10/05/2020

Improving Generative Imagination in Object-Centric World Models

The remarkable recent advances in object-centric generative world models...

Please sign up or login with your details

Forgot password? Click here to reset