Video Summarization Based on Video-text Modelling

01/07/2022
by   Li Haopeng, et al.
0

Modern video summarization methods are based on deep neural networks which require a large amount of annotated data for training. However, existing datasets for video summarization are small-scale, easily leading to over-fitting of the deep models. Considering that the annotation of large-scale datasets is time-consuming, we propose a multimodal self-supervised learning framework to obtain semantic representations of videos, which benefits the video summarization task. Specifically, we explore the semantic consistency between the visual information and text information of videos, for the self-supervised pretraining of a multimodal encoder on a newly-collected dataset of video-text pairs. Additionally, we introduce a progressive video summarization method, where the important content in a video is pinpointed progressively to generate better summaries. Finally, an objective evaluation framework is proposed to measure the quality of video summaries based on video classification. Extensive experiments have proved the effectiveness and superiority of our method in rank correlation coefficients, F-score, and the proposed objective evaluation compared to the state of the art.

READ FULL TEXT

page 3

page 4

research
03/13/2023

Align and Attend: Multimodal Summarization with Dual Contrastive Losses

The goal of multimodal summarization is to extract the most important in...
research
12/27/2021

Video Joint Modelling Based on Hierarchical Transformer for Co-summarization

Video summarization aims to automatically generate a summary (storyboard...
research
01/30/2015

Co-Regularized Deep Representations for Video Summarization

Compact keyframe-based video summaries are a popular way of generating v...
research
03/28/2023

SELF-VS: Self-supervised Encoding Learning For Video Summarization

Despite its wide range of applications, video summarization is still hel...
research
04/24/2019

A General Framework for Edited Video and Raw Video Summarization

In this paper, we build a general summarization framework for both of ed...
research
05/27/2021

Self-Supervised Multimodal Opinion Summarization

Recently, opinion summarization, which is the generation of a summary fr...
research
03/08/2023

Sample Efficient Multimodal Semantic Augmentation for Incremental Summarization

In this work, we develop a prompting approach for incremental summarizat...

Please sign up or login with your details

Forgot password? Click here to reset