AutoAD: Movie Description in Context

03/29/2023
by   Tengda Han, et al.
14

The objective of this paper is an automatic Audio Description (AD) model that ingests movies and outputs AD in text form. Generating high-quality movie AD is challenging due to the dependency of the descriptions on context, and the limited amount of training data available. In this work, we leverage the power of pretrained foundation models, such as GPT and CLIP, and only train a mapping network that bridges the two models for visually-conditioned text generation. In order to obtain high-quality AD, we make the following four contributions: (i) we incorporate context from the movie clip, AD from previous clips, as well as the subtitles; (ii) we address the lack of training data by pretraining on large-scale datasets, where visual or contextual information is unavailable, e.g. text-only AD without movies or visual captioning datasets without context; (iii) we improve on the currently available AD datasets, by removing label noise in the MAD dataset, and adding character naming information; and (iv) we obtain strong results on the movie AD task compared with previous methods.

READ FULL TEXT

page 1

page 5

page 8

page 15

page 16

page 18

research
05/12/2016

Movie Description

Audio Description (AD) provides linguistic descriptions of movies and al...
research
05/08/2020

Condensed Movies: Story Based Retrieval with Contextual Embeddings

Our objective in this work is the long range understanding of the narrat...
research
06/04/2015

The Long-Short Story of Movie Description

Generating descriptions for videos has many applications including assis...
research
09/19/2018

MTLE: A Multitask Learning Encoder of Visual Feature Representations for Video and Movie Description

Learning visual feature representations for video analysis is a daunting...
research
05/20/2023

Movie101: A New Movie Understanding Benchmark

To help the visually impaired enjoy movies, automatic movie narrating sy...
research
08/17/2020

Learning to Create Better Ads: Generation and Ranking Approaches for Ad Creative Refinement

In the online advertising industry, the process of designing an ad creat...
research
12/05/2022

Framework for 2D Ad placements in LinearTV

Virtual Product placement(VPP) is the advertising technique of digitally...

Please sign up or login with your details

Forgot password? Click here to reset