Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels

by   Bang Yang, et al.

Generating an informative and attractive title for the product is a crucial task for e-commerce. Most existing works follow the standard multimodal natural language generation approaches, e.g., image captioning, and employ the large scale of human-labelled datasets to train desirable models. However, for novel products, especially in a different domain, there are few existing labelled data. In this paper, we propose a prompt-based approach, i.e., the Multimodal Prompt Learning framework, to accurately and efficiently generate titles for novel products with limited labels. We observe that the core challenges of novel product title generation are the understanding of novel product characteristics and the generation of titles in a novel writing style. To this end, we build a set of multimodal prompts from different modalities to preserve the corresponding characteristics and writing styles of novel products. As a result, with extremely limited labels for training, the proposed method can retrieve the multimodal prompts to generate desirable titles for novel products. The experiments and analyses are conducted on five novel product categories under both the in-domain and out-of-domain experimental settings. The results show that, with only 1 our proposed approach achieves the best few-shot results and even achieves competitive results with fully-supervised methods trained on 100 data; With the full labelled data for training, our method achieves state-of-the-art results.


page 1

page 2

page 3

page 4


A Multimodal Late Fusion Model for E-Commerce Product Classification

The cataloging of product listings is a fundamental problem for most e-c...

Automatic Generation of Product-Image Sequence in E-commerce

Product images are essential for providing desirable user experience in ...

Multimodal Dialogue Response Generation

Responsing with image has been recognized as an important capability for...

A Multimodal Recommender System for Large-scale Assortment Generation in E-commerce

E-commerce platforms surface interesting products largely through produc...

Product Image Recognition with Guidance Learning and Noisy Supervision

This paper considers recognizing products from daily photos, which is an...

Unimodal Intermediate Training for Multimodal Meme Sentiment Classification

Internet Memes remain a challenging form of user-generated content for a...

A labeled dataset of cloud types using data from GOES-16 and CloudSat

In this paper we present the development of a dataset consisting of 91 M...

Please sign up or login with your details

Forgot password? Click here to reset