MM-GEF: Multi-modal representation meet collaborative filtering

08/14/2023
by   Hao Wu, et al.
0

In modern e-commerce, item content features in various modalities offer accurate yet comprehensive information to recommender systems. The majority of previous work either focuses on learning effective item representation during modelling user-item interactions, or exploring item-item relationships by analysing multi-modal features. Those methods, however, fail to incorporate the collaborative item-user-item relationships into the multi-modal feature-based item structure. In this work, we propose a graph-based item structure enhancement method MM-GEF: Multi-Modal recommendation with Graph Early-Fusion, which effectively combines the latent item structure underlying multi-modal contents with the collaborative signals. Instead of processing the content feature in different modalities separately, we show that the early-fusion of multi-modal features provides significant improvement. MM-GEF learns refined item representations by injecting structural information obtained from both multi-modal and collaborative signals. Through extensive experiments on four publicly available datasets, we demonstrate systematical improvements of our method over state-of-the-art multi-modal recommendation methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/19/2021

Mining Latent Structures for Multimedia Recommendation

Multimedia content is of predominance in the modern Web era. Investigati...
research
08/08/2023

Online Distillation-enhanced Multi-modal Transformer for Sequential Recommendation

Multi-modal recommendation systems, which integrate diverse types of inf...
research
05/24/2023

Collaborative Recommendation Model Based on Multi-modal Multi-view Attention Network: Movie and literature cases

The existing collaborative recommendation models that use multi-modal in...
research
08/30/2023

Adaptive Multi-Modalities Fusion in Sequential Recommendation Systems

In sequential recommendation, multi-modal information (e.g., text or ima...
research
05/13/2020

Multi-modal Embedding Fusion-based Recommender

Recommendation systems have lately been popularized globally, with prima...
research
02/21/2023

Multi-Modal Self-Supervised Learning for Recommendation

The online emergence of multi-modal sharing platforms (eg, TikTok, Youtu...
research
07/22/2019

Multi-Modal Adversarial Autoencoders for Recommendations of Citations and Subject Labels

We present multi-modal adversarial autoencoders for recommendation and e...

Please sign up or login with your details

Forgot password? Click here to reset