Exposure-Aware Recommendation using Contextual Bandits

09/04/2022
by   Masoud Mansoury, et al.
0

Exposure bias is a well-known issue in recommender systems where items and suppliers are not equally represented in the recommendation results. This is especially problematic when bias is amplified over time as a few items (e.g., popular ones) are repeatedly over-represented in recommendation lists and users' interactions with those items will amplify bias towards those items over time resulting in a feedback loop. This issue has been extensively studied in the literature on model-based or neighborhood-based recommendation algorithms, but less work has been done on online recommendation models, such as those based on top-K contextual bandits, where recommendation models are dynamically updated with ongoing user feedback. In this paper, we study exposure bias in a class of well-known contextual bandit algorithms known as Linear Cascading Bandits. We analyze these algorithms on their ability to handle exposure bias and provide a fair representation for items in the recommendation results. Our analysis reveals that these algorithms tend to amplify exposure disparity among items over time. In particular, we observe that these algorithms do not properly adapt to the feedback provided by the users and frequently recommend certain items even when those items are not selected by users. To mitigate this bias, we propose an Exposure-Aware (EA) reward model that updates the model parameters based on two factors: 1) user feedback (i.e., clicked or not), and 2) position of the item in the recommendation list. This way, the proposed model controls the utility assigned to items based on their exposure in the recommendation list. Extensive experiments on two real-world datasets using three contextual bandit algorithms show that the proposed reward model reduces exposure bias amplification in long run while maintaining the recommendation accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/07/2021

Unbiased Cascade Bandits: Mitigating Exposure Bias in Online Learning to Rank Recommendation

Exposure bias is a well-known issue in recommender systems where items a...
research
09/05/2023

Fairness of Exposure in Dynamic Recommendation

Exposure bias is a well-known issue in recommender systems where the exp...
research
08/21/2020

Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation

Online recommendation services recommend multiple commodities to users. ...
research
11/16/2022

Mitigating Frequency Bias in Next-Basket Recommendation via Deconfounders

Recent studies on Next-basket Recommendation (NBR) have achieved much pr...
research
06/13/2021

Correcting Exposure Bias for Link Recommendation

Link prediction methods are frequently applied in recommender systems, e...
research
03/26/2021

Analysing the Effect of Recommendation Algorithms on the Amplification of Misinformation

Recommendation algorithms have been pointed out as one of the major culp...
research
09/13/2021

Correcting the User Feedback-Loop Bias for Recommendation Systems

Selection bias is prevalent in the data for training and evaluating reco...

Please sign up or login with your details

Forgot password? Click here to reset