The Evaluation of Rating Systems in Online Free-for-All Games

08/15/2020
by   Arman Dehpanah, et al.
0

Online competitive games have become increasingly popular. To ensure an exciting and competitive environment, these games routinely attempt to match players with similar skill levels. Matching players is often accomplished through a rating system. There has been an increasing amount of research on developing such rating systems. However, less attention has been given to the evaluation metrics of these systems. In this paper, we present an exhaustive analysis of six metrics for evaluating rating systems in online competitive games. We compare traditional metrics such as accuracy. We then introduce other metrics adapted from the field of information retrieval. We evaluate these metrics against several well-known rating systems on a large real-world dataset of over 100,000 free-for-all matches. Our results show stark differences in their utility. Some metrics do not consider deviations between two ranks. Others are inordinately impacted by new players. Many do not capture the importance of distinguishing between errors in higher ranks and lower ranks. Among all metrics studied, we recommend Normalized Discounted Cumulative Gain (NDCG) because not only does it resolve the issues faced by other metrics, but it also offers flexibility to adjust the evaluations based on the goals of the system

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2021

The Evaluation of Rating Systems in Team-based Battle Royale Games

Online competitive games have become a mainstream entertainment platform...
research
07/24/2023

Deep Bradley-Terry Rating: Estimate Properties Without Metric of Unseen Items

Many properties in the real world, such as desirability or strength in c...
research
10/02/2019

Understanding and Pushing the Limits of the Elo Rating Algorithm

This work is concerned with the rating of players/teams in face-to-face ...
research
08/04/2023

A State-Space Perspective on Modelling and Inference for Online Skill Rating

This paper offers a comprehensive review of the main methodologies used ...
research
08/14/2018

Skill Rating for Generative Models

We explore a new way to evaluate generative models using insights from e...
research
01/02/2021

An Elo-like System for Massive Multiplayer Competitions

Rating systems play an important role in competitive sports and games. T...
research
01/12/2022

Learning to Identify Top Elo Ratings: A Dueling Bandits Approach

The Elo rating system is widely adopted to evaluate the skills of (chess...

Please sign up or login with your details

Forgot password? Click here to reset