Reliable Fidelity and Diversity Metrics for Generative Models

02/23/2020
by   Muhammad Ferjad Naeem, et al.
23

Devising indicative evaluation metrics for the image generation task remains an open problem. The most widely used metric for measuring the similarity between real and generated images has been the Fréchet Inception Distance (FID) score. Because it does not differentiate the fidelity and diversity aspects of the generated images, recent papers have introduced variants of precision and recall metrics to diagnose those properties separately. In this paper, we show that even the latest version of the precision and recall metrics are not reliable yet. For example, they fail to detect the match between two identical distributions, they are not robust against outliers, and the evaluation hyperparameters are selected arbitrarily. We propose density and coverage metrics that solve the above issues. We analytically and experimentally show that density and coverage provide more interpretable and reliable signals for practitioners than the existing metrics. Code: https://github.com/clovaai/generative-evaluation-prdc.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 8

page 9

09/04/2023

Probabilistic Precision and Recall Towards Reliable Evaluation of Generative Models

Assessing the fidelity and diversity of the generative model is a diffic...
06/04/2021

Barcode Method for Generative Model Evaluation driven by Topological Data Analysis

Evaluating the performance of generative models in image synthesis is a ...
08/31/2023

Unsupervised evaluation of GAN sample quality: Introducing the TTJac Score

Evaluation metrics are essential for assessing the performance of genera...
07/21/2022

Efficient Graph-Friendly COCO Metric Computation for Train-Time Model Evaluation

Evaluating the COCO mean average precision (MaP) and COCO recall metrics...
04/26/2020

Evaluation Metrics for Conditional Image Generation

We present two new metrics for evaluating generative models in the class...
04/09/2019

Reliable and Efficient Image Cropping: A Grid Anchor based Approach

Image cropping aims to improve the composition as well as aesthetic qual...

Code Repositories

generative-evaluation-prdc

Code base for the precision, recall, density, and coverage metrics for generative models. ICML 2020.


view repo

gan-metrics

Lots of evaluation metrics for the generative adversarial networks in pytorch


view repo

Please sign up or login with your details

Forgot password? Click here to reset