Efficient Graph-Friendly COCO Metric Computation for Train-Time Model Evaluation

07/21/2022
by   Luke Wood, et al.
0

Evaluating the COCO mean average precision (MaP) and COCO recall metrics as part of the static computation graph of modern deep learning frameworks poses a unique set of challenges. These challenges include the need for maintaining a dynamic-sized state to compute mean average precision, reliance on global dataset-level statistics to compute the metrics, and managing differing numbers of bounding boxes between images in a batch. As a consequence, it is common practice for researchers and practitioners to evaluate COCO metrics as a post training evaluation step. With a graph-friendly algorithm to compute COCO Mean Average Precision and recall, these metrics could be evaluated at training time, improving visibility into the evolution of the metrics through training curve plots, and decreasing iteration time when prototyping new model versions. Our contributions include an accurate approximation algorithm for Mean Average Precision, an open source implementation of both COCO mean average precision and COCO recall, extensive numerical benchmarks to verify the accuracy of our implementations, and an open-source training loop that include train-time evaluation of mean average precision and recall.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/19/2022

A Parallel Implementation of Computing Mean Average Precision

Mean Average Precision (mAP) has been widely used for evaluating the qua...
research
11/22/2022

Ontology-aware Learning and Evaluation for Audio Tagging

This study defines a new evaluation metric for audio tagging tasks to ov...
research
02/23/2020

Reliable Fidelity and Diversity Metrics for Generative Models

Devising indicative evaluation metrics for the image generation task rem...
research
07/28/2020

EasierPath: An Open-source Tool for Human-in-the-loop Deep Learning of Renal Pathology

Considerable morphological phenotyping studies in nephrology have emerge...
research
08/28/2017

It's Time to Consider "Time" when Evaluating Recommender-System Algorithms [Proposal]

In this position paper, we question the current practice of calculating ...
research
04/07/2022

ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO

Image-Test matching (ITM) is a common task for evaluating the quality of...
research
09/15/2023

The Use of Multi-Scale Fiducial Markers To Aid Takeoff and Landing Navigation by Rotorcraft

This paper quantifies the impact of adverse environmental conditions on ...

Please sign up or login with your details

Forgot password? Click here to reset