EvalAI: Towards Better Evaluation Systems for AI Agents

02/10/2019
by   Deshraj Yadav, et al.
24

We introduce EvalAI, an open source platform for evaluating and comparing machine learning (ML) and artificial intelligence algorithms (AI) at scale. EvalAI is built to provide a scalable solution to the research community to fulfill the critical need of evaluating machine learning models and agents acting in an environment against annotations or with a human-in-the-loop. This will help researchers, students, and data scientists to create, collaborate, and participate in AI challenges organized around the globe. By simplifying and standardizing the process of benchmarking these models, EvalAI seeks to lower the barrier to entry for participating in the global scientific effort to push the frontiers of machine learning and artificial intelligence, thereby increasing the rate of measurable progress in this domain.

READ FULL TEXT
research
09/24/2020

Advancing the Research and Development of Assured Artificial Intelligence and Machine Learning Capabilities

Artificial intelligence (AI) and machine learning (ML) have become incre...
research
01/27/2023

Polycraft World AI Lab (PAL): An Extensible Platform for Evaluating Artificial Intelligence Agents

As artificial intelligence research advances, the platforms used to eval...
research
07/04/2020

Human Assisted Artificial Intelligence Based Technique to Create Natural Features for OpenStreetMap

In this work, we propose an AI-based technique using freely available sa...
research
11/26/2019

ModelHub.AI: Dissemination Platform for Deep Learning Models

Recent advances in artificial intelligence research have led to a profus...
research
03/18/2023

A general-purpose AI assistant embedded in an open-source radiology information system

Radiology AI models have made significant progress in near-human perform...
research
02/05/2019

Dungeon Crawl Stone Soup as an Evaluation Domain for Artificial Intelligence

Dungeon Crawl Stone Soup is a popular, single-player, free and open-sour...
research
03/23/2018

Computational Power and the Social Impact of Artificial Intelligence

Machine learning is a computational process. To that end, it is inextric...

Please sign up or login with your details

Forgot password? Click here to reset