Prediction Factory: automated development and collaborative evaluation of predictive models

11/29/2018
by   Gaurav Sheni, et al.
4

In this paper, we present a data science automation system called Prediction Factory. The system uses several key automation algorithms to enable data scientists to rapidly develop predictive models and share them with domain experts. To assess the system's impact, we implemented 3 different interfaces for creating predictive modeling projects: baseline automation, full automation, and optional automation. With a dataset of online grocery shopper behaviors, we divided data scientists among the interfaces to specify prediction problems, learn and evaluate models, and write a report for domain experts to judge whether or not to fund to continue working on. In total, 22 data scientists created 94 reports that were judged 296 times by 26 experts. In a head-to-head trial, reports generated utilizing full data science automation interface reports were funded 57.5 baseline automation were only funded 42.5 interface which supports optional automation generated reports were funded 58.6 automation reports were funded about equally when put head-to-head. These results demonstrate that Prediction Factory has implemented a critical amount of automation to augment the role of data scientists and improve business outcomes.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 6

page 9

page 10

page 14

research
01/13/2021

AutoDS: Towards Human-Centered Automation of Data Science

Data science (DS) projects often follow a lifecycle that consists of lab...
research
03/02/2023

A Vision for Semantically Enriched Data Science

The recent efforts in automation of machine learning or data science has...
research
02/19/2023

AutoDOViz: Human-Centered Automation for Decision Optimization

We present AutoDOViz, an interactive user interface for automated decisi...
research
05/16/2022

A Survey on Semantics in Automated Data Science

Data Scientists leverage common sense reasoning and domain knowledge to ...
research
08/24/2022

A Survey of Open Source Automation Tools for Data Science Predictions

We present an expository overview of technical and cultural challenges t...
research
06/11/2020

Transparency in Language Generation: Levels of Automation

Language models and conversational systems are growing increasingly adva...
research
06/28/2019

MLFriend: Interactive Prediction Task Recommendation for Event-Driven Time-Series Data

Most automation in machine learning focuses on model selection and hyper...

Please sign up or login with your details

Forgot password? Click here to reset