Towards Green Automated Machine Learning: Status Quo and Future Directions

11/10/2021
by   Tanja Tornede, et al.
0

Automated machine learning (AutoML) strives for the automatic configuration of machine learning algorithms and their composition into an overall (software) solution - a machine learning pipeline - tailored to the learning task (dataset) at hand. Over the last decade, AutoML has become a hot research topic with hundreds of contributions. While AutoML offers many prospects, it is also known to be quite resource-intensive, which is one of its major points of criticism. The primary cause for a high resource consumption is that many approaches rely on the (costly) evaluation of many ML pipelines while searching for good candidates. This problem is amplified in the context of research on AutoML methods, due to large scale experiments conducted with many datasets and approaches, each of them being run with several repetitions to rule out random effects. In the spirit of recent work on Green AI, this paper is written in an attempt to raise the awareness of AutoML researchers for the problem and to elaborate on possible remedies. To this end, we identify four categories of actions the community may take towards more sustainable research on AutoML, namely approach design, benchmarking, research incentives, and transparency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/11/2018

MARVIN: An Open Machine Learning Corpus and Environment for Automated Machine Learning Primitive Annotation and Execution

In this demo paper, we introduce the DARPA D3M program for automatic mac...
research
01/30/2020

AVATAR – Machine Learning Pipeline Evaluation Using Surrogate Model

The evaluation of machine learning (ML) pipelines is essential during au...
research
11/21/2020

AutoWeka4MCPS-AVATAR: Accelerating Automated Machine Learning Pipeline Composition and Optimisation

Automated machine learning pipeline (ML) composition and optimisation ai...
research
07/04/2020

Lale: Consistent Automated Machine Learning

Automated machine learning makes it easier for data scientists to develo...
research
07/29/2017

MLBench: How Good Are Machine Learning Clouds for Binary Classification Tasks on Structured Data?

We conduct an empirical study of machine learning functionalities provid...
research
08/08/2022

On Taking Advantage of Opportunistic Meta-knowledge to Reduce Configuration Spaces for Automated Machine Learning

The automated machine learning (AutoML) process can require searching th...
research
11/29/2021

The CSIRO Crown-of-Thorn Starfish Detection Dataset

Crown-of-Thorn Starfish (COTS) outbreaks are a major cause of coral loss...

Please sign up or login with your details

Forgot password? Click here to reset