Better Software Analytics via "DUO": Data Mining Algorithms Using/Used-by Optimizers

12/04/2018
by   Amritanshu Agrawal, et al.
0

This paper claims that a new field of empirical software engineering research and practice is emerging: data mining using/used-by optimizers for empirical studies, or DUO. For example, data miners can generate the models that are explored by optimizers.Also, optimizers can advise how to best adjust the control parameters of a data miner. This combined approach acts like an agent leaning over the shoulder of an analyst that advises "ask this question next" or "ignore that problem, it is not relevant to your goals". Further, those agents can help us build "better" predictive models, where "better" can be either greater predictive accuracy, or faster modeling time (which, in turn, enables the exploration of a wider range of options). We also caution that the era of papers that just use data miners is coming to an end. Results obtained from an unoptimized data miner can be quickly refuted, just by applying an optimizer to produce a different (and better performing) model. Our conclusion, hence, is that for software analytics it is possible, useful and necessary to combine data mining and optimization using DUO.

READ FULL TEXT
research
05/31/2020

How to Recognize Actionable Static Code Warnings (Using Linear SVMs)

Static code warning tools often generate warnings that programmers ignor...
research
04/28/2018

Hyperparameter Optimization for Effort Estimation

Software analytics has been widely used in software engineering for many...
research
02/05/2019

How to "DODGE" Complex Software Analytics?

AI software is still software. Software engineers need better tools to m...
research
01/30/2018

Data-Driven Search-based Software Engineering

This paper introduces Data-Driven Search-based Software Engineering (DSE...
research
07/05/2021

A contextual analysis of multi-layer perceptron models in classifying hand-written digits and letters: limited resources

Classifying hand-written digits and letters has taken a big leap with th...
research
07/09/2019

Computer-Aided Data Mining: Automating a Novel Knowledge Discovery and Data Mining Process Model for Metabolomics

This work presents MeKDDaM-SAGA, computer-aided automation software for ...
research
01/20/2022

Predictive modeling of movements of refugees and internally displaced people: Towards a computational framework

Predicting forced displacement is an important undertaking of many human...

Please sign up or login with your details

Forgot password? Click here to reset