A Critical Analysis of Recursive Model Indexes

06/30/2021
by   Marcel Maltry, et al.
0

The recursive model index (RMI) has recently been introduced as a machine-learned replacement for traditional indexes over sorted data, achieving remarkably fast lookups. Follow-up work focused on explaining RMI's performance and automatically configuring RMIs through enumeration. Unfortunately, configuring RMIs involves setting several hyperparameters, the enumeration of which is often too time-consuming in practice. Therefore, in this work, we conduct the first inventor-independent broad analysis of RMIs with the goal of understanding the impact of each hyperparameter on performance. In particular, we show that in addition to model types and layer size, error bounds and search algorithms must be considered to achieve the best possible performance. Based on our findings, we develop a simple-to-follow guideline for configuring RMIs. We evaluate our guideline by comparing the resulting RMIs with a number of state-of-the-art indexes, both learned and traditional. We show that our simple guideline is sufficient to achieve competitive performance with other learned indexes and RMIs whose configuration was determined using an expensive enumeration procedure. In addition, while carefully reimplementing RMIs, we are able to improve the build time by 2.5x to 6.3x.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/08/2018

Using Known Information to Accelerate HyperParameters Optimization Based on SMBO

Automl is the key technology for machine learning problem. Current state...
research
06/24/2020

Hyperparameter Ensembles for Robustness and Uncertainty Quantification

Ensembles over neural network weights trained from different random init...
research
08/11/2021

Towards Practical Learned Indexing

Latest research proposes to replace existing index structures with learn...
research
12/23/2021

Using Sequential Statistical Tests to Improve the Performance of Random Search in hyperparameter Tuning

Hyperparamter tuning is one of the the most time-consuming parts in mach...
research
04/30/2020

RadixSpline: A Single-Pass Learned Index

Recent research has shown that learned models can outperform state-of-th...
research
09/16/2019

A Tsetlin Machine with Multigranular Clauses

The recently introduced Tsetlin Machine (TM) has provided competitive pa...

Please sign up or login with your details

Forgot password? Click here to reset