Flexible Modeling and Multitask Learning using Differentiable Tree Ensembles

05/19/2022
by   Shibal Ibrahim, et al.
0

Decision tree ensembles are widely used and competitive learning models. Despite their success, popular toolkits for learning tree ensembles have limited modeling capabilities. For instance, these toolkits support a limited number of loss functions and are restricted to single task learning. We propose a flexible framework for learning tree ensembles, which goes beyond existing toolkits to support arbitrary loss functions, missing responses, and multi-task learning. Our framework builds on differentiable (a.k.a. soft) tree ensembles, which can be trained using first-order methods. However, unlike classical trees, differentiable trees are difficult to scale. We therefore propose a novel tensor-based formulation of differentiable trees that allows for efficient vectorization on GPUs. We perform experiments on a collection of 28 real open-source and proprietary datasets, which demonstrate that our framework can lead to 100x more compact and 23 by popular toolkits.

READ FULL TEXT
research
09/10/2021

A Neural Tangent Kernel Perspective of Infinite Tree Ensembles

In practical situations, the ensemble tree model is one of the most popu...
research
02/18/2020

The Tree Ensemble Layer: Differentiability meets Conditional Computation

Neural networks and tree ensembles are state-of-the-art learners, each w...
research
05/25/2022

A Neural Tangent Kernel Formula for Ensembles of Soft Trees with Arbitrary Architectures

A soft tree is an actively studied variant of a decision tree that updat...
research
05/05/2023

Verifiable Learning for Robust Tree Ensembles

Verifying the robustness of machine learning models against evasion atta...
research
06/08/2022

TreeFlow: Going beyond Tree-based Gaussian Probabilistic Regression

The tree-based ensembles are known for their outstanding performance for...
research
05/31/2022

ForestPrune: Compact Depth-Controlled Tree Ensembles

Tree ensembles are versatile supervised learning algorithms that achieve...
research
06/01/2023

Loss-Optimal Classification Trees: A Generalized Framework and the Logistic Case

The Classification Tree (CT) is one of the most common models in interpr...

Please sign up or login with your details

Forgot password? Click here to reset