Fast Counting in Machine Learning Applications

04/12/2018
by   Subhadeep Karan, et al.
University at Buffalo
0

We propose scalable methods to execute counting queries in machine learning applications. To achieve memory and computational efficiency, we abstract counting queries and their context such that the counts can be aggregated as a stream. We demonstrate performance and scalability of the resulting approach on random queries, and through extensive experimentation using Bayesian networks learning and association rule mining. Our methods significantly outperform commonly used ADtrees and hash tables, and are practical alternatives for processing large-scale data.

READ FULL TEXT
03/01/1998

Cached Sufficient Statistics for Efficient Machine Learning with Large Datasets

This paper introduces new algorithms and data structures for quick count...
12/02/2021

A short note on the counting complexity of conjunctive queries

This note closes a minor gap in the literature on the counting complexit...
06/01/2020

F-IVM: Learning over Fast-Evolving Relational Data

F-IVM is a system for real-time analytics such as machine learning appli...
11/20/2012

A Traveling Salesman Learns Bayesian Networks

Structure learning of Bayesian networks is an important problem that ari...
06/13/2015

Graphlet Decomposition: Framework, Algorithms, and Applications

From social science to biology, numerous applications often rely on grap...
06/21/2019

Learning to Sample: Counting with Complex Queries

In this paper we present a suite of methods to efficiently estimate coun...
10/13/2020

Data Engineering for HPC with Python

Data engineering is becoming an increasingly important part of scientifi...

Please sign up or login with your details

Forgot password? Click here to reset