TargetUM: Targeted High-Utility Itemset Querying

10/30/2021
by   Jinbao Miao, et al.
0

Traditional high-utility itemset mining (HUIM) aims to determine all high-utility itemsets (HUIs) that satisfy the minimum utility threshold (minUtil) in transaction databases. However, in most applications, not all HUIs are interesting because only specific parts are required. Thus, targeted mining based on user preferences is more important than traditional mining tasks. This paper is the first to propose a target-based HUIM problem and to provide a clear formulation of the targeted utility mining task in a quantitative transaction database. A tree-based algorithm known as Target-based high-Utility iteMset querying using (TargetUM) is proposed. The algorithm uses a lexicographic querying tree and three effective pruning strategies to improve the mining efficiency. We implemented experimental validation on several real and synthetic databases, and the results demonstrate that the performance of TargetUM is satisfactory, complete, and correct. Finally, owing to the lexicographic querying tree, the database no longer needs to be scanned repeatedly for multiple queries.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2023

Targeted Mining of Top-k High Utility Itemsets

Finding high-importance patterns in data is an emerging data mining task...
research
11/17/2019

A one-phase tree-based algorithm for mining high-utility itemsets from a transaction database

High-utility itemset mining finds itemsets from a transaction database w...
research
12/25/2019

Utility-Driven Mining of Trend Information for Intelligent System

Useful knowledge, embedded in a database, is likely to change over time....
research
03/18/2018

A Guided FP-growth algorithm for multitude-targeted mining of big data

In this paper we present the GFP-growth (Guided FP-growth) algorithm, a ...
research
12/27/2021

An efficient mining scheme for high utility itemsets

Knowledge discovery in databases aims at finding useful information, whi...
research
08/18/2020

Discovering High Utility-Occupancy Patterns from Uncertain Data

It is widely known that there is a lot of useful information hidden in b...
research
07/06/2023

Finding Favourite Tuples on Data Streams with Provably Few Comparisons

One of the most fundamental tasks in data science is to assist a user wi...

Please sign up or login with your details

Forgot password? Click here to reset