Multi-label Stream Classification with Self-Organizing Maps

04/20/2020
by   Ricardo Cerri, et al.
0

Several learning algorithms have been proposed for offline multi-label classification. However, applications in areas such as traffic monitoring, social networks, and sensors produce data continuously, the so called data streams, posing challenges to batch multi-label learning. With the lack of stationarity in the distribution of data streams, new algorithms are needed to online adapt to such changes (concept drift). Also, in realistic applications, changes occur in scenarios of infinitely delayed labels, where the true classes of the arrival instances are never available. We propose an online unsupervised incremental method based on self-organizing maps for multi-label stream classification with infinitely delayed labels. In the classification phase, we use a k-nearest neighbors strategy to compute the winning neurons in the maps, adapting to concept drift by online adjusting neuron weight vectors and dataset label cardinality. We predict labels for each instance using the Bayes rule and the outputs of each neuron, adapting the probabilities and conditional probabilities of the classes in the stream. Experiments using synthetic and real datasets show that our method is highly competitive with several ones from the literature, in both stationary and concept drift scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/31/2022

Implicit Concept Drift Detection for Multi-label Data Streams

Many real-world applications adopt multi-label data streams as the need ...
research
09/23/2016

A Novel Progressive Multi-label Classifier for Classincremental Data

In this paper, a progressive learning algorithm for multi-label classifi...
research
03/15/2022

Improved Multi-label Classification under Temporal Concept Drift: Rethinking Group-Robust Algorithms in a Label-Wise Setting

In document classification for, e.g., legal and biomedical text, we ofte...
research
10/06/2022

Evaluating k-NN in the Classification of Data Streams with Concept Drift

Data streams are often defined as large amounts of data flowing continuo...
research
02/10/2019

Hybrid Forest: A Concept Drift Aware Data Stream Mining Algorithm

Nowadays with a growing number of online controlling systems in the orga...
research
03/01/2021

STUDD: A Student-Teacher Method for Unsupervised Concept Drift Detection

Concept drift detection is a crucial task in data stream evolving enviro...
research
03/29/2022

Evolving Multi-Label Fuzzy Classifier

Multi-label classification has attracted much attention in the machine l...

Please sign up or login with your details

Forgot password? Click here to reset