Adaptive Preferential Attached kNN Graph with Distribution-Awareness

08/04/2023
by   Shaojie Min, et al.
Chongqing University
0

Graph-based kNN algorithms have garnered widespread popularity for machine learning tasks due to their simplicity and effectiveness. However, as factual data often inherit complex distributions, the conventional kNN graph's reliance on a unified k-value can hinder its performance. A crucial factor behind this challenge is the presence of ambiguous samples along decision boundaries that are inevitably more prone to incorrect classifications. To address the situation, we propose the Preferential Attached k-Nearest Neighbors Graph (paNNG), which adopts distribution-aware adaptive-k into graph construction. By incorporating distribution information as a cohesive entity, paNNG can significantly improve performance on ambiguous samples by "pulling" them towards their original classes and hence enhance overall generalization capability. Through rigorous evaluations on diverse datasets, paNNG outperforms state-of-the-art algorithms, showcasing its adaptability and efficacy across various real-world scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

01/29/2021

A Comprehensive Survey and Experimental Comparison of Graph-Based Approximate Nearest Neighbor Search

Approximate nearest neighbor search (ANNS) constitutes an important oper...
05/26/2023

GVdoc: Graph-based Visual Document Classification

The robustness of a model for real-world deployment is decided by how we...
06/03/2016

A Graph-Based Semi-Supervised k Nearest-Neighbor Method for Nonlinear Manifold Distributed Data Classification

k Nearest Neighbors (kNN) is one of the most widely used supervised lear...
11/30/2017

Graph Centrality Measures for Boosting Popularity-Based Entity Linking

Many Entity Linking systems use collective graph-based methods to disamb...
08/25/2021

NGC: A Unified Framework for Learning with Open-World Noisy Data

The existence of noisy data is prevalent in both the training and testin...
05/30/2023

GraphCleaner: Detecting Mislabelled Samples in Popular Graph Learning Benchmarks

Label errors have been found to be prevalent in popular text, vision, an...
04/22/2022

Metric Learning and Adaptive Boundary for Out-of-Domain Detection

Conversational agents are usually designed for closed-world environments...

Please sign up or login with your details

Forgot password? Click here to reset