Generic LSH Families for the Angular Distance Based on Johnson-Lindenstrauss Projections and Feature Hashing LSH

04/15/2017
by   Luis Argerich, et al.
0

In this paper we propose the creation of generic LSH families for the angular distance based on Johnson-Lindenstrauss projections. We show that feature hashing is a valid J-L projection and propose two new LSH families based on feature hashing. These new LSH families are tested on both synthetic and real datasets with very good results and a considerable performance improvement over other LSH families. While the theoretical analysis is done for the angular distance, these families can also be used in practice for the euclidean distance with excellent results [2]. Our tests using real datasets show that the proposed LSH functions work well for the euclidean distance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/02/2019

Analysis of SparseHash: an efficient embedding of set-similarity via sparse projections

Embeddings provide compact representations of signals in order to perfor...
research
01/24/2019

Note on distance matrix hashing

Hashing algorithm of dynamical set of distances is described. Proposed h...
research
01/10/2023

Discrete mixture representations of spherical distributions

We obtain discrete mixture representations for parametric families of pr...
research
04/15/2020

Locality Sensitive Hashing for Set-Queries, Motivated by Group Recommendations

Locality Sensitive Hashing (LSH) is an effective method to index a set o...
research
03/02/2018

Robust Multivariate Nonparametric Tests via Projection-Pursuit

In this work, we generalize the Cramér-von Mises statistic via projectio...
research
02/11/2022

An inductive-recursive universe generic for small families

We show that it is possible to construct a universe in all Grothendieck ...
research
10/12/2021

Generic Level Polymorphic N-ary Functions

Agda's standard library struggles in various places with n-ary functions...

Please sign up or login with your details

Forgot password? Click here to reset