Efficient Training on Very Large Corpora via Gramian Estimation

07/18/2018
by   Walid Krichene, et al.
0

We study the problem of learning similarity functions over very large corpora using neural network embedding models. These models are typically trained using SGD with sampling of random observed and unobserved pairs, with a number of samples that grows quadratically with the corpus size, making it expensive to scale to very large corpora. We propose new efficient methods to train these models without having to sample unobserved pairs. Inspired by matrix factorization, our approach relies on adding a global quadratic penalty to all pairs of examples and expressing this term as the matrix-inner-product of two generalized Gramians. We show that the gradient of this term can be efficiently computed by maintaining estimates of the Gramians, and develop variance reduction schemes to improve the quality of the estimates. We conduct large-scale experiments that show a significant improvement in training time and generalization quality compared to traditional sampling methods.

READ FULL TEXT
research
06/26/2019

NetSMF: Large-Scale Network Embedding as Sparse Matrix Factorization

We study the problem of large-scale network embedding, which aims to lea...
research
01/13/2016

Online Prediction of Dyadic Data with Heterogeneous Matrix Factorization

Dyadic Data Prediction (DDP) is an important problem in many research ar...
research
10/26/2020

An Efficient Newton Method for Extreme Similarity Learning with Nonlinear Embeddings

We study the problem of learning similarity by using nonlinear embedding...
research
07/08/2021

Assigning Topics to Documents by Successive Projections

Topic models provide a useful tool to organize and understand the struct...
research
10/26/2021

iALS++: Speeding up Matrix Factorization with Subspace Optimization

iALS is a popular algorithm for learning matrix factorization models fro...
research
02/06/2016

Swivel: Improving Embeddings by Noticing What's Missing

We present Submatrix-wise Vector Embedding Learner (Swivel), a method fo...

Please sign up or login with your details

Forgot password? Click here to reset