Dynamic Database Embeddings with FoRWaRD

03/11/2021
by   Jan Toenshoff, et al.
0

We study the problem of computing an embedding of the tuples of a relational database in a manner that is extensible to dynamic changes of the database. Importantly, the embedding of existing tuples should not change due to the embedding of newly inserted tuples (as database applications might rely on existing embeddings), while the embedding of all tuples, old and new, should retain high quality. This task is challenging since state-of-the-art embedding techniques for structured data, such as (adaptations of) embeddings on graphs, have inherent inter-dependencies among the embeddings of different entities. We present the FoRWaRD algorithm (Foreign Key Random Walk Embeddings for Relational Databases) that draws from embedding techniques for general graphs and knowledge graphs, and is inherently utilizing the schema and its key and foreign-key constraints. We compare FoRWaRD to an alternative approach that we devise by adapting node embeddings for graphs (Node2Vec) to dynamic databases. We show that FoRWaRD is comparable and sometimes superior to state-of-the-art embeddings in the static (traditional) setting, using a collection of downstream tasks of column prediction over geographical and biological domains. More importantly, in the dynamic setting FoRWaRD outperforms the alternatives consistently and often considerably, and features only a mild reduction of quality even when the database consists of mostly newly inserted tuples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/22/2021

Updating Embeddings for Dynamic Knowledge Graphs

Data in Knowledge Graphs often represents part of the current state of t...
research
10/24/2021

A Broader Picture of Random-walk Based Graph Embedding

Graph embedding based on random-walks supports effective solutions for m...
research
05/13/2020

On Embeddings in Relational Databases

We address the problem of learning a distributed representation of entit...
research
08/27/2018

One-Shot Relational Learning for Knowledge Graphs

Knowledge graphs (KGs) are the key components of various natural languag...
research
11/28/2019

RETRO: Relation Retrofitting For In-Database Machine Learning on Textual Data

There are massive amounts of textual data residing in databases, valuabl...
research
09/19/2022

Walk-and-Relate: A Random-Walk-based Algorithm for Representation Learning on Sparse Knowledge Graphs

Knowledge graph (KG) embedding techniques use structured relationships b...
research
03/12/2018

Concept2vec: Metrics for Evaluating Quality of Embeddings for Ontological Concepts

Although there is an emerging trend towards generating embeddings for pr...

Please sign up or login with your details

Forgot password? Click here to reset