Macau: Scalable Bayesian Multi-relational Factorization with Side Information using MCMC

09/15/2015
by   Jaak Simm, et al.
0

We propose Macau, a powerful and flexible Bayesian factorization method for heterogeneous data. Our model can factorize any set of entities and relations that can be represented by a relational model, including tensors and also multiple relations for each entity. Macau can also incorporate side information, specifically entity and relation features, which are crucial for predicting sparsely observed relations. Macau scales to millions of entity instances, hundred millions of observations, and sparse entity features with millions of dimensions. To achieve the scale up, we specially designed sampling procedure for entity and relation features that relies primarily on noise injection in linear regressions. We show performance and advanced features of Macau in a set of experiments, including challenging drug-protein activity prediction task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/01/2015

Highly Scalable Tensor Factorization for Prediction of Drug-Protein Interaction Type

The understanding of the type of inhibitory interaction plays an importa...
research
08/28/2020

HittER: Hierarchical Transformers for Knowledge Graph Embeddings

This paper examines the challenging problem of learning representations ...
research
06/18/2021

PRGC: Potential Relation and Global Correspondence Based Joint Relational Triple Extraction

Joint extraction of entities and relations from unstructured texts is a ...
research
06/23/2023

Mutually Guided Few-shot Learning for Relational Triple Extraction

Knowledge graphs (KGs), containing many entity-relation-entity triples, ...
research
07/29/2018

DataJoint: A Simpler Relational Data Model

The relational data model offers unrivaled rigor and precision in defini...
research
02/04/2019

Extracting Multiple-Relations in One-Pass with Pre-Trained Transformers

Most approaches to extraction multiple relations from a paragraph requir...
research
11/05/2019

OMXWare, A Cloud-Based Platform for Studying Microbial Life at Scale

The rapid growth in biological sequence data is revolutionizing our unde...

Please sign up or login with your details

Forgot password? Click here to reset