Row Conditional-TGAN for generating synthetic relational databases

11/14/2022
by   Mohamed Gueye, et al.
0

Besides reproducing tabular data properties of standalone tables, synthetic relational databases also require modeling the relationships between related tables. In this paper, we propose the Row Conditional-Tabular Generative Adversarial Network (RC-TGAN), a novel generative adversarial network (GAN) model that extends the tabular GAN to support modeling and synthesizing relational databases. The RC-TGAN models relationship information between tables by incorporating conditional data of parent rows into the design of the child table's GAN. We further extend the RC-TGAN to model the influence that grandparent table rows may have on their grandchild rows, in order to prevent the loss of this connection when the rows of the parent table fail to transfer this relationship information. The experimental results, using eight real relational databases, show significant improvements in the quality of the synthesized relational databases when compared to the benchmark system, demonstrating the effectiveness of the RC-TGAN in preserving relationships between tables of the original database.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/04/2023

REaLTabFormer: Generating Realistic Relational and Tabular Data using Transformers

Tabular data is a common form of organizing data. Multiple models are av...
research
04/03/2019

Extracting Tables from Documents using Conditional Generative Adversarial Networks and Genetic Algorithms

Extracting information from tables in documents presents a significant c...
research
05/13/2020

On Embeddings in Relational Databases

We address the problem of learning a distributed representation of entit...
research
01/13/2020

Identifying Table Structure in Documents using Conditional Generative Adversarial Networks

In many industries, as well as in academic research, information is prim...
research
05/24/2023

Towards Foundation Models for Relational Databases [Vision Paper]

Tabular representation learning has recently gained a lot of attention. ...
research
01/22/2018

Prioritizing Technical Debt in Database Normalization Using Portfolio Theory and Data Quality Metrics

Database normalization is the one of main principles for designing relat...
research
07/01/2019

Modeling Tabular data using Conditional GAN

Modeling the probability distribution of rows in tabular data and genera...

Please sign up or login with your details

Forgot password? Click here to reset