DIALITE: Discover, Align and Integrate Open Data Tables

by   Aamod Khatiwada, et al.

We demonstrate a novel table discovery pipeline called DIALITE that allows users to discover, integrate and analyze open data tables. DIALITE has three main stages. First, it allows users to discover tables from open data platforms using state-of-the-art table discovery techniques. Second, DIALITE integrates the discovered tables to produce an integrated table. Finally, it allows users to analyze the integration result by applying different downstreaming tasks over it. Our pipeline is flexible such that the user can easily add and compare additional discovery and integration algorithms.


page 3

page 4


WarpGate: A Semantic Join Discovery System for Cloud Data Warehouses

Data discovery is a major challenge in enterprise data analysis: users o...

Optimizing Organizations for Navigating Data Lakes

Navigation is known to be an effective complement to search. In addition...

MATE: Multi-Attribute Table Extraction

A core operation in data discovery is to find joinable tables for a give...

Auto-Pipeline: Synthesizing Complex Data Pipelines By-Target Using Reinforcement Learning and Search

Recent work has made significant progress in helping users to automate s...

TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets

Tables have been an ever-existing structure to store data. There exist n...

Detecting Table Region in PDF Documents Using Distant Supervision

Superior to state-of-the-art approaches which compete in table recogniti...

DisCoveR: Accurate Efficient Discovery of Declarative Process Models

Declarative process modeling formalisms - which capture high-level proce...

Please sign up or login with your details

Forgot password? Click here to reset