POTATO: exPlainable infOrmation exTrAcTion framewOrk

01/31/2022
by   Ádám Kovács, et al.
0

We present POTATO, a task- and languageindependent framework for human-in-the-loop (HITL) learning of rule-based text classifiers using graph-based features. POTATO handles any type of directed graph and supports parsing text into Abstract Meaning Representations (AMR), Universal Dependencies (UD), and 4lang semantic graphs. A streamlit-based user interface allows users to build rule systems from graph patterns, provides real-time evaluation based on ground truth data, and suggests rules by ranking graph features using interpretable machine learning models. Users can also provide patterns over graphs using regular expressions, and POTATO can recommend refinements of such rules. POTATO is applied in projects across domains and languages, including classification tasks on German legal text and English social media data. All components of our system are written in Python, can be installed via pip, and are released under an MIT License on GitHub.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/24/2022

PyTAIL: Interactive and Incremental Learning of NLP Models with Human in the Loop for Online Data

Online data streams make training machine learning models hard because o...
research
03/24/2020

Commutators for Stochastic Rewriting Systems: Theory and Implementation in Z3

In the semantics of stochastic rewriting systems (SRSs) based on rule al...
research
08/30/2018

Rule-based OWL Modeling with ROWLTab Protege Plugin

It has been argued that it is much easier to convey logical statements u...
research
05/22/2018

Rule-Based Drawing, Analysis and Generation of Graphs for Mason's Mark Design

We are developing a rule-based implementation of a tool to analyse and g...
research
06/30/2016

SnapToGrid: From Statistical to Interpretable Models for Biomedical Information Extraction

We propose an approach for biomedical information extraction that marrie...
research
06/01/2020

Efficient EUD Parsing

We present the system submission from the FASTPARSE team for the EUD Sha...
research
06/28/2015

WYSIWYE: An Algebra for Expressing Spatial and Textual Rules for Visual Information Extraction

The visual layout of a webpage can provide valuable clues for certain ty...

Please sign up or login with your details

Forgot password? Click here to reset