A Multilingual FrameNet-based Grammar and Lexicon for Controlled Natural Language

11/12/2015
by   Normunds Grūzītis, et al.
0

Berkeley FrameNet is a lexico-semantic resource for English based on the theory of frame semantics. It has been exploited in a range of natural language processing applications and has inspired the development of framenets for many languages. We present a methodological approach to the extraction and generation of a computational multilingual FrameNet-based grammar and lexicon. The approach leverages FrameNet-annotated corpora to automatically extract a set of cross-lingual semantico-syntactic valence patterns. Based on data from Berkeley FrameNet and Swedish FrameNet, the proposed approach has been implemented in Grammatical Framework (GF), a categorial grammar formalism specialized for multilingual grammars. The implementation of the grammar and lexicon is supported by the design of FrameNet, providing a frame semantic abstraction layer, an interlingual semantic API (application programming interface), over the interlingual syntactic API already provided by GF Resource Grammar Library. The evaluation of the acquired grammar and lexicon shows the feasibility of the approach. Additionally, we illustrate how the FrameNet-based grammar and lexicon are exploited in two distinct multilingual controlled natural language applications. The produced resources are available under an open source license.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/02/2018

Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing

Addressing the cross-lingual variation of grammatical structures and mea...
research
01/16/2023

XNLI 2.0: Improving XNLI dataset and performance on Cross Lingual Understanding (XLU)

Natural Language Processing systems are heavily dependent on the availab...
research
04/07/2021

GrammarTagger: A Multilingual, Minimally-Supervised Grammar Profiler for Language Education

We present GrammarTagger, an open-source grammar profiler which, given a...
research
10/05/2020

The Grammar of Emergent Languages

In this paper, we consider the syntactic properties of languages emerged...
research
04/28/2018

Specifying and Verbalising Answer Set Programs in Controlled Natural Language

We show how a bi-directional grammar can be used to specify and verbalis...
research
01/08/2020

From Natural Language Instructions to Complex Processes: Issues in Chaining Trigger Action Rules

Automation services for complex business processes usually require a hig...
research
08/03/2018

Lightweight Call-Graph Construction for Multilingual Software Analysis

Analysis of multilingual codebases is a topic of increasing importance. ...

Please sign up or login with your details

Forgot password? Click here to reset