A Corpus Study and Annotation Schema for Named Entity Recognition and Relation Extraction of Business Products

04/07/2020
by   Saskia Schön, et al.
0

Recognizing non-standard entity types and relations, such as B2B products, product classes and their producers, in news and forum texts is important in application areas such as supply chain monitoring and market research. However, there is a decided lack of annotated corpora and annotation guidelines in this domain. In this work, we present a corpus study, an annotation schema and associated guidelines, for the annotation of product entity and company-product relation mentions. We find that although product mentions are often realized as noun phrases, defining their exact extent is difficult due to high boundary ambiguity and the broad syntactic and semantic variety of their surface realizations. We also describe our ongoing annotation effort, and present a preliminary corpus of English web and social media documents annotated according to the proposed guidelines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2020

A German Corpus for Fine-Grained Named Entity Recognition and Relation Extraction of Traffic and Industry Events

Monitoring mobility- and industry-relevant events is important in areas ...
research
10/30/2017

Creation of an Annotated Corpus of Spanish Radiology Reports

This paper presents a new annotated corpus of 513 anonymized radiology r...
research
05/27/2022

Who is we? Disambiguating the referents of first person plural pronouns in parliamentary debates

This paper investigates the use of first person plural pronouns as a rhe...
research
10/29/2020

RuREBus: a Case Study of Joint Named Entity Recognition and Relation Extraction from e-Government Domain

We show-case an application of information extraction methods, such as n...
research
11/27/2019

NorNE: Annotating Named Entities for Norwegian

This paper presents NorNE, a manually annotated corpus of named entities...
research
06/04/2020

The SOFC-Exp Corpus and Neural Approaches to Information Extraction in the Materials Science Domain

This paper presents a new challenging information extraction task in the...
research
06/07/2022

Guidelines and a Corpus for Extracting Biographical Events

Despite biographies are widely spread within the Semantic Web, resources...

Please sign up or login with your details

Forgot password? Click here to reset