A Library Perspective on Nearly-Unsupervised Information Extraction Workflows in Digital Libraries

05/02/2022
by   Hermann Kroll, et al.
0

Information extraction can support novel and effective access paths for digital libraries. Nevertheless, designing reliable extraction workflows can be cost-intensive in practice. On the one hand, suitable extraction methods rely on domain-specific training data. On the other hand, unsupervised and open extraction methods usually produce not-canonicalized extraction results. This paper tackles the question how digital libraries can handle such extractions and if their quality is sufficient in practice. We focus on unsupervised extraction workflows by analyzing them in case studies in the domains of encyclopedias (Wikipedia), pharmacy and political sciences. We report on opportunities and limitations. Finally we discuss best practices for unsupervised extraction workflows.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/22/2022

On Dimensions of Plausibility for Narrative Information Access to Digital Libraries

Designing keyword-based access paths is a common practice in digital lib...
research
01/04/2007

The Unix KISS: A Case Study

In this paper we show that the initial philosophy used in designing and ...
research
01/22/2021

Unsupervised Technical Domain Terms Extraction using Term Extractor

Terminology extraction, also known as term extraction, is a subtask of i...
research
05/02/2022

What a Publication Tells You – Benefits of Narrative Information Access in Digital Libraries

Knowledge bases allow effective access paths in digital libraries. Here ...
research
04/15/2023

Enriching Simple Keyword Queries for Domain-Aware Narrative Retrieval

Providing effective access paths to content is a key task in digital lib...
research
07/14/2023

Aspect-Driven Structuring of Historical Dutch Newspaper Archives

Digital libraries oftentimes provide access to historical newspaper arch...
research
04/23/2023

Capturing Stability of Information Needs in Digital Libraries

Scientific digital libraries provide users access to large amounts of da...

Please sign up or login with your details

Forgot password? Click here to reset