Semi-Automated Construction of Food Composition Knowledge Base

01/24/2023
by   Jason Youn, et al.
0

A food composition knowledge base, which stores the essential phyto-, micro-, and macro-nutrients of foods is useful for both research and industrial applications. Although many existing knowledge bases attempt to curate such information, they are often limited by time-consuming manual curation processes. Outside of the food science domain, natural language processing methods that utilize pre-trained language models have recently shown promising results for extracting knowledge from unstructured text. In this work, we propose a semi-automated framework for constructing a knowledge base of food composition from the scientific literature available online. To this end, we utilize a pre-trained BioBERT language model in an active learning setup that allows the optimal use of limited training data. Our work demonstrates how human-in-the-loop models are a step toward AI-assisted food systems that scale well to the ever-increasing big data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/20/2023

FoodGPT: A Large Language Model in Food Testing Domain with Incremental Pre-training and Knowledge Graph Prompt

Currently, the construction of large language models in specific domains...
research
01/02/2022

Topical Classification of Food Safety Publications with a Knowledge Base

The vast body of scientific publications presents an increasing challeng...
research
06/29/2022

OASYS: Domain-Agnostic Automated System for Constructing Knowledge Base from Unstructured Text

In recent years, creating and managing knowledge bases have become cruci...
research
06/20/2023

Exploring New Frontiers in Agricultural NLP: Investigating the Potential of Large Language Models for Food Applications

This paper explores new frontiers in agricultural natural language proce...
research
09/26/2022

Towards Fine-Dining Recipe Generation with Generative Pre-trained Transformers

Food is essential to human survival. So much so that we have developed d...
research
06/15/2023

Leveraging Human Salience to Improve Calorie Estimation

The following paper investigates the effectiveness of incorporating huma...
research
04/19/2017

Using Contexts and Constraints for Improved Geotagging of Human Trafficking Webpages

Extracting geographical tags from webpages is a well-motivated applicati...

Please sign up or login with your details

Forgot password? Click here to reset