Product/Brand extraction from WikiPedia

12/12/2012
by   K. Massoudi, et al.
0

In this paper we describe the task of extracting product and brand pages from wikipedia. We present an experimental environment and setup built on top of a dataset of wikipedia pages we collected. We introduce a method for recognition of product pages modelled as a boolean probabilistic classification task. We show that this approach can lead to promising results and we discuss alternative approaches we considered.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/19/2011

Edit wars in Wikipedia

We present a new, efficient method for automatically detecting severe co...
research
07/16/2020

Wikipedia's Network Bias on Controversial Topics

The most important feature of Wikipedia is the presence of hyperlinks in...
research
09/23/2017

Language Independent Acquisition of Abbreviations

This paper addresses automatic extraction of abbreviations (encompassing...
research
12/16/2022

How to disagree well: Investigating the dispute tactics used on Wikipedia

Disagreements are frequently studied from the perspective of either dete...
research
06/21/2022

Transformer-Based Multi-modal Proposal and Re-Rank for Wikipedia Image-Caption Matching

With the increased accessibility of web and online encyclopedias, the am...
research
10/25/2022

Wikinformetrics: Construction and description of an open Wikipedia knowledge graph dataset for informetric purposes

Wikipedia is one of the most visited websites in the world and is also a...

Please sign up or login with your details

Forgot password? Click here to reset