Don't Classify, Translate: Multi-Level E-Commerce Product Categorization Via Machine Translation

12/14/2018
by   Maggie Yundi Li, et al.
1

E-commerce platforms categorize their products into a multi-level taxonomy tree with thousands of leaf categories. Conventional methods for product categorization are typically based on machine learning classification algorithms. These algorithms take product information as input (e.g., titles and descriptions) to classify a product into a leaf category. In this paper, we propose a new paradigm based on machine translation. In our approach, we translate a product's natural language description into a sequence of tokens representing a root-to-leaf path in a product taxonomy. In our experiments on two large real-world datasets, we show that our approach achieves better predictive accuracy than a state-of-the-art classification system for product categorization. In addition, we demonstrate that our machine translation models can propose meaningful new paths between previously unconnected nodes in a taxonomy tree, thereby transforming the taxonomy into a directed acyclic graph (DAG). We discuss how the resultant taxonomy DAG promotes user-friendly navigation, and how it is more adaptable to new products.

READ FULL TEXT
research
04/15/2020

TXtract: Taxonomy-Aware Knowledge Extraction for Thousands of Product Categories

Extracting structured knowledge from product profiles is crucial for var...
research
06/20/2016

Product Classification in E-Commerce using Distributional Semantics

Product classification is the task of automatically predicting a taxonom...
research
06/09/2016

e-Commerce product classification: our participation at cDiscount 2015 challenge

This report describes our participation in the cDiscount 2015 challenge ...
research
05/09/2023

Consistent Text Categorization using Data Augmentation in e-Commerce

The categorization of massive e-Commerce data is a crucial, well-studied...
research
07/29/2023

Multi-output Headed Ensembles for Product Item Classification

In this paper, we revisit the problem of product item classification for...
research
11/03/2016

Probabilistic Modeling of Progressive Filtering

Progressive filtering is a simple way to perform hierarchical classifica...
research
09/02/2021

Text Classification for Predicting Multi-level Product Categories

In an online shopping platform, a detailed classification of the product...

Please sign up or login with your details

Forgot password? Click here to reset