SOTAVerified

An Empirical Comparison of Generative Approaches for Product Attribute-Value Identification

2024-07-01Code Available0· sign in to hype

Kassem Sabeh, Robert Litschko, Mouna Kacimi, Barbara Plank, Johann Gamper

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Product attributes are crucial for e-commerce platforms, supporting applications like search, recommendation, and question answering. The task of Product Attribute and Value Identification (PAVI) involves identifying both attributes and their values from product information. In this paper, we formulate PAVI as a generation task and provide, to the best of our knowledge, the most comprehensive evaluation of PAVI so far. We compare three different attribute-value generation (AVG) strategies based on fine-tuning encoder-decoder models on three datasets. Experiments show that end-to-end AVG approach, which is computationally efficient, outperforms other strategies. However, there are differences depending on model sizes and the underlying language model. The code to reproduce all experiments is available at: https://github.com/kassemsabeh/pavi-avg

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
AE-110kT5 Large - End2EndF1-score84.29Unverified
MAVET5 Large - End2EndF1-score95.19Unverified
OA-Mine - annotationsT5 Large - End2EndF1-score86.28Unverified

Reproductions