BERT Goes Shopping: Comparing Distributional Models for Product Representations

2020-12-17ACL (ECNLP) 2021Code Available1· sign in to hype

Federico Bianchi, Bingqing Yu, Jacopo Tagliabue

Code Available — Be the first to reproduce this paper.

Code

github.com/vinid/prodb
OfficialIn papertf★ 18

Abstract

Word embeddings (e.g., word2vec) have been applied successfully to eCommerce products through~prod2vec. Inspired by the recent performance improvements on several NLP tasks brought by contextualized embeddings, we propose to transfer BERT-like architectures to eCommerce: our model -- ~Prod2BERT -- is trained to generate representations of products through masked session modeling. Through extensive experiments over multiple shops, different tasks, and a range of design choices, we systematically compare the accuracy of~Prod2BERT and~prod2vec embeddings: while~Prod2BERT is found to be superior in several scenarios, we highlight the importance of resources and hyperparameters in the best performing models. Finally, we provide guidelines to practitioners for training embeddings under a variety of computational and data constraints.

Tasks

Language Modelling Product Recommendation Word Embeddings

BERT Goes Shopping: Comparing Distributional Models for Product Representations

Code

Abstract

Tasks

Reproductions