SOTAVerified

Leveraging Similar Users for Personalized Language Modeling with Limited Data

2022-05-01ACL 2022Unverified0· sign in to hype

Charles Welch, Chenxi Gu, Jonathan Kummerfeld, Veronica Perez-Rosas, Rada Mihalcea

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Personalized language models are designed and trained to capture language patterns specific to individual users. This makes them more accurate at predicting what a user will write. However, when a new user joins a platform and not enough text is available, it is harder to build effective personalized language models. We propose a solution for this problem, using a model trained on users that are similar to a new user. In this paper, we explore strategies for finding the similarity between new users and existing ones and methods for using the data from existing users who are a good match. We further explore the trade-off between available data for new users and how well their language can be modeled.

Tasks

Reproductions