Continual Prompt Tuning for Dialog State Tracking

2022-03-13ACL 2022Code Available1· sign in to hype

Qi Zhu, Bing Li, Fei Mi, Xiaoyan Zhu, Minlie Huang

Code Available — Be the first to reproduce this paper.

Code

github.com/thu-coai/cpt4dst
OfficialIn paperjax★ 27

Abstract

A desirable dialog system should be able to continually learn new skills without forgetting old ones, and thereby adapt to new domains or tasks in its life cycle. However, continually training a model often leads to a well-known catastrophic forgetting issue. In this paper, we present Continual Prompt Tuning, a parameter-efficient framework that not only avoids forgetting but also enables knowledge transfer between tasks. To avoid forgetting, we only learn and store a few prompt tokens' embeddings for each task while freezing the backbone pre-trained model. To achieve bi-directional knowledge transfer among tasks, we propose several techniques (continual prompt initialization, query fusion, and memory replay) to transfer knowledge from preceding tasks and a memory-guided technique to transfer knowledge from subsequent tasks. Extensive experiments demonstrate the effectiveness and efficiency of our proposed method on continual learning for dialog state tracking, compared with state-of-the-art baselines.

Tasks

Continual Learning dialog state tracking Transfer Learning

Continual Prompt Tuning for Dialog State Tracking

Code

Abstract

Tasks

Reproductions