TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents

2019-01-23Code Available2· sign in to hype

Thomas Wolf, Victor Sanh, Julien Chaumond, Clement Delangue

Code Available — Be the first to reproduce this paper.

Code

github.com/thu-coai/CDial-GPT
pytorch★ 1,938
github.com/ErikEkstedt/TurnGPT
pytorch★ 65
github.com/noriyukipy/gptchat
pytorch★ 0
github.com/the-pythoncoder/counsel-chat
pytorch★ 0
github.com/KhueNguyen312/Persona-Chatbot
pytorch★ 0
github.com/pranavgollamudi/Chatbot
pytorch★ 0
github.com/dladustn95/enLanguageModel
pytorch★ 0
github.com/BSlience/end2end-conversational-ai
pytorch★ 0
github.com/samsonleegh/convai_smile
pytorch★ 0
github.com/huggingface/transfer-learning-conv-ai
pytorch★ 0

Abstract

We introduce a new approach to generative data-driven dialogue systems (e.g. chatbots) called TransferTransfo which is a combination of a Transfer learning based training scheme and a high-capacity Transformer model. Fine-tuning is performed by using a multi-task objective which combines several unsupervised prediction tasks. The resulting fine-tuned model shows strong improvements over the current state-of-the-art end-to-end conversational models like memory augmented seq2seq and information-retrieval models. On the privately held PERSONA-CHAT dataset of the Conversational Intelligence Challenge 2, this approach obtains a new state-of-the-art, with respective perplexity, Hits@1 and F1 metrics of 16.28 (45 % absolute improvement), 80.7 (46 % absolute improvement) and 19.5 (20 % absolute improvement).

Tasks

Dialogue Generation Information Retrieval Retrieval Transfer Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Persona-Chat	TransferTransfo	Avg F1	19.09	—	Unverified

TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents

Code

Abstract

Tasks

Benchmark Results

Reproductions