An Effective Domain Adaptive Post-Training Method for BERT in Response Selection

2019-08-13Code Available0· sign in to hype

Taesun Whang, Dongyub Lee, Chanhee Lee, Kisu Yang, Dongsuk Oh, Heuiseok Lim

Code Available — Be the first to reproduce this paper.

Code

github.com/taesunwhang/BERT-ResSel
pytorch★ 0

Abstract

We focus on multi-turn response selection in a retrieval-based dialog system. In this paper, we utilize the powerful pre-trained language model Bi-directional Encoder Representations from Transformer (BERT) for a multi-turn dialog system and propose a highly effective post-training method on domain-specific corpus. Although BERT is easily adopted to various NLP tasks and outperforms previous baselines of each task, it still has limitations if a task corpus is too focused on a certain domain. Post-training on domain-specific corpus (e.g., Ubuntu Corpus) helps the model to train contextualized representations and words that do not appear in general corpus (e.g., English Wikipedia). Experimental results show that our approach achieves new state-of-the-art on two response selection benchmarks (i.e., Ubuntu Corpus V1, Advising Corpus) performance improvement by 5.9% and 6% on R@1.

Tasks

Conversational Response Selection Language Modeling Language Modelling Retrieval

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Douban	BERT	MAP	0.59	—	Unverified
RRS	BERT	MAP	0.63	—	Unverified
RRS Ranking Test	BERT	NDCG@3	0.63	—	Unverified
Ubuntu Dialogue (v1, Ranking)	BERT-VFT	R10@1	0.86	—	Unverified

An Effective Domain Adaptive Post-Training Method for BERT in Response Selection

Code

Abstract

Tasks

Benchmark Results

Reproductions