Contextualized Embeddings based Transformer Encoder for Sentence Similarity Modeling in Answer Selection Task

2020-05-01LREC 2020Code Available1· sign in to hype

Md Tahmid Rahman Laskar, Jimmy Xiangji Huang, Enamul Hoque

Code Available — Be the first to reproduce this paper.

Code

github.com/tahmedge/CETE-LREC
pytorch★ 22

Abstract

Word embeddings that consider context have attracted great attention for various natural language processing tasks in recent years. In this paper, we utilize contextualized word embeddings with the transformer encoder for sentence similarity modeling in the answer selection task. We present two different approaches (feature-based and fine-tuning-based) for answer selection. In the feature-based approach, we utilize two types of contextualized embeddings, namely the Embeddings from Language Models (ELMo) and the Bidirectional Encoder Representations from Transformers (BERT) and integrate each of them with the transformer encoder. We find that integrating these contextual embeddings with the transformer encoder is effective to improve the performance of sentence similarity modeling. In the second approach, we fine-tune two pre-trained transformer encoder models for the answer selection task. Based on our experiments on six datasets, we find that the fine-tuning approach outperforms the feature-based approach on all of them. Among our fine-tuning-based models, the Robustly Optimized BERT Pretraining Approach (RoBERTa) model results in new state-of-the-art performance across five datasets.

Tasks

Answer Selection Sentence Sentence Similarity Word Embeddings

Contextualized Embeddings based Transformer Encoder for Sentence Similarity Modeling in Answer Selection Task

Code

Abstract

Tasks

Reproductions