Dynamic Self-Attention : Computing Attention over Words Dynamically for Sentence Embedding

2018-08-22Code Available1· sign in to hype

Deunsol Yoon, Dongbok Lee, SangKeun Lee

Code Available — Be the first to reproduce this paper.

Code

github.com/dsindex/iclassifier
pytorch★ 44

Abstract

In this paper, we propose Dynamic Self-Attention (DSA), a new self-attention mechanism for sentence embedding. We design DSA by modifying dynamic routing in capsule network (Sabouretal.,2017) for natural language processing. DSA attends to informative words with a dynamic weight vector. We achieve new state-of-the-art results among sentence encoding methods in Stanford Natural Language Inference (SNLI) dataset with the least number of parameters, while showing comparative results in Stanford Sentiment Treebank (SST) dataset.

Tasks

Natural Language Inference Sentence Sentence Embedding Sentence-Embedding

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
SNLI	2400D Multiple-Dynamic Self-Attention Model	% Test Accuracy	87.4	—	Unverified
SNLI	600D Dynamic Self-Attention Model	% Test Accuracy	86.8	—	Unverified

Dynamic Self-Attention : Computing Attention over Words Dynamically for Sentence Embedding

Code

Abstract

Tasks

Benchmark Results

Reproductions