Dynamic Self-Attention : Computing Attention over Words Dynamically for Sentence Embedding
2018-08-22Code Available1· sign in to hype
Deunsol Yoon, Dongbok Lee, SangKeun Lee
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/dsindex/iclassifierpytorch★ 44
Abstract
In this paper, we propose Dynamic Self-Attention (DSA), a new self-attention mechanism for sentence embedding. We design DSA by modifying dynamic routing in capsule network (Sabouretal.,2017) for natural language processing. DSA attends to informative words with a dynamic weight vector. We achieve new state-of-the-art results among sentence encoding methods in Stanford Natural Language Inference (SNLI) dataset with the least number of parameters, while showing comparative results in Stanford Sentiment Treebank (SST) dataset.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| SNLI | 2400D Multiple-Dynamic Self-Attention Model | % Test Accuracy | 87.4 | — | Unverified |
| SNLI | 600D Dynamic Self-Attention Model | % Test Accuracy | 86.8 | — | Unverified |