Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

2017-05-05EMNLP 2017Code Available1· sign in to hype

Alexis Conneau, Douwe Kiela, Holger Schwenk, Loic Barrault, Antoine Bordes

Code Available — Be the first to reproduce this paper.

Code

github.com/cdpierse/transformers-interpret
pytorch★ 1,413
github.com/boknilev/nmt-repr-analysis
pytorch★ 38
github.com/menajosep/AleatoricSent
tf★ 2
github.com/AmanDaVinci/Universal-Sentence-Representations
pytorch★ 0
github.com/avinassh/kylo
none★ 0
github.com/MirkoLenz/ReCAP-Argument-Graph-Retrieval
tf★ 0
github.com/galkesten/Domestic-Violence-Classifier
pytorch★ 0
github.com/duynguyen158/wann-nlp
pytorch★ 0
github.com/rockandroll123/natural-language-inference
none★ 0
github.com/sidak/SentEval
pytorch★ 0

Abstract

Many modern NLP systems rely on word embeddings, previously trained in an unsupervised manner on large corpora, as base features. Efforts to obtain embeddings for larger chunks of text, such as sentences, have however not been so successful. Several attempts at learning unsupervised representations of sentences have not reached satisfactory enough performance to be widely adopted. In this paper, we show how universal sentence representations trained using the supervised data of the Stanford Natural Language Inference datasets can consistently outperform unsupervised methods like SkipThought vectors on a wide range of transfer tasks. Much like how computer vision uses ImageNet to obtain features, which can then be transferred to other tasks, our work tends to indicate the suitability of natural language inference for transfer learning to other NLP tasks. Our encoder is publicly available.

Tasks

Cross-Lingual Natural Language Inference Natural Language Inference Semantic Textual Similarity Sentence Transfer Learning Word Embeddings

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
SNLI	4096D BiLSTM with max-pooling	% Test Accuracy	84.5	—	Unverified

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

Code

Abstract

Tasks

Benchmark Results

Reproductions