Order-Embeddings of Images and Language
2015-11-19Code Available1· sign in to hype
Ivan Vendrov, Ryan Kiros, Sanja Fidler, Raquel Urtasun
Code Available — Be the first to reproduce this paper.
ReproduceCode
Abstract
Hypernymy, textual entailment, and image captioning can be seen as special cases of a single visual-semantic hierarchy over words, sentences, and images. In this paper we advocate for explicitly modeling the partial order structure of this hierarchy. Towards this goal, we introduce a general method for learning ordered representations, and show how it can be applied to a variety of tasks involving images and language. We show that the resulting representations improve performance over current approaches for hypernym prediction and image-caption retrieval.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| SNLI | 1024D GRU encoders w/ unsupervised 'skip-thoughts' pre-training | % Test Accuracy | 81.4 | — | Unverified |