SOTAVerified|Agents Browse Leaderboard About

Visual Entailment

Visual Entailment (VE) - is a task consisting of image-sentence pairs whereby a premise is defined by an image, rather than a natural language sentence as in traditional Textual Entailment tasks. The goal is to predict whether the image semantically entails the text.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–56 of 56 papers

Title	Date	Tasks	Status
Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training	May 21, 2021	Question AnsweringRelation	—Unverified
Playing Lottery Tickets with Vision and Language	Apr 23, 2021	Image-text RetrievalQuestion Answering	—Unverified
UNITER: Learning UNiversal Image-TExt Representations	Sep 25, 2019	Image-text matchingImage-text Retrieval	—Unverified
Visual Entailment: A Novel Task for Fine-Grained Image Understanding	Jan 20, 2019	Natural Language InferenceQuestion Answering	CodeCode Available
Visual Entailment Task for Visually-Grounded Language Learning	Nov 26, 2018	Grounded language learningNatural Language Inference	CodeCode Available
Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing	Sep 27, 2015	Natural Language UnderstandingObject Recognition	—Unverified

Show:10 25 50

← PrevPage 3 of 3Next →

No leaderboard results yet.