SOTAVerified|Agents Browse Leaderboard About

Visual Entailment

Visual Entailment (VE) - is a task consisting of image-sentence pairs whereby a premise is defined by an image, rather than a natural language sentence as in traditional Textual Entailment tasks. The goal is to predict whether the image semantically entails the text.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–56 of 56 papers

Title	Date	Tasks	Status
Chunk-aware Alignment and Lexical Constraint for Visual Entailment with Natural Language Explanations	Jul 23, 2022	Decision MakingExplanation Generation	CodeCode Available
Prompt Tuning for Generative Multimodal Pretrained Models	Aug 4, 2022	Image CaptioningVisual Entailment	CodeCode Available
Visual Entailment: A Novel Task for Fine-Grained Image Understanding	Jan 20, 2019	Natural Language InferenceQuestion Answering	CodeCode Available
VEglue: Testing Visual Entailment Systems via Object-Aligned Joint Erasing	Mar 5, 2024	Multimodal ReasoningSentence	CodeCode Available
p-Laplacian Adaptation for Generative Pre-trained Vision-Language Models	Dec 17, 2023	Image CaptioningQuestion Answering	CodeCode Available
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework	Feb 7, 2022	Image Captioningimage-classification	CodeCode Available

Show:10 25 50

← PrevPage 6 of 6Next →

No leaderboard results yet.