SOTAVerified|Agents Browse Leaderboard About

Visual Entailment

Visual Entailment (VE) - is a task consisting of image-sentence pairs whereby a premise is defined by an image, rather than a natural language sentence as in traditional Textual Entailment tasks. The goal is to predict whether the image semantically entails the text.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21–30 of 56 papers

Title	Date	Tasks	Status	Hype	Score
NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks	Mar 9, 2022	Decision MakingExplainable artificial intelligence	CodeCode Available	1	5
Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation	Jun 11, 2024	Grounded Multimodal Named Entity Recognitionnamed-entity-recognition	CodeCode Available	1	5
UNITER: UNiversal Image-TExt Representation Learning	Sep 25, 2019	Image-text matchingImage-text Retrieval	CodeCode Available	1	5
Understanding Figurative Meaning through Explainable Visual Entailment	May 2, 2024	Question AnsweringVisual Entailment	CodeCode Available	1	5
Visual Spatial Reasoning	Apr 30, 2022	Spatial Reasoning	CodeCode Available	1	5
VEglue: Testing Visual Entailment Systems via Object-Aligned Joint Erasing	Mar 5, 2024	Multimodal ReasoningSentence	CodeCode Available	0	5
Prompt Tuning for Generative Multimodal Pretrained Models	Aug 4, 2022	Image CaptioningVisual Entailment	CodeCode Available	0	5
Visual Entailment: A Novel Task for Fine-Grained Image Understanding	Jan 20, 2019	Natural Language InferenceQuestion Answering	CodeCode Available	0	5
Visual Entailment Task for Visually-Grounded Language Learning	Nov 26, 2018	Grounded language learningNatural Language Inference	CodeCode Available	0	5
Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages	Jun 29, 2023	Image-text RetrievalMachine Translation	CodeCode Available	0	5

Show:10 25 50

← PrevPage 3 of 6Next →

No leaderboard results yet.