SOTAVerified

Explanatory Visual Question Answering

Explanatory Visual Question Answering (EVQA) requires answering visual questions and generating multimodal explanations for the reasoning processes.

Papers

Showing 15 of 5 papers

TitleStatusHype
LININ: Logic Integrated Neural Inference Network for Explanatory Visual Question AnsweringCode0
Variational Causal Inference Network for Explanatory Visual Question AnsweringCode1
REX: Reasoning-aware and Grounded ExplanationCode1
Faithful Multimodal Explanation for Visual Question AnsweringCode1
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions0
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VCINBLEU-458.65Unverified
2REX-LXMERTBLEU-454.79Unverified
3REX-VisualBertBLEU-454.59Unverified
4VQAEBLEU-442.56Unverified
5EXPBLEU-442.45Unverified