SOTAVerified

Explanation Generation

Papers

Showing 1120 of 235 papers

TitleStatusHype
Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations?Code1
End-to-End Multimodal Fact-Checking and Explanation Generation: A Challenging Dataset and ModelsCode1
Harnessing the Power of Multi-Task Pretraining for Ground-Truth Level Natural Language ExplanationsCode1
HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-CheckingCode1
A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question AnsweringCode1
EX-FEVER: A Dataset for Multi-hop Explainable Fact VerificationCode1
Retrieval augmentation of large language models for lay language generationCode1
A Survey on Interpretable Cross-modal ReasoningCode1
CodeExp: Explanatory Code Document GenerationCode1
Explainable Legal Case Matching via Inverse Optimal Transport-based Rationale ExtractionCode1
Show:102550
← PrevPage 2 of 24Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VLIS (Lynx)Accuracy80Unverified
2VLIS (LLaVA)Accuracy73Unverified
3Ground-truth Caption -> GPT3 (Oracle)Human (%)68Unverified
4Predicted Caption -> GPT3Human (%)33Unverified
5BLIP2 FlanT5-XXL (Fine-tuned)Human (%)27Unverified
6BLIP2 FlanT5-XL (Fine-tuned)Human (%)15Unverified
7BLIP2 FlanT5-XXL (Zero-shot)Human (%)0Unverified
#ModelMetricClaimedVerifiedStatus
1PJ-XB487.4Unverified
2FMB478.8Unverified
#ModelMetricClaimedVerifiedStatus
1OFA-XHuman Explanation Rating85.7Unverified
2OFA-X-MTHuman Explanation Rating80.4Unverified
#ModelMetricClaimedVerifiedStatus
1OFA-X-MTHuman Explanation Rating77.3Unverified
2OFA-XHuman Explanation Rating68.9Unverified
#ModelMetricClaimedVerifiedStatus
1OFA-XHuman Explanation Rating89.5Unverified
2OFA-X-MTHuman Explanation Rating87.8Unverified