SOTAVerified

Explanation Generation

Papers

Showing 3140 of 235 papers

TitleStatusHype
Rethinking the Evaluation for Conversational Recommendation in the Era of Large Language ModelsCode1
Retrieval augmentation of large language models for lay language generationCode1
Harnessing the Power of Multi-Task Pretraining for Ground-Truth Level Natural Language ExplanationsCode1
HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-CheckingCode1
End-to-End Multimodal Fact-Checking and Explanation Generation: A Challenging Dataset and ModelsCode1
CLEVR-X: A Visual Reasoning Dataset for Natural Language ExplanationsCode1
CodeExp: Explanatory Code Document GenerationCode1
A Survey on Interpretable Cross-modal ReasoningCode1
Explainable Automated Fact-Checking for Public Health ClaimsCode1
Advisable Learning for Self-Driving Vehicles by Internalizing Observation-to-Action RulesCode0
Show:102550
← PrevPage 4 of 24Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VLIS (Lynx)Accuracy80Unverified
2VLIS (LLaVA)Accuracy73Unverified
3Ground-truth Caption -> GPT3 (Oracle)Human (%)68Unverified
4Predicted Caption -> GPT3Human (%)33Unverified
5BLIP2 FlanT5-XXL (Fine-tuned)Human (%)27Unverified
6BLIP2 FlanT5-XL (Fine-tuned)Human (%)15Unverified
7BLIP2 FlanT5-XXL (Zero-shot)Human (%)0Unverified
#ModelMetricClaimedVerifiedStatus
1PJ-XB487.4Unverified
2FMB478.8Unverified
#ModelMetricClaimedVerifiedStatus
1OFA-XHuman Explanation Rating85.7Unverified
2OFA-X-MTHuman Explanation Rating80.4Unverified
#ModelMetricClaimedVerifiedStatus
1OFA-X-MTHuman Explanation Rating77.3Unverified
2OFA-XHuman Explanation Rating68.9Unverified
#ModelMetricClaimedVerifiedStatus
1OFA-XHuman Explanation Rating89.5Unverified
2OFA-X-MTHuman Explanation Rating87.8Unverified