SOTAVerified

Explanation Generation

Papers

Showing 151175 of 235 papers

TitleStatusHype
TE2Rules: Explaining Tree Ensembles using RulesCode1
End-to-End Multimodal Fact-Checking and Explanation Generation: A Challenging Dataset and ModelsCode1
M6-Rec: Generative Pretrained Language Models are Open-Ended Recommender Systems0
Textual Explanations and Critiques in Recommendation Systems0
Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behavior in out-of-distribution reasoning tasksCode1
Multi-Scale Distribution Deep Variational Autoencoder for Explanation Generation0
Explainable Assessment of Healthcare Articles with QA0
Improving Personalized Explanation Generation through Visualization0
Calibrating Trust of Multi-Hop Question Answering Systems with Decompositional Probes0
CLEVR-X: A Visual Reasoning Dataset for Natural Language ExplanationsCode1
E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning0
REX: Reasoning-aware and Grounded ExplanationCode1
Counterfactual Explanations for Predictive Business Process Monitoring0
Best of Both Worlds: A Hybrid Approach for Multi-Hop Explanation with Declarative Facts0
Generating Fluent Fact Checking Explanations with Unsupervised Post-Editing0
Generating High-Quality Explanations for Navigation in Partially-Revealed EnvironmentsCode0
E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning0
A Deep Generative XAI Framework for Natural Language Inference Explanations Generation0
Counterfactual Explanations for Models of CodeCode0
Parameterized Explanations for Investor / Company Matching0
Hierarchical Aspect-guided Explanation Generation for Explainable Recommendation0
Explainable Assessment of Healthcare Articles with QA0
A Framework for Rationale Extraction for Deep QA models0
Truth Table Deep Convolutional Neural Network, A New SAT-Encodable Architecture - Application To Complete Robustness0
Weakly Supervised Explainable Phrasal Reasoning with Neural Fuzzy LogicCode0
Show:102550
← PrevPage 7 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VLIS (Lynx)Accuracy80Unverified
2VLIS (LLaVA)Accuracy73Unverified
3Ground-truth Caption -> GPT3 (Oracle)Human (%)68Unverified
4Predicted Caption -> GPT3Human (%)33Unverified
5BLIP2 FlanT5-XXL (Fine-tuned)Human (%)27Unverified
6BLIP2 FlanT5-XL (Fine-tuned)Human (%)15Unverified
7BLIP2 FlanT5-XXL (Zero-shot)Human (%)0Unverified
#ModelMetricClaimedVerifiedStatus
1PJ-XB487.4Unverified
2FMB478.8Unverified
#ModelMetricClaimedVerifiedStatus
1OFA-XHuman Explanation Rating85.7Unverified
2OFA-X-MTHuman Explanation Rating80.4Unverified
#ModelMetricClaimedVerifiedStatus
1OFA-X-MTHuman Explanation Rating77.3Unverified
2OFA-XHuman Explanation Rating68.9Unverified
#ModelMetricClaimedVerifiedStatus
1OFA-XHuman Explanation Rating89.5Unverified
2OFA-X-MTHuman Explanation Rating87.8Unverified