SOTAVerified

Explanation Generation

Papers

Showing 125 of 235 papers

TitleStatusHype
Hierarchical Interaction Summarization and Contrastive Prompting for Explainable Recommendations0
The Future is Agentic: Definitions, Perspectives, and Open Challenges of Multi-Agent Recommender Systems0
RealFactBench: A Benchmark for Evaluating Large Language Models in Real-World Fact-CheckingCode0
LiTEx: A Linguistic Taxonomy of Explanations for Understanding Within-Label Variation in Natural Language InferenceCode0
Does Rationale Quality Matter? Enhancing Mental Disorder Detection via Selective Reasoning DistillationCode0
Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models0
SNAPE-PM: Building and Utilizing Dynamic Partner Models for Adaptive Explanation GenerationCode0
Towards Budget-Friendly Model-Agnostic Explanation Generation for Large Language Models0
Generating Skyline Explanations for Graph Neural Networks0
Harnessing LLMs Explanations to Boost Surrogate Models in Tabular Data Classification0
ChartQA-X: Generating Explanations for Charts0
TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context0
Rubrik's Cube: Testing a New Rubric for Evaluating Explanations on the CUBE dataset0
Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face DetectorCode2
Explainable Synthetic Image Detection through Diffusion Timestep Ensembling0
EXCLAIM: An Explainable Cross-Modal Agentic System for Misinformation Detection with Hierarchical Retrieval0
MemeIntel: Explainable Detection of Propagandistic and Hateful Memes0
Reasoning About Persuasion: Can LLMs Enable Explainable Propaganda Detection?0
Coherency Improved Explainable Recommendation via Large Language Model0
Accelerating Anchors via Specialization and Feature Transformation0
Target-Augmented Shared Fusion-based Multimodal Sarcasm Explanation GenerationCode0
Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasksCode0
Boosting Knowledge Graph-based Recommendations through Confidence-Aware Augmentation with Large Language Models0
Multimodal Fake News Video Explanation: Dataset, Analysis and Evaluation0
Quantifying Relational Exploration in Cultural Heritage Knowledge Graphs with LLMs: A Neuro-Symbolic Approach0
Show:102550
← PrevPage 1 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VLIS (Lynx)Accuracy80Unverified
2VLIS (LLaVA)Accuracy73Unverified
3Ground-truth Caption -> GPT3 (Oracle)Human (%)68Unverified
4Predicted Caption -> GPT3Human (%)33Unverified
5BLIP2 FlanT5-XXL (Fine-tuned)Human (%)27Unverified
6BLIP2 FlanT5-XL (Fine-tuned)Human (%)15Unverified
7BLIP2 FlanT5-XXL (Zero-shot)Human (%)0Unverified
#ModelMetricClaimedVerifiedStatus
1PJ-XB487.4Unverified
2FMB478.8Unverified
#ModelMetricClaimedVerifiedStatus
1OFA-XHuman Explanation Rating85.7Unverified
2OFA-X-MTHuman Explanation Rating80.4Unverified
#ModelMetricClaimedVerifiedStatus
1OFA-X-MTHuman Explanation Rating77.3Unverified
2OFA-XHuman Explanation Rating68.9Unverified
#ModelMetricClaimedVerifiedStatus
1OFA-XHuman Explanation Rating89.5Unverified
2OFA-X-MTHuman Explanation Rating87.8Unverified