| Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector | Mar 26, 2025 | Binary ClassificationDeepFake Detection | CodeCode Available | 2 |
| MACRec: a Multi-Agent Collaboration Framework for Recommendation | Feb 23, 2024 | Conversational RecommendationDecision Making | CodeCode Available | 2 |
| Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations? | Apr 29, 2024 | Answer GenerationBenchmarking | CodeCode Available | 1 |
| Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond | Mar 15, 2024 | Explanation GenerationImage Generation | CodeCode Available | 1 |
| XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs | Nov 15, 2023 | Decision MakingDecoder | CodeCode Available | 1 |
| A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question Answering | Nov 13, 2023 | Decision MakingExplanation Generation | CodeCode Available | 1 |
| VLIS: Unimodal Language Models Guide Multimodal Language Generation | Oct 15, 2023 | Caption GenerationExplanation Generation | CodeCode Available | 1 |
| EX-FEVER: A Dataset for Multi-hop Explainable Fact Verification | Oct 15, 2023 | Claim VerificationExplanation Generation | CodeCode Available | 1 |
| LLM4Vis: Explainable Visualization Recommendation using ChatGPT | Oct 11, 2023 | Data VisualizationExplanation Generation | CodeCode Available | 1 |
| Fin-Fact: A Benchmark Dataset for Multimodal Financial Fact Checking and Explanation Generation | Sep 15, 2023 | Explanation GenerationFact Checking | CodeCode Available | 1 |