| Evaluating ChatGPT as a Question Answering System: A Comprehensive Analysis and Comparison with Existing Models | Dec 11, 2023 | HallucinationLanguage Modelling | —Unverified | 0 |
| Context Tuning for Retrieval Augmented Generation | Dec 9, 2023 | HallucinationRAG | —Unverified | 0 |
| DelucionQA: Detecting Hallucinations in Domain-specific Question Answering | Dec 8, 2023 | HallucinationInformation Retrieval | —Unverified | 0 |
| HALO: An Ontology for Representing and Categorizing Hallucinations in Large Language Models | Dec 8, 2023 | Hallucination | —Unverified | 0 |
| Behind the Magic, MERLIM: Multi-modal Evaluation Benchmark for Large Image-Language Models | Dec 3, 2023 | HallucinationVisual Grounding | CodeCode Available | 0 |
| On Exploring the Reasoning Capability of Large Language Models with Knowledge Graphs | Dec 1, 2023 | HallucinationKnowledge Graphs | —Unverified | 0 |
| How to Build an AI Tutor That Can Adapt to Any Course Using Knowledge Graph-Enhanced Retrieval-Augmented Generation (KG-RAG) | Nov 29, 2023 | HallucinationKnowledge Graphs | —Unverified | 0 |
| Understanding Your Agent: Leveraging Large Language Models for Behavior Explanation | Nov 29, 2023 | counterfactualHallucination | —Unverified | 0 |
| Combating the "Sameness" in AI Art: Reflections on the Interactive AI Installation Fencing Hallucination | Nov 28, 2023 | Hallucination | —Unverified | 0 |
| Mitigating Hallucination in Visual Language Models with Visual Supervision | Nov 27, 2023 | Hallucination | —Unverified | 0 |
| Deficiency of Large Language Models in Finance: An Empirical Examination of Hallucination | Nov 27, 2023 | Few-Shot LearningHallucination | —Unverified | 0 |
| Calibrated Language Models Must Hallucinate | Nov 24, 2023 | ArticlesHallucination | —Unverified | 0 |
| Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach | Nov 23, 2023 | Decision MakingHallucination | —Unverified | 0 |
| Challenges of Large Language Models for Mental Health Counseling | Nov 23, 2023 | HallucinationNavigate | —Unverified | 0 |
| Minimizing Factual Inconsistency and Hallucination in Large Language Models | Nov 23, 2023 | HallucinationRAG | —Unverified | 0 |
| Mitigating Large Language Model Hallucinations via Autonomous Knowledge Graph-based Retrofitting | Nov 22, 2023 | HallucinationLanguage Modeling | —Unverified | 0 |
| KNVQA: A Benchmark for evaluation knowledge-based VQA | Nov 21, 2023 | HallucinationObject Hallucination | —Unverified | 0 |
| Adapting LLMs for Efficient, Personalized Information Retrieval: Methods and Implications | Nov 21, 2023 | ChatbotHallucination | —Unverified | 0 |
| Control in Hybrid Chatbots | Nov 20, 2023 | ChatbotHallucination | —Unverified | 0 |
| GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration | Nov 20, 2023 | HallucinationLanguage Modeling | —Unverified | 0 |
| Chain of Visual Perception: Harnessing Multimodal Large Language Models for Zero-shot Camouflaged Object Detection | Nov 19, 2023 | counterfactualHallucination | CodeCode Available | 0 |
| Journey of Hallucination-minimized Generative AI Solutions for Financial Decision Makers | Nov 18, 2023 | Answer GenerationDecision Making | —Unverified | 0 |
| Crafting In-context Examples according to LMs' Parametric Knowledge | Nov 16, 2023 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| Deceptive Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination? | Nov 16, 2023 | HallucinationSentence | CodeCode Available | 0 |
| How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities | Nov 15, 2023 | EthicsFairness | CodeCode Available | 0 |