| Phare: A Safety Probe for Large Language Models | May 16, 2025 | DiagnosticHallucination | CodeCode Available | 1 | 5 |
| IterGen: Iterative Semantic-aware Structured LLM Generation with Backtracking | Oct 9, 2024 | ARCCode Generation | CodeCode Available | 1 | 5 |
| Entity-level Factual Consistency of Abstractive Text Summarization | Feb 18, 2021 | Abstractive Text SummarizationHallucination | CodeCode Available | 1 | 5 |
| KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection | Oct 13, 2023 | Abstractive Text SummarizationHallucination | CodeCode Available | 1 | 5 |
| Multimodal LLMs Can Reason about Aesthetics in Zero-Shot | Jan 15, 2025 | BenchmarkingHallucination | CodeCode Available | 1 | 5 |
| Evaluation and Analysis of Hallucination in Large Vision-Language Models | Aug 29, 2023 | HallucinationHallucination Evaluation | CodeCode Available | 1 | 5 |
| Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models | Jun 24, 2024 | Common Sense ReasoningHallucination | CodeCode Available | 1 | 5 |
| NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination | Jul 27, 2020 | HallucinationObject Detection | CodeCode Available | 1 | 5 |
| Enhancing Uncertainty-Based Hallucination Detection with Stronger Focus | Nov 22, 2023 | HallucinationRetrieval | CodeCode Available | 1 | 5 |
| MMRel: A Relation Understanding Benchmark in the MLLM Era | Jun 13, 2024 | DiversityHallucination | CodeCode Available | 1 | 5 |
| Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling | May 16, 2024 | Contrastive LearningHallucination | CodeCode Available | 1 | 5 |
| Knowledge Graph-Enhanced Large Language Models via Path Selection | Jun 19, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 1 | 5 |
| Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training | May 31, 2024 | HallucinationMulti-Task Learning | CodeCode Available | 1 | 5 |
| Mitigating Open-Vocabulary Caption Hallucinations | Dec 6, 2023 | DiversityHallucination | CodeCode Available | 1 | 5 |
| Chain of Natural Language Inference for Reducing Large Language Model Ungrounded Hallucinations | Oct 6, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| Entity-Based Knowledge Conflicts in Question Answering | Sep 10, 2021 | HallucinationOut-of-Distribution Generalization | CodeCode Available | 1 | 5 |
| Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations | Apr 15, 2024 | BenchmarkingBias Detection | CodeCode Available | 1 | 5 |
| Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback | Apr 22, 2024 | AttributeHallucination | CodeCode Available | 1 | 5 |
| Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources | May 22, 2023 | HallucinationLanguage Modelling | CodeCode Available | 1 | 5 |
| Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards | May 7, 2025 | BenchmarkingHallucination | CodeCode Available | 1 | 5 |
| A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs | May 13, 2025 | HallucinationUncertainty Quantification | CodeCode Available | 1 | 5 |
| Detecting Hallucinated Content in Conditional Neural Sequence Generation | Nov 5, 2020 | Abstractive Text SummarizationHallucination | CodeCode Available | 1 | 5 |
| Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations | Apr 18, 2025 | Hallucination | CodeCode Available | 1 | 5 |
| Accuracy and Political Bias of News Source Credibility Ratings by Large Language Models | Apr 1, 2023 | Fact CheckingHallucination | CodeCode Available | 1 | 5 |
| Enhancing LLM's Cognition via Structurization | Jul 23, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 1 | 5 |
| EventHallusion: Diagnosing Event Hallucinations in Video LLMs | Sep 25, 2024 | HallucinationInstruction Following | CodeCode Available | 1 | 5 |
| "Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation | Dec 18, 2023 | HallucinationLanguage Modelling | CodeCode Available | 1 | 5 |
| Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model | Aug 2, 2023 | HallucinationImage Captioning | CodeCode Available | 1 | 5 |
| Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization | Nov 28, 2023 | HallucinationMME | CodeCode Available | 1 | 5 |
| Learning From Correctness Without Prompting Makes LLM Efficient Reasoner | Mar 28, 2024 | Hallucination | CodeCode Available | 1 | 5 |
| Mitigating Object Hallucinations via Sentence-Level Early Intervention | Jul 16, 2025 | HallucinationMM-Vet | CodeCode Available | 1 | 5 |
| Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method | May 22, 2023 | BenchmarkingHallucination | CodeCode Available | 1 | 5 |
| Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-Generation | Apr 4, 2025 | ClusteringHallucination | CodeCode Available | 1 | 5 |
| DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models | Mar 1, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 1 | 5 |
| EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models | Feb 15, 2024 | HallucinationObject Hallucination | CodeCode Available | 1 | 5 |
| EmbodiedAgent: A Scalable Hierarchical Approach to Overcome Practical Challenge in Multi-Robot Control | Apr 14, 2025 | Hallucination | CodeCode Available | 1 | 5 |
| MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? | Jul 5, 2024 | HallucinationImage Generation | CodeCode Available | 1 | 5 |
| Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and Beyond | Jun 16, 2023 | BenchmarkingEvidence Selection | CodeCode Available | 1 | 5 |
| AGIR: Automating Cyber Threat Intelligence Reporting with Natural Language Generation | Oct 4, 2023 | HallucinationText Generation | CodeCode Available | 1 | 5 |
| Knowledge Verification to Nip Hallucination in the Bud | Jan 19, 2024 | HallucinationWorld Knowledge | CodeCode Available | 1 | 5 |
| ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark | Jan 9, 2025 | FairnessHallucination | CodeCode Available | 1 | 5 |
| EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale Dataset | Oct 11, 2021 | BenchmarkingFace Hallucination | CodeCode Available | 1 | 5 |
| Mitigating Hallucinations in Large Vision-Language Models via Summary-Guided Decoding | Oct 17, 2024 | HallucinationObject Hallucination | CodeCode Available | 1 | 5 |
| Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information Flow | Feb 28, 2025 | HallucinationObject | CodeCode Available | 1 | 5 |
| DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented Generation | Jun 9, 2024 | Common Sense ReasoningDenoising | CodeCode Available | 1 | 5 |
| Doc2Query--: When Less is More | Jan 9, 2023 | HallucinationRetrieval | CodeCode Available | 1 | 5 |
| Improving Simultaneous Machine Translation with Monolingual Data | Dec 2, 2022 | HallucinationKnowledge Distillation | CodeCode Available | 1 | 5 |
| Mitigating Hallucinations in Vision-Language Models through Image-Guided Head Suppression | May 22, 2025 | HallucinationImage Description | CodeCode Available | 1 | 5 |
| No-Reference Image Quality Assessment by Hallucinating Pristine Features | Aug 9, 2021 | DisentanglementHallucination | CodeCode Available | 1 | 5 |
| "Merge Conflicts!" Exploring the Impacts of External Distractors to Parametric Knowledge Graphs | Sep 15, 2023 | HallucinationKnowledge Graphs | CodeCode Available | 0 | 5 |