| Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations | Apr 15, 2024 | BenchmarkingBias Detection | CodeCode Available | 1 |
| Analyzing and Mitigating Object Hallucination in Large Vision-Language Models | Oct 1, 2023 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| High-resolution Face Swapping via Latent Semantics Disentanglement | Mar 30, 2022 | DisentanglementFace Swapping | CodeCode Available | 1 |
| Into the Unknown: Self-Learning Large Language Models | Feb 14, 2024 | HallucinationSelf-Learning | CodeCode Available | 1 |
| Hallucination Detection in LLMs Using Spectral Features of Attention Maps | Feb 24, 2025 | Hallucination | CodeCode Available | 1 |
| Hallucination Augmented Contrastive Learning for Multimodal Large Language Model | Dec 12, 2023 | Contrastive LearningHallucination | CodeCode Available | 1 |
| Hallucinated Neural Radiance Fields in the Wild | Nov 30, 2021 | HallucinationNeRF | CodeCode Available | 1 |
| Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language Models | May 11, 2025 | DescriptiveDiagnostic | CodeCode Available | 1 |
| Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards | May 7, 2025 | BenchmarkingHallucination | CodeCode Available | 1 |
| HallE-Control: Controlling Object Hallucination in Large Multimodal Models | Oct 3, 2023 | AttributeDecoder | CodeCode Available | 1 |
| Phare: A Safety Probe for Large Language Models | May 16, 2025 | DiagnosticHallucination | CodeCode Available | 1 |
| Chain of Natural Language Inference for Reducing Large Language Model Ungrounded Hallucinations | Oct 6, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model | Aug 2, 2023 | HallucinationImage Captioning | CodeCode Available | 1 |
| Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization | Nov 28, 2023 | HallucinationMME | CodeCode Available | 1 |
| HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data | Nov 22, 2023 | Attributecounterfactual | CodeCode Available | 1 |
| A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity | Feb 8, 2023 | Code GenerationHallucination | CodeCode Available | 1 |
| How well can a large language model explain business processes as perceived by users? | Jan 23, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models | Sep 23, 2023 | Code CompletionHallucination | CodeCode Available | 1 |
| Grounded Chain-of-Thought for Multimodal Large Language Models | Mar 17, 2025 | HallucinationSpatial Reasoning | CodeCode Available | 1 |
| Balanced Classification: A Unified Framework for Long-Tailed Object Detection | Aug 4, 2023 | HallucinationLong-tailed Object Detection | CodeCode Available | 1 |
| BIGPrior: Towards Decoupling Learned Prior Hallucination and Data Fidelity in Image Restoration | Nov 3, 2020 | ColorizationDenoising | CodeCode Available | 1 |
| InterrogateLLM: Zero-Resource Hallucination Detection in LLM-Generated Answers | Mar 5, 2024 | Hallucination | CodeCode Available | 1 |
| BachGAN: High-Resolution Image Synthesis from Salient Object Layout | Mar 26, 2020 | Generative Adversarial NetworkHallucination | CodeCode Available | 1 |
| GraphArena: Benchmarking Large Language Models on Graph Computational Problems | Jun 29, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |
| HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning | Jul 22, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |
| Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models | Jun 30, 2024 | Hallucinationmultimodal interaction | CodeCode Available | 1 |
| Introspective Planning: Aligning Robots' Uncertainty with Inherent Task Ambiguity | Feb 9, 2024 | Conformal PredictionHallucination | CodeCode Available | 1 |
| IterGen: Iterative Semantic-aware Structured LLM Generation with Backtracking | Oct 9, 2024 | ARCCode Generation | CodeCode Available | 1 |
| Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models | Jun 5, 2025 | DiagnosticHallucination | CodeCode Available | 1 |
| KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection | Oct 13, 2023 | Abstractive Text SummarizationHallucination | CodeCode Available | 1 |
| LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation | Apr 22, 2024 | HallucinationRAG | CodeCode Available | 1 |
| FlySearch: Exploring how vision-language models explore | Jun 3, 2025 | HallucinationTask Planning | CodeCode Available | 1 |
| Automatic Curriculum Expert Iteration for Reliable LLM Reasoning | Oct 10, 2024 | HallucinationLogical Reasoning | CodeCode Available | 1 |
| 3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure Prior | Mar 31, 2020 | 3D Semantic Scene Completion3D Semantic Scene Completion from a single RGB image | CodeCode Available | 1 |
| Advancing TTP Analysis: Harnessing the Power of Large Language Models with Retrieval Augmented Generation | Dec 30, 2023 | DecoderHallucination | CodeCode Available | 1 |
| BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models | Oct 2, 2023 | HallucinationRetrieval | CodeCode Available | 1 |
| AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation | Nov 13, 2023 | AttributeHallucination | CodeCode Available | 1 |
| CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning | Mar 25, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| AdaPlanner: Adaptive Planning from Feedback with Language Models | May 26, 2023 | Decision MakingHallucination | CodeCode Available | 1 |
| Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & Hallucinations | Feb 10, 2024 | DiagnosticHallucination | CodeCode Available | 1 |
| LAN-HDR: Luminance-based Alignment Network for High Dynamic Range Video Reconstruction | Aug 22, 2023 | HallucinationMotion Compensation | CodeCode Available | 1 |
| Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based Reasoning | Jan 31, 2023 | HallucinationSemantic Parsing | CodeCode Available | 1 |
| Automated Review Generation Method Based on Large Language Models | Jul 30, 2024 | ArticlesHallucination | CodeCode Available | 1 |
| FineSurE: Fine-grained Summarization Evaluation using LLMs | Jul 1, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |
| Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception | Apr 29, 2025 | counterfactualHallucination | CodeCode Available | 1 |
| Can Knowledge Editing Really Correct Hallucinations? | Oct 21, 2024 | Hallucinationknowledge editing | CodeCode Available | 1 |
| Automated Multi-level Preference for MLLMs | May 18, 2024 | Dataset GenerationHallucination | CodeCode Available | 1 |
| Finetune-RAG: Fine-Tuning Language Models to Resist Hallucination in Retrieval-Augmented Generation | May 16, 2025 | HallucinationRAG | CodeCode Available | 1 |
| LiDAR-based 4D Occupancy Completion and Forecasting | Oct 17, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language Model | Jan 21, 2025 | HallucinationImage Captioning | CodeCode Available | 1 |