| A Survey of Hallucination in Large Foundation Models | Sep 12, 2023 | HallucinationSurvey | CodeCode Available | 1 |
| Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & Hallucinations | Feb 10, 2024 | DiagnosticHallucination | CodeCode Available | 1 |
| FlySearch: Exploring how vision-language models explore | Jun 3, 2025 | HallucinationTask Planning | CodeCode Available | 1 |
| Citation-Enhanced Generation for LLM-based Chatbots | Feb 25, 2024 | ChatbotCitation Prediction | CodeCode Available | 1 |
| Circuit Transformer: A Transformer That Preserves Logical Equivalence | Mar 14, 2024 | Hallucination | CodeCode Available | 1 |
| PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language Model | Jan 21, 2025 | HallucinationImage Captioning | CodeCode Available | 1 |
| FineSurE: Fine-grained Summarization Evaluation using LLMs | Jul 1, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |
| Finetune-RAG: Fine-Tuning Language Models to Resist Hallucination in Retrieval-Augmented Generation | May 16, 2025 | HallucinationRAG | CodeCode Available | 1 |
| AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant | Nov 11, 2024 | Decision MakingHallucination | CodeCode Available | 1 |
| CHATREPORT: Democratizing Sustainability Disclosure Analysis through LLM-based Tools | Jul 28, 2023 | Hallucination | CodeCode Available | 1 |
| Federated Recommendation via Hybrid Retrieval Augmented Generation | Mar 7, 2024 | HallucinationPrivacy Preserving | CodeCode Available | 1 |
| CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification | Apr 30, 2024 | Code GenerationHallucination | CodeCode Available | 1 |
| CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models | May 27, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Hallucinated Neural Radiance Fields in the Wild | Nov 30, 2021 | HallucinationNeRF | CodeCode Available | 1 |
| Filter-then-Generate: Large Language Models with Structure-Text Adapter for Knowledge Graph Completion | Dec 12, 2024 | HallucinationKnowledge Graph Completion | CodeCode Available | 1 |
| Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions | Jun 9, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| FaithDial: A Faithful Benchmark for Information-Seeking Dialogue | Apr 22, 2022 | Dialogue GenerationHallucination | CodeCode Available | 1 |
| CRUSH4SQL: Collective Retrieval Using Schema Hallucination For Text2SQL | Nov 2, 2023 | HallucinationRetrieval | CodeCode Available | 1 |
| ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short Summaries | Apr 26, 2023 | Data SummarizationHallucination | CodeCode Available | 1 |
| ChartInsighter: An Approach for Mitigating Hallucination in Time-series Chart Summary Generation with A Benchmark Dataset | Jan 16, 2025 | HallucinationSentence | CodeCode Available | 1 |
| A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text Generation | Apr 18, 2021 | FormHallucination | CodeCode Available | 1 |
| High-resolution Face Swapping via Latent Semantics Disentanglement | Mar 30, 2022 | DisentanglementFace Swapping | CodeCode Available | 1 |
| Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding | Jun 18, 2024 | Hallucination | CodeCode Available | 1 |
| Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers | Nov 13, 2023 | Hallucinationknowledge editing | CodeCode Available | 1 |
| HyperPocket: Generative Point Cloud Completion | Feb 11, 2021 | HallucinationObject Hallucination | CodeCode Available | 1 |
| Generating Natural Language Proofs with Verifier-Guided Search | May 25, 2022 | Hallucinationvalid | CodeCode Available | 1 |
| Harnessing GPT-4V(ision) for Insurance: A Preliminary Exploration | Apr 15, 2024 | Hallucination | CodeCode Available | 1 |
| Extract Free Dense Misalignment from CLIP | Dec 24, 2024 | HallucinationImage Generation | CodeCode Available | 1 |
| Into the Unknown: Self-Learning Large Language Models | Feb 14, 2024 | HallucinationSelf-Learning | CodeCode Available | 1 |
| Introspective Planning: Aligning Robots' Uncertainty with Inherent Task Ambiguity | Feb 9, 2024 | Conformal PredictionHallucination | CodeCode Available | 1 |
| CR-LT-KGQA: A Knowledge Graph Question Answering Dataset Requiring Commonsense Reasoning and Long-Tail Knowledge | Mar 3, 2024 | Claim VerificationGraph Question Answering | CodeCode Available | 1 |
| IterGen: Iterative Semantic-aware Structured LLM Generation with Backtracking | Oct 9, 2024 | ARCCode Generation | CodeCode Available | 1 |
| Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models | Jun 5, 2025 | DiagnosticHallucination | CodeCode Available | 1 |
| KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection | Oct 13, 2023 | Abstractive Text SummarizationHallucination | CodeCode Available | 1 |
| Face Hallucination via Split-Attention in Split-Attention Network | Oct 22, 2020 | Face DetectionFace Hallucination | CodeCode Available | 1 |
| Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation | Mar 25, 2025 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| Context-aware Decoding Reduces Hallucination in Query-focused Summarization | Dec 21, 2023 | HallucinationLanguage Modelling | CodeCode Available | 1 |
| Context-Patch Face Hallucination Based on Thresholding Locality-constrained Representation and Reproducing Learning | Sep 3, 2018 | Face HallucinationHallucination | CodeCode Available | 1 |
| Exploring the Transferability of Visual Prompting for Multimodal Large Language Models | Apr 17, 2024 | HallucinationMultimodal Reasoning | CodeCode Available | 1 |
| FactAlign: Long-form Factuality Alignment of Large Language Models | Oct 2, 2024 | FormHallucination | CodeCode Available | 1 |
| Evaluation and Analysis of Hallucination in Large Vision-Language Models | Aug 29, 2023 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| Know Or Not: a library for evaluating out-of-knowledge base robustness | May 19, 2025 | HallucinationRAG | CodeCode Available | 1 |
| Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models | Jun 24, 2024 | Hallucination | CodeCode Available | 1 |
| Controllable Neural Dialogue Summarization with Personal Named Entity Planning | Sep 27, 2021 | dialogue summaryHallucination | CodeCode Available | 1 |
| Label Hallucination for Few-Shot Classification | Dec 6, 2021 | ClassificationFew-Shot Learning | CodeCode Available | 1 |
| LAN-HDR: Luminance-based Alignment Network for High Dynamic Range Video Reconstruction | Aug 22, 2023 | HallucinationMotion Compensation | CodeCode Available | 1 |
| EventHallusion: Diagnosing Event Hallucinations in Video LLMs | Sep 25, 2024 | HallucinationInstruction Following | CodeCode Available | 1 |
| Large Language Models for Multi-Robot Systems: A Survey | Feb 6, 2025 | Action GenerationBenchmarking | CodeCode Available | 1 |
| Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering | Sep 19, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers | Oct 16, 2023 | 16kHallucination | CodeCode Available | 1 |