| Towards An End-to-End Framework for Flow-Guided Video Inpainting | Apr 6, 2022 | HallucinationOptical Flow Estimation | CodeCode Available | 3 |
| LLMDFA: Analyzing Dataflow in Code with Large Language Models | Feb 16, 2024 | Hallucination | CodeCode Available | 3 |
| Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models | Sep 3, 2023 | HallucinationWorld Knowledge | CodeCode Available | 3 |
| Evaluating Hallucinations in Chinese Large Language Models | Oct 5, 2023 | HallucinationQuestion Answering | CodeCode Available | 3 |
| Automated Hypothesis Validation with Agentic Sequential Falsifications | Feb 14, 2025 | Decision MakingHallucination | CodeCode Available | 3 |
| RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework | Aug 2, 2024 | BenchmarkingDataset Generation | CodeCode Available | 3 |
| RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models | May 23, 2024 | HallucinationSentence | CodeCode Available | 3 |
| Retrieval Head Mechanistically Explains Long-Context Factuality | Apr 24, 2024 | Continual PretrainingHallucination | CodeCode Available | 3 |
| RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing | Apr 30, 2024 | Computational EfficiencyHallucination | CodeCode Available | 3 |
| PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models | Mar 8, 2020 | Face HallucinationHallucination | CodeCode Available | 3 |
| RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation | Mar 8, 2024 | Code GenerationHallucination | CodeCode Available | 3 |
| AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models | May 22, 2025 | BenchmarkingFairness | CodeCode Available | 3 |
| PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models | Feb 12, 2024 | Answer GenerationHallucination | CodeCode Available | 3 |
| Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making | Oct 9, 2024 | BenchmarkingDecision Making | CodeCode Available | 3 |
| Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models | Mar 19, 2024 | Hallucination | CodeCode Available | 3 |
| PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models | Feb 2, 2024 | Action GenerationDecision Making | CodeCode Available | 3 |
| Mitigating Object Hallucination via Concentric Causal Attention | Oct 21, 2024 | HallucinationObject | CodeCode Available | 2 |
| Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention | Jan 1, 2025 | HallucinationResponse Generation | CodeCode Available | 2 |
| MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation | Oct 15, 2024 | HallucinationLanguage Modeling | CodeCode Available | 2 |
| Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language Models | Aug 4, 2024 | Hallucination | CodeCode Available | 2 |
| Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding | Nov 28, 2023 | HallucinationObject | CodeCode Available | 2 |
| MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models | Aug 17, 2023 | Decision MakingHallucination | CodeCode Available | 2 |
| A Diffusion-Based Generative Equalizer for Music Restoration | Mar 27, 2024 | Bandwidth ExtensionHallucination | CodeCode Available | 2 |
| Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key | Jan 16, 2025 | 16kHallucination | CodeCode Available | 2 |
| Medical Hallucinations in Foundation Models and Their Impact on Healthcare | Feb 26, 2025 | BenchmarkingHallucination | CodeCode Available | 2 |
| mDPO: Conditional Preference Optimization for Multimodal Large Language Models | Jun 17, 2024 | HallucinationLanguage Modeling | CodeCode Available | 2 |
| MeMemo: On-device Retrieval Augmentation for Private and Personalized Text Generation | Jul 2, 2024 | HallucinationRAG | CodeCode Available | 2 |
| Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality | Oct 7, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 |
| MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning | Sep 14, 2023 | HallucinationIn-Context Learning | CodeCode Available | 2 |
| 3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination | Jun 7, 2024 | Hallucination | CodeCode Available | 2 |
| Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps | Jul 9, 2024 | ArticlesHallucination | CodeCode Available | 2 |
| Controllable and Reliable Knowledge-Intensive Task-Oriented Conversational Agents with Declarative Genie Worksheets | Jul 8, 2024 | HallucinationNavigate | CodeCode Available | 2 |
| Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models | Oct 4, 2024 | DecoderHallucination | CodeCode Available | 2 |
| Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective | Feb 22, 2024 | HallucinationSentence | CodeCode Available | 2 |
| ANAH: Analytical Annotation of Hallucinations in Large Language Models | May 30, 2024 | Generative Question AnsweringHallucination | CodeCode Available | 2 |
| KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking | Apr 3, 2024 | Fact CheckingForm | CodeCode Available | 2 |
| Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions | Dec 20, 2022 | HallucinationQuestion Answering | CodeCode Available | 2 |
| Lawyer LLaMA Technical Report | May 24, 2023 | ArticlesHallucination | CodeCode Available | 2 |
| LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation | Jan 30, 2024 | HallucinationKnowledge Distillation | CodeCode Available | 2 |
| LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models | Jun 15, 2023 | HallucinationImage Captioning | CodeCode Available | 2 |
| CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs | Jan 28, 2025 | Hallucination | CodeCode Available | 2 |
| HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models | May 19, 2023 | HallucinationHallucination Evaluation | CodeCode Available | 2 |
| HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models | Oct 23, 2023 | DiagnosticHallucination | CodeCode Available | 2 |
| HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation | May 19, 2023 | HallucinationMachine Translation | CodeCode Available | 2 |
| VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis | Mar 29, 2024 | HallucinationImage Captioning | CodeCode Available | 2 |
| HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding | Mar 1, 2024 | HallucinationObject | CodeCode Available | 2 |
| Calibrated Self-Rewarding Vision Language Models | May 23, 2024 | HallucinationLanguage Modelling | CodeCode Available | 2 |
| Knowledge Graph-Guided Retrieval Augmented Generation | Feb 8, 2025 | DiversityHallucination | CodeCode Available | 2 |
| Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Jun 11, 2024 | HallucinationImage Description | CodeCode Available | 2 |
| Generate-on-Graph: Treat LLM as both Agent and KG in Incomplete Knowledge Graph Question Answering | Apr 23, 2024 | Graph Question AnsweringHallucination | CodeCode Available | 2 |