| RARE: Retrieval-Augmented Reasoning Modeling | Mar 30, 2025 | HallucinationMemorization | CodeCode Available | 2 |
| An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering | Mar 30, 2025 | HallucinationMulti-hop Question Answering | —Unverified | 0 |
| Learning to Instruct for Visual Instruction Tuning | Mar 28, 2025 | HallucinationInstruction Following | —Unverified | 0 |
| Real-Time Evaluation Models for RAG: Who Detects Hallucinations Best? | Mar 27, 2025 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search | Mar 27, 2025 | HallucinationKnowledge Distillation | —Unverified | 0 |
| Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack | Mar 27, 2025 | HallucinationRAG | —Unverified | 0 |
| Vision-Amplified Semantic Entropy for Hallucination Detection in Medical Visual Question Answering | Mar 26, 2025 | DiagnosticHallucination | —Unverified | 0 |
| Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs | Mar 26, 2025 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Mitigating Low-Level Visual Hallucinations Requires Self-Awareness: Database, Model and Training Strategy | Mar 26, 2025 | HallucinationImage Captioning | —Unverified | 0 |
| GAPO: Learning Preferential Prompt through Generative Adversarial Policy Optimization | Mar 26, 2025 | HallucinationPrompt Learning | CodeCode Available | 0 |
| TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes | Mar 26, 2025 | Hallucination | —Unverified | 0 |
| KSHSeek: Data-Driven Approaches to Mitigating and Detecting Knowledge-Shortcut Hallucinations in Generative Models | Mar 25, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text | Mar 25, 2025 | Cross-Modal RetrievalHallucination | CodeCode Available | 1 |
| CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning | Mar 25, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| HausaNLP at SemEval-2025 Task 3: Towards a Fine-Grained Model-Aware Hallucination Detection | Mar 25, 2025 | HallucinationNatural Language Inference | —Unverified | 0 |
| Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation | Mar 25, 2025 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices | Mar 23, 2025 | HallucinationTriviaQA | —Unverified | 0 |
| GeoBenchX: Benchmarking LLMs for Multistep Geospatial Tasks | Mar 23, 2025 | BenchmarkingHallucination | CodeCode Available | 1 |
| good4cir: Generating Detailed Synthetic Captions for Composed Image Retrieval | Mar 22, 2025 | DiversityHallucination | —Unverified | 0 |
| Judge Anything: MLLM as a Judge Across Any Modality | Mar 21, 2025 | Hallucination | —Unverified | 0 |
| FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs | Mar 21, 2025 | HallucinationKnowledge Graphs | —Unverified | 0 |
| ProDehaze: Prompting Diffusion Models Toward Faithful Image Dehazing | Mar 21, 2025 | HallucinationImage Dehazing | CodeCode Available | 1 |
| MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations | Mar 20, 2025 | HallucinationVideo Understanding | —Unverified | 0 |
| Towards Lighter and Robust Evaluation for Retrieval Augmented Generation | Mar 20, 2025 | HallucinationRAG | CodeCode Available | 0 |
| DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs | Mar 20, 2025 | BenchmarkingHallucination | —Unverified | 0 |
| ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph | Mar 20, 2025 | BenchmarkingHallucination | —Unverified | 0 |
| Poly-FEVER: A Multilingual Fact Verification Benchmark for Hallucination Detection in Large Language Models | Mar 19, 2025 | Fact CheckingFact Verification | —Unverified | 0 |
| R^2: A LLM Based Novel-to-Screenplay Generation Framework with Causal Plot Graphs | Mar 19, 2025 | graph constructionHallucination | —Unverified | 0 |
| MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models | Mar 19, 2025 | Adversarial RobustnessAutonomous Driving | —Unverified | 0 |
| Enhancing LLM Generation with Knowledge Hypergraph for Evidence-Based Medicine | Mar 18, 2025 | HallucinationRAG | —Unverified | 0 |
| From "Hallucination" to "Suture": Insights from Language Philosophy to Enhance Large Language Models | Mar 18, 2025 | HallucinationPhilosophy | —Unverified | 0 |
| Learning on LLM Output Signatures for gray-box LLM Behavior Analysis | Mar 18, 2025 | Hallucination | CodeCode Available | 0 |
| RAD: Retrieval-Augmented Decision-Making of Meta-Actions with Vision-Language Models in Autonomous Driving | Mar 18, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |
| HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models | Mar 17, 2025 | HallucinationQuestion Answering | CodeCode Available | 0 |
| Grounded Chain-of-Thought for Multimodal Large Language Models | Mar 17, 2025 | HallucinationSpatial Reasoning | CodeCode Available | 1 |
| ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models | Mar 17, 2025 | Computational EfficiencyHallucination | CodeCode Available | 2 |
| LLMSeR: Enhancing Sequential Recommendation via LLM-based Data Augmentation | Mar 16, 2025 | Data AugmentationHallucination | —Unverified | 0 |
| Applications of Large Language Model Reasoning in Feature Generation | Mar 15, 2025 | Computational EfficiencyDomain Adaptation | —Unverified | 0 |
| RAG-KG-IL: A Multi-Agent Hybrid Framework for Reducing Hallucinations and Enhancing LLM Reasoning through RAG and Incremental Knowledge Graph Learning Integration | Mar 14, 2025 | Graph LearningHallucination | —Unverified | 0 |
| LLM Agents for Education: Advances and Applications | Mar 14, 2025 | FairnessHallucination | —Unverified | 0 |
| AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation | Mar 14, 2025 | Abstractive Text SummarizationChunking | CodeCode Available | 0 |
| Prompt Injection Detection and Mitigation via AI Multi-Agent NLP Frameworks | Mar 14, 2025 | Hallucination | CodeCode Available | 0 |
| Learning to Inference Adaptively for Multimodal Large Language Models | Mar 13, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention | Mar 13, 2025 | HallucinationObject Hallucination | CodeCode Available | 1 |
| Through the Magnifying Glass: Adaptive Perception Magnification for Hallucination-Free VLM Decoding | Mar 13, 2025 | HallucinationText Generation | CodeCode Available | 0 |
| Conversational Gold: Evaluating Personalized Conversational Search System using Gold Nuggets | Mar 12, 2025 | Answer GenerationConversational Search | CodeCode Available | 0 |
| Is LLMs Hallucination Usable? LLM-based Negative Reasoning for Fake News Detection | Mar 12, 2025 | Decision MakingFake News Detection | —Unverified | 0 |
| NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model | Mar 12, 2025 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Attention Hijackers: Detect and Disentangle Attention Hijacking in LVLMs for Hallucination Mitigation | Mar 11, 2025 | AttributeDisentanglement | —Unverified | 0 |
| Gradient-guided Attention Map Editing: Towards Efficient Contextual Hallucination Mitigation | Mar 11, 2025 | Computational EfficiencyHallucination | —Unverified | 0 |