| RARE: Retrieval-Augmented Reasoning Modeling | Mar 30, 2025 | HallucinationMemorization | CodeCode Available | 2 |
| An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering | Mar 30, 2025 | HallucinationMulti-hop Question Answering | —Unverified | 0 |
| Learning to Instruct for Visual Instruction Tuning | Mar 28, 2025 | HallucinationInstruction Following | —Unverified | 0 |
| Real-Time Evaluation Models for RAG: Who Detects Hallucinations Best? | Mar 27, 2025 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search | Mar 27, 2025 | HallucinationKnowledge Distillation | —Unverified | 0 |
| Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack | Mar 27, 2025 | HallucinationRAG | —Unverified | 0 |
| Vision-Amplified Semantic Entropy for Hallucination Detection in Medical Visual Question Answering | Mar 26, 2025 | DiagnosticHallucination | —Unverified | 0 |
| Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs | Mar 26, 2025 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Mitigating Low-Level Visual Hallucinations Requires Self-Awareness: Database, Model and Training Strategy | Mar 26, 2025 | HallucinationImage Captioning | —Unverified | 0 |
| GAPO: Learning Preferential Prompt through Generative Adversarial Policy Optimization | Mar 26, 2025 | HallucinationPrompt Learning | CodeCode Available | 0 |
| TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes | Mar 26, 2025 | Hallucination | —Unverified | 0 |
| KSHSeek: Data-Driven Approaches to Mitigating and Detecting Knowledge-Shortcut Hallucinations in Generative Models | Mar 25, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning | Mar 25, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text | Mar 25, 2025 | Cross-Modal RetrievalHallucination | CodeCode Available | 1 |
| Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation | Mar 25, 2025 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| HausaNLP at SemEval-2025 Task 3: Towards a Fine-Grained Model-Aware Hallucination Detection | Mar 25, 2025 | HallucinationNatural Language Inference | —Unverified | 0 |
| ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices | Mar 23, 2025 | HallucinationTriviaQA | —Unverified | 0 |
| GeoBenchX: Benchmarking LLMs for Multistep Geospatial Tasks | Mar 23, 2025 | BenchmarkingHallucination | CodeCode Available | 1 |
| good4cir: Generating Detailed Synthetic Captions for Composed Image Retrieval | Mar 22, 2025 | DiversityHallucination | —Unverified | 0 |
| FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs | Mar 21, 2025 | HallucinationKnowledge Graphs | —Unverified | 0 |
| Judge Anything: MLLM as a Judge Across Any Modality | Mar 21, 2025 | Hallucination | —Unverified | 0 |
| ProDehaze: Prompting Diffusion Models Toward Faithful Image Dehazing | Mar 21, 2025 | HallucinationImage Dehazing | CodeCode Available | 1 |
| ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph | Mar 20, 2025 | BenchmarkingHallucination | —Unverified | 0 |
| MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations | Mar 20, 2025 | HallucinationVideo Understanding | —Unverified | 0 |
| DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs | Mar 20, 2025 | BenchmarkingHallucination | —Unverified | 0 |