| LLM-Advisor: An LLM Benchmark for Cost-efficient Path Planning across Multiple Terrains | Mar 3, 2025 | Common Sense ReasoningHallucination | —Unverified | 0 |
| Tackling Hallucination from Conditional Models for Medical Image Reconstruction with DynamicDPS | Mar 3, 2025 | HallucinationImage Reconstruction | —Unverified | 0 |
| Explainable Depression Detection in Clinical Interviews with Personalized Retrieval-Augmented Generation | Mar 3, 2025 | Depression DetectionHallucination | —Unverified | 0 |
| NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SeflCheckGPT | Mar 2, 2025 | Hallucination | CodeCode Available | 0 |
| Unmasking Digital Falsehoods: A Comparative Analysis of LLM-Based Misinformation Detection Strategies | Mar 2, 2025 | Fact CheckingFederated Learning | —Unverified | 0 |
| Steer LLM Latents for Hallucination Detection | Mar 1, 2025 | Hallucination | —Unverified | 0 |
| U-NIAH: Unified RAG and LLM Evaluation for Long Context Needle-In-A-Haystack | Mar 1, 2025 | HallucinationRAG | CodeCode Available | 0 |
| UniFa: A unified feature hallucination framework for any-shot object detection | Mar 1, 2025 | Generalized Zero-Shot Object DetectionHallucination | —Unverified | 0 |
| Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs | Feb 28, 2025 | Hallucination | —Unverified | 0 |
| MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language Models | Feb 28, 2025 | Decision MakingHallucination | CodeCode Available | 0 |
| Vision-Encoders (Already) Know What They See: Mitigating Object Hallucination via Simple Fine-Grained CLIPScore | Feb 27, 2025 | HallucinationObject | CodeCode Available | 0 |
| On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation | Feb 26, 2025 | Cross-Modal RetrievalHallucination | —Unverified | 0 |
| Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in QA Agents | Feb 26, 2025 | HallucinationKnowledge Distillation | —Unverified | 0 |
| Exploring the Generalizability of Factual Hallucination Mitigation via Enhancing Precise Knowledge Utilization | Feb 26, 2025 | Hallucination | —Unverified | 0 |
| BRIDO: Bringing Democratic Order to Abstractive Summarization | Feb 25, 2025 | Abstractive Text SummarizationContrastive Learning | —Unverified | 0 |
| Stealthy Backdoor Attack in Self-Supervised Learning Vision Encoders for Large Vision Language Models | Feb 25, 2025 | Backdoor AttackHallucination | —Unverified | 0 |
| `Generalization is hallucination' through the lens of tensor completions | Feb 24, 2025 | HallucinationPosition | —Unverified | 0 |
| Exploring Causes and Mitigation of Hallucinations in Large Vision Language Models | Feb 24, 2025 | HallucinationImage Captioning | —Unverified | 0 |
| Uncertainty-Aware Fusion: An Ensemble Framework for Mitigating Hallucinations in Large Language Models | Feb 22, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination | Feb 22, 2025 | HallucinationText Generation | —Unverified | 0 |
| ZiGong 1.0: A Large Language Model for Financial Credit | Feb 22, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| The Role of Background Information in Reducing Object Hallucination in Vision-Language Models: Insights from Cutoff API Prompting | Feb 21, 2025 | HallucinationObject | —Unverified | 0 |
| Large Language Models Struggle to Describe the Haystack without Human Help: Human-in-the-loop Evaluation of LLMs | Feb 20, 2025 | HallucinationTopic Models | —Unverified | 0 |
| Hallucination Detection in Large Language Models with Metamorphic Relations | Feb 20, 2025 | Hallucination | —Unverified | 0 |
| MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models | Feb 20, 2025 | Decision MakingHallucination | —Unverified | 0 |