| Evaluating LLMs' Assessment of Mixed-Context Hallucination Through the Lens of Summarization | Mar 3, 2025 | HallucinationHallucination Evaluation | CodeCode Available | 0 |
| LLM-Advisor: An LLM Benchmark for Cost-efficient Path Planning across Multiple Terrains | Mar 3, 2025 | Common Sense ReasoningHallucination | —Unverified | 0 |
| Tackling Hallucination from Conditional Models for Medical Image Reconstruction with DynamicDPS | Mar 3, 2025 | HallucinationImage Reconstruction | —Unverified | 0 |
| Explainable Depression Detection in Clinical Interviews with Personalized Retrieval-Augmented Generation | Mar 3, 2025 | Depression DetectionHallucination | —Unverified | 0 |
| NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SeflCheckGPT | Mar 2, 2025 | Hallucination | CodeCode Available | 0 |
| Unmasking Digital Falsehoods: A Comparative Analysis of LLM-Based Misinformation Detection Strategies | Mar 2, 2025 | Fact CheckingFederated Learning | —Unverified | 0 |
| Steer LLM Latents for Hallucination Detection | Mar 1, 2025 | Hallucination | —Unverified | 0 |
| UniFa: A unified feature hallucination framework for any-shot object detection | Mar 1, 2025 | Generalized Zero-Shot Object DetectionHallucination | —Unverified | 0 |
| U-NIAH: Unified RAG and LLM Evaluation for Long Context Needle-In-A-Haystack | Mar 1, 2025 | HallucinationRAG | CodeCode Available | 0 |
| MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language Models | Feb 28, 2025 | Decision MakingHallucination | CodeCode Available | 0 |
| Towards General Visual-Linguistic Face Forgery Detection(V2) | Feb 28, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs | Feb 28, 2025 | Hallucination | —Unverified | 0 |
| Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information Flow | Feb 28, 2025 | HallucinationObject | CodeCode Available | 1 |
| One-for-More: Continual Diffusion Model for Anomaly Detection | Feb 27, 2025 | Anomaly Detectioncontinual anomaly detection | CodeCode Available | 2 |
| ProAPO: Progressively Automatic Prompt Optimization for Visual Classification | Feb 27, 2025 | ClassificationHallucination | CodeCode Available | 1 |
| Vision-Encoders (Already) Know What They See: Mitigating Object Hallucination via Simple Fine-Grained CLIPScore | Feb 27, 2025 | HallucinationObject | CodeCode Available | 0 |
| Exploring the Generalizability of Factual Hallucination Mitigation via Enhancing Precise Knowledge Utilization | Feb 26, 2025 | Hallucination | —Unverified | 0 |
| Medical Hallucinations in Foundation Models and Their Impact on Healthcare | Feb 26, 2025 | BenchmarkingHallucination | CodeCode Available | 2 |
| On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation | Feb 26, 2025 | Cross-Modal RetrievalHallucination | —Unverified | 0 |
| Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in QA Agents | Feb 26, 2025 | HallucinationKnowledge Distillation | —Unverified | 0 |
| BRIDO: Bringing Democratic Order to Abstractive Summarization | Feb 25, 2025 | Abstractive Text SummarizationContrastive Learning | —Unverified | 0 |
| Verdict: A Library for Scaling Judge-Time Compute | Feb 25, 2025 | Fact CheckingHallucination | CodeCode Available | 3 |
| Stealthy Backdoor Attack in Self-Supervised Learning Vision Encoders for Large Vision Language Models | Feb 25, 2025 | Backdoor AttackHallucination | —Unverified | 0 |
| Hallucination Detection in LLMs Using Spectral Features of Attention Maps | Feb 24, 2025 | Hallucination | CodeCode Available | 1 |
| Exploring Causes and Mitigation of Hallucinations in Large Vision Language Models | Feb 24, 2025 | HallucinationImage Captioning | —Unverified | 0 |