| U-NIAH: Unified RAG and LLM Evaluation for Long Context Needle-In-A-Haystack | Mar 1, 2025 | HallucinationRAG | CodeCode Available | 0 |
| Steer LLM Latents for Hallucination Detection | Mar 1, 2025 | Hallucination | —Unverified | 0 |
| UniFa: A unified feature hallucination framework for any-shot object detection | Mar 1, 2025 | Generalized Zero-Shot Object DetectionHallucination | —Unverified | 0 |
| MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language Models | Feb 28, 2025 | Decision MakingHallucination | CodeCode Available | 0 |
| Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs | Feb 28, 2025 | Hallucination | —Unverified | 0 |
| Vision-Encoders (Already) Know What They See: Mitigating Object Hallucination via Simple Fine-Grained CLIPScore | Feb 27, 2025 | HallucinationObject | CodeCode Available | 0 |
| Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in QA Agents | Feb 26, 2025 | HallucinationKnowledge Distillation | —Unverified | 0 |
| On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation | Feb 26, 2025 | Cross-Modal RetrievalHallucination | —Unverified | 0 |
| Exploring the Generalizability of Factual Hallucination Mitigation via Enhancing Precise Knowledge Utilization | Feb 26, 2025 | Hallucination | —Unverified | 0 |
| BRIDO: Bringing Democratic Order to Abstractive Summarization | Feb 25, 2025 | Abstractive Text SummarizationContrastive Learning | —Unverified | 0 |