| Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis | May 20, 2025 | Hallucination | —Unverified | 0 |
| Towards Omnidirectional Reasoning with 360-R1: A Dataset, Benchmark, and GRPO-based Method | May 20, 2025 | HallucinationObject Localization | —Unverified | 0 |
| Legal Rule Induction: Towards Generalizable Principle Discovery from Analogous Judicial Precedents | May 20, 2025 | Hallucination | —Unverified | 0 |
| MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM Hallucinations | May 20, 2025 | Fact CheckingHallucination | CodeCode Available | 0 |
| The Hallucination Tax of Reinforcement Finetuning | May 20, 2025 | HallucinationMath | —Unverified | 0 |
| Aligning Attention Distribution to Information Flow for Hallucination Mitigation in Large Vision-Language Models | May 20, 2025 | HallucinationImage Captioning | —Unverified | 0 |
| Know Or Not: a library for evaluating out-of-knowledge base robustness | May 19, 2025 | HallucinationRAG | CodeCode Available | 1 |
| Selective Code Generation for Functional Guarantees | May 19, 2025 | Code GenerationHallucination | —Unverified | 0 |
| Calm-Whisper: Reduce Whisper Hallucination On Non-Speech By Calming Crazy Heads Down | May 19, 2025 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Granary: Speech Recognition and Translation Dataset in 25 European Languages | May 19, 2025 | HallucinationPunctuation Restoration | —Unverified | 0 |