| The Hallucination Tax of Reinforcement Finetuning | May 20, 2025 | HallucinationMath | —Unverified | 0 |
| Aligning Attention Distribution to Information Flow for Hallucination Mitigation in Large Vision-Language Models | May 20, 2025 | HallucinationImage Captioning | —Unverified | 0 |
| Legal Rule Induction: Towards Generalizable Principle Discovery from Analogous Judicial Precedents | May 20, 2025 | Hallucination | —Unverified | 0 |
| Plane Geometry Problem Solving with Multi-modal Reasoning: A Survey | May 20, 2025 | DecoderGeometry Problem Solving | —Unverified | 0 |
| Calm-Whisper: Reduce Whisper Hallucination On Non-Speech By Calming Crazy Heads Down | May 19, 2025 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Granary: Speech Recognition and Translation Dataset in 25 European Languages | May 19, 2025 | HallucinationPunctuation Restoration | —Unverified | 0 |
| Mitigating Hallucination in VideoLLMs via Temporal-Aware Activation Engineering | May 19, 2025 | Hallucination | —Unverified | 0 |
| LLM-based Query Expansion Fails for Unfamiliar and Ambiguous Queries | May 19, 2025 | HallucinationRetrieval | CodeCode Available | 0 |
| Detection and Mitigation of Hallucination in Large Reasoning Models: A Mechanistic Perspective | May 19, 2025 | Hallucination | —Unverified | 0 |
| Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical Practice | May 19, 2025 | AllHallucination | —Unverified | 0 |
| Selective Code Generation for Functional Guarantees | May 19, 2025 | Code GenerationHallucination | —Unverified | 0 |
| Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation | May 18, 2025 | Fact CheckingForm | —Unverified | 0 |
| Mitigating Hallucinations via Inter-Layer Consistency Aggregation in Large Vision-Language Models | May 18, 2025 | HallucinationMME | —Unverified | 0 |
| The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models | May 18, 2025 | Hallucination | —Unverified | 0 |
| Mixture of Decoding: An Attention-Inspired Adaptive Decoding Strategy to Mitigate Hallucinations in Large Vision-Language Models | May 17, 2025 | Hallucination | CodeCode Available | 0 |
| Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning? | May 17, 2025 | HallucinationObject Counting | —Unverified | 0 |
| CCNU at SemEval-2025 Task 3: Leveraging Internal and External Knowledge of Large Language Models for Multilingual Hallucination Annotation | May 17, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| Diverging Towards Hallucination: Detection of Failures in Vision-Language Models via Multi-token Aggregation | May 16, 2025 | DiagnosticHallucination | —Unverified | 0 |
| EmotionHallucer: Evaluating Emotion Hallucinations in Multimodal Large Language Models | May 16, 2025 | Hallucination | CodeCode Available | 0 |
| Towards Robust Evaluation of STEM Education: Leveraging MLLMs in Project-Based Learning | May 16, 2025 | HallucinationInformation Retrieval | —Unverified | 0 |
| DO-RAG: A Domain-Specific QA Framework Using Knowledge Graph-Enhanced Retrieval-Augmented Generation | May 15, 2025 | graph constructionHallucination | CodeCode Available | 0 |
| AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges | May 15, 2025 | AI AgentData Summarization | —Unverified | 0 |
| Ornithologist: Towards Trustworthy "Reasoning" about Central Bank Communications | May 14, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| The Impact of Large Language Models on Task Automation in Manufacturing Services | May 14, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| Beyond the Black Box: Interpretability of LLMs in Finance | May 14, 2025 | FairnessHallucination | —Unverified | 0 |