| OViP: Online Vision-Language Preference Learning | May 21, 2025 | Hallucination | —Unverified | 0 |
| Visual Instruction Bottleneck Tuning | May 20, 2025 | HallucinationObject Hallucination | —Unverified | 0 |
| MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM Hallucinations | May 20, 2025 | Fact CheckingHallucination | CodeCode Available | 0 |
| Toward Reliable Biomedical Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language Models | May 20, 2025 | Hallucinationscientific discovery | CodeCode Available | 0 |
| Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models | May 20, 2025 | Anomaly DetectionDescriptive | —Unverified | 0 |
| Legal Rule Induction: Towards Generalizable Principle Discovery from Analogous Judicial Precedents | May 20, 2025 | Hallucination | —Unverified | 0 |
| Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization | May 20, 2025 | HallucinationIn-Context Learning | —Unverified | 0 |
| The Hallucination Tax of Reinforcement Finetuning | May 20, 2025 | HallucinationMath | —Unverified | 0 |
| Foundations of Unknown-aware Machine Learning | May 20, 2025 | Hallucination | —Unverified | 0 |
| Plane Geometry Problem Solving with Multi-modal Reasoning: A Survey | May 20, 2025 | DecoderGeometry Problem Solving | —Unverified | 0 |