| Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding | May 22, 2025 | Causal InferenceHallucination | —Unverified | 0 |
| UNCLE: Uncertainty Expressions in Long-Form Generation | May 22, 2025 | 4kForm | —Unverified | 0 |
| AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models | May 22, 2025 | BenchmarkingFairness | CodeCode Available | 3 |
| Walk&Retrieve: Simple Yet Effective Zero-shot Retrieval-Augmented Generation via Knowledge Graph Walks | May 22, 2025 | HallucinationRAG | CodeCode Available | 0 |
| NEXT-EVAL: Next Evaluation of Traditional and LLM Web Data Record Extraction | May 21, 2025 | BenchmarkingHallucination | —Unverified | 0 |
| OViP: Online Vision-Language Preference Learning | May 21, 2025 | Hallucination | —Unverified | 0 |
| Aug2Search: Enhancing Facebook Marketplace Search with LLM-Generated Synthetic Data Augmentation | May 21, 2025 | Data AugmentationDiversity | —Unverified | 0 |
| Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization | May 21, 2025 | Document SummarizationHallucination | —Unverified | 0 |
| Multilingual Prompting for Improving LLM Generation Diversity | May 21, 2025 | DiversityHallucination | —Unverified | 0 |
| KaFT: Knowledge-aware Fine-tuning for Boosting LLMs' Domain-specific Question-Answering Performance | May 21, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| RePPL: Recalibrating Perplexity by Uncertainty in Semantic Propagation and Language Generation for Explainable QA Hallucination Detection | May 21, 2025 | HallucinationText Generation | —Unverified | 0 |
| HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving | May 21, 2025 | Autonomous DrivingHallucination | —Unverified | 0 |
| Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization | May 20, 2025 | HallucinationIn-Context Learning | —Unverified | 0 |
| Foundations of Unknown-aware Machine Learning | May 20, 2025 | Hallucination | —Unverified | 0 |
| Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models | May 20, 2025 | Anomaly DetectionDescriptive | —Unverified | 0 |
| Visual Instruction Bottleneck Tuning | May 20, 2025 | HallucinationObject Hallucination | —Unverified | 0 |
| JARVIS: A Multi-Agent Code Assistant for High-Quality EDA Script Generation | May 20, 2025 | HallucinationScript Generation | —Unverified | 0 |
| The Hallucination Tax of Reinforcement Finetuning | May 20, 2025 | HallucinationMath | —Unverified | 0 |
| Toward Reliable Biomedical Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language Models | May 20, 2025 | Hallucinationscientific discovery | CodeCode Available | 0 |
| DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning | May 20, 2025 | HallucinationMathematical Reasoning | CodeCode Available | 5 |
| MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM Hallucinations | May 20, 2025 | Fact CheckingHallucination | CodeCode Available | 0 |
| Towards Omnidirectional Reasoning with 360-R1: A Dataset, Benchmark, and GRPO-based Method | May 20, 2025 | HallucinationObject Localization | —Unverified | 0 |
| Legal Rule Induction: Towards Generalizable Principle Discovery from Analogous Judicial Precedents | May 20, 2025 | Hallucination | —Unverified | 0 |
| Aligning Attention Distribution to Information Flow for Hallucination Mitigation in Large Vision-Language Models | May 20, 2025 | HallucinationImage Captioning | —Unverified | 0 |
| Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis | May 20, 2025 | Hallucination | —Unverified | 0 |