| Hydra: An Agentic Reasoning Approach for Enhancing Adversarial Robustness and Mitigating Hallucinations in Vision-Language Models | Apr 19, 2025 | Adversarial AttackAdversarial Defense | —Unverified | 0 |
| Multi-Stage Retrieval for Operational Technology Cybersecurity Compliance Using Large Language Models: A Railway Casestudy | Apr 18, 2025 | HallucinationLogical Reasoning | —Unverified | 0 |
| Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations | Apr 18, 2025 | Hallucination | CodeCode Available | 1 |
| Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training | Apr 17, 2025 | Caption GenerationHallucination | —Unverified | 0 |
| Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations | Apr 17, 2025 | DecoderHallucination | CodeCode Available | 0 |
| VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models | Apr 17, 2025 | HallucinationVideo Understanding | CodeCode Available | 1 |
| Aspect-Based Summarization with Self-Aspect Retrieval Enhanced Generation | Apr 17, 2025 | HallucinationIn-Context Learning | —Unverified | 0 |
| Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling | Apr 17, 2025 | Hallucination | CodeCode Available | 2 |
| QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning? | Apr 17, 2025 | HallucinationMulti-agent Reinforcement Learning | —Unverified | 0 |
| Naming is framing: How cybersecurity's language problems are repeating in AI governance | Apr 16, 2025 | Hallucination | —Unverified | 0 |
| SemEval-2025 Task 3: Mu-SHROOM, the Multilingual Shared Task on Hallucinations and Related Observable Overgeneration Mistakes | Apr 16, 2025 | Hallucination | CodeCode Available | 0 |
| Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization | Apr 16, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| Efficient Contrastive Decoding with Probabilistic Hallucination Detection - Mitigating Hallucinations in Large Vision Language Models - | Apr 16, 2025 | Hallucination | —Unverified | 0 |
| Purposefully Induced Psychosis (PIP): Embracing Hallucination as Imagination in Large Language Models | Apr 16, 2025 | EthicsHallucination | —Unverified | 0 |
| Hallucination-Aware Generative Pretrained Transformer for Cooperative Aerial Mobility Control | Apr 15, 2025 | HallucinationReinforcement Learning (RL) | —Unverified | 0 |
| From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs | Apr 15, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| Hallucination Detection in LLMs via Topological Divergence on Attention Graphs | Apr 14, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| The Mirage of Performance Gains: Why Contrastive Decoding Fails to Address Multimodal Hallucination | Apr 14, 2025 | Hallucination | CodeCode Available | 1 |
| EmbodiedAgent: A Scalable Hierarchical Approach to Overcome Practical Challenge in Multi-Robot Control | Apr 14, 2025 | Hallucination | CodeCode Available | 1 |
| The Future of MLLM Prompting is Adaptive: A Comprehensive Experimental Evaluation of Prompt Engineering Methods for Robust Multimodal Performance | Apr 14, 2025 | Code GenerationHallucination | —Unverified | 0 |
| Enhancing Mathematical Reasoning in Large Language Models with Self-Consistency-Based Hallucination Detection | Apr 13, 2025 | Answer SelectionAutomated Theorem Proving | —Unverified | 0 |
| DiTSE: High-Fidelity Generative Speech Enhancement via Latent Diffusion Transformers | Apr 13, 2025 | HallucinationSpeech Enhancement | —Unverified | 0 |
| HalluShift: Measuring Distribution Shifts towards Hallucination Detection in LLMs | Apr 13, 2025 | HallucinationMisinformation | CodeCode Available | 0 |
| SynthTRIPs: A Knowledge-Grounded Framework for Benchmark Query Generation for Personalized Tourism Recommenders | Apr 12, 2025 | HallucinationRecommendation Systems | —Unverified | 0 |
| The Other Side of the Coin: Exploring Fairness in Retrieval-Augmented Generation | Apr 11, 2025 | FairnessHallucination | CodeCode Available | 0 |