| Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization | May 20, 2025 | HallucinationIn-Context Learning | —Unverified | 0 |
| Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis | May 20, 2025 | Hallucination | —Unverified | 0 |
| Visual Instruction Bottleneck Tuning | May 20, 2025 | HallucinationObject Hallucination | —Unverified | 0 |
| Toward Reliable Biomedical Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language Models | May 20, 2025 | Hallucinationscientific discovery | CodeCode Available | 0 |
| Mitigating Hallucination in VideoLLMs via Temporal-Aware Activation Engineering | May 19, 2025 | Hallucination | —Unverified | 0 |
| Calm-Whisper: Reduce Whisper Hallucination On Non-Speech By Calming Crazy Heads Down | May 19, 2025 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| LLM-based Query Expansion Fails for Unfamiliar and Ambiguous Queries | May 19, 2025 | HallucinationRetrieval | CodeCode Available | 0 |
| Detection and Mitigation of Hallucination in Large Reasoning Models: A Mechanistic Perspective | May 19, 2025 | Hallucination | —Unverified | 0 |
| Granary: Speech Recognition and Translation Dataset in 25 European Languages | May 19, 2025 | HallucinationPunctuation Restoration | —Unverified | 0 |
| Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical Practice | May 19, 2025 | AllHallucination | —Unverified | 0 |
| Selective Code Generation for Functional Guarantees | May 19, 2025 | Code GenerationHallucination | —Unverified | 0 |
| Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation | May 18, 2025 | Fact CheckingForm | —Unverified | 0 |
| Mitigating Hallucinations via Inter-Layer Consistency Aggregation in Large Vision-Language Models | May 18, 2025 | HallucinationMME | —Unverified | 0 |
| The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models | May 18, 2025 | Hallucination | —Unverified | 0 |
| Mixture of Decoding: An Attention-Inspired Adaptive Decoding Strategy to Mitigate Hallucinations in Large Vision-Language Models | May 17, 2025 | Hallucination | CodeCode Available | 0 |
| Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning? | May 17, 2025 | HallucinationObject Counting | —Unverified | 0 |
| CCNU at SemEval-2025 Task 3: Leveraging Internal and External Knowledge of Large Language Models for Multilingual Hallucination Annotation | May 17, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| Diverging Towards Hallucination: Detection of Failures in Vision-Language Models via Multi-token Aggregation | May 16, 2025 | DiagnosticHallucination | —Unverified | 0 |
| EmotionHallucer: Evaluating Emotion Hallucinations in Multimodal Large Language Models | May 16, 2025 | Hallucination | CodeCode Available | 0 |
| Towards Robust Evaluation of STEM Education: Leveraging MLLMs in Project-Based Learning | May 16, 2025 | HallucinationInformation Retrieval | —Unverified | 0 |
| DO-RAG: A Domain-Specific QA Framework Using Knowledge Graph-Enhanced Retrieval-Augmented Generation | May 15, 2025 | graph constructionHallucination | CodeCode Available | 0 |
| AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges | May 15, 2025 | AI AgentData Summarization | —Unverified | 0 |
| The Impact of Large Language Models on Task Automation in Manufacturing Services | May 14, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| Beyond the Black Box: Interpretability of LLMs in Finance | May 14, 2025 | FairnessHallucination | —Unverified | 0 |
| A Multimodal Multi-Agent Framework for Radiology Report Generation | May 14, 2025 | DiagnosticHallucination | —Unverified | 0 |
| Ornithologist: Towards Trustworthy "Reasoning" about Central Bank Communications | May 14, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation | May 13, 2025 | Event ExtractionHallucination | —Unverified | 0 |
| Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training | May 13, 2025 | HallucinationLarge Language Model | CodeCode Available | 0 |
| Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification | May 13, 2025 | HallucinationRAG | —Unverified | 0 |
| On the Cost and Benefits of Training Context with Utterance or Full Conversation Training: A Comparative Stud | May 12, 2025 | GPUHallucination | —Unverified | 0 |
| SEReDeEP: Hallucination Detection in Retrieval-Augmented Models via Semantic Entropy and Context-Parameter Fusion | May 12, 2025 | HallucinationRAG | —Unverified | 0 |
| Critique Before Thinking: Mitigating Hallucination through Rationale-Augmented Instruction Tuning | May 12, 2025 | HallucinationMultimodal Reasoning | —Unverified | 0 |
| Multimodal Survival Modeling in the Age of Foundation Models | May 12, 2025 | HallucinationSurvival Prediction | CodeCode Available | 0 |
| TrumorGPT: Graph-Based Retrieval-Augmented Large Language Model for Fact-Checking | May 11, 2025 | Fact CheckingFew-Shot Learning | —Unverified | 0 |
| Evolutionary thoughts: integration of large language models and evolutionary algorithms | May 9, 2025 | Evolutionary AlgorithmsHallucination | CodeCode Available | 0 |
| Osiris: A Lightweight Open-Source Hallucination Detection System | May 7, 2025 | HallucinationRAG | —Unverified | 0 |
| Interpretable Zero-shot Learning with Infinite Class Concepts | May 6, 2025 | HallucinationZero-Shot Learning | —Unverified | 0 |
| Mitigating Image Captioning Hallucinations in Vision-Language Models | May 6, 2025 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation | May 5, 2025 | Entity DisambiguationHallucination | —Unverified | 0 |
| UCSC at SemEval-2025 Task 3: Context, Models and Prompt Optimization for Automated Hallucination Detection in LLM Output | May 5, 2025 | Hallucination | CodeCode Available | 0 |
| SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation | May 4, 2025 | HallucinationText Summarization | —Unverified | 0 |
| A Comprehensive Analysis for Visual Object Hallucination in Large Vision-Language Models | May 4, 2025 | AttributeHallucination | —Unverified | 0 |
| Regression is all you need for medical image translation | May 4, 2025 | AllHallucination | CodeCode Available | 0 |
| Automated Parsing of Engineering Drawings for Structured Information Extraction Using a Fine-tuned Document Understanding Transformer | May 2, 2025 | document understandingHallucination | —Unverified | 0 |
| Multi-agents based User Values Mining for Recommendation | May 2, 2025 | HallucinationRecommendation Systems | —Unverified | 0 |
| SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided Distillation | May 1, 2025 | HallucinationNavigate | CodeCode Available | 0 |
| HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection | May 1, 2025 | Extractive Question-AnsweringHallucination | —Unverified | 0 |
| Triggering Hallucinations in LLMs: A Quantitative Study of Prompt-Induced Hallucination in Large Language Models | May 1, 2025 | Hallucination | —Unverified | 0 |
| Efficient and robust 3D blind harmonization for large domain gaps | Apr 30, 2025 | HallucinationImage Harmonization | —Unverified | 0 |
| Black-Box Visual Prompt Engineering for Mitigating Object Hallucination in Large Vision Language Models | Apr 30, 2025 | HallucinationObject | —Unverified | 0 |