| Mitigating Object Hallucinations via Sentence-Level Early Intervention | Jul 16, 2025 | HallucinationMM-Vet | CodeCode Available | 1 |
| ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way | Jul 11, 2025 | Depth EstimationHallucination | —Unverified | 0 |
| UQLM: A Python Package for Uncertainty Quantification in Large Language Models | Jul 8, 2025 | HallucinationUncertainty Quantification | CodeCode Available | 5 |
| ReLoop: "Seeing Twice and Thinking Backwards" via Closed-loop Training to Mitigate Hallucinations in Multimodal understanding | Jul 7, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| DeepRetro: Retrosynthetic Pathway Discovery using Iterative LLM Reasoning | Jul 7, 2025 | HallucinationLarge Language Model | —Unverified | 0 |
| The Future is Agentic: Definitions, Perspectives, and Open Challenges of Multi-Agent Recommender Systems | Jul 2, 2025 | Explanation GenerationHallucination | —Unverified | 0 |
| GAF-Guard: An Agentic Framework for Risk Management and Governance in Large Language Models | Jul 1, 2025 | HallucinationManagement | CodeCode Available | 0 |
| HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation | Jun 26, 2025 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| Mitigating Hallucination of Large Vision-Language Models via Dynamic Logits Calibration | Jun 26, 2025 | HallucinationText Generation | CodeCode Available | 0 |
| Seeing is Believing? Mitigating OCR Hallucinations in Multimodal Large Language Models | Jun 25, 2025 | document understandingHallucination | —Unverified | 0 |
| Feature Hallucination for Self-supervised Action Recognition | Jun 25, 2025 | Action RecognitionHallucination | —Unverified | 0 |
| KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality | Jun 24, 2025 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph Refinement | Jun 18, 2025 | Graph GenerationHallucination | CodeCode Available | 2 |
| Robust Instant Policy: Leveraging Student's t-Regression Model for Robust In-context Imitation Learning of Robot Manipulation | Jun 18, 2025 | HallucinationImitation Learning | —Unverified | 0 |
| HEAL: An Empirical Study on Hallucinations in Embodied Agents Driven by Large Language Models | Jun 18, 2025 | Hallucination | —Unverified | 0 |
| ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM | Jun 17, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Abstract Meaning Representation for Hospital Discharge Summarization | Jun 17, 2025 | Abstract Meaning RepresentationHallucination | CodeCode Available | 0 |
| DREAM: On hallucinations in AI-generated content for nuclear medicine imaging | Jun 16, 2025 | DiagnosticHallucination | —Unverified | 0 |
| Stress-Testing Multimodal Foundation Models for Crystallographic Reasoning | Jun 16, 2025 | HallucinationSpatial Interpolation | CodeCode Available | 0 |
| A Regret Perspective on Online Selective Generation | Jun 16, 2025 | HallucinationLEMMA | —Unverified | 0 |
| VL-GenRM: Enhancing Vision-Language Verification via Vision Experts and Iterative Training | Jun 16, 2025 | HallucinationMultimodal Reasoning | —Unverified | 0 |
| HKD4VLM: A Progressive Hybrid Knowledge Distillation Framework for Robust Multimodal Hallucination and Factuality Detection in VLMs | Jun 16, 2025 | HallucinationKnowledge Distillation | —Unverified | 0 |
| Second Order State Hallucinations for Adversarial Attack Mitigation in Formation Control of Multi-Agent Systems | Jun 14, 2025 | Adversarial AttackHallucination | —Unverified | 0 |
| DiffFuSR: Super-Resolution of all Sentinel-2 Multispectral Bands using Diffusion Models | Jun 13, 2025 | AllHallucination | CodeCode Available | 1 |
| HalLoc: Token-level Localization of Hallucinations for Vision Language Models | Jun 12, 2025 | HallucinationImage Captioning | CodeCode Available | 0 |
| Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers | Jun 12, 2025 | HallucinationOptical Character Recognition (OCR) | —Unverified | 0 |
| Attention Head Embeddings with Trainable Deep Kernels for Hallucination Detection in LLMs | Jun 11, 2025 | Hallucination | —Unverified | 0 |
| Text-Aware Image Restoration with Diffusion Models | Jun 11, 2025 | DenoisingHallucination | —Unverified | 0 |
| Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMs | Jun 11, 2025 | Dependency ParsingHallucination | CodeCode Available | 0 |
| ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs | Jun 11, 2025 | Code GenerationDiagnostic | CodeCode Available | 1 |
| Revisit What You See: Disclose Language Prior in Vision Tokens for Efficient Guided Decoding of LVLMs | Jun 11, 2025 | HallucinationObject Hallucination | CodeCode Available | 1 |
| RHealthTwin: Towards Responsible and Multimodal Digital Twins for Personalized Well-being | Jun 10, 2025 | HallucinationInstruction Following | —Unverified | 0 |
| SECOND: Mitigating Perceptual Hallucination in Vision-Language Models via Selective and Contrastive Decoding | Jun 10, 2025 | HallucinationObject Hallucination | CodeCode Available | 0 |
| MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models | Jun 9, 2025 | DiagnosticHallucination | CodeCode Available | 1 |
| Uncertainty-o: One Model-agnostic Framework for Unveiling Uncertainty in Large Multimodal Models | Jun 9, 2025 | Hallucination | —Unverified | 0 |
| MEMOIR: Lifelong Model Editing with Minimal Overwrite and Informed Retention for LLMs | Jun 9, 2025 | HallucinationModel Editing | —Unverified | 0 |
| ARGUS: Hallucination and Omission Evaluation in Video-LLMs | Jun 9, 2025 | DescriptiveForm | —Unverified | 0 |
| Conservative Bias in Large Language Models: Measuring Relation Predictions | Jun 9, 2025 | HallucinationRelation | —Unverified | 0 |
| Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning | Jun 8, 2025 | AttributeHallucination | —Unverified | 0 |
| Reducing Object Hallucination in Large Audio-Language Models via Audio-Aware Decoding | Jun 8, 2025 | HallucinationObject Hallucination | —Unverified | 0 |
| QuantMCP: Grounding Large Language Models in Verifiable Financial Reality | Jun 7, 2025 | Decision MakingFinancial Analysis | —Unverified | 0 |
| Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models | Jun 5, 2025 | DiagnosticHallucination | CodeCode Available | 1 |
| Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification | Jun 5, 2025 | Automated Theorem ProvingHallucination | CodeCode Available | 1 |
| CLATTER: Comprehensive Entailment Reasoning for Hallucination Detection | Jun 5, 2025 | HallucinationNatural Language Inference | —Unverified | 0 |
| When Thinking LLMs Lie: Unveiling the Strategic Deception in Representations of Reasoning Models | Jun 5, 2025 | HallucinationMisinformation | —Unverified | 0 |
| GOLFer: Smaller LM-Generated Documents Hallucination Filter & Combiner for Query Expansion in Information Retrieval | Jun 5, 2025 | HallucinationInformation Retrieval | CodeCode Available | 0 |
| On the Fundamental Impossibility of Hallucination Control in Large Language Models | Jun 4, 2025 | Hallucination | —Unverified | 0 |
| CHIME: Conditional Hallucination and Integrated Multi-scale Enhancement for Time Series Diffusion Model | Jun 4, 2025 | DenoisingHallucination | —Unverified | 0 |
| OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis | Jun 4, 2025 | Action GenerationDecision Making | CodeCode Available | 1 |
| Magic Mushroom: A Customizable Benchmark for Fine-grained Analysis of Retrieval Noise Erosion in RAG Systems | Jun 4, 2025 | DenoisingHallucination | —Unverified | 0 |