| Mitigating Manipulation and Enhancing Persuasion: A Reflective Multi-Agent Approach for Legal Argument Generation | Jun 3, 2025 | Hallucination | —Unverified | 0 |
| Machine Mirages: Defining the Undefined | Jun 3, 2025 | Causal InferenceHallucination | —Unverified | 0 |
| FlySearch: Exploring how vision-language models explore | Jun 3, 2025 | HallucinationTask Planning | CodeCode Available | 1 |
| Tomographic Foundation Model -- FORCE: Flow-Oriented Reconstruction Conditioning Engine | Jun 2, 2025 | Computed Tomography (CT)Hallucination | —Unverified | 0 |
| TRUST -- Transformer-Driven U-Net for Sparse Target Recovery | Jun 1, 2025 | DecoderHallucination | —Unverified | 0 |
| Generative AI and Organizational Structure in the Knowledge Economy | May 31, 2025 | Hallucination | —Unverified | 0 |
| Measuring Faithfulness and Abstention: An Automated Pipeline for Evaluating LLM-Generated 3-ply Case-Based Legal Arguments | May 31, 2025 | Hallucination | —Unverified | 0 |
| An AI-powered Knowledge Hub for Potato Functional Genomics | May 30, 2025 | AI AgentHallucination | —Unverified | 0 |
| Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs | May 30, 2025 | Fact CheckingHallucination | —Unverified | 0 |
| LLM Inference Enhanced by External Knowledge: A Survey | May 30, 2025 | HallucinationKnowledge Graphs | CodeCode Available | 0 |
| The Hallucination Dilemma: Factuality-Aware Reinforcement Learning for Large Reasoning Models | May 30, 2025 | HallucinationMathematical Reasoning | CodeCode Available | 1 |
| BIMA: Bijective Maximum Likelihood Learning Approach to Hallucination Prediction and Mitigation in Large Vision-Language Models | May 30, 2025 | Hallucination | —Unverified | 0 |
| FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning Evaluation | May 30, 2025 | Hallucination | CodeCode Available | 2 |
| MIRAGE: Assessing Hallucination in Multimodal Reasoning Chains of MLLM | May 30, 2025 | HallucinationMultimodal Reasoning | —Unverified | 0 |
| Preemptive Hallucination Reduction: An Input-Level Approach for Multimodal Language Model | May 29, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation | May 29, 2025 | FormHallucination | —Unverified | 0 |
| MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration | May 29, 2025 | HallucinationMultimodal Reasoning | CodeCode Available | 0 |
| Are Reasoning Models More Prone to Hallucination? | May 29, 2025 | Hallucination | —Unverified | 0 |
| Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation | May 29, 2025 | Decision MakingHallucination | —Unverified | 0 |
| Map&Make: Schema Guided Text to Table Generation | May 29, 2025 | HallucinationInformation Retrieval | —Unverified | 0 |
| Data-efficient Meta-models for Evaluation of Context-based Questions and Answers in LLMs | May 29, 2025 | Dimensionality ReductionHallucination | —Unverified | 0 |
| Qwen Look Again: Guiding Vision-Language Reasoning Models to Re-attention Visual Information | May 29, 2025 | Hallucination | CodeCode Available | 0 |
| Evaluation Hallucination in Multi-Round Incomplete Information Lateral-Driven Reasoning Tasks | May 28, 2025 | Hallucination | —Unverified | 0 |
| SkewRoute: Training-Free LLM Routing for Knowledge Graph Retrieval-Augmented Generation via Score Skewness of Retrieved Context | May 28, 2025 | HallucinationRAG | —Unverified | 0 |
| CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models | May 27, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |