| Ornithologist: Towards Trustworthy "Reasoning" about Central Bank Communications | May 14, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training | May 13, 2025 | HallucinationLarge Language Model | CodeCode Available | 0 |
| Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification | May 13, 2025 | HallucinationRAG | —Unverified | 0 |
| Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation | May 13, 2025 | Event ExtractionHallucination | —Unverified | 0 |
| A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs | May 13, 2025 | HallucinationUncertainty Quantification | CodeCode Available | 1 |
| On the Cost and Benefits of Training Context with Utterance or Full Conversation Training: A Comparative Stud | May 12, 2025 | GPUHallucination | —Unverified | 0 |
| SEReDeEP: Hallucination Detection in Retrieval-Augmented Models via Semantic Entropy and Context-Parameter Fusion | May 12, 2025 | HallucinationRAG | —Unverified | 0 |
| Multimodal Survival Modeling in the Age of Foundation Models | May 12, 2025 | HallucinationSurvival Prediction | CodeCode Available | 0 |
| Critique Before Thinking: Mitigating Hallucination through Rationale-Augmented Instruction Tuning | May 12, 2025 | HallucinationMultimodal Reasoning | —Unverified | 0 |
| TrumorGPT: Graph-Based Retrieval-Augmented Large Language Model for Fact-Checking | May 11, 2025 | Fact CheckingFew-Shot Learning | —Unverified | 0 |
| Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language Models | May 11, 2025 | DescriptiveDiagnostic | CodeCode Available | 1 |
| Evolutionary thoughts: integration of large language models and evolutionary algorithms | May 9, 2025 | Evolutionary AlgorithmsHallucination | CodeCode Available | 0 |
| Osiris: A Lightweight Open-Source Hallucination Detection System | May 7, 2025 | HallucinationRAG | —Unverified | 0 |
| Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards | May 7, 2025 | BenchmarkingHallucination | CodeCode Available | 1 |
| Mitigating Image Captioning Hallucinations in Vision-Language Models | May 6, 2025 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Interpretable Zero-shot Learning with Infinite Class Concepts | May 6, 2025 | HallucinationZero-Shot Learning | —Unverified | 0 |
| UCSC at SemEval-2025 Task 3: Context, Models and Prompt Optimization for Automated Hallucination Detection in LLM Output | May 5, 2025 | Hallucination | CodeCode Available | 0 |
| Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering | May 5, 2025 | HallucinationQuestion Answering | CodeCode Available | 1 |
| Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation | May 5, 2025 | Entity DisambiguationHallucination | —Unverified | 0 |
| A Comprehensive Analysis for Visual Object Hallucination in Large Vision-Language Models | May 4, 2025 | AttributeHallucination | —Unverified | 0 |
| SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation | May 4, 2025 | HallucinationText Summarization | —Unverified | 0 |
| Regression is all you need for medical image translation | May 4, 2025 | AllHallucination | CodeCode Available | 0 |
| Multi-agents based User Values Mining for Recommendation | May 2, 2025 | HallucinationRecommendation Systems | —Unverified | 0 |
| VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding | May 2, 2025 | Anomaly DetectionCommon Sense Reasoning | CodeCode Available | 1 |
| Automated Parsing of Engineering Drawings for Structured Information Extraction Using a Fine-tuned Document Understanding Transformer | May 2, 2025 | document understandingHallucination | —Unverified | 0 |