| Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language Models | May 11, 2025 | DescriptiveDiagnostic | CodeCode Available | 1 |
| Evolutionary thoughts: integration of large language models and evolutionary algorithms | May 9, 2025 | Evolutionary AlgorithmsHallucination | CodeCode Available | 0 |
| Osiris: A Lightweight Open-Source Hallucination Detection System | May 7, 2025 | HallucinationRAG | —Unverified | 0 |
| Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards | May 7, 2025 | BenchmarkingHallucination | CodeCode Available | 1 |
| Interpretable Zero-shot Learning with Infinite Class Concepts | May 6, 2025 | HallucinationZero-Shot Learning | —Unverified | 0 |
| Mitigating Image Captioning Hallucinations in Vision-Language Models | May 6, 2025 | HallucinationHallucination Evaluation | —Unverified | 0 |
| UCSC at SemEval-2025 Task 3: Context, Models and Prompt Optimization for Automated Hallucination Detection in LLM Output | May 5, 2025 | Hallucination | CodeCode Available | 0 |
| Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering | May 5, 2025 | HallucinationQuestion Answering | CodeCode Available | 1 |
| Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation | May 5, 2025 | Entity DisambiguationHallucination | —Unverified | 0 |
| A Comprehensive Analysis for Visual Object Hallucination in Large Vision-Language Models | May 4, 2025 | AttributeHallucination | —Unverified | 0 |