| VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological Images | Aug 28, 2024 | Hallucination | CodeCode Available | 0 |
| Reducing Quantity Hallucinations in Abstractive Summarization | Sep 28, 2020 | Abstractive Text SummarizationHallucination | CodeCode Available | 0 |
| ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection | Jul 18, 2024 | Cross-Lingual TransferHallucination | CodeCode Available | 0 |
| HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN | Dec 10, 2019 | Action AnticipationAction Classification | CodeCode Available | 0 |
| Re-Ex: Revising after Explanation Reduces the Factual Errors in LLM Responses | Feb 27, 2024 | Hallucination | CodeCode Available | 0 |
| Hallucination Reduction in Long Input Text Summarization | Sep 28, 2023 | DecoderHallucination | CodeCode Available | 0 |
| DualFocus: Integrating Macro and Micro Perspectives in Multi-modal Large Language Models | Feb 22, 2024 | Hallucination | CodeCode Available | 0 |
| Hallucination, Monofacts, and Miscalibration: An Empirical Investigation | Feb 11, 2025 | DecoderHallucination | CodeCode Available | 0 |
| UCSC at SemEval-2025 Task 3: Context, Models and Prompt Optimization for Automated Hallucination Detection in LLM Output | May 5, 2025 | Hallucination | CodeCode Available | 0 |
| UFO: a Unified and Flexible Framework for Evaluating Factuality of Large Language Models | Feb 22, 2024 | HallucinationRetrieval | CodeCode Available | 0 |