| Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs' Decoding Layers | Mar 4, 2025 | Hallucination | CodeCode Available | 0 |
| SAFE: A Sparse Autoencoder-Based Framework for Robust Query Enrichment and Hallucination Mitigation in LLMs | Mar 4, 2025 | Hallucination | —Unverified | 0 |
| WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation | Mar 4, 2025 | Hallucination | CodeCode Available | 2 |
| MCiteBench: A Multimodal Benchmark for Generating Text with Citations | Mar 4, 2025 | HallucinationText Generation | CodeCode Available | 0 |
| Adaptively profiling models with task elicitation | Mar 3, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Explainable Depression Detection in Clinical Interviews with Personalized Retrieval-Augmented Generation | Mar 3, 2025 | Depression DetectionHallucination | —Unverified | 0 |
| Evaluating LLMs' Assessment of Mixed-Context Hallucination Through the Lens of Summarization | Mar 3, 2025 | HallucinationHallucination Evaluation | CodeCode Available | 0 |
| LLM-Advisor: An LLM Benchmark for Cost-efficient Path Planning across Multiple Terrains | Mar 3, 2025 | Common Sense ReasoningHallucination | —Unverified | 0 |
| Tackling Hallucination from Conditional Models for Medical Image Reconstruction with DynamicDPS | Mar 3, 2025 | HallucinationImage Reconstruction | —Unverified | 0 |
| NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SeflCheckGPT | Mar 2, 2025 | Hallucination | CodeCode Available | 0 |