| TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes | Mar 26, 2025 | Hallucination | —Unverified | 0 |
| KSHSeek: Data-Driven Approaches to Mitigating and Detecting Knowledge-Shortcut Hallucinations in Generative Models | Mar 25, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| HausaNLP at SemEval-2025 Task 3: Towards a Fine-Grained Model-Aware Hallucination Detection | Mar 25, 2025 | HallucinationNatural Language Inference | —Unverified | 0 |
| CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning | Mar 25, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text | Mar 25, 2025 | Cross-Modal RetrievalHallucination | CodeCode Available | 1 |
| Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation | Mar 25, 2025 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices | Mar 23, 2025 | HallucinationTriviaQA | —Unverified | 0 |
| GeoBenchX: Benchmarking LLMs for Multistep Geospatial Tasks | Mar 23, 2025 | BenchmarkingHallucination | CodeCode Available | 1 |
| good4cir: Generating Detailed Synthetic Captions for Composed Image Retrieval | Mar 22, 2025 | DiversityHallucination | —Unverified | 0 |
| Judge Anything: MLLM as a Judge Across Any Modality | Mar 21, 2025 | Hallucination | —Unverified | 0 |