| MAVEN-Fact: A Large-scale Event Factuality Detection Dataset | Jul 22, 2024 | Hallucination | CodeCode Available | 0 | 5 |
| Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM Framework for Detecting Factual Errors | Jun 18, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 | 5 |
| Low to High Dimensional Modality Hallucination using Aggregated Fields of View | Jul 13, 2020 | HallucinationVocal Bursts Intensity Prediction | CodeCode Available | 0 | 5 |
| LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression | Mar 6, 2025 | BenchmarkingCommon Sense Reasoning | CodeCode Available | 0 | 5 |
| MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language Models | Feb 28, 2025 | Decision MakingHallucination | CodeCode Available | 0 | 5 |
| LLMs and Memorization: On Quality and Specificity of Copyright Compliance | May 28, 2024 | HallucinationMemorization | CodeCode Available | 0 | 5 |
| Analogy Models for Neural Word Inflection | Dec 1, 2020 | HallucinationLEMMA | CodeCode Available | 0 | 5 |
| Delve into Visual Contrastive Decoding for Hallucination Mitigation of Large Vision-Language Models | Dec 9, 2024 | Hallucination | CodeCode Available | 0 | 5 |
| LLM Inference Enhanced by External Knowledge: A Survey | May 30, 2025 | HallucinationKnowledge Graphs | CodeCode Available | 0 | 5 |
| Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem | Mar 6, 2024 | BenchmarkingHallucination | CodeCode Available | 0 | 5 |