| Do Language Models Know When They're Hallucinating References? | May 29, 2023 | HallucinationLanguage Modeling | CodeCode Available | 0 | 5 |
| MedScore: Factuality Evaluation of Free-Form Medical Answers | May 24, 2025 | FormHallucination | CodeCode Available | 0 | 5 |
| MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA | Dec 19, 2023 | Document ClassificationHallucination | CodeCode Available | 0 | 5 |
| BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science | Jun 29, 2024 | AI AgentClaim Verification | CodeCode Available | 0 | 5 |
| Diving Deep into Modes of Fact Hallucinations in Dialogue Systems | Jan 11, 2023 | Hallucination | CodeCode Available | 0 | 5 |
| Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations | Mar 27, 2024 | AttributeDiagnostic | CodeCode Available | 0 | 5 |
| MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language Models | Feb 28, 2025 | Decision MakingHallucination | CodeCode Available | 0 | 5 |
| Mitigating Hallucination in Fictional Character Role-Play | Jun 25, 2024 | HallucinationWorld Knowledge | CodeCode Available | 0 | 5 |
| Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling | May 1, 2024 | HallucinationTopic Classification | CodeCode Available | 0 | 5 |
| Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models | Jul 23, 2024 | HallucinationMachine Translation | CodeCode Available | 0 | 5 |