| Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement | Mar 31, 2025 | HallucinationRAG | CodeCode Available | 2 |
| GPT-NER: Named Entity Recognition via Large Language Models | Apr 20, 2023 | Hallucinationnamed-entity-recognition | CodeCode Available | 2 |
| FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning Evaluation | May 30, 2025 | Hallucination | CodeCode Available | 2 |
| FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation | Oct 5, 2023 | HallucinationWorld Knowledge | CodeCode Available | 2 |
| HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation | May 19, 2023 | HallucinationMachine Translation | CodeCode Available | 2 |
| Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph | Jan 24, 2025 | Community DetectionHallucination | CodeCode Available | 2 |
| A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions | Nov 9, 2023 | HallucinationInformation Retrieval | CodeCode Available | 2 |
| FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows" | Sep 30, 2024 | counterfactualHallucination | CodeCode Available | 2 |
| CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs | Jan 28, 2025 | Hallucination | CodeCode Available | 2 |
| From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models | Oct 13, 2023 | HallucinationImage Captioning | CodeCode Available | 2 |