| AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation | Nov 13, 2023 | AttributeHallucination | CodeCode Available | 1 |
| Advancing TTP Analysis: Harnessing the Power of Large Language Models with Retrieval Augmented Generation | Dec 30, 2023 | DecoderHallucination | CodeCode Available | 1 |
| 3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure Prior | Mar 31, 2020 | 3D Semantic Scene Completion3D Semantic Scene Completion from a single RGB image | CodeCode Available | 1 |
| FlySearch: Exploring how vision-language models explore | Jun 3, 2025 | HallucinationTask Planning | CodeCode Available | 1 |
| ADeLA: Automatic Dense Labeling with Attention for Viewpoint Adaptation in Semantic Segmentation | Jul 29, 2021 | Domain AdaptationHallucination | CodeCode Available | 1 |
| Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & Hallucinations | Feb 10, 2024 | DiagnosticHallucination | CodeCode Available | 1 |
| Generating Natural Language Proofs with Verifier-Guided Search | May 25, 2022 | Hallucinationvalid | CodeCode Available | 1 |
| FineSurE: Fine-grained Summarization Evaluation using LLMs | Jul 1, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |
| An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models | Jun 7, 2024 | Hallucinationparameter-efficient fine-tuning | CodeCode Available | 1 |
| Finetune-RAG: Fine-Tuning Language Models to Resist Hallucination in Retrieval-Augmented Generation | May 16, 2025 | HallucinationRAG | CodeCode Available | 1 |