| KNVQA: A Benchmark for evaluation knowledge-based VQA | Nov 21, 2023 | HallucinationObject Hallucination | —Unverified | 0 |
| From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models | Oct 13, 2023 | HallucinationImage Captioning | CodeCode Available | 2 |
| Ferret: Refer and Ground Anything Anywhere at Any Granularity | Oct 11, 2023 | HallucinationLanguage Modeling | CodeCode Available | 5 |
| Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models | Oct 9, 2023 | HallucinationObject | —Unverified | 0 |
| HallE-Control: Controlling Object Hallucination in Large Multimodal Models | Oct 3, 2023 | AttributeDecoder | CodeCode Available | 1 |
| Analyzing and Mitigating Object Hallucination in Large Vision-Language Models | Oct 1, 2023 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| Detecting and Preventing Hallucinations in Large Vision Language Models | Aug 11, 2023 | 16kHallucination | CodeCode Available | 1 |
| TinyLVLM-eHub: Towards Comprehensive and Efficient Evaluation for Large Vision-Language Models | Aug 7, 2023 | HallucinationObject Hallucination | CodeCode Available | 2 |
| Transferable Decoding with Visual Entities for Zero-Shot Image Captioning | Jul 31, 2023 | Caption GenerationHallucination | CodeCode Available | 1 |
| LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models | Jun 15, 2023 | HallucinationImage Captioning | CodeCode Available | 2 |
| Evaluating Object Hallucination in Large Vision-Language Models | May 17, 2023 | HallucinationObject | CodeCode Available | 2 |
| Simple Token-Level Confidence Improves Caption Correctness | May 11, 2023 | HallucinationImage Captioning | —Unverified | 0 |
| Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training | Oct 14, 2022 | HallucinationImage Augmentation | CodeCode Available | 0 |
| Deep Learning Approaches on Image Captioning: A Review | Jan 31, 2022 | Caption GenerationDeep Learning | —Unverified | 0 |
| Relational Graph Learning for Grounded Video Description Generation | Dec 2, 2021 | Graph LearningHallucination | —Unverified | 0 |
| Consensus Graph Representation Learning for Better Grounded Image Captioning | Dec 2, 2021 | Graph Representation LearningHallucination | —Unverified | 0 |
| Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning | Oct 4, 2021 | HallucinationImage Captioning | CodeCode Available | 1 |
| ``I've Seen Things You People Wouldn't Believe'': Hallucinating Entities in GuessWhat?! | Aug 1, 2021 | HallucinationImage Captioning | —Unverified | 0 |
| HyperPocket: Generative Point Cloud Completion | Feb 11, 2021 | HallucinationObject Hallucination | CodeCode Available | 1 |
| Explain and Improve: LRP-Inference Fine-Tuning for Image Captioning Models | Jan 4, 2020 | HallucinationImage Captioning | CodeCode Available | 0 |
| Object Hallucination in Image Captioning | Sep 6, 2018 | HallucinationImage Captioning | CodeCode Available | 0 |