| DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception | May 24, 2024 | Hallucination | CodeCode Available | 1 |
| Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs | May 24, 2024 | HallucinationResponse Generation | CodeCode Available | 1 |
| The 2nd FutureDial Challenge: Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG) | May 21, 2024 | HallucinationRAG | CodeCode Available | 1 |
| Automated Multi-level Preference for MLLMs | May 18, 2024 | Dataset GenerationHallucination | CodeCode Available | 1 |
| Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling | May 16, 2024 | Contrastive LearningHallucination | CodeCode Available | 1 |
| THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models | May 8, 2024 | AttributeData Augmentation | CodeCode Available | 1 |
| CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification | Apr 30, 2024 | Code GenerationHallucination | CodeCode Available | 1 |
| LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation | Apr 22, 2024 | HallucinationRAG | CodeCode Available | 1 |
| VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models | Apr 22, 2024 | HallucinationInformativeness | CodeCode Available | 1 |
| Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback | Apr 22, 2024 | AttributeHallucination | CodeCode Available | 1 |
| Exploring the Transferability of Visual Prompting for Multimodal Large Language Models | Apr 17, 2024 | HallucinationMultimodal Reasoning | CodeCode Available | 1 |
| MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory | Apr 17, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations | Apr 15, 2024 | BenchmarkingBias Detection | CodeCode Available | 1 |
| Harnessing GPT-4V(ision) for Insurance: A Preliminary Exploration | Apr 15, 2024 | Hallucination | CodeCode Available | 1 |
| Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs | Apr 15, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| CuriousLLM: Elevating Multi-Document QA with Reasoning-Infused Knowledge Graph Prompting | Apr 13, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 1 |
| Tackling Structural Hallucination in Image Translation with Local Diffusion | Apr 9, 2024 | HallucinationImage Generation | CodeCode Available | 1 |
| Learning From Correctness Without Prompting Makes LLM Efficient Reasoner | Mar 28, 2024 | Hallucination | CodeCode Available | 1 |
| Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering | Mar 28, 2024 | HallucinationIn-Context Learning | CodeCode Available | 1 |
| JDocQA: Japanese Document Question Answering Dataset for Generative Language Models | Mar 28, 2024 | HallucinationQuestion Answering | CodeCode Available | 1 |
| UrbanVLP: Multi-Granularity Vision-Language Pretraining for Urban Socioeconomic Indicator Prediction | Mar 25, 2024 | HallucinationText Generation | CodeCode Available | 1 |
| Pensieve: Retrospect-then-Compare Mitigates Visual Hallucination | Mar 21, 2024 | HallucinationMME | CodeCode Available | 1 |
| What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-modal Models | Mar 20, 2024 | counterfactualHallucination | CodeCode Available | 1 |
| PhD: A ChatGPT-Prompted Visual hallucination Evaluation Dataset | Mar 17, 2024 | AttributeCommon Sense Reasoning | CodeCode Available | 1 |
| Circuit Transformer: A Transformer That Preserves Logical Equivalence | Mar 14, 2024 | Hallucination | CodeCode Available | 1 |