| Automated Review Generation Method Based on Large Language Models | Jul 30, 2024 | ArticlesHallucination | CodeCode Available | 1 |
| Enhancing LLM's Cognition via Structurization | Jul 23, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning | Jul 22, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |
| Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks | Jul 13, 2024 | HallucinationNavigate | CodeCode Available | 1 |
| Multi-Object Hallucination in Vision-Language Models | Jul 8, 2024 | HallucinationObject Hallucination | CodeCode Available | 1 |
| MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? | Jul 5, 2024 | HallucinationImage Generation | CodeCode Available | 1 |
| MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context | Jul 3, 2024 | HallucinationResponse Generation | CodeCode Available | 1 |
| FineSurE: Fine-grained Summarization Evaluation using LLMs | Jul 1, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |
| Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models | Jun 30, 2024 | Hallucinationmultimodal interaction | CodeCode Available | 1 |
| GraphArena: Benchmarking Large Language Models on Graph Computational Problems | Jun 29, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |
| ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models | Jun 28, 2024 | DiagnosticHallucination | CodeCode Available | 1 |
| Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models | Jun 24, 2024 | Hallucination | CodeCode Available | 1 |
| Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models | Jun 24, 2024 | Common Sense ReasoningHallucination | CodeCode Available | 1 |
| Knowledge Graph-Enhanced Large Language Models via Path Selection | Jun 19, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 1 |
| Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding | Jun 18, 2024 | Hallucination | CodeCode Available | 1 |
| Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector | Jun 17, 2024 | 2kHallucination | CodeCode Available | 1 |
| MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts | Jun 17, 2024 | HallucinationMixture-of-Experts | CodeCode Available | 1 |
| MMRel: A Relation Understanding Benchmark in the MLLM Era | Jun 13, 2024 | DiversityHallucination | CodeCode Available | 1 |
| We Have a Package for You! A Comprehensive Analysis of Package Hallucinations by Code Generating LLMs | Jun 12, 2024 | Code GenerationHallucination | CodeCode Available | 1 |
| REAL Sampling: Boosting Factuality and Diversity of Open-Ended Generation via Asymptotic Entropy | Jun 11, 2024 | DiversityHallucination | CodeCode Available | 1 |
| DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented Generation | Jun 9, 2024 | Common Sense ReasoningDenoising | CodeCode Available | 1 |
| An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models | Jun 7, 2024 | Hallucinationparameter-efficient fine-tuning | CodeCode Available | 1 |
| Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training | May 31, 2024 | HallucinationMulti-Task Learning | CodeCode Available | 1 |
| TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models | May 28, 2024 | Hallucination | CodeCode Available | 1 |
| Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization | May 28, 2024 | Hallucination | CodeCode Available | 1 |
| DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception | May 24, 2024 | Hallucination | CodeCode Available | 1 |
| Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs | May 24, 2024 | HallucinationResponse Generation | CodeCode Available | 1 |
| The 2nd FutureDial Challenge: Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG) | May 21, 2024 | HallucinationRAG | CodeCode Available | 1 |
| Automated Multi-level Preference for MLLMs | May 18, 2024 | Dataset GenerationHallucination | CodeCode Available | 1 |
| Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling | May 16, 2024 | Contrastive LearningHallucination | CodeCode Available | 1 |
| THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models | May 8, 2024 | AttributeData Augmentation | CodeCode Available | 1 |
| CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification | Apr 30, 2024 | Code GenerationHallucination | CodeCode Available | 1 |
| LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation | Apr 22, 2024 | HallucinationRAG | CodeCode Available | 1 |
| VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models | Apr 22, 2024 | HallucinationInformativeness | CodeCode Available | 1 |
| Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback | Apr 22, 2024 | AttributeHallucination | CodeCode Available | 1 |
| Exploring the Transferability of Visual Prompting for Multimodal Large Language Models | Apr 17, 2024 | HallucinationMultimodal Reasoning | CodeCode Available | 1 |
| MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory | Apr 17, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations | Apr 15, 2024 | BenchmarkingBias Detection | CodeCode Available | 1 |
| Harnessing GPT-4V(ision) for Insurance: A Preliminary Exploration | Apr 15, 2024 | Hallucination | CodeCode Available | 1 |
| Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs | Apr 15, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| CuriousLLM: Elevating Multi-Document QA with Reasoning-Infused Knowledge Graph Prompting | Apr 13, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 1 |
| Tackling Structural Hallucination in Image Translation with Local Diffusion | Apr 9, 2024 | HallucinationImage Generation | CodeCode Available | 1 |
| Learning From Correctness Without Prompting Makes LLM Efficient Reasoner | Mar 28, 2024 | Hallucination | CodeCode Available | 1 |
| Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering | Mar 28, 2024 | HallucinationIn-Context Learning | CodeCode Available | 1 |
| JDocQA: Japanese Document Question Answering Dataset for Generative Language Models | Mar 28, 2024 | HallucinationQuestion Answering | CodeCode Available | 1 |
| UrbanVLP: Multi-Granularity Vision-Language Pretraining for Urban Socioeconomic Indicator Prediction | Mar 25, 2024 | HallucinationText Generation | CodeCode Available | 1 |
| Pensieve: Retrospect-then-Compare Mitigates Visual Hallucination | Mar 21, 2024 | HallucinationMME | CodeCode Available | 1 |
| What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-modal Models | Mar 20, 2024 | counterfactualHallucination | CodeCode Available | 1 |
| PhD: A ChatGPT-Prompted Visual hallucination Evaluation Dataset | Mar 17, 2024 | AttributeCommon Sense Reasoning | CodeCode Available | 1 |
| Circuit Transformer: A Transformer That Preserves Logical Equivalence | Mar 14, 2024 | Hallucination | CodeCode Available | 1 |