| Automated Review Generation Method Based on Large Language Models | Jul 30, 2024 | ArticlesHallucination | CodeCode Available | 1 |
| Enhancing LLM's Cognition via Structurization | Jul 23, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning | Jul 22, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |
| Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks | Jul 13, 2024 | HallucinationNavigate | CodeCode Available | 1 |
| Multi-Object Hallucination in Vision-Language Models | Jul 8, 2024 | HallucinationObject Hallucination | CodeCode Available | 1 |
| MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? | Jul 5, 2024 | HallucinationImage Generation | CodeCode Available | 1 |
| MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context | Jul 3, 2024 | HallucinationResponse Generation | CodeCode Available | 1 |
| FineSurE: Fine-grained Summarization Evaluation using LLMs | Jul 1, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |
| Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models | Jun 30, 2024 | Hallucinationmultimodal interaction | CodeCode Available | 1 |
| GraphArena: Benchmarking Large Language Models on Graph Computational Problems | Jun 29, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |
| ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models | Jun 28, 2024 | DiagnosticHallucination | CodeCode Available | 1 |
| Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models | Jun 24, 2024 | Hallucination | CodeCode Available | 1 |
| Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models | Jun 24, 2024 | Common Sense ReasoningHallucination | CodeCode Available | 1 |
| Knowledge Graph-Enhanced Large Language Models via Path Selection | Jun 19, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 1 |
| Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding | Jun 18, 2024 | Hallucination | CodeCode Available | 1 |
| Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector | Jun 17, 2024 | 2kHallucination | CodeCode Available | 1 |
| MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts | Jun 17, 2024 | HallucinationMixture-of-Experts | CodeCode Available | 1 |
| MMRel: A Relation Understanding Benchmark in the MLLM Era | Jun 13, 2024 | DiversityHallucination | CodeCode Available | 1 |
| We Have a Package for You! A Comprehensive Analysis of Package Hallucinations by Code Generating LLMs | Jun 12, 2024 | Code GenerationHallucination | CodeCode Available | 1 |
| REAL Sampling: Boosting Factuality and Diversity of Open-Ended Generation via Asymptotic Entropy | Jun 11, 2024 | DiversityHallucination | CodeCode Available | 1 |
| DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented Generation | Jun 9, 2024 | Common Sense ReasoningDenoising | CodeCode Available | 1 |
| An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models | Jun 7, 2024 | Hallucinationparameter-efficient fine-tuning | CodeCode Available | 1 |
| Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training | May 31, 2024 | HallucinationMulti-Task Learning | CodeCode Available | 1 |
| TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models | May 28, 2024 | Hallucination | CodeCode Available | 1 |
| Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization | May 28, 2024 | Hallucination | CodeCode Available | 1 |