Hallucination

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–275 of 1816 papers

Title	Date	Tasks	Status	Hype
Automated Review Generation Method Based on Large Language Models	Jul 30, 2024	ArticlesHallucination	CodeCode Available	1
Enhancing LLM's Cognition via Structurization	Jul 23, 2024	HallucinationHallucination Evaluation	CodeCode Available	1
HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning	Jul 22, 2024	BenchmarkingHallucination	CodeCode Available	1
Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks	Jul 13, 2024	HallucinationNavigate	CodeCode Available	1
Multi-Object Hallucination in Vision-Language Models	Jul 8, 2024	HallucinationObject Hallucination	CodeCode Available	1
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?	Jul 5, 2024	HallucinationImage Generation	CodeCode Available	1
MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context	Jul 3, 2024	HallucinationResponse Generation	CodeCode Available	1
FineSurE: Fine-grained Summarization Evaluation using LLMs	Jul 1, 2024	BenchmarkingHallucination	CodeCode Available	1
Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models	Jun 30, 2024	Hallucinationmultimodal interaction	CodeCode Available	1
GraphArena: Benchmarking Large Language Models on Graph Computational Problems	Jun 29, 2024	BenchmarkingHallucination	CodeCode Available	1
ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models	Jun 28, 2024	DiagnosticHallucination	CodeCode Available	1
Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models	Jun 24, 2024	Hallucination	CodeCode Available	1
Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models	Jun 24, 2024	Common Sense ReasoningHallucination	CodeCode Available	1
Knowledge Graph-Enhanced Large Language Models via Path Selection	Jun 19, 2024	HallucinationKnowledge Graphs	CodeCode Available	1
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding	Jun 18, 2024	Hallucination	CodeCode Available	1
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector	Jun 17, 2024	2kHallucination	CodeCode Available	1
MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts	Jun 17, 2024	HallucinationMixture-of-Experts	CodeCode Available	1
MMRel: A Relation Understanding Benchmark in the MLLM Era	Jun 13, 2024	DiversityHallucination	CodeCode Available	1
We Have a Package for You! A Comprehensive Analysis of Package Hallucinations by Code Generating LLMs	Jun 12, 2024	Code GenerationHallucination	CodeCode Available	1
REAL Sampling: Boosting Factuality and Diversity of Open-Ended Generation via Asymptotic Entropy	Jun 11, 2024	DiversityHallucination	CodeCode Available	1
DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented Generation	Jun 9, 2024	Common Sense ReasoningDenoising	CodeCode Available	1
An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models	Jun 7, 2024	Hallucinationparameter-efficient fine-tuning	CodeCode Available	1
Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training	May 31, 2024	HallucinationMulti-Task Learning	CodeCode Available	1
TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models	May 28, 2024	Hallucination	CodeCode Available	1
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization	May 28, 2024	Hallucination	CodeCode Available	1

Show:10 25 50

← PrevPage 11 of 73Next →

No leaderboard results yet.