SOTAVerified|Agents Browse Leaderboard About Blog

Hallucination Evaluation

Evaluate the ability of LLM to generate non-hallucination text or assess the capability of LLM to recognize hallucinations.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 41–49 of 49 papers

Title	Date	Tasks	Status	Hype	Score
GraphEval: A Knowledge-Graph Based LLM Hallucination Evaluation Framework	Jul 15, 2024	HallucinationHallucination Evaluation	—Unverified	0	0
Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models	Feb 24, 2024	HallucinationHallucination Evaluation	—Unverified	0	0
TextSquare: Scaling up Text-Centric Visual Instruction Tuning	Apr 19, 2024	HallucinationHallucination Evaluation	—Unverified	0	0
HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation	Jun 26, 2025	counterfactualCounterfactual Reasoning	—Unverified	0	0
Can We Catch the Elephant? A Survey of the Evolvement of Hallucination Evaluation on Natural Language Generation	Apr 18, 2024	HallucinationHallucination Evaluation	—Unverified	0	0
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks	Oct 19, 2023	HallucinationHallucination Evaluation	—Unverified	0	0
Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs	Mar 26, 2025	HallucinationHallucination Evaluation	—Unverified	0	0
TLDR: Token-Level Detective Reward Model for Large Vision Language Models	Oct 7, 2024	HallucinationHallucination Evaluation	—Unverified	0	0
Localizing Before Answering: A Hallucination Evaluation Benchmark for Grounded Medical Multimodal LLMs	Apr 30, 2025	HallucinationHallucination Evaluation	—Unverified	0	0

Show:10 25 50

← PrevPage 5 of 5Next →

No leaderboard results yet.