SOTAVerified|Agents Browse Leaderboard About

Hallucination Evaluation

Evaluate the ability of LLM to generate non-hallucination text or assess the capability of LLM to recognize hallucinations.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 31–40 of 49 papers

Title	Date	Tasks	Status	Hype	Score
Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models	Oct 30, 2024	HallucinationHallucination Evaluation	CodeCode Available	0	5
Lynx: An Open Source Hallucination Evaluation Model	Jul 11, 2024	HallucinationHallucination Evaluation	—Unverified	0	0
A Survey of Hallucination in Large Visual Language Models	Oct 20, 2024	HallucinationHallucination Evaluation	—Unverified	0	0
FIHA: Autonomous Hallucination Evaluation in Vision-Language Models with Davidson Scene Graphs	Sep 20, 2024	HallucinationHallucination Evaluation	—Unverified	0	0
Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization	Nov 15, 2024	HallucinationHallucination Evaluation	—Unverified	0	0
CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems	May 24, 2024	DiagnosticHallucination	—Unverified	0	0
Mitigating Image Captioning Hallucinations in Vision-Language Models	May 6, 2025	HallucinationHallucination Evaluation	—Unverified	0	0
Do Androids Know They're Only Dreaming of Electric Sheep?	Dec 28, 2023	HallucinationHallucination Evaluation	—Unverified	0	0
Real-Time Evaluation Models for RAG: Who Detects Hallucinations Best?	Mar 27, 2025	HallucinationHallucination Evaluation	—Unverified	0	0
Exploring and Evaluating Hallucinations in LLM-Powered Code Generation	Apr 1, 2024	Code GenerationHallucination	—Unverified	0	0

Show:10 25 50

← PrevPage 4 of 5Next →

No leaderboard results yet.