SOTAVerified|Agents Browse Leaderboard About Blog

Hallucination Evaluation

Evaluate the ability of LLM to generate non-hallucination text or assess the capability of LLM to recognize hallucinations.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 49 papers

Title	Date	Tasks	Status	Hype
Alleviating Hallucinations of Large Language Models through Induced Hallucinations	Dec 25, 2023	HallucinationHallucination Evaluation	CodeCode Available	1
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation	Mar 25, 2025	HallucinationHallucination Evaluation	CodeCode Available	1
Investigating Hallucinations in Pruned Large Language Models for Abstractive Summarization	Nov 15, 2023	Abstractive Text SummarizationHallucination	CodeCode Available	1
DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models	Mar 1, 2024	HallucinationHallucination Evaluation	CodeCode Available	1
Enhancing LLM's Cognition via Structurization	Jul 23, 2024	HallucinationHallucination Evaluation	CodeCode Available	1
Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards	May 7, 2025	BenchmarkingHallucination	CodeCode Available	1
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data	Nov 22, 2023	Attributecounterfactual	CodeCode Available	1
Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering	Sep 19, 2024	HallucinationHallucination Evaluation	CodeCode Available	1
Evaluation and Analysis of Hallucination in Large Vision-Language Models	Aug 29, 2023	HallucinationHallucination Evaluation	CodeCode Available	1
Analyzing and Mitigating Object Hallucination in Large Vision-Language Models	Oct 1, 2023	HallucinationHallucination Evaluation	CodeCode Available	1

Show:10 25 50

← PrevPage 2 of 5Next →

No leaderboard results yet.