SOTAVerified|Agents Browse Leaderboard About Blog

Hallucination Evaluation

Evaluate the ability of LLM to generate non-hallucination text or assess the capability of LLM to recognize hallucinations.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 41–49 of 49 papers

Title	Date	Tasks	Status	Hype
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data	Nov 22, 2023	Attributecounterfactual	CodeCode Available	1
Investigating Hallucinations in Pruned Large Language Models for Abstractive Summarization	Nov 15, 2023	Abstractive Text SummarizationHallucination	CodeCode Available	1
AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation	Nov 13, 2023	AttributeHallucination	CodeCode Available	1
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models	Oct 23, 2023	DiagnosticHallucination	CodeCode Available	2
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks	Oct 19, 2023	HallucinationHallucination Evaluation	—Unverified	0
Analyzing and Mitigating Object Hallucination in Large Vision-Language Models	Oct 1, 2023	HallucinationHallucination Evaluation	CodeCode Available	1
Evaluation and Analysis of Hallucination in Large Vision-Language Models	Aug 29, 2023	HallucinationHallucination Evaluation	CodeCode Available	1
MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models	Aug 17, 2023	Decision MakingHallucination	CodeCode Available	2
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models	May 19, 2023	HallucinationHallucination Evaluation	CodeCode Available	2

Show:10 25 50

← PrevPage 5 of 5Next →

No leaderboard results yet.