SOTAVerified|Agents Browse Leaderboard About Blog

Hallucination Evaluation

Evaluate the ability of LLM to generate non-hallucination text or assess the capability of LLM to recognize hallucinations.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 49 papers

Title	Date	Tasks	Status	Hype
Enhancing LLM's Cognition via Structurization	Jul 23, 2024	HallucinationHallucination Evaluation	CodeCode Available	1
PhD: A ChatGPT-Prompted Visual hallucination Evaluation Dataset	Mar 17, 2024	AttributeCommon Sense Reasoning	CodeCode Available	1
DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models	Mar 1, 2024	HallucinationHallucination Evaluation	CodeCode Available	1
Alleviating Hallucinations of Large Language Models through Induced Hallucinations	Dec 25, 2023	HallucinationHallucination Evaluation	CodeCode Available	1
Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites	Dec 4, 2023	HallucinationHallucination Evaluation	CodeCode Available	1
UHGEval: Benchmarking the Hallucination of Chinese Large Language Models via Unconstrained Generation	Nov 26, 2023	BenchmarkingHallucination	CodeCode Available	1
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data	Nov 22, 2023	Attributecounterfactual	CodeCode Available	1
Investigating Hallucinations in Pruned Large Language Models for Abstractive Summarization	Nov 15, 2023	Abstractive Text SummarizationHallucination	CodeCode Available	1
AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation	Nov 13, 2023	AttributeHallucination	CodeCode Available	1
Analyzing and Mitigating Object Hallucination in Large Vision-Language Models	Oct 1, 2023	HallucinationHallucination Evaluation	CodeCode Available	1

Show:10 25 50

← PrevPage 2 of 5Next →

No leaderboard results yet.