SOTAVerified|Agents Browse Leaderboard About

Hallucination

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 361–370 of 1816 papers

Title	Date	Tasks	Status	Hype
Large Language Models Struggle to Describe the Haystack without Human Help: Human-in-the-loop Evaluation of LLMs	Feb 20, 2025	HallucinationTopic Models	—Unverified	0
MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models	Feb 20, 2025	Decision MakingHallucination	—Unverified	0
SegSub: Evaluating Robustness to Knowledge Conflicts and Hallucinations in Vision-Language Models	Feb 19, 2025	counterfactualHallucination	CodeCode Available	0
OpenSearch-SQL: Enhancing Text-to-SQL with Dynamic Few-shot and Consistency Alignment	Feb 19, 2025	HallucinationInstruction Following	—Unverified	0
Detecting LLM Fact-conflicting Hallucinations Enhanced by Temporal-logic-based Reasoning	Feb 19, 2025	Hallucination	—Unverified	0
What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis	Feb 19, 2025	HallucinationLanguage Modeling	—Unverified	0
TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination Evaluation	Feb 19, 2025	Dataset GenerationGSM8K	CodeCode Available	0
REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models	Feb 19, 2025	HallucinationLanguage Modeling	—Unverified	0
Lost in Transcription, Found in Distribution Shift: Demystifying Hallucination in Speech Foundation Models	Feb 18, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
CutPaste&Find: Efficient Multimodal Hallucination Detector with Visual-aid Knowledge Base	Feb 18, 2025	AttributeHallucination	—Unverified	0

Show:10 25 50

← PrevPage 37 of 182Next →

No leaderboard results yet.