SOTAVerified|Agents Browse Leaderboard About

Hallucination

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–210 of 1816 papers

Title	Date	Tasks	Status	Hype
Generating Natural Language Proofs with Verifier-Guided Search	May 25, 2022	Hallucinationvalid	CodeCode Available	1
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity	Feb 8, 2023	Code GenerationHallucination	CodeCode Available	1
BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models	Sep 23, 2023	Code CompletionHallucination	CodeCode Available	1
Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & Hallucinations	Feb 10, 2024	DiagnosticHallucination	CodeCode Available	1
Balanced Classification: A Unified Framework for Long-Tailed Object Detection	Aug 4, 2023	HallucinationLong-tailed Object Detection	CodeCode Available	1
BachGAN: High-Resolution Image Synthesis from Salient Object Layout	Mar 26, 2020	Generative Adversarial NetworkHallucination	CodeCode Available	1
PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language Model	Jan 21, 2025	HallucinationImage Captioning	CodeCode Available	1
FlySearch: Exploring how vision-language models explore	Jun 3, 2025	HallucinationTask Planning	CodeCode Available	1
Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards	May 7, 2025	BenchmarkingHallucination	CodeCode Available	1
CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning	Mar 25, 2025	HallucinationLanguage Modeling	CodeCode Available	1

Show:10 25 50

← PrevPage 21 of 182Next →

No leaderboard results yet.