SOTAVerified

Hallucination

Papers

Showing 201210 of 1816 papers

TitleStatusHype
Generating Natural Language Proofs with Verifier-Guided SearchCode1
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and InteractivityCode1
BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language ModelsCode1
Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & HallucinationsCode1
Balanced Classification: A Unified Framework for Long-Tailed Object DetectionCode1
BachGAN: High-Resolution Image Synthesis from Salient Object LayoutCode1
PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language ModelCode1
FlySearch: Exploring how vision-language models exploreCode1
Benchmarking LLM Faithfulness in RAG with Evolving LeaderboardsCode1
CAFe: Unifying Representation and Generation with Contrastive-Autoregressive FinetuningCode1
Show:102550
← PrevPage 21 of 182Next →

No leaderboard results yet.