SOTAVerified

Hallucination

Papers

Showing 661670 of 1816 papers

TitleStatusHype
Medico: Towards Hallucination Detection and Correction with Multi-source Evidence Fusion0
SkillAggregation: Reference-free LLM-Dependent Aggregation0
VideoAgent: Self-Improving Video GenerationCode2
LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language ModelsCode0
Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code0
Honest AI: Fine-Tuning "Small" Language Models to Say "I Don't Know", and Reducing Hallucination in RAG0
VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment0
A Methodology for Evaluating RAG Systems: A Case Study On Configuration Dependency ValidationCode0
Measuring the Inconsistency of Large Language Models in Preferential Ranking0
VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video UnderstandingCode1
Show:102550
← PrevPage 67 of 182Next →

No leaderboard results yet.