SOTAVerified

Hallucination

Papers

Showing 426450 of 1816 papers

TitleStatusHype
EventHallusion: Diagnosing Event Hallucinations in Video LLMsCode1
"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented GenerationCode1
Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training ModelCode1
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference OptimizationCode1
Learning From Correctness Without Prompting Makes LLM Efficient ReasonerCode1
Mitigating Object Hallucinations via Sentence-Level Early InterventionCode1
Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought MethodCode1
Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-GenerationCode1
DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language ModelsCode1
EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language ModelsCode1
EmbodiedAgent: A Scalable Hierarchical Approach to Overcome Practical Challenge in Multi-Robot ControlCode1
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?Code1
Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and BeyondCode1
AGIR: Automating Cyber Threat Intelligence Reporting with Natural Language GenerationCode1
Knowledge Verification to Nip Hallucination in the BudCode1
ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition BenchmarkCode1
EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale DatasetCode1
Mitigating Hallucinations in Large Vision-Language Models via Summary-Guided DecodingCode1
Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information FlowCode1
DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented GenerationCode1
Doc2Query--: When Less is MoreCode1
Improving Simultaneous Machine Translation with Monolingual DataCode1
Mitigating Hallucinations in Vision-Language Models through Image-Guided Head SuppressionCode1
No-Reference Image Quality Assessment by Hallucinating Pristine FeaturesCode1
"Merge Conflicts!" Exploring the Impacts of External Distractors to Parametric Knowledge GraphsCode0
Show:102550
← PrevPage 18 of 73Next →

No leaderboard results yet.