SOTAVerified

Hallucination

Papers

Showing 401450 of 1816 papers

TitleStatusHype
Phare: A Safety Probe for Large Language ModelsCode1
IterGen: Iterative Semantic-aware Structured LLM Generation with BacktrackingCode1
Entity-level Factual Consistency of Abstractive Text SummarizationCode1
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination DetectionCode1
Multimodal LLMs Can Reason about Aesthetics in Zero-ShotCode1
Evaluation and Analysis of Hallucination in Large Vision-Language ModelsCode1
Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language ModelsCode1
NOH-NMS: Improving Pedestrian Detection by Nearby Objects HallucinationCode1
Enhancing Uncertainty-Based Hallucination Detection with Stronger FocusCode1
MMRel: A Relation Understanding Benchmark in the MLLM EraCode1
Enhancing Semantics in Multimodal Chain of Thought via Soft Negative SamplingCode1
Knowledge Graph-Enhanced Large Language Models via Path SelectionCode1
Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial TrainingCode1
Mitigating Open-Vocabulary Caption HallucinationsCode1
Chain of Natural Language Inference for Reducing Large Language Model Ungrounded HallucinationsCode1
Entity-Based Knowledge Conflicts in Question AnsweringCode1
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for HallucinationsCode1
Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI FeedbackCode1
Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous SourcesCode1
Benchmarking LLM Faithfulness in RAG with Evolving LeaderboardsCode1
A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM OutputsCode1
Detecting Hallucinated Content in Conditional Neural Sequence GenerationCode1
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal RepresentationsCode1
Accuracy and Political Bias of News Source Credibility Ratings by Large Language ModelsCode1
Enhancing LLM's Cognition via StructurizationCode1
EventHallusion: Diagnosing Event Hallucinations in Video LLMsCode1
"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented GenerationCode1
Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training ModelCode1
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference OptimizationCode1
Learning From Correctness Without Prompting Makes LLM Efficient ReasonerCode1
Mitigating Object Hallucinations via Sentence-Level Early InterventionCode1
Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought MethodCode1
Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-GenerationCode1
DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language ModelsCode1
EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language ModelsCode1
EmbodiedAgent: A Scalable Hierarchical Approach to Overcome Practical Challenge in Multi-Robot ControlCode1
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?Code1
Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and BeyondCode1
AGIR: Automating Cyber Threat Intelligence Reporting with Natural Language GenerationCode1
Knowledge Verification to Nip Hallucination in the BudCode1
ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition BenchmarkCode1
EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale DatasetCode1
Mitigating Hallucinations in Large Vision-Language Models via Summary-Guided DecodingCode1
Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information FlowCode1
DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented GenerationCode1
Doc2Query--: When Less is MoreCode1
Improving Simultaneous Machine Translation with Monolingual DataCode1
Mitigating Hallucinations in Vision-Language Models through Image-Guided Head SuppressionCode1
No-Reference Image Quality Assessment by Hallucinating Pristine FeaturesCode1
"Merge Conflicts!" Exploring the Impacts of External Distractors to Parametric Knowledge GraphsCode0
Show:102550
← PrevPage 9 of 37Next →

No leaderboard results yet.