SOTAVerified

Hallucination

Papers

Showing 451500 of 1816 papers

TitleStatusHype
Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion0
Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking0
Stop Learning it all to Mitigate Visual Hallucination, Focus on the Hallucination Target.0
VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token SparsificationCode1
Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
Octopus: Alleviating Hallucination via Dynamic Contrastive DecodingCode1
VL-RewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models0
POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation0
RRHF-V: Ranking Responses to Mitigate Hallucinations in Multimodal Large Language Models with Human FeedbackCode0
IllusionBench: A Large-scale and Comprehensive Benchmark for Visual Illusion Understanding in Vision-Language Models0
A review of faithfulness metrics for hallucination assessment in Large Language Models0
Distilling Desired Comments for Enhanced Code Review with Large Language Models0
HALLUCINOGEN: A Benchmark for Evaluating Object Hallucination in Large Visual-Language ModelsCode0
Is Your Text-to-Image Model Robust to Caption Noise?0
An End-to-End Depth-Based Pipeline for Selfie Image Rectification0
MedHallBench: A New Benchmark for Assessing Hallucination in Medical Large Language Models0
From Hallucinations to Facts: Enhancing Language Models with Curated Knowledge Graphs0
Extract Free Dense Misalignment from CLIPCode1
Improving Factuality with Explicit Working Memory0
Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-AugmentationCode1
Multimodal Preference Data Synthetic Alignment with Reward ModelCode0
CiteBART: Learning to Generate Citations for Local Citation RecommendationCode0
AlzheimerRAG: Multimodal Retrieval Augmented Generation for PubMed articles0
Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage0
Logical Consistency of Large Language Models in Fact-checking0
Token Preference Optimization with Self-Calibrated Visual-Anchored Rewards for Hallucination Mitigation0
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling0
A Comparative Study of DSPy Teleprompter Algorithms for Aligning Large Language Models Evaluation Metrics to Human Evaluation0
Query pipeline optimization for cancer patient question answering systems0
Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation0
Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence0
Are LLMs Good Literature Review Writers? Evaluating the Literature Review Writing Ability of Large Language Models0
ReXTrust: A Model for Fine-Grained Hallucination Detection in AI-Generated Radiology Reports0
A MapReduce Approach to Effectively Utilize Long Context Information in Retrieval Augmented Language Models0
When to Speak, When to Abstain: Contrastive Decoding with Abstention0
What External Knowledge is Preferred by LLMs? Characterizing and Exploring Chain of Evidence in Imperfect Context0
A Benchmark and Robustness Study of In-Context-Learning with Large Language Models in Music Entity DetectionCode0
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial ReasoningCode2
CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding0
Task-Oriented Dialog Systems for the Senegalese Wolof Language0
RAC3: Retrieval-Augmented Corner Case Comprehension for Autonomous Driving with Vision-Language Models0
Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning0
Accelerating Retrieval-Augmented Generation0
NoisyEQA: Benchmarking Embodied Question Answering Against Noisy Queries0
Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data0
TACOMORE: Leveraging the Potential of LLMs in Corpus-based Discourse Analysis with Prompt Engineering0
Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Prompts0
Benchmarking large language models for materials synthesis: the case of atomic layer deposition0
Multi-Task Learning with LLMs for Implicit Sentiment Analysis: Data-level and Task-level Automatic Weight Learning0
Filter-then-Generate: Large Language Models with Structure-Text Adapter for Knowledge Graph CompletionCode1
Show:102550
← PrevPage 10 of 37Next →

No leaderboard results yet.