Hallucination

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 1816 papers

Title	Date	Tasks	Status	Hype
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models	Jun 15, 2023	HallucinationImage Captioning	CodeCode Available	2
Lawyer LLaMA Technical Report	May 24, 2023	ArticlesHallucination	CodeCode Available	2
Enabling Large Language Models to Generate Text with Citations	May 24, 2023	HallucinationRetrieval	CodeCode Available	2
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models	May 19, 2023	HallucinationHallucination Evaluation	CodeCode Available	2
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation	May 19, 2023	HallucinationMachine Translation	CodeCode Available	2
Evaluating Object Hallucination in Large Vision-Language Models	May 17, 2023	HallucinationObject	CodeCode Available	2
Exploring Human-Like Translation Strategy with Large Language Models	May 6, 2023	HallucinationMachine Translation	CodeCode Available	2
GPT-NER: Named Entity Recognition via Large Language Models	Apr 20, 2023	Hallucinationnamed-entity-recognition	CodeCode Available	2
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models	Mar 15, 2023	Fact CheckingHallucination	CodeCode Available	2
MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization	Jan 28, 2023	HallucinationMultiple-choice	CodeCode Available	2
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions	Dec 20, 2022	HallucinationQuestion Answering	CodeCode Available	2
Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression	Jul 21, 2022	HallucinationImage Enhancement	CodeCode Available	2
PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision	Mar 29, 2022	3D Human Pose EstimationHallucination	CodeCode Available	2
Mitigating Object Hallucinations via Sentence-Level Early Intervention	Jul 16, 2025	HallucinationMM-Vet	CodeCode Available	1
KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality	Jun 24, 2025	HallucinationHallucination Evaluation	CodeCode Available	1
DiffFuSR: Super-Resolution of all Sentinel-2 Multispectral Bands using Diffusion Models	Jun 13, 2025	AllHallucination	CodeCode Available	1
Revisit What You See: Disclose Language Prior in Vision Tokens for Efficient Guided Decoding of LVLMs	Jun 11, 2025	HallucinationObject Hallucination	CodeCode Available	1
ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs	Jun 11, 2025	Code GenerationDiagnostic	CodeCode Available	1
MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models	Jun 9, 2025	DiagnosticHallucination	CodeCode Available	1
Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models	Jun 5, 2025	DiagnosticHallucination	CodeCode Available	1
Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification	Jun 5, 2025	Automated Theorem ProvingHallucination	CodeCode Available	1
OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis	Jun 4, 2025	Action GenerationDecision Making	CodeCode Available	1
FlySearch: Exploring how vision-language models explore	Jun 3, 2025	HallucinationTask Planning	CodeCode Available	1
The Hallucination Dilemma: Factuality-Aware Reinforcement Learning for Large Reasoning Models	May 30, 2025	HallucinationMathematical Reasoning	CodeCode Available	1
CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models	May 27, 2025	HallucinationLanguage Modeling	CodeCode Available	1
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning	May 26, 2025	HallucinationRAG	CodeCode Available	1
Removal of Hallucination on Hallucination: Debate-Augmented RAG	May 24, 2025	HallucinationRAG	CodeCode Available	1
Mitigating Hallucinations in Vision-Language Models through Image-Guided Head Suppression	May 22, 2025	HallucinationImage Description	CodeCode Available	1
Know Or Not: a library for evaluating out-of-knowledge base robustness	May 19, 2025	HallucinationRAG	CodeCode Available	1
Phare: A Safety Probe for Large Language Models	May 16, 2025	DiagnosticHallucination	CodeCode Available	1
Finetune-RAG: Fine-Tuning Language Models to Resist Hallucination in Retrieval-Augmented Generation	May 16, 2025	HallucinationRAG	CodeCode Available	1
A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs	May 13, 2025	HallucinationUncertainty Quantification	CodeCode Available	1
Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language Models	May 11, 2025	DescriptiveDiagnostic	CodeCode Available	1
Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards	May 7, 2025	BenchmarkingHallucination	CodeCode Available	1
Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering	May 5, 2025	HallucinationQuestion Answering	CodeCode Available	1
VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding	May 2, 2025	Anomaly DetectionCommon Sense Reasoning	CodeCode Available	1
Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception	Apr 29, 2025	counterfactualHallucination	CodeCode Available	1
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations	Apr 18, 2025	Hallucination	CodeCode Available	1
VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models	Apr 17, 2025	HallucinationVideo Understanding	CodeCode Available	1
EmbodiedAgent: A Scalable Hierarchical Approach to Overcome Practical Challenge in Multi-Robot Control	Apr 14, 2025	Hallucination	CodeCode Available	1
The Mirage of Performance Gains: Why Contrastive Decoding Fails to Address Multimodal Hallucination	Apr 14, 2025	Hallucination	CodeCode Available	1
Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-Generation	Apr 4, 2025	ClusteringHallucination	CodeCode Available	1
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation	Mar 25, 2025	HallucinationHallucination Evaluation	CodeCode Available	1
LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text	Mar 25, 2025	Cross-Modal RetrievalHallucination	CodeCode Available	1
CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning	Mar 25, 2025	HallucinationLanguage Modeling	CodeCode Available	1
GeoBenchX: Benchmarking LLMs for Multistep Geospatial Tasks	Mar 23, 2025	BenchmarkingHallucination	CodeCode Available	1
ProDehaze: Prompting Diffusion Models Toward Faithful Image Dehazing	Mar 21, 2025	HallucinationImage Dehazing	CodeCode Available	1
Grounded Chain-of-Thought for Multimodal Large Language Models	Mar 17, 2025	HallucinationSpatial Reasoning	CodeCode Available	1
TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention	Mar 13, 2025	HallucinationObject Hallucination	CodeCode Available	1
Towards General Visual-Linguistic Face Forgery Detection(V2)	Feb 28, 2025	HallucinationLanguage Modeling	CodeCode Available	1

Show:10 25 50

← PrevPage 4 of 37Next →

No leaderboard results yet.