SOTAVerified

Hallucination

Papers

Showing 501550 of 1816 papers

TitleStatusHype
Map&Make: Schema Guided Text to Table Generation0
Preemptive Hallucination Reduction: An Input-Level Approach for Multimodal Language Model0
Data-efficient Meta-models for Evaluation of Context-based Questions and Answers in LLMs0
Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation0
Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation0
Are Reasoning Models More Prone to Hallucination?0
Qwen Look Again: Guiding Vision-Language Reasoning Models to Re-attention Visual InformationCode0
SkewRoute: Training-Free LLM Routing for Knowledge Graph Retrieval-Augmented Generation via Score Skewness of Retrieved Context0
Evaluation Hallucination in Multi-Round Incomplete Information Lateral-Driven Reasoning Tasks0
Mitigating Hallucination in Large Vision-Language Models via Adaptive Attention Calibration0
A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction0
Grounding Language with Vision: A Conditional Mutual Information Calibrated Decoding Strategy for Reducing Hallucinations in LVLMs0
Uncertainty-Aware Attention Heads: Efficient Unsupervised Uncertainty Quantification for LLMs0
Retrieval Visual Contrastive Decoding to Mitigate Object Hallucinations in Large Vision-Language ModelsCode0
Attention! You Vision Language Model Could Be Maliciously Manipulated0
Causal-LLaVA: Causal Disentanglement for Mitigating Hallucination in Multimodal Large Language ModelsCode0
Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical SupervisionCode0
Enhancing Visual Reliance in Text Generation: A Bayesian Perspective on Mitigating Hallucination in Large Vision-Language Models0
LLLMs: A Data-Driven Survey of Evolving Research on Limitations of Large Language Models0
GUARDIAN: Safeguarding LLM Multi-Agent Collaborations with Temporal Graph Modeling0
CCHall: A Novel Benchmark for Joint Cross-Lingual and Cross-Modal Hallucinations Detection in Large Language ModelsCode0
MedScore: Factuality Evaluation of Free-Form Medical AnswersCode0
keepitsimple at SemEval-2025 Task 3: LLM-Uncertainty based Approach for Multilingual Hallucination Span DetectionCode0
More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models0
Teaching with Lies: Curriculum DPO on Synthetic Negatives for Hallucination Detection0
Locate-then-Merge: Neuron-Level Parameter Fusion for Mitigating Catastrophic Forgetting in Multimodal LLMs0
UNCLE: Uncertainty Expressions in Long-Form Generation0
Walk&Retrieve: Simple Yet Effective Zero-shot Retrieval-Augmented Generation via Knowledge Graph WalksCode0
LLM-Powered Agents for Navigating Venice's Historical Cadastre0
Steering LVLMs via Sparse Autoencoder for Hallucination Mitigation0
Chain-of-Thought Poisoning Attacks against R1-based Retrieval-Augmented Generation Systems0
Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding0
Shadows in the Attention: Contextual Perturbation and Representation Drift in the Dynamics of Hallucination in LLMs0
Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization0
NEXT-EVAL: Next Evaluation of Traditional and LLM Web Data Record Extraction0
Multilingual Prompting for Improving LLM Generation Diversity0
KaFT: Knowledge-aware Fine-tuning for Boosting LLMs' Domain-specific Question-Answering Performance0
HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving0
RePPL: Recalibrating Perplexity by Uncertainty in Semantic Propagation and Language Generation for Explainable QA Hallucination Detection0
Aug2Search: Enhancing Facebook Marketplace Search with LLM-Generated Synthetic Data Augmentation0
OViP: Online Vision-Language Preference Learning0
Visual Instruction Bottleneck Tuning0
MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM HallucinationsCode0
Toward Reliable Biomedical Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language ModelsCode0
Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models0
Legal Rule Induction: Towards Generalizable Principle Discovery from Analogous Judicial Precedents0
Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization0
The Hallucination Tax of Reinforcement Finetuning0
Foundations of Unknown-aware Machine Learning0
Plane Geometry Problem Solving with Multi-modal Reasoning: A Survey0
Show:102550
← PrevPage 11 of 37Next →

No leaderboard results yet.