SOTAVerified

Hallucination

Papers

Showing 176200 of 1816 papers

TitleStatusHype
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement LearningCode1
Removal of Hallucination on Hallucination: Debate-Augmented RAGCode1
Mitigating Hallucinations in Vision-Language Models through Image-Guided Head SuppressionCode1
Know Or Not: a library for evaluating out-of-knowledge base robustnessCode1
Phare: A Safety Probe for Large Language ModelsCode1
Finetune-RAG: Fine-Tuning Language Models to Resist Hallucination in Retrieval-Augmented GenerationCode1
A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM OutputsCode1
Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language ModelsCode1
Benchmarking LLM Faithfulness in RAG with Evolving LeaderboardsCode1
Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question AnsweringCode1
VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video UnderstandingCode1
Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object PerceptionCode1
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal RepresentationsCode1
VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video ModelsCode1
EmbodiedAgent: A Scalable Hierarchical Approach to Overcome Practical Challenge in Multi-Robot ControlCode1
The Mirage of Performance Gains: Why Contrastive Decoding Fails to Address Multimodal HallucinationCode1
Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-GenerationCode1
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and MitigationCode1
LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer TextCode1
CAFe: Unifying Representation and Generation with Contrastive-Autoregressive FinetuningCode1
GeoBenchX: Benchmarking LLMs for Multistep Geospatial TasksCode1
ProDehaze: Prompting Diffusion Models Toward Faithful Image DehazingCode1
Grounded Chain-of-Thought for Multimodal Large Language ModelsCode1
TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-InterventionCode1
Towards General Visual-Linguistic Face Forgery Detection(V2)Code1
Show:102550
← PrevPage 8 of 73Next →

No leaderboard results yet.