SOTAVerified

Hallucination

Papers

Showing 651675 of 1816 papers

TitleStatusHype
LargePiG: Your Large Language Model is Secretly a Pointer Generator0
ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability0
MLLM can see? Dynamic Correction Decoding for Hallucination MitigationCode2
Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language ModelsCode0
Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs0
On the Capacity of Citation Generation by Large Language Models0
Magnifier Prompt: Tackling Multimodal Hallucination via Extremely Simple Instructions0
AGENTiGraph: An Interactive Knowledge Graph Platform for LLM-based Chatbots Utilizing Private Data0
Can Structured Data Reduce Epistemic Uncertainty?0
Parenting: Optimizing Knowledge Selection of Retrieval-Augmented Language Models with Parameter Decoupling and Tailored Tuning0
SkillAggregation: Reference-free LLM-Dependent Aggregation0
Medico: Towards Hallucination Detection and Correction with Multi-source Evidence Fusion0
VideoAgent: Self-Improving Video GenerationCode2
LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language ModelsCode0
Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code0
Honest AI: Fine-Tuning "Small" Language Models to Say "I Don't Know", and Reducing Hallucination in RAG0
VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment0
VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video UnderstandingCode1
A Methodology for Evaluating RAG Systems: A Case Study On Configuration Dependency ValidationCode0
Measuring the Inconsistency of Large Language Models in Preferential Ranking0
PublicHearingBR: A Brazilian Portuguese Dataset of Public Hearing Transcripts for Summarization of Long Documents0
Can Knowledge Graphs Make Large Language Models More Trustworthy? An Empirical Study over Open-ended Question Answering0
Automatic Curriculum Expert Iteration for Reliable LLM ReasoningCode1
OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model PromptingCode1
LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts0
Show:102550
← PrevPage 27 of 73Next →

No leaderboard results yet.