SOTAVerified

Hallucination

Papers

Showing 826850 of 1816 papers

TitleStatusHype
VL-RewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models0
A review of faithfulness metrics for hallucination assessment in Large Language Models0
Distilling Desired Comments for Enhanced Code Review with Large Language Models0
HALLUCINOGEN: A Benchmark for Evaluating Object Hallucination in Large Visual-Language ModelsCode0
Is Your Text-to-Image Model Robust to Caption Noise?0
An End-to-End Depth-Based Pipeline for Selfie Image Rectification0
MedHallBench: A New Benchmark for Assessing Hallucination in Medical Large Language Models0
Improving Factuality with Explicit Working Memory0
From Hallucinations to Facts: Enhancing Language Models with Curated Knowledge Graphs0
Multimodal Preference Data Synthetic Alignment with Reward ModelCode0
CiteBART: Learning to Generate Citations for Local Citation RecommendationCode0
AlzheimerRAG: Multimodal Retrieval Augmented Generation for PubMed articles0
Logical Consistency of Large Language Models in Fact-checking0
Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage0
Token Preference Optimization with Self-Calibrated Visual-Anchored Rewards for Hallucination Mitigation0
Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation0
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling0
Query pipeline optimization for cancer patient question answering systems0
A Comparative Study of DSPy Teleprompter Algorithms for Aligning Large Language Models Evaluation Metrics to Human Evaluation0
Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence0
Are LLMs Good Literature Review Writers? Evaluating the Literature Review Writing Ability of Large Language Models0
When to Speak, When to Abstain: Contrastive Decoding with Abstention0
A MapReduce Approach to Effectively Utilize Long Context Information in Retrieval Augmented Language Models0
What External Knowledge is Preferred by LLMs? Characterizing and Exploring Chain of Evidence in Imperfect Context0
ReXTrust: A Model for Fine-Grained Hallucination Detection in AI-Generated Radiology Reports0
Show:102550
← PrevPage 34 of 73Next →

No leaderboard results yet.