SOTAVerified

Hallucination

Papers

Showing 151175 of 1816 papers

TitleStatusHype
Ornithologist: Towards Trustworthy "Reasoning" about Central Bank Communications0
Prioritizing Image-Related Tokens Enhances Vision-Language Pre-TrainingCode0
Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification0
Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation0
A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM OutputsCode1
On the Cost and Benefits of Training Context with Utterance or Full Conversation Training: A Comparative Stud0
SEReDeEP: Hallucination Detection in Retrieval-Augmented Models via Semantic Entropy and Context-Parameter Fusion0
Multimodal Survival Modeling in the Age of Foundation ModelsCode0
Critique Before Thinking: Mitigating Hallucination through Rationale-Augmented Instruction Tuning0
TrumorGPT: Graph-Based Retrieval-Augmented Large Language Model for Fact-Checking0
Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language ModelsCode1
Evolutionary thoughts: integration of large language models and evolutionary algorithmsCode0
Osiris: A Lightweight Open-Source Hallucination Detection System0
Benchmarking LLM Faithfulness in RAG with Evolving LeaderboardsCode1
Mitigating Image Captioning Hallucinations in Vision-Language Models0
Interpretable Zero-shot Learning with Infinite Class Concepts0
UCSC at SemEval-2025 Task 3: Context, Models and Prompt Optimization for Automated Hallucination Detection in LLM OutputCode0
Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question AnsweringCode1
Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation0
A Comprehensive Analysis for Visual Object Hallucination in Large Vision-Language Models0
SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation0
Regression is all you need for medical image translationCode0
Multi-agents based User Values Mining for Recommendation0
VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video UnderstandingCode1
Automated Parsing of Engineering Drawings for Structured Information Extraction Using a Fine-tuned Document Understanding Transformer0
Show:102550
← PrevPage 7 of 73Next →

No leaderboard results yet.