SOTAVerified

Hallucination

Papers

Showing 5175 of 1816 papers

TitleStatusHype
Mitigating Manipulation and Enhancing Persuasion: A Reflective Multi-Agent Approach for Legal Argument Generation0
Machine Mirages: Defining the Undefined0
FlySearch: Exploring how vision-language models exploreCode1
Tomographic Foundation Model -- FORCE: Flow-Oriented Reconstruction Conditioning Engine0
TRUST -- Transformer-Driven U-Net for Sparse Target Recovery0
Generative AI and Organizational Structure in the Knowledge Economy0
Measuring Faithfulness and Abstention: An Automated Pipeline for Evaluating LLM-Generated 3-ply Case-Based Legal Arguments0
An AI-powered Knowledge Hub for Potato Functional Genomics0
Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs0
LLM Inference Enhanced by External Knowledge: A SurveyCode0
The Hallucination Dilemma: Factuality-Aware Reinforcement Learning for Large Reasoning ModelsCode1
BIMA: Bijective Maximum Likelihood Learning Approach to Hallucination Prediction and Mitigation in Large Vision-Language Models0
FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning EvaluationCode2
MIRAGE: Assessing Hallucination in Multimodal Reasoning Chains of MLLM0
Preemptive Hallucination Reduction: An Input-Level Approach for Multimodal Language Model0
Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation0
MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence CalibrationCode0
Are Reasoning Models More Prone to Hallucination?0
Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation0
Map&Make: Schema Guided Text to Table Generation0
Data-efficient Meta-models for Evaluation of Context-based Questions and Answers in LLMs0
Qwen Look Again: Guiding Vision-Language Reasoning Models to Re-attention Visual InformationCode0
Evaluation Hallucination in Multi-Round Incomplete Information Lateral-Driven Reasoning Tasks0
SkewRoute: Training-Free LLM Routing for Knowledge Graph Retrieval-Augmented Generation via Score Skewness of Retrieved Context0
CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language ModelsCode1
Show:102550
← PrevPage 3 of 73Next →

No leaderboard results yet.