SOTAVerified

Hallucination

Papers

Showing 476500 of 1816 papers

TitleStatusHype
MEMOIR: Lifelong Model Editing with Minimal Overwrite and Informed Retention for LLMs0
Conservative Bias in Large Language Models: Measuring Relation Predictions0
Uncertainty-o: One Model-agnostic Framework for Unveiling Uncertainty in Large Multimodal Models0
ARGUS: Hallucination and Omission Evaluation in Video-LLMs0
Reducing Object Hallucination in Large Audio-Language Models via Audio-Aware Decoding0
Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning0
QuantMCP: Grounding Large Language Models in Verifiable Financial Reality0
When Thinking LLMs Lie: Unveiling the Strategic Deception in Representations of Reasoning Models0
CLATTER: Comprehensive Entailment Reasoning for Hallucination Detection0
GOLFer: Smaller LM-Generated Documents Hallucination Filter & Combiner for Query Expansion in Information RetrievalCode0
Magic Mushroom: A Customizable Benchmark for Fine-grained Analysis of Retrieval Noise Erosion in RAG Systems0
On the Fundamental Impossibility of Hallucination Control in Large Language Models0
CHIME: Conditional Hallucination and Integrated Multi-scale Enhancement for Time Series Diffusion Model0
Machine Mirages: Defining the Undefined0
Mitigating Manipulation and Enhancing Persuasion: A Reflective Multi-Agent Approach for Legal Argument Generation0
Tomographic Foundation Model -- FORCE: Flow-Oriented Reconstruction Conditioning Engine0
TRUST -- Transformer-Driven U-Net for Sparse Target Recovery0
Measuring Faithfulness and Abstention: An Automated Pipeline for Evaluating LLM-Generated 3-ply Case-Based Legal Arguments0
Generative AI and Organizational Structure in the Knowledge Economy0
Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs0
BIMA: Bijective Maximum Likelihood Learning Approach to Hallucination Prediction and Mitigation in Large Vision-Language Models0
MIRAGE: Assessing Hallucination in Multimodal Reasoning Chains of MLLM0
An AI-powered Knowledge Hub for Potato Functional Genomics0
LLM Inference Enhanced by External Knowledge: A SurveyCode0
Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation0
Show:102550
← PrevPage 20 of 73Next →

No leaderboard results yet.