SOTAVerified

Hallucination

Papers

Showing 751775 of 1816 papers

TitleStatusHype
Large Language Models Struggle to Describe the Haystack without Human Help: Human-in-the-loop Evaluation of LLMs0
Detecting LLM Fact-conflicting Hallucinations Enhanced by Temporal-logic-based Reasoning0
SegSub: Evaluating Robustness to Knowledge Conflicts and Hallucinations in Vision-Language ModelsCode0
REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models0
What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis0
OpenSearch-SQL: Enhancing Text-to-SQL with Dynamic Few-shot and Consistency Alignment0
TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination EvaluationCode0
CutPaste&Find: Efficient Multimodal Hallucination Detector with Visual-aid Knowledge Base0
Lost in Transcription, Found in Distribution Shift: Demystifying Hallucination in Speech Foundation Models0
How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the WildCode0
Can Your Uncertainty Scores Detect Hallucinated Entity?0
Smoothing Out Hallucinations: Mitigating LLM Hallucination with Smoothed Knowledge Distillation0
Valuable Hallucinations: Realizable Non-realistic Propositions0
A Survey of LLM-based Agents in Medicine: How far are we from Baymax?0
Enhancing RAG with Active Learning on Conversation Records: Reject Incapables and Answer Capables0
Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal ReasoningCode0
DeepSeek on a Trip: Inducing Targeted Visual Hallucinations via Representation Vulnerabilities0
Hallucination, Monofacts, and Miscalibration: An Empirical InvestigationCode0
Refine Knowledge of Large Language Models via Adaptive Contrastive Learning0
Hallucination Detection: A Probabilistic Framework Using Embeddings Distance Analysis0
Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language ModelsCode0
Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasksCode0
ChallengeMe: An Adversarial Learning-enabled Text Summarization Framework0
Enhancing Hallucination Detection through Noise Injection0
Linear Correlation in LM's Compositional Generalization and HallucinationCode0
Show:102550
← PrevPage 31 of 73Next →

No leaderboard results yet.