SOTAVerified

Hallucination

Papers

Showing 376400 of 1816 papers

TitleStatusHype
Valuable Hallucinations: Realizable Non-realistic Propositions0
A Survey of LLM-based Agents in Medicine: How far are we from Baymax?0
Automated Hypothesis Validation with Agentic Sequential FalsificationsCode3
Enhancing RAG with Active Learning on Conversation Records: Reject Incapables and Answer Capables0
DeepSeek on a Trip: Inducing Targeted Visual Hallucinations via Representation Vulnerabilities0
Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal ReasoningCode0
Hallucination, Monofacts, and Miscalibration: An Empirical InvestigationCode0
Refine Knowledge of Large Language Models via Adaptive Contrastive Learning0
Hallucination Detection: A Probabilistic Framework Using Embeddings Distance Analysis0
Knowledge Graph-Guided Retrieval Augmented GenerationCode2
Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language ModelsCode0
Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasksCode0
VideoRoPE: What Makes for Good Video Rotary Position Embedding?Code3
ChallengeMe: An Adversarial Learning-enabled Text Summarization Framework0
Linear Correlation in LM's Compositional Generalization and HallucinationCode0
TruthFlow: Truthful LLM Generation via Representation Flow Correction0
Large Language Models for Multi-Robot Systems: A SurveyCode1
Enhancing Hallucination Detection through Noise Injection0
The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information SteeringCode2
A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs)0
DAMO: Data- and Model-aware Alignment of Multi-modal LLMsCode1
Mitigating Object Hallucinations in Large Vision-Language Models via Attention Calibration0
Eliciting Language Model Behaviors with Investigator Agents0
SelfCheckAgent: Zero-Resource Hallucination Detection in Generative Large Language Models0
MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation0
Show:102550
← PrevPage 16 of 73Next →

No leaderboard results yet.