SOTAVerified

Hallucination

Papers

Showing 13511375 of 1816 papers

TitleStatusHype
Uncertainty Aware Review Hallucination for Science Article Classification0
Uncertainty-o: One Model-agnostic Framework for Unveiling Uncertainty in Large Multimodal Models0
UNCLE: Uncertainty Expressions in Long-Form Generation0
Understanding Alignment in Multimodal LLMs: A Comprehensive Study0
Understanding and predicting user dissatisfaction in a neural generative chatbot0
Understanding Your Agent: Leveraging Large Language Models for Behavior Explanation0
UniFa: A unified feature hallucination framework for any-shot object detection0
Unleashing the potential of prompt engineering for large language models0
Unmasking Digital Falsehoods: A Comparative Analysis of LLM-Based Misinformation Detection Strategies0
Unsupervised Compressive Text Summarisation with Reinforcement Learning0
Unveiling Glitches: A Deep Dive into Image Encoding Bugs within CLIP0
A Comprehensive Survey of Hallucination in Large Language, Image, Video and Audio Foundation Models0
UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation0
Urban Land Cover Classification with Missing Data Modalities Using Deep Convolutional Neural Networks0
User-Controlled Knowledge Fusion in Large Language Models: Balancing Creativity and Hallucination0
UserSumBench: A Benchmark Framework for Evaluating User Summarization Approaches0
Using Mobile Data and Deep Models to Assess Auditory Verbal Hallucinations0
Utilizing Large Language Models in an iterative paradigm with domain feedback for zero-shot molecule optimization0
Validating Network Protocol Parsers with Traceable RFC Document Interpretation0
VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech0
Valuable Hallucinations: Realizable Non-realistic Propositions0
Verb Mirage: Unveiling and Assessing Verb Concept Hallucinations in Multimodal Large Language Models0
Verify when Uncertain: Beyond Self-Consistency in Black Box Hallucination Detection0
VERITAS: A Unified Approach to Reliability Evaluation0
ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models0
Show:102550
← PrevPage 55 of 73Next →

No leaderboard results yet.