SOTAVerified

Hallucination

Papers

Showing 10511100 of 1816 papers

TitleStatusHype
Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption UtilizationCode0
Contrastive Learning for Knowledge-Based Question Generation in Large Language Models0
FIHA: Autonomous Hallucination Evaluation in Vision-Language Models with Davidson Scene Graphs0
A Multiple-Fill-in-the-Blank Exam Approach for Enhancing Zero-Resource Hallucination Detection in Large Language Models0
JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated ImagesCode0
LLMs Can Check Their Own Results to Mitigate Hallucinations in Traffic Understanding Tasks0
Textualized Agent-Style Reasoning for Complex Tasks by Multiple Round LLM Generation0
THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language ModelsCode0
Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling0
Depth-based Privileged Information for Boosting 3D Human Pose Estimation on RGB0
Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to GiantCode0
Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection0
HALO: Hallucination Analysis and Learning Optimization to Empower LLMs with Retrieval-Augmented Context for Guided Clinical Decision MakingCode0
SFR-RAG: Towards Contextually Faithful LLMs0
Confidence Estimation for LLM-Based Dialogue State TrackingCode0
Explore the Hallucination on Low-level Perception for MLLMs0
ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models0
Winning Solution For Meta KDD Cup' 240
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications0
Safety challenges of AI in medicine in the era of large language models0
Mitigating Hallucination in Visual-Language Models via Re-Balancing Contrastive Decoding0
LLMs Will Always Hallucinate, and We Need to Live With This0
Generating Faithful and Salient Text from Multimodal DataCode0
Detecting Buggy Contracts via Smart Testing0
Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Question Answering0
Vietnamese Legal Information Retrieval in Question-Answering System0
CLUE: Concept-Level Uncertainty Estimation for Large Language Models0
Improved Single Camera BEV Perception Using Multi-Camera Training0
Hallucination Detection in LLMs: Fast and Memory-Efficient Fine-Tuned ModelsCode0
Multi-Source Knowledge Pruning for Retrieval-Augmented Generation: A Benchmark and Empirical StudyCode0
Understanding Multimodal Hallucination with Parameter-Free Representation AlignmentCode0
What does it take to get state of the art in simultaneous speech-to-speech translation?0
LLMs Prompted for Graphs: Hallucinations and Generative Capabilities0
Pre-Training Multimodal Hallucination Detectors with Corrupted Grounding Data0
UserSumBench: A Benchmark Framework for Evaluating User Summarization Approaches0
VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological ImagesCode0
Measuring text summarization factuality using atomic facts entailment metrics in the context of retrieval augmented generation0
Evidence-Enhanced Triplet Generation Framework for Hallucination Alleviation in Generative Question Answering0
Negation Blindness in Large Language Models: Unveiling the NO Syndrome in Image Generation0
Genetic Approach to Mitigate Hallucination in Generative IRCode0
Towards Reliable Medical Question Answering: Techniques and Challenges in Mitigating Hallucinations in Language Models0
Internal and External Knowledge Interactive Refinement Framework for Knowledge-Intensive Question Answering0
Can LLM be a Good Path Planner based on Prompt Engineering? Mitigating the Hallucination for Path Planning0
Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful ComparatorsCode0
RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference DataCode0
GRATR: Zero-Shot Evidence Graph Retrieval-Augmented Trustworthiness ReasoningCode0
MedDiT: A Knowledge-Controlled Diffusion Transformer Framework for Dynamic Medical Image Generation in Virtual Simulated Patient0
Towards Analyzing and Mitigating Sycophancy in Large Vision-Language Models0
RAG-Optimized Tibetan Tourism LLMs: Enhancing Accuracy and Personalization0
MAPLE: Enhancing Review Generation with Multi-Aspect Prompt LEarning in Explainable Recommendation0
Show:102550
← PrevPage 22 of 37Next →

No leaderboard results yet.