SOTAVerified

Hallucination

Papers

Showing 551600 of 1816 papers

TitleStatusHype
Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization0
Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis0
Visual Instruction Bottleneck Tuning0
Toward Reliable Biomedical Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language ModelsCode0
Mitigating Hallucination in VideoLLMs via Temporal-Aware Activation Engineering0
Calm-Whisper: Reduce Whisper Hallucination On Non-Speech By Calming Crazy Heads Down0
LLM-based Query Expansion Fails for Unfamiliar and Ambiguous QueriesCode0
Detection and Mitigation of Hallucination in Large Reasoning Models: A Mechanistic Perspective0
Granary: Speech Recognition and Translation Dataset in 25 European Languages0
Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical Practice0
Selective Code Generation for Functional Guarantees0
Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation0
Mitigating Hallucinations via Inter-Layer Consistency Aggregation in Large Vision-Language Models0
The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models0
Mixture of Decoding: An Attention-Inspired Adaptive Decoding Strategy to Mitigate Hallucinations in Large Vision-Language ModelsCode0
Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?0
CCNU at SemEval-2025 Task 3: Leveraging Internal and External Knowledge of Large Language Models for Multilingual Hallucination Annotation0
Diverging Towards Hallucination: Detection of Failures in Vision-Language Models via Multi-token Aggregation0
EmotionHallucer: Evaluating Emotion Hallucinations in Multimodal Large Language ModelsCode0
Towards Robust Evaluation of STEM Education: Leveraging MLLMs in Project-Based Learning0
DO-RAG: A Domain-Specific QA Framework Using Knowledge Graph-Enhanced Retrieval-Augmented GenerationCode0
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges0
The Impact of Large Language Models on Task Automation in Manufacturing Services0
Beyond the Black Box: Interpretability of LLMs in Finance0
A Multimodal Multi-Agent Framework for Radiology Report Generation0
Ornithologist: Towards Trustworthy "Reasoning" about Central Bank Communications0
Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation0
Prioritizing Image-Related Tokens Enhances Vision-Language Pre-TrainingCode0
Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification0
On the Cost and Benefits of Training Context with Utterance or Full Conversation Training: A Comparative Stud0
SEReDeEP: Hallucination Detection in Retrieval-Augmented Models via Semantic Entropy and Context-Parameter Fusion0
Critique Before Thinking: Mitigating Hallucination through Rationale-Augmented Instruction Tuning0
Multimodal Survival Modeling in the Age of Foundation ModelsCode0
TrumorGPT: Graph-Based Retrieval-Augmented Large Language Model for Fact-Checking0
Evolutionary thoughts: integration of large language models and evolutionary algorithmsCode0
Osiris: A Lightweight Open-Source Hallucination Detection System0
Interpretable Zero-shot Learning with Infinite Class Concepts0
Mitigating Image Captioning Hallucinations in Vision-Language Models0
Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation0
UCSC at SemEval-2025 Task 3: Context, Models and Prompt Optimization for Automated Hallucination Detection in LLM OutputCode0
SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation0
A Comprehensive Analysis for Visual Object Hallucination in Large Vision-Language Models0
Regression is all you need for medical image translationCode0
Automated Parsing of Engineering Drawings for Structured Information Extraction Using a Fine-tuned Document Understanding Transformer0
Multi-agents based User Values Mining for Recommendation0
SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided DistillationCode0
HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection0
Triggering Hallucinations in LLMs: A Quantitative Study of Prompt-Induced Hallucination in Large Language Models0
Efficient and robust 3D blind harmonization for large domain gaps0
Black-Box Visual Prompt Engineering for Mitigating Object Hallucination in Large Vision Language Models0
Show:102550
← PrevPage 12 of 37Next →

No leaderboard results yet.