Hallucination

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 551–600 of 1816 papers

Title	Date	Tasks	Status
Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization	May 20, 2025	HallucinationIn-Context Learning	—Unverified
Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis	May 20, 2025	Hallucination	—Unverified
Visual Instruction Bottleneck Tuning	May 20, 2025	HallucinationObject Hallucination	—Unverified
Toward Reliable Biomedical Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language Models	May 20, 2025	Hallucinationscientific discovery	CodeCode Available
Mitigating Hallucination in VideoLLMs via Temporal-Aware Activation Engineering	May 19, 2025	Hallucination	—Unverified
Calm-Whisper: Reduce Whisper Hallucination On Non-Speech By Calming Crazy Heads Down	May 19, 2025	Automatic Speech RecognitionDecoder	—Unverified
LLM-based Query Expansion Fails for Unfamiliar and Ambiguous Queries	May 19, 2025	HallucinationRetrieval	CodeCode Available
Detection and Mitigation of Hallucination in Large Reasoning Models: A Mechanistic Perspective	May 19, 2025	Hallucination	—Unverified
Granary: Speech Recognition and Translation Dataset in 25 European Languages	May 19, 2025	HallucinationPunctuation Restoration	—Unverified
Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical Practice	May 19, 2025	AllHallucination	—Unverified
Selective Code Generation for Functional Guarantees	May 19, 2025	Code GenerationHallucination	—Unverified
Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation	May 18, 2025	Fact CheckingForm	—Unverified
Mitigating Hallucinations via Inter-Layer Consistency Aggregation in Large Vision-Language Models	May 18, 2025	HallucinationMME	—Unverified
The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models	May 18, 2025	Hallucination	—Unverified
Mixture of Decoding: An Attention-Inspired Adaptive Decoding Strategy to Mitigate Hallucinations in Large Vision-Language Models	May 17, 2025	Hallucination	CodeCode Available
Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?	May 17, 2025	HallucinationObject Counting	—Unverified
CCNU at SemEval-2025 Task 3: Leveraging Internal and External Knowledge of Large Language Models for Multilingual Hallucination Annotation	May 17, 2025	HallucinationQuestion Answering	—Unverified
Diverging Towards Hallucination: Detection of Failures in Vision-Language Models via Multi-token Aggregation	May 16, 2025	DiagnosticHallucination	—Unverified
EmotionHallucer: Evaluating Emotion Hallucinations in Multimodal Large Language Models	May 16, 2025	Hallucination	CodeCode Available
Towards Robust Evaluation of STEM Education: Leveraging MLLMs in Project-Based Learning	May 16, 2025	HallucinationInformation Retrieval	—Unverified
DO-RAG: A Domain-Specific QA Framework Using Knowledge Graph-Enhanced Retrieval-Augmented Generation	May 15, 2025	graph constructionHallucination	CodeCode Available
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges	May 15, 2025	AI AgentData Summarization	—Unverified
The Impact of Large Language Models on Task Automation in Manufacturing Services	May 14, 2025	HallucinationQuestion Answering	—Unverified
Beyond the Black Box: Interpretability of LLMs in Finance	May 14, 2025	FairnessHallucination	—Unverified
A Multimodal Multi-Agent Framework for Radiology Report Generation	May 14, 2025	DiagnosticHallucination	—Unverified
Ornithologist: Towards Trustworthy "Reasoning" about Central Bank Communications	May 14, 2025	HallucinationLanguage Modeling	—Unverified
Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation	May 13, 2025	Event ExtractionHallucination	—Unverified
Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training	May 13, 2025	HallucinationLarge Language Model	CodeCode Available
Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification	May 13, 2025	HallucinationRAG	—Unverified
On the Cost and Benefits of Training Context with Utterance or Full Conversation Training: A Comparative Stud	May 12, 2025	GPUHallucination	—Unverified
SEReDeEP: Hallucination Detection in Retrieval-Augmented Models via Semantic Entropy and Context-Parameter Fusion	May 12, 2025	HallucinationRAG	—Unverified
Critique Before Thinking: Mitigating Hallucination through Rationale-Augmented Instruction Tuning	May 12, 2025	HallucinationMultimodal Reasoning	—Unverified
Multimodal Survival Modeling in the Age of Foundation Models	May 12, 2025	HallucinationSurvival Prediction	CodeCode Available
TrumorGPT: Graph-Based Retrieval-Augmented Large Language Model for Fact-Checking	May 11, 2025	Fact CheckingFew-Shot Learning	—Unverified
Evolutionary thoughts: integration of large language models and evolutionary algorithms	May 9, 2025	Evolutionary AlgorithmsHallucination	CodeCode Available
Osiris: A Lightweight Open-Source Hallucination Detection System	May 7, 2025	HallucinationRAG	—Unverified
Interpretable Zero-shot Learning with Infinite Class Concepts	May 6, 2025	HallucinationZero-Shot Learning	—Unverified
Mitigating Image Captioning Hallucinations in Vision-Language Models	May 6, 2025	HallucinationHallucination Evaluation	—Unverified
Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation	May 5, 2025	Entity DisambiguationHallucination	—Unverified
UCSC at SemEval-2025 Task 3: Context, Models and Prompt Optimization for Automated Hallucination Detection in LLM Output	May 5, 2025	Hallucination	CodeCode Available
SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation	May 4, 2025	HallucinationText Summarization	—Unverified
A Comprehensive Analysis for Visual Object Hallucination in Large Vision-Language Models	May 4, 2025	AttributeHallucination	—Unverified
Regression is all you need for medical image translation	May 4, 2025	AllHallucination	CodeCode Available
Automated Parsing of Engineering Drawings for Structured Information Extraction Using a Fine-tuned Document Understanding Transformer	May 2, 2025	document understandingHallucination	—Unverified
Multi-agents based User Values Mining for Recommendation	May 2, 2025	HallucinationRecommendation Systems	—Unverified
SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided Distillation	May 1, 2025	HallucinationNavigate	CodeCode Available
HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection	May 1, 2025	Extractive Question-AnsweringHallucination	—Unverified
Triggering Hallucinations in LLMs: A Quantitative Study of Prompt-Induced Hallucination in Large Language Models	May 1, 2025	Hallucination	—Unverified
Efficient and robust 3D blind harmonization for large domain gaps	Apr 30, 2025	HallucinationImage Harmonization	—Unverified
Black-Box Visual Prompt Engineering for Mitigating Object Hallucination in Large Vision Language Models	Apr 30, 2025	HallucinationObject	—Unverified

Show:10 25 50

← PrevPage 12 of 37Next →

No leaderboard results yet.