SOTAVerified

Hallucination

Papers

Showing 601650 of 1816 papers

TitleStatusHype
MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness0
Localizing Before Answering: A Hallucination Evaluation Benchmark for Grounded Medical Multimodal LLMs0
Can LLMs Detect Intrinsic Hallucinations in Paraphrasing and Machine Translation?0
Hallucination by Code Generation LLMs: Taxonomy, Benchmarks, Mitigation, and Challenges0
An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination0
Explanatory Summarization with Discourse-Driven Planning0
Validating Network Protocol Parsers with Traceable RFC Document Interpretation0
Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction0
Toward Personalizing Quantum Computing Education: An Evolutionary LLM-Powered Approach0
The Dance of Atoms-De Novo Protein Design with Diffusion Model0
(Im)possibility of Automated Hallucination Detection in Large Language Models0
Insights from Verification: Training a Verilog Generation LLM with Reinforcement Learning with Testbench Feedback0
Grounded in Context: Retrieval-Based Method for Hallucination Detection0
POLYRAG: Integrating Polyviews into Retrieval-Augmented Generation for Medical Applications0
aiXamine: Simplified LLM Safety and Security0
ResNetVLLM-2: Addressing ResNetVLLM's Multi-Modal Hallucinations0
Hydra: An Agentic Reasoning Approach for Enhancing Adversarial Robustness and Mitigating Hallucinations in Vision-Language Models0
Density Measures for Language Generation0
Multi-Stage Retrieval for Operational Technology Cybersecurity Compliance Using Large Language Models: A Railway Casestudy0
Why and How LLMs Hallucinate: Connecting the Dots with Subsequence AssociationsCode0
Aspect-Based Summarization with Self-Aspect Retrieval Enhanced Generation0
QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?0
Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training0
SemEval-2025 Task 3: Mu-SHROOM, the Multilingual Shared Task on Hallucinations and Related Observable Overgeneration MistakesCode0
Efficient Contrastive Decoding with Probabilistic Hallucination Detection - Mitigating Hallucinations in Large Vision Language Models -0
Naming is framing: How cybersecurity's language problems are repeating in AI governance0
Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization0
Purposefully Induced Psychosis (PIP): Embracing Hallucination as Imagination in Large Language Models0
Hallucination-Aware Generative Pretrained Transformer for Cooperative Aerial Mobility Control0
From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs0
The Future of MLLM Prompting is Adaptive: A Comprehensive Experimental Evaluation of Prompt Engineering Methods for Robust Multimodal Performance0
Hallucination Detection in LLMs via Topological Divergence on Attention Graphs0
Enhancing Mathematical Reasoning in Large Language Models with Self-Consistency-Based Hallucination Detection0
DiTSE: High-Fidelity Generative Speech Enhancement via Latent Diffusion Transformers0
HalluShift: Measuring Distribution Shifts towards Hallucination Detection in LLMsCode0
SynthTRIPs: A Knowledge-Grounded Framework for Benchmark Query Generation for Personalized Tourism Recommenders0
MedHal: An Evaluation Dataset for Medical Hallucination Detection0
The Other Side of the Coin: Exploring Fairness in Retrieval-Augmented GenerationCode0
Cross-Document Cross-Lingual NLI via RST-Enhanced Graph Fusion and Interpretability Prediction0
Hallucination, reliability, and the role of generative AI in science0
Learning Fine-grained Domain Generalization via Hyperbolic State Space HallucinationCode0
Generative AI in Collaborative Academic Report Writing: Advantages, Disadvantages, and Ethical Considerations0
Robust Hallucination Detection in LLMs via Adaptive Token Selection0
How to Detect and Defeat Molecular Mirage: A Metric-Driven Benchmark for Hallucination in LLM-based Molecular Comprehension0
Endowing Embodied Agents with Spatial Reasoning Capabilities for Vision-and-Language Navigation0
Perception in Reflection0
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens0
Graph-based Approaches and Functionalities in Retrieval-Augmented Generation: A Comprehensive Survey0
Capturing AI's Attention: Physics of Repetition, Hallucination, Bias and Beyond0
TARAC: Mitigating Hallucination in LVLMs via Temporal Attention Real-time Accumulative Connection0
Show:102550
← PrevPage 13 of 37Next →

No leaderboard results yet.