SOTAVerified

Hallucination

Papers

Showing 176200 of 1816 papers

TitleStatusHype
HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection0
Triggering Hallucinations in LLMs: A Quantitative Study of Prompt-Induced Hallucination in Large Language Models0
SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided DistillationCode0
Efficient and robust 3D blind harmonization for large domain gaps0
MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness0
Black-Box Visual Prompt Engineering for Mitigating Object Hallucination in Large Vision Language Models0
Localizing Before Answering: A Hallucination Evaluation Benchmark for Grounded Medical Multimodal LLMs0
Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object PerceptionCode1
Can LLMs Detect Intrinsic Hallucinations in Paraphrasing and Machine Translation?0
Hallucination by Code Generation LLMs: Taxonomy, Benchmarks, Mitigation, and Challenges0
An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination0
Explanatory Summarization with Discourse-Driven Planning0
Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble ScorersCode5
Validating Network Protocol Parsers with Traceable RFC Document Interpretation0
Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction0
Toward Personalizing Quantum Computing Education: An Evolutionary LLM-Powered Approach0
The Dance of Atoms-De Novo Protein Design with Diffusion Model0
(Im)possibility of Automated Hallucination Detection in Large Language Models0
Grounded in Context: Retrieval-Based Method for Hallucination Detection0
Insights from Verification: Training a Verilog Generation LLM with Reinforcement Learning with Testbench Feedback0
DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual UnderstandingCode2
POLYRAG: Integrating Polyviews into Retrieval-Augmented Generation for Medical Applications0
aiXamine: Simplified LLM Safety and Security0
ResNetVLLM-2: Addressing ResNetVLLM's Multi-Modal Hallucinations0
Hydra: An Agentic Reasoning Approach for Enhancing Adversarial Robustness and Mitigating Hallucinations in Vision-Language Models0
Show:102550
← PrevPage 8 of 73Next →

No leaderboard results yet.