SOTAVerified

Hallucination

Papers

Showing 12011250 of 1816 papers

TitleStatusHype
Hallucination Mitigation Prompts Long-term Video UnderstandingCode0
Self-training Large Language Models through Knowledge DetectionCode0
Teaching Large Language Models to Express Knowledge Boundary from Their Own Signals0
Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded ConversationsCode0
Detecting and Evaluating Medical Hallucinations in Large Vision Language Models0
DefAn: Definitive Answer Dataset for LLMs Hallucination EvaluationCode0
HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination EvaluationCode0
Beyond Words: On Large Language Models Actionability in Mission-Critical Risk Analysis0
Progressive Query Expansion for Retrieval Over Cost-constrained Data Sources0
On the Hallucination in Simultaneous Machine TranslationCode0
Estimating the Hallucination Rate of Generative AI0
A Probabilistic Framework for LLM Hallucination Detection via Belief Tree PropagationCode0
Investigating and Addressing Hallucinations of LLMs in Tasks Involving Negation0
Robustness Assessment of Mathematical Reasoning in the Presence of Missing and Contradictory Conditions0
Chaos with Keywords: Exposing Large Language Models Sycophantic Hallucination to Misleading Keywords and Evaluating Defense Strategies0
Confabulation: The Surprising Value of Large Language Model Hallucinations0
ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints0
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework0
Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends0
Enhancing Trust in LLMs: Algorithms for Comparing and Interpreting LLMs0
How to Explore with Belief: State Entropy Maximization in POMDPs0
CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models0
OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors DetectionCode0
Ask-EDA: A Design Assistant Empowered by LLM, Hybrid RAG and Abbreviation De-hallucination0
Large Language Model Assisted Optimal Bidding of BESS in FCAS Market: An AI-agent based Approach0
Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs0
Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost0
Comprehensive Evaluation of Large Language Models for Topic Modeling0
DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language ModelsCode0
NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language ModelsCode0
Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts0
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools0
MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification0
MASSIVE Multilingual Abstract Meaning Representation: A Dataset and Baselines for Hallucination Detection0
Two-Layer Retrieval-Augmented Generation Framework for Low-Resource Medical Question Answering Using Reddit Data: Proof-of-Concept Study0
LLMs and Memorization: On Quality and Specificity of Copyright ComplianceCode0
Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action0
Data-augmented phrase-level alignment for mitigating object hallucination0
RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language Models0
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings0
Think Before You Act: A Two-Stage Framework for Mitigating Gender Bias Towards Vision-Language TasksCode0
GeneAgent: Self-verification Language Agent for Gene Set Knowledge Discovery using Domain Databases0
Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced OptimizationCode0
CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems0
Large Language Model Pruning0
Scaling Laws for Discriminative Classification in Large Language Models0
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models0
GameVLM: A Decision-making Framework for Robotic Task Planning Based on Visual Language Models and Zero-sum Games0
Less for More: Enhanced Feedback-aligned Mixed LLMs for Molecule Caption Generation and Fine-Grained NLI Evaluation0
CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models0
Show:102550
← PrevPage 25 of 37Next →

No leaderboard results yet.