SOTAVerified

Hallucination

Papers

Showing 16761700 of 1816 papers

TitleStatusHype
Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused0
Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training0
LR-to-HR Face Hallucination with an Adversarial Progressive Attribute-Induced Network0
Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost0
Lynx: An Open Source Hallucination Evaluation Model0
M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation0
Machine learning techniques for the Schizophrenia diagnosis: A comprehensive review and future research directions0
Machine Mirages: Defining the Undefined0
MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness0
Magic Mushroom: A Customizable Benchmark for Fine-grained Analysis of Retrieval Noise Erosion in RAG Systems0
Magnifier Prompt: Tackling Multimodal Hallucination via Extremely Simple Instructions0
MALTO at SemEval-2024 Task 6: Leveraging Synthetic Data for LLM Hallucination Detection0
Manipulating Attributes of Natural Scenes via Hallucination0
MAPLE: Enhancing Review Generation with Multi-Aspect Prompt LEarning in Explainable Recommendation0
Map&Make: Schema Guided Text to Table Generation0
MARCO: Multi-Agent Real-time Chat Orchestration0
MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations0
MASSIVE Multilingual Abstract Meaning Representation: A Dataset and Baselines for Hallucination Detection0
Maximum Hallucination Standards for Domain-Specific Large Language Models0
Meaningless is better: hashing bias-inducing words in LLM prompts improves performance in logical reasoning and statistical learning0
Measuring and Mitigating Hallucinations in Vision-Language Dataset Generation for Remote Sensing0
Measuring and Reducing LLM Hallucination without Gold-Standard Answers0
Measuring Faithfulness and Abstention: An Automated Pipeline for Evaluating LLM-Generated 3-ply Case-Based Legal Arguments0
Measuring text summarization factuality using atomic facts entailment metrics in the context of retrieval augmented generation0
Measuring the Inconsistency of Large Language Models in Preferential Ranking0
Show:102550
← PrevPage 68 of 73Next →

No leaderboard results yet.