SOTAVerified

Hallucination

Papers

Showing 12261250 of 1816 papers

TitleStatusHype
Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs0
Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost0
Comprehensive Evaluation of Large Language Models for Topic Modeling0
DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language ModelsCode0
NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language ModelsCode0
Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts0
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools0
MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification0
MASSIVE Multilingual Abstract Meaning Representation: A Dataset and Baselines for Hallucination Detection0
Two-Layer Retrieval-Augmented Generation Framework for Low-Resource Medical Question Answering Using Reddit Data: Proof-of-Concept Study0
LLMs and Memorization: On Quality and Specificity of Copyright ComplianceCode0
Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action0
Data-augmented phrase-level alignment for mitigating object hallucination0
RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language Models0
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings0
Think Before You Act: A Two-Stage Framework for Mitigating Gender Bias Towards Vision-Language TasksCode0
GeneAgent: Self-verification Language Agent for Gene Set Knowledge Discovery using Domain Databases0
Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced OptimizationCode0
CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems0
Large Language Model Pruning0
Scaling Laws for Discriminative Classification in Large Language Models0
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models0
GameVLM: A Decision-making Framework for Robotic Task Planning Based on Visual Language Models and Zero-sum Games0
Less for More: Enhanced Feedback-aligned Mixed LLMs for Molecule Caption Generation and Fine-Grained NLI Evaluation0
CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models0
Show:102550
← PrevPage 50 of 73Next →

No leaderboard results yet.