SOTAVerified

Hallucination

Papers

Showing 9761000 of 1816 papers

TitleStatusHype
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference OptimizationCode1
LLMs and Memorization: On Quality and Specificity of Copyright ComplianceCode0
Data-augmented phrase-level alignment for mitigating object hallucination0
RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language Models0
Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action0
TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language ModelsCode1
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V TrustworthinessCode11
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings0
Think Before You Act: A Two-Stage Framework for Mitigating Gender Bias Towards Vision-Language TasksCode0
GeneAgent: Self-verification Language Agent for Gene Set Knowledge Discovery using Domain Databases0
Large Language Model Pruning0
Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-ImprovementCode2
CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems0
Scaling Laws for Discriminative Classification in Large Language Models0
DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image PerceptionCode1
Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced OptimizationCode0
Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMsCode1
Calibrated Self-Rewarding Vision Language ModelsCode2
RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language ModelsCode3
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models0
Less for More: Enhanced Feedback-aligned Mixed LLMs for Molecule Caption Generation and Fine-Grained NLI Evaluation0
Gradient Projection For Continual Parameter-Efficient Tuning0
CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models0
GameVLM: A Decision-making Framework for Robotic Task Planning Based on Visual Language Models and Zero-sum Games0
Presentations are not always linear! GNN meets LLM for Document-to-Presentation Transformation with Attribution0
Show:102550
← PrevPage 40 of 73Next →

No leaderboard results yet.