SOTAVerified

Object Hallucination

Papers

Showing 2650 of 71 papers

TitleStatusHype
Multi-Object Hallucination in Vision-Language ModelsCode1
Revisit What You See: Disclose Language Prior in Vision Tokens for Efficient Guided Decoding of LVLMsCode1
Seeing is Believing: Mitigating Hallucination in Large Vision-Language Models via CLIP-Guided DecodingCode1
Transferable Decoding with Visual Entities for Zero-Shot Image CaptioningCode1
TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-InterventionCode1
Retrieval Visual Contrastive Decoding to Mitigate Object Hallucinations in Large Vision-Language ModelsCode0
Explain and Improve: LRP-Inference Fine-Tuning for Image Captioning ModelsCode0
Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-trainingCode0
Understanding Multimodal Hallucination with Parameter-Free Representation AlignmentCode0
SECOND: Mitigating Perceptual Hallucination in Vision-Language Models via Selective and Contrastive DecodingCode0
Unified Triplet-Level Hallucination Evaluation for Large Vision-Language ModelsCode0
HALLUCINOGEN: A Benchmark for Evaluating Object Hallucination in Large Visual-Language ModelsCode0
Think Before You Act: A Two-Stage Framework for Mitigating Gender Bias Towards Vision-Language TasksCode0
Vision-Encoders (Already) Know What They See: Mitigating Object Hallucination via Simple Fine-Grained CLIPScoreCode0
Instruction Makes a DifferenceCode0
Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) ModelsCode0
Object Hallucination in Image CaptioningCode0
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting0
Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs0
Reducing Object Hallucination in Large Audio-Language Models via Audio-Aware Decoding0
Relational Graph Learning for Grounded Video Description Generation0
ALOHa: A New Measure for Hallucination in Captioning Models0
RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language Models0
Consensus Graph Representation Learning for Better Grounded Image Captioning0
Seeing What's Not There: Spurious Correlation in Multimodal LLMs0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.