SOTAVerified

Object Hallucination

Papers

Showing 150 of 71 papers

TitleStatusHype
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V TrustworthinessCode11
MoE-LLaVA: Mixture of Experts for Large Vision-Language ModelsCode7
Ferret: Refer and Ground Anything Anywhere at Any GranularityCode5
Evaluating Object Hallucination in Large Vision-Language ModelsCode2
ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language ModelsCode2
Mitigating Object Hallucination via Concentric Causal AttentionCode2
Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive DecodingCode2
TinyLVLM-eHub: Towards Comprehensive and Efficient Evaluation for Large Vision-Language ModelsCode2
HALC: Object Hallucination Reduction via Adaptive Focal-Contrast DecodingCode2
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language ModelsCode2
From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language ModelsCode2
Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language ModelsCode2
Towards Enhanced Image Inpainting: Mitigating Unwanted Object Insertion and Preserving Color ConsistencyCode1
Analyzing and Mitigating Object Hallucination in Large Vision-Language ModelsCode1
CAFe: Unifying Representation and Generation with Contrastive-Autoregressive FinetuningCode1
Detecting and Preventing Hallucinations in Large Vision Language ModelsCode1
EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language ModelsCode1
Extract Free Dense Misalignment from CLIPCode1
HallE-Control: Controlling Object Hallucination in Large Multimodal ModelsCode1
HyperPocket: Generative Point Cloud CompletionCode1
Let there be a clock on the beach: Reducing Object Hallucination in Image CaptioningCode1
Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language ModelsCode1
Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption RewritesCode1
Mitigating Hallucinations in Large Vision-Language Models via Summary-Guided DecodingCode1
Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information FlowCode1
Multi-Object Hallucination in Vision-Language ModelsCode1
Revisit What You See: Disclose Language Prior in Vision Tokens for Efficient Guided Decoding of LVLMsCode1
Seeing is Believing: Mitigating Hallucination in Large Vision-Language Models via CLIP-Guided DecodingCode1
Transferable Decoding with Visual Entities for Zero-Shot Image CaptioningCode1
TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-InterventionCode1
Retrieval Visual Contrastive Decoding to Mitigate Object Hallucinations in Large Vision-Language ModelsCode0
Explain and Improve: LRP-Inference Fine-Tuning for Image Captioning ModelsCode0
Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-trainingCode0
Understanding Multimodal Hallucination with Parameter-Free Representation AlignmentCode0
SECOND: Mitigating Perceptual Hallucination in Vision-Language Models via Selective and Contrastive DecodingCode0
Unified Triplet-Level Hallucination Evaluation for Large Vision-Language ModelsCode0
HALLUCINOGEN: A Benchmark for Evaluating Object Hallucination in Large Visual-Language ModelsCode0
Think Before You Act: A Two-Stage Framework for Mitigating Gender Bias Towards Vision-Language TasksCode0
Vision-Encoders (Already) Know What They See: Mitigating Object Hallucination via Simple Fine-Grained CLIPScoreCode0
Instruction Makes a DifferenceCode0
Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) ModelsCode0
Object Hallucination in Image CaptioningCode0
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting0
Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs0
Reducing Object Hallucination in Large Audio-Language Models via Audio-Aware Decoding0
Relational Graph Learning for Grounded Video Description Generation0
ALOHa: A New Measure for Hallucination in Captioning Models0
RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language Models0
Consensus Graph Representation Learning for Better Grounded Image Captioning0
Seeing What's Not There: Spurious Correlation in Multimodal LLMs0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.