SOTAVerified

Image Comprehension

Papers

Showing 3140 of 49 papers

TitleStatusHype
FullAnno: A Data Engine for Enhancing Image Comprehension of MLLMs0
IW-Bench: Evaluating Large Multimodal Models for Converting Image-to-Web0
Alleviating Hallucination in Large Vision-Language Models with Active Retrieval Augmentation0
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and OutputCode0
Unveiling Glitches: A Deep Dive into Image Encoding Bugs within CLIP0
VGA: Vision GUI Assistant -- Minimizing Hallucinations through Image-Centric Fine-TuningCode0
Multiplane Prior Guided Few-Shot Aerial Scene Rendering0
MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation and Fine-grained ClassificationCode0
Rec-GPT4V: Multimodal Recommendation with Large Vision-Language Models0
Muffin or Chihuahua? Challenging Multimodal Large Language Models with Multipanel VQA0
Show:102550
← PrevPage 4 of 5Next →

No leaderboard results yet.