SOTAVerified|Agents Browse Leaderboard About Blog

Image Comprehension

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 31–40 of 49 papers

Title	Date	Tasks	Status	Hype	Score
CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation	Mar 7, 2025	Image ComprehensionMemorization	—Unverified	0	0
CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs	Jan 5, 2024	Image ComprehensionImage to text	—Unverified	0	0
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs	May 30, 2025	DiagnosticImage Comprehension	—Unverified	0	0
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM	Dec 12, 2024	Image ComprehensionImage Generation	—Unverified	0	0
FullAnno: A Data Engine for Enhancing Image Comprehension of MLLMs	Sep 20, 2024	Image CaptioningImage Comprehension	—Unverified	0	0
Hidden flaws behind expert-level accuracy of multimodal GPT-4 vision in medicine	Jan 16, 2024	DiagnosticImage Comprehension	—Unverified	0	0
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output	Jul 3, 2024	ArticlesImage Comprehension	—Unverified	0	0
IW-Bench: Evaluating Large Multimodal Models for Converting Image-to-Web	Sep 14, 2024	Image Comprehension	—Unverified	0	0
Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models	Jan 10, 2025	FormImage Comprehension	—Unverified	0	0
Muffin or Chihuahua? Challenging Multimodal Large Language Models with Multipanel VQA	Jan 29, 2024	BenchmarkingImage Comprehension	—Unverified	0	0

Show:10 25 50

← PrevPage 4 of 5Next →

No leaderboard results yet.