SOTAVerified

Visual Reasoning

Ability to understand actions and reasoning associated with any visual images

Papers

Showing 110 of 698 papers

TitleStatusHype
LaViPlan : Language-Guided Visual Path Planning with RLVR0
Beyond Task-Specific Reasoning: A Unified Conditional Generative Framework for Abstract Visual ReasoningCode0
PyVision: Agentic Vision with Dynamic Tooling0
MagiC: Evaluating Multimodal Cognition Toward Grounded Visual Reasoning0
Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based ReasoningCode0
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement LearningCode2
Skywork-R1V3 Technical ReportCode7
Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning0
Foundation Models for Zero-Shot Segmentation of Scientific Images without AI-Ready Data0
MiCo: Multi-image Contrast for Reinforcement Visual Reasoning0
Show:102550
← PrevPage 1 of 70Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BEiT-3Accuracy92.58Unverified
2X2-VLM (large)Accuracy89.4Unverified
3XFM (base)Accuracy88.4Unverified
4CoCaAccuracy87Unverified
5X2-VLM (base)Accuracy87Unverified
6VLMoAccuracy86.86Unverified
7SimVLMAccuracy85.15Unverified
8X-VLM (base)Accuracy84.76Unverified
9BLIP-129MAccuracy83.09Unverified
10ALBEF (14M)Accuracy82.55Unverified