SOTAVerified

Visual Reasoning

Ability to understand actions and reasoning associated with any visual images

Papers

Showing 110 of 698 papers

TitleStatusHype
LaViPlan : Language-Guided Visual Path Planning with RLVR0
Beyond Task-Specific Reasoning: A Unified Conditional Generative Framework for Abstract Visual ReasoningCode0
PyVision: Agentic Vision with Dynamic Tooling0
MagiC: Evaluating Multimodal Cognition Toward Grounded Visual Reasoning0
Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based ReasoningCode0
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement LearningCode2
Skywork-R1V3 Technical ReportCode7
Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning0
Foundation Models for Zero-Shot Segmentation of Scientific Images without AI-Ready Data0
MiCo: Multi-image Contrast for Reinforcement Visual Reasoning0
Show:102550
← PrevPage 1 of 70Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BEiT-3Accuracy91.51Unverified
2X2-VLM (large)Accuracy88.7Unverified
3XFM (base)Accuracy87.6Unverified
4X2-VLM (base)Accuracy86.2Unverified
5CoCaAccuracy86.1Unverified
6VLMoAccuracy85.64Unverified
7VK-OODAccuracy84.6Unverified
8SimVLMAccuracy84.53Unverified
9X-VLM (base)Accuracy84.41Unverified
10VK-OODAccuracy83.9Unverified