SOTAVerified

Visual Prompting

Visual Prompting is the task of streamlining computer vision processes by harnessing the power of prompts, inspired by the breakthroughs of text prompting in NLP. This innovative approach involves using a few visual prompts to swiftly convert an unlabeled dataset into a deployed model, significantly reducing development time for both individual projects and enterprise solutions.

Papers

Showing 6170 of 127 papers

TitleStatusHype
Towards Online Multi-Modal Social Interaction UnderstandingCode0
Benchmarking Human and Automated Prompting in the Segment Anything ModelCode0
VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data Synthesis0
WeatherGFM: Learning A Weather Generalist Foundation Model via In-context Learning0
Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model0
Zoomer: Adaptive Image Focus Optimization for Black-box MLLM0
3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V0
3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o0
A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis0
Affordance-Guided Reinforcement Learning via Visual Prompting0
Show:102550
← PrevPage 7 of 13Next →

No leaderboard results yet.