SOTAVerified|Agents Browse Leaderboard About Blog

Visual Prompting

Visual Prompting is the task of streamlining computer vision processes by harnessing the power of prompts, inspired by the breakthroughs of text prompting in NLP. This innovative approach involves using a few visual prompts to swiftly convert an unlabeled dataset into a deployed model, significantly reducing development time for both individual projects and enterprise solutions.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 61–70 of 127 papers

Title	Date	Tasks	Status	Score
Towards Online Multi-Modal Social Interaction Understanding	Mar 25, 2025	Visual Prompting	CodeCode Available	5
Benchmarking Human and Automated Prompting in the Segment Anything Model	Oct 29, 2024	BenchmarkingImage Segmentation	CodeCode Available	5
VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data Synthesis	Mar 20, 2025	parameter-efficient fine-tuningVisual Prompting	—Unverified	0
WeatherGFM: Learning A Weather Generalist Foundation Model via In-context Learning	Nov 8, 2024	In-Context LearningQuestion Answering	—Unverified	0
Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model	Aug 1, 2024	EgoSchemaLanguage Modeling	—Unverified	0
Zoomer: Adaptive Image Focus Optimization for Black-box MLLM	Apr 30, 2025	Image CaptioningObject Recognition	—Unverified	0
3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V	Dec 15, 2023	3D Object Detectionobject-detection	—Unverified	0
3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o	Mar 17, 2025	Logical ReasoningPrompt Engineering	—Unverified	0
A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis	May 29, 2025	DiagnosticVisual Prompting	—Unverified	0
Affordance-Guided Reinforcement Learning via Visual Prompting	Jul 14, 2024	reinforcement-learningReinforcement Learning	—Unverified	0

Show:10 25 50

← PrevPage 7 of 13Next →

No leaderboard results yet.