SOTAVerified|Agents Browse Leaderboard About Blog

Visual Prompting

Visual Prompting is the task of streamlining computer vision processes by harnessing the power of prompts, inspired by the breakthroughs of text prompting in NLP. This innovative approach involves using a few visual prompts to swiftly convert an unlabeled dataset into a deployed model, significantly reducing development time for both individual projects and enterprise solutions.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 127 papers

Title	Date	Tasks	Status	Hype
Segment Anything	Apr 5, 2023	Event-based Object SegmentationImage Segmentation	CodeCode Available	5
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models	Jan 2, 2025	Scene Understandingtext annotation	CodeCode Available	4
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models	Apr 19, 2024	Language ModelingLanguage Modelling	CodeCode Available	4
Visual In-Context Prompting	Nov 22, 2023	DecoderSegmentation	CodeCode Available	4
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V	Oct 17, 2023	Interactive SegmentationReferring Expression	CodeCode Available	4
Generative Multimodal Models are In-Context Learners	Dec 20, 2023	In-Context LearningPersonalized Image Generation	CodeCode Available	3
Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction	Mar 10, 2025	Autonomous DrivingScene Understanding	CodeCode Available	2
Attention Prompting on Image for Large Vision-Language Models	Sep 25, 2024	MM-VetVisual Prompting	CodeCode Available	2
Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models	Jun 5, 2024	Few-Shot LearningLanguage Modeling	CodeCode Available	2
Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning	May 9, 2024	parameter-efficient fine-tuningVisual Prompting	CodeCode Available	2

Show:10 25 50

← PrevPage 1 of 13Next →

No leaderboard results yet.