SOTAVerified

Visual Prompting

Visual Prompting is the task of streamlining computer vision processes by harnessing the power of prompts, inspired by the breakthroughs of text prompting in NLP. This innovative approach involves using a few visual prompts to swiftly convert an unlabeled dataset into a deployed model, significantly reducing development time for both individual projects and enterprise solutions.

Papers

Showing 110 of 127 papers

TitleStatusHype
Segment AnythingCode5
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language ModelsCode4
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4VCode4
Visual In-Context PromptingCode4
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language ModelsCode4
Generative Multimodal Models are In-Context LearnersCode3
Chameleon: Fast-slow Neuro-symbolic Lane Topology ExtractionCode2
Memory-Space Visual Prompting for Efficient Vision-Language Fine-TuningCode2
Attention Prompting on Image for Large Vision-Language ModelsCode2
Explicit Visual Prompting for Low-Level Structure SegmentationsCode2
Show:102550
← PrevPage 1 of 13Next →

No leaderboard results yet.