SOTAVerified

Visual Prompting

Visual Prompting is the task of streamlining computer vision processes by harnessing the power of prompts, inspired by the breakthroughs of text prompting in NLP. This innovative approach involves using a few visual prompts to swiftly convert an unlabeled dataset into a deployed model, significantly reducing development time for both individual projects and enterprise solutions.

Papers

Showing 110 of 127 papers

TitleStatusHype
Segment AnythingCode5
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language ModelsCode4
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language ModelsCode4
Visual In-Context PromptingCode4
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4VCode4
Generative Multimodal Models are In-Context LearnersCode3
Chameleon: Fast-slow Neuro-symbolic Lane Topology ExtractionCode2
Attention Prompting on Image for Large Vision-Language ModelsCode2
Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language ModelsCode2
Memory-Space Visual Prompting for Efficient Vision-Language Fine-TuningCode2
Show:102550
← PrevPage 1 of 13Next →

No leaderboard results yet.