Visual Prompting

Visual Prompting is the task of streamlining computer vision processes by harnessing the power of prompts, inspired by the breakthroughs of text prompting in NLP. This innovative approach involves using a few visual prompts to swiftly convert an unlabeled dataset into a deployed model, significantly reducing development time for both individual projects and enterprise solutions.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 127 papers

Title	Date	Tasks	Status	Hype	Score
Segment Anything	Apr 5, 2023	Event-based Object SegmentationImage Segmentation	CodeCode Available	5	5
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models	Apr 19, 2024	Language ModelingLanguage Modelling	CodeCode Available	4	5
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models	Jan 2, 2025	Scene Understandingtext annotation	CodeCode Available	4	5
Visual In-Context Prompting	Nov 22, 2023	DecoderSegmentation	CodeCode Available	4	5
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V	Oct 17, 2023	Interactive SegmentationReferring Expression	CodeCode Available	4	5
Generative Multimodal Models are In-Context Learners	Dec 20, 2023	In-Context LearningPersonalized Image Generation	CodeCode Available	3	5
Explicit Visual Prompting for Low-Level Structure Segmentations	Mar 20, 2023	Camouflaged Object SegmentationDefocus Blur Detection	CodeCode Available	2	5
Visual Prompting via Image Inpainting	Sep 1, 2022	ColorizationEdge Detection	CodeCode Available	2	5
Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning	May 9, 2024	parameter-efficient fine-tuningVisual Prompting	CodeCode Available	2	5
Exploring Visual Prompts for Adapting Large-Scale Models	Mar 31, 2022	Visual Prompting	CodeCode Available	2	5
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want	Mar 29, 2024	Instruction FollowingLanguage Modelling	CodeCode Available	2	5
Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction	Mar 10, 2025	Autonomous DrivingScene Understanding	CodeCode Available	2	5
Attention Prompting on Image for Large Vision-Language Models	Sep 25, 2024	MM-VetVisual Prompting	CodeCode Available	2	5
Explicit Visual Prompting for Universal Foreground Segmentations	May 29, 2023	Camouflaged Object SegmentationDefocus Blur Detection	CodeCode Available	2	5
Tokenize Anything via Prompting	Dec 14, 2023	DecoderVisual Prompting	CodeCode Available	2	5
Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models	Jun 5, 2024	Few-Shot LearningLanguage Modeling	CodeCode Available	2	5
BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning	Mar 26, 2023	Transfer LearningVisual Prompting	CodeCode Available	1	5
GeoSAM: Fine-tuning SAM with Multi-Modal Prompts for Mobility Infrastructure Segmentation	Nov 19, 2023	Image SegmentationLarge Language Model	CodeCode Available	1	5
Dynamic Domains, Dynamic Solutions: DPCore for Continual Test-Time Adaptation	Jun 15, 2024	Test-time AdaptationVisual Prompting	CodeCode Available	1	5
Fine-Grained Visual Prompting	Jun 7, 2023	Visual Prompting	CodeCode Available	1	5
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation	Feb 2, 2025	Inductive BiasVisual Prompting	CodeCode Available	1	5
Diversity-Aware Meta Visual Prompting	Mar 14, 2023	DiversityVisual Prompting	CodeCode Available	1	5
AutoVP: An Automated Visual Prompting Framework and Benchmark	Oct 12, 2023	image-classificationImage Classification	CodeCode Available	1	5
EZ-CLIP: Efficient Zeroshot Video Action Recognition	Dec 13, 2023	Action RecognitionGPU	CodeCode Available	1	5
Exploring the Transferability of Visual Prompting for Multimodal Large Language Models	Apr 17, 2024	HallucinationMultimodal Reasoning	CodeCode Available	1	5

Show:10 25 50

← PrevPage 1 of 6Next →

No leaderboard results yet.