Visual Prompting

Visual Prompting is the task of streamlining computer vision processes by harnessing the power of prompts, inspired by the breakthroughs of text prompting in NLP. This innovative approach involves using a few visual prompts to swiftly convert an unlabeled dataset into a deployed model, significantly reducing development time for both individual projects and enterprise solutions.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 127 papers

Title	Date	Tasks	Status	Hype
Stepwise Decomposition and Dual-stream Focus: A Novel Approach for Training-free Camouflaged Object Segmentation	Jun 7, 2025	Camouflaged Object SegmentationFeature Correlation	CodeCode Available	0
RSVP: Reasoning Segmentation via Visual Prompting and Multi-modal Chain-of-Thought	Jun 4, 2025	Multimodal ReasoningReasoning Segmentation	—Unverified	0
Grid-LOGAT: Grid Based Local and Global Area Transcription for Video Question Answering	May 30, 2025	Language ModelingLanguage Modelling	—Unverified	0
DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models	May 29, 2025	Visual Prompting	—Unverified	0
A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis	May 29, 2025	DiagnosticVisual Prompting	—Unverified	0
VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation	May 21, 2025	parameter-efficient fine-tuningSemantic Segmentation	—Unverified	0
Vision Graph Prompting via Semantic Low-Rank Decomposition	May 7, 2025	parameter-efficient fine-tuningVisual Prompting	CodeCode Available	1
Token Coordinated Prompt Attention is Needed for Visual Prompting	May 5, 2025	DiversityVisual Prompting	CodeCode Available	1
Zoomer: Adaptive Image Focus Optimization for Black-box MLLM	Apr 30, 2025	Image CaptioningObject Recognition	—Unverified	0
Black-Box Visual Prompt Engineering for Mitigating Object Hallucination in Large Vision Language Models	Apr 30, 2025	HallucinationObject	—Unverified	0
RadSAM: Segmenting 3D radiological images with a 2D promptable model	Apr 29, 2025	Image SegmentationMedical Image Segmentation	—Unverified	0
Visual and textual prompts for enhancing emotion recognition in video	Apr 24, 2025	Emotion RecognitionVideo Emotion Recognition	—Unverified	0
NVSMask3D: Hard Visual Prompting with Camera Pose Interpolation for 3D Open Vocabulary Instance Segmentation	Apr 20, 2025	3D Instance Segmentation3D Open-Vocabulary Instance Segmentation	—Unverified	0
Visual Prompting for One-shot Controllable Video Editing without Inversion	Apr 19, 2025	Video EditingVisual Prompting	—Unverified	0
EarthGPT-X: Enabling MLLMs to Flexibly and Comprehensively Understand Multi-Source Remote Sensing Imagery	Apr 17, 2025	Large Language ModelMulti-Task Learning	—Unverified	0
Prompt-Guided Attention Head Selection for Focus-Oriented Image Retrieval	Apr 2, 2025	Image RetrievalRetrieval	—Unverified	0
Is Temporal Prompting All We Need For Limited Labeled Action Recognition?	Apr 2, 2025	Action RecognitionAll	—Unverified	0
Towards Online Multi-Modal Social Interaction Understanding	Mar 25, 2025	Visual Prompting	CodeCode Available	0
VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data Synthesis	Mar 20, 2025	parameter-efficient fine-tuningVisual Prompting	—Unverified	0
3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o	Mar 17, 2025	Logical ReasoningPrompt Engineering	—Unverified	0
KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation	Mar 13, 2025	ObjectVisual Prompting	—Unverified	0
Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction	Mar 10, 2025	Autonomous DrivingScene Understanding	CodeCode Available	2
Towards Universal Text-driven CT Image Segmentation	Mar 8, 2025	Computed Tomography (CT)Contrastive Learning	CodeCode Available	0
Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity	Mar 8, 2025	Depth EstimationScene Understanding	CodeCode Available	0
The Role of Background Information in Reducing Object Hallucination in Vision-Language Models: Insights from Cutoff API Prompting	Feb 21, 2025	HallucinationObject	—Unverified	0

Show:10 25 50

← PrevPage 1 of 6Next →

No leaderboard results yet.