SOTAVerified

Visual Prompting

Visual Prompting is the task of streamlining computer vision processes by harnessing the power of prompts, inspired by the breakthroughs of text prompting in NLP. This innovative approach involves using a few visual prompts to swiftly convert an unlabeled dataset into a deployed model, significantly reducing development time for both individual projects and enterprise solutions.

Papers

Showing 5160 of 127 papers

TitleStatusHype
When Does Visual Prompting Outperform Linear Probing for Vision-Language Models? A Likelihood PerspectiveCode0
Open-Vocabulary Action Localization with Iterative Visual PromptingCode1
Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models0
Targeted Visual Prompting for Medical Visual Question AnsweringCode0
Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model0
Chat2Layout: Interactive 3D Furniture Layout with a Multimodal LLM0
EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote SensingCode1
By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual PromptingCode1
Affordance-Guided Reinforcement Learning via Visual Prompting0
UICrit: Enhancing Automated Design Evaluation with a UICritique DatasetCode0
Show:102550
← PrevPage 6 of 13Next →

No leaderboard results yet.