SOTAVerified

Visual Prompting

Visual Prompting is the task of streamlining computer vision processes by harnessing the power of prompts, inspired by the breakthroughs of text prompting in NLP. This innovative approach involves using a few visual prompts to swiftly convert an unlabeled dataset into a deployed model, significantly reducing development time for both individual projects and enterprise solutions.

Papers

Showing 2650 of 127 papers

TitleStatusHype
From PowerPoint UI Sketches to Web-Based Applications: Pattern-Driven Code Generation for GIS Dashboard Development Using Knowledge-Augmented LLMs, Context-Aware Visual Prompting, and the React Framework0
Personalization Toolkit: Training Free Personalization of Large Vision Language Models0
Articulate AnyMesh: Open-Vocabulary 3D Articulated Objects Modeling0
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model AdaptationCode1
IP-Prompter: Training-Free Theme-Specific Image Generation via Dynamic Visual PromptingCode0
MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention0
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language ModelsCode4
Query Efficient Black-Box Visual Prompting with Subspace Learning0
Visual Prompting with Iterative Refinement for Design Critique Generation0
Selective Visual Prompting in Vision MambaCode1
Test-time Correction with Human Feedback: An Online 3D Detection System via Visual Prompting0
Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction TuningCode1
MLLM-Search: A Zero-Shot Approach to Finding People using Multimodal Large Language Models0
Improved GUI Grounding via Iterative NarrowingCode1
Prompting the Unseen: Detecting Hidden Backdoors in Black-Box Models0
WeatherGFM: Learning A Weather Generalist Foundation Model via In-context Learning0
Benchmarking Human and Automated Prompting in the Segment Anything ModelCode0
Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms0
Visual Prompting in LLMs for Enhancing Emotion Recognition0
Improving Visual Object Tracking through Visual PromptingCode1
GSON: A Group-based Social Navigation Framework with Large Multimodal Model0
Attention Prompting on Image for Large Vision-Language ModelsCode2
Cycle-Consistency Uncertainty Estimation for Visual Prompting based One-Shot Defect Segmentation0
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting0
Visual Prompting in Multimodal Large Language Models: A Survey0
Show:102550
← PrevPage 2 of 6Next →

No leaderboard results yet.