SOTAVerified

Visual Prompting

Visual Prompting is the task of streamlining computer vision processes by harnessing the power of prompts, inspired by the breakthroughs of text prompting in NLP. This innovative approach involves using a few visual prompts to swiftly convert an unlabeled dataset into a deployed model, significantly reducing development time for both individual projects and enterprise solutions.

Papers

Showing 6170 of 127 papers

TitleStatusHype
Articulate AnyMesh: Open-Vocabulary 3D Articulated Objects Modeling0
Black-Box Visual Prompt Engineering for Mitigating Object Hallucination in Large Vision Language Models0
BLINK: Multimodal Large Language Models Can See but Not Perceive0
Chat2Layout: Interactive 3D Furniture Layout with a Multimodal LLM0
Cycle-Consistency Uncertainty Estimation for Visual Prompting based One-Shot Defect Segmentation0
DegustaBot: Zero-Shot Visual Preference Estimation for Personalized Multi-Object Rearrangement0
DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models0
EarthGPT-X: Enabling MLLMs to Flexibly and Comprehensively Understand Multi-Source Remote Sensing Imagery0
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting0
Explore until Confident: Efficient Exploration for Embodied Question Answering0
Show:102550
← PrevPage 7 of 13Next →

No leaderboard results yet.