SOTAVerified

Image to text

Papers

Showing 4150 of 246 papers

TitleStatusHype
From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing0
Robotic State Recognition with Image-to-Text Retrieval Task of Pre-Trained Vision-Language Model and Black-Box Optimization0
Semantic Editing Increment Benefits Zero-Shot Composed Image RetrievalCode2
Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)Code0
Beyond Color and Lines: Zero-Shot Style-Specific Image Variations with Coordinated Semantics0
Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image0
An Online Learning Approach to Prompt-based Selection of Generative Models0
Patch is Enough: Naturalistic Adversarial Patch against Vision-Language Pre-training Models0
Backdooring Vision-Language Models with Out-Of-Distribution Data0
See then Tell: Enhancing Key Information Extraction with Vision Grounding0
Show:102550
← PrevPage 5 of 25Next →

No leaderboard results yet.