SOTAVerified

Image to text

Papers

Showing 8190 of 246 papers

TitleStatusHype
Libra: Building Decoupled Vision System on Large Language ModelsCode2
DOCCI: Descriptions of Connected and Contrasting Images0
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation0
Leveraging AI to Generate Audio for User-generated Content in Video Games0
VISLA Benchmark: Evaluating Embedding Sensitivity to Semantic and Lexical AlterationsCode0
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?Code1
Do LLMs Understand Visual Anomalies? Uncovering LLM's Capabilities in Zero-shot Anomaly Detection0
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept MatchingCode2
OVFoodSeg: Elevating Open-Vocabulary Food Image Segmentation via Image-Informed Textual Representation0
Evaluating Text-to-Visual Generation with Image-to-Text GenerationCode3
Show:102550
← PrevPage 9 of 25Next →

No leaderboard results yet.