SOTAVerified

Image to text

Papers

Showing 5175 of 246 papers

TitleStatusHype
TrojVLM: Backdoor Attack Against Vision Language Models0
Robotic Environmental State Recognition with Pre-Trained Vision-Language Models and Black-Box Optimization0
Evaluating authenticity and quality of image captions via sentiment and semantic analyses0
See or Guess: Counterfactually Regularized Image CaptioningCode1
UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and GenerationCode1
Ask, Attend, Attack: A Effective Decision-Based Black-Box Targeted Attack for Image-to-Text Models0
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic SegmentationCode2
Instruction Tuning-free Visual Token Complement for Multimodal LLMs0
GABInsight: Exploring Gender-Activity Binding Bias in Vision-Language ModelsCode0
Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local SimilaritiesCode2
Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic0
GPC: Generative and General Pathology Image Classifier0
LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image RetrievalCode2
15M Multimodal Facial Image-Text Dataset0
Towards a text-based quantitative and explainable histopathology image analysisCode0
Vision-Braille: An End-to-End Tool for Chinese Braille Image-to-Text Translation0
HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels0
Image-to-Text Logic Jailbreak: Your Imagination can Help You Do Anything0
A Data-Driven Guided Decoding Mechanism for Diagnostic CaptioningCode0
Reminding Multimodal Large Language Models of Object-aware Knowledge with Retrieved Tags0
BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image RetrievalCode0
CMC-Bench: Towards a New Paradigm of Visual Signal CompressionCode1
Fetch-A-Set: A Large-Scale OCR-Free Benchmark for Historical Document Retrieval0
Benchmarking Vision-Language Contrastive Methods for Medical Representation LearningCode0
AICoderEval: Improving AI Domain Code Generation of Large Language Models0
Show:102550
← PrevPage 3 of 10Next →

No leaderboard results yet.