SOTAVerified

Caption Generation

Papers

Showing 176200 of 310 papers

TitleStatusHype
Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning0
Unpaired Cross-lingual Image Caption Generation with Self-Supervised Rewards0
UNISON: Unpaired Cross-lingual Image Captioning0
ViCo: Engaging Video Comment Generation with Human Preference Rewards0
Video Caption Dataset for Describing Human Actions in Japanese0
Video Captioning in Compressed Video0
Video Captioning with Guidance of Multimodal Latent Topics0
Vision-Language Modeling Meets Remote Sensing: Models, Datasets and Perspectives0
Visual Analytics for Efficient Image Exploration and User-Guided Image Captioning0
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation0
WAT2019: English-Hindi Translation on Hindi Visual Genome Dataset0
Denoising Large-Scale Image Captioning from Alt-text Data using Content Selection Models0
Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching0
What is not where: the challenge of integrating spatial representations into deep learning architectures0
Word to Sentence Visual Semantic Similarity for Caption Generation: Lessons Learned0
XMeCap: Meme Caption Generation with Sub-Image Adaptability0
YouMakeup: A Large-Scale Domain-Specific Multimodal Dataset for Fine-Grained Semantic Comprehension0
3G structure for image caption generation0
3M: Multi-style image caption generation using Multi-modality features under Multi-UPDOWN model0
A Comparative Study of Pre-trained CNNs and GRU-Based Attention for Image Caption Generation0
A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism0
Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation0
AIC-AB NET: A Neural Network for Image Captioning with Spatial Attention and Text Attributes0
Auto-ACD: A Large-scale Dataset for Audio-Language Representation Learning0
Aligning Images and Text with Semantic Role Labels for Fine-Grained Cross-Modal Understanding0
Show:102550
← PrevPage 8 of 13Next →

No leaderboard results yet.