SOTAVerified

Caption Generation

Papers

Showing 151200 of 310 papers

TitleStatusHype
Sequence to Sequence - Video to Text0
Set Prediction Guided by Semantic Concepts for Diverse Video Captioning0
Simultaneous Segmentation and Recognition: Towards more accurate Ego Gesture Recognition0
Skip-Gram − Zipf + Uniform = Vector Additivity0
SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs0
Social Media Ready Caption Generation for Brands0
Soft + Hardwired Attention: An LSTM Framework for Human Trajectory Prediction and Abnormal Event Detection0
Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning0
Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning0
Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation0
Structural and Functional Decomposition for Personality Image Captioning in a Communication Game0
StyleNet: Generating Attractive Visual Captions With Styles0
Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models0
Temporal Knowledge-Aware Image Captioning0
Temporal Object Captioning for Street Scene Videos from LiDAR Tracks0
THE DCASE 2021 CHALLENGE TASK 6 SYSTEM: AUTOMATED AUDIO CAPTIONING WITH WEAKLY SUPERVISED PRE-TRAING AND WORD SELECTION METHODS0
The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation0
The Solution for the ICCV 2023 1st Scientific Figure Captioning Challenge0
The Use of Object Labels and Spatial Prepositions as Keywords in a Web-Retrieval-Based Image Caption Generation System0
Time Series Language Model for Descriptive Caption Generation0
TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation0
Topic Scene Graph Generation by Attention Distillation From Caption0
TPsgtR: Neural-Symbolic Tensor Product Scene-Graph-Triplet Representation for Image Captioning0
Uncertainty-Aware Image Captioning0
Understanding How Paper Writers Use AI-Generated Captions in Figure Caption Writing0
Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning0
Unpaired Cross-lingual Image Caption Generation with Self-Supervised Rewards0
UNISON: Unpaired Cross-lingual Image Captioning0
ViCo: Engaging Video Comment Generation with Human Preference Rewards0
Video Caption Dataset for Describing Human Actions in Japanese0
Video Captioning in Compressed Video0
Video Captioning with Guidance of Multimodal Latent Topics0
Vision-Language Modeling Meets Remote Sensing: Models, Datasets and Perspectives0
Visual Analytics for Efficient Image Exploration and User-Guided Image Captioning0
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation0
WAT2019: English-Hindi Translation on Hindi Visual Genome Dataset0
Denoising Large-Scale Image Captioning from Alt-text Data using Content Selection Models0
Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching0
What is not where: the challenge of integrating spatial representations into deep learning architectures0
Word to Sentence Visual Semantic Similarity for Caption Generation: Lessons Learned0
XMeCap: Meme Caption Generation with Sub-Image Adaptability0
YouMakeup: A Large-Scale Domain-Specific Multimodal Dataset for Fine-Grained Semantic Comprehension0
3G structure for image caption generation0
3M: Multi-style image caption generation using Multi-modality features under Multi-UPDOWN model0
A Comparative Study of Pre-trained CNNs and GRU-Based Attention for Image Caption Generation0
A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism0
Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation0
AIC-AB NET: A Neural Network for Image Captioning with Spatial Attention and Text Attributes0
Auto-ACD: A Large-scale Dataset for Audio-Language Representation Learning0
Aligning Images and Text with Semantic Role Labels for Fine-Grained Cross-Modal Understanding0
Show:102550
← PrevPage 4 of 7Next →

No leaderboard results yet.