SOTAVerified

Caption Generation

Papers

Showing 7180 of 310 papers

TitleStatusHype
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding0
BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving0
EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer0
FaceGemma: Enhancing Image Captioning with Facial Attributes for Portrait Images0
Cross-modal Coherence Modeling for Caption Generation0
Cross-Lingual Image Caption Generation0
CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving0
Benchmarking Multimodal Models for Ukrainian Language Understanding Across Academic and Cultural Domains0
A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism0
Analysis of Convolutional Decoder for Image Caption Generation0
Show:102550
← PrevPage 8 of 31Next →

No leaderboard results yet.