SOTAVerified

Caption Generation

Papers

Showing 151160 of 310 papers

TitleStatusHype
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding0
Controllable Video Captioning with an Exemplar SentenceCode1
Image Caption Generation Framework for Assamese News using Attention Mechanism0
Multi-modal Dependency Tree for Video Captioning0
CLIP Meets Video Captioning: Concept-Aware Representation Learning Does MatterCode0
SwinBERT: End-to-End Transformers with Sparse Attention for Video CaptioningCode1
E-MMAD: Multimodal Advertising Caption Generation Based on Structured Information0
Temporal Knowledge-Aware Image Captioning0
AUTOMATED AUDIO CAPTIONING BY FINE-TUNING BART WITH AUDIOSET TAGSCode0
Rˆ3Net:Relation-embedded Representation Reconstruction Network for Change CaptioningCode0
Show:102550
← PrevPage 16 of 31Next →

No leaderboard results yet.