SOTAVerified

Caption Generation

Papers

Showing 151200 of 310 papers

TitleStatusHype
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding0
Controllable Video Captioning with an Exemplar SentenceCode1
Image Caption Generation Framework for Assamese News using Attention Mechanism0
Multi-modal Dependency Tree for Video Captioning0
CLIP Meets Video Captioning: Concept-Aware Representation Learning Does MatterCode0
SwinBERT: End-to-End Transformers with Sparse Attention for Video CaptioningCode1
E-MMAD: Multimodal Advertising Caption Generation Based on Structured Information0
Temporal Knowledge-Aware Image Captioning0
AUTOMATED AUDIO CAPTIONING BY FINE-TUNING BART WITH AUDIOSET TAGSCode0
Rˆ3Net:Relation-embedded Representation Reconstruction Network for Change CaptioningCode0
Bangla Image Caption Generation through CNN-Transformer based Encoder-Decoder NetworkCode0
Cortico-cerebellar networks as decoupling neural interfacesCode0
R^3Net:Relation-embedded Representation Reconstruction Network for Change CaptioningCode0
Topic Scene Graph Generation by Attention Distillation from CaptionCode1
Geometry-Entangled Visual Semantic Transformer for Image Captioning0
Scene Graph Generation for Better Image Captioning?0
Denoising Large-Scale Image Captioning from Alt-text Data using Content Selection Models0
COSMic: A Coherence-Aware Generation Metric for Image DescriptionsCode1
Journalistic Guidelines Aware News Image CaptioningCode0
LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption GenerationCode0
Goal-driven text descriptions for images0
Table Caption Generation in Scholarly Documents Leveraging Pre-trained Language ModelsCode0
End-to-End Dense Video Captioning with Parallel DecodingCode1
Caption Generation on Scenes with Seen and Unseen Object Categories0
O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning0
NLPHut’s Participation at WAT20210
A Thorough Review on Recent Deep Learning Methodologies for Image Captioning0
Global Object Proposals for Improving Multi-Sentence Video DescriptionsCode0
An encoder-decoder based framework for hindi image caption generation0
Controlled Caption Generation for Images Through Adversarial Attacks0
THE DCASE 2021 CHALLENGE TASK 6 SYSTEM: AUTOMATED AUDIO CAPTIONING WITH WEAKLY SUPERVISED PRE-TRAING AND WORD SELECTION METHODS0
Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object LocalizationCode1
Error Causal inference for Multi-Fusion models0
Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching0
Empirical Analysis of Image Caption Generation using Deep Learning0
Connecting What to Say With Where to Look by Modeling Human Attention TracesCode1
Towards Accurate Text-based Image Captioning with Content Diversity ExplorationCode1
Human-like Controllable Image Captioning with Verb-specific Semantic RolesCode1
3M: Multi-style image caption generation using Multi-modality features under Multi-UPDOWN model0
Knowledge driven Description Synthesis for Floor Plan Interpretation0
Relationship-based Neural Baby Talk0
Analysis of Convolutional Decoder for Image Caption Generation0
Comparative evaluation of CNN architectures for Image Caption GenerationCode0
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual ConceptsCode1
Video Captioning in Compressed Video0
Topic Scene Graph Generation by Attention Distillation From Caption0
Cortico-cerebellar networks as decoupled neural interfaces0
Image to Bengali Caption Generation Using Deep CNN and Bidirectional Gated Recurrent Unit0
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer NetworkCode1
TAP: Text-Aware Pre-training for Text-VQA and Text-CaptionCode1
Show:102550
← PrevPage 4 of 7Next →

No leaderboard results yet.