| NLPHut’s Participation at WAT2021 | Aug 1, 2021 | Caption GenerationImage Captioning | —Unverified | 0 |
| A Thorough Review on Recent Deep Learning Methodologies for Image Captioning | Jul 28, 2021 | Caption GenerationDescriptive | —Unverified | 0 |
| Global Object Proposals for Improving Multi-Sentence Video Descriptions | Jul 18, 2021 | Caption GenerationDense Video Captioning | CodeCode Available | 0 |
| An encoder-decoder based framework for hindi image caption generation | Jul 9, 2021 | Caption GenerationDecoder | —Unverified | 0 |
| Controlled Caption Generation for Images Through Adversarial Attacks | Jul 7, 2021 | Caption GenerationImage Captioning | —Unverified | 0 |
| THE DCASE 2021 CHALLENGE TASK 6 SYSTEM: AUTOMATED AUDIO CAPTIONING WITH WEAKLY SUPERVISED PRE-TRAING AND WORD SELECTION METHODS | Jul 6, 2021 | Audio captioningCaption Generation | —Unverified | 0 |
| Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization | Jun 11, 2021 | Caption GenerationObject | CodeCode Available | 1 |
| Error Causal inference for Multi-Fusion models | Jun 1, 2021 | Caption GenerationCausal Inference | —Unverified | 0 |
| Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching | May 18, 2021 | Caption GenerationCross-Modal Retrieval | —Unverified | 0 |
| Empirical Analysis of Image Caption Generation using Deep Learning | May 14, 2021 | Caption GenerationDecoder | —Unverified | 0 |
| Connecting What to Say With Where to Look by Modeling Human Attention Traces | May 12, 2021 | Caption GenerationImage Captioning | CodeCode Available | 1 |
| Towards Accurate Text-based Image Captioning with Content Diversity Exploration | Apr 23, 2021 | Caption GenerationDiversity | CodeCode Available | 1 |
| Human-like Controllable Image Captioning with Verb-specific Semantic Roles | Mar 22, 2021 | Caption Generationcontrollable image captioning | CodeCode Available | 1 |
| 3M: Multi-style image caption generation using Multi-modality features under Multi-UPDOWN model | Mar 20, 2021 | Caption GenerationImage Captioning | —Unverified | 0 |
| Knowledge driven Description Synthesis for Floor Plan Interpretation | Mar 15, 2021 | Caption GenerationDescriptive | —Unverified | 0 |
| Relationship-based Neural Baby Talk | Mar 8, 2021 | Caption GenerationGraph Attention | —Unverified | 0 |
| Analysis of Convolutional Decoder for Image Caption Generation | Mar 8, 2021 | Caption GenerationData Augmentation | —Unverified | 0 |
| Comparative evaluation of CNN architectures for Image Caption Generation | Feb 23, 2021 | Caption GenerationObject Recognition | CodeCode Available | 0 |
| Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts | Feb 17, 2021 | Caption GenerationDiversity | CodeCode Available | 1 |
| Video Captioning in Compressed Video | Jan 2, 2021 | Caption GenerationVideo Captioning | —Unverified | 0 |
| Topic Scene Graph Generation by Attention Distillation From Caption | Jan 1, 2021 | Caption GenerationGraph Generation | —Unverified | 0 |
| Cortico-cerebellar networks as decoupled neural interfaces | Jan 1, 2021 | Caption Generation | —Unverified | 0 |
| Image to Bengali Caption Generation Using Deep CNN and Bidirectional Gated Recurrent Unit | Dec 22, 2020 | Caption GenerationDecoder | —Unverified | 0 |
| Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network | Dec 13, 2020 | Caption GenerationDecoder | CodeCode Available | 1 |
| TAP: Text-Aware Pre-training for Text-VQA and Text-Caption | Dec 8, 2020 | Caption GenerationLanguage Modeling | CodeCode Available | 1 |