| Bangla Image Caption Generation through CNN-Transformer based Encoder-Decoder Network | Oct 24, 2021 | Caption GenerationDecoder | CodeCode Available | 0 |
| Cortico-cerebellar networks as decoupling neural interfaces | Oct 21, 2021 | Caption Generation | CodeCode Available | 0 |
| R^3Net:Relation-embedded Representation Reconstruction Network for Change Captioning | Oct 20, 2021 | Caption GenerationRelation | CodeCode Available | 0 |
| Topic Scene Graph Generation by Attention Distillation from Caption | Oct 12, 2021 | Caption GenerationGraph Generation | CodeCode Available | 1 |
| Geometry-Entangled Visual Semantic Transformer for Image Captioning | Sep 29, 2021 | Caption GenerationImage Captioning | —Unverified | 0 |
| Scene Graph Generation for Better Image Captioning? | Sep 23, 2021 | Caption GenerationGraph Generation | —Unverified | 0 |
| Denoising Large-Scale Image Captioning from Alt-text Data using Content Selection Models | Sep 17, 2021 | Caption GenerationDenoising | —Unverified | 0 |
| COSMic: A Coherence-Aware Generation Metric for Image Descriptions | Sep 11, 2021 | Caption GenerationImage Captioning | CodeCode Available | 1 |
| Journalistic Guidelines Aware News Image Captioning | Sep 7, 2021 | Caption GenerationDescriptive | CodeCode Available | 0 |
| LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation | Sep 4, 2021 | Caption GenerationImage Captioning | CodeCode Available | 0 |