Deep Learning Approaches on Image Captioning: A Review Jan 31, 2022 Caption Generation Deep Learning
— Unverified 0A Frustratingly Simple Approach for End-to-End Image Captioning Jan 30, 2022 Decoder Image Captioning
— Unverified 0BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation Jan 28, 2022 Image Captioning Image-text matching
Code Code Available 5An Integrated Approach for Video Captioning and Applications Jan 23, 2022 Image Captioning Video Captioning
— Unverified 0Visual Information Guided Zero-Shot Paraphrase Generation Jan 22, 2022 Diversity Image Captioning
Code Code Available 0Discovering Non-Monotonic Autoregressive Ordering for Text Generation Models using Sinkhorn Distributions Jan 17, 2022 Code Generation Decoder
— Unverified 0Crossmodal-3600: A Massively Multilingual Multimodal Evaluation Dataset Jan 16, 2022 Image Captioning Model Selection
— Unverified 0Transparent Human Evaluation for Image Captioning Jan 16, 2022 Image Captioning
— Unverified 0All You May Need for VQA are Image Captions Jan 16, 2022 All Image Captioning
— Unverified 0Long-Tail Classification for Distinctive Image Captioning: A Simple yet Effective Remedy for Side Effects of Reinforcement Learning Jan 16, 2022 Image Captioning Reinforcement Learning (RL)
— Unverified 0Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand Jan 16, 2022 Image Captioning Machine Translation
— Unverified 0Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training Jan 11, 2022 Decoder Image Captioning
— Unverified 0Repurposing Existing Deep Networks for Caption and Aesthetic-Guided Image Cropping Jan 7, 2022 Image Captioning Image Cropping
— Unverified 0Compact Bidirectional Transformer for Image Captioning Jan 6, 2022 Decoder Image Captioning
Code Code Available 1Synthesizer Based Efficient Self-Attention for Vision Tasks Jan 5, 2022 Image Captioning image-classification
— Unverified 0Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety Jan 4, 2022 Decoder Deep Learning
— Unverified 0StyleM: Stylized Metrics for Image Captioning Built with Contrastive N-grams Jan 4, 2022 Image Captioning
— Unverified 0DIFNet: Boosting Visual Information Flow for Image Captioning Jan 1, 2022 Image Captioning Prediction
— Unverified 0DeeCap: Dynamic Early Exiting for Efficient Image Captioning Jan 1, 2022 Image Captioning Imitation Learning
Code Code Available 1Show, Deconfound and Tell: Image Captioning With Causal Inference Jan 1, 2022 Causal Inference Decoder
Code Code Available 1ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation Dec 31, 2021 Image Captioning Image Generation
Code Code Available 1Knowledge Matters: Radiology Report Generation with General and Specific Knowledge Dec 30, 2021 Decoder General Knowledge
— Unverified 0Extended Self-Critical Pipeline for Transforming Videos to Text (TRECVID-VTT Task 2021) -- Team: MMCUniAugsburg Dec 28, 2021 Image Captioning
— Unverified 0Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation Dec 28, 2021 Image Captioning Machine Translation
— Unverified 0A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision Dec 27, 2021 Classification Image Captioning
— Unverified 0VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks Dec 13, 2021 Image Captioning Transfer Learning
Code Code Available 1MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning Dec 13, 2021 Caption Generation Descriptive
— Unverified 0Injecting Semantic Concepts into End-to-End Image Captioning Dec 9, 2021 Caption Generation Image Captioning
Code Code Available 1Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand Dec 8, 2021 Image Captioning Machine Translation
Code Code Available 1Protecting Intellectual Property of Language Generation APIs with Lexical Watermark Dec 5, 2021 Document Summarization Image Captioning
Code Code Available 0Consensus Graph Representation Learning for Better Grounded Image Captioning Dec 2, 2021 Graph Representation Learning Hallucination
— Unverified 0Object-Centric Unsupervised Image Captioning Dec 2, 2021 Image Captioning Object
Code Code Available 0Image2tweet: Datasets in Hindi and English for Generating Tweets from Images Dec 1, 2021 Image Captioning World Knowledge
Code Code Available 0A Scaled Encoder Decoder Network for Image Captioning in Hindi Dec 1, 2021 Decoder Deep Learning
— Unverified 0Image Caption Generation Framework for Assamese News using Attention Mechanism Dec 1, 2021 Caption Generation Decoder
— Unverified 0Set Prediction in the Latent Space Dec 1, 2021 Image Captioning object-detection
Code Code Available 0Neural Attention for Image Captioning: Review of Outstanding Methods Nov 29, 2021 Decoder Deep Learning
— Unverified 0ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic Nov 29, 2021 Contrastive Learning Descriptive
Code Code Available 1Scene Graph Generation with Geometric Context Nov 25, 2021 Activity Recognition Graph Generation
— Unverified 0Generating More Pertinent Captions by Leveraging Semantics and Style on Multi-Source Datasets Nov 24, 2021 Descriptive Image Captioning
— Unverified 0Scaling Up Vision-Language Pre-training for Image Captioning Nov 24, 2021 Attribute Image Captioning
— Unverified 0UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling Nov 23, 2021 Image Captioning Image Description
Code Code Available 1L-Verse: Bidirectional Generation Between Image and Text Nov 22, 2021 Image Captioning Image Generation
Code Code Available 1UFO: A UniFied TransfOrmer for Vision-Language Representation Learning Nov 19, 2021 Image Captioning Image-text matching
— Unverified 0ClipCap: CLIP Prefix for Image Captioning Nov 18, 2021 Image Captioning Language Modeling
Code Code Available 2Transparent Human Evaluation for Image Captioning Nov 17, 2021 Image Captioning
Code Code Available 1Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation Nov 16, 2021 Image Captioning Knowledge Distillation
— Unverified 0Leveraging Visual Knowledge in Language Tasks: An Empirical Study on Intermediate Pre-training for Cross-Modal Knowledge Transfer Nov 16, 2021 Image Captioning Language Modeling
— Unverified 0On Vision Features in Multimodal Machine Translation Nov 16, 2021 Image Captioning Machine Translation
— Unverified 0Temporal Knowledge-Aware Image Captioning Nov 16, 2021 Caption Generation Image Captioning
— Unverified 0