Auto-Encoding Scene Graphs for Image Captioning Dec 6, 2018 Decoder Image Captioning
Code Code Available 05 Learning to Evaluate Image Captioning Jun 17, 2018 8k Data Augmentation
Code Code Available 05 Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning Oct 4, 2022 Image Captioning Sentence
Code Code Available 05 Learning Visually-Grounded Semantics from Contrastive Adversarial Samples Jun 27, 2018 Adversarial Attack Image Captioning
Code Code Available 05 Adapting Contrastive Language-Image Pretrained (CLIP) Models for Out-of-Distribution Detection Mar 10, 2023 Anomaly Detection Image Captioning
Code Code Available 05 Learning by Correction: Efficient Tuning Task for Zero-Shot Generative Vision-Language Reasoning Apr 1, 2024 Image Captioning Instruction Following
Code Code Available 05 Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images Apr 25, 2015 Image Captioning Novel Concepts
Code Code Available 05 LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation Sep 4, 2021 Caption Generation Image Captioning
Code Code Available 05 Large Vision-Language Models for Knowledge-Grounded Data Annotation of Memes Jan 23, 2025 Emotion Classification Image Captioning
Code Code Available 05 Learn from Downstream and Be Yourself in Multimodal Large Language Model Fine-Tuning Nov 17, 2024 Image Captioning Language Modeling
Code Code Available 05 A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks Oct 10, 2024 Fairness Image Captioning
Code Code Available 05 Context-Aware Visual Policy Network for Fine-Grained Image Captioning Jun 6, 2019 Image Captioning Image Paragraph Captioning
Code Code Available 05 Context-Aware Visual Policy Network for Sequence-Level Image Captioning Aug 16, 2018 Deep Reinforcement Learning Image Captioning
Code Code Available 05 Language-Driven Region Pointer Advancement for Controllable Image Captioning Nov 30, 2020 controllable image captioning Image Captioning
Code Code Available 05 Language Models as Knowledge Bases for Visual Word Sense Disambiguation Oct 3, 2023 Image Captioning Multiple-choice
Code Code Available 05 Learning a Deep Embedding Model for Zero-Shot Learning Nov 15, 2016 Image Captioning Sentence
Code Code Available 05 Multimodal Learning for Hateful Memes Detection Nov 25, 2020 Image Captioning Multimodal Deep Learning
Code Code Available 05 ReFormer: The Relational Transformer for Image Captioning Jul 29, 2021 Graph Generation Image Captioning
Code Code Available 05 Context-aware Captions from Context-agnostic Supervision Jan 11, 2017 Image Captioning Language Modeling
Code Code Available 05 ContCap: A scalable framework for continual image captioning Sep 19, 2019 Continual Learning Image Captioning
Code Code Available 05 JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts Dec 18, 2024 Action Detection Descriptive
Code Code Available 05 KALE: An Artwork Image Captioning System Augmented with Heterogeneous Graph Sep 17, 2024 cross-modal alignment Image Captioning
Code Code Available 05 Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning Dec 6, 2016 Decoder Image Captioning
Code Code Available 05 JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models Nov 7, 2023 Image Captioning
Code Code Available 05 Journalistic Guidelines Aware News Image Captioning Sep 7, 2021 Caption Generation Descriptive
Code Code Available 05 Connecting Vision and Language with Localized Narratives Dec 6, 2019 Form Image Captioning
Code Code Available 05 A Benchmark for Multi-Lingual Vision-Language Learning in Remote Sensing Image Captioning Mar 6, 2025 Descriptive Image Captioning
Code Code Available 05 JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images Sep 19, 2024 Hallucination Image Captioning
Code Code Available 05 InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation Nov 30, 2023 Image Captioning Referring Expression
Code Code Available 05 iParaphrasing: Extracting Visually Grounded Paraphrases via an Image Jun 12, 2018 Image Captioning Question Answering
Code Code Available 05 Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning Jul 1, 2018 Image Captioning
Code Code Available 05 AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search Mar 26, 2019 GPU Image Captioning
Code Code Available 05 iPIC-XAI: Improving PIC-XAI for Enhanced Image Captioning Explanation Sep 23, 2023 Image Captioning TAG
Code Code Available 05 Attention on Attention for Image Captioning Aug 19, 2019 Decoder Image Captioning
Code Code Available 05 Adaptive Testing of Computer Vision Models Dec 6, 2022 Image Captioning object-detection
Code Code Available 05 Kvasir-VQA: A Text-Image Pair GI Tract Dataset Sep 2, 2024 Image Captioning Image Generation
Code Code Available 05 Improving Image Captioning with Conditional Generative Adversarial Nets May 18, 2018 Decoder Image Captioning
Code Code Available 05 Composition and Deformance: Measuring Imageability with a Text-to-Image Model Jun 5, 2023 Image Captioning Image Generation
Code Code Available 05 Improving Reinforcement Learning Based Image Captioning with Natural Language Prior Sep 13, 2018 Image Captioning reinforcement-learning
Code Code Available 05 Compositional Image-Text Matching and Retrieval by Grounding Entities May 4, 2025 Image Captioning Image-text matching
Code Code Available 05 Compositional Generalization in Image Captioning Sep 10, 2019 Caption Generation Image Captioning
Code Code Available 05 Adaptively Clustering Neighbor Elements for Image-Text Generation Jan 5, 2023 Clustering Decoder
Code Code Available 05 Improved Image Captioning via Policy Gradient optimization of SPIDEr Dec 1, 2016 Image Captioning
Code Code Available 05 Attention-Based Models for Text-Dependent Speaker Verification Oct 28, 2017 Image Captioning Machine Translation
Code Code Available 05 Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset Mar 1, 2024 Image Captioning Image Generation
Code Code Available 05 Attend to You: Personalized Image Captioning with Context Sequence Memory Networks Apr 21, 2017 Descriptive Image Captioning
Code Code Available 05 IMAGINATOR: Pre-Trained Image+Text Joint Embeddings using Word-Level Grounding of Images May 12, 2023 Hyperparameter Optimization Image Captioning
Code Code Available 05 COMIC: Towards A Compact Image Captioning Model with Attention Mar 4, 2019 Decoder Image Captioning
Code Code Available 05 Implicit Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis Sep 21, 2023 Cross-Modal Retrieval Image Captioning
Code Code Available 05 Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for Marketing May 6, 2019 Descriptive Image Captioning
Code Code Available 05