A Semi-supervised Framework for Image Captioning Nov 16, 2016 Decoder Image Captioning
Code Code Available 05 Language Models as Knowledge Bases for Visual Word Sense Disambiguation Oct 3, 2023 Image Captioning Multiple-choice
Code Code Available 05 Kvasir-VQA: A Text-Image Pair GI Tract Dataset Sep 2, 2024 Image Captioning Image Generation
Code Code Available 05 Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning Dec 6, 2016 Decoder Image Captioning
Code Code Available 05 Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning Sep 16, 2021 Decoder Image Captioning
Code Code Available 05 JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts Dec 18, 2024 Action Detection Descriptive
Code Code Available 05 Journalistic Guidelines Aware News Image Captioning Sep 7, 2021 Caption Generation Descriptive
Code Code Available 05 JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models Nov 7, 2023 Image Captioning
Code Code Available 05 JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images Sep 19, 2024 Hallucination Image Captioning
Code Code Available 05 KALE: An Artwork Image Captioning System Augmented with Heterogeneous Graph Sep 17, 2024 cross-modal alignment Image Captioning
Code Code Available 05 iParaphrasing: Extracting Visually Grounded Paraphrases via an Image Jun 12, 2018 Image Captioning Question Answering
Code Code Available 05 iPIC-XAI: Improving PIC-XAI for Enhanced Image Captioning Explanation Sep 23, 2023 Image Captioning TAG
Code Code Available 05 GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition Jun 9, 2025 Image Captioning
Code Code Available 05 InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation Nov 30, 2023 Image Captioning Referring Expression
Code Code Available 05 Leveraging Human Attention in Novel Object Captioning Aug 19, 2021 Image Captioning Object
Code Code Available 05 Caption Enriched Samples for Improving Hateful Memes Detection Sep 22, 2021 Image Captioning
Code Code Available 05 Evaluating and interpreting caption prediction for histopathology images Jul 8, 2020 Caption Generation Image Captioning
Code Code Available 05 Cascaded Revision Network for Novel Object Captioning Aug 6, 2019 Image Captioning Object
Code Code Available 05 Improving Reinforcement Learning Based Image Captioning with Natural Language Prior Sep 13, 2018 Image Captioning reinforcement-learning
Code Code Available 05 Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language Models May 16, 2025 Image Captioning Question Answering
Code Code Available 05 Improving Image Captioning with Conditional Generative Adversarial Nets May 18, 2018 Decoder Image Captioning
Code Code Available 05 Improved Image Captioning via Policy Gradient optimization of SPIDEr Dec 1, 2016 Image Captioning
Code Code Available 05 Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset Mar 1, 2024 Image Captioning Image Generation
Code Code Available 05 Enhancing Descriptive Image Captioning with Natural Language Inference Aug 1, 2021 Descriptive Image Captioning
Code Code Available 05 AICAttack: Adversarial Image Captioning Attack with Attention-Based Optimization Feb 19, 2024 Adversarial Attack Image Captioning
Code Code Available 05 Large Vision-Language Models for Knowledge-Grounded Data Annotation of Memes Jan 23, 2025 Emotion Classification Image Captioning
Code Code Available 05 LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation Sep 4, 2021 Caption Generation Image Captioning
Code Code Available 05 Learn from Downstream and Be Yourself in Multimodal Large Language Model Fine-Tuning Nov 17, 2024 Image Captioning Language Modeling
Code Code Available 05 Improved Bengali Image Captioning via deep convolutional neural network based encoder-decoder model Feb 14, 2021 Decoder Image Captioning
Code Code Available 05 Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding Apr 20, 2025 Autonomous Driving Image Captioning
Code Code Available 05 Image Captioning with Deep Bidirectional LSTMs Apr 4, 2016 Caption Generation Data Augmentation
Code Code Available 05 IMAGINATOR: Pre-Trained Image+Text Joint Embeddings using Word-Level Grounding of Images May 12, 2023 Hyperparameter Optimization Image Captioning
Code Code Available 05 CAPEEN: Image Captioning with Early Exits and Knowledge Distillation Oct 6, 2024 Descriptive Image Captioning
Code Code Available 05 Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering Nov 17, 2015 Image Captioning Question Answering
Code Code Available 05 End-to-End Instance Segmentation with Recurrent Attention May 30, 2016 Autonomous Driving Image Captioning
Code Code Available 05 Exploring the sequence length bottleneck in the Transformer for Image Captioning Jul 7, 2022 Image Captioning
Code Code Available 05 A Critical Review of Recurrent Neural Networks for Sequence Learning May 29, 2015 Handwriting Recognition Image Captioning
Code Code Available 05 Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for Marketing May 6, 2019 Descriptive Image Captioning
Code Code Available 05 Implicit Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis Sep 21, 2023 Cross-Modal Retrieval Image Captioning
Code Code Available 05 End-to-end Image Captioning Exploits Distributional Similarity in Multimodal Space Nov 1, 2018 Image Captioning Text Generation
Code Code Available 05 End-to-End Attention-based Image Captioning Apr 30, 2021 Image Captioning Translation
Code Code Available 05 Image Captioning: Transforming Objects into Words Jun 14, 2019 Decoder Image Captioning
Code Code Available 05 A Hybrid Model for Combining Neural Image Caption and k-Nearest Neighbor Approach for Image Captioning May 9, 2021 Image Captioning regression
Code Code Available 05 Image Captioning using Deep Neural Architectures Jan 17, 2018 Image Captioning Machine Translation
Code Code Available 05 A High-Quality Text-Rich Image Instruction Tuning Dataset via Hybrid Instruction Generation Dec 20, 2024 Image Captioning
Code Code Available 05 Image Captioning as an Assistive Technology: Lessons Learned from VizWiz 2020 Challenge Dec 21, 2020 Image Captioning Navigate
Code Code Available 05 Image Captioning for Effective Use of Language Models in Knowledge-Based Visual Question Answering Sep 15, 2021 Image Captioning Knowledge Graphs
Code Code Available 05 Image Captioning via Dynamic Path Customization Jun 1, 2024 Diversity Image Captioning
Code Code Available 05 Image2tweet: Datasets in Hindi and English for Generating Tweets from Images Dec 1, 2021 Image Captioning World Knowledge
Code Code Available 05 Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks Aug 22, 2022 All Cross-Modal Retrieval
Code Code Available 05