Caption Enriched Samples for Improving Hateful Memes Detection Sep 22, 2021 Image Captioning
Code Code Available 0Adaptive Correlated Monte Carlo for Contextual Categorical Sequence Generation Dec 31, 2019 Image Captioning Program Synthesis
Code Code Available 0Neural Twins Talk Sep 26, 2020 Image Captioning Sentence
Code Code Available 0Neural Twins Talk & Alternative Calculations Aug 5, 2021 Descriptive Image Captioning
Code Code Available 0Show, Translate and Tell Mar 14, 2019 Cross-Modal Retrieval Image Captioning
Code Code Available 0Guided Open Vocabulary Image Captioning with Constrained Beam Search Dec 2, 2016 Image Captioning TAG
Code Code Available 0Decoupled Novel Object Captioner Apr 11, 2018 Image Captioning Novel Concepts
Code Code Available 0Decoding fMRI Data into Captions using Prefix Language Modeling Jan 5, 2025 Brain Decoding Image Captioning
Code Code Available 0SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization Dec 21, 2024 Image Captioning Multimodal Reasoning
Code Code Available 0Translating speech with just images Jun 11, 2024 Image Captioning Translation
Code Code Available 0simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions Aug 27, 2018 Decoder Image Captioning
Code Code Available 0CAPEEN: Image Captioning with Early Exits and Knowledge Distillation Oct 6, 2024 Descriptive Image Captioning
Code Code Available 0What is image captioning made of? Jan 1, 2018 Image Captioning Text Generation
Code Code Available 0nocaps: novel object captioning at scale Dec 20, 2018 Image Captioning Object
Code Code Available 0The Role of Data Curation in Image Captioning May 5, 2023 Few-Shot Learning Image Captioning
Code Code Available 0GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement Aug 18, 2022 Grounded Situation Recognition Image Captioning
Code Code Available 0No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling Apr 24, 2018 Image Captioning Reinforcement Learning
Code Code Available 0Counterfactual Maximum Likelihood Estimation for Training Deep Networks Jun 7, 2021 counterfactual Domain Generalization
Code Code Available 0What Is Missing in Multilingual Visual Reasoning and How to Fix It Mar 3, 2024 Image Captioning Visual Reasoning
Code Code Available 0Treble Counterfactual VLMs: A Causal Approach to Hallucination Mar 8, 2025 Autonomous Driving counterfactual
Code Code Available 0Group Relative Policy Optimization for Image Captioning Mar 3, 2025 Diversity Image Captioning
Code Code Available 0A Critical Review of Recurrent Neural Networks for Sequence Learning May 29, 2015 Handwriting Recognition Image Captioning
Code Code Available 0Top-Down Framework for Weakly-supervised Grounded Image Captioning Jun 13, 2023 Image Captioning Multi-Label Classification
Code Code Available 0Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering Nov 17, 2015 Image Captioning Question Answering
Code Code Available 0Correlating instruction-tuning (in multimodal models) with vision-language processing (in the brain) May 26, 2025 Image Captioning
Code Code Available 0Object-Centric Unsupervised Image Captioning Dec 2, 2021 Image Captioning Object
Code Code Available 0TROPE: TRaining-Free Object-Part Enhancement for Seamlessly Improving Fine-Grained Zero-Shot Image Captioning Sep 30, 2024 Image Captioning Object
Code Code Available 0Object Hallucination in Image Captioning Sep 6, 2018 Hallucination Image Captioning
Code Code Available 0SMURF: SeMantic and linguistic UndeRstanding Fusion for Caption Evaluation via Typicality Analysis Jun 2, 2021 Image Captioning
Code Code Available 0TS-RGBD Dataset: a Novel Dataset for Theatre Scenes Description for People with Visual Impairments Aug 2, 2023 Action Recognition Image Captioning
Code Code Available 0Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language Apr 1, 2022 Diversity Image Captioning
Code Code Available 0Grand Challenge On Detecting Cheapfakes Apr 3, 2023 Image Captioning
Code Code Available 0OmniNet: A unified architecture for multi-modal multi-task learning Jul 17, 2019 Image Captioning Multi-Task Learning
Code Code Available 0Grad-CAM: Why did you say that? Nov 22, 2016 Image Captioning Visual Question Answering
Code Code Available 0TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models Apr 18, 2023 Data Augmentation Diversity
Code Code Available 0Sparse and Structured Visual Attention Feb 13, 2020 Image Captioning Question Answering
Code Code Available 0What is the Role of Recurrent Neural Networks (RNNs) in an Image Caption Generator? Aug 7, 2017 Image Captioning
Code Code Available 0A Semi-supervised Framework for Image Captioning Nov 16, 2016 Decoder Image Captioning
Code Code Available 0Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding Apr 20, 2025 Autonomous Driving Image Captioning
Code Code Available 0Vision Matters When It Should: Sanity Checking Multimodal Machine Translation Models Sep 8, 2021 Image Captioning Machine Translation
Code Code Available 0Can adversarial training learn image captioning ? Oct 31, 2019 Image Captioning Text Generation
Code Code Available 0On Measuring Gender Bias in Translation of Gender-neutral Pronouns May 28, 2019 Ethics Image Captioning
Code Code Available 0UdL at SemEval-2017 Task 1: Semantic Textual Similarity Estimation of English Sentence Pairs Using Regression Model over Pairwise Features Aug 1, 2017 Ensemble Learning Image Captioning
Code Code Available 0GPTs Are Multilingual Annotators for Sequence Generation Tasks Feb 8, 2024 Image Captioning
Code Code Available 0Core Tokensets for Data-efficient Sequential Training of Transformers Oct 8, 2024 Image Captioning image-classification
Code Code Available 0Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights Jul 16, 2024 Image Captioning Multimodal Reasoning
Code Code Available 0On the Interpretability of Attention Networks Dec 30, 2022 Image Captioning
Code Code Available 0Adapting Contrastive Language-Image Pretrained (CLIP) Models for Out-of-Distribution Detection Mar 10, 2023 Anomaly Detection Image Captioning
Code Code Available 0A Hierarchical Approach for Generating Descriptive Image Paragraphs Nov 20, 2016 Dense Captioning Descriptive
Code Code Available 0Stack-Captioning: Coarse-to-Fine Learning for Image Captioning Sep 11, 2017 Decoder Image Captioning
Code Code Available 0