Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training Oct 17, 2022 Image Captioning Network Interpretation
Code Code Available 0Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space Nov 30, 2016 Image Captioning Image Inpainting
Code Code Available 0Unifying Text, Tables, and Images for Multimodal Question Answering Dec 10, 2023 Image Captioning Question Answering
Code Code Available 0Fraternal Dropout Oct 31, 2017 Image Captioning Language Modeling
Code Code Available 0Unrestricted Adversarial Examples via Semantic Manipulation Apr 12, 2019 Colorization Image Captioning
Code Code Available 0Fluency-Guided Cross-Lingual Image Captioning Aug 15, 2017 Image Captioning
Code Code Available 0FLoRA: Enhancing Vision-Language Models with Parameter-Efficient Federated Learning Apr 12, 2024 Federated Learning Image Captioning
Code Code Available 0Fine-Grained Image Captioning with Global-Local Discriminative Objective Jul 21, 2020 Descriptive Image Captioning
Code Code Available 0#PraCegoVer: A Large Dataset for Image Captioning in Portuguese Mar 21, 2021 Image Captioning Sentence
Code Code Available 0UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning Dec 31, 2020 Contrastive Learning Cross-Modal Retrieval
Code Code Available 0Pragmatic Issue-Sensitive Image Captioning Apr 29, 2020 Descriptive Image Captioning
Code Code Available 0Precision or Recall? An Analysis of Image Captions for Training Text-to-Image Generation Model Nov 7, 2024 Image Captioning Image Generation
Code Code Available 0A Benchmark for Multi-Lingual Vision-Language Learning in Remote Sensing Image Captioning Mar 6, 2025 Descriptive Image Captioning
Code Code Available 0Beyond Temporal Pooling: Recurrence and Temporal Convolutions for Gesture Recognition in Video Jun 5, 2015 Gesture Recognition Image Captioning
Code Code Available 0Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution Dec 20, 2024 Answer Generation Image Captioning
Code Code Available 0Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length Image Tagging Apr 18, 2016 Image Captioning Machine Translation
Code Code Available 0Technical Report of NICE Challenge at CVPR 2024: Caption Re-ranking Evaluation Using Ensembled CLIP and Consensus Scores May 2, 2024 Image Captioning Re-Ranking
Code Code Available 0Pretrained Image-Text Models are Secretly Video Captioners Feb 19, 2025 Image Captioning Video Captioning
Code Code Available 0Finding beans in burgers: Deep semantic-visual embedding with localization Apr 5, 2018 Cross-Modal Retrieval Image Captioning
Code Code Available 0Visually-Aware Context Modeling for News Image Captioning Aug 16, 2023 Articles Image Captioning
Code Code Available 0"Wikily" Supervised Neural Translation Tailored to Cross-Lingual Tasks Apr 16, 2021 Cross-Lingual Transfer Cross-Lingual Word Embeddings
Code Code Available 0Fast and Simple Mixture of Softmaxes with BPE and Hybrid-LightRNN for Language Generation Sep 25, 2018 Image Captioning Machine Translation
Code Code Available 0An Eye for an Ear: Zero-shot Audio Description Leveraging an Image Captioner using Audiovisual Distribution Alignment Oct 8, 2024 Audio captioning Contrastive Learning
Code Code Available 0Face-Cap: Image Captioning using Facial Expression Analysis Jul 6, 2018 Descriptive Image Captioning
Code Code Available 0Zero-shot Translation of Attention Patterns in VQA Models to Natural Language Nov 8, 2023 Image Captioning Language Modeling
Code Code Available 0Context-Aware Visual Policy Network for Sequence-Level Image Captioning Aug 16, 2018 Deep Reinforcement Learning Image Captioning
Code Code Available 0Expressing Visual Relationships via Language Jun 18, 2019 Decoder Image Captioning
Code Code Available 0Context-aware Captions from Context-agnostic Supervision Jan 11, 2017 Image Captioning Language Modeling
Code Code Available 0Visual Question Answering: which investigated applications? Mar 4, 2021 Image Captioning Question Answering
Code Code Available 0TexLiDAR: Automated Text Understanding for Panoramic LiDAR Data Feb 5, 2025 Image Captioning object-detection
Code Code Available 0ContCap: A scalable framework for continual image captioning Sep 19, 2019 Continual Learning Image Captioning
Code Code Available 0Protecting Intellectual Property of Language Generation APIs with Lexical Watermark Dec 5, 2021 Document Summarization Image Captioning
Code Code Available 0Exploring the Synergy Between Vision-Language Pretraining and ChatGPT for Artwork Captioning: A Preliminary Study Jan 21, 2023 Image Captioning Informativeness
Code Code Available 0PR Product: A Substitute for Inner Product in Neural Networks Apr 30, 2019 General Classification Image Captioning
Code Code Available 0Exploring Nearest Neighbor Approaches for Image Captioning May 17, 2015 Image Captioning
Code Code Available 0Aesthetic Attributes Assessment of Images Jul 11, 2019 Attribute Image Captioning
Code Code Available 0Visual Semantic Relatedness Dataset for Image Captioning Jan 20, 2023 Image Captioning text similarity
Code Code Available 0Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models Dec 8, 2024 Image Captioning
Code Code Available 0An Examination of the Robustness of Reference-Free Image Captioning Evaluation Metrics May 24, 2023 Image Captioning Negation
Code Code Available 0Quality Estimation for Image Captions Based on Large-scale Human Evaluations Sep 8, 2019 Image Captioning Model Selection
Code Code Available 0Exploring Annotation-free Image Captioning with Retrieval-augmented Pseudo Sentence Generation Jul 27, 2023 Image Captioning Model Optimization
Code Code Available 0Quantifying the amount of visual information used by neural caption generators Oct 12, 2018 Image Captioning Position
Code Code Available 0Unsupervised Image Captioning Nov 27, 2018 Image Captioning Image Description
Code Code Available 0Quantifying the visual concreteness of words and topics in multimodal datasets Apr 18, 2018 BIG-bench Machine Learning Image Captioning
Code Code Available 0Explicit Sparse Transformer: Concentrated Attention Through Explicit Selection Dec 25, 2019 Image Captioning Language Modeling
Code Code Available 0Experimenting with Self-Supervision using Rotation Prediction for Image Captioning Jul 28, 2021 Decoder Image Captioning
Code Code Available 0Exploring the sequence length bottleneck in the Transformer for Image Captioning Jul 7, 2022 Image Captioning
Code Code Available 0Examining Gender and Racial Bias in Large Vision-Language Models Using a Novel Dataset of Parallel Images Feb 8, 2024 Image Captioning Question Answering
Code Code Available 0Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis Jan 16, 2025 Decoder Image Captioning
Code Code Available 0Exact Adversarial Attack to Image Captioning via Structured Output Learning with Latent Variables May 10, 2019 Adversarial Attack Image Captioning
Code Code Available 0