Language Models are General-Purpose Interfaces Jun 13, 2022 Causal Language Modeling Few-Shot Learning
— Unverified 0Intra-agent speech permits zero-shot task acquisition Jun 7, 2022 Image Captioning
— Unverified 0Improving Image Captioning with Control Signal of Sentence Quality Jun 7, 2022 Image Captioning Sentence
— Unverified 0Examining the Effects of Language-and-Vision Data Augmentation for Generation of Descriptions of Human Faces Jun 1, 2022 Caption Generation Data Augmentation
— Unverified 0Visual Transformer for Object Detection Jun 1, 2022 Image Captioning Machine Translation
— Unverified 0BAN-Cap: A Multi-Purpose English-Bangla Image Descriptions Dataset May 28, 2022 Image Captioning Machine Translation
Code Code Available 0Variational Transformer: A Framework Beyond the Trade-off between Accuracy and Diversity for Image Captioning May 28, 2022 Diversity Image Captioning
Code Code Available 0Prompt-based Learning for Unpaired Image Captioning May 26, 2022 Image Captioning Image-text Retrieval
— Unverified 0Crossmodal-3600: A Massively Multilingual Multimodal Evaluation Dataset May 25, 2022 Image Captioning Image Retrieval
— Unverified 0Reassessing Evaluation Practices in Visual Question Answering: A Case Study on Out-of-Distribution Generalization May 24, 2022 Image Captioning Out-of-Distribution Generalization
— Unverified 0On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization May 24, 2022 Descriptive Image Captioning
— Unverified 0It Isn't Sh!tposting, It's My CAT Posting May 18, 2022 Image Captioning
— Unverified 0Understanding Transfer Learning for Chest Radiograph Clinical Report Generation with Modified Transformer Architectures May 5, 2022 Image Captioning Transfer Learning
— Unverified 0Diverse Image Captioning with Grounded Style May 3, 2022 Attribute Diversity
Code Code Available 0Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering May 2, 2022 Decoder Image Captioning
— Unverified 0Molecular Identification from AFM images using the IUPAC Nomenclature and Attribute Multimodal Recurrent Neural Networks May 1, 2022 Attribute Image Captioning
— Unverified 0Combine to Describe: Evaluating Compositional Generalization in Image Captioning May 1, 2022 Image Captioning
— Unverified 0Controllable Image Captioning Apr 28, 2022 controllable image captioning Decoder
— Unverified 0CapOnImage: Context-driven Dense-Captioning on Image Apr 27, 2022 Dense Captioning Diversity
— Unverified 0Cross-view Brain Decoding Apr 18, 2022 Brain Decoding Image Captioning
— Unverified 0Guiding Attention using Partial-Order Relationships for Image Captioning Apr 15, 2022 Caption Generation Image Captioning
— Unverified 0Robust Cross-Modal Representation Learning with Progressive Self-Distillation Apr 10, 2022 Contrastive Learning Image Captioning
— Unverified 0Semantic Exploration from Language Abstractions and Pretrained Representations Apr 8, 2022 Image Captioning Reinforcement Learning (RL)
— Unverified 0Multimodal Quasi-AutoRegression: Forecasting the visual popularity of new fashion products Apr 8, 2022 Image Captioning image-classification
— Unverified 0On Distinctive Image Captioning via Comparing and Reweighting Apr 8, 2022 Image Captioning Retrieval
— Unverified 0Learning Audio-Video Modalities from Image Captions Apr 1, 2022 Image Captioning Retrieval
— Unverified 0Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language Apr 1, 2022 Diversity Image Captioning
Code Code Available 0WuDaoMM: A large-scale Multi-Modal Dataset for Pre-training models Mar 22, 2022 Image Captioning Image Generation
— Unverified 0AlignTransformer: Hierarchical Alignment of Visual Regions and Disease Tags for Medical Report Generation Mar 18, 2022 Descriptive Image Captioning
— Unverified 0DU-VLG: Unifying Vision-and-Language Generation via Dual Sequence-to-Sequence Pre-training Mar 17, 2022 Denoising Image Captioning
— Unverified 0Leveraging Visual Knowledge in Language Tasks: An Empirical Study on Intermediate Pre-training for Cross-modal Knowledge Transfer Mar 14, 2022 Image Captioning Language Modeling
— Unverified 0Contrastive Visual Semantic Pretraining Magnifies the Semantics of Natural Language Representations Mar 14, 2022 Image Captioning Semantic Textual Similarity
— Unverified 0Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation Mar 12, 2022 Image Captioning Knowledge Distillation
— Unverified 0Taking an Emotional Look at Video Paragraph Captioning Mar 12, 2022 Image Captioning
— Unverified 0Geodesic Multi-Modal Mixup for Robust Fine-Tuning Mar 8, 2022 Image Captioning zero-shot-classification
Code Code Available 0Semantic Distillation Guided Salient Object Detection Mar 8, 2022 Image Captioning Object
— Unverified 0Unpaired Image Captioning by Image-level Weakly-Supervised Visual Concept Recognition Mar 7, 2022 Graph Neural Network Image Captioning
— Unverified 0A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism Mar 3, 2022 Caption Generation Decoder
— Unverified 0Interactive Machine Learning for Image Captioning Feb 28, 2022 BIG-bench Machine Learning Data Augmentation
— Unverified 0I-Tuning: Tuning Frozen Language Models with Image for Lightweight Image Captioning Feb 14, 2022 Decoder Image Captioning
— Unverified 0Bench-Marking And Improving Arabic Automatic Image Captioning Through The Use Of Multi-Task Learning Paradigm Feb 11, 2022 Image Captioning Multi-Task Learning
— Unverified 0Describing image focused in cognitive and visual details for visually impaired people: An approach to generating inclusive paragraphs Feb 10, 2022 Dense Captioning Image Captioning
— Unverified 0OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework Feb 7, 2022 Image Captioning image-classification
Code Code Available 0Deep Learning Approaches on Image Captioning: A Review Jan 31, 2022 Caption Generation Deep Learning
— Unverified 0A Frustratingly Simple Approach for End-to-End Image Captioning Jan 30, 2022 Decoder Image Captioning
— Unverified 0An Integrated Approach for Video Captioning and Applications Jan 23, 2022 Image Captioning Video Captioning
— Unverified 0Visual Information Guided Zero-Shot Paraphrase Generation Jan 22, 2022 Diversity Image Captioning
Code Code Available 0Discovering Non-Monotonic Autoregressive Ordering for Text Generation Models using Sinkhorn Distributions Jan 17, 2022 Code Generation Decoder
— Unverified 0All You May Need for VQA are Image Captions Jan 16, 2022 All Image Captioning
— Unverified 0Long-Tail Classification for Distinctive Image Captioning: A Simple yet Effective Remedy for Side Effects of Reinforcement Learning Jan 16, 2022 Image Captioning Reinforcement Learning (RL)
— Unverified 0