On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization May 24, 2022 Descriptive Image Captioning
— Unverified 0mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections May 24, 2022 Computational Efficiency cross-modal alignment
Code Code Available 1Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam Search May 19, 2022 Decision Making Image Captioning
Code Code Available 1It Isn't Sh!tposting, It's My CAT Posting May 18, 2022 Image Captioning
— Unverified 0Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning May 9, 2022 Image Captioning Object
Code Code Available 1Understanding Transfer Learning for Chest Radiograph Clinical Report Generation with Modified Transformer Architectures May 5, 2022 Image Captioning Transfer Learning
— Unverified 0Language Models Can See: Plugging Visual Controls in Text Generation May 5, 2022 Image Captioning Image-text matching
Code Code Available 2CoCa: Contrastive Captioners are Image-Text Foundation Models May 4, 2022 Action Classification Decoder
Code Code Available 1All You May Need for VQA are Image Captions May 4, 2022 All Image Captioning
Code Code Available 3Diverse Image Captioning with Grounded Style May 3, 2022 Attribute Diversity
Code Code Available 0Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering May 2, 2022 Decoder Image Captioning
— Unverified 0Combine to Describe: Evaluating Compositional Generalization in Image Captioning May 1, 2022 Image Captioning
— Unverified 0Molecular Identification from AFM images using the IUPAC Nomenclature and Attribute Multimodal Recurrent Neural Networks May 1, 2022 Attribute Image Captioning
— Unverified 0Controllable Image Captioning Apr 28, 2022 controllable image captioning Decoder
— Unverified 0CapOnImage: Context-driven Dense-Captioning on Image Apr 27, 2022 Dense Captioning Diversity
— Unverified 0Cross-view Brain Decoding Apr 18, 2022 Brain Decoding Image Captioning
— Unverified 0It is Okay to Not Be Okay: Overcoming Emotional Bias in Affective Image Captioning by Contrastive Data Collection Apr 15, 2022 Image Captioning
Code Code Available 1Guiding Attention using Partial-Order Relationships for Image Captioning Apr 15, 2022 Caption Generation Image Captioning
— Unverified 0Image Captioning In the Transformer Age Apr 15, 2022 Decoder Image Captioning
Code Code Available 1Robust Cross-Modal Representation Learning with Progressive Self-Distillation Apr 10, 2022 Contrastive Learning Image Captioning
— Unverified 0Semantic Exploration from Language Abstractions and Pretrained Representations Apr 8, 2022 Image Captioning Reinforcement Learning (RL)
— Unverified 0On Distinctive Image Captioning via Comparing and Reweighting Apr 8, 2022 Image Captioning Retrieval
— Unverified 0Multimodal Quasi-AutoRegression: Forecasting the visual popularity of new fashion products Apr 8, 2022 Image Captioning image-classification
— Unverified 0Learning Audio-Video Modalities from Image Captions Apr 1, 2022 Image Captioning Retrieval
— Unverified 0Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language Apr 1, 2022 Diversity Image Captioning
Code Code Available 0Quantifying Societal Bias Amplification in Image Captioning Mar 29, 2022 Attribute Image Captioning
Code Code Available 1End-to-End Transformer Based Model for Image Captioning Mar 29, 2022 Decoder Image Captioning
Code Code Available 1Linking Emergent and Natural Languages via Corpus Transfer Mar 24, 2022 Attribute Disentanglement
Code Code Available 1WuDaoMM: A large-scale Multi-Modal Dataset for Pre-training models Mar 22, 2022 Image Captioning Image Generation
— Unverified 0AlignTransformer: Hierarchical Alignment of Visual Regions and Disease Tags for Medical Report Generation Mar 18, 2022 Descriptive Image Captioning
— Unverified 0DU-VLG: Unifying Vision-and-Language Generation via Dual Sequence-to-Sequence Pre-training Mar 17, 2022 Denoising Image Captioning
— Unverified 0On Vision Features in Multimodal Machine Translation Mar 17, 2022 Image Captioning Machine Translation
Code Code Available 1Leveraging Visual Knowledge in Language Tasks: An Empirical Study on Intermediate Pre-training for Cross-modal Knowledge Transfer Mar 14, 2022 Image Captioning Language Modeling
— Unverified 0Contrastive Visual Semantic Pretraining Magnifies the Semantics of Natural Language Representations Mar 14, 2022 Image Captioning Semantic Textual Similarity
— Unverified 0Chart-to-Text: A Large-Scale Benchmark for Chart Summarization Mar 12, 2022 Data-to-Text Generation Image Captioning
Code Code Available 1Taking an Emotional Look at Video Paragraph Captioning Mar 12, 2022 Image Captioning
— Unverified 0Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation Mar 12, 2022 Image Captioning Knowledge Distillation
— Unverified 0Geodesic Multi-Modal Mixup for Robust Fine-Tuning Mar 8, 2022 Image Captioning zero-shot-classification
Code Code Available 0Semantic Distillation Guided Salient Object Detection Mar 8, 2022 Image Captioning Object
— Unverified 0Unpaired Image Captioning by Image-level Weakly-Supervised Visual Concept Recognition Mar 7, 2022 Graph Neural Network Image Captioning
— Unverified 0FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context Mar 4, 2022 Decoder Image Captioning
Code Code Available 1A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism Mar 3, 2022 Caption Generation Decoder
— Unverified 0Interactive Machine Learning for Image Captioning Feb 28, 2022 BIG-bench Machine Learning Data Augmentation
— Unverified 0CaMEL: Mean Teacher Learning for Image Captioning Feb 21, 2022 Image Captioning Knowledge Distillation
Code Code Available 1I-Tuning: Tuning Frozen Language Models with Image for Lightweight Image Captioning Feb 14, 2022 Decoder Image Captioning
— Unverified 0ACORT: A Compact Object Relation Transformer for Parameter Efficient Image Captioning Feb 11, 2022 Image Captioning Relation
Code Code Available 1Bench-Marking And Improving Arabic Automatic Image Captioning Through The Use Of Multi-Task Learning Paradigm Feb 11, 2022 Image Captioning Multi-Task Learning
— Unverified 0Describing image focused in cognitive and visual details for visually impaired people: An approach to generating inclusive paragraphs Feb 10, 2022 Dense Captioning Image Captioning
— Unverified 0DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models Feb 8, 2022 Diagnostic Image Captioning
Code Code Available 3OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework Feb 7, 2022 Image Captioning image-classification
Code Code Available 0