Multi-modal Understanding and Generation for Medical Images and Text via Vision-Language Pre-Training May 24, 2021 Image Captioning Medical Visual Question Answering
Code Code Available 1Visual representation of negation: Real world data analysis on comic image design May 21, 2021 Image Captioning image-classification
— Unverified 0More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching May 20, 2021 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Dependent Multi-Task Learning with Causal Intervention for Image Captioning May 18, 2021 Image Captioning Multi-agent Reinforcement Learning
— Unverified 0Multi-Modal Image Captioning for the Visually Impaired May 17, 2021 Image Captioning
— Unverified 0Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval May 16, 2021 Graph Generation Image Captioning
— Unverified 0Empirical Analysis of Image Caption Generation using Deep Learning May 14, 2021 Caption Generation Decoder
— Unverified 0Connecting What to Say With Where to Look by Modeling Human Attention Traces May 12, 2021 Caption Generation Image Captioning
Code Code Available 1Instance-aware Remote Sensing Image Captioning with Cross-hierarchy Attention May 11, 2021 Decoder Diversity
— Unverified 0A Hybrid Model for Combining Neural Image Caption and k-Nearest Neighbor Approach for Image Captioning May 9, 2021 Image Captioning regression
Code Code Available 0Passage Retrieval for Outside-Knowledge Visual Question Answering May 9, 2021 Image Captioning Object
Code Code Available 1Exploring Explicit and Implicit Visual Relationships for Image Captioning May 6, 2021 Decoder Image Captioning
— Unverified 0End-to-End Attention-based Image Captioning Apr 30, 2021 Image Captioning Translation
Code Code Available 0Removing Word-Level Spurious Alignment between Images and Pseudo-Captions in Unsupervised Image Captioning Apr 28, 2021 Image Captioning image-sentence alignment
Code Code Available 0Contextualized Keyword Representations for Multi-modal Retinal Image Captioning Apr 26, 2021 Avg Image Captioning
— Unverified 0RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition Apr 24, 2021 Image Captioning Object Recognition
Code Code Available 1Towards Accurate Text-based Image Captioning with Content Diversity Exploration Apr 23, 2021 Caption Generation Diversity
Code Code Available 1CLIPScore: A Reference-free Evaluation Metric for Image Captioning Apr 18, 2021 Hallucination Pair-wise Detection (1-ref) Hallucination Pair-wise Detection (4-ref)
Code Code Available 1Concadia: Towards Image-Based Text Generation with a Purpose Apr 16, 2021 Image Captioning Image to text
Code Code Available 1"Wikily" Supervised Neural Translation Tailored to Cross-Lingual Tasks Apr 16, 2021 Cross-Lingual Transfer Cross-Lingual Word Embeddings
Code Code Available 0HindSight: A Graph-Based Vision Model Architecture For Representing Part-Whole Hierarchies Apr 8, 2021 Image Captioning image-classification
— Unverified 0Compressing Visual-linguistic Model via Knowledge Distillation Apr 5, 2021 Image Captioning Knowledge Distillation
— Unverified 0Exploiting Image–Text Synergy for Contextual Image Captioning Apr 1, 2021 Articles Image Captioning
— Unverified 0Making Use of Latent Space in Language GANs for Generating Diverse Text without Pre-training Apr 1, 2021 Diversity Image Captioning
— Unverified 0On Hallucination and Predictive Uncertainty in Conditional Language Generation Mar 28, 2021 Data-to-Text Generation Hallucination
— Unverified 0Human-like Controllable Image Captioning with Verb-specific Semantic Roles Mar 22, 2021 Caption Generation controllable image captioning
Code Code Available 1#PraCegoVer: A Large Dataset for Image Captioning in Portuguese Mar 21, 2021 Image Captioning Sentence
Code Code Available 03M: Multi-style image caption generation using Multi-modality features under Multi-UPDOWN model Mar 20, 2021 Caption Generation Image Captioning
— Unverified 0A Comprehensive Survey of Scene Graphs: Generation and Application Mar 17, 2021 Image Captioning Question Answering
— Unverified 0Knowledge driven Description Synthesis for Floor Plan Interpretation Mar 15, 2021 Caption Generation Descriptive
— Unverified 0WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training Mar 11, 2021 Contrastive Learning GPU
Code Code Available 1Multiple Instance Captioning: Learning Representations from Histopathology Textbooks and Articles Mar 8, 2021 Articles Diagnostic
Code Code Available 1Analysis of Convolutional Decoder for Image Caption Generation Mar 8, 2021 Caption Generation Data Augmentation
— Unverified 0Visual Question Answering: which investigated applications? Mar 4, 2021 Image Captioning Question Answering
Code Code Available 0DeepFN: Towards Generalizable Facial Action Unit Recognition with Deep Face Normalization Mar 3, 2021 Action Recognition Denoising
— Unverified 0Retrieval Augmentation for Deep Neural Networks Feb 25, 2021 Image Captioning Retrieval
Code Code Available 0Enhanced Modality Transition for Image Captioning Feb 23, 2021 Decoder Image Captioning
— Unverified 0Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings and Data Augmentation Feb 22, 2021 Data Augmentation Decoder
— Unverified 0VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning Feb 20, 2021 Decoder Image Captioning
Code Code Available 1Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts Feb 17, 2021 Caption Generation Diversity
Code Code Available 1Image Captioning using Multiple Transformers for Self-Attention Mechanism Feb 14, 2021 Image Captioning
— Unverified 0Improved Bengali Image Captioning via deep convolutional neural network based encoder-decoder model Feb 14, 2021 Decoder Image Captioning
Code Code Available 0In Defense of Scene Graphs for Image Captioning Feb 9, 2021 Human-Object Interaction Detection Image Captioning
Code Code Available 1Iconographic Image Captioning for Artworks Feb 7, 2021 Image Captioning
Code Code Available 0Unifying Vision-and-Language Tasks via Text Generation Feb 4, 2021 Conditional Text Generation Decoder
Code Code Available 1L2C: Describing Visual Differences Needs Semantic Understanding of Individuals Feb 3, 2021 Image Captioning
— Unverified 0The Role of Syntactic Planning in Compositional Image Captioning Jan 28, 2021 Image Captioning
Code Code Available 0CPTR: Full Transformer Network for Image Captioning Jan 26, 2021 Decoder Image Captioning
— Unverified 0ECOL-R: Encouraging Copying in Novel Object Captioning with Reinforcement Learning Jan 25, 2021 Image Captioning Object
— Unverified 0Fast Sequence Generation with Multi-Agent Reinforcement Learning Jan 24, 2021 Image Captioning Machine Translation
— Unverified 0