Neural Fashion Image Captioning : Accounting for Data Diversity Jun 23, 2021 Decoder Diversity
Code Code Available 1RSTNet: Captioning With Adaptive Attention on Visual and Non-Visual Words Jun 19, 2021 Decoder Image Captioning
Code Code Available 1Semi-Autoregressive Transformer for Image Captioning Jun 17, 2021 Image Captioning
Code Code Available 1Understanding and Evaluating Racial Biases in Image Captioning Jun 16, 2021 Benchmarking Image Captioning
Code Code Available 1BERTGEN: Multi-task Generation through BERT Jun 7, 2021 Decoder Image Captioning
Code Code Available 1Multi-modal Understanding and Generation for Medical Images and Text via Vision-Language Pre-Training May 24, 2021 Image Captioning Medical Visual Question Answering
Code Code Available 1Connecting What to Say With Where to Look by Modeling Human Attention Traces May 12, 2021 Caption Generation Image Captioning
Code Code Available 1Passage Retrieval for Outside-Knowledge Visual Question Answering May 9, 2021 Image Captioning Object
Code Code Available 1RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition Apr 24, 2021 Image Captioning Object Recognition
Code Code Available 1Towards Accurate Text-based Image Captioning with Content Diversity Exploration Apr 23, 2021 Caption Generation Diversity
Code Code Available 1CLIPScore: A Reference-free Evaluation Metric for Image Captioning Apr 18, 2021 Hallucination Pair-wise Detection (1-ref) Hallucination Pair-wise Detection (4-ref)
Code Code Available 1Concadia: Towards Image-Based Text Generation with a Purpose Apr 16, 2021 Image Captioning Image to text
Code Code Available 1Human-like Controllable Image Captioning with Verb-specific Semantic Roles Mar 22, 2021 Caption Generation controllable image captioning
Code Code Available 1WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training Mar 11, 2021 Contrastive Learning GPU
Code Code Available 1Multiple Instance Captioning: Learning Representations from Histopathology Textbooks and Articles Mar 8, 2021 Articles Diagnostic
Code Code Available 1VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning Feb 20, 2021 Decoder Image Captioning
Code Code Available 1Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts Feb 17, 2021 Caption Generation Diversity
Code Code Available 1In Defense of Scene Graphs for Image Captioning Feb 9, 2021 Human-Object Interaction Detection Image Captioning
Code Code Available 1Unifying Vision-and-Language Tasks via Text Generation Feb 4, 2021 Conditional Text Generation Decoder
Code Code Available 1Dual-Level Collaborative Transformer for Image Captioning Jan 16, 2021 Descriptive Image Captioning
Code Code Available 1Self-Distillation for Few-Shot Image Captioning Jan 6, 2021 Image Captioning
Code Code Available 1Discovering Autoregressive Orderings with Variational Inference Jan 1, 2021 Code Generation Image Captioning
Code Code Available 1Text-Free Image-to-Speech Synthesis Using Learned Segmental Units Dec 31, 2020 Image Captioning Speech Synthesis
Code Code Available 1Detecting Hate Speech in Multi-modal Memes Dec 29, 2020 Binary Classification Hate Speech Detection
Code Code Available 1Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network Dec 13, 2020 Caption Generation Decoder
Code Code Available 1Confidence-aware Non-repetitive Multimodal Transformers for TextCaps Dec 7, 2020 Image Captioning Optical Character Recognition
Code Code Available 1Diverse Image Captioning with Context-Object Split Latent Spaces Nov 2, 2020 Diversity Image Captioning
Code Code Available 1ViLBERTScore: Evaluating Image Caption Using Vision-and-Language BERT Nov 1, 2020 Image Captioning Sentence
Code Code Available 1Can images help recognize entities? A study of the role of images for Multimodal NER Oct 23, 2020 Image Captioning named-entity-recognition
Code Code Available 1WaveTransformer: A Novel Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information Oct 21, 2020 Audio captioning Decoder
Code Code Available 1Bayesian Attention Modules Oct 20, 2020 Image Captioning Machine Translation
Code Code Available 1Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision Oct 14, 2020 Image Captioning Language Modeling
Code Code Available 1Visual News: Benchmark and Challenges in News Image Captioning Oct 8, 2020 Articles Image Captioning
Code Code Available 1Dense Relational Image Captioning via Multi-task Triple-Stream Networks Oct 8, 2020 Graph Generation Image Captioning
Code Code Available 1Pix2Prof: fast extraction of sequential information from galaxy imagery via a deep natural language 'captioning' model Oct 1, 2020 CPU Image Captioning
Code Code Available 1Are scene graphs good enough to improve Image Captioning? Sep 25, 2020 Decoder Graph Attention
Code Code Available 1X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers Sep 23, 2020 Image Captioning Image Generation
Code Code Available 1Towards Unique and Informative Captioning of Images Sep 8, 2020 Diversity Image Captioning
Code Code Available 1Protect, Show, Attend and Tell: Empowering Image Captioning Models with Ownership Protection Aug 25, 2020 Image Captioning image-classification
Code Code Available 1Text as Neural Operator: Image Manipulation by Text Instruction Aug 11, 2020 Conditional Image Generation Image Captioning
Code Code Available 1Describe What to Change: A Text-guided Unsupervised Image-to-Image Translation Approach Aug 10, 2020 Attribute Image Captioning
Code Code Available 1Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards Aug 6, 2020 Attribute Image Captioning
Code Code Available 1Learning to Generate Grounded Visual Captions without Localization Supervision Aug 1, 2020 Image Captioning Language Modelling
Code Code Available 1Comprehensive Image Captioning via Scene Graph Decomposition Jul 23, 2020 Diversity Image Captioning
Code Code Available 1Length-Controllable Image Captioning Jul 19, 2020 controllable image captioning Decoder
Code Code Available 1Consensus-Aware Visual-Semantic Embedding for Image-Text Matching Jul 17, 2020 Image Captioning Image-text matching
Code Code Available 1RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning Jul 13, 2020 Continual Learning Image Captioning
Code Code Available 1EDSL: An Encoder-Decoder Architecture with Symbol-Level Features for Printed Mathematical Expression Recognition Jul 6, 2020 Decoder Image Captioning
Code Code Available 1Graph Optimal Transport for Cross-Domain Alignment Jun 26, 2020 Graph Matching Image Captioning
Code Code Available 1Improving Image Captioning with Better Use of Captions Jun 21, 2020 Caption Generation Image Captioning
Code Code Available 1