Cross-Domain Image Captioning with Discriminative Finetuning Apr 4, 2023 Descriptive Image Captioning
— Unverified 0Grand Challenge On Detecting Cheapfakes Apr 3, 2023 Image Captioning
Code Code Available 0Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations Mar 29, 2023 Image Captioning Instance Segmentation
— Unverified 0AutoAD: Movie Description in Context Mar 29, 2023 Image Captioning Text Generation
Code Code Available 1Multimodal Image-Text Matching Improves Retrieval-based Chest X-Ray Report Generation Mar 29, 2023 Image Captioning Image-text matching
Code Code Available 1Variational Distribution Learning for Unsupervised Text-to-Image Generation Mar 28, 2023 Image Captioning Image Generation
— Unverified 0Open-Vocabulary Object Detection using Pseudo Caption Labels Mar 23, 2023 Image Captioning Knowledge Distillation
— Unverified 0MAGVLT: Masked Generative Vision-and-Language Transformer Mar 21, 2023 Image Captioning Image Generation
Code Code Available 1Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation Mar 21, 2023 Contrastive Learning Image Captioning
Code Code Available 1Sketch2Saliency: Learning to Detect Salient Objects from Human Drawings Mar 20, 2023 Image Captioning Retrieval
— Unverified 0Multi-modal reward for visual relationships-based image captioning Mar 19, 2023 Caption Generation Deep Reinforcement Learning
— Unverified 0Visual Information Matters for ASR Error Correction Mar 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning Mar 15, 2023 Image Captioning
— Unverified 0Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images Mar 13, 2023 Common Sense Reasoning Explanation Generation
— Unverified 0ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions Mar 12, 2023 Image Captioning Question Answering
Code Code Available 2ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation Mar 11, 2023 Image Captioning Image to text
Code Code Available 1Learning Combinatorial Prompts for Universal Controllable Image Captioning Mar 11, 2023 controllable image captioning Image Captioning
— Unverified 0Adapting Contrastive Language-Image Pretrained (CLIP) Models for Out-of-Distribution Detection Mar 10, 2023 Anomaly Detection Image Captioning
Code Code Available 0Spawrious: A Benchmark for Fine Control of Spurious Correlation Biases Mar 9, 2023 Image Captioning image-classification
Code Code Available 1Interpretable Visual Question Answering Referring to Outside Knowledge Mar 8, 2023 Diversity Image Captioning
— Unverified 0Graph Neural Networks in Vision-Language Image Understanding: A Survey Mar 7, 2023 Image Captioning Image Retrieval
— Unverified 0DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training Mar 6, 2023 Decoder Image Captioning
Code Code Available 1Neighborhood Contrastive Transformer for Change Captioning Mar 6, 2023 Decoder Image Captioning
Code Code Available 1Comparative study of Transformer and LSTM Network with attention mechanism on Image Captioning Mar 5, 2023 Image Captioning
— Unverified 0Prismer: A Vision-Language Model with Multi-Task Experts Mar 4, 2023 Few-Shot Learning Image Captioning
Code Code Available 1ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing Mar 4, 2023 Diversity Image Captioning
Code Code Available 1FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks Mar 4, 2023 Cross-Modal Retrieval Image Captioning
Code Code Available 1ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of Pneumothorax Mar 2, 2023 Descriptive Image Captioning
Code Code Available 1Language Is Not All You Need: Aligning Perception with Language Models Feb 27, 2023 All Image Captioning
— Unverified 0Retrieval-augmented Image Captioning Feb 16, 2023 Decoder Image Captioning
Code Code Available 1Tuning computer vision models with task rewards Feb 16, 2023 Colorization Image Captioning
— Unverified 0Towards Local Visual Modeling for Image Captioning Feb 13, 2023 Image Captioning Object Recognition
Code Code Available 1See Your Heart: Psychological states Interpretation through Visual Creations Feb 11, 2023 Emotion Classification Image Captioning
— Unverified 0Nemesis: Neural Mean Teacher Learning-Based Emotion-Centric Speaker Feb 9, 2023 Image Captioning
— Unverified 0Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning Feb 9, 2023 Few-Shot Learning Image Captioning
— Unverified 0Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning Feb 8, 2023 Caption Generation Decoder
— Unverified 0KENGIC: KEyword-driven and N-Gram Graph based Image Captioning Feb 7, 2023 Image Captioning
— Unverified 0Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image Captioning Feb 4, 2023 Caption Generation Coherence Evaluation
Code Code Available 0DEVICE: DEpth and VIsual ConcEpts Aware Transformer for TextCaps Feb 3, 2023 Image Captioning Optical Character Recognition (OCR)
— Unverified 0IC3: Image Captioning by Committee Consensus Feb 2, 2023 Image Captioning
Code Code Available 1UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers Jan 31, 2023 Image Captioning Image Classification
Code Code Available 1PromptMix: Text-to-image diffusion models enhance the performance of lightweight networks Jan 30, 2023 Crowd Counting Data Augmentation
— Unverified 0BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models Jan 30, 2023 Generative Visual Question Answering Image Captioning
Code Code Available 4Exploring External Knowledge for Accurate modeling of Visual and Language Problems Jan 27, 2023 Image Captioning Machine Translation
— Unverified 0Paraphrase Acquisition from Image Captions Jan 26, 2023 Articles Image Captioning
Code Code Available 0Style-Aware Contrastive Learning for Multi-Style Image Captioning Jan 26, 2023 Contrastive Learning Image Captioning
— Unverified 0Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data Jan 26, 2023 Image Captioning Relational Captioning
— Unverified 0Summarize the Past to Predict the Future: Natural Language Descriptions of Context Boost Multimodal Object Interaction Anticipation Jan 22, 2023 Common Sense Reasoning Image Captioning
— Unverified 0Exploring the Synergy Between Vision-Language Pretraining and ChatGPT for Artwork Captioning: A Preliminary Study Jan 21, 2023 Image Captioning Informativeness
Code Code Available 0Visual Semantic Relatedness Dataset for Image Captioning Jan 20, 2023 Image Captioning text similarity
Code Code Available 0