Brain Captioning: Decoding human brain activity into images and text May 19, 2023 Brain Decoding Depth Estimation
Code Code Available 1Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner May 19, 2023 Dense Captioning Image Captioning
Code Code Available 1Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models May 15, 2023 3D Object Detection Image Captioning
Code Code Available 1InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation May 10, 2023 Benchmarking Image Captioning
Code Code Available 1Vision-Language Models in Remote Sensing: Current Progress and Future Trends May 9, 2023 Image Captioning Image Generation
Code Code Available 1Transforming Visual Scene Graphs to Image Captions May 3, 2023 Attribute Decoder
Code Code Available 1From Association to Generation: Text-only Captioning by Unsupervised Cross-modal Mapping Apr 26, 2023 Decoder Image Captioning
Code Code Available 1Uncurated Image-Text Datasets: Shedding Light on Demographic Bias Apr 6, 2023 Image Captioning Image Generation
Code Code Available 1AutoAD: Movie Description in Context Mar 29, 2023 Image Captioning Text Generation
Code Code Available 1Multimodal Image-Text Matching Improves Retrieval-based Chest X-Ray Report Generation Mar 29, 2023 Image Captioning Image-text matching
Code Code Available 1Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation Mar 21, 2023 Contrastive Learning Image Captioning
Code Code Available 1MAGVLT: Masked Generative Vision-and-Language Transformer Mar 21, 2023 Image Captioning Image Generation
Code Code Available 1ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation Mar 11, 2023 Image Captioning Image to text
Code Code Available 1Spawrious: A Benchmark for Fine Control of Spurious Correlation Biases Mar 9, 2023 Image Captioning image-classification
Code Code Available 1Neighborhood Contrastive Transformer for Change Captioning Mar 6, 2023 Decoder Image Captioning
Code Code Available 1DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training Mar 6, 2023 Decoder Image Captioning
Code Code Available 1FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks Mar 4, 2023 Cross-Modal Retrieval Image Captioning
Code Code Available 1Prismer: A Vision-Language Model with Multi-Task Experts Mar 4, 2023 Few-Shot Learning Image Captioning
Code Code Available 1ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing Mar 4, 2023 Diversity Image Captioning
Code Code Available 1ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of Pneumothorax Mar 2, 2023 Descriptive Image Captioning
Code Code Available 1Retrieval-augmented Image Captioning Feb 16, 2023 Decoder Image Captioning
Code Code Available 1Towards Local Visual Modeling for Image Captioning Feb 13, 2023 Image Captioning Object Recognition
Code Code Available 1IC3: Image Captioning by Committee Consensus Feb 2, 2023 Image Captioning
Code Code Available 1UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers Jan 31, 2023 Image Captioning Image Classification
Code Code Available 1See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge-based Visual Reasoning Jan 12, 2023 Few-Shot Learning Image Captioning
Code Code Available 1Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning Dec 27, 2022 Image Captioning Image Retrieval
Code Code Available 1On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective Dec 24, 2022 Decision Making Image Captioning
Code Code Available 1Position-guided Text Prompt for Vision-Language Pre-training Dec 19, 2022 Cross-Modal Retrieval Image Captioning
Code Code Available 1Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift Dec 15, 2022 Benchmarking Image Captioning
Code Code Available 1Aesthetically Relevant Image Captioning Nov 25, 2022 Image Captioning Sentence
Code Code Available 1Exploring Discrete Diffusion Models for Image Captioning Nov 21, 2022 Image Captioning Image Generation
Code Code Available 1I Can't Believe There's No Images! Learning Visual Tasks Using only Language Supervision Nov 17, 2022 Image Captioning Question Answering
Code Code Available 1Progressive Tree-Structured Prototype Network for End-to-End Image Captioning Nov 17, 2022 Image Captioning
Code Code Available 1PromptCap: Prompt-Guided Task-Aware Image Captioning Nov 15, 2022 Image Captioning Language Modelling
Code Code Available 1Large-Scale Bidirectional Training for Zero-Shot Image Captioning Nov 13, 2022 Image Captioning Keyword Extraction
Code Code Available 1DeltaNet:Conditional Medical Report Generation for COVID-19 Diagnosis Nov 12, 2022 COVID-19 Diagnosis Decoder
Code Code Available 1Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation Oct 20, 2022 Decoder Image Captioning
Code Code Available 1MAPL: Parameter-Efficient Adaptation of Unimodal Pre-Trained Models for Vision-Language Few-Shot Prompting Oct 13, 2022 Image Captioning Question Answering
Code Code Available 1Not All Errors are Equal: Learning Text Generation Metrics using Stratified Error Synthesis Oct 10, 2022 All Image Captioning
Code Code Available 1CLIP-Diffusion-LM: Apply Diffusion Model on Image Captioning Oct 10, 2022 Decoder Denoising
Code Code Available 1Towards Multi-Modal Sarcasm Detection via Hierarchical Congruity Modeling with Knowledge Enhancement Oct 7, 2022 Image Captioning Sarcasm Detection
Code Code Available 1SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation Sep 30, 2022 Decoder Image Captioning
Code Code Available 1Linearly Mapping from Image to Text Space Sep 30, 2022 Image Captioning Image to text
Code Code Available 1Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text Sep 28, 2022 Image Captioning Image Retrieval
Code Code Available 1Learning Distinct and Representative Styles for Image Captioning Sep 17, 2022 Diversity Image Captioning
Code Code Available 1Belief Revision based Caption Re-ranker with Visual Semantic Information Sep 16, 2022 Caption Generation Image Captioning
Code Code Available 1M^4I: Multi-modal Models Membership Inference Sep 15, 2022 Image Captioning Inference Attack
Code Code Available 1VAuLT: Augmenting the Vision-and-Language Transformer for Sentiment Classification on Social Media Aug 18, 2022 Descriptive Diversity
Code Code Available 1Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning Aug 13, 2022 Image Captioning
Code Code Available 1Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning Aug 8, 2022 Image Captioning Image Generation
Code Code Available 1