Text-based Person Search without Parallel Image-Text Data May 22, 2023 Image Captioning Language Modeling
— Unverified 0A request for clarity over the End of Sequence token in the Self-Critical Sequence Training May 20, 2023 Image Captioning Sentence
Code Code Available 0Cross2StrA: Unpaired Cross-lingual Image Captioning with Cross-lingual Cross-modal Structure-pivoted Alignment May 20, 2023 Image Captioning Translation
— Unverified 0DiffCap: Exploring Continuous Diffusion on Image Captioning May 20, 2023 Caption Generation Diversity
— Unverified 0Semantic Composition in Visually Grounded Language Models May 15, 2023 Image Captioning Inductive Bias
— Unverified 0IMAGINATOR: Pre-Trained Image+Text Joint Embeddings using Word-Level Grounding of Images May 12, 2023 Hyperparameter Optimization Image Captioning
Code Code Available 0Simple Token-Level Confidence Improves Caption Correctness May 11, 2023 Hallucination Image Captioning
— Unverified 0Towards L-System Captioning for Tree Reconstruction May 10, 2023 Image Captioning
— Unverified 0Exploiting Pseudo Image Captions for Multimodal Summarization May 9, 2023 Common Sense Reasoning Contrastive Learning
— Unverified 0UIT-OpenViIC: A Novel Benchmark for Evaluating Image Captioning in Vietnamese May 7, 2023 Image Captioning Vietnamese Image Captioning
— Unverified 0A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding May 5, 2023 Articles Image Captioning
Code Code Available 0The Role of Data Curation in Image Captioning May 5, 2023 Few-Shot Learning Image Captioning
Code Code Available 0Image Captioners Sometimes Tell More Than Images They See May 4, 2023 Descriptive Image Captioning
— Unverified 0Multimodal Data Augmentation for Image Captioning using Diffusion Models May 3, 2023 Data Augmentation Image Captioning
Code Code Available 0Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime May 3, 2023 Image Captioning Question Answering
— Unverified 0Fairness in AI Systems: Mitigating gender bias from language-vision models May 3, 2023 Fairness Image Captioning
— Unverified 0Quality-agnostic Image Captioning to Safely Assist People with Vision Impairment Apr 28, 2023 Data Augmentation Image Captioning
— Unverified 0Learning Human-Human Interactions in Images from Weak Textual Supervision Apr 27, 2023 Human-Human Interaction Recognition Image Captioning
— Unverified 0TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models Apr 18, 2023 Data Augmentation Diversity
Code Code Available 0A-CAP: Anticipation Captioning with Commonsense Knowledge Apr 13, 2023 Image Captioning Language Modeling
— Unverified 0Boosting Cross-task Transferability of Adversarial Patches with Visual Relations Apr 11, 2023 Image Captioning Object Recognition
— Unverified 0Advancing Medical Imaging with Language Models: A Journey from N-grams to ChatGPT Apr 11, 2023 Diagnostic Image Captioning
— Unverified 0ImageCaptioner^2: Image Captioner for Image Captioning Bias Amplification Assessment Apr 10, 2023 Image Captioning
— Unverified 0Model-Agnostic Gender Debiased Image Captioning Apr 7, 2023 Image Captioning model
Code Code Available 0Towards Self-Explainability of Deep Neural Networks with Heatmap Captioning and Large-Language Models Apr 5, 2023 Explainable Artificial Intelligence (XAI) Image Captioning
— Unverified 0Scalable and Accurate Self-supervised Multimodal Representation Learning without Aligned Video and Text Data Apr 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Cross-Domain Image Captioning with Discriminative Finetuning Apr 4, 2023 Descriptive Image Captioning
— Unverified 0Grand Challenge On Detecting Cheapfakes Apr 3, 2023 Image Captioning
Code Code Available 0Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations Mar 29, 2023 Image Captioning Instance Segmentation
— Unverified 0Variational Distribution Learning for Unsupervised Text-to-Image Generation Mar 28, 2023 Image Captioning Image Generation
— Unverified 0Open-Vocabulary Object Detection using Pseudo Caption Labels Mar 23, 2023 Image Captioning Knowledge Distillation
— Unverified 0Sketch2Saliency: Learning to Detect Salient Objects from Human Drawings Mar 20, 2023 Image Captioning Retrieval
— Unverified 0Multi-modal reward for visual relationships-based image captioning Mar 19, 2023 Caption Generation Deep Reinforcement Learning
— Unverified 0Visual Information Matters for ASR Error Correction Mar 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning Mar 15, 2023 Image Captioning
— Unverified 0Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images Mar 13, 2023 Common Sense Reasoning Explanation Generation
— Unverified 0Learning Combinatorial Prompts for Universal Controllable Image Captioning Mar 11, 2023 controllable image captioning Image Captioning
— Unverified 0Adapting Contrastive Language-Image Pretrained (CLIP) Models for Out-of-Distribution Detection Mar 10, 2023 Anomaly Detection Image Captioning
Code Code Available 0Interpretable Visual Question Answering Referring to Outside Knowledge Mar 8, 2023 Diversity Image Captioning
— Unverified 0Graph Neural Networks in Vision-Language Image Understanding: A Survey Mar 7, 2023 Image Captioning Image Retrieval
— Unverified 0Comparative study of Transformer and LSTM Network with attention mechanism on Image Captioning Mar 5, 2023 Image Captioning
— Unverified 0Language Is Not All You Need: Aligning Perception with Language Models Feb 27, 2023 All Image Captioning
— Unverified 0Tuning computer vision models with task rewards Feb 16, 2023 Colorization Image Captioning
— Unverified 0See Your Heart: Psychological states Interpretation through Visual Creations Feb 11, 2023 Emotion Classification Image Captioning
— Unverified 0Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning Feb 9, 2023 Few-Shot Learning Image Captioning
— Unverified 0Nemesis: Neural Mean Teacher Learning-Based Emotion-Centric Speaker Feb 9, 2023 Image Captioning
— Unverified 0Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning Feb 8, 2023 Caption Generation Decoder
— Unverified 0KENGIC: KEyword-driven and N-Gram Graph based Image Captioning Feb 7, 2023 Image Captioning
— Unverified 0Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image Captioning Feb 4, 2023 Caption Generation Coherence Evaluation
Code Code Available 0DEVICE: DEpth and VIsual ConcEpts Aware Transformer for TextCaps Feb 3, 2023 Image Captioning Optical Character Recognition (OCR)
— Unverified 0