Aligning MAGMA by Few-Shot Learning and Finetuning Oct 18, 2022 Few-Shot Learning Image Captioning
— Unverified 0Probing Cross-modal Semantics Alignment Capability from the Textual Perspective Oct 18, 2022 Image Captioning Sentence
— Unverified 0Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training Oct 17, 2022 Image Captioning Network Interpretation
Code Code Available 0Generating image captions with external encyclopedic knowledge Oct 10, 2022 Caption Generation Image Captioning
— Unverified 0MMT: Image-guided Story Ending Generation with Multimodal Memory Transformer Oct 10, 2022 Decoder Image Captioning
Code Code Available 0Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning Oct 4, 2022 Image Captioning Sentence
Code Code Available 0Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity Oct 3, 2022 Audio captioning Image Captioning
— Unverified 0JPG - Jointly Learn to Align: Automated Disease Prediction and Radiology Report Generation Oct 1, 2022 cross-modal alignment Disease Prediction
— Unverified 0On the Effects of Video Grounding on Language Models Oct 1, 2022 Image Captioning Question Answering
— Unverified 0DeltaNet: Conditional Medical Report Generation for COVID-19 Diagnosis Oct 1, 2022 COVID-19 Diagnosis Decoder
— Unverified 0Multi-view and Cross-view Brain Decoding Oct 1, 2022 Brain Decoding Image Captioning
— Unverified 0Medical Image Captioning via Generative Pretrained Transformers Sep 28, 2022 Caption Generation Descriptive
— Unverified 0DRAMA: Joint Risk Localization and Captioning in Driving Sep 22, 2022 Image Captioning
— Unverified 0Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia Sep 21, 2022 Articles Image Captioning
— Unverified 0Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering Sep 21, 2022 Image Captioning Optical Character Recognition (OCR)
— Unverified 0LAVIS: A Library for Language-Vision Intelligence Sep 15, 2022 Benchmarking Image Captioning
— Unverified 0OmniVL:One Foundation Model for Image-Language and Video-Language Tasks Sep 15, 2022 Action Classification Action Recognition
— Unverified 0PaLI: A Jointly-Scaled Multilingual Language-Image Model Sep 14, 2022 Decoder Few-Shot Image Classification
— Unverified 0PreSTU: Pre-Training for Scene-Text Understanding Sep 12, 2022 Decoder Image Captioning
— Unverified 0Every picture tells a story: Image-grounded controllable stylistic story generation Sep 4, 2022 Image Captioning Image to text
— Unverified 0vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain using Swin Transformer and Attention-based LSTM Sep 3, 2022 Decoder Image Captioning
— Unverified 0Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks Aug 22, 2022 All Cross-Modal Retrieval
Code Code Available 0A Medical Semantic-Assisted Transformer for Radiographic Report Generation Aug 22, 2022 Image Captioning Medical Report Generation
— Unverified 0Target-oriented Sentiment Classification with Sequential Cross-modal Semantic Graph Aug 19, 2022 Decoder Image Captioning
Code Code Available 0GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement Aug 18, 2022 Grounded Situation Recognition Image Captioning
Code Code Available 0ILLUME: Rationalizing Vision-Language Models through Human Interactions Aug 17, 2022 Image Captioning Question Answering
Code Code Available 0Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2 Aug 9, 2022 Attribute Image Captioning
— Unverified 0Distinctive Image Captioning via CLIP Guided Group Optimization Aug 8, 2022 Image Captioning
— Unverified 0RadTex: Learning Efficient Radiograph Representations from Text Reports Aug 5, 2022 Classification Decoder
— Unverified 0Prompt Tuning for Generative Multimodal Pretrained Models Aug 4, 2022 Image Captioning Visual Entailment
— Unverified 0Neuro-Symbolic Learning: Principles and Applications in Ophthalmology Jul 31, 2022 Common Sense Reasoning Image Captioning
— Unverified 0Retrieval-Augmented Transformer for Image Captioning Jul 26, 2022 Image Captioning Retrieval
— Unverified 0Efficient Modeling of Future Context for Image Captioning Jul 22, 2022 Image Captioning Sentence
Code Code Available 0Rethinking the Reference-based Distinctive Image Captioning Jul 22, 2022 Attribute Benchmarking
Code Code Available 0LineCap: Line Charts for Data Visualization Captioning Models Jul 15, 2022 Data Visualization Deep Learning
Code Code Available 0A Baseline for Detecting Out-of-Distribution Examples in Image Captioning Jul 12, 2022 Image Captioning Out of Distribution (OOD) Detection
— Unverified 0Adaptive Fine-Grained Predicates Learning for Scene Graph Generation Jul 11, 2022 Fine-Grained Image Classification Graph Generation
— Unverified 0Predicting Word Learning in Children from the Performance of Computer Vision Systems Jul 7, 2022 Image Captioning
— Unverified 0Exploring the sequence length bottleneck in the Transformer for Image Captioning Jul 7, 2022 Image Captioning
Code Code Available 0Are metrics measuring what they should? An evaluation of image captioning task metrics Jul 4, 2022 Image Captioning
— Unverified 0American == White in Multimodal Language-and-Image AI Jul 1, 2022 Image Captioning Question Answering
— Unverified 0MilaNLP at SemEval-2022 Task 5: Using Perceiver IO for Detecting Misogynous Memes with Text and Image Modalities Jul 1, 2022 Image Captioning
Code Code Available 0ZoDIAC: Zoneout Dropout Injection Attention Calculation Jun 28, 2022 Image Captioning image-classification
Code Code Available 0Competence-based Multimodal Curriculum Learning for Medical Report Generation Jun 24, 2022 Image Captioning Medical Report Generation
— Unverified 0DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection Jun 20, 2022 Image Captioning Image Generation
— Unverified 00/1 Deep Neural Networks via Block Coordinate Descent Jun 19, 2022 10-shot image generation
— Unverified 0A Self-Guided Framework for Radiology Report Generation Jun 19, 2022 Image Captioning Medical Report Generation
— Unverified 0Image Captioning based on Feature Refinement and Reflective Decoding Jun 16, 2022 Decoder Image Captioning
— Unverified 0A Unified Sequence Interface for Vision Tasks Jun 15, 2022 Image Captioning Instance Segmentation
— Unverified 0Measuring Representational Harms in Image Captioning Jun 14, 2022 Fairness Image Captioning
— Unverified 0