OG-SGG: Ontology-Guided Scene Graph Generation. A Case Study in Transfer Learning for Telepresence Robotics Feb 21, 2022 BIG-bench Machine Learning Graph Generation
Code Code Available 0RankDVQA: Deep VQA based on Ranking-inspired Hybrid Training Feb 17, 2022 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 0Delving Deeper into Cross-lingual Visual Question Answering Feb 15, 2022 Inductive Bias Question Answering
Code Code Available 0Privacy Preserving Visual Question Answering Feb 15, 2022 Privacy Preserving Question Answering
— Unverified 0An experimental study of the vision-bottleneck in VQA Feb 14, 2022 Object Question Answering
— Unverified 0Can Open Domain Question Answering Systems Answer Visual Knowledge Questions? Feb 9, 2022 Open-Domain Question Answering Question Answering
— Unverified 0NEWSKVQA: Knowledge-Aware News Video Question Answering Feb 8, 2022 Common Sense Reasoning Management
— Unverified 0OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework Feb 7, 2022 Image Captioning image-classification
Code Code Available 0Grounding Answers for Visual Questions Asked by Visually Impaired People Feb 4, 2022 Question Answering Visual Question Answering
Code Code Available 0Webly Supervised Concept Expansion for General Purpose Vision Models Feb 4, 2022 Human-Object Interaction Detection Image Retrieval
— Unverified 0Compositionality as Lexical Symmetry Jan 30, 2022 Data Augmentation Inductive Bias
Code Code Available 0Transformer Module Networks for Systematic Generalization in Visual Question Answering Jan 27, 2022 Question Answering Systematic Generalization
Code Code Available 0Learning to Compose Diversified Prompts for Image Emotion Classification Jan 26, 2022 Classification Emotion Classification
— Unverified 0MGA-VQA: Multi-Granularity Alignment for Visual Question Answering Jan 25, 2022 Question Answering Visual Question Answering
— Unverified 0SA-VQA: Structured Alignment of Visual and Semantic Representations for Visual Question Answering Jan 25, 2022 Question Answering Visual Question Answering
— Unverified 0Question Generation for Evaluating Cross-Dataset Shifts in Multi-modal Grounding Jan 24, 2022 Question Answering Question Generation
— Unverified 0KAT: A Knowledge Augmented Transformer for Vision-and-Language Jan 16, 2022 Answer Generation Decoder
— Unverified 0All You May Need for VQA are Image Captions Jan 16, 2022 All Image Captioning
— Unverified 0Task Formulation Matters When Learning Continuously: A Case Study in Visual Question Answering Jan 16, 2022 Continual Learning Incremental Learning
— Unverified 0Probing the Role of Positional Information in Vision-Language Models Jan 16, 2022 Contrastive Learning Image-text matching
— Unverified 0Retrieving Visual Facts For Few-Shot Visual Question Answering Jan 16, 2022 Language Modeling Language Modelling
— Unverified 0MANGO: Enhancing the Robustness of VQA Models via Adversarial Noise Generation Jan 16, 2022 Logical Reasoning Question Answering
— Unverified 0CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks Jan 15, 2022 Question Answering Visual Commonsense Reasoning
— Unverified 0A Thousand Words Are Worth More Than a Picture: Natural Language-Centric Outside-Knowledge Visual Question Answering Jan 14, 2022 Generative Question Answering Image to text
— Unverified 0Towards Automated Error Analysis: Learning to Characterize Errors Jan 13, 2022 Common Sense Reasoning Meta-Learning
— Unverified 0On the Efficacy of Co-Attention Transformer Layers in Visual Question Answering Jan 11, 2022 POS Question Answering
— Unverified 0Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training Jan 11, 2022 Decoder Image Captioning
— Unverified 0COIN: Counterfactual Image Generation for VQA Interpretation Jan 10, 2022 counterfactual Image Generation
— Unverified 0Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety Jan 4, 2022 Decoder Deep Learning
— Unverified 0Transform-Retrieve-Generate: Natural Language-Centric Outside-Knowledge Visual Question Answering Jan 1, 2022 Generative Question Answering Image to text
— Unverified 0Query and Attention Augmentation for Knowledge-Based Explainable Reasoning Jan 1, 2022 Question Answering Visual Question Answering
Code Code Available 0Towards General Purpose Vision Systems: An End-to-End Task-Agnostic Vision-Language Architecture Jan 1, 2022 Question Answering Visual Question Answering
— Unverified 0V-Doc: Visual Questions Answers With Documents Jan 1, 2022 Question Answering Question Generation
— Unverified 0Does CLIP Benefit Visual Question Answering in the Medical Domain as Much as it Does in the General Domain? Dec 27, 2021 Articles Medical Visual Question Answering
— Unverified 0Multi-Image Visual Question Answering Dec 27, 2021 Question Answering Visual Question Answering
Code Code Available 0General Greedy De-bias Learning Dec 20, 2021 image-classification Image Classification
Code Code Available 0Task-Oriented Multi-User Semantic Communications Dec 19, 2021 Image Retrieval Machine Translation
— Unverified 0Zero-shot and Few-shot Learning with Knowledge Graphs: A Comprehensive Survey Dec 18, 2021 Data Augmentation Few-Shot Learning
— Unverified 0Understanding Attention for Vision-and-Language Tasks Dec 17, 2021 Image Generation Image Retrieval
— Unverified 03D Question Answering Dec 15, 2021 3D geometry Question Answering
— Unverified 0Improving and Diagnosing Knowledge-Based Visual Question Answering via Entity Enhanced Knowledge Injection Dec 13, 2021 Common Sense Reasoning Knowledge Graph Embeddings
— Unverified 0Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation Dec 10, 2021 Image-text matching Image-text Retrieval
— Unverified 0MoCA: Incorporating Multi-stage Domain Pretraining and Cross-guided Multimodal Attention for Textbook Question Answering Dec 6, 2021 Language Modelling Question Answering
— Unverified 0Curriculum Learning Effectively Improves Low Data VQA Dec 1, 2021 Question Answering Visual Question Answering
— Unverified 0Robust Visual Reasoning via Language Guided Neural Module Networks Dec 1, 2021 Question Answering Referring Expression
— Unverified 0eaVQA: An Experimental Analysis on Visual Question Answering Models Dec 1, 2021 Question Answering Visual Question Answering
— Unverified 0Scallop: From Probabilistic Deductive Databases to Scalable Differentiable Reasoning Dec 1, 2021 Logical Reasoning Question Answering
— Unverified 0LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering Nov 29, 2021 Diversity Question Answering
— Unverified 0Scene Graph Generation with Geometric Context Nov 25, 2021 Activity Recognition Graph Generation
— Unverified 0A Confidence-Based Interface for Neuro-Symbolic Visual Question Answering Nov 21, 2021 Question Answering Translation
— Unverified 0