Propagating Over Phrase Relations for One-Stage Visual Grounding Aug 1, 2020 Phrase Grounding Relational Reasoning
— Unverified 0Spatially Aware Multimodal Transformers for TextVQA Jul 23, 2020 Optical Character Recognition (OCR) Spatial Reasoning
Code Code Available 1Visual Relation Grounding in Videos Jul 17, 2020 Question Answering Relation
Code Code Available 1Reducing Language Biases in Visual Question Answering with Visually-Grounded Question Encoder Jul 13, 2020 Question Answering Visual Grounding
— Unverified 0Multi-Granularity Modularized Network for Abstract Visual Reasoning Jul 9, 2020 Visual Grounding Visual Reasoning
— Unverified 0Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation Jul 3, 2020 Contrastive Learning Knowledge Distillation
Code Code Available 1Knowledge Supports Visual Language Grounding: A Case Study on Colour Terms Jul 1, 2020 Diagnostic Object
— Unverified 0Fast visual grounding in interaction: bringing few-shot learning with neural networks to an interactive robot Jun 1, 2020 Few-Shot Learning Transfer Learning
— Unverified 0Visual Grounding Annotation of Recipe Flow Graph May 1, 2020 Visual Grounding
— Unverified 0Visual Grounding of Learned Physical Models Apr 28, 2020 Visual Grounding
Code Code Available 1Deep Multimodal Neural Architecture Search Apr 25, 2020 Decoder Image-text matching
Code Code Available 1Visual Grounding Methods for VQA are Working for the Wrong Reasons! Apr 12, 2020 Question Answering Visual Grounding
Code Code Available 1Spatio-Temporal Graph for Video Captioning with Knowledge Distillation Mar 31, 2020 Knowledge Distillation Object
— Unverified 0Giving Commands to a Self-driving Car: A Multimodal Reasoner for Visual Grounding Mar 19, 2020 Object Referring Expression Comprehension
— Unverified 0Visual Grounding in Video for Unsupervised Word Translation Mar 11, 2020 Translation Visual Grounding
Code Code Available 1Guessing State Tracking for Visual Dialogue Feb 24, 2020 Visual Grounding
Code Code Available 1Emergent Communication with World Models Feb 22, 2020 Visual Grounding
— Unverified 0Learning Cross-modal Context Graph for Visual Grounding Feb 13, 2020 Graph Matching Graph Neural Network
Code Code Available 1Exploring Context, Attention and Audio Features for Audio Visual Scene-Aware Dialog Dec 20, 2019 Audio Classification Visual Grounding
— Unverified 0Connecting Vision and Language with Localized Narratives Dec 6, 2019 Form Image Captioning
Code Code Available 0Compositional Temporal Visual Grounding of Natural Language Event Descriptions Dec 4, 2019 Visual Grounding
— Unverified 0OptiBox: Breaking the Limits of Proposals for Visual Grounding Nov 29, 2019 Image Captioning Visual Grounding
— Unverified 0Learning Cross-modal Context Graph for Visual Grounding Nov 20, 2019 Graph Matching Graph Neural Network
Code Code Available 1Leveraging Past References for Robust Language Grounding Nov 1, 2019 Object Referring Expression
— Unverified 0Countering Language Drift via Visual Grounding Sep 10, 2019 Language Modeling Language Modelling
— Unverified 0Language learning using Speech to Image retrieval Sep 9, 2019 Grounded language learning Image Retrieval
— Unverified 0Differentiable Disentanglement Filter: an Application Agnostic Core Concept Discovery Probe Sep 4, 2019 Disentanglement Visual Grounding
— Unverified 0A Fast and Accurate One-Stage Approach to Visual Grounding Aug 18, 2019 Referring Expression Referring Expression Comprehension
Code Code Available 1Multimodal Unified Attention Networks for Vision-and-Language Interactions Aug 12, 2019 Question Answering Visual Grounding
— Unverified 0ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks Aug 6, 2019 Image Retrieval Question Answering
Code Code Available 1Differentiable Disentanglement Filter: an Application Agnostic Core Concept Discovery Probe Jul 17, 2019 Disentanglement Visual Grounding
— Unverified 0Transfer Learning from Audio-Visual Grounding to Speech Recognition Jul 9, 2019 speech-recognition Speech Recognition
— Unverified 0Joint Visual Grounding with Language Scene Graphs Jun 9, 2019 Referring Expression Visual Grounding
— Unverified 0Visually Grounded Neural Syntax Acquisition Jun 7, 2019 Visual Grounding
— Unverified 0Learning to Compose and Reason with Language Tree Structures for Visual Grounding Jun 5, 2019 Visual Grounding Visual Reasoning
— Unverified 0On the Contributions of Visual and Textual Supervision in Low-Resource Semantic Speech Retrieval Apr 24, 2019 Retrieval Visual Grounding
— Unverified 0Semantic query-by-example speech search using visual grounding Apr 15, 2019 Retrieval Semantic Retrieval
Code Code Available 0Modularized Textual Grounding for Counterfactual Resilience Apr 7, 2019 Attribute counterfactual
Code Code Available 0VQD: Visual Query Detection in Natural Scenes Apr 4, 2019 Referring Expression Referring Expression Comprehension
— Unverified 0Revisiting Visual Grounding Apr 3, 2019 Image Retrieval Retrieval
— Unverified 0Learning semantic sentence representations from visually grounded language without lexical knowledge Mar 27, 2019 Grounded language learning Learning Semantic Representations
Code Code Available 0Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment Mar 27, 2019 Image Retrieval Phrase Grounding
— Unverified 0Dual Attention Networks for Visual Reference Resolution in Visual Dialog Feb 25, 2019 AI Agent Question Answering
Code Code Available 0You Only Look & Listen Once: Towards Fast and Accurate Visual Grounding Feb 12, 2019 object-detection Object Detection
Code Code Available 0Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded Feb 11, 2019 Image Captioning Question Answering
— Unverified 0Learning to Assemble Neural Module Tree Networks for Visual Grounding Dec 8, 2018 Dependency Parsing Natural Language Visual Grounding
— Unverified 0Multi-task Learning of Hierarchical Vision-Language Representation Dec 3, 2018 Multi-Task Learning Question Answering
— Unverified 0Being data-driven is not enough: Revisiting interactive instruction giving as a challenge for NLG Nov 1, 2018 Text Generation Visual Grounding
— Unverified 0Overcoming Language Priors in Visual Question Answering with Adversarial Regularization Oct 8, 2018 Question Answering Visual Grounding
— Unverified 0Beyond task success: A closer look at jointly learning to see, ask, and GuessWhat Sep 10, 2018 Multi-Task Learning Reinforcement Learning
Code Code Available 0