Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models Mar 27, 2024 Image Classification Image Comprehension
Code Code Available 75 Hawk: Learning to Understand Open-World Video Anomalies May 27, 2024 Anomaly Detection Question Answering
Code Code Available 35 An Annotated Corpus of Reference Resolution for Interpreting Common Grounding Nov 18, 2019 Coreference Resolution Goal-Oriented Dialog
Code Code Available 15 Visual Dialogue State Tracking for Question Generation Nov 12, 2019 Dialogue State Tracking Question Generation
Code Code Available 15 Video Dialog as Conversation about Objects Living in Space-Time Jul 8, 2022 Object Relational Reasoning
Code Code Available 15 Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline Dec 5, 2019 Language Modelling Representation Learning
Code Code Available 15 The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training May 25, 2022 Conditional Text Generation Out-of-Distribution Detection
Code Code Available 15 Large-Scale Answerer in Questioner's Mind for Visual Dialog Question Generation Feb 22, 2019 Question Generation Question-Generation
Code Code Available 15 History for Visual Dialog: Do we really need it? May 8, 2020 Visual Dialog
Code Code Available 15 Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning Mar 20, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 15 Visual Dialog Nov 26, 2016 AI Agent Chatbot
Code Code Available 15 VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution May 29, 2022 AI Agent coreference-resolution
Code Code Available 15 VD-BERT: A Unified Vision and Dialog Transformer with BERT Apr 28, 2020 Answer Generation Visual Dialog
Code Code Available 15 Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7 Jun 1, 2018 Video Description Visual Dialog
Code Code Available 15 Iterative Context-Aware Graph Inference for Visual Dialog Apr 5, 2020 Graph Attention Graph Embedding
Code Code Available 15 Where Are You? Localization from Embodied Dialog Nov 16, 2020 Navigate Visual Dialog
Code Code Available 15 Answerer in Questioner's Mind: Information Theoretic Approach to Goal-Oriented Visual Dialog Feb 12, 2018 Goal-Oriented Dialog Reinforcement Learning
Code Code Available 15 Hierarchical Question-Image Co-Attention for Visual Question Answering May 31, 2016 Visual Dialog Visual Question Answering
Code Code Available 15 Multi-View Attention Network for Visual Dialog Apr 29, 2020 Visual Dialog
Code Code Available 15 Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer Apr 14, 2020 Graph Learning Graph structure learning
Code Code Available 15 Ensemble of MRR and NDCG models for Visual Dialog Apr 15, 2021 AI Agent Visual Dialog
Code Code Available 15 Unified Multimodal Model with Unlikelihood Training for Visual Dialog Nov 23, 2022 Answer Generation Chatbot
Code Code Available 15 Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model Jun 5, 2017 Informativeness Metric Learning
Code Code Available 05 Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser Sep 6, 2021 Diversity Reinforcement Learning (RL)
Code Code Available 05 Efficient Attention Mechanism for Visual Dialog that can Handle All the Interactions between Multiple Inputs Nov 26, 2019 All Visual Dialog
Code Code Available 05 DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue Nov 17, 2019 feature selection Question Answering
Code Code Available 05 Two Causal Principles for Improving Visual Dialog Nov 24, 2019 Visual Dialog Vocal Bursts Valence Prediction
Code Code Available 05 Dual Attention Networks for Visual Reference Resolution in Visual Dialog Feb 25, 2019 AI Agent Question Answering
Code Code Available 05 TAB-VCR: Tags and Attributes based Visual Commonsense Reasoning Baselines Oct 31, 2019 Attribute Question Answering
Code Code Available 05 UNITER-Based Situated Coreference Resolution with Rich Multimodal Input Dec 7, 2021 coreference-resolution Coreference Resolution
Code Code Available 05 DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog Dec 18, 2019 AI Agent Decoder
Code Code Available 05 TAB-VCR: Tags and Attributes based VCR Baselines Dec 1, 2019 Attribute Question Answering
Code Code Available 05 Recursive Visual Attention in Visual Dialog Dec 6, 2018 Question Answering Visual Dialog
Code Code Available 05 Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data Jul 24, 2020 Visual Dialog Visual Question Answering (VQA)
Code Code Available 05 SeqDialN: Sequential Visual Dialog Network in Joint Visual-Linguistic Representation Space Aug 1, 2021 Visual Dialog
Code Code Available 05 PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts May 24, 2023 Dialogue State Tracking Image Retrieval
Code Code Available 05 Perceptual Score: What Data Modalities Does Your Model Perceive? Oct 27, 2021 Question Answering Visual Dialog
Code Code Available 05 SeqDialN: Sequential Visual Dialog Networks in Joint Visual-Linguistic Representation Space Aug 2, 2020 Visual Dialog
Code Code Available 05 Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic Representation May 24, 2021 Referring Expression Referring Expression Comprehension
Code Code Available 05 Improving Generative Visual Dialog by Answering Diverse Questions Sep 23, 2019 Reinforcement Learning Representation Learning
Code Code Available 05 Dialog-based Interactive Image Retrieval May 1, 2018 Image Retrieval reinforcement-learning
Code Code Available 05 Factor Graph Attention Apr 11, 2019 Graph Attention Question Answering
Code Code Available 05 LAVIS: A Library for Language-Vision Intelligence Sep 15, 2022 Benchmarking Image Captioning
Code Code Available 05 Learning Goal-Oriented Visual Dialog via Tempered Policy Gradient Jul 2, 2018 Deep Reinforcement Learning Policy Gradient Methods
Code Code Available 05 Ask No More: Deciding when to guess in referential visual dialogue May 17, 2018 Decision Making Visual Dialog
Code Code Available 05 Reasoning Visual Dialogs with Structural and Partial Observations Apr 11, 2019 Graph Neural Network Visual Dialog
Code Code Available 05 Examining Cooperation in Visual Dialog Models Dec 4, 2017 Visual Dialog
Code Code Available 05 Collecting Visually-Grounded Dialogue with A Game Of Sorts Sep 10, 2023 Coreference Resolution Image Retrieval
Code Code Available 05 Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue Oct 1, 2020 Question Generation Question-Generation
Code Code Available 05 CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog Mar 7, 2019 coreference-resolution Coreference Resolution
Code Code Available 05