Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models Mar 27, 2024 Image Classification Image Comprehension
Code Code Available 7Hawk: Learning to Understand Open-World Video Anomalies May 27, 2024 Anomaly Detection Question Answering
Code Code Available 3Answerer in Questioner's Mind: Information Theoretic Approach to Goal-Oriented Visual Dialog Feb 12, 2018 Goal-Oriented Dialog Reinforcement Learning
Code Code Available 1Video Dialog as Conversation about Objects Living in Space-Time Jul 8, 2022 Object Relational Reasoning
Code Code Available 1An Annotated Corpus of Reference Resolution for Interpreting Common Grounding Nov 18, 2019 Coreference Resolution Goal-Oriented Dialog
Code Code Available 1The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training May 25, 2022 Conditional Text Generation Out-of-Distribution Detection
Code Code Available 1Where Are You? Localization from Embodied Dialog Nov 16, 2020 Navigate Visual Dialog
Code Code Available 1Unified Multimodal Model with Unlikelihood Training for Visual Dialog Nov 23, 2022 Answer Generation Chatbot
Code Code Available 1History for Visual Dialog: Do we really need it? May 8, 2020 Visual Dialog
Code Code Available 1Visual Dialog Nov 26, 2016 AI Agent Chatbot
Code Code Available 1Ensemble of MRR and NDCG models for Visual Dialog Apr 15, 2021 AI Agent Visual Dialog
Code Code Available 1Large-Scale Answerer in Questioner's Mind for Visual Dialog Question Generation Feb 22, 2019 Question Generation Question-Generation
Code Code Available 1Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning Mar 20, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7 Jun 1, 2018 Video Description Visual Dialog
Code Code Available 1Multi-View Attention Network for Visual Dialog Apr 29, 2020 Visual Dialog
Code Code Available 1VD-BERT: A Unified Vision and Dialog Transformer with BERT Apr 28, 2020 Answer Generation Visual Dialog
Code Code Available 1VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution May 29, 2022 AI Agent coreference-resolution
Code Code Available 1Visual Dialogue State Tracking for Question Generation Nov 12, 2019 Dialogue State Tracking Question Generation
Code Code Available 1Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer Apr 14, 2020 Graph Learning Graph structure learning
Code Code Available 1Hierarchical Question-Image Co-Attention for Visual Question Answering May 31, 2016 Visual Dialog Visual Question Answering
Code Code Available 1Iterative Context-Aware Graph Inference for Visual Dialog Apr 5, 2020 Graph Attention Graph Embedding
Code Code Available 1Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline Dec 5, 2019 Language Modelling Representation Learning
Code Code Available 1Building Task-Oriented Visual Dialog Systems Through Alternative Optimization Between Dialog Policy and Language Generation Sep 6, 2019 Decoder Reinforcement Learning
— Unverified 0Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog May 8, 2018 Hierarchical Reinforcement Learning reinforcement-learning
— Unverified 0Multi-Modal Open-Domain Dialogue Oct 2, 2020 Visual Dialog
— Unverified 0Effective questions in referential visual dialogue Jul 1, 2020 Visual Dialog
— Unverified 0Modality-Balanced Models for Visual Dialogue Jan 17, 2020 Visual Dialog
— Unverified 0Adversarial Robustness of Visual Dialog Jul 6, 2022 Adversarial Robustness Visual Dialog
— Unverified 0Enhancing Visual Dialog State Tracking through Iterative Object-Entity Alignment in Multi-Round Conversations Aug 13, 2024 dialog state tracking Dialogue State Tracking
— Unverified 0ENRICH4ALL: A First Luxembourgish BERT Model for a Multilingual Chatbot Jun 1, 2022 Chatbot Language Modeling
— Unverified 0Gold Seeker: Information Gain from Policy Distributions for Goal-oriented Vision-and-Langauge Reasoning Dec 16, 2018 Reinforcement Learning Visual Dialog
— Unverified 0Image-Question-Answer Synergistic Network for Visual Dialog Feb 26, 2019 Visual Dialog
— Unverified 0A survey on knowledge-enhanced multimodal learning Nov 19, 2022 Conditional Image Generation Factual Visual Question Answering
— Unverified 0Modeling Coreference Relations in Visual Dialog Mar 6, 2022 Question Answering Visual Dialog
— Unverified 0Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog Feb 1, 2019 Question Answering Visual Dialog
— Unverified 0Discourse Analysis for Evaluating Coherence in Video Paragraph Captions Jan 17, 2022 Video Captioning Visual Dialog
— Unverified 0A Generative Adversarial Density Estimator Jun 1, 2019 Density Estimation Visual Dialog
— Unverified 0Learning to Ground Visual Objects for Visual Dialog Sep 13, 2021 Visual Dialog
— Unverified 0Grounded Agreement Games: Emphasizing Conversational Grounding in Visual Dialogue Settings Aug 29, 2019 Chatbot Visual Dialog
— Unverified 0Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning Nov 21, 2017 Question Answering Reinforcement Learning
— Unverified 0Granular Multimodal Attention Networks for Visual Dialog Oct 13, 2019 Visual Dialog
— Unverified 0GoG: Relation-aware Graph-over-Graph Network for Visual Dialog Sep 17, 2021 coreference-resolution Coreference Resolution
— Unverified 0Learning Goal-Oriented Visual Dialog Agents: Imitating and Surpassing Analytic Experts Jul 24, 2019 Imitation Learning reinforcement-learning
— Unverified 0Making History Matter: History-Advantage Sequence Training for Visual Dialog Feb 25, 2019 Answer Generation Decoder
— Unverified 0How to Fool Systems and Humans in Visually Grounded Interaction: A Case Study on Adversarial Attacks on Visual Dialog Jan 16, 2022 Visual Dialog
— Unverified 0ICCV23 Visual-Dialog Emotion Explanation Challenge: SEU_309 Team Technical Report Jul 13, 2024 Explanation Generation Language Modeling
— Unverified 0Generative Visual Dialogue System via Adaptive Reasoning and Weighted Likelihood Estimation Feb 26, 2019 Visual Dialog
— Unverified 0Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning Apr 15, 2022 Contrastive Learning Question Answering
— Unverified 0FlipDial: A Generative Model for Two-Way Visual Dialogue Feb 11, 2018 Visual Dialog Vocal Bursts Valence Prediction
— Unverified 0Connecting Language and Vision to Actions Jul 1, 2018 Image Captioning Language Modeling
— Unverified 0