V^2Dial: Unification of Video and Visual Dialog via Multimodal Experts Mar 3, 2025 Contrastive Learning Text Retrieval
— Unverified 0V^2Dial: Unification of Video and Visual Dialog via Multimodal Experts Jan 1, 2025 Contrastive Learning Text Retrieval
— Unverified 0Enhancing Visual Dialog State Tracking through Iterative Object-Entity Alignment in Multi-Round Conversations Aug 13, 2024 dialog state tracking Dialogue State Tracking
— Unverified 0ICCV23 Visual-Dialog Emotion Explanation Challenge: SEU_309 Team Technical Report Jul 13, 2024 Explanation Generation Language Modeling
— Unverified 0Hawk: Learning to Understand Open-World Video Anomalies May 27, 2024 Anomaly Detection Question Answering
Code Code Available 3Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models Mar 27, 2024 Image Classification Image Comprehension
Code Code Available 7FlexCap: Describe Anything in Images in Controllable Detail Mar 18, 2024 Attribute Dense Captioning
— Unverified 0VD-GR: Boosting Visual Dialog with Cascaded Spatial-Temporal Multi-Modal GRaphs Oct 25, 2023 Visual Dialog
— Unverified 0Collecting Visually-Grounded Dialogue with A Game Of Sorts Sep 10, 2023 Coreference Resolution Image Retrieval
Code Code Available 0Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations Aug 30, 2023 Explanation Generation Question Answering
— Unverified 0PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts May 24, 2023 Dialogue State Tracking Image Retrieval
Code Code Available 0Unified Multimodal Model with Unlikelihood Training for Visual Dialog Nov 23, 2022 Answer Generation Chatbot
Code Code Available 1A survey on knowledge-enhanced multimodal learning Nov 19, 2022 Conditional Image Generation Factual Visual Question Answering
— Unverified 0Knowledge Transfer with Visual Prompt in multi-modal Dialogue Understanding and Generation Oct 1, 2022 Dialogue Understanding Knowledge Distillation
— Unverified 0LAVIS: A Library for Language-Vision Intelligence Sep 15, 2022 Benchmarking Image Captioning
Code Code Available 0Video Dialog as Conversation about Objects Living in Space-Time Jul 8, 2022 Object Relational Reasoning
Code Code Available 1Adversarial Robustness of Visual Dialog Jul 6, 2022 Adversarial Robustness Visual Dialog
— Unverified 0ENRICH4ALL: A First Luxembourgish BERT Model for a Multilingual Chatbot Jun 1, 2022 Chatbot Language Modeling
— Unverified 0VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution May 29, 2022 AI Agent coreference-resolution
Code Code Available 1The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training May 25, 2022 Conditional Text Generation Out-of-Distribution Detection
Code Code Available 1UTC: A Unified Transformer with Inter-Task Contrastive Learning for Visual Dialog May 1, 2022 Contrastive Learning Representation Learning
— Unverified 0Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning Apr 15, 2022 Contrastive Learning Question Answering
— Unverified 0Reasoning with Multi-Structure Commonsense Knowledge in Visual Dialog Apr 10, 2022 Logical Reasoning Sentence
— Unverified 0Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene Mar 16, 2022 Visual Dialog
Code Code Available 0Modeling Coreference Relations in Visual Dialog Mar 6, 2022 Question Answering Visual Dialog
— Unverified 0VU-BERT: A Unified framework for Visual Dialog Feb 22, 2022 Language Modeling Language Modelling
— Unverified 0Discourse Analysis for Evaluating Coherence in Video Paragraph Captions Jan 17, 2022 Video Captioning Visual Dialog
— Unverified 0How to Fool Systems and Humans in Visually Grounded Interaction: A Case Study on Adversarial Attacks on Visual Dialog Jan 16, 2022 Visual Dialog
— Unverified 0UNITER-Based Situated Coreference Resolution with Rich Multimodal Input Dec 7, 2021 coreference-resolution Coreference Resolution
Code Code Available 0Region under Discussion for visual dialog Nov 1, 2021 Visual Dialog
— Unverified 0Enriching Language Models with Visually-grounded Word Vectors and the Lancaster Sensorimotor Norms Nov 1, 2021 Visual Dialog
— Unverified 0Perceptual Score: What Data Modalities Does Your Model Perceive? Oct 27, 2021 Question Answering Visual Dialog
Code Code Available 0ViDA-MAN: Visual Dialog with Digital Humans Oct 26, 2021 speech-recognition Speech Recognition
— Unverified 0Evaluating and Improving Interactions with Hazy Oracles Oct 19, 2021 Object Tracking Referring Expression
— Unverified 0The Impact of Answers in Referential Visual Dialog Oct 1, 2021 Question Generation Question-Generation
— Unverified 0Variational Disentangled Attention for Regularized Visual Dialog Sep 29, 2021 Question Answering Visual Dialog
— Unverified 0GoG: Relation-aware Graph-over-Graph Network for Visual Dialog Sep 17, 2021 coreference-resolution Coreference Resolution
— Unverified 0Learning to Ground Visual Objects for Visual Dialog Sep 13, 2021 Visual Dialog
— Unverified 0Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser Sep 6, 2021 Diversity Reinforcement Learning (RL)
Code Code Available 0SeqDialN: Sequential Visual Dialog Network in Joint Visual-Linguistic Representation Space Aug 1, 2021 Visual Dialog
Code Code Available 0Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic Representation May 24, 2021 Referring Expression Referring Expression Comprehension
Code Code Available 0Ensemble of MRR and NDCG models for Visual Dialog Apr 15, 2021 AI Agent Visual Dialog
Code Code Available 1Visual-Textual Alignment for Graph Inference in Visual Dialog Dec 1, 2020 Visual Dialog
— Unverified 0Where Are You? Localization from Embodied Dialog Nov 16, 2020 Navigate Visual Dialog
Code Code Available 1Reasoning Over History: Context Aware Visual Dialog Nov 2, 2020 coreference-resolution Coreference Resolution
— Unverified 0Multi-Modal Open-Domain Dialogue Oct 2, 2020 Visual Dialog
— Unverified 0Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue Oct 1, 2020 Question Generation Question-Generation
Code Code Available 0SeqDialN: Sequential Visual Dialog Networks in Joint Visual-Linguistic Representation Space Aug 2, 2020 Visual Dialog
Code Code Available 0Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data Jul 24, 2020 Visual Dialog Visual Question Answering (VQA)
Code Code Available 0Effective questions in referential visual dialogue Jul 1, 2020 Visual Dialog
— Unverified 0