| Adaptive User-Centered Multimodal Interaction towards Reliable and Trusted Automotive Interfaces | Nov 7, 2022 | multimodal interaction | —Unverified | 0 |
| Adaptive User-centered Neuro-symbolic Learning for Multimodal Interaction with Autonomous Systems | Sep 11, 2023 | Incremental Learningmultimodal interaction | —Unverified | 0 |
| A look under the hood of the Interactive Deep Learning Enterprise (No-IDLE) | Jun 27, 2024 | AnatomyDeep Learning | —Unverified | 0 |
| A Multimodal Corpus for the Assessment of Public Speaking Ability and Anxiety | May 1, 2016 | multimodal interaction | —Unverified | 0 |
| A multi-stage augmented multimodal interaction network for fish feeding intensity quantification | Jun 17, 2025 | Decision Makingmultimodal interaction | —Unverified | 0 |
| Analyzing Multimodal Interaction Strategies for LLM-Assisted Manipulation of 3D Scenes | Oct 29, 2024 | 3D scene Editingmultimodal interaction | —Unverified | 0 |
| An Evaluation Framework for Multimodal Interaction | May 1, 2018 | Gesture Recognitionmultimodal interaction | —Unverified | 0 |
| A novel multimodal dynamic fusion network for disfluency detection in spoken utterances | Nov 27, 2022 | multimodal interaction | —Unverified | 0 |
| A POMDP-based Multimodal Interaction System Using a Humanoid Robot | Oct 1, 2016 | Face Recognitionmultimodal interaction | —Unverified | 0 |
| A Review of Temporal Aspects of Hand Gesture Analysis Applied to Discourse Analysis and Natural Conversation | Dec 17, 2013 | multimodal interactionSystematic Literature Review | —Unverified | 0 |
| A Survey of Interactive Generative Video | Apr 30, 2025 | Autonomous Drivingmultimodal interaction | —Unverified | 0 |
| A Unified Understanding of Adversarial Vulnerability Regarding Unimodal Models and Vision-Language Pre-training Models | Jul 25, 2024 | Data Augmentationmultimodal interaction | —Unverified | 0 |
| Automatized Generation of Alphabets of Symbols | Jul 16, 2017 | multimodal interaction | —Unverified | 0 |
| BlendScape: Enabling End-User Customization of Video-Conferencing Environments through Generative AI | Mar 20, 2024 | Image Generationmultimodal interaction | —Unverified | 0 |
| HUMBO: Bridging Response Generation and Facial Expression Synthesis | May 24, 2019 | Dialogue Generationmultimodal interaction | —Unverified | 0 |
| Chat-to-Design: AI Assisted Personalized Fashion Design | Jul 3, 2022 | multimodal interactionNatural Language Understanding | —Unverified | 0 |
| CMATH: Cross-Modality Augmented Transformer with Hierarchical Variational Distillation for Multimodal Emotion Recognition in Conversation | Nov 15, 2024 | Emotion RecognitionEmotion Recognition in Conversation | —Unverified | 0 |
| Computer Vision-Driven Gesture Recognition: Toward Natural and Intuitive Human-Computer | Dec 24, 2024 | Gesture Recognitionmultimodal interaction | —Unverified | 0 |
| Immersive Multimedia Communication: State-of-the-Art on eXtended Reality Streaming | Mar 27, 2025 | multimodal interaction | —Unverified | 0 |
| Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic | Jul 25, 2024 | Image to textLanguage Modeling | —Unverified | 0 |
| DeepSORT-Driven Visual Tracking Approach for Gesture Recognition in Interactive Systems | May 11, 2025 | Gesture Recognitionmultimodal interaction | —Unverified | 0 |
| Dual Convolutional LSTM Network for Referring Image Segmentation | Jan 30, 2020 | DecoderImage Segmentation | —Unverified | 0 |
| Dyadic Speech-based Affect Recognition using DAMI-P2C Parent-child Multimodal Interaction Dataset | Aug 20, 2020 | multimodal interaction | —Unverified | 0 |
| Dynamic Hand Gesture-Featured Human Motor Adaptation in Tool Delivery using Voice Recognition | Sep 20, 2023 | Gesture RecognitionHand Gesture Recognition | —Unverified | 0 |
| EMMI -- Empathic Multimodal Motivational Interviews Dataset: Analyses and Annotations | Jun 24, 2024 | multimodal interaction | —Unverified | 0 |
| EmotiW 2018: Audio-Video, Student Engagement and Group-Level Affect Prediction | Aug 23, 2018 | Emotion Recognitionmultimodal interaction | —Unverified | 0 |
| Expanding the Role of Affective Phenomena in Multimodal Interaction Research | May 18, 2023 | multimodal interaction | —Unverified | 0 |
| FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection | Jan 8, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| From Modal to Multimodal Ambiguities: a Classification Approach | Apr 4, 2017 | ClassificationGeneral Classification | —Unverified | 0 |
| RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba | Aug 16, 2024 | AllMamba | —Unverified | 0 |
| Robi Butler: Multimodal Remote Interaction with a Household Robot Assistant | Sep 30, 2024 | multimodal interaction | —Unverified | 0 |
| RoomReader: A Multimodal Corpus of Online Multiparty Conversational Interactions | Jun 1, 2022 | multimodal interaction | —Unverified | 0 |
| SBAT: Video Captioning with Sparse Boundary-Aware Transformer | Jul 23, 2020 | Machine Translationmultimodal interaction | —Unverified | 0 |
| Scene Graph based Image Retrieval -- A case study on the CLEVR Dataset | Nov 3, 2019 | Graph MatchingImage Retrieval | —Unverified | 0 |
| Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment | Oct 10, 2022 | Articlesmultimodal interaction | —Unverified | 0 |
| Shaping a social robot's humor with Natural Language Generation and socially-aware reinforcement learning | Nov 1, 2018 | multimodal interactionreinforcement-learning | —Unverified | 0 |
| SocialInteractionGAN: Multi-person Interaction Sequence Generation | Mar 10, 2021 | Decodermultimodal interaction | —Unverified | 0 |
| Symbol Emergence in Robotics: A Survey | Sep 29, 2015 | multimodal interactionSurvey | —Unverified | 0 |
| The VoxWorld Platform for Multimodal Embodied Agents | Jun 1, 2022 | multimodal interaction | —Unverified | 0 |
| Toward Multimodal Interaction in Scalable Visual Digital Evidence Visualization Using Computer Vision Techniques and ISS | Aug 1, 2018 | Managementmultimodal interaction | —Unverified | 0 |
| Phase Diagram of Vision Large Language Models Inference: A Perspective from Interaction across Image and Instruction | Nov 1, 2024 | multimodal interaction | —Unverified | 0 |
| Proceedings of the 1st Workshop on Speech and Multimodal Interaction in Assistive Environments | Jul 1, 2012 | multimodal interaction | —Unverified | 0 |
| Retrospective Learning from Interactions | Oct 17, 2024 | multimodal interaction | —Unverified | 0 |
| ReVision: A Dataset and Baseline VLM for Privacy-Preserving Task-Oriented Visual Instruction Rewriting | Feb 20, 2025 | Image Captioningmultimodal interaction | —Unverified | 0 |
| Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum | Apr 27, 2024 | Contrastive LearningEmotion Recognition | —Unverified | 0 |
| Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering | May 9, 2022 | multimodal interactionQuestion Answering | CodeCode Available | 0 |
| Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions | Aug 2, 2024 | Benchmarkingmultimodal interaction | CodeCode Available | 0 |
| ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle | Nov 3, 2021 | Emotion Recognitionmultimodal interaction | CodeCode Available | 0 |
| Recurrent Multimodal Interaction for Referring Image Segmentation | Mar 23, 2017 | Image Segmentationmultimodal interaction | CodeCode Available | 0 |
| Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness Predictions | Nov 7, 2022 | Contrastive Learningmultimodal interaction | CodeCode Available | 0 |