| Phase Diagram of Vision Large Language Models Inference: A Perspective from Interaction across Image and Instruction | Nov 1, 2024 | multimodal interaction | —Unverified | 0 | 0 |
| Proceedings of the 1st Workshop on Speech and Multimodal Interaction in Assistive Environments | Jul 1, 2012 | multimodal interaction | —Unverified | 0 | 0 |
| Adaptive User-centered Neuro-symbolic Learning for Multimodal Interaction with Autonomous Systems | Sep 11, 2023 | Incremental Learningmultimodal interaction | —Unverified | 0 | 0 |
| Retrospective Learning from Interactions | Oct 17, 2024 | multimodal interaction | —Unverified | 0 | 0 |
| ReVision: A Dataset and Baseline VLM for Privacy-Preserving Task-Oriented Visual Instruction Rewriting | Feb 20, 2025 | Image Captioningmultimodal interaction | —Unverified | 0 | 0 |
| Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum | Apr 27, 2024 | Contrastive LearningEmotion Recognition | —Unverified | 0 | 0 |
| RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba | Aug 16, 2024 | AllMamba | —Unverified | 0 | 0 |
| Robi Butler: Multimodal Remote Interaction with a Household Robot Assistant | Sep 30, 2024 | multimodal interaction | —Unverified | 0 | 0 |
| RoomReader: A Multimodal Corpus of Online Multiparty Conversational Interactions | Jun 1, 2022 | multimodal interaction | —Unverified | 0 | 0 |
| SBAT: Video Captioning with Sparse Boundary-Aware Transformer | Jul 23, 2020 | Machine Translationmultimodal interaction | —Unverified | 0 | 0 |