| Phase Diagram of Vision Large Language Models Inference: A Perspective from Interaction across Image and Instruction | Nov 1, 2024 | multimodal interaction | —Unverified | 0 |
| Proceedings of the 1st Workshop on Speech and Multimodal Interaction in Assistive Environments | Jul 1, 2012 | multimodal interaction | —Unverified | 0 |
| Retrospective Learning from Interactions | Oct 17, 2024 | multimodal interaction | —Unverified | 0 |
| ReVision: A Dataset and Baseline VLM for Privacy-Preserving Task-Oriented Visual Instruction Rewriting | Feb 20, 2025 | Image Captioningmultimodal interaction | —Unverified | 0 |
| Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum | Apr 27, 2024 | Contrastive LearningEmotion Recognition | —Unverified | 0 |
| Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering | May 9, 2022 | multimodal interactionQuestion Answering | CodeCode Available | 0 |
| Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions | Aug 2, 2024 | Benchmarkingmultimodal interaction | CodeCode Available | 0 |
| ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle | Nov 3, 2021 | Emotion Recognitionmultimodal interaction | CodeCode Available | 0 |
| Recurrent Multimodal Interaction for Referring Image Segmentation | Mar 23, 2017 | Image Segmentationmultimodal interaction | CodeCode Available | 0 |
| Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness Predictions | Nov 7, 2022 | Contrastive Learningmultimodal interaction | CodeCode Available | 0 |