| HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation | Jan 1, 2023 | multimodal interactionObject | —Unverified | 0 | 0 |
| Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic | Jul 25, 2024 | Image to textLanguage Modeling | —Unverified | 0 | 0 |
| Improving Adversarial Transferability of Vision-Language Pre-training Models through Collaborative Multimodal Interaction | Mar 16, 2024 | Adversarial RobustnessImage-text Retrieval | —Unverified | 0 | 0 |
| Corpus of Multimodal Interaction for Collaborative Planning | Jun 1, 2019 | multimodal interaction | —Unverified | 0 | 0 |
| Integration of Multimodal Interaction as Assistance in Virtual Environments | Jul 1, 2012 | multimodal interaction | —Unverified | 0 | 0 |
| Interactive Sketchpad: A Multimodal Tutoring System for Collaborative, Visual Problem-Solving | Feb 12, 2025 | Mathmultimodal interaction | —Unverified | 0 | 0 |
| InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback | May 29, 2025 | multimodal interaction | —Unverified | 0 | 0 |
| InterMulti:Multi-view Multimodal Interactions with Text-dominated Hierarchical High-order Fusion for Emotion Analysis | Dec 20, 2022 | Emotion Recognitionmultimodal interaction | —Unverified | 0 | 0 |
| Computer Vision-Driven Gesture Recognition: Toward Natural and Intuitive Human-Computer | Dec 24, 2024 | Gesture Recognitionmultimodal interaction | —Unverified | 0 | 0 |
| LLM-Assisted Visual Analytics: Opportunities and Challenges | Sep 4, 2024 | Managementmultimodal interaction | —Unverified | 0 | 0 |