| HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation | Jan 1, 2023 | multimodal interactionObject | —Unverified | 0 | 0 |
| Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic | Jul 25, 2024 | Image to textLanguage Modeling | —Unverified | 0 | 0 |
| Improving Adversarial Transferability of Vision-Language Pre-training Models through Collaborative Multimodal Interaction | Mar 16, 2024 | Adversarial RobustnessImage-text Retrieval | —Unverified | 0 | 0 |
| Corpus of Multimodal Interaction for Collaborative Planning | Jun 1, 2019 | multimodal interaction | —Unverified | 0 | 0 |
| Integration of Multimodal Interaction as Assistance in Virtual Environments | Jul 1, 2012 | multimodal interaction | —Unverified | 0 | 0 |
| Interactive Sketchpad: A Multimodal Tutoring System for Collaborative, Visual Problem-Solving | Feb 12, 2025 | Mathmultimodal interaction | —Unverified | 0 | 0 |
| InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback | May 29, 2025 | multimodal interaction | —Unverified | 0 | 0 |
| InterMulti:Multi-view Multimodal Interactions with Text-dominated Hierarchical High-order Fusion for Emotion Analysis | Dec 20, 2022 | Emotion Recognitionmultimodal interaction | —Unverified | 0 | 0 |
| Computer Vision-Driven Gesture Recognition: Toward Natural and Intuitive Human-Computer | Dec 24, 2024 | Gesture Recognitionmultimodal interaction | —Unverified | 0 | 0 |
| LLM-Assisted Visual Analytics: Opportunities and Challenges | Sep 4, 2024 | Managementmultimodal interaction | —Unverified | 0 | 0 |
| CMATH: Cross-Modality Augmented Transformer with Hierarchical Variational Distillation for Multimodal Emotion Recognition in Conversation | Nov 15, 2024 | Emotion RecognitionEmotion Recognition in Conversation | —Unverified | 0 | 0 |
| Chat-to-Design: AI Assisted Personalized Fashion Design | Jul 3, 2022 | multimodal interactionNatural Language Understanding | —Unverified | 0 | 0 |
| HUMBO: Bridging Response Generation and Facial Expression Synthesis | May 24, 2019 | Dialogue Generationmultimodal interaction | —Unverified | 0 | 0 |
| MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning | Oct 9, 2022 | Image-text Retrievalmultimodal interaction | —Unverified | 0 | 0 |
| Memory-Inspired Temporal Prompt Interaction for Text-Image Classification | Jan 26, 2024 | Classificationimage-classification | —Unverified | 0 | 0 |
| BlendScape: Enabling End-User Customization of Video-Conferencing Environments through Generative AI | Mar 20, 2024 | Image Generationmultimodal interaction | —Unverified | 0 | 0 |
| Immersive Multimedia Communication: State-of-the-Art on eXtended Reality Streaming | Mar 27, 2025 | multimodal interaction | —Unverified | 0 | 0 |
| Toward Multimodal Interaction in Scalable Visual Digital Evidence Visualization Using Computer Vision Techniques and ISS | Aug 1, 2018 | Managementmultimodal interaction | —Unverified | 0 | 0 |
| Automatized Generation of Alphabets of Symbols | Jul 16, 2017 | multimodal interaction | —Unverified | 0 | 0 |
| A Unified Understanding of Adversarial Vulnerability Regarding Unimodal Models and Vision-Language Pre-training Models | Jul 25, 2024 | Data Augmentationmultimodal interaction | —Unverified | 0 | 0 |
| MODfinity: Unsupervised Domain Adaptation with Multimodal Information Flow Intertwining | Jan 1, 2025 | Domain AdaptationModel Selection | —Unverified | 0 | 0 |
| A Survey of Interactive Generative Video | Apr 30, 2025 | Autonomous Drivingmultimodal interaction | —Unverified | 0 | 0 |
| Adaptive User-Centered Multimodal Interaction towards Reliable and Trusted Automotive Interfaces | Nov 7, 2022 | multimodal interaction | —Unverified | 0 | 0 |
| Multimodal Interaction-aware Motion Prediction for Autonomous Street Crossing | Aug 21, 2018 | motion predictionmultimodal interaction | —Unverified | 0 | 0 |
| A Review of Temporal Aspects of Hand Gesture Analysis Applied to Discourse Analysis and Natural Conversation | Dec 17, 2013 | multimodal interactionSystematic Literature Review | —Unverified | 0 | 0 |