| A Survey of Interactive Generative Video | Apr 30, 2025 | Autonomous Drivingmultimodal interaction | —Unverified | 0 |
| A Unified Understanding of Adversarial Vulnerability Regarding Unimodal Models and Vision-Language Pre-training Models | Jul 25, 2024 | Data Augmentationmultimodal interaction | —Unverified | 0 |
| Automatized Generation of Alphabets of Symbols | Jul 16, 2017 | multimodal interaction | —Unverified | 0 |
| BlendScape: Enabling End-User Customization of Video-Conferencing Environments through Generative AI | Mar 20, 2024 | Image Generationmultimodal interaction | —Unverified | 0 |
| HUMBO: Bridging Response Generation and Facial Expression Synthesis | May 24, 2019 | Dialogue Generationmultimodal interaction | —Unverified | 0 |
| Chat-to-Design: AI Assisted Personalized Fashion Design | Jul 3, 2022 | multimodal interactionNatural Language Understanding | —Unverified | 0 |
| CMATH: Cross-Modality Augmented Transformer with Hierarchical Variational Distillation for Multimodal Emotion Recognition in Conversation | Nov 15, 2024 | Emotion RecognitionEmotion Recognition in Conversation | —Unverified | 0 |
| Computer Vision-Driven Gesture Recognition: Toward Natural and Intuitive Human-Computer | Dec 24, 2024 | Gesture Recognitionmultimodal interaction | —Unverified | 0 |
| Immersive Multimedia Communication: State-of-the-Art on eXtended Reality Streaming | Mar 27, 2025 | multimodal interaction | —Unverified | 0 |
| Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic | Jul 25, 2024 | Image to textLanguage Modeling | —Unverified | 0 |