| RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba | Aug 16, 2024 | AllMamba | —Unverified | 0 |
| Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions | Aug 2, 2024 | Benchmarkingmultimodal interaction | CodeCode Available | 0 |
| A Unified Understanding of Adversarial Vulnerability Regarding Unimodal Models and Vision-Language Pre-training Models | Jul 25, 2024 | Data Augmentationmultimodal interaction | —Unverified | 0 |
| Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic | Jul 25, 2024 | Image to textLanguage Modeling | —Unverified | 0 |
| Empathic Grounding: Explorations using Multimodal Interaction and Large Language Models with Conversational Agents | Jul 1, 2024 | Emotional IntelligenceEmotion Classification | CodeCode Available | 0 |
| HGNET: A Hierarchical Feature Guided Network for Occupancy Flow Field Prediction | Jul 1, 2024 | Autonomous Drivingmultimodal interaction | —Unverified | 0 |
| A look under the hood of the Interactive Deep Learning Enterprise (No-IDLE) | Jun 27, 2024 | AnatomyDeep Learning | —Unverified | 0 |
| OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents | Jun 27, 2024 | DecoderImitation Learning | —Unverified | 0 |
| EMMI -- Empathic Multimodal Motivational Interviews Dataset: Analyses and Annotations | Jun 24, 2024 | multimodal interaction | —Unverified | 0 |
| Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum | Apr 27, 2024 | Contrastive LearningEmotion Recognition | —Unverified | 0 |
| BlendScape: Enabling End-User Customization of Video-Conferencing Environments through Generative AI | Mar 20, 2024 | Image Generationmultimodal interaction | —Unverified | 0 |
| Improving Adversarial Transferability of Vision-Language Pre-training Models through Collaborative Multimodal Interaction | Mar 16, 2024 | Adversarial RobustnessImage-text Retrieval | —Unverified | 0 |
| On the Arrow of Inference | Feb 22, 2024 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| Memory-Inspired Temporal Prompt Interaction for Text-Image Classification | Jan 26, 2024 | Classificationimage-classification | —Unverified | 0 |
| Dynamic Hand Gesture-Featured Human Motor Adaptation in Tool Delivery using Voice Recognition | Sep 20, 2023 | Gesture RecognitionHand Gesture Recognition | —Unverified | 0 |
| Adaptive User-centered Neuro-symbolic Learning for Multimodal Interaction with Autonomous Systems | Sep 11, 2023 | Incremental Learningmultimodal interaction | —Unverified | 0 |
| Expanding the Role of Affective Phenomena in Multimodal Interaction Research | May 18, 2023 | multimodal interaction | —Unverified | 0 |
| A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT | Mar 7, 2023 | multimodal interaction | CodeCode Available | 0 |
| HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation | Jan 1, 2023 | multimodal interactionObject | —Unverified | 0 |
| InterMulti:Multi-view Multimodal Interactions with Text-dominated Hierarchical High-order Fusion for Emotion Analysis | Dec 20, 2022 | Emotion Recognitionmultimodal interaction | —Unverified | 0 |
| A novel multimodal dynamic fusion network for disfluency detection in spoken utterances | Nov 27, 2022 | multimodal interaction | —Unverified | 0 |
| Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness Predictions | Nov 7, 2022 | Contrastive Learningmultimodal interaction | CodeCode Available | 0 |
| Adaptive User-Centered Multimodal Interaction towards Reliable and Trusted Automotive Interfaces | Nov 7, 2022 | multimodal interaction | —Unverified | 0 |
| Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment | Oct 10, 2022 | Articlesmultimodal interaction | —Unverified | 0 |
| MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning | Oct 9, 2022 | Image-text Retrievalmultimodal interaction | —Unverified | 0 |