| Foundations and Recent Trends in Multimodal Mobile Agents: A Survey | Nov 4, 2024 | multimodal interactionSurvey | CodeCode Available | 2 |
| Phase Diagram of Vision Large Language Models Inference: A Perspective from Interaction across Image and Instruction | Nov 1, 2024 | multimodal interaction | —Unverified | 0 |
| Analyzing Multimodal Interaction Strategies for LLM-Assisted Manipulation of 3D Scenes | Oct 29, 2024 | 3D scene Editingmultimodal interaction | —Unverified | 0 |
| LLMs Can Evolve Continually on Modality for X-Modal Reasoning | Oct 26, 2024 | Continual Learningmultimodal interaction | CodeCode Available | 1 |
| Retrospective Learning from Interactions | Oct 17, 2024 | multimodal interaction | —Unverified | 0 |
| Spatio-Temporal 3D Point Clouds from WiFi-CSI Data via Transformer Networks | Oct 7, 2024 | multimodal interaction | CodeCode Available | 1 |
| Robi Butler: Multimodal Remote Interaction with a Household Robot Assistant | Sep 30, 2024 | multimodal interaction | —Unverified | 0 |
| Mamba-Enhanced Text-Audio-Video Alignment Network for Emotion Recognition in Conversations | Sep 8, 2024 | Emotion RecognitionMamba | CodeCode Available | 1 |
| LLM-Assisted Visual Analytics: Opportunities and Challenges | Sep 4, 2024 | Managementmultimodal interaction | —Unverified | 0 |
| RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba | Aug 16, 2024 | AllMamba | —Unverified | 0 |