| LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference | Jun 26, 2024 | multimodal interaction | CodeCode Available | 2 |
| EMMI -- Empathic Multimodal Motivational Interviews Dataset: Analyses and Annotations | Jun 24, 2024 | multimodal interaction | —Unverified | 0 |
| Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum | Apr 27, 2024 | Contrastive LearningEmotion Recognition | —Unverified | 0 |
| Narrative Action Evaluation with Prompt-Guided Multimodal Interaction | Apr 22, 2024 | Action Quality Assessmentmultimodal interaction | CodeCode Available | 1 |
| Cooperative Sentiment Agents for Multimodal Sentiment Analysis | Apr 19, 2024 | DisentanglementEmotion Recognition | CodeCode Available | 1 |
| Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want | Mar 29, 2024 | Instruction FollowingLanguage Modelling | CodeCode Available | 2 |
| BlendScape: Enabling End-User Customization of Video-Conferencing Environments through Generative AI | Mar 20, 2024 | Image Generationmultimodal interaction | —Unverified | 0 |
| Improving Adversarial Transferability of Vision-Language Pre-training Models through Collaborative Multimodal Interaction | Mar 16, 2024 | Adversarial RobustnessImage-text Retrieval | —Unverified | 0 |
| On the Arrow of Inference | Feb 22, 2024 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| Memory-Inspired Temporal Prompt Interaction for Text-Image Classification | Jan 26, 2024 | Classificationimage-classification | —Unverified | 0 |