| Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models | Jun 30, 2024 | Hallucinationmultimodal interaction | CodeCode Available | 1 | 5 |
| LLMs Can Evolve Continually on Modality for X-Modal Reasoning | Oct 26, 2024 | Continual Learningmultimodal interaction | CodeCode Available | 1 | 5 |
| MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts | Nov 16, 2023 | Binary ClassificationDescriptive | CodeCode Available | 1 | 5 |
| Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering | Sep 10, 2021 | multimodal interactionNatural Language Understanding | CodeCode Available | 1 | 5 |
| Dialogue-based generation of self-driving simulation scenarios using Large Language Models | Oct 26, 2023 | multimodal interactionSelf-Driving Cars | CodeCode Available | 1 | 5 |
| Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness Predictions | Nov 7, 2022 | Contrastive Learningmultimodal interaction | CodeCode Available | 0 | 5 |
| Towards Explainable Multimodal Depression Recognition for Clinical Interviews | Jan 27, 2025 | Decision MakingDepression Detection | CodeCode Available | 0 | 5 |
| ChartSketcher: Reasoning with Multimodal Feedback and Reflection for Chart Understanding | May 25, 2025 | Chart UnderstandingLogical Reasoning | CodeCode Available | 0 | 5 |
| Recurrent Multimodal Interaction for Referring Image Segmentation | Mar 23, 2017 | Image Segmentationmultimodal interaction | CodeCode Available | 0 | 5 |
| Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering | May 9, 2022 | multimodal interactionQuestion Answering | CodeCode Available | 0 | 5 |
| MIRe: Enhancing Multimodal Queries Representation via Fusion-Free Modality Interaction for Multimodal Retrieval | Nov 13, 2024 | Image ComprehensionInformation Retrieval | CodeCode Available | 0 | 5 |
| Empathic Grounding: Explorations using Multimodal Interaction and Large Language Models with Conversational Agents | Jul 1, 2024 | Emotional IntelligenceEmotion Classification | CodeCode Available | 0 | 5 |
| MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question Answering | Nov 1, 2021 | multimodal interactionMultiple-choice | CodeCode Available | 0 | 5 |
| ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle | Nov 3, 2021 | Emotion Recognitionmultimodal interaction | CodeCode Available | 0 | 5 |
| Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions | Aug 2, 2024 | Benchmarkingmultimodal interaction | CodeCode Available | 0 | 5 |
| A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT | Mar 7, 2023 | multimodal interaction | CodeCode Available | 0 | 5 |
| DeepSORT-Driven Visual Tracking Approach for Gesture Recognition in Interactive Systems | May 11, 2025 | Gesture Recognitionmultimodal interaction | —Unverified | 0 | 0 |
| A Review of Temporal Aspects of Hand Gesture Analysis Applied to Discourse Analysis and Natural Conversation | Dec 17, 2013 | multimodal interactionSystematic Literature Review | —Unverified | 0 | 0 |
| Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic | Jul 25, 2024 | Image to textLanguage Modeling | —Unverified | 0 | 0 |
| A POMDP-based Multimodal Interaction System Using a Humanoid Robot | Oct 1, 2016 | Face Recognitionmultimodal interaction | —Unverified | 0 | 0 |
| Corpus of Multimodal Interaction for Collaborative Planning | Jun 1, 2019 | multimodal interaction | —Unverified | 0 | 0 |
| HGNET: A Hierarchical Feature Guided Network for Occupancy Flow Field Prediction | Jul 1, 2024 | Autonomous Drivingmultimodal interaction | —Unverified | 0 | 0 |
| Computer Vision-Driven Gesture Recognition: Toward Natural and Intuitive Human-Computer | Dec 24, 2024 | Gesture Recognitionmultimodal interaction | —Unverified | 0 | 0 |
| Guidelines for creating man-machine multimodal interfaces | Jan 29, 2019 | multimodal interaction | —Unverified | 0 | 0 |
| Graph-based Fine-grained Multimodal Attention Mechanism for Sentiment Analysis | Nov 16, 2021 | multimodal interactionMultimodal Sentiment Analysis | —Unverified | 0 | 0 |