| Agent AI: Surveying the Horizons of Multimodal Interaction | Jan 7, 2024 | multimodal interaction | CodeCode Available | 2 |
| MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts | Nov 16, 2023 | Binary ClassificationDescriptive | CodeCode Available | 1 |
| Dialogue-based generation of self-driving simulation scenarios using Large Language Models | Oct 26, 2023 | multimodal interactionSelf-Driving Cars | CodeCode Available | 1 |
| MM-BigBench: Evaluating Multimodal Models on Multimodal Content Comprehension Tasks | Oct 13, 2023 | multimodal interactionMultimodal Reasoning | CodeCode Available | 1 |
| Dynamic Hand Gesture-Featured Human Motor Adaptation in Tool Delivery using Voice Recognition | Sep 20, 2023 | Gesture RecognitionHand Gesture Recognition | —Unverified | 0 |
| Adaptive User-centered Neuro-symbolic Learning for Multimodal Interaction with Autonomous Systems | Sep 11, 2023 | Incremental Learningmultimodal interaction | —Unverified | 0 |
| CFN-ESA: A Cross-Modal Fusion Network with Emotion-Shift Awareness for Dialogue Emotion Recognition | Jul 28, 2023 | Emotion RecognitionEmotion Recognition in Conversation | CodeCode Available | 1 |
| Multi-Grained Multimodal Interaction Network for Entity Linking | Jul 19, 2023 | Contrastive LearningDescriptive | CodeCode Available | 1 |
| A Facial Expression-Aware Multimodal Multi-task Learning Framework for Emotion Recognition in Multi-party Conversations | Jul 1, 2023 | Emotion RecognitionEmotion Recognition in Conversation | CodeCode Available | 1 |
| Generative Multimodal Entity Linking | Jun 22, 2023 | Entity LinkingIn-Context Learning | CodeCode Available | 1 |
| Expanding the Role of Affective Phenomena in Multimodal Interaction Research | May 18, 2023 | multimodal interaction | —Unverified | 0 |
| Segment and Track Anything | May 11, 2023 | Autonomous Drivingmultimodal interaction | CodeCode Available | 4 |
| Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval | Mar 22, 2023 | Image-text matchingLanguage Modeling | CodeCode Available | 2 |
| A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT | Mar 7, 2023 | multimodal interaction | CodeCode Available | 0 |
| HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation | Jan 1, 2023 | multimodal interactionObject | —Unverified | 0 |
| InterMulti:Multi-view Multimodal Interactions with Text-dominated Hierarchical High-order Fusion for Emotion Analysis | Dec 20, 2022 | Emotion Recognitionmultimodal interaction | —Unverified | 0 |
| A novel multimodal dynamic fusion network for disfluency detection in spoken utterances | Nov 27, 2022 | multimodal interaction | —Unverified | 0 |
| Adaptive User-Centered Multimodal Interaction towards Reliable and Trusted Automotive Interfaces | Nov 7, 2022 | multimodal interaction | —Unverified | 0 |
| Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness Predictions | Nov 7, 2022 | Contrastive Learningmultimodal interaction | CodeCode Available | 0 |
| Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment | Oct 10, 2022 | Articlesmultimodal interaction | —Unverified | 0 |
| MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning | Oct 9, 2022 | Image-text Retrievalmultimodal interaction | —Unverified | 0 |
| On the Horizon: Interactive and Compositional Deepfakes | Sep 5, 2022 | multimodal interaction | —Unverified | 0 |
| Chat-to-Design: AI Assisted Personalized Fashion Design | Jul 3, 2022 | multimodal interactionNatural Language Understanding | —Unverified | 0 |
| The VoxWorld Platform for Multimodal Embodied Agents | Jun 1, 2022 | multimodal interaction | —Unverified | 0 |
| RoomReader: A Multimodal Corpus of Online Multiparty Conversational Interactions | Jun 1, 2022 | multimodal interaction | —Unverified | 0 |
| Multilevel Hierarchical Network with Multiscale Sampling for Video Question Answering | May 9, 2022 | multimodal interactionQuestion Answering | CodeCode Available | 0 |
| Graph-based Fine-grained Multimodal Attention Mechanism for Sentiment Analysis | Nov 16, 2021 | multimodal interactionMultimodal Sentiment Analysis | —Unverified | 0 |
| ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle | Nov 3, 2021 | Emotion Recognitionmultimodal interaction | CodeCode Available | 0 |
| MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question Answering | Nov 1, 2021 | multimodal interactionMultiple-choice | CodeCode Available | 0 |
| Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering | Sep 10, 2021 | multimodal interactionNatural Language Understanding | CodeCode Available | 1 |
| Dynamic Modality Interaction Modeling for Image-Text Retrieval | Jul 11, 2021 | cross-modal alignmentCross-Modal Retrieval | CodeCode Available | 1 |
| Neural dSCA: demixing multimodal interaction among brain areas during naturalistic experiments | Jun 5, 2021 | Dimensionality ReductionExperimental Design | —Unverified | 0 |
| SocialInteractionGAN: Multi-person Interaction Sequence Generation | Mar 10, 2021 | Decodermultimodal interaction | —Unverified | 0 |
| ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision | Feb 5, 2021 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 1 |
| Dyadic Speech-based Affect Recognition using DAMI-P2C Parent-child Multimodal Interaction Dataset | Aug 20, 2020 | multimodal interaction | —Unverified | 0 |
| SBAT: Video Captioning with Sparse Boundary-Aware Transformer | Jul 23, 2020 | Machine Translationmultimodal interaction | —Unverified | 0 |
| Improving Multimodal Named Entity Recognition via Entity Span Detection with Unified Multimodal Transformer | Jul 1, 2020 | multimodal interactionMulti-modal Named Entity Recognition | CodeCode Available | 1 |
| Dual Convolutional LSTM Network for Referring Image Segmentation | Jan 30, 2020 | DecoderImage Segmentation | —Unverified | 0 |
| Scene Graph based Image Retrieval -- A case study on the CLEVR Dataset | Nov 3, 2019 | Graph MatchingImage Retrieval | —Unverified | 0 |
| Corpus of Multimodal Interaction for Collaborative Planning | Jun 1, 2019 | multimodal interaction | —Unverified | 0 |
| HUMBO: Bridging Response Generation and Facial Expression Synthesis | May 24, 2019 | Dialogue Generationmultimodal interaction | —Unverified | 0 |
| Guidelines for creating man-machine multimodal interfaces | Jan 29, 2019 | multimodal interaction | —Unverified | 0 |
| Shaping a social robot's humor with Natural Language Generation and socially-aware reinforcement learning | Nov 1, 2018 | multimodal interactionreinforcement-learning | —Unverified | 0 |
| EmotiW 2018: Audio-Video, Student Engagement and Group-Level Affect Prediction | Aug 23, 2018 | Emotion Recognitionmultimodal interaction | —Unverified | 0 |
| Multimodal Interaction-aware Motion Prediction for Autonomous Street Crossing | Aug 21, 2018 | motion predictionmultimodal interaction | —Unverified | 0 |
| Toward Multimodal Interaction in Scalable Visual Digital Evidence Visualization Using Computer Vision Techniques and ISS | Aug 1, 2018 | Managementmultimodal interaction | —Unverified | 0 |
| An Evaluation Framework for Multimodal Interaction | May 1, 2018 | Gesture Recognitionmultimodal interaction | —Unverified | 0 |
| Automatized Generation of Alphabets of Symbols | Jul 16, 2017 | multimodal interaction | —Unverified | 0 |
| From Modal to Multimodal Ambiguities: a Classification Approach | Apr 4, 2017 | ClassificationGeneral Classification | —Unverified | 0 |
| Recurrent Multimodal Interaction for Referring Image Segmentation | Mar 23, 2017 | Image Segmentationmultimodal interaction | CodeCode Available | 0 |