SOTAVerified

multimodal interaction

Papers

Showing 51100 of 106 papers

TitleStatusHype
Agent AI: Surveying the Horizons of Multimodal InteractionCode2
MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction ExpertsCode1
Dialogue-based generation of self-driving simulation scenarios using Large Language ModelsCode1
MM-BigBench: Evaluating Multimodal Models on Multimodal Content Comprehension TasksCode1
Dynamic Hand Gesture-Featured Human Motor Adaptation in Tool Delivery using Voice Recognition0
Adaptive User-centered Neuro-symbolic Learning for Multimodal Interaction with Autonomous Systems0
CFN-ESA: A Cross-Modal Fusion Network with Emotion-Shift Awareness for Dialogue Emotion RecognitionCode1
Multi-Grained Multimodal Interaction Network for Entity LinkingCode1
A Facial Expression-Aware Multimodal Multi-task Learning Framework for Emotion Recognition in Multi-party ConversationsCode1
Generative Multimodal Entity LinkingCode1
Expanding the Role of Affective Phenomena in Multimodal Interaction Research0
Segment and Track AnythingCode4
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person RetrievalCode2
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPTCode0
HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation0
InterMulti:Multi-view Multimodal Interactions with Text-dominated Hierarchical High-order Fusion for Emotion Analysis0
A novel multimodal dynamic fusion network for disfluency detection in spoken utterances0
Adaptive User-Centered Multimodal Interaction towards Reliable and Trusted Automotive Interfaces0
Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness PredictionsCode0
Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment0
MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning0
On the Horizon: Interactive and Compositional Deepfakes0
Chat-to-Design: AI Assisted Personalized Fashion Design0
The VoxWorld Platform for Multimodal Embodied Agents0
RoomReader: A Multimodal Corpus of Online Multiparty Conversational Interactions0
Multilevel Hierarchical Network with Multiscale Sampling for Video Question AnsweringCode0
Graph-based Fine-grained Multimodal Attention Mechanism for Sentiment Analysis0
ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving VehicleCode0
MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question AnsweringCode0
Temporal Pyramid Transformer with Multimodal Interaction for Video Question AnsweringCode1
Dynamic Modality Interaction Modeling for Image-Text RetrievalCode1
Neural dSCA: demixing multimodal interaction among brain areas during naturalistic experiments0
SocialInteractionGAN: Multi-person Interaction Sequence Generation0
ViLT: Vision-and-Language Transformer Without Convolution or Region SupervisionCode1
Dyadic Speech-based Affect Recognition using DAMI-P2C Parent-child Multimodal Interaction Dataset0
SBAT: Video Captioning with Sparse Boundary-Aware Transformer0
Improving Multimodal Named Entity Recognition via Entity Span Detection with Unified Multimodal TransformerCode1
Dual Convolutional LSTM Network for Referring Image Segmentation0
Scene Graph based Image Retrieval -- A case study on the CLEVR Dataset0
Corpus of Multimodal Interaction for Collaborative Planning0
HUMBO: Bridging Response Generation and Facial Expression Synthesis0
Guidelines for creating man-machine multimodal interfaces0
Shaping a social robot's humor with Natural Language Generation and socially-aware reinforcement learning0
EmotiW 2018: Audio-Video, Student Engagement and Group-Level Affect Prediction0
Multimodal Interaction-aware Motion Prediction for Autonomous Street Crossing0
Toward Multimodal Interaction in Scalable Visual Digital Evidence Visualization Using Computer Vision Techniques and ISS0
An Evaluation Framework for Multimodal Interaction0
Automatized Generation of Alphabets of Symbols0
From Modal to Multimodal Ambiguities: a Classification Approach0
Recurrent Multimodal Interaction for Referring Image SegmentationCode0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.