SOTAVerified

multimodal interaction

Papers

Showing 51100 of 106 papers

TitleStatusHype
RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba0
Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory InstructionsCode0
A Unified Understanding of Adversarial Vulnerability Regarding Unimodal Models and Vision-Language Pre-training Models0
Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic0
Empathic Grounding: Explorations using Multimodal Interaction and Large Language Models with Conversational AgentsCode0
HGNET: A Hierarchical Feature Guided Network for Occupancy Flow Field Prediction0
A look under the hood of the Interactive Deep Learning Enterprise (No-IDLE)0
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents0
EMMI -- Empathic Multimodal Motivational Interviews Dataset: Analyses and Annotations0
Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum0
BlendScape: Enabling End-User Customization of Video-Conferencing Environments through Generative AI0
Improving Adversarial Transferability of Vision-Language Pre-training Models through Collaborative Multimodal Interaction0
On the Arrow of Inference0
Memory-Inspired Temporal Prompt Interaction for Text-Image Classification0
Dynamic Hand Gesture-Featured Human Motor Adaptation in Tool Delivery using Voice Recognition0
Adaptive User-centered Neuro-symbolic Learning for Multimodal Interaction with Autonomous Systems0
Expanding the Role of Affective Phenomena in Multimodal Interaction Research0
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPTCode0
HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation0
InterMulti:Multi-view Multimodal Interactions with Text-dominated Hierarchical High-order Fusion for Emotion Analysis0
A novel multimodal dynamic fusion network for disfluency detection in spoken utterances0
Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness PredictionsCode0
Adaptive User-Centered Multimodal Interaction towards Reliable and Trusted Automotive Interfaces0
Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment0
MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning0
On the Horizon: Interactive and Compositional Deepfakes0
Chat-to-Design: AI Assisted Personalized Fashion Design0
RoomReader: A Multimodal Corpus of Online Multiparty Conversational Interactions0
The VoxWorld Platform for Multimodal Embodied Agents0
Multilevel Hierarchical Network with Multiscale Sampling for Video Question AnsweringCode0
Graph-based Fine-grained Multimodal Attention Mechanism for Sentiment Analysis0
ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving VehicleCode0
MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question AnsweringCode0
Neural dSCA: demixing multimodal interaction among brain areas during naturalistic experiments0
SocialInteractionGAN: Multi-person Interaction Sequence Generation0
Dyadic Speech-based Affect Recognition using DAMI-P2C Parent-child Multimodal Interaction Dataset0
SBAT: Video Captioning with Sparse Boundary-Aware Transformer0
Dual Convolutional LSTM Network for Referring Image Segmentation0
Scene Graph based Image Retrieval -- A case study on the CLEVR Dataset0
Corpus of Multimodal Interaction for Collaborative Planning0
HUMBO: Bridging Response Generation and Facial Expression Synthesis0
Guidelines for creating man-machine multimodal interfaces0
Shaping a social robot's humor with Natural Language Generation and socially-aware reinforcement learning0
EmotiW 2018: Audio-Video, Student Engagement and Group-Level Affect Prediction0
Multimodal Interaction-aware Motion Prediction for Autonomous Street Crossing0
Toward Multimodal Interaction in Scalable Visual Digital Evidence Visualization Using Computer Vision Techniques and ISS0
An Evaluation Framework for Multimodal Interaction0
Automatized Generation of Alphabets of Symbols0
From Modal to Multimodal Ambiguities: a Classification Approach0
Recurrent Multimodal Interaction for Referring Image SegmentationCode0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.