SOTAVerified

multimodal interaction

Papers

Showing 51100 of 106 papers

TitleStatusHype
HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation0
Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic0
Improving Adversarial Transferability of Vision-Language Pre-training Models through Collaborative Multimodal Interaction0
Corpus of Multimodal Interaction for Collaborative Planning0
Integration of Multimodal Interaction as Assistance in Virtual Environments0
Interactive Sketchpad: A Multimodal Tutoring System for Collaborative, Visual Problem-Solving0
InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback0
InterMulti:Multi-view Multimodal Interactions with Text-dominated Hierarchical High-order Fusion for Emotion Analysis0
Computer Vision-Driven Gesture Recognition: Toward Natural and Intuitive Human-Computer0
LLM-Assisted Visual Analytics: Opportunities and Challenges0
CMATH: Cross-Modality Augmented Transformer with Hierarchical Variational Distillation for Multimodal Emotion Recognition in Conversation0
Chat-to-Design: AI Assisted Personalized Fashion Design0
HUMBO: Bridging Response Generation and Facial Expression Synthesis0
MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning0
Memory-Inspired Temporal Prompt Interaction for Text-Image Classification0
BlendScape: Enabling End-User Customization of Video-Conferencing Environments through Generative AI0
Immersive Multimedia Communication: State-of-the-Art on eXtended Reality Streaming0
Toward Multimodal Interaction in Scalable Visual Digital Evidence Visualization Using Computer Vision Techniques and ISS0
Automatized Generation of Alphabets of Symbols0
A Unified Understanding of Adversarial Vulnerability Regarding Unimodal Models and Vision-Language Pre-training Models0
MODfinity: Unsupervised Domain Adaptation with Multimodal Information Flow Intertwining0
A Survey of Interactive Generative Video0
Adaptive User-Centered Multimodal Interaction towards Reliable and Trusted Automotive Interfaces0
Multimodal Interaction-aware Motion Prediction for Autonomous Street Crossing0
A Review of Temporal Aspects of Hand Gesture Analysis Applied to Discourse Analysis and Natural Conversation0
Neural dSCA: demixing multimodal interaction among brain areas during naturalistic experiments0
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents0
On the Arrow of Inference0
On the Horizon: Interactive and Compositional Deepfakes0
A POMDP-based Multimodal Interaction System Using a Humanoid Robot0
Phase Diagram of Vision Large Language Models Inference: A Perspective from Interaction across Image and Instruction0
Proceedings of the 1st Workshop on Speech and Multimodal Interaction in Assistive Environments0
Adaptive User-centered Neuro-symbolic Learning for Multimodal Interaction with Autonomous Systems0
Retrospective Learning from Interactions0
ReVision: A Dataset and Baseline VLM for Privacy-Preserving Task-Oriented Visual Instruction Rewriting0
Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum0
RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba0
Robi Butler: Multimodal Remote Interaction with a Household Robot Assistant0
RoomReader: A Multimodal Corpus of Online Multiparty Conversational Interactions0
SBAT: Video Captioning with Sparse Boundary-Aware Transformer0
Scene Graph based Image Retrieval -- A case study on the CLEVR Dataset0
A novel multimodal dynamic fusion network for disfluency detection in spoken utterances0
Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment0
Shaping a social robot's humor with Natural Language Generation and socially-aware reinforcement learning0
SocialInteractionGAN: Multi-person Interaction Sequence Generation0
An Evaluation Framework for Multimodal Interaction0
Analyzing Multimodal Interaction Strategies for LLM-Assisted Manipulation of 3D Scenes0
A multi-stage augmented multimodal interaction network for fish feeding intensity quantification0
Symbol Emergence in Robotics: A Survey0
A Multimodal Corpus for the Assessment of Public Speaking Ability and Anxiety0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.