SOTAVerified

multimodal interaction

Papers

Showing 2650 of 106 papers

TitleStatusHype
Generative Multimodal Entity LinkingCode1
Temporal Pyramid Transformer with Multimodal Interaction for Video Question AnsweringCode1
Dynamic Modality Interaction Modeling for Image-Text RetrievalCode1
ViLT: Vision-and-Language Transformer Without Convolution or Region SupervisionCode1
Improving Multimodal Named Entity Recognition via Entity Span Detection with Unified Multimodal TransformerCode1
A multi-stage augmented multimodal interaction network for fish feeding intensity quantification0
InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback0
ChartSketcher: Reasoning with Multimodal Feedback and Reflection for Chart UnderstandingCode0
DeepSORT-Driven Visual Tracking Approach for Gesture Recognition in Interactive Systems0
A Survey of Interactive Generative Video0
Immersive Multimedia Communication: State-of-the-Art on eXtended Reality Streaming0
ReVision: A Dataset and Baseline VLM for Privacy-Preserving Task-Oriented Visual Instruction Rewriting0
Interactive Sketchpad: A Multimodal Tutoring System for Collaborative, Visual Problem-Solving0
Towards Explainable Multimodal Depression Recognition for Clinical InterviewsCode0
FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection0
MODfinity: Unsupervised Domain Adaptation with Multimodal Information Flow Intertwining0
Computer Vision-Driven Gesture Recognition: Toward Natural and Intuitive Human-Computer0
CMATH: Cross-Modality Augmented Transformer with Hierarchical Variational Distillation for Multimodal Emotion Recognition in Conversation0
Generative AI in Multimodal User Interfaces: Trends, Challenges, and Cross-Platform Adaptability0
MIRe: Enhancing Multimodal Queries Representation via Fusion-Free Modality Interaction for Multimodal RetrievalCode0
Phase Diagram of Vision Large Language Models Inference: A Perspective from Interaction across Image and Instruction0
Analyzing Multimodal Interaction Strategies for LLM-Assisted Manipulation of 3D Scenes0
Retrospective Learning from Interactions0
Robi Butler: Multimodal Remote Interaction with a Household Robot Assistant0
LLM-Assisted Visual Analytics: Opportunities and Challenges0
Show:102550
← PrevPage 2 of 5Next →

No leaderboard results yet.