SOTAVerified

multimodal interaction

Papers

Showing 2650 of 106 papers

TitleStatusHype
Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language ModelsCode1
LLMs Can Evolve Continually on Modality for X-Modal ReasoningCode1
MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction ExpertsCode1
Temporal Pyramid Transformer with Multimodal Interaction for Video Question AnsweringCode1
Dialogue-based generation of self-driving simulation scenarios using Large Language ModelsCode1
Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness PredictionsCode0
Towards Explainable Multimodal Depression Recognition for Clinical InterviewsCode0
ChartSketcher: Reasoning with Multimodal Feedback and Reflection for Chart UnderstandingCode0
Recurrent Multimodal Interaction for Referring Image SegmentationCode0
Multilevel Hierarchical Network with Multiscale Sampling for Video Question AnsweringCode0
MIRe: Enhancing Multimodal Queries Representation via Fusion-Free Modality Interaction for Multimodal RetrievalCode0
Empathic Grounding: Explorations using Multimodal Interaction and Large Language Models with Conversational AgentsCode0
MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question AnsweringCode0
ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving VehicleCode0
Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory InstructionsCode0
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPTCode0
DeepSORT-Driven Visual Tracking Approach for Gesture Recognition in Interactive Systems0
A Review of Temporal Aspects of Hand Gesture Analysis Applied to Discourse Analysis and Natural Conversation0
Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic0
A POMDP-based Multimodal Interaction System Using a Humanoid Robot0
Corpus of Multimodal Interaction for Collaborative Planning0
HGNET: A Hierarchical Feature Guided Network for Occupancy Flow Field Prediction0
Computer Vision-Driven Gesture Recognition: Toward Natural and Intuitive Human-Computer0
Guidelines for creating man-machine multimodal interfaces0
Graph-based Fine-grained Multimodal Attention Mechanism for Sentiment Analysis0
Show:102550
← PrevPage 2 of 5Next →

No leaderboard results yet.