SOTAVerified

Human Interaction Recognition

Human Interaction Recognition (HIR) is a field of study that involves the development of computer algorithms to detect and recognize human interactions in videos, images, or other multimedia content. The goal of HIR is to automatically identify and analyze the social interactions between people, their body language, and facial expressions.

Papers

Showing 110 of 22 papers

TitleStatusHype
Dynamic Scene Understanding from Vision-Language Representations0
OV-HHIR: Open Vocabulary Human Interaction Recognition Using Cross-modal Integration of Large Language Models0
CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action RecognitionCode1
Empathic Grounding: Explorations using Multimodal Interaction and Large Language Models with Conversational AgentsCode0
Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches0
SkateFormer: Skeletal-Temporal Transformer for Human Action RecognitionCode2
Learning Mutual Excitation for Hand-to-Hand and Human-to-Human Interaction RecognitionCode0
A Two-stream Hybrid CNN-Transformer Network for Skeleton-based Human Interaction Recognition0
Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action RecognitionCode1
Human-to-Human Interaction Detection0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1H-LSTCMAccuracy94.03Unverified
2Co-LSTSMAccuracy92.88Unverified
3Donahue et al.Accuracy80.13Unverified