SOTAVerified

Human Interaction Recognition

Human Interaction Recognition (HIR) is a field of study that involves the development of computer algorithms to detect and recognize human interactions in videos, images, or other multimedia content. The goal of HIR is to automatically identify and analyze the social interactions between people, their body language, and facial expressions.

Papers

Showing 122 of 22 papers

TitleStatusHype
Dynamic Scene Understanding from Vision-Language Representations0
OV-HHIR: Open Vocabulary Human Interaction Recognition Using Cross-modal Integration of Large Language Models0
CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action RecognitionCode1
Empathic Grounding: Explorations using Multimodal Interaction and Large Language Models with Conversational AgentsCode0
Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches0
SkateFormer: Skeletal-Temporal Transformer for Human Action RecognitionCode2
Learning Mutual Excitation for Hand-to-Hand and Human-to-Human Interaction RecognitionCode0
A Two-stream Hybrid CNN-Transformer Network for Skeleton-based Human Interaction Recognition0
Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action RecognitionCode1
Human-to-Human Interaction Detection0
WiFi-TCN: Temporal Convolution for Human Interaction Recognition based on WiFi signal0
SkeleTR: Towards Skeleton-based Action Recognition in the Wild0
Two-person Graph Convolutional Network for Skeleton-based Human Interaction RecognitionCode0
IGFormer: Interaction Graph Transformer for Skeleton-based Human Interaction Recognition0
A Prospective Approach for Human-to-Human Interaction Recognition from Wi-Fi Channel Data using Attention Bidirectional Gated Recurrent Neural Network with GUI Application Implementation0
Slow-Fast Auditory Streams For Audio RecognitionCode1
Human Interaction Recognition Framework based on Interacting Body Part Attention0
Three-Stream Fusion Network for First-Person Interaction Recognition0
Interaction Relational Network for Mutual Action RecognitionCode0
Hierarchical Long Short-Term Concurrent Memory for Human Interaction Recognition0
Deep Convolutional Poses for Human Interaction Recognition in Monocular Videos0
Facial Descriptors for Human Interaction Recognition In Still Images0
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SkateFormerAccuracy (Cross-Setup)93.2Unverified
2CHASE(CTR-GCN)Accuracy (Cross-Setup)92.3Unverified
3ISTA-NetAccuracy (Cross-Setup)91.7Unverified
4SkeleTRAccuracy (Cross-Setup)88.3Unverified
5IGFormerAccuracy (Cross-Setup)86.5Unverified
6LSTM-IRNAccuracy (Cross-Setup)79.6Unverified
#ModelMetricClaimedVerifiedStatus
1SkateFormerAccuracy (Cross-Subject)97.1Unverified
2CHASE(CTR-GCN)Accuracy (Cross-Subject)96.5Unverified
3SkeleTRAccuracy (Cross-Subject)94.9Unverified
4IGFormerAccuracy (Cross-Subject)93.6Unverified
5LSTM-IRN'fc1inter+intraAccuracy (Cross-Subject)90.5Unverified
#ModelMetricClaimedVerifiedStatus
1H-LSTCMAccuracy98.33Unverified
2Co-LSTSMAccuracy95Unverified
3Raptis et al.Accuracy93.3Unverified
4Donahue et al.Accuracy85Unverified
#ModelMetricClaimedVerifiedStatus
1H-LSTCMAccuracy94.03Unverified
2Co-LSTSMAccuracy92.88Unverified
3Donahue et al.Accuracy80.13Unverified
#ModelMetricClaimedVerifiedStatus
1Slow-Fast(Finetune by Fivewin team)Top-1 accuracy %55.11Unverified
#ModelMetricClaimedVerifiedStatus
1LSTM-IRN'fc1inter+intraAccuracy98.2Unverified
#ModelMetricClaimedVerifiedStatus
1IGFormerAccuracy98.4Unverified
#ModelMetricClaimedVerifiedStatus
1LSTM-IRN'fc1inter+intraAccuracy (Set 1)98.3Unverified