SOTAVerified

Activity Recognition

Human Activity Recognition is the problem of identifying events performed by humans given a video input. It is formulated as a binary (or multiclass) classification problem of outputting activity class labels. Activity Recognition is an important problem with many societal applications including smart surveillance, video search/retrieval, intelligent robots, and other monitoring systems.

Source: Learning Latent Sub-events in Activity Videos Using Temporal Attention Filters

Papers

Showing 876900 of 1322 papers

TitleStatusHype
STS Classification with Dual-stream CNN0
Student Activity Recognition in Classroom Environments using Transfer Learning0
Super Normal Vector for Activity Recognition Using Depth Sequences0
Surgical Activity Recognition in Robot-Assisted Radical Prostatectomy using Deep Learning0
SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgery0
SWTF: Sparse Weighted Temporal Fusion for Drone-Based Activity Recognition0
Tapestry of Time and Actions: Modeling Human Activity Sequences using Temporal Point Process Flows0
TASKED: Transformer-based Adversarial learning for human activity recognition using wearable sensors via Self-KnowledgE Distillation0
WiFi-TCN: Temporal Convolution for Human Interaction Recognition based on WiFi signal0
Template co-updating in multi-modal human activity recognition systems0
Temporal Action Segmentation with High-level Complex Activity Labels0
Temporal DINO: A Self-supervised Video Strategy to Enhance Action Prediction0
Temporally Consistent Dynamic Scene Graphs: An End-to-End Approach for Action Tracklet Generation0
Temporally Robust Global Motion Compensation by Keypoint-based Congealing0
Temporal Reasoning Graph for Activity Recognition0
TENT: Connect Language Models with IoT Sensors for Zero-Shot Activity Recognition0
Text me the data: Generating Ground Pressure Sequence from Textual Descriptions for HAR0
The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes0
The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary0
The Evolution of First Person Vision Methods: A Survey0
The Potential of Vision-Language Models for Content Moderation of Children's Videos0
The Sweet-Home speech and multimodal corpus for home automation interaction0
Three Branches: Detecting Actions With Richer Features0
Three-Stream Fusion Network for First-Person Interaction Recognition0
Three-stream network for enriched Action Recognition0
Show:102550
← PrevPage 36 of 53Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Structured Keypoint PoolingAccuracy93.4Unverified
2Semi-Supervised Hard Attention (SSHA); pretrained on Deepmind Kinetics datasetAccuracy90.4Unverified
3Human Skeletons + Change DetectionAccuracy90.25Unverified
4Separable Convolutional LSTMAccuracy89.75Unverified
5SPIL ConvolutionAccuracy89.3Unverified
6Flow Gated NetworkAccuracy87.25Unverified
#ModelMetricClaimedVerifiedStatus
1FocusCLIPTop-3 Accuracy (%)10.47Unverified
2CLIPTop-3 Accuracy (%)6.49Unverified
#ModelMetricClaimedVerifiedStatus
1Boutaleb et al.1:1 Accuracy97.91Unverified
#ModelMetricClaimedVerifiedStatus
1all-landmark-modelActivity Recognition0.76Unverified